
An Open Dataset of Synthetic Speech
This paper introduces a multilingual, multispeaker dataset composed of synthetic and natural speech, designed to foster research and benchmarking in synthetic speech detection.

Word-Class Embeddings for Multiclass Text Classification
Code for Word-Class Embeddings (WCEs), a form of supervised embeddings especially suited for multiclass text classification.

Neighborhood Contrastive Learning for Novel Class Discovery
A holistic learning framework for Novel Class Discovery (NCD), which adopts contrastive learning to learn discriminate features with both the labeled and unlabeled data.

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation
we study the task of synthetic-to-real domain generalized semantic segmentation, which aims to learn a model that is robust to unseen real-world scenes using only synthetic data.

Novel Class Discovery in Semantic Segmentation (NCDSS)
We introduce a new setting of Novel Class Discovery in Semantic Segmentation (NCDSS), which aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes.

ToDY: Time of Day/Year: Dataset for visual time of day and season classification
The dataset provides training and valiation data for classifying images by time of day and season (time of year). The images are taken from the Skyfinder dataset, containing webcam images along with timestamps and geolocation.