
Uncertainty-Based Learning of a Lightweight Model for Multimodal Emotion Recognition
In this paper, the authors propose a lightweight neural network architecture that extracts and performs the analysis of multimodal information using the same audio and visual networks across multiple temporal segments.

CO2A – Contrastive Conditional domain Alignment
A novel unsupervised domain adaptation approach for action recognition from videos, inspired by recent literature on contrastive learning.

pygrank
pygrank is an open source framework to define, run and evaluate node ranking algorithms.

InDistill
InDistill enchances the effectiveness of the Knowledge Distillation procedure by leveraging the properties of channel pruning to both reduce the capacity gap between the models and retain the information geometry.

ToDY: Time of Day/Year: Dataset for visual time of day and season classification
The dataset provides training and valiation data for classifying images by time of day and season (time of year). The images are taken from the Skyfinder dataset, containing webcam images along with timestamps and geolocation.

VISIONE Feature Repository for VBS: Multi-Modal Features and Detected Objects from MVK Dataset
This repository contains a diverse set of features extracted from the marine video (underwater) dataset (MVK) .