[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
-
Updated
Jul 11, 2023 - Python
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
This repository contains the implementation of Environmental Sound Classification on the ESC-50 dataset using the ACDNet.
Simplified PyTorch implementation of audio classification, support multi-gpu training and validating, automatic mixed precision training, knowledge distillation etc.
Model Deployment for HEAR4U Bangkit Capstone Project
This project focuses on the ESC50 Challenge. The ESC-50 dataset is a labeled collection of 2000 environmental audio recordings suitable for benchmarking methods of environmental sound classification.
SoundGuard is a GenAI agent that detects emergency sounds, explains what it hears, and responds like a smart assistant — built with YAMNet, Gradio, Google Cloud, and deployed on Hugging Face Spaces.
Add a description, image, and links to the esc50 topic page so that developers can more easily learn about it.
To associate your repository with the esc50 topic, visit your repo's landing page and select "manage topics."