Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
-
Updated
Feb 21, 2024 - Python
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Efficient Training of Audio Transformers with Patchout
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
Training code of Cornell Birdcall Identification Challenge 6th place solution
Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, and sound event detection. Implemented using PyTorch.
6th place solution to Freesound Audio Tagging 2019 kaggle competition
Easy to use Audio Tagging in PyTorch
A Swiss Army knife for programmatic music library management. Manages both local and music streaming service libraries.
Google's AudioSet consistently reformatted
Scripts to process Google's Audioset
Use shazam to rename and Auto filled the tag of a list of mp3 and opus files
Kaggle Freesound Audio Tagging 2019 Challenge (top 5%)
Automatically download and tag Deezer tracks, albums and playlists, using free-mp3-download.net
Extended repository w. Cnn14, ResNet38 & Wavegram-LogMel_Cnn14 models for Audio Tagging
Real Time audio tagger using Whisper.ai Audio Tagging (AT)
Add a description, image, and links to the audio-tagging topic page so that developers can more easily learn about it.
To associate your repository with the audio-tagging topic, visit your repo's landing page and select "manage topics."