Skip to content
View Kabanosk's full-sized avatar
📚
📚

Highlights

  • Pro

Organizations

@pavo-company

Block or report Kabanosk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speech-deep-learning

21 repositories

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 914 242 Updated Apr 13, 2024

A must-read paper for speech separation based on neural networks

770 137 Updated Apr 18, 2022

Deep learning for audio denoising

Python 682 128 Updated Oct 15, 2023

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of sep…

Jupyter Notebook 314 34 Updated Jul 6, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,899 70 Updated Jan 4, 2024

End-to-End Speech Processing Toolkit

Python 8,761 2,212 Updated Feb 5, 2025

Speech Enhancement Generative Adversarial Network in TensorFlow

Python 830 281 Updated Mar 24, 2023

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 53,471 8,887 Updated Aug 14, 2024

speech enhancement\speech seperation\sound source localization

1,087 224 Updated Nov 14, 2023

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Python 325 67 Updated Oct 4, 2022

Conformer-based Metric GAN for speech enhancement

Python 338 62 Updated May 3, 2024

Speech Enhancement Generative Adversarial Network in PyTorch

Python 384 110 Updated Aug 16, 2023

The Hugging Face Course on Transformers for Audio

MDX 369 105 Updated Jan 23, 2025

Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemented for the task of speech enhancement in the time-domain.

Python 218 40 Updated Mar 24, 2023

A neural network for end-to-end speech denoising

Python 684 164 Updated Jul 6, 2023

🐸 collection of TTS papers

666 68 Updated Jul 4, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 937 161 Updated Jul 5, 2023

Audio super resolution using neural networks

Python 1,211 208 Updated Oct 24, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

Jupyter Notebook 360 57 Updated Mar 25, 2023

AEC Challenge

396 132 Updated Jun 4, 2024

A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"

Python 139 41 Updated Oct 21, 2019