Skip to content
View Kabanosk's full-sized avatar
📚
📚

Highlights

  • Pro

Organizations

@pavo-company

Block or report Kabanosk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

ideas

12 repositories

Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM

Python 357 33 Updated Mar 24, 2023

Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - unofficial work in progress

Python 61 2 Updated Apr 2, 2020

Speaker embedding (d-vector) trained with GE2E loss

Python 274 47 Updated Jan 8, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 937 161 Updated Jul 5, 2023

Audio super resolution using neural networks

Python 1,211 208 Updated Oct 24, 2023

The Implementation of FastSpeech based on pytorch.

Python 862 213 Updated Jul 6, 2023

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,031 209 Updated Oct 23, 2024

A modified VITS that utilizes phoneme duration's ground truth for better robustness

Python 126 39 Updated Aug 27, 2023

This repository presents a subset of our proposed FSD dataset for song deepfake detection.

Python 22 Updated Sep 14, 2024

Curated list of python software and packages related to scientific research in audio

1,591 172 Updated Jul 14, 2023

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 990 215 Updated Aug 28, 2023