Speech Enhancement Tutorials

This is a repo for Speech Enhancement tutorials (Especially for time-frequency domain). You can experiment with various Speech enhancement techniques through this repo.

Update:

2024.05.15 Upload codes

Will be soon:

Requirements

This repo is tested with Ubuntu 22.04, PyTorch 2.0.1, Python3.10, and CUDA11.7. For package dependencies, you can install them by:

pip install -r requirements.txt

Getting started

Install the necessary libraries.
Download the VoiceBank+DEMAND database or prepare your own database and place it in '../Dataset/' folder.

├── 📦 SE_Tutorials   
│   └── 📂 models   
│       └── 📂 ref   
│           └── ...
│       └── ED_FNN.py   
│       └── ED_CNN.py
│   └── options.py   
│   └── train_interface.py   
│   └── ...   
└── 📦 Dataset   
    └── 📂 VBD (or ...)
        └── 📂 train   
            └── clean
            └── noisy
        └── 📂 test   
            └── clean
            └── noisy

Run train_interface.py

You can simply change any parameter settings if you need to adjust them. (options.py)

For easy start

We have prepared a .ipynb file so you can just run it.

Baseline model architecture

Techniques

Technologies available in this repo are as follows:

generate noisy database
normalization
compression
domain
joint loss function
perceptual loss function
adversarial train

Performance ranks (using VoiceBank+DEMAND database)

The scores shown in this table are based on the values written in their paper.

Model	Params (M)	Causality	PESQ	CSIG	CBAK	COVL	STOI	SSNR	Year	Input	Code
Noisy	-	-	1.97	3.35	2.44	2.63	0.91	1.68	-	-	-
SEGAN	97.47	✗	2.16	3.48	2.94	2.80	0.92	7.73	2017	Time	✗
MetricGAN	-	✗	2.86	3.99	3.18	3.42	-	-	2019	Magnitude	✓
PHASEN	0.92	✗	2.99	4.21	3.55	3.62	-	10.08	2020	Magnitude+Phase	✗

Reference

Contact

Please get in touch with us if you have any questions or suggestions.
E-mail: allmindfine@yonsei.ac.kr

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
LICENSE		LICENSE
README.md		README.md
generate_noisy_data.ipynb		generate_noisy_data.ipynb
requirements.txt		requirements.txt
se_tutorials.ipynb		se_tutorials.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

LICENSE

LICENSE

README.md

README.md

generate_noisy_data.ipynb

generate_noisy_data.ipynb

requirements.txt

requirements.txt

se_tutorials.ipynb

se_tutorials.ipynb

Repository files navigation

Speech Enhancement Tutorials

Update:

Will be soon:

Requirements

Getting started

For easy start

Baseline model architecture

Techniques

Performance ranks (using VoiceBank+DEMAND database)

Reference

Contact

About

Releases

Packages

Languages

License

seorim0/SE_Tutorials

Folders and files

Latest commit

History

Repository files navigation

Speech Enhancement Tutorials

Update:

Will be soon:

Requirements

Getting started

For easy start

Baseline model architecture

Techniques

Performance ranks (using VoiceBank+DEMAND database)

Reference

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages