Skip to content

๐Ÿ“š Sogang Univ. CSEG109/AIEG109 Lab - Audio processing, Language Models, FastSpeech2 TTS with PyTorch & Colab

Notifications You must be signed in to change notification settings

june-oh/2024_cseg109

Repository files navigation

LAB for Audio Recognition and Audio Synthesis

Python PyTorch Matplotlib Torchaudio Librosa

์„œ๊ฐ•๋Œ€ํ•™๊ต CSE5109/CSEG109/AIE5109/AIEG109 ๊ณผ๋ชฉ์˜ ์‹ค์Šต ์ž๋ฃŒ๋ฅผ ์œ„ํ•œ ์ €์žฅ์†Œ์ž…๋‹ˆ๋‹ค.

์‹ค์Šต ๋‚ด์šฉ

LAB 1: ๊ธฐ์ดˆ ์‹ ํ˜ธ ์ฒ˜๋ฆฌ ๋ฐ ์˜ค๋””์˜ค ํŒŒ์ผ ๋‹ค๋ฃจ๊ธฐ

  • Numpy์™€ matplotlib๋ฅผ ์ด์šฉํ•œ ๊ธฐ์ดˆ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ
  • Python์„ ์ด์šฉํ•œ ๊ธฐ์ดˆ ์‹ ํ˜ธ ์ฒ˜๋ฆฌ

LAB 2: ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ์ดˆ ๋ฐ ์–ธ์–ด ๋ชจ๋ธ

  • PyTorch๋ฅผ ์ด์šฉํ•œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ์ดˆ
  • MLP๋ฅผ ์ด์šฉํ•œ ๊ฐ„๋‹จํ•œ ์–ธ์–ด ๋ชจ๋ธ ๊ตฌํ˜„
  • ๋ชจ๋ธ ํ•™์Šต ๋ฐ ํ‰๊ฐ€

LAB 3: ์Œ์„ฑ ํ•ฉ์„ฑ

  • Korean-FastSpeech2-Pytorch
    • (Forked)Korean-FastSpeech2-Pytorch
    • ์œ„ ์ €์žฅ์†Œ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ๊ตญ์–ด ์Œ์„ฑ ํ•ฉ์„ฑ์„ Colab์—์„œ ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ๋„๋ก ์ˆ˜์ •ํ•˜์˜€์Šต๋‹ˆ๋‹ค.
  • FastSpeech2 ๋ชจ๋ธ ํ•™์Šต ๋ฐ ํ‰๊ฐ€
  • FastSpeech2 ๋ชจ๋ธ ์ถ”๋ก 

์‹ค์Šต ๋…ธํŠธ๋ถ

  1. Lab 1 - Basic Audio File Handling
  2. Lab 2-0 - PyTorch and Deep Learning
  3. Lab 2-1 - Language Model Exercise
  4. Lab 3-1 - FastSpeech2 Training
  5. Lab 3-2 - FastSpeech2 Inference

ํ•„์š” ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ

์‹ค์Šต ํ™˜๊ฒฝ

  • colab

ํ•„์š” ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ(Colab ํ™˜๊ฒฝ์— ์„ค์น˜๋ผ ์žˆ์Œ)

  • numpy
  • matplotlib
  • torch
  • torchaudio

์ฐธ๊ณ ์‚ฌํ•ญ

  • ๊ฐ ์‹ค์Šต์€ Google Colab ํ™˜๊ฒฝ์—์„œ ์‹คํ–‰ ๊ฐ€๋Šฅํ•˜๋„๋ก ๊ตฌ์„ฑ๋˜์–ด ์žˆ์Šต๋‹ˆ๋‹ค
  • ์‹ค์Šต ์ž๋ฃŒ๋Š” ์ง€์†์ ์œผ๋กœ ์—…๋ฐ์ดํŠธ๋  ์˜ˆ์ •์ž…๋‹ˆ๋‹ค
  • ์‹ค์Šต ์ข…๋ฃŒ ํ›„ solution ๋…ธํŠธ๋ถ์„ ์ถ”๊ฐ€ํ•  ์˜ˆ์ •์ž…๋‹ˆ๋‹ค

About

๐Ÿ“š Sogang Univ. CSEG109/AIEG109 Lab - Audio processing, Language Models, FastSpeech2 TTS with PyTorch & Colab

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors