GCT634 Fall 2022

Musical Applications of Machine Learning

Course Description

This course aims to learn the new technology to learn machine learning with applications to the music and audio domains. Specificially, we handle various tasks in the topics of music and audio classification, automatic music transcription, source separation, sound synthesis, and music generation. Student will have hands-on experiences using audio processing and machine learning libraries through the assignments and gain experience of the full cycle of research through the final research project.

General Information

  • Instructor: Juhan Nam (남주한)
  • TAs: Seungheon Doh(도승헌), Jaekwon Im(임재권), Houn Su Kim(김현수)
  • Time: Mon/Wed 14:30 - 16:00
  • Place: N25#3229 Paik Nam June Hall (백남준홀)
  • Course Format: online or hybrid (onsite + online)

Grading Policy

  • Assignments: 50 %
  • Research Project: 50%
    • Paper review
    • Presentation
    • Report

Textbooks

                       

Schedule

Week Topics
1
  • [Aug-29, online] Course Introduction [slides]
  • [Aug-31, online] Audio Data Representations [slides]
  • [practice] audio_representations.ipynb
  • Suggested Readings
    • The FMP book (Chapter 1: Music Representations)
    • The FMP book (Chapter 2: Fourier Analysis of Signals)
2
3
  • [Sep-12] No Class (Chuseok Holiday)
  • [Sep-14, online] Music Classification - Traditional Machine Learning [slides]
  • [practice] Unsupervised Learning.ipynb
  • [Homework #1] Musical Instrument Recognition (Due Sep 25, 11:59 PM) [link]
  • Suggested Readings
    • The PRML book (Chapter 3: Linear Models for Regression)
    • The PRML book (Chapter 4: Linear Models for Classification)
    • The PRML book (Chapter 5: Neural Networks)
    • The PRML book (Chapter 6: Kernel Methods)
    • The PRML book (Chapter 9: Mixture Models and EM)
    • The PRML book (Chapter 12: Continuous Latent Variables)
    • Musical Genre Classification of Audio Signals, Tzanetakis and Cook (2002)
4
5
6
  • [Oct-3] No Class (National Holiday)
  • [Oct-5, hybrid] HW1 review, HW2 introduction, and Music Recommendation and Retrieval (Cont'd)
  • [Homework #2] Music Auto-Tagging (Due Oct 16, 11:59 PM) [link]
7
8
  • [Oct-17, online] Automatic Music Transcription - Polyphonic Pitch Estimation and Note Transcription (Make-Up) [slides]
  • Suggested Readings
    • The FMP book (Chapter 8: Musically Informed Audio Decomposition)
9
  • [Oct-24, hybrid] Automatic Music Transcription - Audio-to-Score Alignment [slides]
  • [Oct-26, hybrid] HW2 review, HW3 introduction, and Automatic Music Transcription - Audio-to-Score Alignment (Cont'd)
  • [Homework #3] Polyphonic Piano Transcription (Due Nov 9, 11:59 PM) [link]
  • Suggested Readings
    • The FMP book (Chapter 3: Music Synchronization)
10
11
12
  • [Nov-14, hybrid] Symbolic Music Generation - Overview and RNN Models [slides]
  • [Nov-16, hybrid] HW3 Review, HW4 Introduction, Symbolic Music Generation - RNN Models (Cont'd)
  • [Nov-18, video] Symbolic Music Generation - Transformer Models [slides]
  • Individual Project Meeting with Professor
  • [Homework #4] (Optional) Symbolic Music Generation [link]
13
  • [Nov-21, hybrid] Audio Generation - Overview and Deep Learning Models [slides]
  • [Nov-23, hybrid] Singing Voice Synthesis (Invited Talk: Soonbeom Choi, PhD Student)
  • Individual Project Meeting with Professor
14
  • [Nov-28 and Nov-30, hybrid] Student Presentation: Project Introduction
15
  • [Dec-7, online] Invited Talk: Dr. Taesu Kim (Neosapience CEO)
16