關於 ChordMini

為音樂家、研究人員和音樂愛好者打造的 AI 驅動和弦辨識與音樂分析平台。

研究專案

Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

Automatic Chord Recognition (ACR) is constrained by the scarcity of aligned chord labels, as well-aligned annotations are costly to acquire. At the same time, open-weight pre-trained models are currently more accessible than their proprietary training data. In this work, we present a two-stage training pipeline that leverages pre-trained models together with unlabeled audio. The proposed method decouples training into two stages. In the first stage, we use a pre-trained BTC model as a teacher to generate pseudo-labels for over 1,000 hours of diverse unlabeled audio and train a student model solely on these pseudo-labels. In the second stage, the student is continually trained on ground-truth labels as they become available, with selective knowledge distillation (KD) from the teacher applied as a regularizer to prevent catastrophic forgetting of the representations learned in the first stage. In our experiments, two models (BTC, 2E1D) were used as students. In stage 1, using only pseudo-labels, the BTC student achieves over 98% of the teacher's performance, while the 2E1D model achieves about 96% across seven standard mir_eval metrics. After a single training run for both students in stage 2, the resulting BTC student model surpasses the traditional supervised learning baseline by 2.5% and the original pre-trained teacher model by 1.55% on average across all metrics. The resulting 2E1D student model improves from the traditional supervised learning baseline by 3.79% on average and achieves almost the same performance as the teacher. Both cases show large gains on rare chord qualities.

arXiv:2602.19778 [cs.SD]

Nghia Phan, Rong Jin, Gang Liu, Xiao Dong

學術引用

如果您在研究或學術工作中使用了 ChordMini，請引用：

@misc{phan2026enhancingautomaticchordrecognition,
      title={Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation}, 
      author={Nghia Phan and Rong Jin and Gang Liu and Xiao Dong},
      year={2026},
      eprint={2602.19778},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2602.19778}, 
}

Nghia Phan, Rong Jin, Gang Liu, and Xiao Dong. “Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation.” arXiv preprint arXiv:2602.19778, 2026. https://arxiv.org/abs/2602.19778

應用專案

技術架構

前端

• Next.js 15 (React Framework)
• TypeScript
• Tailwind CSS
• Framer Motion
• Chart.js & D3.js

後端與機器學習

• Python Flask (Google Cloud Run)
• Firebase Firestore
• Vercel Blob Storage
• 自訂機器學習模型

功能特色

機器學習模型

• Beat-Transformer 節拍偵測
• Chord-CNN-LSTM 和弦辨識
• BTC 模型提升準確度
• 即時音訊處理

平台功能

• YouTube 整合
• 同步歌詞顯示
• 樂譜生成
• 多語言支援

致謝

ChordMini 建立在許多優秀的開源專案和服務之上。我們衷心感謝以下第三方函式庫與服務：

聯絡與合作

如有研究諮詢、合作機會或技術問題，請聯繫：

電子郵件： phantrongnghia510@gmail.com

GitHub： ChordMiniApp Repository

關於 ChordMini

研究專案

Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

學術引用

應用專案

技術架構

前端

後端與機器學習

功能特色

機器學習模型

平台功能

致謝

第三方函式庫與服務

@tombatossals/react-chords

LRClib

youtube-search-api

yt-dlp

Genius API

Music.AI

Google Gemini API

聯絡與合作

ChordMini