關於 ChordMini
為音樂家、研究人員和音樂愛好者打造的 AI 驅動和弦辨識與音樂分析平台。
研究專案
Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation
Automatic Chord Recognition (ACR) is constrained by the scarcity of aligned chord labels, as well-aligned annotations are costly to acquire. At the same time, open-weight pre-trained models are currently more accessible than their proprietary training data. In this work, we present a two-stage training pipeline that leverages pre-trained models together with unlabeled audio. The proposed method decouples training into two stages. In the first stage, we use a pre-trained BTC model as a teacher to generate pseudo-labels for over 1,000 hours of diverse unlabeled audio and train a student model solely on these pseudo-labels. In the second stage, the student is continually trained on ground-truth labels as they become available, with selective knowledge distillation (KD) from the teacher applied as a regularizer to prevent catastrophic forgetting of the representations learned in the first stage. In our experiments, two models (BTC, 2E1D) were used as students. In stage 1, using only pseudo-labels, the BTC student achieves over 98% of the teacher's performance, while the 2E1D model achieves about 96% across seven standard mir_eval metrics. After a single training run for both students in stage 2, the resulting BTC student model surpasses the traditional supervised learning baseline by 2.5% and the original pre-trained teacher model by 1.55% on average across all metrics. The resulting 2E1D student model improves from the traditional supervised learning baseline by 3.79% on average and achieves almost the same performance as the teacher. Both cases show large gains on rare chord qualities.
Nghia Phan, Rong Jin, Gang Liu, Xiao Dong
學術引用
如果您在研究或學術工作中使用了 ChordMini,請引用:
@misc{phan2026enhancingautomaticchordrecognition,
title={Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation},
author={Nghia Phan and Rong Jin and Gang Liu and Xiao Dong},
year={2026},
eprint={2602.19778},
archivePrefix={arXiv},
primaryClass={cs.SD},
url={https://arxiv.org/abs/2602.19778},
}Nghia Phan, Rong Jin, Gang Liu, and Xiao Dong. “Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation.” arXiv preprint arXiv:2602.19778, 2026. https://arxiv.org/abs/2602.19778
應用專案
技術架構
前端
- • Next.js 15 (React Framework)
- • TypeScript
- • Tailwind CSS
- • Framer Motion
- • Chart.js & D3.js
後端與機器學習
- • Python Flask (Google Cloud Run)
- • Firebase Firestore
- • Vercel Blob Storage
- • 自訂機器學習模型
功能特色
機器學習模型
- • Beat-Transformer 節拍偵測
- • Chord-CNN-LSTM 和弦辨識
- • BTC 模型提升準確度
- • 即時音訊處理
平台功能
- • YouTube 整合
- • 同步歌詞顯示
- • 樂譜生成
- • 多語言支援
致謝
ChordMini 建立在許多優秀的開源專案和服務之上。 我們衷心感謝以下第三方函式庫與服務:
第三方函式庫與服務
@tombatossals/react-chords
吉他和弦圖表視覺化元件,用於在吉他和弦分頁中顯示互動式指法圖。
LRClib
歌詞同步服務,為歌詞與和弦功能提供時間同步的歌詞資料。
youtube-search-api
YouTube 搜尋功能,用於直接從平台上尋找和分析音樂影片。
yt-dlp
YouTube 音訊擷取工具,用於下載和處理音訊內容以進行和弦分析。
Genius API
歌詞與歌曲元資料服務,提供完整的歌曲資訊和歌詞資料。
Music.AI
AI 驅動的音樂轉錄服務,用於逐字歌詞同步和音訊分析。
Google Gemini API
AI 語言模型,用於歌詞翻譯、異名同音和弦校正和智慧音樂分析。
我們衷心感謝所有這些專案的開發者和維護者, 感謝他們將作品貢獻給開源社群。
聯絡與合作
如有研究諮詢、合作機會或技術問題,請聯繫:
電子郵件: phantrongnghia510@gmail.com
GitHub: ChordMiniApp Repository