關於 ChordMini

為音樂家、研究人員和音樂愛好者打造的 AI 驅動和弦辨識與音樂分析平台。

研究專案

Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

Automatic Chord Recognition (ACR) is constrained by the scarcity of aligned chord labels, as well-aligned annotations are costly to acquire. At the same time, open-weight pre-trained models are currently more accessible than their proprietary training data. In this work, we present a two-stage training pipeline that leverages pre-trained models together with unlabeled audio. The proposed method decouples training into two stages. In the first stage, we use a pre-trained BTC model as a teacher to generate pseudo-labels for over 1,000 hours of diverse unlabeled audio and train a student model solely on these pseudo-labels. In the second stage, the student is continually trained on ground-truth labels as they become available, with selective knowledge distillation (KD) from the teacher applied as a regularizer to prevent catastrophic forgetting of the representations learned in the first stage. In our experiments, two models (BTC, 2E1D) were used as students. In stage 1, using only pseudo-labels, the BTC student achieves over 98% of the teacher's performance, while the 2E1D model achieves about 96% across seven standard mir_eval metrics. After a single training run for both students in stage 2, the resulting BTC student model surpasses the traditional supervised learning baseline by 2.5% and the original pre-trained teacher model by 1.55% on average across all metrics. The resulting 2E1D student model improves from the traditional supervised learning baseline by 3.79% on average and achieves almost the same performance as the teacher. Both cases show large gains on rare chord qualities.

arXiv:2602.19778 [cs.SD]

Nghia Phan, Rong Jin, Gang Liu, Xiao Dong

學術引用

如果您在研究或學術工作中使用了 ChordMini,請引用:

@misc{phan2026enhancingautomaticchordrecognition,
      title={Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation}, 
      author={Nghia Phan and Rong Jin and Gang Liu and Xiao Dong},
      year={2026},
      eprint={2602.19778},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2602.19778}, 
}

Nghia Phan, Rong Jin, Gang Liu, and Xiao Dong. “Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation.” arXiv preprint arXiv:2602.19778, 2026. https://arxiv.org/abs/2602.19778

應用專案

技術架構

前端

  • • Next.js 15 (React Framework)
  • • TypeScript
  • • Tailwind CSS
  • • Framer Motion
  • • Chart.js & D3.js

後端與機器學習

  • • Python Flask (Google Cloud Run)
  • • Firebase Firestore
  • • Vercel Blob Storage
  • • 自訂機器學習模型

功能特色

機器學習模型

  • • Beat-Transformer 節拍偵測
  • • Chord-CNN-LSTM 和弦辨識
  • • BTC 模型提升準確度
  • • 即時音訊處理

平台功能

  • • YouTube 整合
  • • 同步歌詞顯示
  • • 樂譜生成
  • • 多語言支援

致謝

ChordMini 建立在許多優秀的開源專案和服務之上。 我們衷心感謝以下第三方函式庫與服務:

第三方函式庫與服務

@tombatossals/react-chords

吉他和弦圖表視覺化元件,用於在吉他和弦分頁中顯示互動式指法圖。

LRClib

歌詞同步服務,為歌詞與和弦功能提供時間同步的歌詞資料。

youtube-search-api

YouTube 搜尋功能,用於直接從平台上尋找和分析音樂影片。

yt-dlp

YouTube 音訊擷取工具,用於下載和處理音訊內容以進行和弦分析。

Genius API

歌詞與歌曲元資料服務,提供完整的歌曲資訊和歌詞資料。

Music.AI

AI 驅動的音樂轉錄服務,用於逐字歌詞同步和音訊分析。

Google Gemini API

AI 語言模型,用於歌詞翻譯、異名同音和弦校正和智慧音樂分析。

我們衷心感謝所有這些專案的開發者和維護者, 感謝他們將作品貢獻給開源社群。

聯絡與合作

如有研究諮詢、合作機會或技術問題,請聯繫: