This repository mirrors version 1.0.0 of Google's Expanded Groove MIDI Dataset
(E-GMD) for access through the Hugging Face Hub.
E-GMD is a large dataset of human drum performances with audio recordings
annotated in MIDI. It contains 444.5 hours of audio from 43 drum kits, with the
same train, validation, and test split definitions as the original Groove MIDI
Dataset.
Quick Start
python
from datasets import load_dataset
ds = load_dataset("schism-audio/e-gmd", split="train", streaming=True)
first = next(iter(ds))
print(first["audio"], first["midi_path"], first["split"])
Repository Layout
The original archive contains some session folders with more than 10,000 files,
which exceeds Hugging Face Hub's per-folder repository limit. This mirror keeps
the original filenames and original drummer/session paths, but stages files under
split and kit directories:
The root metadata.csv follows the AudioFolder convention and contains one row
per WAV file. The file_name column points at the audio file, and the remaining
columns preserve E-GMD metadata plus the paired MIDI path.
The metadata/*.csv files are retained from the original mirror and add these
path columns:
file_name: audio path relative to the split folder
audio_path: audio path relative to the repository root
midi_path: paired MIDI path relative to the repository root
original_audio_filename: original archive audio path
original_midi_filename: original archive MIDI path
Loading Audio With Metadata
The default dataset is loadable with datasets through the AudioFolder
convention: