3D-Speaker
Datasets

A large scale multi-Device, multi-Distance, and multi-Dialect audio dataset of human speech

10000+ Speakers

3D-Speaker Datasets contains 10000+ speakers.

Multiple Devices

Device labels of all utterances are provided. Devices include phones, recording pens, PC laptops, microphone arrays, etc.

Multiple Distances

The distance from sound source to recording devices have been tracked and provided. Distances range from 0.1m to 4m, covering most common scenarios.

Multiple Dialects

3D-Speaker-Datasets covers data of 14 different Mandarin dialects.