Speech Recognition Data
We provide a worldwide collection of speech datasets that are diverse, scalable, and meticulously transcribed, perfect for training machines to accurately recognize and understand different types of languages.
CONTACT US
How it Works
Algorithm
Development
Data Demand
Generation
Dataset
Definition/Design
Trial and
Improvement
Mass
production
Quality Control
Data Package
Delivery
Data Collection and Annotation
Our diverse facial data is collected from around the globe, under a wide variety of situations.
Environments
Indoor
Studio
Outdoor
Incar
Devices
Mobile (iOS/Android)
Computer (Desktop/Laptop)
Pro (Hi-Fi recorder/Mic Array)
Speakers
Language: Chinese/English/French/German…
Gender balanced: 1:1
Age: Children/Senior
Education Background
Machine
annotate
Human
transcribe / Validate
rounds QA by
human & machine
Data Annotation
Accuracy between 95%~98%
Surfing Tech applies its own algorithm during speech annotation to ensure high efficiency and accuracy. We achieve above 95% accuracy rate after three rounds of quality inspection, which makes the datasets more valuable for speech recognition, semantic understanding, and human-computer interaction.
Speech Data Portfolio
Basic
Chinese Mandarin: 10,000 speakers
Chinese Conversation: 500 speakers
Age
Children Mandarin: 10,000 speakers
Senior Mandarin: 800 speakers
Accent
Hakka Dialect: 2,000 speakers
Southwest China: 1,000 speakers
Central China: 1,000 speakers
Environment
In-car: Planning
Office: Planning
Language
Mandarin-English Mixed: 9,000 speakers
American English: 1,500 speakers
Singaporean English: 300 speakers
Singaporean English: 300 speakers