Datasets
Equalyz Crowd: Africa's largest network for ethical voice data collection.
Where Every Voice Finds Its Story
Our collection is hyperlocal in every sense of the word. Across homes, markets, schools, and farms all over Africa. By engaging everyday speakers in their communities, we capture the rich nuances, slangs, and contexts that make each language truly come alive.
Powering AI across key areas impacting everyday people
Our multimodal datasets (speech, text, images, video) are tailored to key domains
Health
Capturing patient–provider dialogues and community wellness narratives
Finance
Education
Agriculture
Three collection tools.
Built for how Africa actually accesses technology.
We meet contributors where they are, online, or offline. With smartphone or on a feature phone
Equalyz Crowd
Browser-based contribution platform for online communities
Contributors record voice, transcribe audio, and validate language data directly in the browser. Built for university partners, NGO research teams, and online communities with stable internet access.
- Multi-task workflows: recording, transcription, validation
- Project-specific dashboards for partners
- Real-time quality control and contributor feedback
- Automatic export to ML-ready formats
Equalyz Gram
Voice and text contribution through the messaging app people already use.
A Telegram-based bot that lets contributors record and submit voice samples through a chat interface they already know. No new app to download, no new account to create.
- Works on any smartphone with Telegram installed
- Familiar interface, low onboarding friction
- Voice notes, text prompts, and audio playback in one chat flow
- Realtime reviews delivered directly through the chat
VoiceBridge
Voice data contribution from any phone, no internet required.
Let contributors call a dedicated number and contribute speech in rich dialects and accents over feature phones. Built for the populations smartphone-first tools cannot reach: rural communities, and regions with limited connectivity
- Works on any mobile network across Africa
- Dedicated phone numbers per project
- Structured prompts and automatic capture
- Reaches populations no other tool can
Ethical by Design, Trusted by Communities
Community Partnership
Guided by cultural custodians, every contribution is fully consented.
Fair Benefit Sharing
Privacy First
Rigorous removal of personal identifiers and sensitive content filtering.
True Representation
Native speakers and experts validate dialects for genuine, inclusive coverage.
Real Results: Our Data in Action
Hours of Yoruba Audio
Hours of Accented Nigerian English Audio
Hausa Financial Conversations
Yoruba Health Conversations