Voice of Hausa

Discover the Hausa language, spoken widely across West Africa, as a rich expression of culture, heritage, and community traditions.

Overview

Hausa is a major West African language spoken by millions, with a rich oral tradition and cultural heritage. This page provides curated datasets of audio and text to support AI research, language learning, and preservation efforts.

Sample Audio

Transcript: Habari, unaendeleaje? (Hello, how are you doing?)

Play
Pause

Writing system

Hausa uses the Latin-based Boko script for most modern writing, and the traditional Ajami script, derived from Arabic, is still used in religious and cultural texts. This dual system reflects the language’s rich history and adaptability.

What’s Here Now

Urban Dialogue

Play
Pause

Transcript: Habari, unaendeleaje? (Hello, how are you doing?)

Market Talk

Play
Pause

Transcript: Habari, unaendeleaje? (Hello, how are you doing?)

Community Radio

Play
Pause

Transcript: Habari, unaendeleaje? (Hello, how are you doing?)

Why It Matters
for AI

Bukusu is under‑represented online; adding clear, community‑reviewed audio can help build tools for local broadcasting, education, and assistive technologies.

Speech Recognition

Speech recognition data captures real Hausa pronunciation and intonation, improving voice assistants, transcription tools, and accessibility systems.

Translation

Translation datasets enable AI to map Hausa words and meanings to other languages, supporting education, communication, and cultural exchange.

Information Access

Information access datasets allow AI to deliver Hausa-language content and resources, bridging the digital gap for native speakers.

Hausa Datasets

Hausa Corpus v1.0

Version: 1.0
Size: 2GB
License: CC-BY 4.0
DOI: 10.1234/swahili.001

Hausa Corpus v1.0

Version: 1.0
Size: 2GB
License: CC-BY 4.0
DOI: 10.1234/swahili.001

Our platform digitally preserves Africa’s rich linguistic diversity by collecting audio, text, and community contributions to build a comprehensive database for research, learning, and AI model training.

Collaborators

Contact us if interested in collaborations. 

© 2025 All Rights Reserved.

Scroll to Top

Request Access

Request access to the Ogiek language datasets. Sign in to view and download curated audio and text resources for AI research, language preservation, and educational purposes.

Request Access

Contribute Data

Contribute your recordings and transcripts to help preserve the Ogiek language. Submit audio, text, and consent forms to support AI research, education, and cultural preservation.

Contribution form