News Digital

Google launches WAXAL, an open-source voice dataset for African languages

Google launches WAXAL, an open-source voice dataset for African languages
Wednesday, 04 February 2026 14:38
  • Google launches WAXAL open-source African language voice database
  • Dataset offers 11,000 hours across 21 languages, free on Hugging Face
  • Project aims to boost voice AI access, led by African institutions

Google has officially launched WAXAL, an open-source voice database designed to support the development of artificial intelligence (AI) technologies capable of understanding and reproducing African languages. The project was developed over three years in partnership with institutions across the continent. It aims to address a long-standing shortage of linguistic data, widely seen as a major obstacle to the growth of voice AI in sub-Saharan Africa.

Now available for free on the Hugging Face platform, WAXAL contains more than 11,000 hours of voice recordings drawn from nearly two million audio files. The database covers 21 African languages, including Hausa, Yoruba, Luganda, Acholi, Swahili, Igbo, and Fulani.

African partners led the data collection effort. Makerere University in Uganda and the University of Ghana coordinated work on 13 languages, while the Rwandan initiative Digital Umuganda contributed five additional languages. Regional studios also helped produce high-quality recordings. The African Institute for Mathematical Sciences (AIMS) took part in developing multilingual corpora for future versions.

Built as a foundational resource, WAXAL provides around 1,250 hours of transcribed speech for automatic speech recognition. It also includes more than 20 hours of studio recordings intended for speech synthesis. The goal is to enable the development of voice-based applications, such as voice assistants, dictation tools, or public services accessible to people with limited literacy, particularly in the fields of health, education, and agriculture.

This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages, finally reaching over 100 million people,” said Aisha Walcott-Bryant, head of Google Research Africa.

The launch of WAXAL comes as efforts to advance African linguistic technologies continue to expand. In 2025, Nigeria presented N-ATLAS, an open-source linguistic model capable of transcribing speech in Yoruba, Hausa, Igbo, and Nigerian English. In the private sector, African startups are also developing voice recognition and translation solutions aimed at local needs.

Sub-Saharan Africa has more than 2,000 languages, but only a small number currently have the resources needed for natural language processing. This limits access to voice technologies for millions of people, even as such tools become widespread in other regions of the world.

Under the partnership model adopted, the African institutions that contributed to the data collection retain ownership of the corpora while making them available under an open license. For Joyce Nakatumba-Nabende, a professor and researcher at Makerere University, “For AI to have a real impact in Africa, it must speak our languages and understand our contexts.”

Fiacre E. Kakpo

On the same topic
UNCDF, Co-op Bank Kenya sign guarantee to boost digital lending Risk-sharing aims expand financing access for startups, platforms Deal supports...
Côte d’Ivoire plans 15 agri-tech hubs to support women in agribusiness The centers will focus on processing, training, and digital tools The project’s...
Kenya becomes the first African country to establish a formal digital dialogue framework with the European Union. The partnership targets...
Angola’s parliament unanimously approved a startup law to address legal gaps and support innovation. Authorities set a $3.5 million annual...
Most Read
01

CCR-UEMOA presents mid-term review of private sector competitiveness efforts Reforms, AfCFTA trai...

Strengthening the Business Climate in WAEMU Countries: CCR-UEMOA Reviews Its Midterm Record
02

Telecel Ghana to boost network investment by 150% in 2026 Expansion targets capacity, reliabi...

Telecel Ghana plans 150% investment increase in MTN-dominated market
03

Togo parliament adopts WAEMU law against currency counterfeiting Bill defines offences including ...

Togo Passes Law to Criminalize Counterfeiting of West African CFA Franc
04

Namibia and Russia agreed to expand cooperation across energy, mining, and agriculture. Both coun...

Namibia and Russia Expand Economic Cooperation Across Key Sectors
05

Cameroon signs MoUs for $1.5 billion waste-to-energy projects Plans target waste treat...

Cameroon Signs $1.5 Billion Waste-to-Energy MoUs Amid Urban Sanitation Strain
Enter your email to receive our newsletter

Ecofin Agency provides daily coverage of nine key African economic sectors: public management, finance, telecoms, agribusiness, mining, energy, transport, communication, and education.
It also designs and manages specialized media, both online and print, for African institutions and publishers.

SALES & ADVERTISING

regie@agenceecofin.com 
Tél: +41 22 301 96 11 
Mob: +41 78 699 13 72


EDITORIAL
redaction@agenceecofin.com

More information
Team
Publisher

ECOFIN AGENCY

Mediamania Sarl
Rue du Léman, 6
1201 Geneva
Switzerland

 

Ecofin Agency is a sector-focused economic news agency, founded in December 2010. Its web platform was launched in June 2011. ©Mediamania.

 
 

Please publish modules in offcanvas position.