2 down, keep it up!<playIcon></playIcon>
2 hasi, komeza!
Abkhaz
Icyabukazi
Abkhaz
Icyabukazi
Accent
Imvugo
Acehnese
Icyacenezi
Acehnese
Icyacenezi
{ $actionType }<playIcon></playIcon> did they accurately speak the sentence?
ese bavuze interuro ku buryo bunoze?
{ $actionType }<recordIcon></recordIcon> then read the sentence aloud
hanyuma usome interuro uranguruye ijwi
{ $actionType }<stopIcon></stopIcon> when done
igihe bikozwe
{ $actionType } submit when ready
ohereza niba witeguye
Add an avatar to your profile
Shyira ishusho ndanga ku isura ndanga yawe
Additional Language
Urundi rurimi
Add Language
Ongeraho ururimi
Adyghe
Icyadige
Adyghe
Icyadige
Afrikaans
Icyafurikanzi
Afrikaans
Icyafurikanzi
Age
Imyaka
Albanian
Ikinyarubaniya
Albanian
Ikinyarubaniya
All
Byose
All voice clips in the dataset are scrubbed of personally identifying information. When a contributor provides demographic data via their profile, that information is de-identified from their voice clips before being bundled for download in the dataset and is never made public on their profile page.
Amajwi yose ari mu ikusanyirizo atandukanywa n'irangamimerere ry'uwayatanze. Igihe utanga umusanzu atanze amakuru amwerekeyeho binyuze ku isura ndanga ye, ayo makuru atandukanywa n'amajwi atanze mbere yo kuyagena ngo ashyirwe aho ashobora kumanurwa na buri wese mu ikusanyirizo kandi ntiyigera ashyirwa ku mugaragaro kuri paji yabo iranga isura ndanga.
Amharic
Icyamuhari
Amharic
Icyamuhari
Anonymized user data like age, sex, and accent helps improve the audio data used to train the accuracy of speech recognition engines. Your username and email will never be associated with your submitted data, and you can choose whether to make your username public or anonymous.
Imbonwa zitagaragazwa nk'imyaka, igistina n'imvugo zifasha kunoza imbonwa z'amajwi zakoreshejwe mu kwitoza ukuboneza kw'imashini ntahura mvugo.
Arabic
Icyarabu
Arabic
Icyarabu
Aragonese
Icyaragonezi
Aragonese
Icyaragonezi
Assamese
Icyasamezi
Assamese
Icyasamezi
Asturian
Icyasituriya
Asturian
Icyasituriya
Audio Format
Uburyo amajwi ateye
Avatar
Ishusho ndanga
Azerbaijani
Icyazerebayija
Azerbaijani
Icyazerebayija
Back to Top
Subira hejuru
Basque
Ikibasike
Basque
Ikibasike
Benefits
Urwunguko
Bengali
Ikibengari
Bengali
Ikibengari
<bold>{ $count }</bold> Clips
Amajwi
<bold>Help us</bold> find more voices
Dufashe kubona andi majwi
<bold>iOS</bold> users can download our free app:
Abakoresha iOS bashobora kumanura porogaramu yacu ku buntu:
Breton
Ikibureto
Breton
Ikibureto
Build Profile
Gukora isura ndanga
Bulgarian
Ikinyaburugariya
Bulgarian
Ikinyaburugariya
Buryat
Ikiburyati
Buryat
Ikiburyati
<b>Why an email?</b> We may need to contact you in the future about changes to the dataset, an email provides us a point of contact.
Kuki imeri ari ngombwa? Tuzakenera kukubaza ibyahindutse ku ikusanyirizo ry'imbonwa, bityo imeri ikaba inzira yo kukugeraho.
By opting in to receive emails you state that you are okay with Mozilla handling this info as explained in Mozilla’s <privacyLink>Privacy Policy<privacyLink>.
Mu gihe wemeye kwakira imeri uba wemeje ko Mozilla ikoresha ayo makuru nk'uko bisobanuye muri poritiki bwite yayo.
By opting in to receive emails you state that you are okay with Mozilla handling this info as explained in Mozilla’s <privacyLink>Privacy Policy<privacyLink>.
Mu gihe wemeye kwakira imeri uba wemeje ko Mozilla ikoresha ayo makuru nk'uko bisobanuye muri poritiki bwite yayo.
<b>You agree</b> to not attempt to determine the identity of speakers in the Common Voice dataset
Wemeye ko utazagerageza kumenya irangamimerere ry'abavuze batanga amajwi mu ikusanyirizo ry'imbonwa rya porogaramu y'Ijwi Rusange.
By providing some information about yourself, the audio data you submit to Common Voice will be more useful to Speech
Recognition engines that use this data to improve their accuracy.
Mu gutanga amwe mu makuru akwerekeyeho, imbonwa z'amajwi watanze ku Ijwi Rusange (Common Voice) zizaba ingirakamaro cyane ku Mvugo. Imashini zitahura amajwi zikoresha izi mbonwa mu kuziboneza.
By using Common Voice, you agree to our <termsLink>Terms</termsLink> and <privacyLink>Privacy Notice</privacyLink>
Mu gukoresha porogaramu y'Ijwi Rusange, wemeye amabwiriza yacu n'ibirebana n'amakuru bwite adashyirwa ahagaragara.
Cancel Re-recording
Kuraho ibifashwe bundi bushya
Cancel Submission
Hagarika, reka kohereza.
Catalan
Igikatara
Catalan
Igikatara
Change your email via Settings under Login Identity
Hindura imeri yawe unyuze mu buryo mfashagena munsi y'ahagenewe irangamimerere ry'ahinjirirwa
Chinese (China)
Igishinwa (cyo mu Bushinwa)
Chinese (China)
Igishinwa (cyo mu Bushinwa)
Chinese (Hong Kong)
Igishinwa (cyo muri Hongukongo)
Chinese (Hong Kong)
Igishinwa (cyo muri Hongukongo)
Chinese (Taiwan)
Igishinwa (cyo muri Tayiwani)
Chinese (Taiwan)
Igishinwa (cyo muri Tayiwani)
Chuvash
Igicuvashi
Chuvash
Igicuvashi
Click
Kanda
Clips recorded
Amajwi yafashwe
Clips Uploaded
Amajwi yafashwe yinjijwemo
Clips validated
Amajwi yemejwe
Clips You've Recorded
Amajwi
Clips You've Validated
Amajwi wemeje
Close
Funga
Collecting sentences from the public domain, or writing new ones for the public domain.
Gukusanya interuro zikoreshwa mu bantu bose, cyangwa kwandika izindi nshya zakoreshwa na bose.
Common Voice data plus all other voice datasets above.
Porogaramu y'Ijwi Rusange hiyongereyeho ayandi makusanyirizo y'imbonwa z'amajwi ziri hejuru aha.
Common Voice is a crowdsourcing platform, and the languages were all added by volunteers.
We would love for you to add your language! <languageRequestLink>Ask about adding your language.</languageRequestLink>
Mozilla nta rurimi irutisha ururndi. Ahubwo Ijwi Rusange (Common Voice) ni igikorwa kerekeye rubanda, ariko kigira ibyiciro byinshi mbere yo kongeramo ururimi no gutangira kwegeranya amajwi atangwa n'abaruvuga. Icya mbere ni uko urubuga rw'Ijwi Rusange rukeneye guhindurwa mu rurimi kugira ngo abaruvuga bashobore gusobanukirwa n'ibyo abantu bashyiraho mu rurimi rwabo. Ikindi ni uko dukeneye kwegeranya interuro nyinshi zidakumirwa n'uburenganzira bw'umutungo mu by'ubwenge bw'abazivuze abantu basoma baranguruye.Iyo ibi byombi byubahirijwe ururimi rushyirwa ku Ijwi Rusange (Common Voice) abantu bagatangira gufata amajwi yabo cyangwa kwemeza ubuhame bw'amajwi yatanzwe n'abandi.
Common Voice is Mozilla's initiative to help teach machines how real people speak.
Ijwi Rusange (Common Voice) ni igikorwa cya Mozilla kigamije gufasha kwigisha amamashini uko abantu nyakuri bavuga.
Common Voice recordings are used by academics, small businesses, and voice recognition enthusiasts to help train and grow publicly available resources like voice models.
Can you let us know why you would like your recordings deleted?
Amajwi yafaswe ya porogaramu y'Ijwi Rusange akoreshwa n'abarimu ba za kaminuza, abafite ibikorwa by'ubucuruzi biciriritse, n'abashishikazwa no guhugukira gutahura amajwi bashaka gufasha mu gutoza no kumenyekanisha amavomo ariho nka za moderi z'amajwi. Ese watubwira impamvu wifuza ko amajwi yawe asibwa?
Connect with Gravatar
Ikonegite unyuze kuri Garavatari (Gravatar).
Contact
Aho umuntu abarizwa
Contact Form
Urupapuro rwuzuzwaho aho umuntu yaboneka, aho abarizwa
Content available under a <licenseLink>Creative Commons license</licenseLink>
Amakuru arimo asomwa gusa n'ufite uruhushya rw'isosiyete nyoroherezabahanzi (Creative Commons).
Contribute
Fasha, tanga umusanzu
Contribute
Tanga umusanzu/ fasha
Contribute
Fasha, tanga umusanzu
Contribute to { $lang }
Fasha mu, tanga umusanzu mu
Contribute to { $lang }
Fasha mu, tanga umusanzu mu
Contribute Your Voice
Tanga umusanzu w'ijwi
Contribution Activity
Igikorwa ntangamusanzu, igikorwa cyo gufasha
Contribution Experience
Uburambe mu gutanga umusanzu
Cookies
Amakuru nyoboranzira
Cornish
Igikorunishi
Cornish
Igikorunishi
{ $count }mo
Ikindi
{ $count }wk
Ikindi cyumweru
{ $count }y
Ikindi
Croatian
Ikinyakorowasiya
Czech
Igiceke
Czech
Igiceke
Danish
Ikinyadanemariki
Danish
Ikinyadanemariki
Dashboard
Imbonerahamwe ngenzuzi
Datasets
Ikusanyirizo ry'imbonwa
Days
Iminsi
De-identified
Itandukanya ry'amakuru y'irangamimerere n'amakuru yandi.
Delete my recordings
Gusiba amajwi nafashe
Delete Profile
Siba isura ndanga
Dhivehi
Ikimaridiviye/ Ikidivehi
Dhivehi
Ikimaridiviye/ Ikidivehi
Donate your voice
Tanga ijwi ryawe
Don't see your language on Common Voice yet?
Ntubona ururimi rwawe ku Ijwi Rusange (Common Voice)?
Don’t see your language reflected in the Dataset? To request a language head over to our Languages page.
Ntabwo ubona ururimi rwawe rugaragara mu ikusanyirizo ry'imbonwa? Kugira ngo usabe ururimi ko rujyaho jya ku ipaji yacu y'indimi.
Download Common Voice Data
Manura imbonwa zo muri porogaramu y'Ijwi Rusange (Common Voice)
Download Data
Manura imbonwa
Download Dataset Bundle
Manura umuzingo w'ikusanyirizo ry'imbonwa
Download { $language }
Manura
Download My Data
Kumanura/ Manura imbonwa zinyerekeyeho
Do you have ideas on how we can make the Common Voice dataset better? Let us know on Discourse
Hari ibitekerezo ufite by'uburyo twanoza ikusanyirizo ry'imbonwa za porogaramu y'Ijwi Rusange? Kitubwire ku rubuga duhuriraho rw'Ijambo.
Drag and drop or <browseWrap>Browse</browseWrap>
Kurura unarekure cyangwa utambagire
During contribution submission feedback will be skipped after clicking 'Submit. Contribution will continue directly with the next set of 5 recordings or validations.
Mu gutanga umusanzu amakuru ngarukira y'iyohereza azasimbukwa nyuma yo gukoanda "Ohereza". Ibyoherejwe nk'umusanzu mutanze bizakomezanya n'indundo 5 z'amajwi yafashwe cyangwa z'ibyemejwe.
Dutch
Igihorandi
Dutch
Igihorandi
Each entry in the dataset consists of a unique MP3 and corresponding text file. Many of the <b>{ $total }</b> recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help train the accuracy of speech recognition engines.
The dataset currently consists of <b>{ $valid }</b> validated hours in <b>{ $languages }</b> languages, but we’re always adding more voices and languages. Take a look at our <languagesLink>Languages page</languagesLink> to request a language or start contributing.
Buri kintu kiri mu ikusanyirizo ry'imbonwa z'amajwi kigizwe n'ijwi ryafashwe n'idosiye yumwandiko waryo. Menshi mu masaha y'ibyafashwe ari mu ikusanyirizo ry'imbonwa na yo arimo amakuru bwite nk'imyaka, igitsina n'imvugo y'umuntu bishobora gufasha gutoza imashini ntahurajwi kuboneza neza.
Edit
Kosora/ hindura
Edit Profile
Kosora isura ndanga
Email
Imeri
Email
Imeri
Email
Imeri
Email
Imeri
Email is already used for a different account
Imeri yarakoreshejwe ku yindi konti
Email Subscriptions
Isaba ryo kohererezwa imeri.
English
Icyongereza
English
Icyongereza
English
Icyongereza
Enter Email to Download
Shyiramo imeri kugira ngo umanure
Enter your email
Injiza imeri yawe
Erzya
Icyeriziya
Erzya
Icyeriziya
Esperanto
Ikesiperanto
Esperanto
Ikesiperanto
Estonian
Ikinyesitoniya
Estonian
Ikinyesitoniya
Everyone
Buri wese
Exit & Delete clips
Sohoka & siba amajwi yafashwe
FAQ
Ibibazo bikunze kubazwa
Faroese
Ikinyaferowe
Faroese
Ikinyaferowe
Female
Gore
Finish recording
Kurangiza gufata amajwi
Finish recording first?
Mbere na mbere kurangiza gufata amajwi?
Finnish
Ikinyafinirande
Finnish
Ikinyafinirande
For these launched languages the website has been successfully localized, and has enough sentences collected, to allow for ongoing <italic>Speak</italic> and <italic>Listen</italic> contribution.
Ku ndimi zashyizweho urubuga rwahise rugaragara neza aho ruri, kandi rufite interuro zihagije zakusanyijwe, guha inzira ibigikorwa by'umusanzu wo Kuvuga no Kumva.
French
Igifaransa
French
Igifaransa
Frequently Asked Questions
Ibibazo bikunze kubazwa
Frisian
Igifirizone
Frisian
Igifirizone
GB
Jigabayiti
Georgian
Ikinyajeworujiya
Georgian
Ikinyajeworujiya
German
Ikidage
German
Ikidage
Get Involved
Gira uruhare
Get Involved
Gira uruhare
Get Started with Speech Recognition
Tangira gukoresha Itahurajwi
Glossary
Urutonde rw'amuga
Goals
Imigambi
Go to Discourse
Jya ku rubuga duhuriraho rw'Ijambo
Go to Languages Page
Jya kuri paji y'indimi
Go to { $name }
Jya kuri...
Great!<recordIcon></recordIcon> Record your next clip
Ni byiza cyane! Fata amajwi yawe akurikiraho.
Great work!<playIcon></playIcon> Listen again when you're ready
Wakoze neza! Ongera wumve igihe wumva witeguye
Greek
Ikigereki
Greek
Ikigereki
Hakha Chin
Igihakacini
Hakha Chin
Igihakacini
Have Feedback?
Ufite amakuru ngarukira?
Have questions about Common Voice? Join us on our <discourseLink>Discourse forum</discourseLink>.
Ufite ibibazo birebana na porogaramu y'Ijwi Rusange? Twegera ku rubuga duhuriraho rw'Ijambo
Have you read our Terms?
Wasomye amabwiriza yacu?
Having a profile is not required to contribute though it is helpful
Kugira ibigwi bikomeye si byo ngombwa nubwo byafasha.
Hebrew
Igiheburayo
Hebrew
Igiheburayo
Help
Fasha
Help teach machines how real people speak, donate your voice at { $link }
Fasha kwigisha imashini uko abantu nyakuri bavuga, tanga ijwi ryawe kuri...
Help us build a community around voice technology, stay in touch via email.
Dufashe gukora umuryango w'abakoresha ikoranabuhanga ry'amajwi, komeza ubane natwe ukoresheje imeri.
Help us build a high quality, publicly open dataset
Dufashe gukora ikusanyirizo rifite ireme rikomeye kandi rikoreshwa na buri wese.
Help us find others to donate their voice!
Dufashe kubona abandi gutanga ijwi ryabo
Help us get to { $goal }
Dufashe ku.....
Help us validate sentences!
Dufashe kwemeza interuro!
Help us validate voices
Dufashe kwemeza amajwi
Hidden
Bihishwe
Hill Mari
Igihirimari
Hill Mari
Igihirimari
Hours Recorded
Amasaha yafashwe
Hours Validated
Amasaha yemejwe
{ $hours } validated hours so far!
andi masaha yemejwe kugeza ubu!
How can I get the Common Voice data?
Nabona nte imbonwa z'Ijwi Rusange (Common Voice)?
How does Common Voice calculate hours?
Porogaramu y'Ijwi Rusange ibara ite amasaha?
How do you ensure anonymity and privacy of the people who donated their voices?
Mubigenza mute ngo abantu batanze amajwi yabo batamenywa na bose kandi amakuru yabo bwite abe ibanga?
Hungarian
Ikinyahongiriya
Hungarian
Ikinyahongiriya
I agree
Ndabyemeye
I am a non-native speaker and I speak with an accent, do you still want my voice?
Ntabwo ndi kavukire w'uru rurimi bigatuma ntaruvuga nka we, ese muracyashaka ko nabaha ijwi ryange?
Icelandic
Ikinyisirande
Icelandic
Ikinyisirande
I'd like updates and to keep current with what's happening with Common Voice.
Nifuza kumenya amakuru y'ibirimo gukorwa ku Ijwi Rusange (Common Voice).
I do not agree
Ntabwo mbyemeye
I’m afraid I don’t know what you’re looking for.
Mpagaritswe umutima n'uko ntazi icyo ushaka.
I'm okay with you handling this info as you explain in Mozilla's <privacyLink>Privacy Policy</privacyLink>
Uburyo mwasobanuye muri poritiki zirebana n'amakuru bwite za Mizilla mukoreshamo aya makuru ndabwemera
* Indicates required field
*Yemeje ingeri ikenewe
* Indicates required field
*Yemeje ingeri ikenewe
Indonesian
Ikinyendoneziya
Indonesian
Ikinyendoneziya
In Progress
Biracyakomeza
In progress languages are currently being built for contribution by our communities; their progress reflects where they are across the website localization and sentence collection phases.
Indimi zigishyirwa muri porogaramu y'Ijwi Rusange zirimo gutunganywa kugira ngo abazikoresha batange umusanzu; uko bigenda bikorwa bigaragaza aho ziri ku rubuga n'ibyiciro byo gukusanya interuro.
Interlingua
Uruhuzandimi
Interlingua
Uruhuzandimi
Irish
Ikinyirirande
Irish
Ikinyirirande
Is my account information public?
Amakuru yange yose abonwa na buri wese?
Is the goal of Common Voice to build a voice assistant?
Ese intego/ umugambi w'Ijwi Rusange ni ugukora porogaramu ifasha ijwi?
Italian
Igitariyani
Italian
Igitariyani
Japanese
Ikiyapani
Japanese
Ikiyapani
Kabyle
Igikabire
Kabyle
Igikabire
Kaqchikel
Igikacikeri
Kaqchikel
Igikacikeri
Kazakh
Igikazake
Kazakh
Igikazake
Keep
Kuyagumishamo
Keep it up, record again <recordIcon></recordIcon>
Komereza aho, ongera ufate amajwi!
Keep the recordings
Komeza ufate amajwi
Keep track of your progress and metrics across multiple languages.
Komeza ugenzure uko ibyo ukora bigenda mu ndimi nyinshi.
Keep track of your progress with a profile and help our voice data be more accurate.
Komeza ugenzure uko ibintu bigenda unyuze ku isura ndanga unadufashe kunoza imbonwa z'amajwi zacu.
Kinyarwanda
Ikinyarwanda
Komi-Zyrian
Igikomi
Komi-Zyrian
Igikomi
Korean
Igikoreya
Korean
Igikoreya
Kyrgyz
Igikirigize
Kyrgyz
Igikirigize
Language
Ururimi
Language
Ururimi
Language Request
Gusaba ururimi
Language request successfully submitted, thank you.
Gusaba ururimi byoherejwe neza, urakoze.
Languages
Indimi
Launched
Byatangijwe / zashyizweho
Leaderboard Visibility
Uko imbonerahamwe y'ubwitabire igaragarira bose
Leaving now means you'll lose your progress
Kuvaho ubu bisobanuye ko urabura amakuru y'aho wari ugeze
LibriSpeech is a corpus of approximately 1000 hours of 16Khz read English speech derived from read audiobooks from the LibriVox project.
Isomero-jambo (LibriSpeech) ni indundo y'amasaha hafi 1000 ya kiroheritse 16 z'imvugo z'Icyongereza zavuye mu majwi y'ibyasomwe mu bitabo by'umushinga w'Isomero-jwi (LibriVox).
License
Uburenganzira
License: <licenseLink>CC-0</licenseLink>
Uburenganzira
License: <licenseLink>{ $license }</licenseLink>
Uburenganzira
Link Copied
Inzira yakoporowe
Listen
Umva
Loading…
Birakinjira...
Loading…
Birakinjira...
Localization
Kugena ahantu/ ahantu
Localized
Cyagaragaye/hagaragaye
Login Identity
Injiza irangamimerere
Log In / Sign Up
Injira
Log Out
Funga imeri
Looks like there aren't any clips to listen to in this language. Help us fill the queue by recording some now.
Bisa nkaho nta majwi yo kumva muri uru rurimi. Dufashe kuzuza ahabugenewe wifate amajwi ubu.
Macedonian
Ikinyamasedoniya
Macedonian
Ikinyamasedoniya
Make your submitted data as rich as possible by providing some anonymous demographic data. We de-identify all demographic data before making it public.
Oherezanya imbonwa zawe hamwe n'andi makuru mbarurishamibare yose ya ngombwa yawe bwite. Tubanza kuyagenzura mbere yo kuyashyira ahagaragarira buri wese.
Male
Gabo
Manage Subscriptions
Genzura isaba wakoze
MB
Megabayiti
Meadow Mari
Ikimidowu
Meadow Mari
Ikimidowu
Message
Ubutumwa
Mixed
Bivanze/ ivanze
Moksha
Ikimokusha
Moksha
Ikimokusha
Mongolian
Ikimongori
Mongolian
Ikimongori
Most of the data used by large companies isn’t available to the majority of people. We think that
stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open
and accessible to everyone.
Imbonwa nyinshi zikoreshwa n'amasosiyete magari ntizigera ku bantu benshi. Dusanga ibyo ari ibyo kurushya abantu. Ni yo mpamvu twatangije uyu mushinga w'Amajwi Rusange (Common Voice) ugamije gufasha mu gutuma ijwi ryumvikana rikanamenywa na buri wese.
Most of the data used by large companies isn’t available to the majority of people. We think
that stifles innovation. So we’ve launched Project Common Voice, a project to help make voice
recognition open to everyone.
Imbonwa nyinshi zikoreshwa n'amasosiyete magari ntizigera ku bantu benshi. Dusanga ibyo ari ibyo kurushya abantu. Ni yo mpamvu twatangije uyu mushinga w'Amajwi Rusange (Common Voice) ugamije gufasha mu gutuma ijwi ryumvikana rikanamenywa na buri wese.
Most speech databases are trained with an overrepresentation of certain demographics which results in a bias towards <articleLink>male and middle class</articleLink>. Accents and dialects that tend to be under-represented in training data sets are typically associated with groups of people who are already marginalised. Many machines also struggle to understand female voices.
This is why in our voice database we want variety!
ikusanyirizo ry'imbonwa z'imvugo nyinshi zirimo amakuru bwite y'abantu atagira ingano bigatuma habaho gushyira abantu cyane ab'igitsina gabo cyangwa abacuriritse ku gihande runaka mu bijyanye n'imvugo. Imvugo n'indimi shami bisa n'ibitagwiriye mu mbonwa zigwaho bihuzwa n'amatsinda y'abantu asa n'ahabwa akato. Hari n'imashini nyinshi zihura n'ikibazo mu gutahura amajwi y'ab'igitsina gore. Iyi ni yo mpamvu dukeneye amajwi anyuranye mu mbonwa z'amajwi zacu.
Mozilla doesn’t pick or favor any one language over another. Instead, Common Voice is a purely community-driven initiative, but it takes <multilangLink>several steps to add a new language</multilangLink> and begin collecting voice donations. First, the Common Voice website needs to be translated so community members can access the contributor experience in their own language. Next, we need a large collection of copyright-free sentences for people to read outloud. Once both of those requirements are satisfied a language is “launched” on Common Voice for people to start recording their voice and validating others donations.
Mozilla nta rurimi irutisha ururndi. Ahubwo Ijwi Rusange (Common Voice) ni igikorwa kerekeye rubanda, ariko kigira ibyiciro byinshi mbere yo kongeramo ururimi no gutangira kwegeranya amajwi atangwa n'abaruvuga. Icya mbere ni uko urubuga rw'Ijwi Rusange rukeneye guhindurwa mu rurimi kugira ngo abaruvuga bashobore gusobanukirwa n'ibyo abantu bashyiraho mu rurimi rwabo. Ikindi ni uko dukeneye kwegeranya interuro nyinshi zidakumirwa n'uburenganzira bw'umutungo mu by'ubwenge bw'abazivuze abantu basoma baranguruye.Iyo ibi byombi byubahirijwe ururimi rushyirwa ku Ijwi Rusange (Common Voice) abantu bagatangira gufata amajwi yabo cyangwa kwemeza ubuhame bw'amajwi yatanzwe n'abandi.
Mozilla is dedicated to keeping the web open and accessible for everyone. To do that we need to empower web creators through projects like Common Voice. As voice technologies proliferate beyond niche applications, we believe they must serve all users equally. That means investing in more languages and accommodating diverse accents and demographics when building and testing voice technologies. Common Voice is a public resource available to everyone and Mozilla teams and developers around the world are already using it on our own projects as well.
Mozilla yiyemeje gukomeza gufungura urubuga no gutuma rugerwaho na buri wese. Kugira ngo bishoboke, ni ngombwa gufasha abakora imbuza za Webu binyuze mu mishinga nk'Ijwi Rusange. Mu gihe ikoranabuhanga ry'amajwi rirushaho kugera muri porogaramu zinyuranye, twumva zikwiye gufasha abazikoresha ku buryo bungana. Ibi bisobanuye ko ari ngombwa gukora ku ndimi zirenze rumwe no kugira imvugo n'amakuru byinshi mu gihe cyo gukora no kugerageza ikoranabuhanga ry'amajwi. Ijwi rusange ni ivomo rusange kuri buri wese, kandi itsinda rya Mozilla n'abatekinisiye bayo bari ku isi yose barayikoresha mu mishinga yacu.
n
n
Name
Izina
Native Language
Ururimi kavukire
Native Language
Ururimi kavukire
Nepali
Ikinepari
Nepali
Ikinepari
Next Goals: { $goal }
Imigambi ikurikiraho
No
Oya
No
Oya
No gravatar found for your email
Nta garavatari ibonetse ijyanye na imeri yanyu.
No microphone found.
Nta mikoro igaragaye
Norwegian Bokmål
Ikinyanoruveje cyanditse
Norwegian Bokmål
Ikinyanoruveje cyanditse
Norwegian Nynorsk
Ikinyanoruveje gishya
Norwegian Nynorsk
Ikinyanoruveje gishya
Note: You will still need to select between Speak or Listen to change contribution type.
Ikitonderwa: Muzaba mugikeneye guhitamo hagati yo Kuvuga cyangwa Kumva kugira ngo muhindure ubwoko bw'umusanzu/ ibyo muzatanga.
Not found
Ntibibonetse/ ntabonetse
No Thanks
Oya, urakoze.
Now you can donate your voice to help us build an open-source voice database that anyone can use
to make innovative apps for devices and the web. Read a sentence to help machines learn how real people speak. Check the work of other
contributors to improve the quality. It’s that simple!
Ubu rero mushobora kuduha amajwi mugafasha gushyiraho ikusanyirizo ry'imbonwa ryaguye buri wese ashobora kwifashisha agakora rojisiyeri nshya z'ibikoresho n'imbuga. Soma interuro ufashe imashini kwiga uko abantu bavuga. Genzura ibyakozwe n'abandi batanga umusanzu wabo muri uru rwego kugira ngo haboneke ibintu binoze. Ibyo biroroshye.
Number of Voices
Umubare w'amajwi (yafashwe)
Occitan
Icyogusita
Occitan
Icyogusita
Odia
Icyodiya
Odia
Icyodiya
Off
Ntibyakije/ birajimije
On
Birakije/ ntibijimije
On desktop computers, you can download the latest:
Kuri mudasobwa zitagendanwa, ushobora kumanura amakuru ya nyuma:
Optionally join on our email list for updates and new information about the project.
Ushobora kongeraho urutonde rwa imeri yacu kugira ngo ubone amakuru mashya arebana n'umushinga.
Optionally submitted demographic data (e.g. age, sex, language, and accent) is de-identified from your submitted voice data and will never be made public on your profile.
Ushobora gutanga amakuru mbarurishamibare yose ya ngombwa yawe bwite (urugero imyaka, igitsina, ururimi n'imvugo ikuranga) igenzurwa mu mbonwa z'amajwi kandi ntazigera ashyirwa ahagaragara ku isura ndanga yawe.
Other
Undi/ ikindi
Other voice datasets…
Ayandi makusanyirizo y'imbonwa z'amajwi.
Other Voice Datasets
Ayandi makusanyirizo y'imbonwa z'amajwi.
Our source text is made up of original contributor donations as well as dialogue from public domain movie scripts like <italic>It’s a Wonderful Life</italic>.
You can view our source sentences in this <githubLink>GitHub folder</githubLink>.
Imvano y'imyandiko yacu ni ibyo duhabwa n'abatanga umusanzu ku rubuga rwacu ndetse n'ibiganiro mu nyandiko firimi zo muri rubanda zishingiraho; interuro nka "Ni ubuzima buryoshye"!
Overall Accuracy
Ibiboneye byose hamwe/ Iboneza rya byose hamwe
Overall Hr. Total
Igiteranyo cy'amasaha yose
Overall project status: see how far we’ve come!
Aho umushinga ugeze muri rusange: reba iyo twatangiriye!
p
p
Persian
Igiperise
Persian
Igiperise
<playIcon></playIcon>Last one!
Rya nyuma/ ya nyuma/ cya nyuma!
Play/Stop
Vuza/Hagarika
Polish
Ikinyaporonye
Polish
Ikinyaporonye
Portuguese (Brazil)
Ikinyaporutigari (cyo muri Bureziri)
Portuguese (Brazil)
Ikinyaporutigari (cyo muri Bureziri)
Press play, listen & tell us: did they accurately speak the sentence below?
Kanda vuza, umva & utubwire: interuro zikurikira bazisomye neza?
Press { shortcut-play-toggle } to toggle play mode
Kanda...winjire mu buryo buvuza
Privacy
Amakuru yihariye bwite
Profile
Isura ndanga
Profile information improves the audio data used in training speech recognition accuracy.
Amakuru arebana n'utanga amajwi atuma imbonwa z'amajwi zikoreshwa mu kwimenyereza kumenya ukubonera kw'imvugo zinozwa.
Progress
Ibigikorwa
r
r
Read More
Soma ibindi birenzeho
Ready to do { $count } more?
Witeguye gukora birenzeho?
Ready to donate your voice?
Witeguye gutanga ijwi ryawe?
Ready to help validate sentences?
Witeguye kwemeza interuro?
Recorded Clips
Amajwi yafashwe
Recorded Hours
Amahasa y'amajwi yafashwe
<recordIcon></recordIcon> Last one!
Iya nyuma!
Recordings
Ibyafashwe
Recording voice clips is an integral part of building our open dataset; some would say it's the fun part too.
Gufata amajwi ni igice cyo gukora ikusanyirizo ryacu rikoreshwa na buri wese, bamwe bakaba basanga ari n'igice kinezeza.
Record/Stop
Fata amajwi/Hagarika
Record your voice
Fata ijwi ryawe
Remove
Kuyakuramo
Request a Language
Hamagaza ururimi
*required
*bikenewe, bisabwe
Re-record
Ongera ufate amajwi
Return to Common Voice
Subira ku Ijwi Rusange (Common Voice).
Return to Common Voice
Subira ku Ijwi Rusange (Common Voice).
Return to Common Voice Datasets
Subira ku ikusanyirizo rya porogaramu y'Ijwi Rusange
Return to Languages
Subira ku ndimi.
Return to Languages
Subira ku ndimi.
Review
Genzura, kosora, subiramo
Review & re-record clips here as you go
Subiramo ugenzure, ongera ufate amajwi hano buri gihe uhari.
Review & re-record clips if needed
Subiramo ugenzure & wongere ufate amajwi bundi bushya niba ari ngombwa
Review & Submit
Subiramo & Ohereza
Romanian
Ikirume
Romanian
Ikirume
Romansh Sursilvan
Ikiromanshe k'igisurusiriva
Romansh Sursilvan
Ikiromanshe k'igisurusiriva
Russian
Ikirusiya
Russian
Ikirusiya
s
s
Sakha
Igisaka
Sakha
Igisaka
Sardinian
Igisaride
Sardinian
Igisaride
Save
Bika/ Shyingura
Saved
Byabitswe/ Byashyinguwe
Search
Shakisha
See how your progress compares to other contributors all over the world.
Reba uko ibyo ukora bitera indi ntambwe ugereranyije n'abandi batanga umusanzu mu isi hose.
See Less
Reba bike
See More
Reba ibirenzeho
Sentence Collection
Gukusanya interuro
Sentences
Interuro
Serbian
Igiseribe
Serbian
Igiseribe
Settings
Uburyo mfashagena
Sex
Igitsina
Shortcuts
Iza bugufi
Sign up for an account
Injira ufungure konti
sign up for email updates
Injira utange andi makuru ya imeri
Sign up for { $lang } updates:
Hamya ko....amakuru mashya:
Sign up for { $lang } updates:
Hamya ko....amakuru mashya:
Size
Ingano
Skip
Simbuka
Skip Submission Feedback
Simbuka amakuru ngarukira y'iyohereza.
Slovak
Igisirovake
Slovak
Igisirovake
Slovenian
Ikinyasiroveniya
Slovenian
Ikinyasiroveniya
Sorbian, Lower
Igisorube cy'epfo
Sorbian, Lower
Igisorube cy'epfo
Sorbian, Upper
Igisorube cya ruguru
Sorbian, Upper
Igisorube cya ruguru
Spanish
Igihisipaniya
Spanish
Igihisipaniya
Speak
Vuga
Speak
Vuga
Speakers
Abavuze/ abavuga
Speak now
Vuga ubungubu
Speak now
Vuga ubungubu
Speak up, contribute here!
Vuga, tanga umusanzu hano!
<speechBlogLink>Get Started with Speech Recognition</speechBlogLink>
Tangira itahurajwi/ itahuramvugo
Speech is often the most natural way we communicate with each other and voice technologies are bringing that convenience to our computers and mobile devices. We want to empower developers to build amazing voice recognition applications like real-time translators and voice-enabled digital assistants. But right now most of the voice data required to build these kinds of apps is expensive and proprietary. We hope the Common Voice dataset will give developers what they need to innovate and make speech technology available in their own language.
To make voice recognition even more universal, we're collecting voice samples in widely spoken languages as well as those with a smaller population of speakers often underserved by commercial speech recognition services. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and entire speech communities to address this gap themselves.
Imvugo ni bwo buryo kamere dukoresha duhana amakuru, ikoranabuhanga ry'amajwi na ryo rikaba rituma bwinjira no muri za mudasobwa zacu na terefone n'ibindi biteye nka yo. Turifuza guha abatekinisiye babikora ubushobozi bwo gukora porogaramu ntahurajwi zishimishije z'imisusire nk'iya za porogaramu zihindura indimi ako kanya, cyangwa izifasha mu kuboneza amajwi muri mudasobwa. Ariko kugeza ubu, imbonwa nyinshi z'amajwi zikenewe mu gukora izi porogaramu zirahenze kandi zihariwe na bene zo gusa. Twizeye neza ko ikusanyirizo ry'Ijwi Rusange (Common Voice) izafasha abakora porogaramu zacu mu byo bakeneye byose bakora ibintu bishya bakanatuma ikoranabuhanga mvugo rigera mu rurimi rwabo. Mu kugira ngo itahurajwi ribe hose, dukusanya amajwi mu ndimi zivugwa ku isi yose tutaretse n'izivugwa n'abantu bake yewe zidakenewe na za serivise ntahurajwi zicuruza. Gutangaza ikusanyirizo ry'amajwi anyuranye bizafasha abatekinisiye bakora za porogaramu, ba rwiyemezamirimo n'abakoresha indimi gukemura iki kibazo bo ubwabo.
Speech-to-text (STT)
Kuva ku mvugo bigana ku mwandiko.
Speech-to-text (STT) technologies convert voice data into text.
Ikoranabuhanga rijyana imvugo mu mwandiko rihindura imbonwa z'ijwi mo umwandiko.
Splits
Amatandukaniro manini
Start recording
Tangira ufate amajwi
Stats
Ibyerekanwa n'ibarurishamibare
Streaks
Inkora/ ibirari
Submit
Ohereza
Submit
Ohereza
Submit
Ohereza
Submit
Ohereza
Submit clips
Ohereza amajwi yafashwe
Subscribe
Iyandikishe
Success, profile created!
Byashobotse, isura ndanga yakozwe.
Swedish
Ikinyasuwede
Swedish
Ikinyasuwede
Tamil
Igitamuru
Tamil
Igitamuru
Tap
Andika
Tatar
Igitamari
Tatar
Igitamari
Tatoeba is a large database of sentences, translations, and spoken audio for use in language learning. This download contains spoken English recorded by their community.
Tatoyeba (Tatoeba) ni indundo y'imbonwa z'interuro, amagambo yahinduwe mu ndimi n'amajwi yavuzwe igomba gukoreshwa mu kwiga ururimi. Ibi byamanurwa kandi birimo Icyongereza mvugo byafashwe n'abavuga ururimi.
TED-LIUM Corpus
Indundo ya Tederiyumu (TED-LIUM)
Telugu
Igiterugu
Telugu
Igiterugu
Terms
Amuga/ amabwiriza
Thai
Igitayi
Thai
Igitayi
Thanks for confirming your account, now let's build your profile.
Urakoze kuba wemeje konti yawe, reka dukore isura ndanga yawe.
Thank you for recording!<lineBreak></lineBreak>Now review and submit your clips below.
Urakoze gufata amajwi! Subira mu majwi yawe wafashe akurikira uyagenzure kandi uyohereze.
Thank you for your interest in contributing to { $lang }. We work hard to get every language ready for launch and keep
the teams updated via email. If you want to contribute, please provide your email below.
Urakoze kuba wishimiye gufasha mu ( ). Dukora ibishoboka byose kugira ngo ibya buri rurimi bitangire no guhora duha amatsinda yose amakuru akenewe twifashishije imeri. niba ushaka gutanga umusanzu wawe, ohereza imeri yawe hasi aha.
Thank you for your interest in contributing to { $lang }. We work hard to get every language ready for launch and keep
the teams updated via email. If you want to contribute, please provide your email below.
Urakoze kuba wishimiye gufasha mu ( ). Dukora ibishoboka byose kugira ngo ibya buri rurimi bitangire no guhora duha amatsinda yose amakuru akenewe twifashishije imeri. niba ushaka gutanga umusanzu wawe, ohereza imeri yawe hasi aha.
The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech. The first version of Deep Speech was released in November 2017 and has continued to evolve ever since. Together with the Common Voice dataset, we believe this open source voice recognition technology should be available to everybody. It’s our hope these technologies will enable developers to build a wave of innovative products and services.
Ijwi Rusange (Common Voice) yuzuza ivomo rusange kuri bose rya porogaramu y'Imvugo Inimbitse (Deep Speech) ya Mozilla. Ubwoko bwa mbere bw'iyi porogaramu bwasohowe mu Ugushyingo 2017 hanyuma yakomeje kuvugururwa. Duhereye ku ikusanyirizo ry'Ijwi Rusange, dutekereza ko iri koranabuhanga ntahurajwi ry'ivomo rifunguriye buri wese rizagera kuri buri wese.Twizera neza ko iri koranabuhanga rizafasha abakora porogaramu gukora ibintu bishya na serivisi byinshi.
The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech, which you can use to build speech recognition applications. Read our <githubLink>Github overview</githubLink> or join the <discourseLink>DeepSpeech Discourse</discourseLink> to learn how to get started.
Ikusanyirizo ry'imbonwa za porogaramu y'Ijwi rusange ryuzuza ivomo rikinguriwe buri wese ry'imashini ntahuramvugo inimbitse, ukaba ushobora kurikoresha ukora porogaramu ntahuramvugo. Soma inshamake ya serivisi yacu ya GitHub cyangwa winjire muri porogaramu y'Ijambo ry'imvugo inimbitse kugira ngo umenye aho wahera.
The Common Voice dataset is an open and publicly available resource that can be used to train a wide variety of speech-enabled applications. To protect the security of our contributors, we ask everyone who downloads the Common Voice dataset to respect contributors’ privacy.
All voice clips in the dataset are scrubbed of personally identifying information. When you download the dataset, you agree to not attempt to determine the identity of any contributor. That means you cannot try to link information in the dataset to a contributor’s personal information. You may, however, use the dataset to train speech recognition, speaker recognition, or other applications, by, for instance, linking information in the dataset to other information already in the dataset.
Ikusanyirizo rya porogaramu y'Ijwi Rusange ni ivomo rusange kandi rifunguye kuri buri wese rishobora gukoreshwa mu gutoza za porogaramu zinyuranye nyinshi zikoreshwa muri aya majwi. Mu kwita ku mutekano w'abaduha umusanzu wabo, dusaba uwo ari we wese umanuye ibiri mu ikusanyirizo rya porogaramu y'Ijwi Rusange kubahiriza amakuru y'umwihariko y'ibanga y'abatangamusanzu. Amajwi yose ari mu ikusanyirizo atandukanywa n'amakuru bwite y'abatangamusanzu. Iyo umanuye ibyo mu ikusanyirizo, uba wiyemeje kutagira icyo ukora ku makuru y'umutangamusanzu wese. Bisobanuye ko udashobora kugerageza guhuza ibyo usanzemo n'amakuru areba uwatanze umusanzu mu gutanga ijwi. Ushobora gusa gukoresha ikusanyirizo kwitoza gutahura ijwi, gutahura uvuga, cyangwa izindi porogaramu, uhuza amakuru amwe usanze mu ikusanyirizo n'ayandi aririrmo.
The Common Voice dataset is available for download under the <licenseLink>CC0</licenseLink> license on <datasetLink>our Datasets page</datasetLink>. You can also download several other publicly available datasets from the same page.
Ikusanyirizo ry'Ijwi Rusange (Common data) ririho kuri buri wese washaka kumanura ibyo yifuza abifitiye uburenganzira kuri paji yacu y'amakusanyirizo. Ushobora kandi kumanura andi makusanyirizo yashyizwe ahagaragara ari kuri iyo paji.
The count of voice recording hours that have been validated by 2 out of 3 users with a vote of “Yes”. These mark progress toward the overall project 10k hours goal.
Ibara ry'amasaha y'ifata ry'ijwi ryemejwe n'abakoresha ururimi babiri kuri batatu bakoresheje gutora na "Yego". Ibi byerekana aho gahunda igeze yo kugera ku masaha ibihumbi 10.000 umushinga ufiteho umugambi.
The count of voice recording hours we have collected so far.
Ibara ry'amasaha y'ifata ry'ijwi yakusanyijwe yose.
The goal of the Common Voice dataset is to enable anyone in the world to build speech recognition, speaker recognition, or any other type of application that requires voice data. A voice assistant is just one of many types of applications you could use the dataset to build.
Umugambi w'Ikusanyirizo ryIjwi Rusange ni ugufasha buri wese ku isi gukora intahurajwi, intahura nyakuvuga, cyangwa indi porogaramu yose ikoresha imbonwa z'ijwi. Porogaramu mfashajwi ni imwe muri nyinshi wakoresha mu gukora ikusanyirizo majwi.
The multi-language version of the Common Voice dataset is currently undergoing community supported bundling and cleaning. If you would like to learn more about supporting this effort, please <contactLink>contact us</contactLink>. We are currently targeting a publish date of January 2019. After that, we’ll update the dataset periodically with new languages and voice clips as they become available. An iterative release cycle cadence is still to be determined.
Uburyo mpuzandimi bw'ikusanyirizo ry'Ijwi Rusange (Common Voice) burakorwaho ubu bugaterwa inkunga bunanozwa. Niba ushaka kumenya ibiruseho ku buryo buterwa inkunga, twandikire. Turashaka itariki twabitangazaho muri Mutarama 2019. Nyuma tuzajya dushyiraho amakuru buri gihe ku birebana n'indimi nshya n'amajwi mashya uko azajya atugeraho. Uburyo bwo guhanahana ayo makuru buracyatekerezwaho.
The number of recordings and which languages you contribute to will be public.
Umubare w'ibyo washyizeho n'indimi watanzeho umusanzu byo bizaba rusange (bigaragara).
The process by which a contributor’s profile information is obscured from their donated voice clips when packaged for download as a part of the dataset.
Guhisha amakuru bwite arebana n'abatanga amajwi agatandukana n'amajwi batanze bikorwa mu gihe aya majwi akubirwa hamwe ngo amanurwe n'ushaka kuyakoresha ayavanye mu ikusanyirizo.
The recording was too long.
Ibyafashwe bwayi birebire cyane.
The recording was too quiet.
Ibyafashwe byumvikanaga buhoro.
The recording was too short.
Ibyafashwe byari bigufi cyane.
The selected file is too large
Idosiye mwahisemo ni ngari cyane
The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website.
Indundo ya Tediriyumu yakozwe mu biganiro by'amajwi n'interuro zayo biboneka ku rubuga rwa Tedi.
This is approximately the number of hours required to train a production speech-to-text system.
Uyu ni umubare ugereranyije w'amasaha ngombwa ku gutoza gukora sisitemu njyanajwi ku mwandiko.
This is our process for translating and adapting our content for many locales (languages).
Ubu ni uburyo dukoresha mu guhindura ururimi mu rundi no guhuza ibyo dushyira hano n'indimi zikoreshwa mu gace runaka.
Three to go!
Hasigaye bitatu, eshatu, atatu
Today
None/ uyu munsi
Today's Common Voice progress on clips recorded
Aho iby'amajwi yafashwe kugeza ubu muri porogaramu y'Ijwi Rusange bigeze
Today's Common Voice progress on clips validated
Aho iby'amajwi yemejwe kugeza ubu muri porogaramu y'Ijwi Rusange bigeze
Today's Progress
Uko ibintu byifasha ubu.
To make the Common Voice dataset as useful as possible we have decided to only allow source text that is available under a Creative Commons (CC0) license. Using the CC0 standard means its more difficult to find and collect source text, but allows anyone to use the resulting voice data without usage restrictions or authorization from Mozilla. Ultimately, we want to make the multi-language dataset as useful as possible to everyone, including researchers, universities, startups, governments, social purpose organizations, and hobbyists.
Kugira ngi iyi porogaramu y'ikusanyirizo ry'Ijwi Rusange "Common Voice" ishobore gukoreshwa byoroshye cyane twemeje gukoresha gusa imvano y'imyandiko irinzwe n'isosiyete irinda uburenganzira bw'umuhanzi ya CCO. Gukoresha aya mabwiriza bisobanuye ko bigoye kugera ku mvano y'imyandiko, ariko biha buri wese uburenganzira bwo gukoresha imbonwa z'amajwi nta kumira ribayeho cyangwa kwaka uburenganzira bwa Mozilla. Ikiruseho ni uko dushaka ko ikusanyirizo mpuzandimi rigerwaho na buri wese harimo abashakashatsi, abarimu ba kaminuza, abafite imishinga iciriritse, za Leta, imiryango yita ku mibereho ya rubanda, n'abandi bishimishije.
Top Contributors
Abagira uruhare kurusha abandi
Total
Igiteranyo
Total Approved
Igiteranyo kemejwe
Toward next goal
Kwerekeza ku wundi mugambi/ ku yindi ntego
Turkish
Igituruke
Turkish
Igituruke
Ubykh
Icyubike
Ubykh
Icyubike
Udmurt
Icyudimuru
Udmurt
Icyudimuru
Ukrainian
Ikinyakereni
Ukrainian
Ikinyakereni
Unable to speak right now?
Gushoboza umuntu kuvuga ubu ngubu?
Upload aborted. Do you want to delete your recordings?
Kwinjiza birananiranye. Urashaka gusiba amajwi wafashe?
Upload an image file
Injiza idosiye y'ishusho
Urdu
Icyurudu
Urdu
Icyurudu
User Name
Izina ndanga
Uzbek
Icyuzubeki
Uzbek
Icyuzubeki
Validated Clips
Amajwi yemejwe
Validated Hours
Amasaha yemejwe
Validated Hrs
Amasaha yemejwe
Validated Hr. Total
Igiteranyo cy'amasaha yemejwe
Validating donated clips is equally important to the Common Voice mission. Take a listen and help us create quality open source voice data.
Kwemeza amajwi yatanzwe ni ingirakamaro ku rwego rumwe n'inshingano z'Ijwi Rusange (Common Voice). Umva neza maze udufashe gukora ivomo ry'imbonwa rinoze rikoreshwa na buri wese.
Validations
Iyemeza/ ibyemejwe
Vietnamese
IIkiviyetinamu
Vietnamese
IIkiviyetinamu
View your progress against personal and project goals.
Garagaza uko ibyo ukora bigenda ku bwawe cyangwa ugereranyije n'ibyo umushinga ugamije.
Visible
Bigaragara
Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology
for our machines. But to create voice systems, developers need an extremely large amount of voice
data.
Ijwi ni kamere, ijwi ni iry'abantu. Ni yo mpamvu dushishikajwe no gukora ikoranabuhanga ry'amajwi rishobora gukoreshwa ku mashini zacu. Ariko mu gukora sisiteme z'amajwi, ababishinzwe bakeneye imbonwa z'amajwi nyinshi.
Voice is natural, voice is human. That’s why we’re fascinated with creating usable voice
technology for our machines. But to create voice systems, an extremely large amount of voice
data is required.
Ijwi ni kamere, ijwi ni iry'abantu. Ni yo mpamvu dushishikajwe no gukora ikoranabuhanga ry'amajwi ryakwifashishwa mu mashini zacu. ariko mu gukora sisiteme z'aya majwi, hakenewe imbonwa z'amajwi nyinshi.
Voice recognition technology is revolutionizing the way we interact with machines, but the currently available systems are expensive and proprietary. Common Voice is part of Mozilla’s initiative to make voice recognition technologies better and more accessible for everyone. Common Voice is a massive global database of donated voices that lets anyone quickly and easily train voice-enabled apps in potentially every language.
We're not only collecting voice samples in widely spoken languages but also in those with a smaller population of speakers. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and communities to address this gap themselves. In addition to the Common Voice dataset, we’re also building an open source speech recognition engine called Deep Speech.
Ikoranabuhanga ntahurajwi rirahindura uburyo dukoresha mudasobwa, ariko uburyo buriho ubu burahenze kandi ni umutungo wihariwe na ba nyirawo. Ijwi Rusange (Common Voice) ni kimwe mu bitekerezo byatangijwe na Mozilla ngo irusheho kunoza ikoranabuhanga ntahurajwi no gutuma rigera kuri buri wese. Ijwi Rusange (Common Voice) ni indundo nyamunini ikubiyemo amajwi yatanzwe n'abantu atuma ushaka wese ashobora kwimenyereza vuba kandi mu buryo bworoshye rojisiyeri zikoresha amajwi mu rurimi urwo ari rwo rwose.
Voices Online Now
Amajwi ubu ari ku murongo wa murandasi.
Votic
Ikivotike
Votic
Ikivotike
VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines.
Vogusiforuje (VoxForge) yakozwe ngo yifashishwe mu kwandukura ibiri mu majwi, mu mvugo kugira ko bikoreshwe n'imashini cyangwa porogaramu ntahuramajwi z'ubuntu z'ivomo rifunguriye bose.
Want updates when we release a new version of the Common Voice dataset? Subscribe to our newsletter.
Ukenera amakuru mashya y'ibyakozwe buri gihe twasohoye ubwoko bushya bw'ikusanyirizo ry'imbonwa z'amajwi rya porogaramu y'Ijwi Rusange? Iyandikishe ku kanyamakuru kacu.
We are building an open and publicly available dataset of voices that everyone can use to train speech-enabled applications.
Turakora ikusanyirizo rusange kuri bose ry'amajwi buri muntu ashobora gukoresha mu kwitoza gukoresha porogaramu ntahuramajwi.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Kuri Mozilla turimo gushinga umuryango w'abantu bahuriye ku ikoranabuhanga ry'amajwi. Twifuza ko mukomeza kutugezaho amakuru, imvano nshya z'imbonwa no kumenya biruseho uko mukoresha izo mbonwa.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Muri Mozilla turimo gushina umuryango w'abantu bahuriye ku ikoranabuhanga ry'amajwi. Dukeneye guhora tumenya amakuru, kubona imvano nshya y'imbonwa no kumenya uko mukoresha izi mbonwa.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Kuri Mozilla turimo gushinga umuryango w'abantu bahuriye ku ikoranabuhanga ry'amajwi. Twifuza ko mukomeza kutugezaho amakuru, imvano nshya z'imbonwa no kumenya biruseho uko mukoresha izo mbonwa.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Muri Mozilla turimo gushina umuryango w'abantu bahuriye ku ikoranabuhanga ry'amajwi. Dukeneye guhora tumenya amakuru, kubona imvano nshya y'imbonwa no kumenya uko mukoresha izi mbonwa.
We believe that large and publicly available voice datasets foster innovation and healthy commercial competition in machine-learning based speech technology. This is a global effort and we invite everyone to participate. Our aim is to help speech technology be more inclusive, reflecting the diversity of voices from around the world.
Dutekereza ko amakusanyirizo magari y'amajwi agerwaho na buri wese azateza imbere guhanga ibishya n'ihiganwa mu bucuruzi by'ikoranabuhanga ry'amajwi/ imvugo rifatiye ku kwiga kw'imashini. Iki ni igikorwa cyo ku isi yose, tukaba dusaba buri wese kukigiramo uruhare. Intego yacu ni ugufasha ikoranabuhanga rirebana n'imvugo/ amajwi kutagira uwo riheza, rikarangwa n'urunyurane rw'amajwi aturutse mu mpande enye z'isi.
We believe that large, publicly available voice datasets will foster innovation and healthy commercial competition in machine-learning based speech technology.
Common Voice’s multi-language dataset is already the largest publicly available voice dataset of its kind, but it’s not the only one.
Look to this page as a reference hub for other open source voice datasets and, as Common Voice continues to grow, a home for our release updates.
Dutekereza ko amakusanyirizo magari y'amajwi agerwaho na buri wese azateza imbere guhanga ibishya n'ihiganwa mu bucuruzi by'ikoranabuhanga ry'amajwi/ imvugo rifatiye ku kwiga kw'imashini. Ikusanyirizo mpuzandimi ry'imbonwa z'amajwi rikaba rimaze kuba ari ryo rigari rizwi mu yandi yose akora ibisa n'ibyaryo, ariko si ryo ryonyine ribaho.
We calculate hours by estimating the average length of each recording, and then multiplying that number by the total number of recordings across all languages.
Tubara amasaha mu kugenekereza impuzandengo y'uburebure bwa buri kintu cyafashwe n'ibyuma mfatamajwi tugakuba uwo mubare n'igiteranyo cyose k'ibyafashwe mu ndimi zose.
We don't have anything to validate in this language, help us fill the queue.
Nta kindi twemeza muri uru rurimi, dufashe kuzuza urutonde.
Welsh
Ikigaruwa
Welsh
Ikigaruwa
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Tubijeje kwifashisha amakuru muduha tubyitayeho. Soma ibindi mu byo twakwandikiye ku murongo wawe bwite.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Tubijeje kwifashisha amakuru muduha tubyitayeho. Soma ibindi birenzeho byoherejwe ku murongo wawe bwite.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Tubijeje kwifashisha amakuru muduha tubyitayeho. Soma ibindi mu byo twakwandikiye ku murongo wawe bwite.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Tubijeje kwifashisha amakuru muduha tubyitayeho. Soma ibindi birenzeho byoherejwe ku murongo wawe bwite.
We’re building an open source, multi-language dataset of voices that anyone can use to train speech-enabled applications.
Turakora ivomo rusange ry'ikusanyirizo mpuzandimi ry'amajwi uwo ari we wese ashobora gukoresha kwitoza porogaramu ntahurajwi
We’re sorry, but your platform is not currently supported.
Tubiseguyeho kuko ihuriro ryanyu ritashyizwe muri porogaramu
We want the Common Voice dataset to reflect the audio quality a speech-to-text engine will hear in the wild, so we’re looking for variety. In addition to a diverse community of speakers, a dataset with varying audio quality will teach the speech-to-text engine to handle various real-world situations, from background talking to car noise. As long as your voice clip is intelligible, it should be good enough for the dataset.
Twifuza ko ikusanyirizo ry'imbonwa ry'Ijwi Rusange (Common Voice) riba ryiza ku buryo ihura n'imashini ishobora kumva mu no mu bihuru, mbese twifuza ibinyuranye. Ikindi kijyanye bantu batandukanye bakoresha ururimi, ikusanyirizo ririmo amajwi anyuranye azatuma imashini imenya uko ikoresha amajwi atandukanye mu bihe bisanzwe bitandukanye birimo imvugo zirimo andi majwi cyangwa urusaku cyangwa imodoka zihinda. Igihe cyose ijwi ryawe ryumvikana, rizaba rihagije ari n'ingirakamaro ku ikusanyirizo ry'imbonwa z'amajwi.
We will be in touch with more information about how to add your language to Common Voice very soon.
Tuzakomeza twungurane ibitekerezo by'uko twakongera ururimi rwawe muri porogaramu y'Ijwi Rusange bidatinze.
We will be in touch with more information as it becomes available.
Tuzakomeza guhana amakuru uko azajya aboneka.
We will be in touch with more information as it becomes available.
Tuzakomeza guhana amakuru uko azajya aboneka.
We will not make your email public.
Ntituzagira rusange imeri yawe.
We will review your request to remove your voice recordings from the dataset. If your request is approved, we will contact those who have downloaded the dataset and request they remove your voice recordings as well.
Tuzasubiramo tugenzure ugusaba kwawe ko amajwi wafashe akurwa mu ikusanyirizo ry'imbonwa. Igihe byemejwe, tuzasaba abamanuye ibyo mu ikusanyirizo gukuramo amajwi yawe wafashe.
What does it mean that I can’t “determine the identity” of speakers in the Common Voice dataset?
Kugena irangamimerere ry'abavuga mu ikusanyirizo rya porogaramu y'Ijwi Rusange bisobanuye iki?
What is Common Voice?
Ijwi Rusange (Common Voice) ni iki?
What level of audio quality is required for a voice clip to be used in the dataset?
Ijwi rishyirwa mu ikusanyirizo ry'amajwi rigomba kuba riri ku ruhe rwego, rifite irihe reme?
What’s inside the Common Voice dataset?
Mu ikusanyirizo ry'imbonwa rya porogaramu y'Ijwi Rusange harimo iki?
What's Public?
Ni iki Rusange?
What’s the difference between Common Voice and Deep Speech?
Ni irihe tandukaniro hagati y'Ijwi Rusange (Common Voice) n'Imvugo Inimbitse (Deep Speech)?
When will you release Common Voice data in other languages?
Muzatangaza ryari imbonwa z'Ijwi Rusange (Common Voice) mu zindi ndimi?
Where does the source text come from?
Imvano y'imyandiko yo ituruka he?
Why a profile?
Kuki isura ndanga ari ngombwa?
Why does this matter?
Kubera iki ibi ari ngombwa/ ari ingenzi?
Why don’t you ask people to read from books or Wikipedia articles in different languages?
Kuki mudasaba abantu gusoma ibitabo cyangwa ibyandikwa kuri Wikipediyamu ndimi zinyuranye?
Why do you need so many different speakers per language?
Kuki dukeneye amavuga batandukanye benshi kuri buri rurimi?
Why is 10,000 validated hours the per language goal for capturing audio?
Kuki ari ngombwa kugeza ku masaha 10,000 yemejwe y'amajwi yafashwe kuri buri rurimi?
Why is Common Voice part of the Mozilla mission?
Kuki Ijwi Rusange ari imwe mu nshingano za Mozilla?
Why is it important?
Kubera iki ari ingirakamaro?
Why is my language not included yet?
Kuki ururimi rwange rutari rwashyirwamo?
Why should I sign up for an account?
Kuki ngomba kugira konti?
Would you like to request your voice recordings be deleted too, or do you prefer to keep them in the Common Voice dataset?
Ushaka gusaba ko amajwi yawe wafashe na yo asibwa, cyangwa usanga yagumishwa mu ikusanyirizo ry'imbonwa rya porogaramu Ijwi Rusange?
y
y
Yes
Yego
Yes
Yego
Yes, send me emails. I’d like to stay informed about the Common Voice Project.
Yego. Munyoherereze imeri. Ndashaka gukomeza kubona amakuru yerekeye umushinga wa "Common Voice".
Yes, send me emails. I’d like to stay informed about the Common Voice Project.
Yego. Munyoherereze imeri. Ndashaka gukomeza kubona amakuru yerekeye umushinga wa "Common Voice".
Yes, send me emails. I'd like to stay informed about the progress of this language on Common Voice.
Yego, munyoherereze imeri. Ndashaka guhora nzi amakuru y'urur rurimi muri "Common voice".
Yes, send me emails. I'd like to stay informed about the progress of this language on Common Voice.
Yego, munyoherereze imeri. Ndashaka guhora nzi amakuru y'urur rurimi muri "Common voice".
Yes, we especially want your voice! Part of the aim of Common Voice is to gather as many different accents as possible so that voice recognition services work equally well for everyone. This means donations from non-native speakers are particularly important.
Yego, dukeneye rwose ijwi ryawe! Imwe mu migambi y'Ijwi Rusange ni ukwegeranya imvugo zitandukanye zishoboka kugira ngo abashinzwe iby'itahurajwi babashe gukorera neza buri wese ku rwego rumwe.Ni yo mpamvu ijwi ry'utari kavukire w'ururimi rifite akamaro gakomeye.
You
Wowe
You are about to initiate a download of <size>{ $size }GB</size>, proceed?
Ugiye gutangira kumanura jigabayite, ukomeze?
You are prepared to initiate a download of <b>{ $size }</b>
Witeguye gutangira kumanura ....
You can choose to make your username public or anonymous.
Ushobora guhitamo kwerekana izina ndanga ukoresha cyangwa nturyerekane.
You must allow microphone access.
Ugomba kwemerera gukoresha mikoro
Your anonymous voice recordings will remain in the Common Voice dataset. Once you delete your profile you will no longer be able to submit a request to remove your recordings from the dataset
Amajwi yawe wafashe atagomba gushyirwa ahagaragara aguma mu ikusanyirizo ry'imbonwa rya porogaramu y'Ijwi Rusange. Iyo uhanaguye isura ndanga yawe, ntuzaba ugishoboye kugira ubwo usaba gukura amajwi wafashe mu ikusanyirizo ry'imbonwa.
Your download has started.
Imanura wakoze ryatangiye.
Your Languages
Indimi zawe
Your username and email will not be associated with the published data.
Izina ndanga ryawe na imeri ntibizajya hamwe n'imbonwa zizatangazwa.
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } recording goal!
Wafashije porogaramu y'Ijwi Rusange kugeza ku mugambi wacu w'amajwi agomba gufatwa buri munsi.
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } validation goal!
Wafashije porogaramu y'Ijwi Rusange kugeza ku mugambi wacu w'ibyemezwa ku munsi.
You've successfully signed up for contributing to { $language }. Thank you.
Washoboye neza guhamya gutanga umusanzu mu . Urakoze.
You've successfully signed up for contributing to { $language }. Thank you.
Washoboye neza guhamya gutanga umusanzu mu . Urakoze.