[1-5]
[1-5]
2019 End-of-Year Release
2019 Lokollo ya Mafelelong a Ngwaga
2 down, keep it up!<playIcon></playIcon>
Tše 2 di feditšwe, Swarang le tiiše!<playIcon></playIcon>
≥ 2 No votes
≥ 2 Aowa dikgetho
≥ 2 Yes votes
≥ 2 Ee dikgetho
Abbreviations and Acronyms. Abbreviations and acronyms like “USA” or “ICE” should be avoided in the source text because they may be read in a way that does not coincide with their spelling. Additionally, there may be multiple accurate readings for a single abbreviation. For example, the acronym “ICE” could be pronounced “I-C-E” or as a single word.
Dikhutsofatšo le mantšu a khutsofadišwego go swana le “USA” goba “ICE” di swanetše go phengwa mongwalong wa mothopo ka baka la gore di ka balwa ka tsela e sa dumelelanago le mopeleto wa tšona. Go oketša moo, go ka ba le dipalo tše dintši tše nepagetšego bakeng sa khutsofatšo e le nngwe. Ka mohlala, khutsofatšo “ICE” e ka bitšwa “I-C-E” goba e le lentšung letee.
Abkhaz
Se-Abkhaz
About
Mabapi le
Accent
Mokgwa
Account
Akhaonte
<accuracyLink>See more on accuracy criteria.</accuracyLink>
<accuracyLink>Bona se se fetago ka mo go nepagetšego ka kriteria.</accuracyLink>
Acehnese
Se-Acehnese
{ $actionType }<playIcon></playIcon> did they accurately speak the sentence?
{ $actionType }<playIcon></playIcon> na ba boletše lefoko ka nepo?
{ $actionType }<recordIcon></recordIcon> then read the sentence aloud
{ $actionType }<recordIcon></recordIcon> ke moka balela mafoko godimo
{ $actionType }<stopIcon></stopIcon> when done
{ $actionType }<stopIcon></stopIcon> ge o feditše.
{ $actionType } submit when ready
{ $actionType } tsenya ge o lokile
Add
Oketša
Add a language you want to contribute to
Tsenya leleme leo o nyakago go neela go lona
Add as much information as you can about the language. This
could include:
Tlaleletša tshedimošo e ntši ka moo o ka kgonago ka gona ka polelo. Se se ka akaretša:
Adding languages to work with
Tlaleletša maleme go mošomo wa gago
Additional Language
Leleme le oketšegilego
Add Language
Oketša polelo
Add Language
Oketša leleme
<addLink>Add more sentences now!</addLink>
<addLink>Tsenya mafoko a mangwe bjale!</addLink>
Add new custom accent "{ $inputValue }"
Tlaleletša mokgwa wa go bolela o tlwaelegilego o mpsha "{ $inputValue }"
Add new sentences
Tsenya mafoko a ma Mphsa
Add Sentences
Tsenya Mafoko
Add <wikipediaLink>public domain</wikipediaLink> sentences
Tlatša <wikipediaLink>mafelong a batho ka moka</wikipediaLink> le mafoko
Add Your Voice
Tsenya Lentšu la Gago
Adyghe
Se-Adyghe
Afaan Ormoo
Se-Afaan Ormoo
A few bytes
Di-bytes tše mmalwa
Afrikaans
Se-Afrikanse
Age
Mengwaga
Albanian
Se-Albania
All
Ka moka
All sentences you submit must be under <wikipediaLink>Public Domain (CC-0) license</wikipediaLink>. To support the inclusion of work not under public licence, we have created a <cc0WaiverLink>Contributions Agreement template</cc0WaiverLink> for works where the copyright owner would like to contribute the material to Common Voice.
Mafoko ka moka ao o wa rometšego a swanetše go ba ka tlase ga<wikipediaLink> laesense ya Mafelong a Batho Bohle (CC-0)</wikipediaLink>. Ho thekga go akeretšwa ga mošomo o sego ka tlase ga laesense ya setšhaba, re hlangwe <cc0WaiverLink>thempleite ya Dumelano ya Neelano</cc0WaiverLink> bakeng sa mešomo moo mong wa ditokelo tša molao a ka rata go neela dintlha go Common Voice.
All voice clips in the dataset are scrubbed of personally identifying information. When a contributor provides demographic data via their profile, that information is de-identified from their voice clips before being bundled for download in the dataset and is never made public on their profile page.
Dikarolo ka moka tša lentšu go dathasete di hlakotšwe boitsebišo bja motho. Ge moneedi a nea datha tša palo ya batho ka profaele ya bona, tsebišo yeo e itlhaotše go tšwa ditsopolwa tša lentšu pele e ka tlangwa go taneloutiwa go dathasete gomme ga go mohla e phatlalatšwa letlakaleng la boemo bja rena.
Amharic
Se-Amharic
Anonymized user data like age, gender, and accent helps improve the audio data used to train the accuracy of speech recognition engines. Your username and email will never be associated with your submitted data, and you can choose whether to make your username public or anonymous.
Datha ya modiriši ye o sa tsebjego bjalo ka mengwaga, le bong bja motho, le mmolelo ao a thuša go go kaonefatša datha ya odio e šomišago go eteletša nepagatšo ya di-entšene a tlhathollo ya mantšu. Leina la gago la mošomiši le e-meile di ka se itswalanye le datha yeo e rometšego, gomme o ka kgetha gore na leina la gago la mošomiši le phatlalatšwe goba le se tsebje.
Approve
Amogela
A quiet background hubbub is OK, but we don’t want additional voices that may cause a machine algorithm to identify words that are not in the written text. If you can hear distinct words apart from those of the text, the clip should be rejected. Typically this happens where the TV has been left on, or where there is a conversation going on nearby.
Lefelong la bokamorago le se nago lešata le LOKILE, eupša ga re nyake mantšu a mangwe a ka dirago gore mokgwa wa motšhene o kgetholwe mantšu ao a se be gona mongwalong. Ge e ba o ka kwa mantšu a phapano go ao a lego mongwalong, setsopolwana se swanetše go ganwa. Se se tlwaetše go diragala moo TV e tlogetšwego, goba moo poledišano e diragalago kgauswi le moo.
Arabic
Se-Arabic
Aragonese
Se-Aragonese
Are you coming? <strong>[called by another]</strong>
Wa tla na? <strong>[called by another]</strong>
Armenian
Se-Armenia
Armenian Western
Se-Armenian Western
Ask about a new language
Botšiša ka polelo e mphsa
Assamese
Se-Assamese
Asturian
Se-Asturia
Audio Format
Fomete ya Odio
Avatar
Motšeakarolo
Avatar uploaded
Tsenya Motšeakarolo
Average
Palo-gare
A voice clip is marked "valid" when a user gives it a Yes vote.
Pego ya lentšu e swailego ka "kgonthiša" ge mošomiši a kgethile Ee.
Avoid too much background noise - it should be easy to hear you.
Phema lešata le lentši la bokamorago - go swanetše go ba bonolo go go kwa.
Awards
Difoka
Azerbaijani
Se-Azerbaijani
Back
Morago
Background Noise
Lešata la Bokamorago
Background Voices
Mantšu a Bokamorago
Back to Top
Boela godimo
Basaa
Se-Basaa
Bashkir
Se-Bashkir
Basque
Se-Basque
Be cautious before rejecting a clip on the ground that the reader has mispronounced a word, has put the stress in the wrong place, or has apparently ignored a question mark. There are a wide variety of pronunciations in use around the world, some of which you may not have heard in your local community. Please provide a margin of appreciation for those who may speak differently from you.
Eba šedi pele o gana setsopolwana ka baka la gore mmadi ga a bitša lentšu ka tsela e fošagetšego, o beile kgateletšo lefelong le fošagetšego, goba go bonagala a hlokomologile letswao la potšišo. Go na le mehuta e mentši ya go bitšwa ga mantšu e didirišwago lefaseng ka bophara, yeo mohlomongwe o se ke wa ekwa setšhabeng sa geno. Ka kgopelo nea ka moedi wa tebogo go bao ba bolelago ka tsela e fapanego le ya gago.
Belarusian
Se-Belarusian
Benefits
Mehola
Bengali
Se-Bengali
[‘Beret’ is OK whether with stress on the first syllable (UK) or the second (US)]
[‘Beret’ is OK whether with stress on the first syllable (UK) or the second (US)]
<bold>{ $count }</bold> Clips
<bold>{ $count }</bold> Dipego
<bold>Help us</bold> find more voices
<bold>Re thuše </bold> re hwetše mantšu a mangwe
Bosnian
Se-Bosnian
Both
Bobedi
Both (Speak and Listen)
Bobedi (Bolela le go Theeletša)
Breton
Se-Breton
Browse Languages
Poraosa Maleme
Build a custom goal
Aga pakane e swanelegilego
Build Profile
Aga Profaele
Bulgarian
Se-Bulgaria
Burmese
Se-Burmese
Buryat
Se-Buryat
<b>Why an email?</b> We may need to contact you in the future about changes to the dataset, an email provides us a point of contact.
<b>Ke lebaka la eng ka e-meile?</b> Re ka hloka go ikgokaganya le wena nakong e tlago mabapi le diphetogo tša dathasete, e-meile e re fa sebaka sa go ikgokaganya.
By editing your goal, you may lose your existing progress.
Ka go thulaganya pakane ya gago, o ka lahlalelwa ke tšwelopelo ya gago e bego gona.
By opting in to receive emails you state that you are okay with Mozilla handling this info as explained in Mozilla’s <privacyLink>Privacy Policy</privacyLink>.
Ge o kgetha go amogela di-emeile o ka bolela gore ga go na molato ge Mozilla e diriša tsebišo ye bjalo ka ge go hlalositšwe ka <privacyLink>Molao wa Lekunutu</privacyLink> wa Mozilla.
<b>You agree</b> to not attempt to determine the identity of speakers in the Common Voice dataset
<b>O wa dumela</b>gore o se ke wa leka go utolla boitšhupo bja baboledi go dathasete ya Common Voice.
By providing some information about yourself, the audio data you submit to Common Voice will be more useful to Speech
Recognition engines that use this data to improve their accuracy.
Ka go nea boitsebišo bjo bo itšego ka wena, datha ya odio yeo o e romelago go Common Voice e tla thuša kudu go di-entšene tša Tlhathollo ya Mantšu ao a dirišago datha yeo go kaonefatša go nepagala ga tšona.
By using Common Voice, you agree to our <termsLink>Terms</termsLink> and <privacyLink>Privacy Notice</privacyLink>
Ka go šomiša Common Voice, o dumela go <termsLink>Mareo</termsLink> le <privacyLink>Tsebišo ya Sephiri</privacyLink> tša rena.
Cancel Re-recording
Khansela go rekhota gape
Cancel Submission
Khansela Tshedimošo
Can't decide?
Ga o kgone go kgetha?
Cantonese
Se-Cantonese
Catalan
Se-Catalan
Central Kurdish
Se-Central Kurdish
Change your email via Settings under Login Identity
Fetola e-meile ya gago go di selaolatirišo tše leng ka fase ga Boitšhupo bja Akhaonte ya Mošomiši
Check if your language is listed on the
<languagesPageLink>Common Voice Languages page</languagesPageLink>.
If not, fill out this form and we’ll <strong>send you a handy email
explaining how to get your language set up</strong>.
Lebelela gore na polelo ya geno e lokeletšwe letlakaleng la polelo ya<languagesPageLink> letlakaleng la dipolelo tša Common Voice</languagesPageLink>. Ge e ba go se bjalo, tlatša fomo ye gomme re tla <strong>go romela e-meile e hlalosago monkgwa wa go hloma polelo ya ya geno </strong>.
<chevron></chevron>See less
<chevron></chevron>Bona tše nyenyane
<chevron></chevron>See more
<chevron></chevron>Bona tša go feta
Chinese (China)
Se-China (China)
Chinese (Hong Kong)
Se-China (Hong Kong)
Chinese (Taiwan)
Se-China (Taiwan)
Choose language/localization
Kgetha polelo/polelo ya tikologong
Chuvash
Se-Chuvash
Click
Klika
{ $clipCount } voice clips, total archive size { $size }. Expires { $expires }.
{ $clipCount } ditsopolwa tša lentšu, palo-moka ya bogolo bja lotela ruri { $size }. E felelwa ke nako { $expires }.
Clip Graveyard
Setsopolwa sa Graveyard
Clips recorded
Ditsopolwa di begilwe
Clips Uploaded
Ditsopolwa di Tsene
Clips validated
Ditsopolwa di kgonthišitšwe
Clips You've Recorded
Di-Odio tše O di Gatišitšego
Clips You've Validated
Di-Odio tše O di Kgonthišitšego
Close
Tswalela
Close
Kgaufsi
Collecting sentences
Kgoboketša mafoko
Collecting sentences from the public domain, or writing new ones for the public domain.
Go kgoboketša mafoko go tšwa go tomeine e phatlaletšwego, goba go ngwala tše di mpsha tša tomeine ya phatlaletšo.
Collect sentences
Kgobokanya mafoko
Comment
Tshwayotshwayo
Common Voice data plus all other voice datasets above.
Datha ya Common Voice le di-dathasete tše dingwe tša lentšu ka godimo.
Common Voice Dataset
Dathasete ya Common Voice
Common Voice is a crowdsourcing platform, and the languages were all added by volunteers.
We would love for you to add your language! <languageRequestLink>Ask about adding your language.</languageRequestLink>
Common Voice ke sethaleng sa batho ba bantši, gomme dipolelo di ile tša tsenywa ke baithaopi.
Re ka rata gore o tsenye polelo ya geno! <languageRequestLink> Botšiša ka go tsenya polelo ya geno.</languageRequestLink>
Common Voice is Mozilla's initiative to help teach machines how real people speak.
Common Voice ke mogato wa pele wa Mozilla wa go ruta metšhene kamoo batho ba sebele ba bolelago ka gona.
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Common Voice ke karolo ya mathomo ga Mozilla go ruta metšhene gore batho ba sebele ba bolela bjang.
Common Voice is the world’s largest publicly available, multi-language voice dataset.
Common Voice ke dathasete e kgolo kudu go fetišiša ya mantšu a maleme a mantši a go hwetšagala lefaseng phatlalatšwa.
Common Voice recordings are used by academics, small businesses, and voice recognition enthusiasts to help train and grow publicly available resources like voice models.
Can you let us know why you would like your recordings deleted?
Dipego tša Common Voice di dirišwa ke barutegi, dikgwebo-potlana, le ba go fišegelwa go tseba ka mantšu go thuša go eteletša le go godiša methopo ye e hwetšagalago phatlalatša bjalo ka mehuta ya mantšu. Na o ka re tsebiša gore ke ka baka la eng o nyaka gore direkhoto tša gago di phumolwe?
Community participation and decision making.
Go tšea karolo ga setšhaba le go tšea dipheto.
Community Playbook
Puku ya Tshedimošo ya Setšhaba
Confirm
Kgonthišetša
Confirm Goal
Tiišetša Pakane
Confirm New Sentences
Tiišetša Mafoko a ma Mphsa
Connect with Gravatar
Kgokaganya le Gravatar
Contact
Kgokagana
Contact Form
Lengwalo la Dikgokaganyo
Content available under a <licenseLink>Creative Commons license</licenseLink>
Diteng di gona ka fase ga<licenseLink>Laesense ya Creative Commons</licenseLink>
Continue
Tšwela pele
Contribute
Thuša
Contribute
Neela
Contribute sentences
Neela ka mafoko
Contribute to { $lang }
Neela go { $lang }
Contribute Your Voice
Neela Lentšu la Gago
Contribution Activity
Meneelo ya Modiro
Contribution Criteria
Ditiragalotšeletšwa tša Kriteria
Contribution Experience
Go fana ka Ditiragalotšweletšwa
Contributors record voice clips by reading from a bank of donated sentences.
Baneedi ba gatiša dipego tša lentšu ka go bala go panka ya mafoko a neetšwego.
Cookies
Cookies
Cookies
Dikhuki
<coquiLink>Coqui</coquiLink> is dedicated to open speech technology. Their projects include deep learning based STT and TTS engines.
<coquiLink>Coqui</coquiLink> e neetšwego go bula thekinolotši ya mantšu. Mediro ya tšona o akaretša thuto e tseneletšego yeo e leng sedirišwa sa STT le TTS.
Cornish
Se-Cornish
Corsican
Se-Corsican
Could not add language
Ga ra kgona go tsenya leleme
Could not change settings. Please try again.
Ga se re kgone go fetola selaolatirišo. Re kgopela o leke gape.
Could not remove language
Ga ra kgona go tloša leleme
Could not remove language: language not found
Ga se re kgone go tloša polelo: polelo ga e hwetšagale
{ $count } clips
{ $count } ditsopolwa
{ $count }mo
{ $count }mo
{ $countOfInvalidated } rejected by you
{ $countOfInvalidated } di ganwego ke wena
{ $countOfReviewed } sentences are already reviewed. Great job!
{ $countOfReviewed } mafoko a šetše a hlahlobilwe. O šomile botse!
{ $countOfSentences } sentences found.
{ $countOfSentences } mafoko a hweditšwe.
{ $countOfUnreviewed } sentences are unreviewed. If you want, you can also review your sentences now before submitting them.
{ $countOfUnreviewed } mafoko a se a hlahlobwe. Ge e ba o nyaka, o ka hlahloba mafoko a gago gabjale pele o wa romela.
{ $count }wk
{ $count }wk
{ $count }y
{ $count }y
Create a Custom Goal
Bopa Pakane e Swanelegilego
{ $created }
{ $created }
Creating opportunities for a diversity of people to contribute to Common Voice ensures the dataset serves as many people as possible. We’ve created resources and templates that you can use!
Go hlagwa menyetla ya gore batho bao ba fapa-fapanego ba tsentšha letsogo go Common Voice go netefatša gore dathasete e hlankela batho ba bantši kamoo go ka kgonegago. Re hlangwe ditlabakelo le dithempleite tšeo o ka di šomišago!
Criteria
Kriteria
Croatian
Se-Croatia
Czech
Se-Czech
Daily Goal
Pakane ya ka Mehla
Danish
Se-Danish
Dashboard
Dashboard
Dataset Release
Lokollo ya Dathasete
Datasets
Di-dathasete
Datasets
Di-dathasete
Date
Letšatši
Days
Matšatši
De-identified
Kgetholla gape
Delete my recordings
Phumola dikgatišo tšaka
Delete Profile
Phumola Profaele
Delete selected sentences
Phumola mafoko a kgetilwego
Deleting selected sentences…
Re phumola mafoko a kgethilwego
Dhivehi
Se-Dhivehi
Different language
Polelo e fapanego
Difficult
Bothatha
Difficult to pronounce
Go thatha go wa bitša
Dioula
Se-Dioula
Discard ongoing recording
Phumola kgatišo e diragalago
Discourse
Discourse
Donate your voice
Neela lentšu la gago
Don't see your language on Common Voice yet?
Ga o bone polelo ya gago go Common Voice gabjale?
Don’t see your language reflected in the Dataset? To request a language head over to our Languages page.
Ga o bone polelo ya gago bonagatša go dathasete? Go kgopela polelo e ya go letlakala la rena la Dipolelo.
Download
Tanelouto
Download
Goga go tšwa inthaneteng
Download Common Voice Data
Tanelouta Datha ya Common Voice
Download Data
Tanelouto ya Datha
Download Dataset Bundle
Tanelouta dathasete di dikgobokane
Download { $language }
Tanelouto { $language }
Download Links
Tanelouta di-Linke
Download My Data
Tanelouta Datha Yaka
Download profile data
Tanelouta datha ya profaela
Download the Single Word Target Segment
Tanelouta Nepo ya Lentšu le Tee
Do you have ideas on how we can make the Common Voice dataset better? Let us know on Discourse
Afa o nale dikgopolo gore re ka dirang gore dathasete ya Common Voice e be kaone? Re tsebiše go Discourse
Do you want to continue?
O nyaka go tšwela pele?
Do you want to Speak, Listen or both?
O nyaka go bolela, go Theeletša goba ka bobedi?
Drag and drop or <browseWrap>Browse</browseWrap>
Go rwala lego bea goba <browseWrap>Praosa</browseWrap>
During contribution submission feedback will be skipped after clicking 'Submit'. Contribution will continue directly with the next set of 5 recordings or validations.
Nakong ya tlhagišo ya moneelo a maikutlo a tla fetišwa ka morago ga go tobetsa ‘Romela’. Moneelo o tla tšwelapele go lebanya le sete e latelago ya direkhoto tše 5 goba dinetefatšo.
Dutch
Se-Dutch
Each entry in the dataset consists of a unique MP3 and corresponding text file. Many of the <b>{ $total }</b> recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help train the accuracy of speech recognition engines.
The dataset currently consists of <b>{ $valid }</b> validated hours in <b>{ $languages }</b> languages, but we’re always adding more voices and languages. Take a look at our <languagesLink>Languages page</languagesLink> to request a language or start contributing.
Engwe le engwe yeo e tsenego dathasete e na le MP3 e fapanego le faele e tsamaišanago le mongwalo. Bontši bja <b>{ $total }</b> diiri tšeo gatišitšwego ka gare ga dathasete di akaretša gape le boitsebošo bja dintlha tša palo ya batho bjalo ka mengwaga, le bong bja motho, le mmolelo seo a ka thušago go eteletša go nepagetša ga tlhathollo ya mantšu tša temogo.
Dathasete ga bjale di na le <b>{ $valid }</b> diiri tšeo di netegaditšwego ka dipolelo<b>{ $languages }</b>, eupša re dula ka mehla re okeditše mantšu le maleme a mangwe. Ela hlogo <languagesLink>letlakaleng la rena la Dipolelo</languagesLink> go kgopela polelo goba go thoma go neela.
Easy
Bonolo
Edit
Rulaganya
Edit Profile
Lokiša Profaele
Email
Lenaneo la poso
Email
E-meile
<emailFragment>Sign up</emailFragment> to our mailing list to learn how you can take part in campaigns, events and co-design features on Common Voice.
<emailFragment>Ngwadiša</emailFragment> go lethamong la rena la mangwalo go ithutha gore na o ka tšea karolo bjang masolong, ditiragalo le dikarolo tša moralo o kopanetšego go Common Voice.
Email is already used for a different account
E-meile ye e šomišwa ke akhaonte e fapanego.
Email Subscriptions
E-meile ya Boingwadišetšo
English
Sekgowa
English
Seisemane
Enter Email to Download
Tsenya E-meile go Tanelouta
Enter email to send a sign up link
Tsenya e-meile go romela linke ya go ngwadiša
Enter your email
Tsenya e-meile
Error { $code }
Phošo { $code }
Error while fetching rejected sentences. Please try again.
Phošo ge re lata mafoko a ganwego. Re kgopela o leke gape.
Error while fetching your sentences. Please try again.
Phošo ge re lata mafoko a gago. Re kgopela o leke gape.
Erzya
Se-Erzya
ESC
ESC
Esperanto
Se-Esperanto
Estonian
Se-Estonia
Events
Ditiragalo
Everyone
Yo mongwe le yo mongwe
Exit & Delete clips
Etšwa o Phumole dipego
Expert linguists can also help, try to reach out to local linguistic universities (both teachers and students) and see if they can help to gather a diverse set of sound-diverse sentences in your language.
Ditsebi tša maleme tše hlwahlwa le tšona di ka thuša, leka go ikopanya le setsebi sa maleme sa diyunibesithi ya lefelong leo (bobedi bja barutiši le baithuti) le go bona gore na ba ka thuša go kgoboketša mafoko e fapanego ya polelo ya geno.
Failed to delete selected sentences… Please try again!
Paletšwe go phumola mafoko a kgetegilwego... Re kgopela o leke gape!
FAQ
Dipotšišo tše di Botšišwago Gantši
Faroese
Se-Faroese
Female
Mosadi
Fields marked with an asterisk (*) are required.
Mafelwana a go swawa ka setlankana sa naletšana (*) di a hlokagala.
Filtered sentences due to requirements failing (please submit fixed versions as new sentences):
Mafoko a filtharelwego ka le baka la dinyakwa tše hlolegilwego (re kgopela o tliše tše lokišitšwego bjalo ka mafoka a ma mpsha.
Finding existing sentences in the Public Domain
Go hwetša mafoko a go ba gona Lefelong la Setšhaba.
Find others who care. That could be universities, language schools, advocacy groups or data science communities. Reach out and explain clearly how they can help and why. <outreachTemplates>See our template outreach emails</outreachTemplates>.
Hwetša ba bagwe bao ba tshwenyegago. Seo se ka ba diyunibesithi, dikolo tša maleme, dihlopha sa bathekgi goba setšhaba sa thutamahlale ya datha. Katanela go fihlilela ditokelelo gomme o hlalose gabotse gore na na ka thuša bjang le gore na ke ka baka la eng. <outreachTemplates>Lebelela di-emeile tša rena tša thempleite</outreachTemplates>.
Finish editing first?
Fetša go rulaganya pele?
Finish recording
Fetša kgatišo
Finish recording first?
Fetša go gatiša pele?
Finish Review
Fetša Hlahlobo
Finnish
Se-Finnish
First, check if your language <languageLink>already exists</languageLink>.
If it isn’t, you can <languageRequestLink>ask about adding your language</languageRequestLink>.
There are two stages. Translating the site, and collecting sentences
Sa pele, hlahloba gore na polelo ya geno <languageLink>e šetše e le gona</languageLink>. Ge e ba ga go bjalo, o ka <languageRequestLink>kgopela mabapi le go tsenya polelo ya geno</languageRequestLink>.
Go na le megato e mebedi. Go fetolela saete, le go kgoboketša mafoko.
Foreign letters. Letters must be valid in the language being spoken. For example, “ж” is a letter in the Russian alphabet but is never used in English and so should never appear in any English source text.
Mangwalo a tšwago ka ntle ga naga. Magwalo a swanetše go nepagala ka polelo yeo e bolelwago. Ka mohlala, “ж” ke tlhaka ya alefabete ya se Russia eupša le ka mohla ga e dirišwe ka Seisimane ka gona ga ya swanela go bonagala mongwalong ofe goba ofe a Seisimane.
For every voice clip donated, and every audio clip validated, your account dashboards are updated to reflect your latest progress in each language you contribute to. Yes, you can contribute to more than one!<br/><br/> Use dashboards to track your stats, see how you're doing alongside others in the community, and set daily or weekly contribution goals.
Bakeng sa setsopolwana se sengwe le sengwe sa lentšu le neetšwego, gomme setsopolwana se sengwe le se sengwe sa odio se kgonthišwe, di-dashpoto tša akhaonto ya gago di a kaonefatšwa go bontšha tšwelo pele ya gago ya moragorago go ye nngwe le ye nngwe ya lpolelong yeo o neelago go yona. Ee, o ka neela go tše fetago e tee!<br/><br/> Šomiša di-dashpoto go latela dipalo-palo tša gago, go bona gore na o šoma bjang le batho ba bangwe setšhabeng, gomme o beele dipakane tša letšatši le letšatši goba tša beke le beke.
for example
Ka mohlala
For public domain text not available online, you could use academic referencing style e.g Harvard style "Jess (2021) My Public licence poems"
Bakeng sa mangwalong a setšhaba ga a hwetšegale inthaneteng, o ka berekiša tshupišo ya thuto e.g setaele sa Harvard "Jess (2021) My Public licence poems"
For these launched languages the website has been successfully <localizationGlossaryLink>localized</localizationGlossaryLink>, and has enough <sentenceCollectionGlossaryLink>sentences collected</sentenceCollectionGlossaryLink> to allow for ongoing <speakLink>Speak</speakLink> and <listenLink>Listen</listenLink> contributions.
Bakeng sa dipolelo tše di thomilwego wepesaete di atlegile <localizationGlossaryLink>localized</localizationGlossaryLink>, gomme di na le <sentenceCollectionGlossaryLink>mafoko a lekanego ao a kgobokanetšego </sentenceCollectionGlossaryLink>go dumelela moneelo go tšwelago pele ka <speakLink>Go Bolela</speakLink> le <listenLink>Go Theeletša</listenLink>.
French
Sefora
Frequently Asked Questions
Dipotšišo tše di Botšišwago Gantši
Frisian
Se-Frisian
Fulah
Se-Fulah
Galician
Se-Galician
GB
GB
Gender
Bong
Georgian
Se-Georgia
German
Sejeremane
Get involved
Go ba karolo
Get involved
Go ba karolo
Get Involved
Akaretša
Get started with goals
Thoma ka dipakane
Get Started with Speech Recognition
Thoma ka go Tlhathollo ya Polelo
<githubLink>NVIDIA NeMo</githubLink>™ is an <docsLink>open-source toolkit</docsLink> for researchers developing state-of-the-art conversational AI models.
<githubLink>NVIDIA NeMo</githubLink>™ ke <docsLink> ditirišwa tša mothopo o bolegilego </docsLink> ya Banyakišiši ba go tšweletša mehlala ya poledišano ya AL ya di State-of-the-art.
Glossary
Pukuntšu
Goal reached
Go fihlelela pakane
Goals
Dipakane
Goan Konkani
Se-Goan Konkani
Go to Discourse
Eya Discourse
Go to Languages Page
Eya go Letlakala la Maleme
Go to { $name }
Eya go { $name }
<governanceLink>Read more about how we're governed</governanceLink>
<governanceLink>Bala ka mo go oketšegilego mabapi le kamoo re bušwago ka gona</governanceLink>
Government proceedings, books and articles are also great however since the text tends to be a little more formal they are less of a priority.
Mananeo a mmušo, dipuku le sehlogo le tšona di botse kudu lega go le bjalo ka ge sengwalwa se na le mokgwa o hlokilego ga tše di tlago pele.
Grammatical / spelling error
Popopolelo/ Phošo mopeletong
Great! How many clips a week?
Seo se a kgahiša! Ke ditsopolwa tše kae ka beke?
Great! How many clips per day?
Go ka botse! Ke di pego tše kae ka letšatši?
Great!<recordIcon></recordIcon> Record your next clip
Ga Botse! <recordlcon></recordlcon>Rekhota pego ya gago e latelago
Great work!<playIcon></playIcon> Listen again when you're ready
O šomile gabotse! !<playIcon></playIcon> Theeletša gape ge o ikemišeditše
Greek
Se-Greek
Guarani
Se-Guarani
Haitian
Se-Haitian
Hakha Chin
Se-Hakha Chin
Hausa
Se-Hausa
Have Feedback?
O nale Tshwayotshwayo
Have you read our Terms?
O badile Mareo a rena?
Having an account is not required to contribute, though it is helpful.
Ga go hlokagale o be le akhaonte go neela, le ge e thuša.
Having a profile is not required to contribute though it is helpful, see why below.
Ga go hlokagale gore o be le profaele le ge e thuša, bona lebaka leo ka lona ka fase.
Hebrew
Se-Heberu
Help
Thuša
Help Common Voice reach { $hours } hours in a language with a personal goal
Thuša Common Voice e fihle { $hours } diiri ka polelo ya pakane ya gago
Help create Common Voice’s first target segment in { $locale }
Re thuše re bope nepo ya karolo ya mathomo ya Common Voice { $locale }
Help reach { $hours } hours in { $language } with a personal goal
Re thuše go fihlela { $hours } diiri { $language } ka pakane ya gago
Help teach machines how real people speak, donate your voice at { $link }
Re thuše re rute metšhene gore batho ba bolela bjang, neela lentšu la gago go { $link }
Help us build a community around voice technology, stay in touch via email.
Re thuše go aga setšhaba mabapi le thekinolotši ya lentšu, dula re boledišana le rena ka e-meile.
Help us build a high quality, publicly open dataset
Re thuše re age dathasete ya khwalithi ya go phagama kudu, le e phatlaletšwego
Help us by reviewing sentences for correctness according to the guidelines.
Re thuše ka go hlahloba mafoko bakeng sa go nepagala go tsamayelana le ditlhahlo.
Help us by writing or collecting Public Domain sentences.
Re thuše ka go ngwala goba go kgoboketša mafoko a mafelong a batho bohle.
Help us find more voices, share your goal
Re thuše re hwetše mantšu a mangwe, abelana ka pakane ya gago
Help us find others to donate their voice!
Re thuše re hwetše ba bangwe ba go neela mantšu a bona!
Help us get to { $goal }
Re thuše re fihle go { $goal }
Help us validate sentences!
Re thuše re kgonthiše mafoko!
Help us validate voices
Re thuše re kgonthiše mantšu
Here are some tips to find sentences:
Tše dingwe tša ditšhišinyo tša go hwetša mafoko:
Here are the links to download your ZIP files.
Di-linke tša go tanelouta di faele tša ZIP ke tše.
Hidden
Fihlilwego
Hill Mari
Se-Hill Mari
Hindi
Se-Hindi
His hand was rais-ed.
Letsogo la gagwe le emišitšwe.
Home
Gae
Hours
Diiri
Hours Recorded
Diiri di Begilwego
Hours Validated
Diiri di Kgonthišitšwe
{ $hours } validated hours so far!
{ $hours } diiri tše kgonthašitšwego go fihlela bjale!
How ?
Bjang?
How are project decisions made?
Le tšea dipheto tša projeke bjang?
How can I get the Common Voice data?
Nka hwetša datha ya Common Voice bjang?
How can we effectively grow a language on Common Voice?
Re ka godiša polelo bjang ka katlego go Common Voice?
How does Common Voice calculate hours?
Common Voice e bala diiri bjang?
How does Common Voice work?
E šoma bjang Common Voice?
How does Common Voice work?
E bereka bjang Common Voice?
How does site localization work?
Bo bereka bjang bo kgauswi bja wepesaete?
How do I access and use the dataset?
Nka fihlelela le go šomiša bjang dathasete?
How do I add a language?
Ke tlaleletša leleme bjang?
How do I add sentences?
Nka oketša mafoko bjang?
How do I know whether to approve a voice clip?
Ke tseba bjang gore na nka amogela pego?
How do I record a high quality voice clip?
Ke gatiša bjang pego ya khwalithi e phagamego kudu?
How do I stay in touch?
Ke ka dula ke boledišana le lena bjang?
How do you ensure anonymity and privacy of the people who donated their voices?
O kgonthišetša bjang leina le Sephiri sa batho ba neetšego mantšu a bona.
How to
Dira bjang
How-to
Go ithuta go
How to Cite
Kamoo go Bontšhago ka gona
How would you describe your accent?
O ka hlalosa bjang mokgwa wa gago wa go bolela?
Hungarian
Se-Hungarian
I agree
Kea dumela
I am a non-native speaker and I speak with an accent, do you still want my voice?
Ga ke seboledi sa polelo ya lefelong le ebile ke bolela ka segalo, le sa nyaka lentšu la ka na?
Icelandic
Se-Icelandic
I confirm that these sentences are <wikipediaLink>public domain</wikipediaLink> and I have permission to upload them.
Ke tiišetša gore mafoko a <wikipediaLink>mafelong a batho bohle</wikipediaLink> le gore ke go nea tumelelo ya gwa romela.
I'd like to receive emails such as goal reminders, my progress updates and newsletters about Common Voice.
Ke tla rata go hwetša di e-meile tša go swana le dikgopotšo tša dipakane, sebontšhatšwelopele le magwalo a ditaba tša Common Voice.
I do not agree
Ga ke dumele
If a clip is rejected by 2 people, it is released in a different subset of the dataset.
Ge e le gore pego e ganwe ke batho ba babedi (2), e tla lokollwa ka gare ga sehlopha se fapanego sa dathasete.
If the recording breaks up, or has crackles, reject unless the entirety of the text can still be heard.
Ge e ba kgatišo e ka arogana, goba e nale go kgakgatha, e ganwe ntle le gore mongwalo o feletšego e sa kwagala.
If the recording breaks up, or has crackles, reject unless the entirety of the text can still be heard.
Ge e ba kgatišo e ka arogana, goba e nale go kgakgatha, e ganwe ntle le gore mongwalo o feletšego e sa kwagala.
If the sentence does not meet the above criteria, click the "Reject" button on the left. If you are unsure about the sentence, you may also skip it and move on to the next one.
Ge e ba lefoko ga le fihlelele kritheriya ya ka godimo, tobetsa; Gana; setobetšwa sa ka go letshadi. Ge e ba o sa kgonthišege ka lefoko, o ka boela wa e taboga gomme o fetele go e latelago.
If the sentence does not meet the above criteria, click the "Reject" button. If you are unsure about the sentence, you may also skip it and move on to the next one.
Ge e ba lefoko ga le fihlelele kritheriya tša ka godimo, tobetsa setobetšwa sa “Gana”. Ge e ba o sa kgonthišege ka lefoko, o ka boela wa e taboga gomme o fetele go e latelago.
If the sentence meets the criteria, click the "Approve" button.
Ge e ba lefoko le lebane le kriteria, klika konotswana ya "Amogela"
If the sentence meets the criteria, click the "Approve" button on the right.
Ge mafoko a wela go kriteria, tobetša "Amogela" konotswana e leng thoko la go ja.
If you come across something that these guidelines don’t cover, please vote according to your best judgement. If you really can’t decide, use the skip button and go on to the next recording.
Ge e ba o kopana le selo seo dikeletšo tšeo di se ke tša akaretšwa, ka kgopelo kgetha go ya ka kahlolo ya gago e botse. Ge e ba ruri o ka se kgone go tšea sephetho, diriša setobetšwa sa go taboga gomme o fetele go kgatišo e latelago.
If you could <strong>hear them</strong> and <strong>understand them</strong>, it’s usually <strong>best to approve</strong>.
Ge o ka <strong>ba kwa</strong> le <strong>go ba kwešiša</strong>, gantši <strong>go kaone go amogela</strong>.
If you don’t hear from us within a couple of days,
reach out at <emailLink>{ $email }</emailLink>
and we’ll get back to you!
Ge e ba o sa kwe go tšwa go rena ka morago ga matšatši a mmalwa a tlago, e kgokagane le rena go <emailLink>{ $email }</emailLink>
gomme re tla boela go wena!
If you’re looking for tools to build ASR models, you can connect to other people in the community <discourseLink>on Discourse</discourseLink>.
Ge e ba o nyaka didirišwa tša go aga mehuta ya ASR, o ka ikopanya le batho ba bangwe ba sehlopha se <discourseLink>go Discourse</discourseLink>.
If your language is not listed here, you can request it through <languageProcessLink>this process</languageProcessLink>.
Ge e ba polelo ya gago ga se e lokeletšwe mo, o ka e kgopela ka go šomiša<languageProcessLink>tiragalo e</languageProcessLink>.
If you run out of sentences to review, please help us collect more sentences!
Ge e ba o felelwa ke mafoko ao o swanetše go a ahlola, re kgopela o re thuše go kgoboketše mafoko a mangwe!
If you run out of sentences to review, please help us collect more sentences!
Ge e ba o felelwa ke mafoko ao o swanetše go a ahlola, re kgopela o re thuše go kgoboketše mafoko a mangwe!
If you think the pronunciation makes it <strong>impossible to understand</strong>, or there’s a <strong>lot of background noise</strong>, or there are <strong>other people speaking too</strong>, then you should reject the clip. <accuracyLink>See more information in our accuracy criteria</accuracyLink>.
Ge e ba o nagana gore mokgwa wa go piletšo ya mantšu e dira gore go<strong> se ke gwa kgonega go kwešišana</strong>, goba go na <strong> lešata le lentši</strong>, goba go ba le<strong>batho ba bagwe bao ba bolelago</strong>, bjale o ka gana setsopolwa seo. <accuracyLink>Bona tshedimošo ye nngwe ya kritheriya ya go nepagala</accuracyLink>.
Igbo
Se-Igbo
I just created a personal goal for voice donation to #CommonVoice -- join me and help teach machines how real people speak { $link }
Ke fetša go hlangwa pakane ya go neela ka lentšu go #CommonVoice -- a re hlakane gomme re thuše go ruta metšhene gore na batho ba nnete ba bolela bjang { $link }
I'm okay with you handling this info as you explain in Mozilla's <privacyLink>Privacy Policy</privacyLink>
Ke dumelana le wena gore o tla swara tshedimošo e ge go ba o hlalositše go Mozilla <privacyLink>Molao wa Sephiri</privacyLink>
Includes email, username & demographic info, available right away
Di akaretša e-meile, leina la mošomiši le tshedimošo ya teka tekano ya palo ya gago, e hwetšagala gona bjale
Includes mp3s and related sentences, may take some time to prepare
E akaretša di-mp3 le mafoko a tswalanego, di ka tšea nako go di lokiša
Indonesian
Se-Indonesia
Information about the language
Tsebišo mabapi le leleme
In Progress
E Tšwelopeleng
Interested in learning more and contributing to the project?
O kgahlegele go ithuta go feta le go neela go projeke.
Interface Language
Lefelokgokgano la Polelo
Interlingua
Se-Interlingua
Interlingue
Se-Interlingue
Irish
Se-Irish
IsiNdebele (North)
Setebele (Leboa)
IsiNdebele (South)
Setebele (Borwa)
Is my account information public?
Afa tshedimošo ya akhaonte yaka e phatlaleditšwe?
<isoCodeLink>ISO Codes</isoCodeLink> if known
<isoCodeLink>ISO Khoutu</isoCodeLink> ge e ba e tsebja.
Is the clip valid?
Pego ye e kgonthišitšwe?
Is the goal of Common Voice to build a voice assistant?
Ke pakane ya Common Voice go aga mothutši wa lentšu?
Italian
Setaliana
It contains words or phrases that are hard to read or pronounce.
E nale mantšu goba dipolelwane tšeo go lego thata go di bala goba go di bitša.
It is written in a language different than what I’m speaking.
E ngwetšwe ka polelo e fapanego le polelo yeo ke e bolelago.
It's important that you reference where you found the public licence to avoid plagiarism and to allow for follow ups on <copyrightIssuesLink>copyright issues</copyrightIssuesLink>. Here are a few ways you can cite the work.
Go bohlokwa gore o bolele moo o hweditšego lengwalo la tumelelo ya setšhaba go phema go ngwalolla tsebišo le go e utswa le go dumelela go boela bakeng sa <copyrightIssuesLink>mathata a tša ditokelo tša molao</copyrightIssuesLink>. Ditsela tše sego kae tše mmalwa tša go tsopola mošomo.
Izhorian
Se-Izhorian
Japanese
Se-Japane
Join the Common Voice mailing list
Ngwadiša go lenaneo la baamogedi ba e-meile ba Common Voice
Just Unsure?
Ga se o kgonthišege?
Kabardian
Se-Kabardian
Kabyle
Se-Kabyle
Kaqchikel
Se-Kaqchikel
Karakalpak
Se-Karakalpak
Kazakh
Se-Kazakh
{ $kb }kb max
{ $kb }kb max
Keep
Boloka
Keep it up, record again <recordIcon></recordIcon>
Tšwela pele, rekhota pego gape <recordIcon></recordIcon>
Keep the recordings
Boloka dipego
Keep track of your progress and metrics across multiple languages.
Dula o tseba boemo bja gago bja tšwelopele le di metrike tša dipolelo tše dintši.
Keep track of your progress with a profile
Dula o tseba boemo bja tšwelopele ka goba le profaele
Keep track of your progress with a profile and help our voice data be more accurate.
Dula o tseba boemo bja tšwelopele yagago ka profaele le go thuša datha ya rena ya lentšu go nepagala kudu.
Khmer
Se-khmer
K'iche'
Se-K'iche'
Kikuyu
Se-kikuyu
Kinyarwanda
Se-Kinyarwanda
Komi-Zyrian
Se-Komi-Zyrian
Konkani (Devanagari)
Se-Konkani (Devanagari)
Korean
Se-Korea
Kurmanji Kurdish
Se-Kurmanji Kurdish
Kyrgyz
Se-Kyrgyz
Language
Polelo
Language
Leleme
Language
Leleme
Language Request
Kgopelo ya leleme
Language request successfully submitted, thank you.
Kgopelo ya leleme e rometšwe ka Katlego, rea leboga.
Languages
Maleme
Languages can have their <validationRulesLink>own validation rules</validationRulesLink> with additional requirements. If there is no specific validation file for a language, we are using the generic English rules.
Dipolelo di ka ba le <validationRulesLink>melao ya yona ya go netefatša </validationRulesLink> ka dinyakwa tše oketšegilego. Ge e ba go se na faele e netefaditšwego ya polelo e itseng, re šomiša melao e tlwaelegilego ya Seisemane.
Latvian
Se-Latvian
Launched
Hloma
Leaderboard Visibility
Kelo ya Baetapele e Bonagalago
Learn how to take part
Ithuthe ka go tšea karolo
Learn More
Ithute ka mo go oketšegilego
Leaving now means you’ll lose your changes
Go tšwa gona bjale go bolela gore o tlo lahlagelwa ke diphetogo tša gago
Leaving now means you'll lose your progress
Go tšwa gona bjale go bolela gore o tlo lahlagelwa ke tšwelopele ya gago
Let's Get Started
Ga re Thome
LibriSpeech is a corpus of approximately 1000 hours of 16Khz read English speech derived from read audiobooks from the LibriVox project.
LibriSpeech ke pokello ya dipego ya diiri tše ka bago tše 1000 tša 16Khz tše di balwago ka polelo ya Seisemane e tšwerego dipukeng tša odio tše balwago tša go tšwa morerong wa LibriVox.
License
Laesense
License: <licenseLink>CC-0</licenseLink>
Laesense: <licenseLink>CC-0</licenseLink>
License: <licenseLink>{ $license }</licenseLink>
Laesense: <licenseLink>{ $license }</licenseLink>
Ligurian
Se-Ligurian
Link Copied
Linke e kopotšwe
Links to websites that can help us understand the language
Di-linke tša wepesaete tše di ka re thušago go kwešiša polelo
Listen
Ekwa
Listen
Theeletša
Listening
Theeleditše
Listen-Queue
Lenaneotetelo la go theeletša
Lithuanian
Ma-Lithuanian
Loading…
Loading...
Loading rejected sentences…
Tsenya mafoko a ganwego
Loading sentences…
Mafoko a tsena
Loading your sentences…
Tsenya mafoko a gago...
Localization
Go lekanyeditša
Localized
Lekanyeditšwego
Login failed
Go tsena go ganne
Login Identity
Tsenya Boitšhupo
Log in or sign up to get started
Hloma akhaonte ya mošomiši goba ngwadiša go thoma
Login / Signup
Tsena/Hloma akhaonte ya mošomiši
Log In / Sign Up
Tsena/Hloma akhaonte ya mošomiši
Log In / Sign Up with { $company } email
Tsena/Hloma akhaonte ya mošomiši ka { $company } e-meile
Logout
Go tšwa
Log Out
Etšwa
Looks like there aren't any clips to listen to in this language. Help us fill the queue by recording some now.
Go bonagala nkare ga gona ditsopolwana tše o ka di theeletšago ka polelo e. Re thuše go tlatša lenaneotetelo ka go rekhota gona bjale.
Luganda
Se-Luganda
Luxembourgish
Se-Luxembourgish
Macedonian
Se-Macedonian
Maithili
Se-Maithili
Make sure the platform is recording before you start speaking, and that it only stops once you’re finished.
Kgonthišetša gore sethaleng se ya rekhota pele o thoma go bolela, le gore se ema fela ge o feditše.
Make sure the sentence meets the following criteria:
Netefatša gore lefoko le latela kriteria ye:
Make sure the sentence meets the following criteria:
Netefatša gore lefoko le latela kriteria ye:
Make your submitted data as rich as possible by providing some anonymous demographic data. We de-identify all demographic data before making it public.
Dira gore datha ya gago yeo o e rometšwego e be e ntši kamoo go ka kgonegago ka gona ka go nea ka datha ya go se tsebje. Re itlhaola datha ka moka ka palo ya batho pele re e phatlalatša.
Malagasy
Se-Malagasy
Malay
Se-Malay
Malayalam
Se-Malayalam
Male
Monna
Maltese
Se-Maltese
Manage Email Subscriptions
Taolo ya Dipeeletšo tša e-meile
Manage Subscriptions
Taolo ya Dipeeletšo
Mapudungun
Se-Mapudungun
Marathi
Se-Marathi
Maybe our <homepageLink>homepage</homepageLink> will help? To ask a question, please join the <matrixLink>Matrix community chat</matrixLink>, monitor site issues via <githubLink>GitHub</githubLink> or visit <discourseLink>our Discourse forums</discourseLink>.
Mohlomongwe <homepageLink>letlakaka la rena la mathomo inthaneteng</homepageLink> le tla thuša? Go botšiša potšišo, ka kgopelo ikgokaganye le<matrixLink>lefelo la poledišano ya setšhaba ya Matrix</matrixLink>, lebelediša mathata a saete go<githubLink>GitHub</githubLink>goba etela<discourseLink>diforamo la rena la tša Discourse</discourseLink>.
MB
MB
Meadow Mari
Se-Meadow Mari
Meetei Lon
Se-Meetei Lon
Message
Molaetša
[Mismatched content]
[Mismatched content]
Misreadings
Di balo tše fošagetšego
Missing an <strong>'s'</strong> at the end of a word.
Go hloka<strong>'s'</strong> mafelelong a lentšu.
Missing <strong>'A'</strong> or <strong>'The'</strong> at the beginning of the recording.
Go hloka <strong>'A'</strong> goba <strong>'The'</strong> mathomong a pego.
Missing the end of the last word by cutting off the recording too quickly.
Go hlaela ke pheletšo ya mafelelo a lentšu gobane o timile pego kapela kudu.
Mixed
Tswakane
Moksha
Se-Moksha
Mongolian
Se-Mongolia
More
Fetago
[More has been recorded than the required text]
[More has been recorded than the required text]
Mossi
Se-Mossi
Most of the data used by large companies isn’t available to the majority of people. We think that
stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open
and accessible to everyone.
Bogolo bja datha e dirišwago ke dikhampani tše dikgolo ga e hwetšagale ke bontši bja batho. Re nagana gore seo se šitiša go tšweletša mekgwa e mefsa. Ka gona re hlomile Common Voice, morero wa go thuša go dira gore temošo ya lentšu e bolege le go hwetšwa ke motho yo mongwe le yo mongwe.
Most of the data used by large companies isn’t available to the majority of people. We think
that stifles innovation. So we’ve launched Project Common Voice, a project to help make voice
recognition open to everyone.
Bogolo bja datha e dirišwago ke dikhampani tše dikgolo ga e hwetšagale ke bontši bja batho. Re nagana gore seo se šitiša go tšweletša mekgwa e mefsa. Ka gona re hlomile Morero wa Common Voice, morero wa go thuša go dira gore temošo ya lentšu e bolege le go hwetšwa ke motho yo mongwe le yo mongwe.
Most recordings are of people talking in their natural voice. You can accept the occasional non-standard recording that is shouted, whispered, or obviously delivered in a ‘dramatic’ voice. Please reject sung recordings and those using a computer-synthesized voice.
Dikgatišo tše dintši ke tša batho bao ba bolelago lentšu ka bona la tlhago. O ka amogela rekhoto e sa tlwaelegago ya nako le nako e goeletša, e sebela, goba ka go hlakodiša molaleng ka lentšu le ‘makatšago’. Ka kgopelo gana direkhoto tšeo di opelwago le tšeo di dirišago lentšu leo le dirilwego ka khomphutha.
Most speech databases are trained with an overrepresentation of certain demographics which results in a bias towards <articleLink>male and middle class</articleLink>. Accents and dialects that tend to be under-represented in training data sets are typically associated with groups of people who are already marginalised. Many machines also struggle to understand female voices.
This is why in our voice database we want variety!
Bontši bja dathabeise di eteletšwa go bontšha palo e feteletšego ya palo ya batho e sekametšego go lebana le<articleLink> banna le maemo a magareng</articleLink>. Mmolelo le mebolelwana ao a atišago go sekametšego go emelwa ka tlase go di datha sete di eteletšwe gantši di tswalaganywago le dihlopha tša batho bao ba šetšego ba boeletšwe ka thoko. Metšene e mentši le yona e thatafalelwa go kwešiša mantšu a basadi.
Ke kamoo polokelong ya rena ya lentšu re nyakang mehuta e fapa-fapanego!"
Mozilla Common Voice is an initiative to help teach machines how real people speak.
Mathomo a Mozilla Common Voice ke go thuša go ruta metšhene gore batho ba sebele ba bololela bjang.
Mozilla Common Voice is made possible by a diverse community of activists, linguists, data scientists, academics and software engineers from all over the world. The project is stewarded by the Mozilla Foundation.
Common Voice ya Mozilla gore go kgonegale e dirilwe ke setšhaba se fapa-fapanego sa bahlohleletši, ditsebi tša maleme, bo-rathutamahlale a datha, barutegi le baentšineare ya mananeo go tšwa lefaseng ka moka. Morero o tsamaiša ke Motheo wa Mozilla.
Mozilla is dedicated to keeping the web open and accessible for everyone. To do that we need to empower web creators through projects like Common Voice. As voice technologies proliferate beyond niche applications, we believe they must serve all users equally. That means investing in more languages and accommodating diverse accents and demographics when building and testing voice technologies. Common Voice is a public resource available to everyone and Mozilla teams and developers around the world are already using it on our own projects as well.
Mozilla e ikemišeditše go boloka mologo o botšwe ebile o hwetšagala ke batho bohle. Go dira bjalo re hloka go matlafatša babopi ba wepesaete ka mediro ya go swana le Common Voice. Ge ditheknolotši tša go hlaboša lentšu di dutše di ata go matlafatša tswalano ya ditišwa, re dumela gore di swanetše go hlankela badiriši ka moka go lekana. Seo se bolela gore go beeletša karolo ya maleme a mantši le go amogela mmelelo o fapanego gammogo le teka-tekano ya palo ya batho ge o aga le go tlhahloba ditheknolotši tša mantšu. Common Voice ke mothopo wa setšhaba wo o hwetšagala ke motho yo mongwe le yo mongwe, gomme dihlopha tša Mozilla le batšweletši lefaseng ka bophara ba šetše ba šomiša seo mananeo a bona.
Mozilla’s open source voice recognition engine Deep Speech can be used to build speech recognition applications. Read our <githubLink>Github overview</githubLink> or join the <discourseLink>DeepSpeech Discourse</discourseLink> to learn how to get started.
Mothopo o bolegilego wa Mozilla ya entšene ya go lemoša lentšu. Polelong e Tseneletšego e ka šomišwa go aga lenaneoditirišo ya tlhathollo ya mantšu. Bala <githubLink>kakaretšo ya rena ya Github</githubLink>goba ikgokagane le <discourseLink>Discourse ya Polelong e Tseneletšegogo </discourseLink>go ithuta kamoo o ka thomago ka gona.
Mutual accountability.
Kwešišano ya boikarabelo.
My Sentences
Mafoko a ka
My Sentences
Mafoko a ka
n
n
N
N
N/A
Ga e gona
Name
Leina
Names of your language
Maina a leleme la gago
Native Language
Polelo ya Geno
Native Language
Leleme la Geno
Need some help with accent?
O hloka thušo ka mokgwa wa go bolela?
Need some help with variants?
O hloka thušo ka mebolelo?
Need to download your data?
O hloka go tanelouta datha ya gago?
Nepali
Se-Nepali
New Language Launch
Maleme a ma Mpsha a Hloma
Next
Latelago
Next Goals: { $goal }
Dipakane tše Latelang: { $goal }
Nias
Se-Nias
No
Aowa
No
Aowa
[No ‘a’ in the original text]
[No ‘a’ in the original text]
No gravatar found for your email
Ga go gravator e hwetšagalago ya e-meile ya gago
No microphone found.
Maekrofone ga e hwetšagale.
No rejected sentences found!
Mafoko a ganwego a hwetšagale!
Northern Sotho
Sesotho sa Leboa
Norwegian Bokmål
Se-Norwegian Bokmål
Norwegian Nynorsk
Se-Norwegian Nynorsk
No sentences found!
Mafoko ga a hwetšagale!
No sentences to review. <addLink>Add more sentences now!</addLink>
Ga gona mafoko a swanetšego go hlahlobja. <addLink>Tlaleletša mafoko amangwe gabjale!</addLink>
Note: When set to 'Visible', this setting can be changed from the <profileLink>Profile page</profileLink>
Keletšo: Ge e abetšwe go "Bonagala", selaolatirišo se se ka fetolwa go <profileLink> letlakala ka profaele</profileLink>
Note: You will still need to select between Speak or Listen to change contribution type.
Keletšo: O tla hloka go kgetha magareng ga go Bolela goba go Theeletša go fetola mohuta wa go neela.
No Thanks
Aowa kea Leboga
No variant selected
Ga gona le mebolelo e kgethilwego
Now that you know a little bit more about Common Voice, why not try it out? Click on the microphone icon to start reading sentences aloud. <br/><br/>If you prefer to review other people's voice contributions, click on the play icon. You’ll help confirm that recordings match the sentences written on screen.
Ka ge bjale o tseba gonyenyane ka Common Voice, ke ka baka la eng o sa e leke? Tobetsa aekhoneng ya maekrofouno go thoma go bala mafoko ka go hlaboša lentšu.<br/><br/>Ge e ba o kgetha go ahlahla ditlaleletšo tša lentšu la batho ba bangwe, tobetsa leswao la go bapala. O tla thuša go netefatša gore direkhoto di dumelelana le mafoko ao a ngwadilwego mo sekerining.
Now you can donate your voice to help us build an open-source voice database that anyone can use
to make innovative apps for devices and the web. Read a sentence to help machines learn how real people speak. Check the work of other
contributors to improve the quality. It’s that simple!
Gona bjale o ka nea ka lentšu la gago go re thuša go aga dathabeise ya lentšu e bolegielgo yeo mang le mang a ka e šomiša go dira mananeotiriši a mafsa a didirišwa le wepesaete. Bala mafoko go thuša metšhene go ithuta gore na batho ba kgonthe ba bolela bjang. Lebediša mošomo wa baneedi ba babgwe go kaonefatša boemo. Go bonolo gakaalo!
Number of Voices
Palo ya Mantšu
Numbers. There should be no digits in the source text because they can cause problems when read aloud. The way a number is read depends on context and might introduce confusion in the dataset. For example, the number “2409” could be accurately read as both “twenty-four zero nine” and “two thousand four hundred nine”.
Dinomoro. Ga se gwa swanela go ba le dinoromo mangwalong a mothopo gobane di ka baka mathata ge o hlaboša lentšu. Tsela yeo palo e balwago ka gona e ithekgile ka kamano ya taba gomme e ka hlagiša gakantšhago go dathasete. Ka mohlala, palo ya “2409” e ka balwa ka nepo e le “mašome-pedi le metšo nne lefeela le senyane” le “dikete tše pedi le makgolo a mane le metšo e senyane”.
Occitan
Se-Occitan
Odia
Se-Odia
Off
Tima
Offensive language
Leleme le kgopišago
Offensive speech
Polelo e kodutlago
On
Tšhuma
Once logged in you can select your languages from the profile section.
Ge o hloma akhaonte ya mošomiši o ka kgetha polelo ya gago go karolo ya profaele.
On desktop devices you can contribute by downloading…
Go katšete Teskhethopo o ka neela ka go tanelouta…
One sentence per line
Lefoko le tee go mothaladi o mo tee
On his head he wore a beret.
O apere berete godimo ga tlogo ya gagwe.
On iOS please continue with Safari to enable recording…
Go iOS re kgopela o tšwelepele ka Safari go kgona go gatiša pego...
On the other hand, if you think that the reader has probably never come across the word before, and is simply making an incorrect guess at the pronunciation, please reject. If you are unsure, use the skip button.
Ka lehlakoreng le lengwe, ge e ba o nagana gore mohlomogwe mmadi ga se ka ka ka gahlana le lentšu pele, gomme o dira kgakanyo e fošagetšego ka mokgwa wa piletšo ya mantšu, ka kgopelo gana. Ge e ba o sa kgonthišege, šomiša setobetšwa sa go taboga.
Optionally join on our email list for updates and new information about the project.
Ka boikgethelo ikopanye go lenaneo la baamogedi ka e-meile ya rena go hwetša mpshafatšo le ka tshedimošo mabapi le projeke.
Optionally submitted demographic data (e.g. age, gender, language, and accent) will never be made public on your profile, and will not be linked to your account in the dataset. Individual audio clips will be associated with demographic data for the purpose of more accurate analysis - for example, a researcher might want to target a training model to a specific demographic segment.
Tekano ya palo ya batho yeo e rometšwego ka datha (mohlala. mengwaga, bong, leleme, le mmolelo) di ka se tša phatlalatšwa go profaele ya gago, gomme di ka se tswalaganywa le akhaonte ya gago ka dathaseteng. Ditsopolwana tša odio di tla tswalanywa le datha tša palo ya batho ka morero wa tsheka-tsheko o nepagetšego kudu – ka mohlala, monyakišiši a ka nyaka go lebiša mohlala wa eteletšo go karolo e itseng ya palo ya batho.
Other
Bjo bongwe
Other Language
Malema a mangwe
Other people validate those voice clips.
Batho ba bangwe ba kgonthiša di pego tša mantšu.
Other voice datasets…
Di-dathasete tše dingwe tša mantšu...
Other Voice Datasets
Di-dathasete tše dingwe tša mantšu
Our community manager will reach out to you to offer
extra support, and to let you know if there is
already a community working on your language
Molaodi wa rena wa setšhaba o tla ikgokaganya le wena go neelana ka thekgo e oketšegilego, le ge go tsebiša ge e ba go šetše go na le setšhaba se šomago ka polelo ya geno.
Our governance is founded on the pillars of:
Taolo ya rena e hwetšwago godimo ga dikokwane tša:
Our source text is made up of original contributor donations as well as dialogue from public domain movie scripts like <italic>It’s a Wonderful Life</italic>.
You can view our source sentences in this <githubLink>GitHub folder</githubLink>.
Mothopo wa rena wa mangwalo o dirilwe ka meneelo wa batsenyaletsogo ba mathomo gotee le dipoledišano tšeo di tšwago go mengwalong ya difilimi tša setšhaba tša go swana le <italic>It's a Wonderful Life</italic>.
O ka lebelela mafoko a rena a mothopo ka <githubLink>foltareng yeo ya GitHub</githubLink>. "
Overall Accuracy
Nepagetša ka Moka
Overall Hr. Total
Palo-moka ya Diiri Ka Kakaretšo
Overall project status: see how far we’ve come!
Bontši ba boemo bja projeke: bona gore o fihla bokgoleng bofe!
p
p
Papiamento (Aruba)
Se-Papiamento (Aruba)
Partners
Badirišani
Partnerships and networks
Tirišano le dinetweke
Partner with local organizations or individuals
Kgokagana le mokgatlo goba batho ba kgauswi
[Part of the text can’t be heard]
[Part of the text can’t be heard]
Pashto
Se-Pashto
Past recordings download requests
Dikgopelo tša go tanelouta dikgatišo tše fetilego
<p>Common Voice is a publicly available voice dataset, powered by the voices of volunteer contributors around the world. People who want to build voice applications can use the dataset to train machine learning models.</p>
<p>At present, most voice datasets are owned by companies, which stifles innovation. Voice datasets also underrepresent: non-English speakers, people of colour, disabled people, women and LGBTQIA+ people. This means that voice-enabled technology doesn’t work at all for many languages, and where it does work, it may not perform equally well for everyone. We want to change that by mobilising people everywhere to share their voice.</p>
Theknolotši ya go tlhathollo lentšu e fetola tsela yeo re dirišago metšhene ya yona, eupša di tsamaišo tšeo di lego gona mo nakong ye di bitša tšhelete e ntši ebile e nale thepa. Mozilla Common Voice ke magato a go dira gore mahlale a go tlhathollo ya lentšu a be kaone le gore e hwetšagale go batho bohle. Common Voice bolekego ya dathabeise ya lefase ka bophara ya mantšu a bonwelo e dumelela mang goba mang gore a kwetliše ga bonolo didirišwa tša go ba le lentšu ka polelo ye nngwe le ye nngwe. Ga re kgoboketše disampole tša mantšu feela ka maleme a go bolelwa ke batho kudu eupša gape le ao a nago le diboledi a badudi ba banyenyane. Go phatlalatša dathasete ya dintlha tšeo di fapanego tša mantšu go tla matlafatša batšweletši, bo rakgwebo le setšhaba go rarolla sekgoba seo ka bo bona.
People come and contribute their voices.
Batho batla gomme ba neela ka mantšu a bona.
Persian
Seperesia
Personal
Ya motho ka noši
Personal dashboards keep you up-to-date with individual and community progress.
Boitšhupo bja dashboard bo tla go tsebiša seo se diregago ka tšwelopele ya motho le ya naga.
<playbookLink>Find helpful guidance</playbookLink> on the entire Common Voice journey, from localisation to dataset usage, as well as how to connect with our community.
<playbookLink>Hwetša keletšo ya maleba </playbookLink> go leeto le feletšego la Common Voice, go tšwa mo o dulago gona go ya go tirišo ya datha, le gore le kopana bjang le setšhaba sa rena.
<playIcon></playIcon>Last one!
<playIcon></playIcon>Ya mafelelo!
Play/Stop
Bapala/ Ema
Please add a source.
Ka kgopelo tsenya mothopo.
Please add sentences.
Ka kgopelo tsenya mafoko.
Please check the <howToLink>guidelines</howToLink>.
Ka kgopelo lebelela<howToLink>ditaelo</howToLink.
Please confirm that these sentences are public domain.
Ka kgopelo netefatša gore mafoko a ke ya mafelong a batho bohle.
Please select a language.
Ka kgopelo kgetha polelo.
Please select a language to review sentences.
Ka kgopelo kgetha leleme leo o tla hlahlobago mafoko.
Please try again.
Re kgopela o leke gape.
Polish
Mopoliši
<pontoonAccountLink>Create an account</pontoonAccountLink> if you don’t have one. Then, choose your language <strong>(‘Team’)</strong> and then choose the project, <pontoonCvLink>Common Voice</pontoonCvLink>. There will be files to translate. Click on one, then it shows words in English and a box to translate them.
<pontoonAccountLink>Hlangwa akhaonte</pontoonAccountLink> ge e ba o se na yona. Ke moka, o kgetha polelo <strong> (‘Sehlopha’)</strong> ebe o kgetha morero, <pontoonCvLink>Common Voice</pontoonCvLink>. Go tla ba le difaele tše tla fetolelwa. klika go e le nngwe, ke moka e bontšha mantšu a Seisemane lepokisana go e fetolela.
Portuguese
Sepotokisi
Press play, listen & tell us: did they accurately speak the sentence below?
Tobetša raloka, theeletša o re butše: a fa ba boletše lefoko le leng ka tlase ga botse.
Press { shortcut-play-toggle } to toggle play mode
Tobetša { shortcut-play-toggle } go raloka mokgwa wa tšhielanyo
Privacy
Sephiri
Privacy
Bo Noši
Privacy Policy
Khupamarama ya Molao
Privacy, security and transparency.
Sephiri, tšhireletso le Ponatšago.
Pro
Thekgo
Profile
Pego
Profile
Profaele
Profile
Setshwanso
Profile information improves the audio data used in training speech recognition accuracy.
Tsebo ya profaele e kaonefatša datha ya odio e šomišwago go tlwaetša tlhathollo ya mantšu.
Profile: { $username }
Profaele: { $username }
Progress
Tšwelopele
Punctuation. Special symbols and punctuation should only be included when absolutely necessary. For example, an apostrophe is included in English words like “don’t” and “we’re” and should be included in the source text, but it’s unlikely you’ll ever need a special symbol like “@” or “#.”
Maswao-dikga. Maswao a kgethegilego le maswao-dikga a polelo e swanetše go akaretšwa feela ge go hlokagala. Ka mohlala, aphostrofi e akaretšwa mantšung a Seisemane bjalo ka “o seke” le “re” gomme e swanetše go tsentšhwa mothopong wa tša mongwalong, eupša ga go bonolo gore o hloke leswao le ikgethegilego bjalo ka “@” goba “#”.
Punjabi
Se-punjabi
Quechua Chanka
Se-Quechua Chanka
r
r
[‘Raised’ in English is always pronounced as one syllable, not two]
[‘Raised’ in English is always pronounced as one syllable, not two]
Reader Effects
Khuetšo ya mobadi
Reading contractions that aren't actually there, such as "We're" instead of "We are", or vice versa.
Go bala mantšu a fokoditšwego a se ng wona bjale ka "Re' go e na le "Rea', goba bjalo le bjalo.
Read More
Bala ka mo go Oketšegilego
Read more on our About page
Bala ka mo go oketšegilego letlakaleng la wepesaete ka rena.
Read our How-to if unsure how to attribute
Bala gore o ka ithuta bjang ge e ba o sa kgonthišege kamoo o ka tlhaodi ya faele
Read the sentence carefully - don’t miss, change or add words.
Bala lefoko ka kelohloko - o se ke wa tshela, le go fetola goba wa oketša mantšu.
{ $readyCount } sentences ready for submission!
{ $readyCount } mafoko a loketše go romelwa!
Ready to add your voice or lend your ear?
O ikemišeditše go tlaleletša lentšu goba go adima tsebe?
Ready to contribute?
O ikemeditše go neela?
Ready to do { $count } more?
Loketše go dira { $count } go feta?
Ready to donate your voice?
O ikemišeditše go neela ka lentšu la gago?
Ready to help validate sentences?
O ikemišeditše go kgonthiša mafoko?
reCAPTCHA is required if you want to proceed
reCAPTCHA e ya nyakega ge e ba o nyaka go tšwelapele
Receive emails such as challenge and goal reminders, progress updates, and newsletters about Common Voice.
Hwetša di-emiele tša go swana le diphadišano le dikgopotšo tša pakane, boemo bja tšwelopele le mangwalo a ditaba mabapi le Common Voice.
Recorded Clips
Ditsopolwa tšeo di Gatišitšwego
Recorded Hours
Diiri tše Begilwego
<recordIcon></recordIcon> Last one!
<recordIcon></recordIcon> Ya mafelelo!
[Recording cut off before the end of the last word]
[Recording cut off before the end of the last word]
Recordings
Dikgatišo
Recordings
Dikgatišo
Recording voice clips is an integral part of building our open dataset; some would say it's the fun part too.
Go gatiša dipego tša lentšu ke karolo e bohlokwa ya go aga dathasete ya rena e bulegilego; ba bangwe ba ka re ke karolo ya go thabiša le yona.
Record/Stop
Pego/Ema
Record your voice
Gatiša lentšu la gago
Refresh
Mpshafatša
Reject
Gana
Rejected Sentences
Mafoko a Ganwego
Rejected Sentences
Mafoko a Ganetšego
{ $rejectedSentences } rejected sentences.
{ $rejectedSentences } mafoko a ganwego.
Remember that we need permission to publish those sentences, so always ensure that the text belongs to the <wikipediaLink>public domain</wikipediaLink>. If there is no explicit indication, reach out to the person that the text belongs to and ask if they want to contribute their text to the Public Domain. If they want to, help them go through the <cc0WaiverLink>Contributions Agreement</cc0WaiverLink> process. If they do not want to contribute the text, then you are not allowed to use that text.
Gopola gore re hloka tumelelo ya go phatlalatša dipolelo tšeo, ke kamoo re kgonthišetša gore mengwalo ke ya<wikipediaLink> batho ka moka</wikipediaLink>. Ge goba a gona dipontšho molaleng, boledišana le motho yo temana yeo e leng ya gagwe gomme o botšiša gore e ba ba nyaka go neela temana ya bona Sebakeng sa Setšhaba. Ge e ba ba nyaka, ba thuše go tsena mogatong a <cc0WaiverLink>Tumelelano ya Meneelo</cc0WaiverLink> . Ge e ba ga ba nyake go tsena tiragalo ya temana, ke moka wa dumelelwa go diriša temana yeo.
remove
Tloša
Remove
Ntšha
Remove Avatar
Tloša Motšeakarolo
Report
Pego
Report Bugs
Bega Phošo
Report copyright issues
Bega mathata a tokelo ya ngwalollo
Report was passed successfully
Pego e fetile ka katlego
Request a Language
Kgopela polelo
Request recordings
Kgopela dipego
*required
*nyakega
Re-record
Rekhota gape
Re-record clip
Rekhota pego ya odio gape
Reset skipped sentences
Thoma leswa mafoko ka moka ao wa tshetšego
Return
Bušetša
Return here to edit your goal anytime.
Boela mo go rulaganya pakane ya gago neng le neng.
Return to Common Voice
Boela go Common Voice
Return to Common Voice Datasets
Boela go dathasete ya Common Voice
Return to Languages
Boela go Maleme
Review
Ahla-ahla
Review
Hlahloba
Review
Poeletšo
Review could not be saved. Please try again later.
Hlahlobo e paletšwe ke go bolokwa> Re kgopela o leke gape ka morago.
ⓘ Review Criteria
ⓘ Hlahloba Kriteria
Review Criteria
Hlahloba Kriteria
Reviewed sentences not submitted, are sure?
Mafoka a ahlotšweng a se a tsene, na o kgodišegile?
<reviewLink>Review now!</reviewLink>
<reviewLink>Hlahloba bjale!</reviewLink>
Review & re-record clips here as you go
Hlahloba o gatiše pego gape mo ge o dutše o tšwelapele
Review & re-record clips if needed
Hlahloba o dira di pego gape ge e ba go hlokagala
Review sentences
Poeletšo ya mafoko
Review Sentences
Ahlola Mafoko
Review Sentences
Hlahloba Mafoko
Review & Submit
Hlahloba o Romele
Romanian
Se-Romania
Romansh Sursilvan
Se-Romansh Sursilvan
Romansh Vallader
Se-Romansh Vallader
Runyankole
Se-Runyankole
Russian
Se-Russia
s
s
S
S
Sakha
Se-Sakha
Santali (Ol Chiki)
Se-Santali (Ol Chiki)
Saraiki
Se-Saraiki
Sardinian
Se-Sardinia
Save
Boloka
Saved
Bolokegile
Search
Nyakiša
Search for answers
Nyaka dikarabo
Search for them on the Internet
Di nyake Inthaneteng
See how your progress compares to other contributors all over the world.
Bona kamoo tšwelopele ya gago e bapišwago le baneedi ba bangwe lefaseng ka bophara.
See Less
Bona ga Nyenyane
See More
Bona go Feta
Select a Language...
Kgetha Leleme...
Selected
Kgethilwego
Select Language
Kgetha Leleme
Send sign up link
Romela linke ya go ngwadiša
Sentence Collection
Kgobokanyo ya Mafoko
Sentence Collection
Kgobokanyo ya Mafoko
<sentenceCollectorFragment>See more on sentence collection</sentenceCollectorFragment>.
<sentenceCollectorFragment>Bona go feta ka kgobokanyo ya mafoko </sentenceCollectorFragment>.
<sentenceCollectorVideo>Watch our guide on using the Sentence Collector</sentenceCollectorVideo>.
<sentenceCollectorVideo>Lebelela ditaetši tša rena go tseba go šomiša Molekgetho wa Mafoko </sentenceCollectorVideo>.
Sentence must contain between 1 and 14 (inclusive) words
Lefoko le be le mantšu a go tloga go 1 go ya go 14 (ka go akaretša)
Sentences
Mafoko
{ $sentences } added by you
{ $sentences } a tsentšwego ke wena
Sentences are being uploaded. This can take several minutes depending on the number of sentences added. Please do not close this website.
Mafoko a tsenywa. Se se ka tšea metsotso e mmalwa go ya ka gore ke mafoko a ma kae a tsenywago. Re kgopela o se ke wa tswalela wepesaete.
Sentences are collected for people to read aloud.
Mafoko a kgobokeditšwe gore batho ba balele godimo.
Sentence should not contain abbreviations
Mafoko a se ke a ba le dikhutsofatšo
Sentence should not contain numbers
Mafoko a se ke a ba le dinomoro
Sentence should not contain symbols
Mafoko a se ke a ba le maswao
Sentences must be reviewed and accepted by two people to be included in Common Voice. You create guidelines for your language <languageGuidelinesLink>here</languageGuidelinesLink>. Sentences must be in the <licenseLink>public domain</licenseLink> and shorter than 15 words. You can ask the owner of a text to make it <waiverLink>CC0 using our waiver process</waiverLink>, and send to us on commonvoice@mozilla.com
"Mafoko a swanela go hlahlobja le go amagelwa ke batho ba babedi gore di tsenywe mo go Common Voice. O hlangwa tlhahlo mabapi le tša polelo ya geno
<languageGuidelinesLink> mo</languageGuidelinesLink>. Mafoko a swanetše go ba<licenseLink> lefelong la batho bohle</licenseLink> gomme le be kopana go fetwa/ka tlase ga mantšu a 15. O ka kgopela mong wa mongwalo go e dira <waiverLink>CC0 o šomiša mokgwa wa rena wa go tlogela</waiverLink>, gomme o re romele go commonvoice@mozilla.com"
Sentences not submitted, are you sure you want to leave?
Mafoko ga a romelwa, na o na le bonnete ba gore o nyaka go tšwa?
Sentences not submitted, are you sure you want to leave?
Ga se o tsenya mafoko, na o kgonthišiša gore o nyaka go tšwa?
{ $sentences } sentences failed
{ $sentences } mafoko a paletšwego
{ $sentences } sentences reviewed. Thank you!
{ $sentences } mafoko a šetše a hlahlobotšwe. Rea leboga!
Sentences that have been validated are exported on Wednesdays, and are available after the next release.
Mafoko a kgonthišitšweng a romelwa ka laboraro, a tle a be gona ka morago ga tše tlo lokollwa tše latelago.
Sentence text
Mafoko a lengwalo
Serbian
Se-Serbia
Set a goal
Ipeele pakane
Set my visibility
Bontšha ponagalo yaka
Setswana
Setswana
Settings
Dilaolatirišo
Settings
Dilaolatirišo
Share Common Voice
Abelana ka Common Voice
Share my goal
Abelana ka pakane yaka
Share your clip
Abelana ka pego ya gago
Share your { $count } Clip Daily Goal for { $type }
Abelana ka { $count } Pakane ya gago ya Letšatši ya Pego ya { $type }
Share your { $count } Clip Weekly Goal for { $type }
Abelana ka { $count } Pakane ya gago ya Beke ya Pego ya { $type }
Shilha
Se-Shilha
Shortcuts
Kgaoletšo
[Should be ‘dinosaurs’]
[Should be ‘dinosaurs’]
[Should be “We are”]
[Should be “We are”]
Show all skipped sentences again
Bontšha mafoko ka moka ao wa tshitšeng gape
Show my ranking
Bontšha boemo bjaka
Sicilian
Se-Sisilia
Sign up
Ngwadiša
Sign up for an account
Ngwadiša go hwetša akhaonte
Sign up for Common Voice newsletters, goal reminders and progress updates
Ngwadiša go lengwalo la ditaba la Common Voice, dikgopotšo tša pakane le di tsebišo ka tšwelopele
sign up for email updates
ngwadiša go hwetša mpshafatšo go e-meile
Sign up for { $lang } updates:
Ngwadiša { $lang } go mpshafatša:
Single ZIP file containing
Faele e tee ya ZIP e nang le
Sinhala
Se-Sinhala
Siswati
Seswati
Size
Bogolo
Size
Bogolo bja yona
Skip
Tshela
Skip
Taboga
Skip Submission Feedback
Tshela Tsebišo ya Tshedimošo
Slovak
Se-Slovak
Slovenian
Se-Slovenia
Social media
Poledišano ya inthaneteng
Somali
Se-Somali
Some languages have enormous variation in grammar, vocabulary and pronunciation. For this reason, we are <ctaLink>introducing ‘Variants’</ctaLink> in 2022. This gives communities a way to distinguish their languages within the larger dataset.
Dipolelo tše dingwe di na le phapano e kgolo ya popolelo, tlotlontšu le mmiletšo ya mantšu. Ka lona lebaka le, re hlagiša <ctaLink> ‘Mehuta e fapanego’ </ctaLink> ka 2022. Se se nea setšhaba mokgwa wa go kgetholla maleme bona ka gare ga dathasete e kgolwanyane.
Someone asks for a language to be added.
Go nale motho wo a kgopelang gore polelo e okeletšwe.
Something went wrong with reCAPTCHA. Please try again.
Go nale se phošo e diregilego ka reCAPTCHA. Re kgopela o leke gape.
Sorbian, Lower
Se-Sorbian, Lower
Sorbian, Upper
Se-Sorbian, Upper
Sorry, Common Voice is running slowly. Thanks for your interest.
Maswabi, Common Voice e šoma ka go nanya. Re leboga kgahlego ya gago.
Sorry, something went wrong
Maswabi, go nale phošo e diregilego
Source: { $sentenceSource }
Mothopo: { $sentenceSource }
Source: { $source }
Mothopo: { $source }
Southern Sotho
Sesotho
Spanish
Sepaniši
Speak
Bolela
Speak
Bolela
Speakers
Diboledi
Speaking
Go bolela
Speaking and Listening
Go Bolela le go Theeletša
Speak in your normal voice! The way you speak is welcome here - we want your accent as it is, and we want your usual volume, style and intonation.
Bolela ka lentšu la gago le le tlwaelegilego! Tsela yeo o bolelago ka yona e amogelegile mo - re nyaka mmolelo wa gago kamoo o lego ka gona, gomme re nyaka modumo wa gago wa lentšu o tlwaelegilego, setaele le segalo sa lentšu.
Speak now
Bjale Bolela
Speak up, contribute here!
Bolela, neela mo!
<speechBlogLink>Get Started with Speech Recognition</speechBlogLink>
<speechBlogLink>Thoma ka Tlhathollo ya Mantšu </speechBlogLink>
Speech is often the most natural way we communicate with each other and voice technologies are bringing that convenience to our computers and mobile devices. We want to empower developers to build amazing voice recognition applications like real-time translators and voice-enabled digital assistants. But right now most of the voice data required to build these kinds of apps is expensive and proprietary. We hope the Common Voice dataset will give developers what they need to innovate and make speech technology available in their own language.
To make voice recognition even more universal, we're collecting voice samples in widely spoken languages as well as those with a smaller population of speakers often underserved by commercial speech recognition services. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and entire speech communities to address this gap themselves.
Gantši polelo ke tsela ya tlhago yeo re boledišanago ka yona ebile ditheknolotši tša lentšu di tliša boiketlo bjo go dikhomphutha tša rena le didirišwa sa elekroniki. Re nyaka go matlafatša bahlagiši go aga didirišwa tše di botse tša go kwa lentšu bjalo ka bafetoledi ba nako ya kgonthe le bathuši ba ditithale tša go hlaboša mantšu. Eupša ga bjale bogolo bja datha ya lentšu yeo e hlokagalago go aga mehuta ye ya disebedišwa di bitša tšhelete ya godimo ebile e na le thepa. Re holofela gore Datha-set ya Common Voice e tla nea batho ba go hlagiša seo ba se hlokago go thoma le go dira gore theknolotši ya polelo e hwetšegale ka maleme a bona.
Go dira gore tlhathollo ya mantšu e lemogwe lefaseng ka bophara, re kgoboketša disampole tša mantšu ka dipolelo tše di bolelwago ke batho ba bantši kudu le baboledi ba dihlopha tše nyenyane tšeo di sa hlokomelegilego ke ditshebeletšo tša dikgwebo. Go phatlalatša pokelelo ya datha tšeo di fapanego go tla matlafatša ba hlagišo, bo rakgwebo, le setšhaba ka moka sa dipolelo go rarolla sekgoba se ka bobona.
Speech-to-text (STT)
Sefetošetšadigwalwa mantšung (STT)
Speech-to-text (STT) technologies convert voice data into text.
Dithekinolotši tša Sefetošetšadigwalwa mantšung (STT) di fetola di datha tša lentšu go mangwalo.
Split into { $archiveCount } ZIP files containing
Aroganya go { $archiveCount } di faele tša ZIP tše swerego
Splits
Aroganago
Start recording
Thoma pego
Start typing to describe your accent
Thoma go thaepa go hlalosa mokgwa wa gago wa go bolela
Statistics
Dipego
Statistics
Dipalo-palo
Stats
Di-Stats
Status Page
Boemo bja Letlakala
Streaks
Mothaledi
<strong>[Crackle]</strong> giant dinosaurs of <strong>[crackle]</strong> -riassic.
<strong>[Crackle]</strong> dinosaurs e kgolo ya <strong>[crackle]</strong> -riassic.
<strong>Do not reject clips where the speaker ‘has an accent’ that is different to your own</strong> - this is important for voice recognition to work better for everyone.
<strong>O se ke wa gana ditsopolwa moo seboledi ‘seo se nago le mmolelo’ woo o fapanego le wa gago</strong> - seo se bohlokwa gore temogo ya lentšu e šome kaone bakeng sa batho bohle.
<strong>Get creative!</strong> Your language community will be unique, and these are just a few ways to get started.
<strong>Diriša bohlale!!</strong> Polelo ya geno e tla ba moswananoši, tše ke ditsela tše mmalwa tša go thoma.
<strong>Sentence Collector</strong> is a tool for collecting and validating public domain sentences. You will <scAccountLink>need to create an account</scAccountLink>. Then, add your language to your <strong>Profile</strong>. You can <strong>Add</strong> sentences or <strong>Review</strong> previously-added sentences.
<strong>Molekgetho wa Mafoko</strong> ke sedirišwa sa go kgoboketša le go netefatša dipolelo tša setšhaba. O tla hloka<scAccountLink>go hlangwa akhaonte</scAccountLink>. Ke moka, tsentšha polelo ya gago go <strong>profaeleng ya gago</strong>. O ka <strong>tsenya mafoko</strong> goba wa <strong>ahla-ahla </strong> mafoko a tsentšhitšego nakong e fetilego.
<strong>[Sneeze]</strong> The giant dinosaurs of the <strong>[cough]</strong> Triassic.
<strong>[Sneeze]</strong> Dinosaur e kgolo ya <strong>[cough]</strong> Triassic.
Submission: { $batchId }
Go tsenya tshedimošo: { $batchId }
Submission Error
Phošo ya go Romela
Submit
Romela
Submit
Tsenya
Submit
Lokela
Submit a report
Tliša pego
Submit clips
Tsenya dipego
Submit clips
Romela ditsopolwa
Submitted sentences. { $duplicates } sentences were rejected as duplicates.
Mafoko a rometšego. { $duplicates } mafoko a ile a ganwa ka ge e le dilo tše pedi tša go swana.
Subscribe
Ngwadiša
Success
Katlego
Success, profile created!
Katlego, profaele e hlangwe!
Swahili
Se-Swahili
Swedish
Se-Swedish
Swipe right to approve the sentence. Swipe left to reject it. Swipe up to skip it. <strong>Do not forget to submit your review!</strong>
Swaepa ka letsogong la letona go amogela lefoko. Swaepela ka go le tshadi go e gana. Swaepela godimo go e taboga. <strong>Se ke wa lebala go romela maikutlo a gago!</strong>
Syriac
Se-Syriac
Tagalog
Se-Tagalog
Tahitian
Se-Tahitia
Taiwanese (Minnan)
Se-Taiwanese (Minnan)
Tajik
Se-Tajik
Taking several attempts to read a word.
Tšea maiteko a mmalwa a go bala lentšu.
Tamil
Se-Tamil
Tap
Kgwatha
Tatar
Se-Tatar
Tatoeba is a large database of sentences, translations, and spoken audio for use in language learning. This download contains spoken English recorded by their community.
Tatoeba ke dathabeise e kgolo ya mafoko, diphetolelo, le mantšu a go bolelwago a odio bakeng sa go dirišwa go ithuta polelo. Tanelouto e e na le Seisemane se se bolelwago sa go rekhotiwa ke setšhaba sa gabo rena.
TED-LIUM Corpus
TED-LIUM Corpus
Tell us what you'd like to download:
Re botše gore o ka rata go tanelouta eng:
Telugu
Se-Telugu
Terms
Mantšu
Terms
Leina
Thai
Se-Thai
Thanks for confirming your account, now let's build your profile.
Rea leboga go tiišetša akhoante ya gago, bjale a re age profaele ya gago.
Thanks to contributions from over 259k people in over 50 languages, this data is being used to train speech-enabled applications to better respond to the human voice.
Ka baka la thušo ya batsenyaletsogo go tšwa go batho ba fetago 259k ka maleme a fetago 50, datha tšeo di dirišwa go eteletša di dirišwa tše di kgonago go bolela go arabela gabotse lentšung la motho.
Thank you for recording!<lineBreak></lineBreak>Now review and submit your clips below.
Re leboga pego ya gago!<lineBreak></lineBreak> Ga bjale hlahloba o tsenye pego tša gago mo tlase.
Thank you for your interest in contributing to { $lang }. We work hard to get every language ready for launch and keep
the teams updated via email. If you want to contribute, please provide your email below.
Ke leboga kgahlego ya gago ya go neela go{ $lang }. Re šoma ka matla go dira gore polelo ye nngwe le ye nngwe e itokišetše go thomiša le go dula dihlopha
beakanywa ka e-meile. Ge e ba o nyaka go neela, ka kgopelo nea ka e-meile ya gago ka mo tlase.
Thank you! You’ve sent a new language enquiry
Rea leboga! O rometše khwiri e mpsha ka polelo.
The best sources you can look for are podcasts, transcripts, movie scripts and anything that potential can contain everyday conversations.
Methopo e kaonekaone yeo o ka e nyakago ke di-podcast, di-transcript, mengwalong ya difilimi le eng goba eng e ka bago gona go polediišano tša letšatši le letšatši.
The bumblebee sped by.
Nose e ile ya feta ka lebelo.
The Clip Graveyard consists of voice clips that didn't make it into the Common Voice dataset. Just like the dataset, the Clip Graveyard is available for download.
We would like to thank the following people and organizations for their help with the project:
Setsopolwa sa Graveyard se na le dirakolo tša lentšu tše di sa fihlaho go dathasete ya Common Voice. Bjalo ka dathasete, setsopolwa sa Graveyard ya hwetšagala bakeng sa go tanelouta. Re ka rata go leboga batho le mekgatlo e latelago ka thušo ya bona morerong o:
The clip has disrespectful or offensive language.
Setsopolwa se na le polelo e hlokago hlompho goba e kgopišago.
The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech. The first version of Deep Speech was released in November 2017 and has continued to evolve ever since. Together with the Common Voice dataset, we believe this open source voice recognition technology should be available to everybody. It’s our hope these technologies will enable developers to build a wave of innovative products and services.
Dathasete ya Common Voice e dumelelana le entšene Deep South ya Mozilla e bolegilego ya go tlhatholla Polelong. Phetolelo ya mathomo ya polelong e tseneletšego ya Deep Speech e lokoloktšwe ka November 2017 gomme esale e tšwela pele ka go fetoga go tloga ka nako yeo. Gotee le dathasete ya Common Voice, re dumela gore theknolotši e e bulegilego go lemoša lentšu e swanetše go hwetšwa ke motho yo mongwe le yo mongwe. Re holofela gore ditheknolotši tše di tla thuša batšweletši go aga ditšweletšwa ke ditirelo tše di mpsha tše dintši.
The Common Voice Dataset contains hundreds of thousands of voice samples that help developers build voice recognition tools.
Dathasete ya Common Voice e nale di pego tše makgolo le dikete ya mehlala ya manntšu e thušago ba batšweletša ba agang sedirišwa sa temogo ya lentšu.
The Common Voice dataset is an open and publicly available resource that can be used to train a wide variety of speech-enabled applications. To protect the security of our contributors, we ask everyone who downloads the Common Voice dataset to respect contributors’ privacy.
All voice clips in the dataset are scrubbed of personally identifying information. When you download the dataset, you agree to not attempt to determine the identity of any contributor. That means you cannot try to link information in the dataset to a contributor’s personal information. You may, however, use the dataset to train speech recognition, speaker recognition, or other applications, by, for instance, linking information in the dataset to other information already in the dataset.
"The Common Voice dathasete ke se dirišwa seo se bolegilego ebile se hwetšagala phatlalatša se ka be rekišwa go eteletšwa mehuta e fapanego ya disebedišwa tša go kgona go bolela. Go sireletša tšhireletšo baneedi ba rena, re kgopela motho yo mongwe le yo mongwe yo a taoneloutago Common Voice go hlompha sephiring sa baneedi.
Dikarolo ka moka tša lentšu go dathasete di hlakotšwe boitsebišo bja motho. Ge o taonelouta dathasete, o dumela gore o ka se leke go tseba gore na mothuši ofe goba o fe ke mang. Seo se bolela gore o ka se leke go kgokaganya tsebišo ka dathasete ka boitsebišo bja mothuši. Lege go le bjalo, o ka šomiša dathasete go kwetliša tlhathollo ya mantšu, tlhathollo ya seboledi, goba ditsebišo tše dingwe, ka mohlala, go kgokaganya tsebišo goba boitsebišo bjo bogwe bjo bo šetšeng bo le gona ka dathasete. "
The Common Voice dataset is available for download under the <licenseLink>CC0</licenseLink> license on <datasetLink>our Datasets page</datasetLink>. You can also download several other publicly available datasets from the same page.
Dathasete ya Common Voice e hwetšagala bakeng sa go tanelouta ka tlase ga <licenseLink>CC0</licenseLink>laesense ya <datasetLink> letlakaleng la rena la Di-dathasete</datasetLink>. O ka tanelouta didirišwa tše dingwe tše mmalwa tšeo di hwetšagalago phatlalatša letlakaleng la go swana.
The Common Voice Sentence Collector has collected { $sentenceCount } sentences in { $languageCount } languages!
Molekgetho wa Mafoko a Common Voice o kgobokeditše { $sentenceCount } mafoko a { $languageCount } a maleme!
The count of voice recording hours that have been validated by 2 out of 3 users with a vote of “Yes”. These mark progress toward the overall project 10k hours goal.
Palo ya diiri tša go rekhota lentšu tše netefaditšwego ke bašomiši ba 2 go ba 3 ka bouto ya “Ee”. Tše di swaya tšwelopele go fihlela le go lebana le modiro wa kakaretšo ya diiri tše 10K.
The count of voice recording hours we have collected so far.
Dibalwa tša di pego tše re di kgobokeditšego go fihla ga bjale.
The giant dinosaur of the Triassic.
Dinosaur e kgolo ya Triassic.
The giant dinosaurs of the Triassi-.
Dinosaur e kgolo ya Triassi-.
The giant dinosaurs of the Triassic.
Dinosaur e kgolo ya Triassic.
The giant dinosaurs of the Triassic. <strong>[read by one voice]</strong>
Dinosaur e kgolo ya Trassic. <strong>[read by one voice]</strong>
The giant dinosaurs of the Triassic. Yes.
Dinosaur e kgolo ya Triassic. Ee.
The giant dino <strong>[cough]</strong> the Triassic.
Dino e kgolo <strong>[cough]</strong> di Triassic.
The goal of the Common Voice dataset is to enable anyone in the world to build speech recognition, speaker recognition, or any other type of application that requires voice data. A voice assistant is just one of many types of applications you could use the dataset to build.
Boikemišetšo bja dathasete ya Common Voice ke go thuša motho lege e le ofe goba ofe lefaseng go aga tlhathollo ya mantšu , tlhathollo ya diboledi, goba mohuta ofe goba ofe wa sedirišwa se hlokago datha ya lentšu. Mothuši wa lentšu ke e nngwe feela ya mehuta e mentši ya didirišwa tšeo di šomišago go aga dathasete.
The multi-language version of the Common Voice dataset is currently undergoing community supported bundling and cleaning. If you would like to help us bring Common Voice to new languages, go check out the <sentenceCollectorLink>Sentence Collection Tool</sentenceCollectorLink> for adding new sentences to the dataset, and Mozilla <pontoonLink>Pontoon</pontoonLink> for translating the website itself. New languages are added to Common Voice for voice contribution when 5000 approved sentences have been collected.
Mohuta wa maleme a mantši dathasete ya Common Voice e dutše e tšwelapele go hlakana le go hlwekiša. Ge e ba o rata go re thuša go tliša Common Voice malemeng a ma mpsha, eya o lebelela <sentenceCollectorLink>Sedirišwa sa go Kgobokanetša Mafoko</sentenceCollectorLink> bakeng sa go oketša mafoko a ma mpsha go dathasete le Mozilla <pontoonLink>Pontoon</pontoonLink> bakeng sa go fetolela wepesaete ya bosona. Dipolelo tše mpsha di tsenywa go Common Voice bakeng sa tlaleletšo ya lentšu ga go se go kgoboketšego mafoko a amogetšwego a 5000.
The number of recordings and which languages you contribute to will be public.
Palo ya dipego le dipolelo tšeo o di neetšego di tla phatlalatšwa.
The process by which a contributor’s profile information is obscured from their donated voice clips when packaged for download as a part of the dataset.
Nakong yeo ba tshedimošo ya profaele ya moneedi e širwa ke dipego tša go neealwa ge di phuthetšwe go tanelouto ka ge di le karolo ya datha-set.
There are a lot of public organizations that might want to collaborate and already have a lot of texts they can donate as public domain. Reach out to local universities, Governments and open source organizations to talk about the project and ask for their help.
Go na le mekgatlo e mentši ya setšhaba e ka nyakago go šomišana gomme e šetše e na le ditemana tše dintši tšeo ba ka neelago ka tšona go etša go lefelong la batho bohle. Ikgokaganye le diyunibesithi tša lefelong leo, Mebušo le mekgatlo e bulegilego go bolela ka morero le go kgopela thušo ya bona.
There are lots of ways to think about language. For the purposes of speech recognition models, Common Voice suggests focussing on ‘mutual intelligibility’, or ‘can speakers of this language mostly understand one another if they try to?’
Go na le ditsela tše dintši tše o ka naganago ka polelo. Ka morero wa mehlala wa tlhathollo ya mantšu, Common Voice e bontšha maikutlo tlhokomelo go ‘kwešišano ya bohlale’, goba ‘na diboledi tša dipolelo tše ba ka kwešišanago gantši ge ba ka leka go dira bjalo?’.
The recording was too long.
Pego e be e le etelele kudu.
The recording was too quiet.
Pego e be e homotše kudu.
The recording was too short.
Pego e be e fokotšegile kudu.
There will be natural variations in volume between readers. Reject only if the volume is so high that the recording breaks up, or (more commonly) if it is so low that you can’t hear what is being said without reference to the written text.
Go tla ba le phaphano ya tlhago ya modumo magareng ga babadi. Gana feela ge e ba modumo o le godimo kudu moo rekhotilwego e sa kwale botse, goba (seo se tlwaelegilego kudu) ge e ba e le tlase moo e sa kwale gore go boletšwe eng ka ntle le go lebelela mo go ngwadilego.
These languages are currently under community development. <localizationGlossaryLink>Website localization</localizationGlossaryLink> and <sentenceCollectionGlossaryLink>sentence collection</sentenceCollectionGlossaryLink> is needed to launch.
Maleme a gona bjale a ka tlase ga tšwelopele ya setšhaba. <localizationGlossaryLink>wepesaeteng ya kgauswi</localizationGlossaryLink> le <sentenceCollectionGlossaryLink> kgoboketšo ya mantšu</sentenceCollectionGlossaryLink> e nyakega go thoma.
The selected file is too large
Faele e kgethilweng e kgolo kudu
The Sentence Collector is part of <commonVoiceLink>Common Voice</commonVoiceLink>. It allows contributors to collect and validate sentences created by the community. You can use this tool also to import and clean-up small-to-medium-sized public domain corpus you have found or collected. All sentences need to be Public Domain. Approved sentences are exported every week to the Common Voice repository and are released on the Common Voice website on every new deployment.
Molekgetho wa Mafoko ke karolo ya<commonVoiceLink> Common Voice</commonVoiceLink>. E dumelela gore baneedi go kgoboketša le go netefatša mafoko a hlangwego ke setšhaba. O ka šomiša sedirišwa se go tsenya le go hlwekiša sehlopha sa setšhaba se senyenyane go iša bogareng bja seo se hwetšalago goba o se kgoboketšego. Mafoko ka moka di swanetše go ba Lefelong la Setšhaba. Mafoko ao a amogetšwego a romelwa beke e nngwe le e nngwe polokelong ya Common Voice gomme di lokolwa wepesaeteng ya Common Voice ga go romelwa selo sengwe le se sengwe se sefsa.
The sentence has a grammatical or spelling error.
Lefoko le nale phošo ya popopolelo le ya go peletwa.
The sentence has disrespectful or offensive language.
Lefoko le nale mantšu a hlokago hlompho goba a kgopišago.
The sentence must be grammatically correct.
Lefoko le swanetše go nepagala ka popopolelo.
The sentence must be grammatically correct.
Lefoko le swanetše go nepagala ka popopolelo.
The sentence must be speakable.
Lefoko le swanetše gore le kgone go bolelwa.
The sentence must be speakable.
Lefoko le swanetše gore le kgone go bolelwa.
The sentence must be spelled correctly.
Lefoko le swanetše go ngwalwa ka nepo.
The sentence must be spelled correctly.
Lefoko le swanetše go ngwalwa ka nepo.
The site is ready to be launched when it reaches 75% completion.
Saete e loketše go thomiša ge e fihla pheletšong ya 75%.
The site will be back up as soon as possible. For the latest information, please join the <matrixLink>Matrix community chat</matrixLink> or visit <githubLink>GitHub</githubLink> or <discourseLink>our Discourse forums</discourseLink> to submit and monitor site experience issues.
Saete e tla thekgwa kapejana ka mo go ka kgonegago. Bakeng sa tsebišo ya morago-rago bjale, ka kgopelo ikgokaganye le <matrixLink>poledišano ya setšhaba ya Matrix</matrixLink> goba etela <githubLink>GitHub</githubLink> goba <discourseLink>diforamo tša rena tša Discourse</discourseLink> go romela le go šetša tokollo ya boiphihlelo bja saete.
The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website.
TE-LIUM coepus e dirilwe go odio tša go bolela le direkhoto tša yona di gona go wepesaete ya TEB.
The website text is translated into that language.
Mengwalo ya wepesaete e fetoletšwe ka polelo yeo.
This is approximately the number of hours required to train a production speech-to-text system.
Tše ke dinomoro tša diiri tše hlokalago go ruta tšweletšo ya lenaneo la sefetošetšadigwalwa mantšung.
This is a use case driven segment containing data to power spoken digit recognition and yes / no detection.
Se ke karolo e hlohloletšwago ke seripa sa go ba le datha go matlafatša temogo ya dinomoro tše bolelwago le go bonwa ga ee/aowa.
This is our process for translating and adapting our content for many locales (languages).
Ye ke tiragalo ya rena ya go fetola le go itumelelanya le diteng tša rena tša bontši bja (dipolelo) tša kgauswi.
This page gives you an overview of all your submitted sentences. You may also delete already submitted sentences if needed, by marking the checkbox next to it and clicking on "Delete selected sentences" at the bottom. Please only remove sentences if absolutely necessary, for example if you noticed after the fact that a sentence is copyright protected.
Letlakaleng le le go nea kakaretšo ya mafoko ka moka ao o wa rometšego. Gape o ka phumola tšeo se šetšego di rometšwe ge e ba go nhlokagala, ka go swaya lepokisana la kgauswi le yona ke moka o tobetsa; Phumola mafoko a kgethilwego; ka tlase. Ka kgopelo tloša mafoko ge feela go hlokagala, ka mohlala ge e ba o hlokometše ka morago ga tiragalo gore lefoko le šireleditšwe ke ditokelo tša molao.
This setting controls your leaderboard visibility. When hidden, your progress will be private. This means your image, user name and progress will not appear on the leaderboard. Note that leaderboard refresh takes ~{ $minutes }min to populate changes.
Peakanyo ye e laola ke ponagalo ya kelo ya baetapele. Ge e fihlilwe, tšwelopele ya gago e tla ba sephiring. Se se bolela gore seswantšho sa gago, leina le tšwelopele di ka se bonagale godimo ga kelo ya baetapele. Hlokomela gore mpshafatšo ya kelo ya baetapele itšea ~{ $minutes }metsotso go hlagiša diphetogo.
Three to go!
Go setše tše tharo!
Tigre
Se-Tigre
Tigrinya
Se-Tigrinya
Today
Lehono
Today's Common Voice progress on clips recorded
Tšwelopele ya lehono ya Common Voice ya di pego tše di gatišitšwego
Today's Common Voice progress on clips validated
Tšwelopele ya lehono ya Common Voice ya di pego di kgonthašitšwe
Today's Progress
Tšwelopele ya Lehono
Toki Pona
Se-Toki Pona
To make it into the Common Voice dataset, a voice clip must be validated by two separate users.
Go e dira gore e tsene go dathasete ya Common Voice, pego ya lentšu e swanetše go netefatšwa ke bašomiši ba babedi ba go fapana.
To make the Common Voice dataset as useful as possible we have decided to only allow source text that is available under a Creative Commons (CC0) license. Using the CC0 standard means its more difficult to find and collect source text, but allows anyone to use the resulting voice data without usage restrictions or authorization from Mozilla. Ultimately, we want to make the multi-language dataset as useful as possible to everyone, including researchers, universities, startups, governments, social purpose organizations, and hobbyists.
Go dira gore dathasete ya Common Voice e be molemo ka moo go ka kgonegago, re tšere sephetho sa go dumelela mothopo wa mogwalo feela go hwetšagala ka tlase ga laesense ya Creative Commons (CC0). Go šomiša maemo a CC0 go bolela gore go boima kudu go hwetša le go kgoboketša mothopo wa mogwalo, eupša go dumelela mang le mang go šomiša dintlha tša lentšu ka ntle ga dithibelo tša ditirišo goba tumelelo go tšwa go Mozilla. Mafelelong, re nyaka go dira gore dathasete ya maleme a mantši/polelo e ntši e be molemo ka moo go ka kgonegago go motho yo mongwe le yo mongwe, go akaretša banyakišiši, diyunibesithi, ba thomago, mebušo, mekgatlo ya merero ya setšhaba, le batho ba ratago go itloša bodutu.
Top Contributors
Baneedi ba Phagamego
Total
Palo-moka
Total
Feletšego
Total Approved
Palo-moka e Amogetšwego
{ $totalHours } hours is achievable in just over { $periodMonths } months if { $people } people record { $clipsPerDay } clips a day.
{ $totalHours } diiri di ya fihlelelwa ka godimo ga { $periodMonths } dikgwedi ge e ba { $people } batho o rekhota { $clipsPerDay } dipego ka letšatši.
{ $totalInReview } sentences in review.
{ $totalInReview } mafoko a hlahlobjago.
{ $totalSentences } total sentences.
{ $totalSentences } palo-moka ya mafoko.
To the right we outline the benefits and clarify what information we make public. Use the links below to get started with a Common Voice account on your own device.
Ka la letona re hlalosa mehola le go tlhaolla gore na ke tshedimošo efe e re phatlalatšago. Diriša kgokagano tša ka tlase go thoma ka akhaonte ya Common Voice go sedirišwa sa gago.
Toward next goal
Go yela go pakane e latelago!
Track progress here and on your stats page.
Latela tšwelopele mo le letlakaleng la gago la dipalo-palo.
Translate this page
Fetolela letlakala le la wepsaete
<translateVideoLink>Watch our guide on how to use Pontoon.</translateVideoLink>
<translateVideoLink>Lebela ditaetši go tseba go šomiša Pontoon.</translateVideoLink>
Translating the site
Go fetolela saete
Translation is from English, but you can see <strong>Suggestions</strong> in other languages. Click the <strong>Profile</strong> icon, then <strong>Settings</strong> link and add any languages you speak. At the bottom right-hand corner will be a list of translations called <strong>Locales</strong>. Translations show on the site after one day.
Phetolelo e tšwa go Seisemane, eupša o ka bona <strong>Ditšhišinyo</strong> ka dipolelong tše dingwe. Tobetsa <strong>aekhone ya Profaele</strong> aekhone, ke moka <strong>Selaelatirišo</strong> linke yaebe o tsenya dipolelo tšeo o di bolelago. Ka tlase ga sona ka letsogong le letona go tla ba le lenaneo la diphetolelo e bitšwago <strong>Locales</strong>. Diphetolelo di bontšha saeteng ka morago ga letšatši letee.
Translation of the Common Voice site happens on <strong>Pontoon</strong>.
Go fetola ga saete ya Common Voice e diragala go <strong>Pontoon</strong>.
Tshivenda
Se-Venda
Turkish
Se-Turkish
Turkmen
Se-Turkmen
Twi
Se-Twi
Typically megabytes
Megabytes e tlwaelegilego
Ubykh
Se-Ubykh
Udmurt
Se-Udmurt
Ukrainian
Se-Ukrainia
Understand contribution criteria
Kwešiša kriteria ya go tlaleletša
Understand what to look for when listening to voice clips and help make your voice recordings richer too!
Kwešiša gore ke eng se o swanetše go se lebelela ge o theeletša dipego gomme o thutše go dira direkhoto tša gago tša lentšu di hume!
Unexpected response returned from server
Karabo yeo bego e sa letelwa e bušetšwe go seba
Unfortunately we can’t have Wikimedia articles yet. So do not copy paste from there.
Ka bomadimabe re ka se kgone go ba le di athekele tša Wikimedia gabjale. O se ke kopiša wa ba wa mometše e tšwago moo.
{ $unreviewedSentencesByYou } sentences left for you to review.
{ $unreviewedSentencesByYou } mafoko a šetšego a gore o wa tlhahlobe.
Updating…
E ya mpshafatšwa...
Upload aborted. Do you want to delete your recordings?
Go tsenya ga faele go ganne. O nyaka go phumola di pego tša gago?
Upload an image file
Tsenya faele ya senepe netwekeng.
Urdu
Se-Urdu
User Name
Leina la Mošomiši
Users validate the accuracy of donated clips, checking that the speaker read the sentence correctly.
Bašomiši ba kgonthiša go nepagala ga dipego tše ba di neetšego, ba lebelela gore moboledi o bala mafoko gabotse.
Using Common Voice
Go Berekiša Common Voice
Uyghur
Se-Uyghur
Uzbek
Se-Uzbek
Validated Clips
Dipego tše di Kgonthišitšwego
Validated Hours
Diiri tše Kgonthišitšwego
Validated Hr. Total
Palo- moka ya diiri tše kgonthišetšwego
{ $validatedSentences } validated sentences.
{ $validatedSentences } mafoko a kgonthašitšwego.
Validating donated clips is equally important to the Common Voice mission. Take a listen and help us create quality open source voice data.
Go netefatša dikarolwana tše neelwego ke mokgwa o bohlokwa kudu mo go Common Voice. Theeletša gomme o re thuše go hlangwa datha ya mothopo o bulegilego wa lentšu.
Validation Progress
Kgonthišišo ya Tšwelopele
Validations
Dikgonthišišo
Value and recognition.
Boleng le temogo.
Variants are a specific form of a language - for example shared by those living in a geography or commmunity. Sometimes these are called dialects.
Mehuta e fapa-fapanego ke mohuta o itseng wa polelo – ka mohlala hlakanetšwego ke bao ba phelago tikologong goba setšhabeng. Ka dinako tše dingwe di bitšwa mebolelo.
Varying Pronunciations
Dipiletšo tša Mantšu tše Fapanego
Venetian
Se-Venetian
Version
Bešene
Version
Version
Vietnamese
Se-Vietnamese
View your progress against personal and project goals.
Lebelela tšwelopele ya gago o e lebantše le di pakane tša gago le tša projeke.
Visible
Bonagala
Voice clips are entered into a submission queue that readies them for listening.
Dipego tša lentšu di tsenywa ka gare ga lenaneotetelo le di lokišwago go theeletšwa.
Voice Contribution
Go Neela ka Lentšu
Voice Dataset, Ready for Download
Dathasete ya lentšu, e Loketše go Taneloutwa
Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology
for our machines. But to create voice systems, developers need an extremely large amount of voice
data.
Lentšu ke la tlhago, lentšu ke motho. Ke ka baka leo re kgahlišago ke go hlangwa theknolotši ya lentšu e dirišwago bakeng sa metšhene ya rena. Eupša go hlangwa thulaganyo ya lentšu, bahlagiši ba hloka palo e kgolo ya datha ya lentšu.
Voice is natural, voice is human. That’s why we’re fascinated with creating usable voice
technology for our machines. But to create voice systems, an extremely large amount of voice
data is required.
Lentšu ke la tlhago, lentšu ke motho. Ke ka le baka leo re kgahlišago ke go hlangwa theknolotši ya lentšu e dirišwago bakeng sa metšhini ya rena. Eupša go hlangwa tshepidišo ya lentšu, go hlokagala palo e kgolo ya datha ya lentšu.
Voice recognition technology is revolutionizing the way we interact with machines, but the currently available systems are expensive and proprietary. Common Voice is part of Mozilla’s initiative to make voice recognition technologies better and more accessible for everyone. Common Voice is a massive global database of donated voices that lets anyone quickly and easily train voice-enabled apps in potentially every language.
Theknolotši ya go tlhatholla lentšu e fetola tselayeo re dirišanago le metšhene ka gona, eupša ditshepedišo tšeo di be go gona nakong ye di bitša tšhelete e ntši ebile ke tša batho. Common Voice ke karolo ya magato a boikitlaetšo ba Mozilla ba go dira gore mahlale a go tlhatholla lentšu a be kaone le gore a hwetšagale ke motho yo mongwe le yo mongwe. Common Voice ke dathabeise ya lefase ka bophara ya mantšu a neetšwego e dumela mang goba mang gore a kwetletše ka pejana le ga bonolo didirišwa tše nago le lentšu ka polelo ye nngwe le ye nngwe.
Voice recognition technology is revolutionizing the way we interact with machines, but the currently available systems are expensive and proprietary. Mozilla Common Voice is an initiative to make voice recognition technologies better and more accessible for everyone. Common Voice is a massive global database of donated voices that lets anyone quickly and easily train voice-enabled apps in potentially every language.
We're not only collecting voice samples in widely spoken languages but also in those with a smaller population of speakers. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and communities to address this gap themselves.
Theknolotši ya go tlhatholla lentšu e fetola tselayeo re dirišanago le metšhene ka gona, eupša ditshepedišo tšeo di be go gona nakong ye di bitša tšhelete e ntši ebile ke tša batho. Common Voice ke karolo ya magato a boikitlaetšo ba Mozilla ba go dira gore mahlale a go tlhatholla lentšu a be kaone le gore a hwetšagale ke motho yo mongwe le yo mongwe. Common Voice ke dathabeise ya lefase ka bophara ya mantšu a neetšwego e dumela mang goba mang gore a kwetletše ka pejana le ga bonolo didirišwa tše nago le lentšu ka polelo ye nngwe le ye nngwe.
Voices Online Now
Mantšu a Inthaneteng ga Bjale
Voice Validation
Kgonthišišo ya Lentšu
Volume
Bolumo
Votic
Se-Votic
VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines.
VoxForge e beilwe go kgobokanya polelo e fetotšwego gore e berikišwe ka Mahala le Mothopo o Bulegilego wa Di-entšene tša Tlhathollo ya Polelo.
Want to help make Common Voice even better?
Great! Get in touch via email or <discourseLink>Discourse</discourseLink>
forums, submit site issues via <githubLink>GitHub</githubLink>, or join the
<matrixLink>Matrix</matrixLink> community chat.
O nyaka go thuša go dira Common Voice gore e be kaone?
Seo se a kgahliša! Ikgokaganye ka e-meile goba <discourseLink>Discourse </discourseLink>diforamo, romela ditokollo tša saete go <githubLink>GitHub</githubLink>, goba ikgokaganye le <matrixLink>poledišano ya setšhaba ya Matrix<matrixLink>.
Want to stay in touch with Common Voice?
O nyaka go dula o boledišana le Common Voice?
Want updates when we release a new version of the Common Voice dataset? Subscribe to our newsletter.
O nyaka di mpshafatšo ge re lokolla di bešene tše mpsha tša dathasete ya Common Voice? Ngwadiša go lengwalo la rena la ditaba.
Watch our video explainer to help
Lebelela vidio ya rena e tlalosang go go thuša
We are building an open and publicly available dataset of voices that everyone can use to train speech-enabled applications.
Re aga tatelano ya dathasete ya lentšu e bulegilego le go phatlalatšwa gore batho ka moka ba ka e berikšago go tlwaetša ditirišo tša go dumela polelo.
We are going out to get a coffee.
Re ya tšwa go hwetša kofi.
We are going out to get coffee.
Re ilo go hwetša kofi.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Rena ba Mozilla re aga setšhaba mabapi le theknolotši ya lentšu. Re lakatša go dula re ikgokagantše le diphetogo, methopo e mpsha ya datha le go kwa ka mo go oketšegilego ka mokgwa wa go diriša datha tšeo.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
Rena ba Mozilla re aga setšhaba mabapi le theknolotši ya lentšu. Re lakatša go dula re ikgokagantše le diphetogo, methopo e mpsha ya datha le go kwa ka mo go oketšegilego ka mokgwa wa go diriša datha tšeo.
We believe that large and publicly available voice datasets foster innovation and healthy commercial competition in machine-learning based speech technology. This is a global effort and we invite everyone to participate. Our aim is to help speech technology be more inclusive, reflecting the diversity of voices from around the world.
Re dumela gore dathasete tša lentšu tše dikgolo di tla hwetšagala phatlalatša di kgothaletša go tšweletša mekgwa e mefsa le tlhodišano ya kgwebo e matla go theknolotši ya go mantšu e thekgilwego ke motšhene. Se ke boiteko ba lefase ka moka gomme re mema motho yo mongwe le yo mongwe go tšea karoko. Boikemišetšo bja rena ke go thuša theknolotši le poledišano gore e tsentšhwe batho ka moka, e bonagatše mehuta e fapa-fapanego ya mantšu a tšwago lefaseng ka bophara.
We believe that large, publicly available voice datasets will foster innovation and healthy commercial competition in machine-learning based speech technology.
Common Voice’s multi-language dataset is already the largest publicly available voice dataset of its kind, but it’s not the only one.
Look to this page as a reference hub for other open source voice datasets and, as Common Voice continues to grow, a home for our release updates.
Re dumela gore dathasete tša lentšu tše dikgolo di tla hwetšagala phatlalatša di kgothaletša go tšweletša mekgwa e mefsa le tlhodišano ya kgwebo e matla go theknolotši ya mantšu e thekgilwego ke motšhene.
Dathasete ya maleme a mantši ya Common Voice e šetše e le e kgolo kudu go fetišiša e hwetšagalago phatlalatša ya mehuta ya yona, eupša ga se yona feela.
Lebelela letlakaleng e le dišupišo bakeng sa mothopo o bulegilego wa dathasete ya lentšu, gomme ge Common Voice e tšwela pela, legae la mphsafatšo tša rena tša tokololo.
Website Localization
Wepesaete ya Lefelong
We calculate hours by estimating the average length of each recording, and then multiplying that number by the total number of recordings across all languages.
Re bala diiri ka go akanyetša palogare botelele bja rekhoto ye nngwe le ye nngwe, ke moka re atiša palo yeo ka kakaretšo ya direkhoto malemeng ka moka.
We could not fetch the stats. Please try again later.
Ga se re kgone go lata dipalo-palo. Re kgopela o leke gape ka morago.
We couldn’t find that page for you
Ga se re hwetše letlakala leo
We couldn’t get any audio clips for you to listen to.
Please try again later.
Re paletšwe ke go hwetša dipego tša odio tše o ka di theeletšago. Re kgopela o leke gape ka morago.
We couldn’t get any sentences for you to speak.
Please try again later.
Ga se re kgone go hwetša mafoko a o ka wa bolelago. Re kgopela o leke gape ka morago.
Weekly
Beke le beke
Weekly Goal
Pakane ya Beke
We failed to fetch available languages. Contributions are currently not possible. Please try again later.
Re paletšwe ke go lata maleme a lego gona. Ga go kgonagale go neela gabjale. Re kgopela o leke gape mankgapela.
We launch the Common Voice site in this language.
Re thomile saete ya Common Voice ka polelo ye.
Welcome { $company } staff!
Rea go amogela { $company } bašomi!
Welcome to Common Voice
Rea go amogela go Common Voice
Welcome to the Common Voice Sentence Collector
Re ya go amogela go Molekgetho wa Mafoko ya Common Voice
Welsh
Se-Welsh
We may not have received that request. Please email us directly
on <emailLink>{ $email }</emailLink> and we will get back to you!
Re ka be re sa hwetša kgopelo yeo. Ka kgopelo re romele e-meile go lebanya le rena go <emailLink>{ $email }</emailLink> re tla boela go wena!
We prefer natural/conversational sentences. While phonetic diversity and different words in sentences is important, we are trying to make recording sentences as much fun as possible. Therefore it would be great if you could try to keep your sentences as natural/engaging as possible.
Re kgetha mafoko a tlhago/poledišano. Lege phapano ya sefonetiki le mantšu a fapanego mafokong a bohlokwa, re leka go dira gore mafoko a go gatišwa di be bose kudu ka moo re ka kgonago. Ka gona go ka ba monate ge o ka leka go boloka mafoko a gago a be a tlhago/kopanelago kamoo go ka kgonegalago.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Re tshepiša go swara tshedimošo ya gago ka šedi. Bala ka mo go oketšegilego <privacyLink>Tsebišo ya Sephiri</privacyLink>.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Re tshepiša go swara tshedimošo ya gago ka šedi. Bala ka mo go oketšegilego <privacyLink>Tsebišo ya Sephiri</privacyLink>.
We’re building an open source, multi-language dataset of voices that anyone can use to train speech-enabled applications.
Re aga dathasete e bulegilego, ya go ba le dipolelo tše dintši ya mantšu a ka šomišago ke mang le mang go tlwaetša tirišo ye dumelang polelo.
We’re crowdsourcing an open-source dataset of voices. Donate your voice, validate the accuracy of other people’s clips, make the dataset better for everyone.
Re nyakana le batho ba bantši ba tšwago mothopong o bulegilego wa mantšu. Neelana ka lentšu la gago, netefatša bonnete bja dikarolwana tša batho ba bangwe, dira gore dathasete e be kaone bakeng sa motho yo mongwe le yo mongwe.
We’re experiencing unexpected downtime
Re lebeletšane le nako e sa letelwago.
We’re going out to get coffee.
Re ilo go hwetša kofi.
We release the dataset every 3 months.
Re ntšha dathasete ka morago ga dikgwedi tše dingwe le tše dingwe tše 3.
We're receiving a lot of traffic and are currently investigating the issues.
Re hwetša pitlagano e ntši ebile re nyakišiša bothata bjo.
We’re sorry, your platform is not currently supported.
Re maswabi, sethala se sa gago ga se se thekgwe ga bjale.
We use a Mozilla tool called <pontoonLink>Pontoon</pontoonLink> for translations. Pontoon has lots of languages,
but if it doesn’t have yours you can <pontoonRequestLink>request for your language to be added</pontoonRequestLink>.
Then, to make the language available on the Common Voice project,
<githubRepoLink>request the new language on GitHub</githubRepoLink>. <localizationFragment>See more on site translation</localizationFragment>.
Re šomiša sedirišwa sa Mozilla se bitšwago <pontoonLink>Pontoon</pontoonLink> bakeng sa diphetolelo. Pontoon e na le dipolelo tše dintši, eupša ge e ba e se na ya gago o ka <pontoonRequestLink>kgopela gore polelo ya geno e tsenywe</pontoonRequestLink>.
Ke moka, o dire gore polelo e hwetšegale modirong wa Common Voice, <githubRepoLink>kgopela polelo e mpsha go GitHub</githubRepoLink>. <localizationFragment>Bona diphetolelo tše oketšegilego mo saete</localizationFragment>.
We've run out of clips to validate in this language...
Re feletšwe ke di pego tša go kgonthišiša ka polelo e...
We've run out of sentences to record in this language...
Re feletšwe ke mafoko a gatiša pego ka polelo e...
We want speech models to be better at understanding a diverse range of speakers. For this to happen, a voice dataset must represent lots of different people.
Re nyaka gore mehlala ya polelo e be kaone go kwešiša mehuta e fapa-fapanego ya diboledi. Gore se se direge, lentšu la dathasete le swanetše go emela batho ba bantši ba fapanego.
We want the Common Voice dataset to reflect the audio quality a speech-to-text engine will hear in the wild, so we’re looking for variety. In addition to a diverse community of speakers, a dataset with varying audio quality will teach the speech-to-text engine to handle various real-world situations, from background talking to car noise. As long as your voice clip is intelligible, it should be good enough for the dataset.
Re nyaka gore dathasete ya Common Voice e bonagatše seka sa theetšwago temogo ya sefetošetšamantšung dingwalwa e tla kwala nageng, kamoo re nyaka mehuta e fapa-fapanego. Go tlaleletša ntle le diboledi tše dintši tša go fapana, dathasete e nago le odio yeo e fapanego e tla ruta temogo ya sefetošetšamantšung dingwalwa go berekišana le maemo a fapanego a lefase, go tloga bokamorago go bolela le lešata ka koloi. Ge feela o dikarolwana tša lentšu la gago le sa kwagale, le tla swanela go ba se se loketšego dathasete.
We want the machine learning algorithms to able to handle a variety of background noise, and even relatively loud noises can be accepted provided that they don’t prevent you from hearing the entirety of the text. Quiet background music is OK; music loud enough to prevent you from hearing each and every word is not.
Re nyaka gore alkoritheme ya go ithuta ya motšhene e kgone go šoma le lešata le fapanego, ešita le mašata a magolo ao a ka amogelwago ge feela le sa go thibele go kwa mongwalo ka moka. Mmino wa bokamorago wa llela fase o LOKILE; mmino o wa godimo ka mo go lekanego go go thibela go kwa lentšu le lengwe le lengwe ga wa lokela.
We will also <strong>let our community manager know
that you’re interested</strong>, so they can give you any
extra support or information you might need.
Le rena re tla <strong>tsebiša moetapele wa badudi gore o na le kgahlego </strong>, gore ba go fe thekgo e oketšegilego goba tsebo e o ka e hlokago.
We will be in touch with more information about how to add your language to Common Voice very soon.
Re tla ikgokaganya le wena ka tsebišo e oketšegilego mabapi le gore o ka tsenya polelo ya gago go Common Voice kapejana.
We will be in touch with more information as it becomes available.
Re tla ikgokaganya le wena ka tsebo e nngwe ge e ba gona.
We will not make your email public.
Re ka se phatlalatše e-meile ya gago.
We will review your request to remove your voice recordings from the dataset. If your request is approved, we will contact those who have downloaded the dataset and request they remove your voice recordings as well.
Re tla hlahloba kgopelo ya gago ya go tloša direkhoto tša lentšu la gago dathaseteng. Ge e ba kgopelo ya gago e amogetšwe, re tla ikgokaganya le bao ba taonelouteng dathasete gomme re tla ba kgopela gore le bona ba phumole direkhoto tša lentšu la gago.
What does it mean that I can’t “determine the identity” of speakers in the Common Voice dataset?
Go ra go reng gore ga ke kgone go "Utolla boitsebišo" ka diboledi go dathasete ya Common Voice?
What is a language on Common Voice?
Ke eng polelo go Common Voice?
What is Common Voice?
Ke eng Common Voice?
What is Common Voice?
Common Voice ke eng?
What issues are you experiencing with this sentence?
Ke mathata a feng ao o lebelanego le wona ka lefoko le?
What kind of goal do you want to build?
O nyaka go aga pakane e bjang?
What level of audio quality is required for a voice clip to be used in the dataset?
Ke kgato efe ya khwalithi ya odio e nyakegago go pego ya lentšu gore e berekišwe go dathasete?
What’s inside the Common Voice dataset?
Ke eng e leng ka gare ga dathasete ya Common Voice?
What's Public?
Ke eng ya go Phatlalatšwa?
What’s the difference between Common Voice and Deep Speech?
Phapano ke efe magareng a Common Voice le Deep Speech?
When a user rejects a voice clip it returns to the Queue. If rejected a second time, the voice clip is moved to the Clip Graveyard.
Ge modiriši a gana mananeo a lentšu le a boela Mothalading. Ge e ba go ganwa lekgetho la bobedi, mananeo a lentšu e išwa Clip Graveyard.
When listening, check very carefully that what has been recorded is exactly what has been written; reject if there are even minor errors. <br />Very common mistakes include:
Ge o theeditše, hlahloba ka kelohloko gore na seo se gatišitšwego ke sona seo se ngwadilwego; gana ge e ba go na le diphošo tše nyenyane. <br />Diphošo tše di tlwaelegilego kudu di akaretša:
When will you release Common Voice data in other languages?
Le tlo lokolla neng datha ya Common Voice ka dipolelo tše dingwe?
When you request your recordings, we compile them into one or multiple ZIP files. Here are your past requests:
Ge o kgopela dipego, re ya di bokanya ka gare ga faele ya ZIP e tee goba tše dintši. Mo ke dikgopelo tša gago tša nakong e fetilego.
Where are these <wikipediaLink>public domain</wikipediaLink> sentences from?
A tšwa kae <wikipediaLink>mafelong a batho bohle</wikipediaLink> mafoko a?
Where does the source text come from?
Mothopo wa mangwalo a tšwa kae?
Which variant of { $language } do you speak?
Ke se fe se swanego le{ $lang } se o se bolelago?
Why ?
Go baneng?
Why a profile?
Go baneng profaele?
Why Common Voice?
Go baneng Common Voice?
Why does this matter?
Ke ka baka la'ng e le bohlokwa?
Why don’t you ask people to read from books or Wikipedia articles in different languages?
Go baneng o sa kgopele batho ba bale tše tšwago dipukung goba di-athekele tša Wikipedia tša maleme a fapanego.
Why do you need so many different speakers per language?
Ke ka baka la'ng o hloka diboledi tše dintši tše di fapanego bakeng sa polelo e nngwe le e nngwe?
Why is 10,000 validated hours the per language goal for capturing audio?
Go baneng diiri tše 10,000 tše kgonthišitšwe e le pakane ya polelo ya go thopa di odio?
Why is Common Voice part of the Mozilla mission?
Go baneng Common Voice e le karolo ya morero wa Mozilla?
Why is it important?
Ke ka lebaka la'ng e le bohlokwa?
Why is my language not included yet?
Go baneng polelo yaka e sa akaretšwa?
Why should I sign up for an account?
Ke le baka la'ng la go ngwadiša akhaonto?
Would you like to request your voice recordings be deleted too, or do you prefer to keep them in the Common Voice dataset?
Na o ka rata go kgopela direkhoto tša lentšu la gago di phumolwe, goba o kgetha go di boloka go dathaseteng ya Common Voice?
Xhosa
SeXhosa
Xitsonga
Setsonga
y
y
Y
Y
Yes
Ee
Yes
Ee
Yes, send me emails. I’d like to stay informed about the Common Voice Project.
Ee, romela di-emeile. Ke tla rata go boledišana le wena ka projeke ya Common Voice.
Yes, send me emails. I'd like to stay informed about the progress of this language on Common Voice.
Ee, nromele di-emeile. ke tla rata go boledišana le lena mabapi le tšwelopele ya polelo mo go Common Voice.
Yes, we especially want your voice! Part of the aim of Common Voice is to gather as many different accents as possible so that voice recognition services work equally well for everyone. This means donations from non-native speakers are particularly important.
Ee, ka go kgethega re nyaka lentšu la gago! Karolo ya boikemišetšo bja Common Voice ke go kgobokanetša mmelelo e mentši ya go fapa-fapana ka moo go ka kgonegago e le gore tlhathollo e amogetšwego ya lentšu e šoma gabotse go batho bohle. Seo se bolela gore meneelo e tšwago go diboledi tšeo di sego tša matswalo a moo a bohlokwa kudu.
Yiddish
Se-Yiddish
Yoruba
Se-Yoruba
You
Wena
You are about to initiate a download of <size>{ $size }GB</size>, proceed?
O thoma tanelouto ya<size>{ $size }GB</size>, Re tšwele pele?
You are prepared to initiate a download of <b>{ $size }</b>
O lokišetše go thoma ka tanelouto ya <b>{ $size }</b>
You can also copy and paste the direct URLs into your favorite download manager. They will expire in 12 hours, but you can come back to this page to generate new ones any time.
O ka kopisa le go beya di-URL tše tobegilego go sesepetša digogwa wa gago o ratago. Di tla felelwa ke nako ka morago ga diiri tše 12, eupša o ka boela latlakaleng le go hlagiša tše di mpsha ka nako le ge e le efe.
You can also use Keyboard Shortcuts: { sc-review-form-button-approve-shortcut } to Approve, { sc-review-form-button-reject-shortcut } to Reject, { sc-review-form-button-skip-shortcut } to Skip
O ka šomiša Dikgaoletše tša Khiipote: { sc-review-form-button-approve-shortcut } go Dumela { sc-review-form-button-reject-shortcut } go Gana { sc-review-form-button-skip-shortcut } go Tshela
You can choose to make your username public or anonymous.
O ka kgetha go phatlaletša leina la mošomiši goba ya go se tsebje.
You can go to the <datasetsPage>datasets page</datasetsPage>, select the version and language(s) you want, and download it! The files have <metadataLink>associated metadata</metadataLink>, such as demographic information and validation data. You’ll need to provide an email address to download the dataset.
O ka ya go <datasetsPage>letlakaleng la di-dathasete</datasetsPage>, kgetha mohuta le (di)polelo tšeo o di nyakago, gomme o di taneloute! Difaele di na le <metadataLink>methadatha a linki</metadataLink>, bjalo ka tsebišo ya palo ya batho le datha e netefatšwego. O tla hloka go nea ka aterese ya e-meile go tanelouta dathasete.
You can help build a diverse, open-source dataset by creating a Common Voice profile and contributing your voice.
O ka thuša go aga di dathasete tše di fapanego tša mothopo o bulegilego ka go hlangwa profaele ya Common Voice le go neela lentšu la gago.
You can meet others in the Mozilla language communities by joining <discourseLink>Discourse</discourseLink> for topical conversations, or <matrixLink>Matrix</matrixLink> for quick advice.
O ka kopana le ba bangwe ba setšhaba sa polelo ya Mozilla ka go ikopanya go <discourseLink>Discourse</discourseLink> go hwetša poledišano ya ditaba, goba <matrixLink>Matrikse</matrixLink> go hwetša keletšo ya kapejana.
You cannot request your recordings while another request is already in progress.
O ka se kgone go kgopela dikgatišo tša gago mola go nale tšeo di kgopetšego.
You can request a new takeout of your recordings every { $days } days.
O ka kgopela go ntšha ga dikgatišo tša gago tše mpsha ka matšatši a mangwe le a mangwe a { $days }.
You can run events to help people contribute. It’s easier than you think. You could do it online with a videoconferencing tool, or in person if it’s safe. <eventTemplate>Check out our templates and resources for running events</eventTemplate>.
O ka tsamaiša ditiragalo go thuša batho go neela. Go bonolo go feta kamoo o naganago ka gona. O ka e dira inthaneteng ka go diriša lenaneo la bidio inthaneteng, goba motho ka nama ge e ba go bolokegile. <eventTemplate>Lebelela dithempleite tša rena le ditlabakelo tša go tsamaiša ditiragalo</eventTemplate>.
You can <sentenceCollectorLinkAdd>add small numbers of sentences</sentenceCollectorLinkAdd>,
or you can do <bulkImportDocsLink>bulk imports using Github</bulkImportDocsLink>.
Remember that sentences need to be <licenseLink>CC0 (or public domain)</licenseLink>, or you can write your own.
O ka <sentenceCollectorLinkAdd>oketša dipalo tše nyenyane tša mafoko</sentenceCollectorLinkAdd>, goba o ka <bulkImportDocsLink>tsenya bontši bja o diriša Github</bulkImportDocsLink>. Gopola gore mafoko ao a hloka go ba <licenseLink>CC0 (goba mafelong a setšhaba ka moka)</licenseLink>, goba o ka ngwala a gago.
You can use the <scraperLink>Sentence extractor</scraperLink> to leverage short sentences from Wikipedia.
O ka šomiša <scraperLink>leswao la kgaoganyo ya Lefoko</scraperLink> go šomiša mafoko a fokotšegilego a go tšwa go Wikipedia.
You could include the website, e.g "Common Voice - https://commonvoice.mozilla.org/"
O ka akaretša wepesaete, e.g "Common Voice- https://commonvoice.mozilla.org/"
You could use academic referencing style, e.g Harvard style "Mozilla (2021) Common Voice. Available at https://commonvoice.mozilla.org/ (Accessed: 15th September 2021)"
O ka berikiša tshupišo ya thuto, e.g setaele sa Harvard "Mozilla (2021) Common Voice. E lego gona go https://commonvoice.mozilla.org/ (Accessed: 15th September 2021)"
You could use social media platforms to get the message out. Share posts that explain why it matters, and get in touch with other people talking about issues like language rights, voice AI, or bias in tech. See more advice on <campaignLink>running a social campaign, including content you can re-use</campaignLink>.
O ka šomiša lekgotla la poledišano ya inthaneteng go kgona go iša molaetša go batho ka moka. Abelana melaetša e hlalosago gore na ke baka la eng go le bohlokwa, gomme o ikgokaganye le batho ba bangwe bao ba bolelago ka mathata a go swana le ditokelo tša polelo, lentšu la AI, goba tshekamelo go theknolotši. Bona dikeletšo tše dingwe mabapi le<campaignLink> go tsamaiša lesolo la setšhaba, go akaretša le diteng tšeo o ka di šomišago gape</campaignLink>.
You have not added any languages yet.
Ga se o tsenye dipolelo tše dingwe.
You have not selected any languages. Please go to your <profileLink>Profile</profileLink> to select languages.
Ga se o kgethe maleme lege e le lefe. <profileLink>Profile</profileLink> go kgetha maleme.
You must allow microphone access.
O swanetše o dumelele go berikišwa ga maekrofone.
You previously skipped sentences while reviewing. Resetting skipped sentences will show all skipped sentences again. This is independent of the language.
Nakong e fetilego o tshetše mafoko ge o dutše o hlahloba. Go setalefsa mafoko a fetilwego go tla bontšha mafoko ka moka a fetilwego. Se se ikemetše ka polelo.
Your accent is the way you pronounce words. It can be shaped by where you have lived, which other languages you speak and lots of other factors. You can share any information you feel is relevant here.
Mmolelo wa gago ke tsela yeo o bitšago matšu ka gona. Seo se ka bopša ke moo o dutšego gona, dipolelo tše dingwe tšeo o di bolelago le dibopego tše dingwe. O ka abelana tsebišo le ge e le efe yeo o naganago gore e bohlokwa mo.
Your anonymous voice recordings will remain in the Common Voice dataset. Once you delete your profile you will no longer be able to submit a request to remove your recordings from the dataset
Dipego tša gago tša lentšu tšeo di sa tsebjego di tla šala go dathasete ya Common Voice. Ka morago ga go phumola profaele ya gago o ka sa kgona go romela kgopelo ya go phumola direkhoto tša gago go dathasete.
Your daily goal has been created
Pakane ya gago ya letšatši e bopilwe
Your download has started.
Tanelouto ya gago e thomile.
You’re contributing to a target segment
O tlaleletša go karolo ya nepo
You’re contributing to our first target segment
O tlaleletša go karolo ya nepo ya rena ya mathomo
You're currently set to <bold>NOT</bold> receive emails such as goal reminders, my
progress updates and newsletters about Common Voice
Gona bjale o <bold>GANA</bold> go hwetša lenaneo la poso la dikgopotšo tša dipakane, sebontšhatšwelopele le mangwalo a ditaba tša Common Voice
You're currently set to receive emails such as goal reminders, my
progress updates and newsletters about Common Voice
Gona bjale o loketše go hwetša di-emeile tša go swana le dikgopotšo tša dipakane, sebontšhatšwelopele le mangwalo a ditaba tša Common Voice.
Your email address
E-meile ya gago
Your files are being assembled. Please check again later.
Difaele tša gago di ya kopantšwa. Re kgopela o lebelele gape ka morago.
Your languages:
Maleme a Gago:
Your Languages
Maleme a Geno
Your username and email will not be associated with the published data.
Leina la gago la mošomiši le e-meile di ka se kopantšhwe le datha e re tlo e phatlalatša.
Your weekly goal has been created
Pakane ya gago ya beke e hlangwe
You should receive an email shortly explaining
the steps to get your language added to Common Voice
O tla hwetša e-meile ka moragonyana e hlalosago megato ya go tsenya polelo ya gago go Common Voice
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } recording goal!
O thušitše Common Voice e fihle <goalPercentage></goalPercentage> ya letšatši le letšatši ya { $goalValue } pakane ya go gatiša.
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } validation goal!
O thušitše Common Voice e fihle <goalPercentage></goalPercentage> ya letšatši le letšatši ya{ $goalValue } pakane ya go gatiša.
You've successfully signed up for contributing to { $language }. Thank you.
O ngwaditšitše ka katlego go neela ka { $language }. Rea leboga.
Zip #{ $offset } of { $total }
Zip #{ $offset } tša { $total }
Zulu
SeZulu