[1-5]
[1-5]
2019 End-of-Year Release
Version de darrèrs de 2019
2 down, keep it up!<playIcon></playIcon>
2 mens! Mantietz eth ritme! <playIcon></playIcon>
≥ 2 No votes
2 vòts negatius o mès
≥ 2 Yes votes
2 vòts positius o mès
Abbreviations and Acronyms. Abbreviations and acronyms like “USA” or “ICE” should be avoided in the source text because they may be read in a way that does not coincide with their spelling. Additionally, there may be multiple accurate readings for a single abbreviation. For example, the acronym “ICE” could be pronounced “I-C-E” or as a single word.
Abreviatures e acronims. S’an d’evitar es abreviatures e es acronims coma «USA» o «IEC» en tèxte, pr'amor que se pòden liéger d'ua manèra que non coïncidís damb era sua grafia. Ath delà, i pòt auer diuèrses lectures corrèctes entà ua sola abreviatura. Per exemple, er acrònim «UAB» se pòt prononciar "u-a-be", "uab" o coma «Universitat Autonòma de Barcelona».
Abkhaz
abkhaz
About
Qui èm
Accent
Accent
Accent
Accent
Account
Compde
<accuracyLink>See more on accuracy criteria.</accuracyLink>
<accuracyLink>Vedetz mès informacion sus es critèris de precision.</accuracyLink>
Acehnese
atjeh
{ $actionType }<playIcon></playIcon> did they accurately speak the sentence?
{ $actionType } <playIcon></playIcon> An prononciat era frasa corrèctaments?
{ $actionType }<recordIcon></recordIcon> then read the sentence aloud
{ $actionType }<recordIcon></recordIcon> e liegetz era frasa en votz nauta
{ $actionType }<stopIcon></stopIcon> when done
{ $actionType }<stopIcon></stopIcon> quan ajatz acabat
{ $actionType } submit when ready
{ $actionType } Envie quan ajatz acabat
Add
Ahig
Add a language you want to contribute to
Ahigetz era lengua a on voletz contribuïr
Add an avatar to your profile
Ajustar un avatar al perfil
Add an avatar to your profile
Ajustar un avatar al perfil
Add as much information as you can about the language. This
could include:
Ahigetz tanta informacion coma pogatz sus era lengua. Açò poderie includir:
Adding languages to work with
Ahíger lengües entà trabalhar-i
Additional Language
Lengua addicionau
Add Language
Ajustar una lenga
Add Language
Ahig ua lengua
Add Language
Ahig ua lengua
<addLink>Add more sentences now!</addLink>
<addLink>Ahigetz mès frases ara!</addLink>
Add new custom accent "{ $inputValue }"
Ahig un nau accent personalizat «{ $inputValue }»
Add new sentences
Ahíger frases naues
Add Sentences
Ahig frases
Add <wikipediaLink>public domain</wikipediaLink> sentences
Ahigetz frases de <wikipediaLink>domeni public</wikipediaLink>
Add Your Voice
Ahigetz era vòsta votz
Adyghe
adigué
A few bytes
Uns pòqui bytes
Afrikaans
Afrikaans
Afrikaans
afrikaans
Age
Atge
Age
Edat
Albanian
Albanés
Albanian
albanés
All
Totes
All
Totas
All Languages
Totas les lengas
All sentences you submit must be under <wikipediaLink>Public Domain (CC-0) license</wikipediaLink>. To support the inclusion of work not under public licence, we have created a <cc0WaiverLink>Contributions Agreement template</cc0WaiverLink> for works where the copyright owner would like to contribute the material to Common Voice.
Totes es frases qu’enviatz an d’auer <wikipediaLink>licéncia de domeni public (CC-0)</wikipediaLink>. Entà perméter er includiment de trabalh que non a licéncia de domeni public, auem creat ua <cc0WaiverLink>plantilha d'acòrd de collaboracion</cc0WaiverLink> entàs trabalhs en qu'eth proprietari des drets d'autor volgue aportar eth materiau ath Common Voice.
All voice clips in the dataset are scrubbed of personally identifying information. When a contributor provides demographic data via their profile, that information is de-identified from their voice clips before being bundled for download in the dataset and is never made public on their profile page.
Era informacion d'identificacion personau se suprimís de toti es talhs de votz deth conjunt de donades. S'un collaborador balhe donades demografiques en sòn perfil, aguesta informacion s’anonimize enes talhs de votz abans d'includir-les en conjunt de donades disponibles entà baishar, e aguesta informacion non ei jamès publica ena sua pagina de perfil.
Amharic
amharic
Anonymized user data like age, gender, and accent helps improve the audio data used to train the accuracy of speech recognition engines. Your username and email will never be associated with your submitted data, and you can choose whether to make your username public or anonymous.
Es donades d'usatgèr anonim coma era edat, eth genre e er accent ajuden a milhorar es donades d’àudio utilizades entà entrenar era precision des motors de reconeishença de votz. Es donades qu’enviatz non seràn jamès associades damb cap de nòm d'usatgèr ne adreça electronica, e podetz escuélher se voletz qu'eth vòste nòm d'usatgèr sigue public o anonim.
Anonymized user data like age, sex, and accent helps improve the audio data used to train the accuracy of speech recognition engines. Your username and email will never be associated with your submitted data, and you can choose whether to make your username public or anonymous.
Las donadas dels utilizaires coma l’atge, lo sexe e l’accent ajudan a melhorar las donadas àudios utilizadas per entrenar e va venir pertinent los motors de reconeissença vocalas. Vòstre nom d’utilizaire e corrièl seràn pas jamai associats amb las donadas enviadas, e podètz causir de far venir o anonim vòstre nom d’utilizaire.
Approve
Apròva
A quiet background hubbub is OK, but we don’t want additional voices that may cause a machine algorithm to identify words that are not in the written text. If you can hear distinct words apart from those of the text, the clip should be rejected. Typically this happens where the TV has been left on, or where there is a conversation going on nearby.
Un tapatge de hons tranquil va plan, mès non volem que i age votzes addicionaus que poguen hèr qu'un algoritme informatic identifique paraules que non apareishen en tèxte escrit. Se podetz sénter paraules diferentes a part des deth tèxte, calerie refusar eth clip. Normauments, açò passe se s'a deishat eth televisor alugat o se i a ua convèrsa apròp.
Arabic
arab
Arabic
Arabi
Aragonese
Aragonés
Aragonese
aragonés
Are you coming? <strong>[called by another]</strong>
E vies? <strong>[cridat per quauquarrés d’aute]</strong>
Armenian
armèni
Armenian Western
armèni occidentau
Artificial intelligence
Intelligéncia artificiala
Ask about a new language
Demana d'ahíger ua lengua
Assamese
Assamés
Assamese
assamès
Asturian
asturian
Asturian
Asturian
Audio Format
Format d'àudio
Avatar
Avatar
Avatar
Avatar
Avatar uploaded
S'a pujat er avatar
Average
Normau
A voice clip is marked "valid" when a user gives it a Yes vote.
Un talh de votz se mèrque coma «valid» quan un usatgèr lo vòte damb Òc.
Avoid too much background noise - it should be easy to hear you.
Evitatz massa tapatge de hons: aurie d’èster facil d'entener-vos.
Awards
Distincions
Azerbaijani
Azèri
Azerbaijani
azèri
Back
Endarrèr
Background Noise
Tapatge de hons
Background Voices
Votzes de hons
Back to Top
Naut dera pagina
Back to Top
Naut de la pagina
Bambara
Bambara
Bambara
Bambara
Basaa
basaa
Bashkir
baixkir
Basque
basc
Basque
Basc
Be cautious before rejecting a clip on the ground that the reader has mispronounced a word, has put the stress in the wrong place, or has apparently ignored a question mark. There are a wide variety of pronunciations in use around the world, some of which you may not have heard in your local community. Please provide a margin of appreciation for those who may speak differently from you.
Tietz compde abans de refusar un clip pr'amor qu'eth lector a prononciat mau ua paraula. Er aranés a ua ampla varietat de pronóncies, e dilhèu no les auetz sentudes jamès. Non se pronóncie parièr en aranés centrau, de Pujòlo o septentrionau. Hetz hidança ad aqueri que parlen d’ua manèra diferenta dera vòsta.
Become a partner
Venir sòci
Belarusian
bielorús
Benefits
Avantatges
Benefits
Beneficis
Bengali
bengalí
Bengali
Bengali
[‘Beret’ is OK whether with stress on the first syllable (UK) or the second (US)]
[«pòrta» Se pòt prononciar de diferentes manères, segons era varianta dialectau]
<bold>{ $count }</bold> Clips
<bold>{ $count }</bold> mòstras
<bold>{ $count }</bold> Clips
<bold>{ $count }</bold> talhs
<bold>Help us</bold> find more voices
<bold>Ajudatz-mos</bold> a trobar mès votzes
<bold>Help us</bold> find more voices
<bold>Ajudatz-nos</bold> a trobar mai de voses
<bold>iOS</bold> users can download our free app:
Los utilizaires <bold>iOS</bold> pòdon telecargar nòstra aplicacion gratuita :
Bosnian
bosnian
Both
Ambdús
Both of these projects are part of our efforts to bridge the digital speech divide. Voice recognition technologies bring a human dimension to our devices, but developers need an enormous amount of voice data to build them. Currently, most of that data is expensive and proprietary.
We want to make voice data freely and publicly available, and make sure the data represents the diversity of real people. Together we can make voice recognition better for everyone.
Aquestes dos projèctes fan partida de nòstres esfòrces per reduire las inegalitats dins lo mond de la votz digitala. Las tecnologias de reconeissença vocala porgisson una dimension umana a nòstres aparelhs, mas los desvolopaires an besonh d'una quantitat importanta de donadas vocalas per las concebre. Actualament, la màger part d'aquelas donadas son costosas e proprietàrias.
Volèm que las donadas vocalas sián liuras e publicas, e garantir que las donadas representen la diversitat de las personas realas. Amassa, podèm melhorar la reconeissença vocala per totòm.
Both (Speak and Listen)
Ambdús (Parla e Escota)
Breton
Breton
Breton
breton
Browse Languages
Explòra es lengües
Build a custom goal
Crea un objectiu personalizat
Build Profile
Crear un perfil
Build Profile
Crea un perfil
Bulgarian
bulgar
Bulgarian
Bulgar
Burmese
birman
Buryat
Burma
Buryat
buriat
<b>Why an email?</b> We may need to contact you in the future about changes to the dataset, an email provides us a point of contact.
<b>Perqué ua adreça electronica?</b> Ei ua forma de contacte en cas de que mos calguesse méter en contacte en un futur per cambis en conjunt de donades.
By editing your goal, you may lose your existing progress.
En editar er objectiu, podetz pèrder eth progrès actuau.
By opting in to receive emails you state that you are okay with Mozilla handling this info as explained in Mozilla’s <privacyLink>Privacy Policy</privacyLink>.
En tot optar per recéber corrèus electronics, acceptatz que Mozilla gestione aguesta informacion enes tèrmes descriti ena sua <privacyLink>politica de privadança</privacyLink> de Mozilla.
By opting in to receive emails you state that you are okay with Mozilla handling this info as explained in Mozilla’s <privacyLink>Privacy Policy<privacyLink>.
En acceptar de recebre los corrièls, indicatz acceptar que Mozilla gerisca aquestas informacions segon la <privacyLink>politica de confidencialitat<privacyLink> de Mozilla.
<b>You agree</b> to not attempt to determine the identity of speakers in the Common Voice dataset
<b>Accèpti</b> non sajar determinar era identitat des parlants deth conjunt de donades deth Common Voice
By providing some information about yourself, the audio data you submit to Common Voice will be more useful to Speech
Recognition engines that use this data to improve their accuracy.
Se balhatz informacion vòsta, es donades d’àudio qu’enviatz tath Common Voice seràn mès utiles entàs motors de reconeishença dera votz que les empleguen entà milhorar era sua precision.
By using Common Voice, you agree to our <termsLink>Terms</termsLink> and <privacyLink>Privacy Notice</privacyLink>
En usar Common Voice, acceptatz es <termsLink>condicions d'usatge</termsLink> e eth <privacyLink>avertiment de privadança</privacyLink>
CANCEL
ANULLAR
Cancel Re-recording
Anulla eth reenregistrament
Cancel Submission
Anulla er enviament
Can't decide?
Non sabetz de qué hèr?
Cantonese
cantonés
Catalan
Catalan
Catalan
catalan
Central Kurdish
curd centrau
Change your email via Settings under Login Identity
Cambiatz era vòsta adreça electronica ena seccion Identitat d'inici de session des paramètres
Check if your language is listed on the
<languagesPageLink>Common Voice Languages page</languagesPageLink>.
If not, fill out this form and we’ll <strong>send you a handy email
explaining how to get your language set up</strong>.
Verificatz s'era vòsta lengua apareish listada ena <languagesPageLink>pagina de lengües deth Common Voice</languagesPageLink>.
Se non, aumplitz aguest formulari e vos <strong>enviaram un corrèu electronic en tot explicar com ahíger era vòsta lengua</strong>.
<chevron></chevron>See less
<chevron></chevron>Mens
<chevron></chevron>See more
<chevron></chevron>Mès
Chinese (China)
Mandarin (China)
Chinese (China)
mandarin (China)
Chinese (Hong Kong)
Mandarin (Hong Kong)
Chinese (Hong Kong)
mandarin (Hong Kong)
Chinese (Taiwan)
mandarin (Taiwan)
Chinese (Taiwan)
Mandarin (Taiwan)
Choose language/localization
Causitz ua lengua/localizacion
Chuvash
chuvaish
Chuvash
Chuvash
Click
Hec clic a
{ $clipCount } voice clips, total archive size { $size }. Expires { $expires }.
{ $clipCount } talhs de votz, mesura totau dr archiu: { $size }. Caduque: { $expires }.
Clip Graveyard
Cementèri de talhs
Clips recorded
Talhs enregistradi
Clips Uploaded
Talhs pujadi
Clips validated
Talhs validadi
Clips You've Recorded
Talhs qu’auetz enregistrat
Clips You've Validated
Talhs qu’auetz validat
Close
Barra
Close
Barra
Close
Tampar
Collecting sentences
Recuelh frases
Collecting sentences from the public domain, or writing new ones for the public domain.
Recuélher frases deth domeni public o escriuer-ne de naues entath domeni public.
Collect sentences
Recuelh frases
Comment
Comentari
Common Voice data plus all other voice datasets above.
Donades deth Common Voice mès toti es conjunts de donades de votz anteriors.
Common Voice Dataset
Conjunt de donades deth Common Voice
Common Voice is a collaborative project, and we're depending on our community of partners and contributors to build the largest open-source dataset of voices ever.
We would like to thank the following people and organizations for their help with the project:
Common Voice es un projècte collabaratiu e dependem de nòstra communitat de sòcis e collaborators per crear lo jòc de donadas de votz liure mai grand pas jamai fach.
Volriam mercejar las personas e organizacions seguenta per lor ajuda sul projècte :
Common Voice is Mozilla's initiative to help teach machines how real people speak.
Eth projècte Common Voice ei iniciativa de Mozilla entà ajudar as maquines a apréner com parlam es persones de vertat.
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Eth projècte Common Voice hè part dera iniciativa de Mozilla entà ajudar as maquines a apréner com parlam es persones de vertat.
Common Voice is part of Mozilla's initiative to help teach machines how real people speak. In addition to the Common Voice dataset, we’re also building an open source speech recognition engine called Deep Speech.
Common Voice fa part de l’initiativa de Mozilla per ajudar a aprendre a las maquinas cossí lo monde parla. En mai del jòc de donadas de Common Voice, construissèm un motor de reconeissença vocal liure apelat Deep Speech.
Common Voice is the world’s largest publicly available, multi-language voice dataset.
Eth Common Voice ei eth conjunt de donades de votz multilingua mès gran disponible publicaments.
Common Voice recordings are used by academics, small businesses, and voice recognition enthusiasts to help train and grow publicly available resources like voice models.
Can you let us know why you would like your recordings deleted?
Academics, petites enterpreses e arderoses dera reconeishença deth parlar empleguen es enregistraments deth Common Voice entà ajudar a entrenar e a hèr a créisher es ressorces disponibles publicaments, coma modèls de votz.
Mos poderíetz hèr a saber per qué voletz suprimir es vòsti enregistraments?
Community participation and decision making.
Participacion comunitària e presa de decisions.
Community Playbook
Libre dera comunitat
Confirm Goal
Confirmi er objectiu
Confirm New Sentences
Confirma es frases naues
Connect with Gravatar
Connectatz-vos amb Gravatar
Connect with Gravatar
Connècta damb Gravatar
Contact
Contacte
Contact
Contactatz
Contact Form
Formulari de contacte
Contact Form
Formulari de contacte
Content available under a <licenseLink>Creative Commons license</licenseLink>
Contengut disponible jos <licenseLink>licéncia Creative Commons</licenseLink>
Content available under a <licenseLink>Creative Commons license</licenseLink>
Contengut disponible jos <licenseLink>licéncia Creative Common</licenseLink>
Continue
Seguís
Contribute
Collaboratz‑i
Contribute
Contribuir
Contribute
Contribuir
Contribute
Collaboratz‑i
Contribute sentences
Aportatz frases
Contribute to { $lang }
Contribuir en { $lang }
Contribute to { $lang }
Collabòra en { $lang }
Contribute Your Voice
Donatz vòstra votz
Contribute Your Voice
Collaboratz-i damb era vòsta votz
Contribution Activity
Activitat de collaboracion
Contribution Criteria
Critèris de collaboracion
Contribution Experience
Experiéncia dera collaboracion
Contribution Experience
Experiéncia de contribucion
Contributors record voice clips by reading from a bank of donated sentences.
Es collaboradors enregistren talhs de votz en tot liéger tèxtes que provien d'un banc de frases dades.
Cookies
Cookies
Cookies
Cookies
Cookies
Cookies
<coquiLink>Coqui</coquiLink> is dedicated to open speech technology. Their projects include deep learning based STT and TTS engines.
<coquiLink>Coqui</coquiLink> ei dedicat ara tecnologia de votz dubèrta. Es sòns projèctes includissen motors de reconeishença de votz e sintèsi de tèxte basadi en aprendissatge prigond.
Cornish
cornic
Cornish
Cornic
Corsican
còrs
Could not add language
Non s'a pogut ahíger era lengua
Could not change settings. Please try again.
Non s'a pogut cambiar era configuracion. Tornatz-ac a provar.
Could not remove language
Non s'a podut suprimir era lengua
Could not remove language: language not found
Non s'a podut suprimir era lengua: era lengua non s'a trobat
{ $count } clips
{ $count } talhs
{ $count }mo
{ $count } meses
{ $count }mo
{ $count } mesi
{ $countOfInvalidated } rejected by you
N'auetz refusat { $countOfInvalidated }
{ $countOfReviewed } sentences are already reviewed. Great job!
Ja s'an revisat { $countOfReviewed } frases. Bon trabalh!
{ $countOfSentences } sentences found.
S'an trobat { $countOfSentences } frases.
{ $countOfUnreviewed } sentences are unreviewed. If you want, you can also review your sentences now before submitting them.
{ $countOfUnreviewed } frases son pendentes de revisar. Se voletz, tanben podetz revisar es vòstes frases ara abans d'enviar-les.
{ $count }wk
{ $count } setm.
{ $count }wk
{ $count } setm.
{ $count }y
{ $count } ans
{ $count }y
{ $count } ans
Create a Custom Goal
Creatz un objectiu personalizat
Create a profile
Crear un perfil
Create a voice clip as part of your profile avatar. Others will be able to hover and listen to your clip.
Creatz una mòstra per vòstra avatar. Las autras personas poiràn escotar vòstra mòstra en passant lor cursor dessús.
{ $created }
{ $created }
Creating opportunities for a diversity of people to contribute to Common Voice ensures the dataset serves as many people as possible. We’ve created resources and templates that you can use!
Era creacion d'oportunitats pr’amor qu'ua diuersitat de persones contribuïsque a Common Voice garantís qu'eth conjunt de donades servisque a tantes persones coma sigue possible. Auem creat recorsi e plantilhes que podetz emplegar!
Criteria
Critèris
Croatian
croat
Czech
chèc
Czech
Chèc
Daily Goal
Objectiu diari
Danish
danés
Danish
Danés
Dashboard
Taulèr
Dashboard
Tablèu de bòrd
Dataset Release
Publicacion deth conjunt de donades
Datasets
Jòc de donadas
Datasets
Conjunts de donades
Datasets
Conjunts de donades
Date
Data
Days
Dies
Days
Jorns
De-identified
Anonimizacion
DELETE
SUPRIMIR
Delete my recordings
Suprimís es mèns enregistraments
Delete Profile
Suprimir lo perfil
Delete Profile
Suprimís eth perfil
Delete selected sentences
Suprimís es frases seleccionades
Deleting selected sentences…
Se suprimissen es frases seleccionades…
Dhivehi
dhivehi
Different language
Ua auta lengua
Difficult
Malaisit
Difficult to pronounce
De mau prononciar
Discard ongoing recording
Descarta er enregistrament en cors
Discourse
Discourse
Donate your voice
Donatz vòstra votz
Donate your voice
Datz era votz
Don't see your language on Common Voice yet?
Trobatz pas encara vòstra lenga sus Common Voice ?
Don't see your language on Common Voice yet?
Non trobatz era vòsta lengua en Common Voice?
Don’t see your language reflected in the Dataset? To request a language head over to our Languages page.
Non vedetz era vòsta lengua en conjunt de donades? Entà sollicitar ua lengua, anatz tara pagina de lengües.
Download
Baisha
Download
Baisha
Download Common Voice Data
Baisha es donades deth Common Voice
Download Common Voice Data
Telecargar las donadas Common Voice
Download Data
Baisha es donades
Download Data
Telecargar las donadas
Download Dataset Bundle
Baisha eth paquet deth conjunt de donades
Download { $language }
Baisha ({ $language })
Download Links
Ligams de baishada
Download My Data
Baisha es mies donades
Download My Data
Telecargar mas donadas
Download profile data
Baisha es donades deth perfil
Download the Dataset
Baisha es conjunts de donades
Download the Single Word Target Segment
Baishatz eth segment especific damb paraules individuaus
Do you have ideas on how we can make the Common Voice dataset better? Let us know on Discourse
Auetz idèes sus com milhorar eth conjunt de donades deth Common Voice? Hetz-mos-ac a saber en Discourse
Do you want to continue?
Voletz seguir?
Do you want to Speak, Listen or both?
Geniau! Voletz parlar, escotar o es dues causes?
Drag and drop or <browseWrap>Browse</browseWrap>
Arrossegueu e deishatz anar o hetz clic en <browserWrap>Navèga<browserWrap>
During contribution submission feedback will be skipped after clicking 'Submit'. Contribution will continue directly with the next set of 5 recordings or validations.
Er enviament de comentaris se desactivarà en hèr clic sus «Envia». Era collaboracion passarà dirèctaments ath següent grop de 5 enregistraments o validacions.
Dutch
Neerlandés
Dutch
neerlandés
Each entry in the dataset consists of a unique MP3 and corresponding text file. Many of the <b>{ $total }</b> recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help train the accuracy of speech recognition engines.
The dataset currently consists of <b>{ $valid }</b> validated hours in <b>{ $languages }</b> languages, but we’re always adding more voices and languages. Take a look at our <languagesLink>Languages page</languagesLink> to request a language or start contributing.
Cada entrada deth conjunt de donades consistís en un unic fichèr MP3 e eth fichèr de tèxte corresponent. Fòrça des <b>{ $totaus }</b> ores enregistrades deth conjunt de donades tanben includissen metadades demografiques, coma era edat, eth sèxe e er accent, #que pòden ajudar a melhorar era precision des motors de reconeishença de votz.
Actuauments, eth conjunt de donades includís <b>{ $valid }</b> ores validades en <b>{ $languages }</b> lengües, mès constantaments s'i van en tot ahíger mès votzes e lengües. Visitatz era <languagesLink>pagina Lengües</languagesLink> entà sollicitar ua lengua o entà començar a collaborar-i.
Easy
Facil
Edit
Modificar
Edit
Edita
Edit Profile
Modificar lo perfil
Edit Profile
Edita eth perfil
Email
Adreça electronica
Email
Adreça electronica
Email
Adreça electronica
Email
Adreça electronica
<emailFragment>Sign up</emailFragment> to our mailing list to learn how you can take part in campaigns, events and co-design features on Common Voice.
<emailFragment>Registratz-vos</emailFragment> ena nòsta lista de corrèu entà saber com podetz participar en campanhes, eveniments e foncions de dessenh conjunt en Common Voice.
Email is already used for a different account
Aguesta adreça electronica ja s'emplegue entà un aute compde
Email is already used for a different account
Aquesta adreça electronica es ja utilizada per un autre compte
Email Subscriptions
Soscripcions per corrèu electronic
English
Anglés
English
anglés
English
anglés
English
Anglés
Enter Email to Download
Introdusitz era adreça electronica entà baishar
Enter email to send a sign up link
Introdusitz ua adreça electronica a on enviar un ligam de registre
Enter your email
Introdusitz era vòsta adreça electronica
Error { $code }
Error { $code }
Error while fetching rejected sentences. Please try again.
S'a produsit un error en obtier es frases refusades. Tornatz-ac a sajar.
Error while fetching your sentences. Please try again.
S'a produsit un error en obtier es frases. Tornatz-ac a sajar.
Erzya
erzya
ESC
ESC
Esperanto
esperanto
Esperanto
Esperanto
Estonian
Estonian
Estonian
estonian
Events
Eveniments
Everyone
Toti/es
Exit & Delete clips
Ges e suprimís es talhs
Expert linguists can also help, try to reach out to local linguistic universities (both teachers and students) and see if they can help to gather a diverse set of sound-diverse sentences in your language.
Es lingüistes expèrts tanben vos pòden ajudar, sajatz de meter-vos eth contacte damb es universitats locaus (damb ensenhaires e estudiantes) e verificatz se pòden ajudar a amassar un conjunt diuèrs d'oracions damb diferenti sons ena vòsta lengua.
Failed to delete selected sentences… Please try again!
Non s'an pogut suprimir es frases seleccionades… Tornatz-ac a provar!
FAQ
FAQ
FAQ
QF
Faroese
feroés
Faroese
Faroés
Female
Femna
Female
Hemna
Fields marked with an asterisk (*) are required.
* Indique un camp de besonh
Filtered sentences due to requirements failing (please submit fixed versions as new sentences):
Frases filtrades a causa de bèth error enes requeriments (enviatz es frases corregides coma frases naues):
Finding existing sentences in the Public Domain
Trobar frases de domeni public
Find others who care. That could be universities, language schools, advocacy groups or data science communities. Reach out and explain clearly how they can help and why. <outreachTemplates>See our template outreach emails</outreachTemplates>.
Trobatz d’autes persones a qui les impòrte. Poderien èster universitats, escòles d'idiòmes, grops de defensa o comunitats de sciéncia de donades. Metetz-vos en contacte e explicatz claraments com vos pòden ajudar e per qué. <outreachTemplates>Consultatz era plantilha de corrèus electronics de divulgacion</outreachTemplates>.
Finish editing first?
Voletz acabar d'editar?
Finish recording
Finaliza er enregistrament
Finish recording first?
Voletz acabar er enregistrament prumèr?
Finish Review
Acaba era revision
Finnish
Finés
Finnish
finés
First, check if your language <languageLink>already exists</languageLink>.
If it isn’t, you can <languageRequestLink>ask about adding your language</languageRequestLink>.
There are two stages. Translating the site, and collecting sentences
Prumèr, verificatz s'era vòsta lengua <languagLink>ja existís<languagLink>. Se non ei atau, vos cau <strong>tradusir eth lòc e arremassar-ne frases</strong>.
Foreign letters. Letters must be valid in the language being spoken. For example, “ж” is a letter in the Russian alphabet but is never used in English and so should never appear in any English source text.
Letres estrangères. Es letres an d’èster valides ena lengua que se parle. Per exemple, «ж» ei ua letra der alfabet cirillic, mès non s'emplegue en catalan e, donques, non aurie d’aparéisher jamès en cap de tèxte en catalan.
For every voice clip donated, and every audio clip validated, your account dashboards are updated to reflect your latest progress in each language you contribute to. Yes, you can contribute to more than one!<br/><br/> Use dashboards to track your stats, see how you're doing alongside others in the community, and set daily or weekly contribution goals.
Per cada talh de votz dada e per cada talh d’àudio validat, es taulèrs deth vòste compde s'actualizen entà rebàter eth vòste progrès mès recent en cada lengua a on collaboratz. Òc, podetz participar en mès d'ua! <br/><br/> Emplegatz taulèrs entà susvelhar es vòstes estadistiques, veir eth vòste rendiment en relacion damb d’auti membres dera comunitat e definir objectius de collaboracion diària o setmanau.
for example
Per exemple
For public domain text not available online, you could use academic referencing style e.g Harvard style "Jess (2021) My Public licence poems"
Entath tèxte de domeni public que non ei disponible en linha, podetz utilizar un estil de referéncia academica. P. ex., er estil de Harvard «Jess (2021) Es mèns poèmes en domeni public»
For these launched languages the website has been successfully <localizationGlossaryLink>localized</localizationGlossaryLink>, and has enough <sentenceCollectionGlossaryLink>sentences collected</sentenceCollectionGlossaryLink> to allow for ongoing <speakLink>Speak</speakLink> and <listenLink>Listen</listenLink> contributions.
Entad aguestes lengües en produccion, s'a <localizationGlossaryLink>tradusit</localizationGlossaryLink> entèrament eth lòc web e an pro <sentenceCollectionGlossaryLink>frases arrecuelhudes</sentenceCollectionGlossaryLink> entà acceptar contribucions de <speakLink>Parlar</speakLink> e <listenLink>Escota</listenLink>.
French
Francés
French
francés
Frequently Asked Questions
Questions mès frequentes
Frisian
frison
Frisian
Frison
Fulah
fulah
Galician
galhec
GB
GB
Gender
Genre
Georgian
Georgian
Georgian
georgian
German
alemand
German
Alemand
Get involved
Collaboratz-i
Get involved
Collaboratz-i
Get Involved
Collaboratz‑i
Get Involved
Participatz
Get started with goals
Introduccion as objectius
Get Started with Speech Recognition
Introduccion ara reconeishença de votz
<githubLink>NVIDIA NeMo</githubLink>™ is an <docsLink>open-source toolkit</docsLink> for researchers developing state-of-the-art conversational AI models.
<githubLink>NVIDIA NeMo</githubLink>™ ei un <docsLink>jòc d'estruments de còdi dubèrt</docsLink> entà recercaires que desvolòpen modèls de convèrsa de IA de darrèra generacion.
Give your avatar a voice
Donatz una votz a vòstre avatar
Glossary
Glossari
Goal reached
S'a artenhut er objectiu
Goals
Objectius
Goals
Objectius
Goan Konkani
konkani
Go to Discourse
Vè tath Discourse
Go to Languages Page
Vè tara pagina de lengües
Go to { $name }
Vè’n tà { $name }
Go to { $name }
Veire { $name }
<governanceLink>Read more about how we're governed</governanceLink>
<governanceLink>Liegetz mès sus com mos governam</governanceLink>
Government proceedings, books and articles are also great however since the text tends to be a little more formal they are less of a priority.
Es actes de govèrn, es libres e es articles tanben son excellenti, mès, com qu'eth tèxte acostume a èster un shinhau mès formau, son mens prioritaris.
Grammatical / spelling error
Error gramaticau / ortografic
Great! How many clips a week?
Geniau! Guairi talhs per setmana?
Great! How many clips per day?
Geniau! Guairi talhs per dia?
Great!<recordIcon></recordIcon> Record your next clip
Fòrça ben! <recordIcon></recordIcon> Enregistratz un aute talh
Great work!<playIcon></playIcon> Listen again when you're ready
Bon trabalh! <playIcon></playIcon> Escotatz un aute viatge quan sigatz prèst/a
Greek
Grèc
Greek
grèc
Guarani
guarani
Haitian
haitian
Hakha Chin
hakha chin
Hausa
haussa
Have Feedback?
Auetz bèth comentari?
Have questions about Common Voice? Ideas for improvements or feedback about a specific language? Join us on our <discourseLink>Discourse forum</discourseLink> and let us know.
Avètz de questions tocant Common Voice ? D’idèas de melhoraments o comentaris sus una lenga en particular ? Trapatz-nos sus nòstre <discourseLink>forum Discourse</discourseLink> e avisatz-nos.
Have you read our Terms?
Auetz liejut es condicions d'usatge?
Have you read our Terms?
Avètz legit las condicions d’utilizacion ?
Having an account is not required to contribute, though it is helpful.
Non cau auer cap de compde entà collaborar-i, mès ei util.
Having a profile is not required to contribute though it is helpful, see why below.
Non ei de besonh crear un perfil entà collaborar, mès ei util.
Hebrew
ebrèu
Hebrew
Ebrieu
Help
Ajudatz
Help
Ajuda
Help Common Voice reach { $hours } hours in a language with a personal goal
Ajudatz ath Common Voice a arribar en { $hours } ores en ua lengua damb un objectiu personau
Help Common Voice reach { NUMBER($hours) } hours in a language with a personal goal
Ajudatz a arribar a { NUMBER($hours) } oras en una lenga amb un objectiu personal
Help create Common Voice’s first target segment in { $locale }
Ajudatz a arténher eth prumèr segment objectiu deth Common Voice en { $locale }
Help reach { $hours } hours in { $language } with a personal goal
Ajudatz a arribar en { $hours } ores en { $language } damb un objectiu personau
Help reach { NUMBER($hours) } hours in { $language } with a personal goal
Ajudatz a arribar a { NUMBER($hours) } oras en { $language } amb un objectiu personal
Help report bugs
Senhalar de problèmas
Help teach machines how real people speak, donate your voice at { $link }
Ajudatz las maquinas a aprendre a parlar coma de vertadièras personas en donar vòstra votz sus { $link }
Help teach machines how real people speak, donate your voice at { $link }
Ajudatz as maquines a apréner a parlar coma persones de vertat en tot dar era vòsta votz sus { $link }
Help us build a community around voice technology, stay in touch via email.
Ajudatz-mos a bastir ua comunitat ath torn des tecnologies de votz, mantietz-vos eth contacte per corrèu electronic.
Help us build a high quality, publicly open dataset
Ajudatz-mos a crear un conjunt de donades de nauta qualitat dubèrt e disponible publicaments
Help us by reviewing sentences for correctness according to the guidelines.
Ajudatz-mos en tot revisar era correccion des frases segon ues directritzes.
Help us by writing or collecting Public Domain sentences.
Ajudatz-mos en tot escríuer o en tot recuélher frases de domeni public.
Help us find more voices, share your goal
Ajudatz-mos a trobar mès votzes, compartitz eth vòste objectiu
Help us find others to donate their voice!
Ajudatz-mos a trobar mès persones entà dar era votz !
Help us find others to donate their voice!
Ajudatz-nos a trobar mai de personas per donar lo votz !
Help us get to { $goal }
Ajudatz-mos a arribar en { $goal }
Help us get to { $goal }
Ajudatz-nos a arribar a { $goal }
Help us validate sentences!
Ajudatz-mos a validar frases!
Help us validate sentences!
Ajudatz-nos a validar las frasas !
Help us validate voices
Nos ajudar a validar las voses
Help us validate voices
Ajudatz-mos a validar votzes
Here are some tips to find sentences:
Ací auetz bèri conselhs entà trobar frases:
Here are the links to download your ZIP files.
Ací auetz es ligams entà baishar es archius ZIP.
Hidden
Amagat
Hidden
Amagat
Hill Mari
hill mari
Hindi
indi
Hindi
Indi
His hand was rais-ed.
A escrit era data.
Home
Inici
Hours
Ores
Hours Recorded
Ores enregistrades
Hours Recorded
Oras enregistradas
Hours Validated
Ores validades
Hours Validated
Oras validadas
{ $hours } validated hours so far!
{ $hours } oras validadas ja !
{ $hours } validated hours so far!
Ja s’an validat { $hours } ores!
How ?
Com?
How are project decisions made?
Com se prenen es decisions deth projècte?
How can I get the Common Voice data?
Com pogui obtier es donades deth Common Voice?
How can we effectively grow a language on Common Voice?
Com podem hèr a créisher ua lengua de manèra efectiva en Common Voice?
How does Common Voice calculate hours?
Com calcule es ores eth Common Voice?
How does Common Voice work?
Com foncione Common Voice?
How does it work?
Cossí fonciona ?
How does site localization work?
Com foncione era localizacion deth lòc web?
How do I access and use the dataset?
Com pogui accedir e utilizar eth conjunt de donades?
How do I add a language?
Com pogui ahíger ua lengua?
How do I add sentences?
Com i ahigi frases?
How do I know whether to approve a voice clip?
Com sabi s'aprovar un talh de votz?
How do I record a high quality voice clip?
Com enregistri un talh de votz de nauta qualitat?
How do I stay in touch?
Com me pogui mantier eth contacte?
How do you ensure anonymity and privacy of the people who donated their voices?
Com asseguratz er anonimat e era privadança des persones qu’an dat era votz?
How to
Com se hè entà
How-to
Com se hè
How to Cite
Com citar
How would you describe your accent?
Com descriuríetz eth vòste accent?
Hungarian
Ongrés
Hungarian
ongarés
I agree
D'acòrd
I agree
Accepti
I am a non-native speaker and I speak with an accent, do you still want my voice?
Non sò parlant/a nadiu/va e parli damb accent. Voletz era mia votz tot parièr?
Icelandic
Islandés
Icelandic
islandés
I confirm that these sentences are <wikipediaLink>public domain</wikipediaLink> and I have permission to upload them.
Confirmi qu'aguestes frases son de <wikipediaLink>domeni public</wikipediaLink> e è autorizacion entà pujar-les.
I'd like to receive emails such as goal reminders, my progress updates and newsletters about Common Voice.
M'agradarie recéber corrèus electronics entà rebrembar objectius, actualizacions deth mèn progrès e bulletins de notícies sus Common Voice.
I do not agree
Non i sò d'acòrd
I do not agree
Accepti pas
If a clip is rejected by 2 people, it is released in a different subset of the dataset.
Se 2 persones refusen un talh, se publicarà en un subconjunt diferent deth conjunt de donades.
If the recording breaks up, or has crackles, reject unless the entirety of the text can still be heard.
S'er enregistrament se talhe, o hè esclats, refusatz-lo. Levat que se pogue enténer era totalitat deth tèxte.
If the recording breaks up, or has crackles, reject unless the entirety of the text can still be heard.
S'er enregistrament se talhe, o hè esclats, refusatz-lo. Levat que se pogue enténer era totalitat deth tèxte.
If the sentence does not meet the above criteria, click the "Reject" button on the left. If you are unsure about the sentence, you may also skip it and move on to the next one.
S'era frasa non complís es critèris anteriors, hetz clic en boton «Refusa». Se non n'ètz segur/a, tanben la podetz sautar e passar ara següenta.
If the sentence does not meet the above criteria, click the "Reject" button. If you are unsure about the sentence, you may also skip it and move on to the next one.
S'era frasa non complís es critèris anteriors, hetz clic en boton «Refusa». Se non n'ètz segur/a, tanben la podetz sautar e passar ara següenta.
If the sentence meets the criteria, click the "Approve" button.
S'era frasa complís es critèris, hetz clic en boton «Apròva».
If the sentence meets the criteria, click the "Approve" button on the right.
S'era frasa complís es critèris, hetz clic en boton «Apròva».
If you come across something that these guidelines don’t cover, please vote according to your best judgement. If you really can’t decide, use the skip button and go on to the next recording.
Se vos trobatz damb quauquarren qu'aguestes directritzes non corbissen, votatz segons eth vòste critèri. Se reaument non ac podetz decidir, emplegatz eth boton Omet e passatz tar enregistrament següent.
If you could <strong>hear them</strong> and <strong>understand them</strong>, it’s usually <strong>best to approve</strong>.
Se podetz <strong>escotar-les</strong> e <strong>comprener-les</strong>, normauments ei <strong>milhor aprovar-les</strong>.
If you don’t hear from us within a couple of days,
reach out at <emailLink>{ $email }</emailLink>
and we’ll get back to you!
Se non sabetz arren de nosati en un parelh de dies, contactatz via <emailLink>{ $email }</emailLink> e mos meteram en contacte damb vos!
If you’re looking for tools to build ASR models, you can connect to other people in the community <discourseLink>on Discourse</discourseLink>.
Se cercatz estruments entà crear modèls RAP, vos podetz connectar damb d’autes persones dera comunitat <discourseLink>en Discourse</discourseLink>.
If your language is not listed here, you can request it through <languageProcessLink>this process</languageProcessLink>.
S'era vòsta lengua non apareish ací, la podetz sollicitar mejançant <languageProcessLink>aguest procès</languageProcessLink>.
If you run out of sentences to review, please help us collect more sentences!
Se demoratz sense frases entà revisar, ajudatz-mos a recuelher-ne mès!
If you think the pronunciation makes it <strong>impossible to understand</strong>, or there’s a <strong>lot of background noise</strong>, or there are <strong>other people speaking too</strong>, then you should reject the clip. <accuracyLink>See more information in our accuracy criteria</accuracyLink>.
Se credetz qu'era prononciacion hè que sigue <strong>impossible de compréner</strong>, o i a <strong>fòrça tapatge de hons</strong> o i a <strong>d’autes persones que tanben parlen</strong>, alavetz vos calerie refusar eth talh. <accuracyLink>Vedetz mès informacion enes critèris de precision</accuracyLink>.
Igbo
igbo
I just created a personal goal for voice donation to #CommonVoice -- join me and help teach machines how real people speak { $link }
Vengui de crear un objectiu personau entà dar era votz ath #CommonVoice #CommonVoiceARA. Junhetz-vos-i e ajudatz-mos a ensenhar es maquines com parlen es persones de vertat { $link }
I’m afraid I don’t know what you’re looking for.
Impossible de trobar çò que cercatz.
I'm okay with you handling this info as you explain in Mozilla's <privacyLink>Privacy Policy</privacyLink>
Accèpti que gestionetz aguesta informacion enes tèrmes descrits ena <privacyLink>politica de privadança<privacyLink> de Mozilla
Includes email, username & demographic info, available right away
Includís era adreça electronica, eth nòm d'usatgèr e era informacion demografica, disponibles immediataments
Includes mp3s and related sentences, may take some time to prepare
Includís es mp3 e es frases associades, pòt trigar un shinhau a premanir-se
* Indicates required field
* indica un camp obligatòri
Indonesian
indonesian
Indonesian
Indonesian
Information about the language
Informacion sus era lengua
In Progress
En cors
Interested in learning more and contributing to the project?
Auetz interès a apréner mès e collaborar en projècte?
Interface Language
Lengua dera interfície
Interlingua
Interlingua
Interlingua
interlingua
Interlingue
interlingue
Irish
irlandés
Irish
Irlandés
Is my account information public?
Era informacion deth mèn compde ei publica?
<isoCodeLink>ISO Codes</isoCodeLink> if known
<isoCodeLink>Còdis ISO</isoCodeLink> se les sabetz
Is the clip valid?
Eth talh ei valid?
Is the goal of Common Voice to build a voice assistant?
Er objectiu deth Common Voice ei bastir un assistent de votz?
Italian
Italian
Italian
italian
It contains words or phrases that are hard to read or pronounce.
Conten paraules o frases de mau liéger o prononciar.
It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.
Es un fach ben conegut qu'un lector serà distrach pel contengut legible d'una pagina quand examinarà sa mesa en pagina.
It is written in a language different than what I’m speaking.
Ei escrita en ua lengua diferenta dera que parli.
It's important that you reference where you found the public licence to avoid plagiarism and to allow for follow ups on <copyrightIssuesLink>copyright issues</copyrightIssuesLink>. Here are a few ways you can cite the work.
Ei important que hescatz referéncia a on auetz trobat era licéncia publica entà evitar eth plagi e entà perméter eth seguiment de <copyrightIssuesLink>problèmes de drets d'autor</copyrightIssuesLink>. Ací auetz bères manères de citar era òbra.
Izhorian
ingrian
Japanese
japonés
Japanese
Japonés
Join the Common Voice mailing list
Junhetz-vos ara lista de corrèu de Common Voice
Just Unsure?
Auetz dobtes?
Kabardian
kabardian
Kabyle
Cabil
Kabyle
cabil
Kannada
kannada
Kaqchikel
Kaqchikel
Kaqchikel
kaqtxikel
Karakalpak
karakalpak
Kazakh
cazac
Kazakh
Cazac
{ $kb }kb max
Maxima { $kb } kB
Keep
Gardar
Keep
Conserva
Keep it up, record again <recordIcon></recordIcon>
Seguitz atau! Enregistratz-ne un aute <recordIcon></recordIcon>
Keep the recordings
Conserva es enregistraments
Keep track of your progress and metrics across multiple languages.
Hetz eth seguiment deth vòste progrès e des metriques des lengües a on collaboratz.
Keep track of your progress with a profile
Hètz seguiment deth vòste progrès damb un perfil
Keep track of your progress with a profile and help our voice data be more accurate.
Un perfil permet de hèr eth seguiment deth vòste progrès e ajude a hèr qu'es donades de votz siguen mès precises.
Khmer
khmer
K'iche'
quiché
Kikuyu
kikuiu
Kinyarwanda
kinyarwanda
Komi-Zyrian
komi-zyrian
Konkani (Devanagari)
konkani (devanagari)
Korean
Corean
Korean
corean
Kurdish
Curd
Kurmanji Kurdish
curd deth nòrd
Kyrgyz
quirguiz
Language
Lengua
Language
Lenga
Language
Lenga
Language
Lengua
Language
Lengua
Language Request
Sollicitud de lengua
Language Request
Demanda de lenga
Language request successfully submitted, thank you.
Era sollicitud de lengua s'a enviat corrèctaments, gràcies.
Languages
Lengües
Languages
Lengas
Languages can have their <validationRulesLink>own validation rules</validationRulesLink> with additional requirements. If there is no specific validation file for a language, we are using the generic English rules.
Cada lengua pòt auer es sues <validationRulesLink>nòrmes de validacion</validationRulesLink> damb requeriments addicionaus. Se non i a cap de fichèr de validacion especifica entà ua lengua, s’empleguen es nòrmes especifiques der anglés.
Lao
lao
Latvian
leton
Launched
En produccion
Leaderboard Visibility
Visibilitat ath classament
Learn how to take part
Aprenetz a participar-i
Learn More
Mès informacion
Leaving now means you’ll lose your changes
S'ac deishatz ara, se perderàn toti es cambis
Leaving now means you'll lose your progress
Se gessetz ara, perderatz eth vòste progrès
Let's Get Started
Començam
LibriSpeech is a corpus of approximately 1000 hours of 16Khz read English speech derived from read audiobooks from the LibriVox project.
LibriSpeech Ei un còrpus d'apuprètz 1.000 ores de dialògs en anglés liejudi a 16 KHz derivat dera lectura d’audiolibres deth projècte LibriVox.
License
Licéncia
License: <licenseLink>CC-0</licenseLink>
Licéncia : <licenseLink>CC-0</licenseLink>
License: <licenseLink>CC-0</licenseLink>
Licéncia : <licenseLink>CC-0</licenseLink>
License: <licenseLink>{ $license }</licenseLink>
Licéncia : <licenseLink>{ $license }</licenseLink>
License: <licenseLink>{ $license }</licenseLink>
Licéncia : <licenseLink>{ $license }</licenseLink>
Ligurian
ligur
Lingala
lingala
Link Copied
Ligam copiat
Link Copied
Ligam copiat
Links to websites that can help us understand the language
Ligams a lòcs web que mos pòden ajudar a compréner era lengua
Listen
Escotar
Listen
Escota
Listen
Escota
Listening
en tot escotar
Listen-Queue
Coa d'escota
Lithuanian
lituanian
Loading…
Cargament…
Loading…
Cargament…
Loading rejected sentences…
Se carguen es frases refusades…
Loading sentences…
Se carguen es frases…
Loading your sentences…
Se carguen es frases…
Localization
Localizacion
Localized
Traduccion
Login failed
Er inici de session a falhat
Login Identity
Identitat d'inici de session
Log in or sign up to get started
Iniciatz session o creatz un compde entà començar
Login / Signup
Inici de session / Registre
Log In / Sign Up
Inici de session / Registre
Log In / Sign Up with { $company } email
Inici de session / Registre damb era adreça electronica de { $company }
Logout
Barra era session
Logout
Desconnexion
Log Out
Desconnexion
Looks like there aren't any clips to listen to in this language. Help us fill the queue by recording some now.
Semble que non i a cap de talh de votz entà revisar en aguesta lengua. Ajudatz-mos a ramplir era coa en tot enregistrar-ne bèri uns ara.
Luganda
luganda
Luxembourgish
luxemborgués
Macedonian
Macedonian
Macedonian
macedonian
Maithili
maithili
Make sure the platform is recording before you start speaking, and that it only stops once you’re finished.
Asseguratz-vos qu'era plataforma enregistre abans de començar a parlar e que sonque s'arture quan ajatz acabat.
Make sure the sentence meets the following criteria:
Asseguratz-vos qu'era frasa complís es critèris següents:
Make sure the sentence meets the following criteria:
Asseguratz-vos de qu'era frasa complís es critèris següents:
Make your submitted data as rich as possible by providing some anonymous demographic data. We de-identify all demographic data before making it public.
Enriquitz es donades qu’enviatz en tot ahiger-i informacion demografica anonima. Aguesta informacion s’anonimize abans de publicar-se.
Malagasy
malgach
Malay
malés
Malayalam
malaialam
Male
Òme
Male
Òme
Maltese
maltés
Manage Email Subscriptions
Gerís es soscripcions per corrèu electronic
Manage Subscriptions
Gestiona es soscripcions
Manage Subscriptions
Gerir las abonaments
Mapudungun
mapudungun
Marathi
marathi
Maybe our <homepageLink>homepage</homepageLink> will help? To ask a question, please join the <matrixLink>Matrix community chat</matrixLink>, monitor site issues via <githubLink>GitHub</githubLink> or visit <discourseLink>our Discourse forums</discourseLink>.
Dilhèu vos pòt ajudar eth nòste <homepageLink>pagina d'inici</homepageLink>? S'auetz bèra qüestion, junhetz-vos ath <matrixLink>chat dera comunitat a Matrix</matrixLink>, supervisatz es problèmes deth lòc mejançant eth <githubLink>GitHub</githubLink> o visitatz es nòstes <discourseLink>fòrums deth Discourse</discourseLink>.
MB
MB
Meadow Mari
meadow mari
Meetei Lon
meitei
Message
Messatges
Message
Messatge
[Mismatched content]
[Eth contengut non coïncidís]
Misreadings
Lectures erronèes
Missing an <strong>'s'</strong> at the end of a word.
Cambiar bèra forma verbau. Per exemple, hèr «cantèssem» en lòc de «cantèssen».
Missing <strong>'A'</strong> or <strong>'The'</strong> at the beginning of the recording.
Cambiar ua paraula per ua auta de semblabla. Per exemple, <strong>«a»</strong>, <strong>«en»</strong> o <strong>«damb»</strong>.
Missing the end of the last word by cutting off the recording too quickly.
Ométer era fin dera darrèra paraula, en talhar er enregistrament massa lèu.
Mixed
Mixta
Moksha
moksha
Mongolian
mongòl
Mongolian
Mongòl
More
Mès
[More has been recorded than the required text]
[S'an enregistrat mès paraules des indicades en tèxte]
Mossi
mossi
Most of the data used by large companies isn’t available to the majority of people. We think that
stifles innovation. So we’ve launched Common Voice, a project to help make voice recognition open
and accessible to everyone.
Era majoria des donades usades per granes companhies non son disponibles entà toti. Consideram qu'açò frene era innovacion. Per açò, auem iniciat eth projècte Common Voice, entà per'mor de hèr qu'era reconeishença dera votz sigue dubèrta e accessible a toti e totes.
Most of the data used by large companies isn’t available to the majority of people. We think
that stifles innovation. So we’ve launched Project Common Voice, a project to help make voice
recognition open to everyone.
Era majoria des donades usades per granes companhies non son disponibles entà toti. Consideram qu'açò frene era innovacion. Per aquerò, auem iniciat eth projècte Common Voice, entà hèr qu'era tecnologia de reconeishença dera votz sigue dubèrta a toti e totes.
Most recordings are of people talking in their natural voice. You can accept the occasional non-standard recording that is shouted, whispered, or obviously delivered in a ‘dramatic’ voice. Please reject sung recordings and those using a computer-synthesized voice.
Era majoria des enregistraments son de gent que parle damb naturalitat. Podetz acceptar era gravadura ocasionau non estandard a on se cride, shevitege o, evidentaments, se hèsque damb votz dramatitzada. Refusatz es enregistraments cantadi, a on s'use ua votz sintetisada per ordenador o es que non agen ua entonacion corrècta dera frasa, mès que mès es interrogatives.
Most speech databases are trained with an overrepresentation of certain demographics which results in a bias towards <articleLink>male and middle class</articleLink>. Accents and dialects that tend to be under-represented in training data sets are typically associated with groups of people who are already marginalised. Many machines also struggle to understand female voices.
This is why in our voice database we want variety!
Era majoritat de bases de donades de votz s'entrenen damb ua sobrerepresentacion · de bèra caracteristica demografica, açò genère resultats damb desviacion de <articleLink>genre masculin e classa mejana<articleLink>. Es accents e dialèctes qu’abituauments son infrarepresentadi enes conjunts de donades d'entrenament s'assòcien tipicament damb persones que ja son marginalizades. Fòrça maquines tanben an dificultat entà compréner votzes femenines.
Ei per açò qu'ena nòsta basa de donades de votz i volem varietat!
Mozilla Common Voice is an initiative to help teach machines how real people speak.
Mozilla Common Voice ei ua iniciativa entà ajudar a ensenhar as maquines com parlen es persones de vertat.
Mozilla Common Voice is made possible by a diverse community of activists, linguists, data scientists, academics and software engineers from all over the world. The project is stewarded by the Mozilla Foundation.
Mozilla Common Voice ei possible mercés a ua comunitat diuèrsa d'activistes, lingüistes, scientifics de donades, academics e enginhèrs de logiciau de tot eth mon. Eth projècte ei gestionat pera Fondacion Mozilla.
Mozilla doesn’t pick or favor any one language over another. Instead, Common Voice is a purely community-driven initiative, but it takes <multilangLink>several steps to add a new language</multilangLink> and begin collecting voice donations. First, the Common Voice website needs to be translated so community members can access the contributor experience in their own language. Next, we need a large collection of copyright-free sentences for people to read outloud. Once both of those requirements are satisfied a language is “launched” on Common Voice for people to start recording their voice and validating others donations. If you want to help launch a new language, head over to our <sentenceCollectorLink>sentence collection tool</sentenceCollectorLink> to get started.
Mozilla causís pas o favorís cap de lenga. A la plaça, Common Voice es una iniciativa menada per la comunautat, mas demanda <multilangLink>mantunas etapas per apondre una lenga novèla</multilangLink> e començar a collectar de don de voses. D’en primièr, cal traduire lo site Common Voice per que los membres de la comunautat pòscan accedir a l’interfàcia pels contributors dins lor lenga. Aprèp, nos cal una brava colleccion de frasas liura de drech a far prononciar al monde. Un còp aquesta doas causas realizadas una lenga es « lançada » sus Common Voice per que lo monde comencen d’enregistrar lor votz e validar çò dels autres. Se volètz ajudar a lançar una lenga novèla, anatz a <sentenceCollectorLink>l’aisina de recuèlh de frasas</sentenceCollectorLink> per començar.
Mozilla is dedicated to keeping the web open and accessible for everyone. To do that we need to empower web creators through projects like Common Voice. As voice technologies proliferate beyond niche applications, we believe they must serve all users equally. That means investing in more languages and accommodating diverse accents and demographics when building and testing voice technologies. Common Voice is a public resource available to everyone and Mozilla teams and developers around the world are already using it on our own projects as well.
Era mission de Mozilla ei mantier eth Web dubèrt e accessible a toti. Entà hè'c, auem de besonh aufrir estruments as creadors de contengut web mejançant projèctes coma eth Common Voice. Es tecnologies de votz prolifèren ath delà d'aplicacions fòrça especializades, e credem qu’an de servir a toti es usatgèrs en egalitat de condicions. Açò signifique includir mès lengües, mès diuersitat d'accents e de donades demografiques ara ora de crear e provar es tecnologies de votz. Eth Common Voice ei un recors public, disponible entà toti, e es equipes de Mozilla e es desvolupadors de pertot eth mon ja l’utilizen enes sòns pròpris projèctes.
Mozilla’s open source voice recognition engine Deep Speech can be used to build speech recognition applications. Read our <githubLink>Github overview</githubLink> or join the <discourseLink>DeepSpeech Discourse</discourseLink> to learn how to get started.
Eth motor de reconeishença de votz de còdi liure Deep Speech de Mozilla se pòt usar entà crear aplicacions de reconeishença de votz. Liegetz-ne era <githubLink>descripcion generau en Github</githubLink> o visitatz eth <discourseLink>fòrum DeepSpeech en Discourse</discourseLink> entà obtier informacion sus com començar.
Mutual accountability.
Responsabilitat mutuau.
Mycroft Ai
Mycroft Ai
Mycroft is the world’s first open source assistant.
Mycroft runs anywhere - on a desktop computer, inside an automobile, or on a Raspberry Pi.
Mycroft es lo primièr assistent liure del monde.
Mycroft fonciona pertot : sus un ordenador, dins una veitura o sus un Raspberry Pi.
My Sentences
Es mies frases
n
n
N
N
N/A
Non disponible
Name
Nòm
Name
Nom
Names of your language
Nòms dera vòsta lengua
Native Language
Lengua mairau
Native Language
Lenga mairala
Native Language
Lengua mairau
Need some help with accent?
Vos cau ajuda sus es accents?
Need some help with variants?
Vos cau ajuda damb es variantes?
Need to download your data?
Voletz baishar es vòstes donades?
Nepali
nepalés
Nepali
Nepali
New Language Launch
Lançament d'ua naua lengua
Next
Següent
Next Goals: { $goal }
Següent objectiu: { $goal }
Next Goals: { $goal }
Objectiu seguent : { $goal }
Nias
nias
No
Non
No
Non
No
Non
No
Non
[No ‘a’ in the original text]
[Eth tèxte originau indique «en»]
No gravatar found for your email
Cap de gravatar pas trobat per vòstra adreça electronica
No gravatar found for your email
Non s'a trobat cap de Gravatar entara vòsta adreça electronica
No microphone found.
Cap de microfòn pas trobat.
No microphone found.
Non s'a trobat cap e micròfon.
No rejected sentences found!
Non s'a trobat cap de frasa refusada!
Norwegian Bokmål
bokmål
Norwegian Nynorsk
Norvegian Nynorsk
Norwegian Nynorsk
noruec nynorsk
No sentences found!
Non s'a trobat cap de frasa!
No sentences to review. <addLink>Add more sentences now!</addLink>
Non i a cap de frasa entà revisar. <addLink>Ahigetz mès frases ara!</addLink>
Note: When set to 'Visible', this setting can be changed from the <profileLink>Profile page</profileLink>
Nòta: s'ei definit coma «Visible», açò se pòt cambiar ena pagina deth perfil
Note: You will still need to select between Speak or Listen to change contribution type.
Nòta: Tot parièr, auratz de seleccionar entre Parla o Escota entà cambiar eth tipe de collaboracion.
Not found
Pas trobat
Not found
Pas trobat
No Thanks
Non, gràcies
No variant selected
Non i a cap de varianta seleccionada
Now that you know a little bit more about Common Voice, why not try it out? Click on the microphone icon to start reading sentences aloud. <br/><br/>If you prefer to review other people's voice contributions, click on the play icon. You’ll help confirm that recordings match the sentences written on screen.
Ara que ja sabetz un shinhau mès sus eth Common Voice, perqué non ac sajatz? Hetz clic ena icòna deth microfòn entà començar a liéger frases en votz nauta. <br/><br/>Se preferitz revisar es contribucions de votz de d’autes persones, hetz clic ena icòna de reproduccion. D'aguesta manèra ajudaratz a confirmar s'es enregistraments coïncidissen damb es frases escrites ena pantalha.
Now you can donate your voice to help us build an open-source voice database that anyone can use
to make innovative apps for devices and the web. Read a sentence to help machines learn how real people speak. Check the work of other
contributors to improve the quality. It’s that simple!
Ara podetz dar era vòsta votz entà ajudar-mos a bastir ua basa de donades de votz damb licéncia dubèrta que quinsevolhe pòt usar entà desvolopar aplicacions innovadores entàs aparelhs e eth web. Liegetz ua frasa entà ajudar as maquines a apréner com parlam es persones de vertat. Revisatz eth trabalh de d’auti collaboradors entà milhorar era qualitat. Atau de simple!
{ NUMBER($count) } clips
{ NUMBER($count) } mòstras
Number of Voices
Nombre de votzes
Numbers. There should be no digits in the source text because they can cause problems when read aloud. The way a number is read depends on context and might introduce confusion in the dataset. For example, the number “2409” could be accurately read as both “twenty-four zero nine” and “two thousand four hundred nine”.
Chifres. Non i aurie d’auer cap de digit en tèxte, pr'amor que pòden causar problèmes quan se liegen en votz nauta. Era manèra en qué se lieg un nombre depen deth contèxte e pòt introdusir confusion en conjunt de donades. Per exemple, eth nombre «2409» se poderie liéger corrèctaments coma «vint-e-quate zèro nau» e coma «dus mil quate cents nau», segons eth contèxte.
{ NUMBER($totalHours) } hours is achievable in just over { NUMBER($periodMonths) }
months if { NUMBER($people) } people record { NUMBER($clipsPerDay) } clips a day.
Se { NUMBER($people) } personas enregistran { NUMBER($clipsPerDay) } mòstras per jorn, se pòt arribar a { NUMBER($totalHours) } oras en un pauc mai de { NUMBER($periodMonths) } meses.
Occitan
occitan
Occitan
Occitan
Odia
odia
Odia
Odia
Off
Desactivat
Offensive language
Lenguatge ofensiu
Offensive speech
Àudio ofensiu
On
Activat
Once logged in you can select your languages from the profile section.
Un còp auetz iniciat session, podetz escuélher es lengües ena seccion deth perfil.
On desktop devices you can contribute by downloading…
En ordenadors d’escritòri, podetz baishar eth darrèr:
One sentence per line
Ua frasa per linha
On his head he wore a beret.
Eri mingen peres.
On iOS please continue with Safari to enable recording…
En iOS, seguitz damb eth Safari entà activar er enregistrament…
On the other hand, if you think that the reader has probably never come across the word before, and is simply making an incorrect guess at the pronunciation, please reject. If you are unsure, use the skip button.
De ua auta part, se credetz que probablaments eth/era lector/a non s'a trobat jamès damb era paraula e simplaments hè ua supausicion incorrècta dera pronóncia, refusatz eth talh. Se non n'ètz segur/a, utilizatz eth boton Omet.
Optionally join on our email list for updates and new information about the project.
Opcionaument, junhetz-vos ara nòsta lista de corrèu entà recéber actualizacions e nauetats sus eth projècte.
Optionally submitted demographic data (e.g. age, gender, language, and accent) will never be made public on your profile, and will not be linked to your account in the dataset. Individual audio clips will be associated with demographic data for the purpose of more accurate analysis - for example, a researcher might want to target a training model to a specific demographic segment.
Es donades demografiques enviades opcionaument (per exemple, edat, genre, lengua e accent) non se haràn jamès publiques en vòste perfil e non s’enlaçaràn damb eth vòste compde en conjunt de donades. De forma individuau, es talhs de votz s’associaràn as donades demografiques damb era finalitat d’analisis mès precises. Per exemple, un recercaire pòt voler orientar un modèl d’entrenament a un segment demografic especific.
Optionally submitted demographic data (e.g. age, sex, language, and accent) is de-identified from your submitted voice data and will never be made public on your profile.
Las informacions demograficas opcionalas (per exemple l'atge, lo sèxe, la lenga, e l'accent) son dissociadas de vòstres enregistraments àudio e seràn pas jamai afichadas publicament sul vòstre perfil.
Optionally submitted demographic data (e.g. age, sex, language, and accent) will never be made public on your profile, and will not be linked to your account in the dataset. Individual audio clips will be associated with demographic data for the purpose of more accurate analysis - for example, a researcher might want to target a training model to a specific demographic segment.
Las donadas demograficas eventualament somesas (per exemple, l’atge, lo sèxe, la lenga e l’accent) seràn pas jamai faches publiques sus vòstre compte e seràn pas ligats a vòstre compte dins l’ensemble de las donadas. De mòstras àudio individualas seràn ligadas a de donadas demograficas per de fins d’analisis mai precisas : per exemple, un cercaire o una cercaira poiriá voler trabalhar sus un modèl d’entraïnament de segment demografic precís.
or Delete voice clip
o Suprimir vòstra mòstra de votz
Other
Autre
Other
Un aute
Other Language
Ua auta lengua
Other people validate those voice clips.
D’autes persones validen es talhs de votz enregistradi.
Other voice datasets…
D’auti conjunts de donades de votz…
Other Voice Datasets
D’auti conjunts de donades de votz
Our community manager will reach out to you to offer
extra support, and to let you know if there is
already a community working on your language
Eth nòste gestor de comunitats se meterà en contacte damb vos entà aufrir-vos supòrt addicionau e entà hèr-vos a saber se ja i a ua comunitat que trabalhe ena vòsta lengua
Our governance is founded on the pillars of:
Era nòsta governança se base enes pilars de:
Our source text is made up of original contributor donations as well as dialogue from public domain movie scripts like <italic>It’s a Wonderful Life</italic>.
You can view our source sentences in this <githubLink>GitHub folder</githubLink>.
Es tèxtes provien de contribucions originaus, tanben d'òbres literàries de domeni public, arrepervèris e fraseologia comuna. Podetz veir es frases en aguest <githubLink>dossièr deth GitHub<githubLink>.
Overall Accuracy
Precision globau
Overall Hr. Total
Ores totaus
Overall project status: see how far we’ve come!
Estat generau deth projècte: vedetz enquia on auem arribat!
p
r
Papiamento (Aruba)
papiament (Aruba)
Partners
Sòcis
Partners
Sòcis
Partnerships and networks
Collaboracions e hilats
Partner with local organizations or individuals
Collaborar damb persones o organizacions locaus
[Part of the text can’t be heard]
[Non s'enten part deth tèxte]
Pashto
paishto
<p>Common Voice is a publicly available voice dataset, powered by the voices of volunteer contributors around the world. People who want to build voice applications can use the dataset to train machine learning models.</p>
<p>At present, most voice datasets are owned by companies, which stifles innovation. Voice datasets also underrepresent: non-English speakers, people of colour, disabled people, women and LGBTQIA+ people. This means that voice-enabled technology doesn’t work at all for many languages, and where it does work, it may not perform equally well for everyone. We want to change that by mobilising people everywhere to share their voice.</p>
<p>Common Voice ei un conjunt de donades de votz disponible publicament, possat pes votzes de collaboradors volontaris de pertot eth mon. Es persones que volgatz crear aplicacions de votz podetz emplegar eth conjunt de donades entà entrenar modèls d'aprendissatge automatic.<p>
<p>Actuaument, era majoritat des conjunts de donades de votz son proprietat d'enterpreses, causa que frene era innovacion. Es conjunts de donades de votz tanben representen en excès es òmes blanqui de parlar anglés. Açò vò díder qu'era tecnologia abilitada entà votz non foncione bric entà fòrça lengües e, a on foncione, pòt èster que non foncione parièr entà toti. Volem cambiar aquerò en tot mobilizar a gent de pertot entà compartir era sua votz.<p>
People come and contribute their voices.
Era gent ven e collabòre en tot enregistrar era sua votz.
Persian
pèrsa
Persian
Pèrse
Personal
Personau
Personal dashboards keep you up-to-date with individual and community progress.
Es taulèrs personaus vos informen sus eth progrès individuau e dera comunitat.
<playbookLink>Find helpful guidance</playbookLink> on the entire Common Voice journey, from localisation to dataset usage, as well as how to connect with our community.
<playbookLink>Trobatz orientacions utiles</playbookLink> sus tot eth recorregut de Common Voice, dera localizacion enquiar usatge de conjunts de donades, atau coma sus com connectar damb era nòsta comunitat.
<playIcon></playIcon>Last one!
<playIcon></playIcon> Eth darrèr!
Play/Stop
Lectura/Arrèst
Play/Stop
Reprodusís/artura
Please add a source.
Ahigetz ua hònt.
Please add sentences.
Ahigetz frases.
Please check the <howToLink>guidelines</howToLink>.
Revisatz es <howToLink>directritzes</howToLink>.
Please confirm that these sentences are public domain.
Confirmatz qu'aguestes frases son de domeni public.
Please select a language.
Seleccionatz ua lengua.
Please select a language to review sentences.
Escuelhetz ua lengua entà revisar-ne es frases.
Please try again.
Tornatz-ac a sajar.
Polish
Polonés
Polish
polonés
<pontoonAccountLink>Create an account</pontoonAccountLink> if you don’t have one. Then, choose your language <strong>(‘Team’)</strong> and then choose the project, <pontoonCvLink>Common Voice</pontoonCvLink>. There will be files to translate. Click on one, then it shows words in English and a box to translate them.
<pontoonAccountLink>Creatz un compde</pontoonAccountLink> se non n'auetz cap. Ara seguida, escuelhetz era vòsta lengua <strong>(«Equipa»)</strong> e, ara seguida, escuelhetz eth projècte, <pontoonCvLink>Common Voice</pontoonCvLink>. I aurà fichèrs entà tradusir. Hetz clic ath dessús d’un, dempús se mòstren es paraules en anglés e un quadre entà tradusir-les.
Portuguese
portugués
Portuguese (Brazil)
Portugués (Brasil)
Portuguese (Brazil)
Portugués (Brasil)
Press play, listen & tell us: did they accurately speak the sentence below?
Premetz eth boton de reproduccion, escotatz e didetz-mos s'er àudio correspon exactaments ara frasa de jos:
Press { shortcut-play-toggle } to toggle play mode
Premetz { shortcut-play-toggle } entà iniciar e arturar era reproduccion
Privacy
Vida privada
Privacy
Privadança
Privacy
Privadança
Privacy Policy
Politica de privadança
Privacy, security and transparency.
Privadança, seguretat e transparéncia.
Pro
Professionau
Profile
Perfil
Profile
Perfil
Profile
Perfil
Profile
Perfil
Profile information improves the audio data used in training speech recognition accuracy.
Era informacion deth perfil melhore es donades d’àudio utilizades entà entrenar era precision dera reconeishença de votz.
Profile: { $username }
Perfil: { $username }
Progress
En cors
Punctuation. Special symbols and punctuation should only be included when absolutely necessary. For example, an apostrophe is included in English words like “don’t” and “we’re” and should be included in the source text, but it’s unlikely you’ll ever need a special symbol like “@” or “#.”
Puntuacion. Es simbèus especiaus e es signes de puntuacion sonque s’an d’includir quan sigue absolutaments de besonh. Per exemple, en catalan s'usen eth punt volat ena ela geminada e er apostròf deuant de paraules començades per vocau, e se s’aurien d’includir en tèxte, mès ei pòc probable que jamès ajatz de besonh un simbèu especiau coma @ o #.
Punjabi
punjabi
Quechua Chanka
quíchua chanka
r
e
[‘Raised’ in English is always pronounced as one syllable, not two]
[En aranés, «escrit» tostemp se pronóncie dera madeisha manèra]
Reader Effects
Efèctes de lectura
Reading contractions that aren't actually there, such as "We're" instead of "We are", or vice versa.
Hèr contraccions o elisions que non se tròben en tèxte, o vice-vèrsa. Per exemple, hèr «eth oncle» en lòc de «er oncle», o hèr «'nar» alloc de «anar».
Read More
Mès informacion
Read More
Legir la seguida
Read more on our About page
Vedetz mès informacion ena pagina «Qui èm»
Read our How-to if unsure how to attribute
Liegetz es instruccions se non ètz segur(e)s de coma atribuïr
Read the sentence carefully - don’t miss, change or add words.
Liegetz atentaments era frasa: non ometetz, cambietz ne i ahijatz cap de paraula.
{ $readyCount } sentences ready for submission!
I a { $readyCount } frases preparades entar enviament!
Ready to add your voice or lend your ear?
Ètz prèst/a entà ahíger era votz o parar era aurelha?
Ready to contribute?
Prèst/a entà collaborar?
Ready to do { $count } more?
Prèst/a entà hèr-ne { $count } mès?
Ready to donate your voice?
Prèst a donar vòstra votz ?
Ready to donate your voice?
Ètz prèst entà dar era votz?
Ready to help validate sentences?
Ètz prèst/a entà validar frases?
reCAPTCHA is required if you want to proceed
reCAPTCHA ei de besonh se voletz seguir
Receive emails such as challenge and goal reminders, progress updates, and newsletters about Common Voice.
Recebetz corrèus coma rebrembes d'objectius e desfisi, actualizacions deth progrès e bulletins sus Common voice.
Recorded Clips
Talhs enregistradi
Recorded Hours
Ores enregistrades
<recordIcon></recordIcon> Last one!
<recordIcon></recordIcon>Eth darrèr!
Recording
Enregistrament
[Recording cut off before the end of the last word]
[Talh d’enregistrament abans dera fin dera darrèra paraula]
Recordings
Enregistraments
Recordings
Enregistraments
Recordings
Enregistraments
Recording voice clips is an integral part of building our open dataset; some would say it's the fun part too.
Er enregistrament de talhs de votz ei ua part essenciau entà crear eth nòste conjunt de donades dubèrtes. Tanben ei era part mès divertida.
Record/Stop
Enregistra/artura
Record/Stop
Enregistrar/Arrestar
RECORD VOICEWAVE
ENREGISTRAR VÒSTRA PRESENTACION
Record your voice
Enregistratz era vòsta votz
Record your voice
Enregistrar vòstra votz
Recreate Voicewave
Tornar crear vòstra mòstra de votz
Refresh
Actualiza
Reject
Refusa
Rejected Sentences
Frases refusades
{ $rejectedSentences } rejected sentences.
{ $rejectedSentences } frases refusades.
Remember that we need permission to publish those sentences, so always ensure that the text belongs to the <wikipediaLink>public domain</wikipediaLink>. If there is no explicit indication, reach out to the person that the text belongs to and ask if they want to contribute their text to the Public Domain. If they want to, help them go through the <cc0WaiverLink>Contributions Agreement</cc0WaiverLink> process. If they do not want to contribute the text, then you are not allowed to use that text.
Arrebrembatz qu'auem de besonh autorizacion entà publicar aguestes frases, per tant asseguratz-vos cada viatge qu'eth tèxte aparten ath <wikipediaLink>domeni public</wikipediaLink>. Se non i a cap d’indicacion explicita, metetz-vos eth contacte damb era persona a qui aparten eth tèxte e demanatz se vò aportar eth tèxte ath domeni public. S'ac vòlen, ajudatz-les a passar peth procès deth <cc0WaiverLink>Acòrd de collaboracions</cc0WaiverLink>. Se non vòlen aportar eth tèxte, non lo podetz emplegar.
remove
elimina
Remove
Elimina
Remove
Levar
Remove Avatar
Elimina er avatar
Report
Informa
Report Bugs
Informa de problèmes
Report copyright issues
Informa de problèmes de drets d'autor
Report was passed successfully
Er informe s'a enviat corrèctament
Request a Language
Sollicita ua lengua
Request a Language
Demandar una lenga
*required
* de besonh
*required
*requesit
Re-record
Torna a enregistrar
RE-RECORD
TORNAR ENREGISTRAR
Re-record clip
Torna a enregistrar eth talh
Reset skipped sentences
Restablís es frases ometudes
Retry
Tornar ensajar
RETRY
TORNAR FAR
Return
Retorn
Return here to edit your goal anytime.
Tornatz ací en moment que sigue entà editar eth vòste objectiu.
Return to Common Voice
Tornar a Common Voice
Return to Common Voice
Torna tà Common Voice
Return to Common Voice Datasets
Torna tàs conjunts de donades deth Common Voice
Return to Languages
Tornar a las lengas
Return to Languages
Torna tàs lengües
Review
Revisa
Review
Revisa
Review
Verificacion
Review
Revisa
Review could not be saved. Please try again later.
Non s'a pogut sauvar era revision. Tornatz-ac a provar mès tard.
ⓘ Review Criteria
ⓘ Critèris de revision
Review Criteria
Critèris de revision
Reviewed sentences not submitted, are sure?
Non s’an enviat es frases revisades, n’ètz segur/a?
<reviewLink>Review now!</reviewLink>
<reviewLink>Revisatz ara!</reviewLink>
Review & re-record clips here as you go
Revisatz e tornatz a enregistrar talhs d’aciu estant
Review & re-record clips if needed
Revisatz es talhs e, se cau, tornatz-les a enregistrar
Review sentences
Revisea es frases
Review Sentences
Revisa es frases
Review Sentences
Revisar frases
Review & Submit
Revisatz e enviatz-ac
Review & Submit
Repassar e enviar
Romanian
Romanés
Romanian
romanés
Romansh Sursilvan
romanch sobreselvan
Romansh Vallader
romanch vallader
Runyankole
nyankole
Russian
rus
Russian
Rus
s
o
S
O
Sakha
sakha
Santali (Ol Chiki)
santali (ol chiki)
Saraiki
seraiki
Sardinian
sard
Save
Enregistrar
Save
Sauva
Saved
Enregistrat
Saved
S'a sauvat
Search
Cerca
Search
Recercar
Search for answers
Cèrca responsa
Search for them on the Internet
Cercatz-ne en Internet
See how your progress compares to other contributors all over the world.
Comparatz eth vòste progrès damb eth des auti collaboradors de tot eth mon.
See Less
Ne veire mens
See Less
Mòstran mens
See More
Ne veire mai
See More
Mòstra’n mès
Select a Language...
Seleccionatz ua lengua...
Selected
Seleccionat
Select Language
Escuelhetz ua lengua
Send sign up link
Envia eth ligam de registre
Sentence Collection
Recuelhuda de frases
Sentence Collection
Recuelhuda de frases
Sentence must contain between 1 and 14 (inclusive) words
Era frasa a de contier entre 1 e 14 paraules
Sentences
Frases
{ $sentences } added by you
N'auetz ahijut { $sentences }
Sentences are being uploaded. This can take several minutes depending on the number of sentences added. Please do not close this website.
Es frases se pugen. Açò pòt trigar bères menutes segon eth nombre de frases ahijudes. Non barretz aguesta pagina.
Sentences are collected for people to read aloud.
S'arrecuelhen frases entà qu'era gent les liege en votz nauta.
Sentence should not contain abbreviations
Era frasa non pòt contier abreviatures
Sentence should not contain numbers
Era frasa non pòt contier chifres
Sentence should not contain symbols
Era frasa non pòt contier simbèus
Sentences must be reviewed and accepted by two people to be included in Common Voice. You create guidelines for your language <languageGuidelinesLink>here</languageGuidelinesLink>. Sentences must be in the <licenseLink>public domain</licenseLink> and shorter than 15 words. You can ask the owner of a text to make it <waiverLink>CC0 using our waiver process</waiverLink>, and send to us on commonvoice@mozilla.com
Es frases an d’èster revisades e acceptades per dues persones entà èster includides en Common Voice. Creatz directritzes entara vòsta lengua <languageGuidelinesLink>ací</languageGuidelinesLink>. Es frases an d’èster de <licenseLink>domeni public</licenseLink> e an d’auer mens de 15 paraules. Podetz demanar ath proprietari d'un tèxte que lo hèsque <waiverLink>CC0 mejançant eth nòste procès d'exempcion</waiverLink> e mos l'envie tà commonvoice@mozilla.com
Sentences not submitted, are you sure you want to leave?
Es frases non s'an enviat, ètz segur/a de que voletz gésser?
{ $sentences } sentences failed
A falhat { $sentences } frases
{ $sentences } sentences reviewed. Thank you!
Auetz revisat { $sentences } frases. Mercés!
Sentences that have been validated are exported on Wednesdays, and are available after the next release.
Es frases que s'an validat s'expòrten eth dimèrcles e son disponibles dempús deth següent lançament.
Sentence text
Tèxte dera frasa
Serbian
Sèrbe
Serbian
sèrbi
Set a goal
Definís un objectiu
Set my visibility
Definís era mia visibilitat
Settings
Paramètres
Settings
Paramètres
Settings
Paramètres
Sex
Sèxe
Sex
Sèxe
Share Common Voice
Compartitz eth Common Voice
Share my goal
Compartís eth mèn objectiu
Share your clip
Compartitz eth vòste talh de votz
Share your { $count } Clip Daily Goal for { $type }
Compartitz eth vòste objectiu diari de { $count } talhs { $type }
Share your { $count } Clip Weekly Goal for { $type }
Compartitz eth vòste objectiu setmanau de { $count } talhs { $type }
Shilha
chelha
Shortcuts
Dreceres
Shortcuts
Acorchis
[Should be ‘dinosaurs’]
[Aurie d’èster «dinosaures»]
[Should be “We are”]
[Aurie d’èster «Anam»]
Show all skipped sentences again
Mòstra de nau totes es frases ometudes
Show my ranking
Mòstra eth mèn classament
Sicilian
sicilian
Sign up
Registre
Sign up for an account
Crear un compte
Sign up for an account
Crea un compde
Sign up for Common Voice newsletters, goal reminders and progress updates
Creatz un compde entà recéber es notícies de Common Voice, rebrembes des objectius e actualizacions de progrès
sign up for email updates
registratz-vos entà recéber actualizacions per corrèu
Sign up for { $lang } updates:
Recebre las actualitats del projècte en { $lang } :
Sign up for { $lang } updates:
Soscriuetz-vos entà recéber actualizacions sus eth { $lang } :
Single ZIP file containing
Un solet archiu ZIP que conten
Sinhala
sinhala
Siswati
{ "" }
Size
Mesura
Size
Mesura
Skip
Omet
Skip
Passar
Skip
Omet
Skip Submission Feedback
Omet er enviament de comentaris
Slovak
eslovac
Slovak
Eslovac
Slovenian
esloven
Slovenian
Eslovèn
Social media
Mejans de comunicacion sociau
Somali
somali
Some languages have enormous variation in grammar, vocabulary and pronunciation. For this reason, we are <ctaLink>introducing ‘Variants’</ctaLink> in 2022. This gives communities a way to distinguish their languages within the larger dataset.
Bères lengües an ua grana variacion en gramata, vocabulari e prononciacion. Per aguest motiu, <ctaLink>introdusim es «Variantes»<ctaLink> eth 2022. Açò aufrís as comunitats ua manèra de distinguir es lengües en conjunt de donades mès generaus.
Someone asks for a language to be added.
Quauquarrés demane que s’ahige ua lengua.
Something went wrong with reCAPTCHA. Please try again.
S'a produsit un error damb reCAPTCHA. Tornatz-ac a provar mès tard.
Sorbian, Lower
Sèrbe bas
Sorbian, Lower
baish sorab
Sorbian, Upper
Sèrbe naut
Sorbian, Upper
naut sorab
Sorry, Common Voice is running slowly. Thanks for your interest.
Mos sap mau, Common Voice trigue a cargar-se, gràcies per interès.
Sorry, something went wrong
I a agut un problèma
Source: { $sentenceSource }
Hònt: ·{ $sentenceSource }
Source: { $source }
Hònt: { $source }
Spanish
espanhòu
Spanish
Castelhan
Speak
Parlatz
Speak
Parlar
Speak
Parla
Speakers
Locutors
Speaking
en tot parlar
Speaking and Listening
en tot parlar e en tot escotar
Speak in your normal voice! The way you speak is welcome here - we want your accent as it is, and we want your usual volume, style and intonation.
Parlatz damb era votz normau! Ací, era vòsta manèra de parlar ei benvenguda: volem eth vòste accent tau com ei, e volem eth vòste volum, estil e entonacion abituaus.
Speak now
Parlatz ara
Speak now
Parlatz ara
Speak up, contribute here!
Parlatz fòrt e collaboratz-i!
Speak up, contribute here!
Parlatz fòrt, contribuissètz aquí !
<speechBlogLink>Get Started with Speech Recognition</speechBlogLink>
<speechBlogLink>Prumèrs passi ena reconeishença dera votz</speechBlogLink>
Speech is often the most natural way we communicate with each other and voice technologies are bringing that convenience to our computers and mobile devices. We want to empower developers to build amazing voice recognition applications like real-time translators and voice-enabled digital assistants. But right now most of the voice data required to build these kinds of apps is expensive and proprietary. We hope the Common Voice dataset will give developers what they need to innovate and make speech technology available in their own language.
To make voice recognition even more universal, we're collecting voice samples in widely spoken languages as well as those with a smaller population of speakers often underserved by commercial speech recognition services. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and entire speech communities to address this gap themselves.
Eth parlar ei soent era manèra mès naturau de comunicar-mos entre nosati e es tecnologies de votz apòrten aguesta comoditat as ordenadors e dispositius mobils. Volem capacitar as desvolupadors entà crear aplicacions susprenentes de reconeishença de votz, coma per exemple reviradors en temps reau e assistents digitaus controtladi pera votz. Ara madeish, totun, era majoria des donades de votz de besonh entà bastir aguesti tipes d'aplicacions son cares e privades. Demoram qu'eth conjunt de donades deth Common Voice proporcione as desvolupadors çò qu’an de besonh entà innovar e qu'era tecnologia deth parlar sigue disponibla ena sua respectiva lengua. Entà hèr qu'era reconeishença de votz sigue encara mès universau, recuelhem mòstres de votz en lengües damb fòrça parlants, atau coma enes qu’an ua poblacion mès petita de parlants e que soent non an accès as servicis de reconeishença de votz comerciau. Era publicacion d'un conjunt de donades de votz diuèrsa permeterà resòlver aguest uet as desvolupadors, as empresaris e a totes es comunitats de tecnologies de votz.
Speech-to-text (STT)
Votz a tèxte (Speech-to-text, STT)
Speech-to-text (STT) technologies convert voice data into text.
Es tecnologies de votz a tèxte (STT) convertissen es donades de votz en tèxte.
Split into { $archiveCount } ZIP files containing
Dividit en { $archiveCount } archius ZIP que contien
Splits
Divisions
Start recording
Enregistratz
Start typing to describe your accent
Començatz a escríuer entà descríuer eth vòste accent
Statistics
Estadistiques
Statistics
Estadistiques
Stats
Estadistiques
Status Page
Estat deth projècte
Streaks
Seguides
<strong>[Crackle]</strong> giant dinosaurs of <strong>[crackle]</strong> -riassic.
<strong>[esclat]</strong> grani dinosaures de <strong>[esclat]</strong> -riasic.
<strong>Do not reject clips where the speaker ‘has an accent’ that is different to your own</strong> - this is important for voice recognition to work better for everyone.
<strong>Non refusetz talhs a on eth locutor «a un accent» diferent deth vòste</strong>: açò ei important entà qu'era reconeishença de votz foncione ben entà toti.
<strong>Get creative!</strong> Your language community will be unique, and these are just a few ways to get started.
<strong>Sigatz creatius!</strong> Era vòsta comunitat lingüistica serà unica e aguestes son sonque bères manères de començar.
<strong>Sentence Collector</strong> is a tool for collecting and validating public domain sentences. You will <scAccountLink>need to create an account</scAccountLink>. Then, add your language to your <strong>Profile</strong>. You can <strong>Add</strong> sentences or <strong>Review</strong> previously-added sentences.
<strong>Sentence Collector</<scAccountLink>strong> Ei un estrument entà recuélher e validar frases de domeni public. Auratz de <scAccountLink>crear un compde</scAccountLink>. Ara seguida, ahigetz era vòsta lengua ath <strong>perfil</strong>. I podetz <strong>ahíger</strong> frases o <strong>revisar</strong> es frases ahijudes anterioraments.
<strong>[Sneeze]</strong> The giant dinosaurs of the <strong>[cough]</strong> Triassic.
<strong>[esternut]</strong> Es grani dinosaures deth <strong><estossegadera></strong> Triasic.
Submission: { $batchId }
Enviament: { $batchId }
Submission Error
Error d'enviament
Submit
Mandar
Submit
Envia
Submit
Mandar
Submit
Envia
Submit
Envia
Submit a report
Enviatz un repòrt
Submit clips
Envia es talhs
Submit clips
Envia es talhs
Submitted sentences. { $duplicates } sentences were rejected as duplicates.
Frases enviades. S'an refusat { $duplicates } frases coma duplicades.
Subscribe
Soscriu-me-i
Success
Corrècte
Success, profile created!
Hèt. S'a creat eth perfil!
Swahili
swahili
Swedish
Suedés
Swedish
suedés
Swipe right to approve the sentence. Swipe left to reject it. Swipe up to skip it. <strong>Do not forget to submit your review!</strong>
Lisatz tara dreta entà aprovar era frasa. Lisatz tara quèrra entà refusar-la. Lisatz ensús entà ometer-la. <strong>Non desbrembetz d’enviar era revision!</strong>
Syriac
sirian
Tagalog
tagalog
Tahitian
tahitian
Taiwanese (Minnan)
taiwanés (min nan)
Tajik
tajik
Take a look
L’ensajar
Taking several attempts to read a word.
Hèr multiples assagi entà liéger ua paraula.
Tamil
Tamil
Tamil
tamil
Tap
Tocatz
Tatar
Tatar
Tatar
tatar
Tatoeba is a large database of sentences, translations, and spoken audio for use in language learning. This download contains spoken English recorded by their community.
Tatoeba ei ua grana basa de donades de frases, traduccions e enregistraments d’àudio entar usatge en aprendissatge de lengües. Aguesta baishada conten toti es enregistraments d'anglés parlat hèti pera comunitat.
TED-LIUM Corpus
Còrpus de TED-LIUM
Tell us what you'd like to download:
Didetz-mos qué voletz baishar:
Telugu
Telugu
Telugu
telugu
Terms
Condicions d'usatge
Terms
Condicions d'usatge
Terms
Condicions
Thai
Tai
Thai
tai
Thanks for confirming your account, now let's build your profile.
Gràcies per confirmar eth compde. Ara crearam eth vòste perfil.
Thanks to contributions from over 259k people in over 50 languages, this data is being used to train speech-enabled applications to better respond to the human voice.
Mercés ara collaboracion de mès de 259.000 persones en mès de 50 lengües, aguestes donades s'usen entà entrenar aplicacions de votz entà respóner milhor ara votz umana.
Thank you for recording!<lineBreak></lineBreak>Now review and submit your clips below.
Gràcies peth vòste enregistrament!<lineBreak></lineBreak>Ara seguida podetz revisar e enviar es vòsti talhs de votz.
Thank you for your interest in contributing to { $lang }. We work hard to get every language ready for launch and keep
the teams updated via email. If you want to contribute, please provide your email below.
Mercés de vòstre interès per contribuir en { $lang }. Trabalham dur per far que totas las lengas sián disponiblas e per téner
los contribuidors al fial per corrièl. Se volètz contribuir, picatz vòstra adreça.
Thank you for your interest in contributing to { $lang }. We work hard to get every language ready for launch and keep
the teams updated via email. If you want to contribute, please provide your email below.
Mercés peth vòste interès en collaborar en { $lang }. Trabalham plan entà arténher que totes es lengües siguen alestides entath lançament e mantiem es equipes informades per corrèu electronic. Se voletz collaborar, indicatz era vòsta adreça electronica ara seguida.
Thank you! You’ve sent a new language enquiry
Gràcies! Auetz enviat ua naua consulta de lengua
The best sources you can look for are podcasts, transcripts, movie scripts and anything that potential can contain everyday conversations.
Es melhores hònts que podetz cercar son podcasts, transcripcions, guidons de pellicules e quinsevolhe causa que pogue contier convèrses quotidianes.
The bumblebee sped by.
Un elefant aucic ua aranha.
The Clip Graveyard consists of voice clips that didn't make it into the Common Voice dataset. Just like the dataset, the Clip Graveyard is available for download.
We would like to thank the following people and organizations for their help with the project:
Eth cementèri de talhs includís es talhs de votz que non an entrat en conjunt de donades deth Common Voice. Atau coma eth conjunt de donades, eth cementèri de talhs tanben se pòt descargar.
The clip has disrespectful or offensive language.
Eth talh a un lenguatge ofensiu o irrespectuós.
The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech. The first version of Deep Speech was released in November 2017 and has continued to evolve ever since. Together with the Common Voice dataset, we believe this open source voice recognition technology should be available to everybody. It’s our hope these technologies will enable developers to build a wave of innovative products and services.
Eth conjunt de donades deth Common Voice complète eth motor de reconeishença de votz de còdi dubèrt Deep Speech de Mozilla. Era prumèra version deth Deep Speech se publiquèc en noveme de 2017 e a seguit era evolucion d'alavetz ençà. Amassa damb eth conjunt de donades deth Common Voice, credem qu'aguesta tecnologia de reconeishença de votz de còdi dubèrt aurie d’èster disponibla entà toti. Demoram qu'aguestes tecnologies permeten as desvolupadors bastir ua onada de productes e servicis innovadors.
The Common Voice dataset complements Mozilla’s open source voice recognition engine Deep Speech, which you can use to build speech recognition applications. Read our <githubLink>Github overview</githubLink> or join the <discourseLink>DeepSpeech Discourse</discourseLink> to learn how to get started.
Lo jòc de donadas de Common Voice de Mozilla es un complementari al motor de reconeissença vocala Deep Speech, que podètz utilizar per crear d’aplicacions de reconeissença vocala. Consultatz nòstre <githubLink>apercebut sus GitHub</githubLink> o visitatz nòstre <discourseLink>Discourse DeepSpeech</discourseLink> per obténer d’informacions per ben començar.
The Common Voice Dataset contains hundreds of thousands of voice samples that help developers build voice recognition tools.
Eth conjunt de donades deth Common Voice conten centenats de milèrs de mòstres de votz qu’ajuden as desvolupadors a crear estruments de reconeishença de votz.
The Common Voice dataset is an open and publicly available resource that can be used to train a wide variety of speech-enabled applications. To protect the security of our contributors, we ask everyone who downloads the Common Voice dataset to respect contributors’ privacy.
All voice clips in the dataset are scrubbed of personally identifying information. When you download the dataset, you agree to not attempt to determine the identity of any contributor. That means you cannot try to link information in the dataset to a contributor’s personal information. You may, however, use the dataset to train speech recognition, speaker recognition, or other applications, by, for instance, linking information in the dataset to other information already in the dataset.
Eth conjunt de donades deth Common Voice ei un recors dubèrt e disponible publicaments que se pòt emplegar entà entrenar ua ampla varietat d’aplicacions qu’utilizen era votz. Entà protegir era seguretat des nòsti collaboradors, demanam a toti es qui baishen eth conjunt de donades deth Common Voice que respècten era privadança des collaboradors.
Toti es talhs de votz deth conjunt de donades son analisadi entà eliminar-ne quina informacion que sigue d'identificacion personau. Quan baishatz eth conjunt de donades, acceptatz de non sajar de determinar era identitat de cap de collaborador. Açò vò díder que non podetz sajar de ligar informacion deth conjunt de donades ara informacion personau deth collaborador. Totun, podetz utilizar eth conjunt de donades entà entrenar era reconeishença de votz, era reconeishença d’oradors o de d’autes aplicacions, per exemple, ligant informacion deth conjunt de donades a d’auta informacion que ja i age en conjunt de donades.
The Common Voice dataset is available for download under the <licenseLink>CC0</licenseLink> license on <datasetLink>our Datasets page</datasetLink>. You can also download several other publicly available datasets from the same page.
Eth conjunt de donades deth Common Voice se pòt baishar jos era licéncia <licenseLink>CC0<licenseLink> ena <datasetLink>nòsta pagina de conjunts de donades. Tanben podetz baishar diuèrsi conjunts de donades disponibles publicaments ena madeisha pagina.
The Common Voice Sentence Collector has collected { $sentenceCount } sentences in { $languageCount } languages!
Eth Common Voice Sentence Collector a arrecuelhut { $sentenceCount } frases en { $languageCount } lengües!
The count of voice recording hours that have been validated by 2 out of 3 users with a vote of “Yes”. These mark progress toward the overall project 10k hours goal.
Recompde des ores d’enregistrament de votz que s'an validat. Ua frasa se considère validada quan dus de tres usatgèrs vòten «Òc». Aguestes ores compden entar objectiu generau deth projècte de 10.000 ores.
The count of voice recording hours we have collected so far.
Recompde des ores d’enregistrament de votz que s'an arrecuelhut enquia ara.
The giant dinosaur of the Triassic.
Es grani dinosaure deth Triasic.
The giant dinosaurs of the Triassi-.
Es grani dinosaures deth Triasi-.
The giant dinosaurs of the Triassic.
Es grani dinosaures deth Triasic.
The giant dinosaurs of the Triassic. <strong>[read by one voice]</strong>
Es grani dinosaures deth Triasic. <strong>[Liejut per ua votz]</strong>
The giant dinosaurs of the Triassic. Yes.
Es grani dinosaures deth Triasic. Òc.
The giant dino <strong>[cough]</strong> the Triassic.
Es grani dino <strong>[estossegadera]</strong> deth Triasic.
The goal of the Common Voice dataset is to enable anyone in the world to build speech recognition, speaker recognition, or any other type of application that requires voice data. A voice assistant is just one of many types of applications you could use the dataset to build.
Er objectiu deth conjunt de donades deth Common Voice ei de perméter que quinsevolh persona deth mon pogue crear aplicacions de reconeishença de votz, de reconeishença de parlants o quinsevolh aute tipe d'aplicacion que requerisque donades de votz. Un assistent de votz ei sonque un des molti tipes d'aplicacions que podetz crear en tot emplegar eth conjunt de donades.
The multi-language version of the Common Voice dataset is currently undergoing community supported bundling and cleaning. If you would like to help us bring Common Voice to new languages, go check out the <sentenceCollectorLink>Sentence Collection Tool</sentenceCollectorLink> for adding new sentences to the dataset, and Mozilla <pontoonLink>Pontoon</pontoonLink> for translating the website itself. New languages are added to Common Voice for voice contribution when 5000 approved sentences have been collected.
Era version multilingua deth conjunt de donades deth Common Voice actuauments ei en fasa d’empaquetatge e neteja damb eth supòrt dera comunitat. Se mos voletz ajudar a ahíger naues lengües ath Common Voice, vedetz eth <sentenceCollectorLink>estrument de recuelhuda de frases<sentenceCollectorLink> entà ahíger frases naues ath conjunt de donades e çò <pontoonLink>Pontoon<pontoonLink> de Mozilla entà tradusir eth lòc web deth projècte. Es lengües naues s'ahigeràn ath Common Voice e poderàn començar a recéber aportacions de votz quan s'agen recuelhut e aprovat 5000 frases.
The number of recordings and which languages you contribute to will be public.
Eth nombre d’enregistraments e es lengües a on auetz collaborat seràn publics.
The process by which a contributor’s profile information is obscured from their donated voice clips when packaged for download as a part of the dataset.
Procès peth quau era informacion deth perfil d'un collaborador s'amague des talhs de votz qu’an dat quan s’empaqueten entà baishar-les coma part deth conjunt de donades.
There are a lot of public organizations that might want to collaborate and already have a lot of texts they can donate as public domain. Reach out to local universities, Governments and open source organizations to talk about the project and ask for their help.
I a fòrça organizacions publiques que dilhèu vòlen collaborar e ja an fòrça tèxtes que pòden dar coma domeni public. Metetz-vos eth contacte damb universitats, govèrns e organizacions de còdi dubèrt locau entà parlar deth projècte e demanar-les ajuda.
There are lots of ways to think about language. For the purposes of speech recognition models, Common Voice suggests focussing on ‘mutual intelligibility’, or ‘can speakers of this language mostly understand one another if they try to?’
I a fòrça manères de considerar qué ei ua lengua. A efèctes des modèls de reconeishença de votz, Common Voice suggerís de centrar-se ena «intelligibilitat mutuau» o «es parlants d'aguesta lengua se pòden compréner, entre eri, s'ac sagen?»
The recording was too long.
Er enregistrament ei massa long.
The recording was too quiet.
Eth volum deth enregistrament ei massa baish.
The recording was too short.
Er enregistrament ei massa cuert.
There will be natural variations in volume between readers. Reject only if the volume is so high that the recording breaks up, or (more commonly) if it is so low that you can’t hear what is being said without reference to the written text.
Abituaument i a variacions en volum entre lectors. Refusatz un talh sonque s'eth volum ei tan naut qu'er enregistrament s’entrecope, o (mès abituau) s'ei tan baish que non podetz sénter se qué se i ditz sense era referéncia deth tèxte escrit.
These languages are currently under community development. <localizationGlossaryLink>Website localization</localizationGlossaryLink> and <sentenceCollectionGlossaryLink>sentence collection</sentenceCollectionGlossaryLink> is needed to launch.
Aguestes lengües son eth desvolopament per part dera comunitat. Es barres de progrès indiquen guaire fauta entà completar era fasa de <localizationGlossaryLink>traduccion deth lòc web</localizationGlossaryLink> e era <sentenceCollectionGlossaryLink>recuelhuda de frases</sentenceCollectionGlossaryLink>.
The selected file is too large
Lo fichièr seleccionat es tròp pesuc
The selected file is too large
Eth fichèr seleccionat ei massa gran
The Sentence Collector is part of <commonVoiceLink>Common Voice</commonVoiceLink>. It allows contributors to collect and validate sentences created by the community. You can use this tool also to import and clean-up small-to-medium-sized public domain corpus you have found or collected. All sentences need to be Public Domain. Approved sentences are exported every week to the Common Voice repository and are released on the Common Voice website on every new deployment.
Eth Sentence Collector ei part deth <commonVoiceLink>Common Voice</commonVoiceLink>. Permet as collaboradors de recuélher e validar frases creades pera comunitat. Tanben podetz usar aguest estrument entà importar e netejar còrpus de domeni public de mesura petita e mejana qu’ajatz trobat o recuelhut. Totes es frases an d’èster de domeni public. Es frases aprovades s'expòrten cada setmana tath repositòri deth Common Voice e son disponibles en lòc web deth Common Voice en cada nau desplegament.
The sentence has a grammatical or spelling error.
Era frasa a un error gramaticau o ortografic.
The sentence has disrespectful or offensive language.
Era frasa a un lenguatge ofensiu o irrespectuós.
The sentence must be grammatically correct.
Era frasa ei gramaticaument corrècta.
The sentence must be grammatically correct.
Era frasa a d’èster gramaticaument corrècta.
The sentence must be speakable.
Era frasa ei pronunciabla.
The sentence must be speakable.
Era frasa a d’èster prononciabla.
The sentence must be spelled correctly.
Era frasa ei escrita corrèctament, sense fautes d'ortografia.
The site is ready to be launched when it reaches 75% completion.
Eth lòc ei prèst entà èster publicat quan arribe ath 75% dera finalizacion.
The site will be back up as soon as possible. For the latest information, please join the <matrixLink>Matrix community chat</matrixLink> or visit <githubLink>GitHub</githubLink> or <discourseLink>our Discourse forums</discourseLink> to submit and monitor site experience issues.
Eth lòc tornarà a èster disponible çò de mès lèu possible. Entà veir era informacion mès actuau, junhetz-vos ath <matrixLink>chat dera comunitat a Matrix</matrixLink>, o visitatz eth <githubLink>GitHub</<discourseLink>githubLink> o es nòstes <githubLink>fòrums deth Discourse</githubLink> entà enviar e supervisar es problèmes qu’experimentetz en lòc.
The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website.
Eth còrpus de TED-LIUM se creèc a compdar d’enregistraments d’àudio e es sues transcripcions disponibles en lòc web de TED.
The website text is translated into that language.
Eth tèxte deth lòc web se tradusís ad aguesta lengua.
This is approximately the number of hours required to train a production speech-to-text system.
Aguest ei, apuprètz, eth nombre d'ores de besonh entà entrenar un sistèma de reconeishença dera votz en produccion.
This is a use case driven segment containing data to power spoken digit recognition and yes / no detection.
Açò ei segmentacion de casi d'usatge que conten donades entà potenciar era reconeishença des digits e era deteccion des paraules «òc» e «non».
This is open source software which can be freely remixed, extended, and improved. Mycroft may be used in anything from a science project to an enterprise software application.
Aquò es un logicial liure que se pòt remesclar, desvelopar e melhorar. Mycroft pòt èsser utilizat dins totes los domenis, del projècte scientific al logicial d’entreprisa.
This is our process for translating and adapting our content for many locales (languages).
Procès de traduccion e adaptacion deth contengut en d’autes lengües.
This page gives you an overview of all your submitted sentences. You may also delete already submitted sentences if needed, by marking the checkbox next to it and clicking on "Delete selected sentences" at the bottom. Please only remove sentences if absolutely necessary, for example if you noticed after the fact that a sentence is copyright protected.
Aguesta pagina vos da ua vision generau de totes es frases qu’auetz enviat. Tanben podetz eliminar frases ja trametudes, se cau, en tot mercar eth caleishon de seleccion deth costat e en tot hèr clic en "Suprimís es frases seleccionades" baish de tot. Sonque esfaçatz frases s'ei absolutaments de besonh, coma per exemple se vos auetz encuedat de qu'ua frasa ei protegida per drets d'autor.
This setting controls your leaderboard visibility. When hidden, your progress will be private. This means your image, user name and progress will not appear on the leaderboard. Note that leaderboard refresh takes ~{ $minutes }min to populate changes.
Aguesta opcion contròtle era vòsta visibilitat en classament. S'ei amagada, eth vòste progrès serà privat. Açò signifique qu'era vòsta imatge, nòm d'usatgèr e progrès non apareisheràn en classament. Ajatz en compde qu'era actualizacion deth classament trigue { $minutes } menutes a mostrar es cambis, apuprètz.
Three to go!
Sonque ne manque tres!
Tigre
tigre
Tigrinya
tigrinha
Today
Uèi
Today
Aué
Today's Common Voice progress on clips recorded
Progrès d'aué deth Common Voice en çò que tanh a talhs enregistradi
Today's Common Voice progress on clips validated
Progrès d'aué deth Common Voice en çò que tanh a talhs validadi
Today's Progress
Progrès d'aué
Toki Pona
toki pona
To make it into the Common Voice dataset, a voice clip must be validated by two separate users.
Entà includir un talh de votz en conjunt de donades deth Common Voice, cau que dus usatgèrs diferenti lo validen.
To make the Common Voice dataset as useful as possible we have decided to only allow source text that is available under a Creative Commons (CC0) license. Using the CC0 standard means its more difficult to find and collect source text, but allows anyone to use the resulting voice data without usage restrictions or authorization from Mozilla. Ultimately, we want to make the multi-language dataset as useful as possible to everyone, including researchers, universities, startups, governments, social purpose organizations, and hobbyists.
Entà qu'eth conjunt de donades deth Common Voice sigue çò de mès util possible, auem decidit de perméter sonque tèxte hònt disponible damb licéncia Creative Commons (CC0). Er usatge der estandard CC0 implique qu'ei mès malaisit de trobar e recuélher tèxte hònt, mès permet que tot usatgèr emplegue es donades de votz resultantes sense restriccions d’emplec ne era autorizacion de Mozilla. En darrèr tèrme, volem qu'eth conjunt de donades multilingües sigue çò de mès util possible entà toti, includidi investigadors, universitats, enterpreses emergentes, govèrns, organizacions sociaus e aficionats.
Top Contributors
Collaboradors mès actius
Total
Totau
Total Approved
Totau d'aprovadi
{ $totalHours } hours is achievable in just over { $periodMonths } months if { $people } people record { $clipsPerDay } clips a day.
Se { $people } persones enregistren { $clipsPerDay } talhs per dia, se pòt arribar en { $totalHours } ores en pòc mès de { $periodMonths } mesi.
{ $totalInReview } sentences in review.
I a { $totalInReview } frases en revision.
{ $totalSentences } total sentences.
{ $totalSentences } frases en totau.
To the right we outline the benefits and clarify what information we make public. Use the links below to get started with a Common Voice account on your own device.
Ara dreta exposam es auantatges e esclarim se quina informacion hèm publica. Emplegatz es ligams següents entà començar a utilizar un compde Common Voice en vòste dispositiu.
Toward next goal
Tath següent objectiu
Track progress here and on your stats page.
Seguitz eth vòste progrès ací e ena pagina d'estadistiques.
Translate this page
Revira aguesta pagina
<translateVideoLink>Watch our guide on how to use Pontoon.</translateVideoLink>
<translateVideoLink>Vedetz era guida de coma usar Pontoon.</translateVideoLink>
Translating the site
Tradusís eth site
Translation is from English, but you can see <strong>Suggestions</strong> in other languages. Click the <strong>Profile</strong> icon, then <strong>Settings</strong> link and add any languages you speak. At the bottom right-hand corner will be a list of translations called <strong>Locales</strong>. Translations show on the site after one day.
Era traduccion ei der anglés, mès podetz veir <strong>Suggeriments</strong> en d’autes lengües. Hetz clic ena icòna <strong>Perfil</strong>, dempús en ligam <strong>Configuracion</strong> e ahigetz es lengües que parlatz. En estrèm inferior dret i aurà ua lista de traduccions nomentades <strong>Locales</strong>. Es traduccions se mòstren en lòc web londeman.
Translation of the Common Voice site happens on <strong>Pontoon</strong>.
Era traduccion deth lòc web de Common Voice se hè en <strong>Pontoon</strong>.
Turkish
turc
Turkish
Turc
Turkmen
turcman
Twi
twi
Typically megabytes
Normauments son bèri megabytes
Ubykh
ubikh
Udmurt
udmurt
Ukrainian
ucraïnian
Ukrainian
Ucraïnian
Understand contribution criteria
Comprenetz es critèris de collaboracion
Understand what to look for when listening to voice clips and help make your voice recordings richer too!
Comprenetz de qué cau cercar quan escotatz clips de votz, e ajudatz a qu'es vòsti enregistraments de votz tanben siguen mès enriquidi.
Unexpected response returned from server
Eth servidor a retornat ua responsa inesperada
Unfortunately we can’t have Wikimedia articles yet. So do not copy paste from there.
Malerosament, encara non podem auer articles dera Wikipèdia. Ei a díder que non copietz e peguetz tèxte d'aquiu estant.
{ $unreviewedSentencesByYou } sentences left for you to review.
Auetz { $unreviewedSentencesByYou } frases entà revisar.
Updating…
S'actualize…
UPLOAD
ENVIAR
Upload aborted. Do you want to delete your recordings?
S'a anullat er enviament. Voletz suprimir es vòsti enregistraments?
Upload an image file
Enviar un imatge
Upload an image file
Puja un fichèr d'imatge
Urdu
Ordo
Urdu
ordo
User Name
Nom d’utilizaire
User Name
Nòm d'usatgèr
Users validate the accuracy of donated clips, checking that the speaker read the sentence correctly.
Es usatgèrs validen era precision des talhs en tot verificar qu’eth locutor liege era frasa corrèctament.
Using Common Voice
Emplec deth Common Voice
Uyghur
uigur
Uzbek
Uzbec
Uzbek
uzbec
Validated Clips
Talhs validadi
Validated Hours
Ores validades
Validated Hrs
Oras validadas
Validated Hr. Total
Ores validades totaus
{ $validatedSentences } validated sentences.
{ $validatedSentences } frases validades.
Validating donated clips is equally important to the Common Voice mission. Take a listen and help us create quality open source voice data.
Era validacion des talhs de votz tanben ei importanta entara mission de Common Voice. Escotatz es talhs e ajudatz-mos a crear donades de votz de qualitat damb licéncia liura.
Validations
Validacions
Validations
Validacions
Value and recognition.
Valor e reconeishença.
Variants are a specific form of a language - for example shared by those living in a geography or commmunity. Sometimes these are called dialects.
Es variantes son formes especifiques d'ua lengua. Per exemple, compartides per persones que viuen en ua zòna geografica o comunitat. A viatges se’n ditz dialèctes.
Varying Pronunciations
Prononciacions variables
Venetian
venèt
Version
Version
Version
Version
Vietnamese
vietnamian
Vietnamese
Vietnamian
View your progress against personal and project goals.
Vedetz eth vòste progrès entàs objectius personaus e deth projècte.
Visible
Visible
Visible
Visible
Voice clips are entered into a submission queue that readies them for listening.
Es talhs de votz se meten en ua coa de talhs premanidi entà escotar.
Voice Contribution
Collaboracions damb era votz
Voice Dataset, Ready for Download
Conjunt de donades de votz, preparat entà baishar
Voice is natural, voice is human. That’s why we’re excited about creating usable voice technology
for our machines. But to create voice systems, developers need an extremely large amount of voice
data.
Era votz ei naturau, era votz ei umana. Ei per aquerò que volem crear tecnologies de votz de qualitat entàs maquines. Mès entà crear aguesti sistèmes de votz, es desvolopadors an de besonh un gran volum de donades de votz.
Voice is natural, voice is human. That’s why we’re fascinated with creating usable voice
technology for our machines. But to create voice systems, an extremely large amount of voice
data is required.
Era votz ei naturau, era votz ei umana. Ei per açò que volem crear tecnologies de votz de qualitat entàs maquines. Mès eth desvolopament d'aguesti sistèmes requerís un gran volum de donades de votz.
Voice recognition technology is revolutionizing the way we interact with machines, but the currently available systems are expensive and proprietary. Common Voice is part of Mozilla’s initiative to make voice recognition technologies better and more accessible for everyone. Common Voice is a massive global database of donated voices that lets anyone quickly and easily train voice-enabled apps in potentially every language.
Era tecnologia de reconeishença de votz ei a revolucionar era manèra qu’interactuam damb es maquines, mès es sistèmes disponibles actuauments son cars e privatius. Eth Common Voice hè part dera iniciativa de Mozilla entà hèr qu'es tecnologies de reconeishença de votz siguen milhors e mès accessibles entà toti. Eth Common Voice ei ua basa de donades mondiaus massives de donacions de votzes que permet qui sigue d’entrenar rapidament e facilament es aplicacions de votz en practicament era lengua que sigue. Non sonque recuelhem mòstres de votz de lengües fòrça estenudes; tanben de lengües damb ua poblacion de parlants mès petita. Era publicacion d'un conjunt de donades de diuèrses votzes permeterà as desvolupadors, empresaris e comunitats solucionar aguesta manca per eri madeishi. Ath delà deth conjunt de donades deth Common Voice, tanben bastim un motor de reconeishença de votz de còdi dubèrt nomentat Deep Speech.
Voice recognition technology is revolutionizing the way we interact with machines, but the currently available systems are expensive and proprietary. Mozilla Common Voice is an initiative to make voice recognition technologies better and more accessible for everyone. Common Voice is a massive global database of donated voices that lets anyone quickly and easily train voice-enabled apps in potentially every language.
We're not only collecting voice samples in widely spoken languages but also in those with a smaller population of speakers. Publishing a diverse dataset of voices will empower developers, entrepreneurs, and communities to address this gap themselves.
Era tecnologia de reconeishença de votz ei a revolucionar era manèra qu’interactuam damb es maquines, mès es sistèmes disponibles actuaument son cars e privatius. Mozilla Common Voice ei ua iniciativa entà hèr qu'es tecnologies de reconeishença de votz siguen milhors e mès accessibles entà toti. Eth Common Voice ei ua basa de donades mondiaus massives de donacions de votzes que permet a qui sigue d’entrenar rapidament e facilament es aplicacions de votz en practicaments era lengua que sigue. Non sonque recuelhem mòstres de votz de lengües fòrça estenudes; tanben de lengües damb ua poblacion de parlants mès petita. Era publicacion d'un conjunt de donades de diuèrses votzes permeterà as desvolupadors, empresaris e comunitats solucionar aguesta manca per eri madeishi.
Voice recording in progress
Enregistrament de votz en cors
Voices Online Now
Votzes en linha ara
Voice Validation
Validacions des votzes
Volume
Volum
Votic
vòt
VoxForge was set up to collect transcribed speech for use with Free and Open Source Speech Recognition Engines.
VoxForge Se creèc entà arrecuélher transcripcions de dialògs que se poguen emplegar en motors de reconeishença dera votz de còdi dubèrt.
Want to change your voice clip?
Volètz cambiar vòstra mòstra de votz ?
Want to help make Common Voice even better?
Great! Get in touch via email or <discourseLink>Discourse</discourseLink>
forums, submit site issues via <githubLink>GitHub</githubLink>, or join the
<matrixLink>Matrix</matrixLink> community chat.
Voletz ajudar a milhorar eth Common Voice? Geniau! Metetz-vos eth contacte per corrèu electronic o pes fòrums deth <discourseLink>Discourse<discourseLink>, notificatz problèmes en <githubLink>GitHub<githubLink> o junhetz-vos ath nòste chat dera comunitat en <matrixLink>Matrix<matrixLink>.
Want to help make Common Voice even better?
Great! Get in touch via email or on <discourseLink>Discourse</discourseLink>,
submit feedback through <githubLink>GitHub</githubLink>,
or join us on <matrixLink>Matrix</matrixLink>.
Volètz ajudar a melhorar Common Voice ?
Crane ! Contactatz-nos per corrièl o sus <discourseLink>Discourse</discourseLink>,
mandatz de comentaris sus <githubLink>GitHub</githubLink>,
o venètz sus <matrixLink>Matrix</matrixLink>.
Want to help make Common Voice even better?
Great! Get in touch via email or on <discourseLink>Discourse</discourseLink>,
submit feedback through <githubLink>GitHub</githubLink>,
or join us on <slackLink>Slack</slackLink>.
Volètz ajudar a melhorar Common Voice ?
Crane ! Contactatz-nos per mail o sus <discourseLink>Discourse</discourseLink>,
enviatz un comentari via <githubLink>GitHub</githubLink>,
o rejonhètz-nos sus <slackLink>Slack</slackLink>.
Want to stay in touch with Common Voice?
Voletz èster en contacte damb Common Voice?
Want updates when we release a new version of the Common Voice dataset? Subscribe to our newsletter.
Voletz arrecéber actualizacions #quan publicam ua naua version deth conjunt de donades deth Common Voice? Soscriuetz-vos ath nòste bulletin.
Watch our video explainer to help
Guardatz eth vidèo explicatiu entà ajudar-vos
We are building an open and publicly available dataset of voices that everyone can use to train speech-enabled applications.
Cream un conjunt de donades de votzes dubèrtes e disponibles publicaments que toti pòden utilizar entà entrenar aplicacions qu’empleguen era votz.
We are going out to get a coffee.
Anam damb tren tà Tarragona.
We are going out to get coffee.
Anam en tren tà Tarragona.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
A Mozilla construissèm una comunitat tocant las tecnologias vocalas. Volriam èsser al jorn amb las novetats, las fonts de donadas e volriam saber cossí utilizatz aquestas donadas.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
En Mozilla bastim ua comunitat ath torn des tecnologies dera votz. Voleríem èster ath dia des nauetats, de naues hònts de donades e voleríem saber com les emplegatz.
We at Mozilla are building a community around voice technology. We would like to stay in touch with updates, new data sources and to hear more about how you're using this data.
En Mozilla bastim ua comunitat ath torn des tecnologies dera votz. Voleríem èster ath dia des nauetats, de naues hònts de donades e saber com les emplegatz.
We believe that large and publicly available voice datasets foster innovation and healthy commercial competition in machine-learning based speech technology. This is a global effort and we invite everyone to participate. Our aim is to help speech technology be more inclusive, reflecting the diversity of voices from around the world.
Credem qu'uns conjunts de donades grani e disponibles publicaments promoiràn era innovacion e era competéncia comerciau sana ena tecnologia dera votz basada en aprendissatge automatic. Aguest ei un esfòrç globau e convidam a toti a participar-i. Eth nòste objectiu ei hèr qu’era tecnologia dera votz sigue mès inclusiva e rebate era diuersitat de votzes de tot eth mon.
We believe that large, publicly available voice datasets will foster innovation and healthy commercial competition in machine-learning based speech technology.
Common Voice’s multi-language dataset is already the largest publicly available voice dataset of its kind, but it’s not the only one.
Look to this page as a reference hub for other open source voice datasets and, as Common Voice continues to grow, a home for our release updates.
Credem qu'uns conjunts de donades grani e disponibles publicaments promoiràn era innovacion e era competéncia comerciau sana ena tecnologia dera votz basada en aprendissatge automatic.
Eth conjunt de donades multilingües deth Common Voice ja ei eth conjunt de donades de votz disponible publicaments mès gran deth sòn tipe, mès non er unic.
Consideratz aguesta pagina coma punt centrau de referéncia sus es auti conjunts de donades de votz dubèrti e, a mesura qu'eth Common Voice vage creishent, era pagina a on anonciaram es actualizacions des versions.
Website Localization
Localizacion deth lòc web
We calculate hours by estimating the average length of each recording, and then multiplying that number by the total number of recordings across all languages.
Calculam es ores en tot estimar era durada mejana de cada enregistrament e, ara seguida, multiplicam aguest nombre peth nombre totau d’enregistraments en totes es lengües.
We could not fetch the stats. Please try again later.
Non s'an podut obtier es estadistiques. Tornatz-ac a sajar mès tard.
We couldn’t find that page for you
Non s’a trobat aguesta pagina
We couldn’t get any audio clips for you to listen to.
Please try again later.
Non auem podut arténher cap de talh de votz entà que lo revisetz.
Tornatz-ac a sajar mès tard.
We couldn’t get any sentences for you to speak.
Please try again later.
Non auem pogut cargar es oracions.
Se vos platz, tornatz-ac a sajar mès tard.
We don't have anything to validate in this language, help us fill the queue.
Avèm pas res a validar dins aquesta lenga, ajudatz-nos a ajustar d’enregistraments dins la fila.
Weekly
Setmanau
Weekly Goal
Objectiu setmanau
We failed to fetch available languages. Contributions are currently not possible. Please try again later.
Non auem podut obtier es lengües disponibles. Actuaument non se pòden hèr aportacions. Provatz-ac mès tard.
We launch the Common Voice site in this language.
Se publique eth lòc web de Common Voice en aguesta lengua.
Welcome { $company } staff!
Personau de { $company }, vos dam era benvenguda!
Welcome to Common Voice
Vos dam era benvenguda en Common Voice
Welcome to the Common Voice Sentence Collector
Vos dam era benvenguda ath Common Voice Sentece Collector
Welsh
Gallés
Welsh
gallés
We may have run out of clips - refresh the page, or try again later.
Non i a cap mès talh entà validar en aguesta lengua...
We may not have received that request. Please email us directly
on <emailLink>{ $email }</emailLink> and we will get back to you!
Dilhèu non auem recebut aguesta sollicitud. Enviatz-mos un corrèu electronic dirèctaments a <emailLink>{ $email }</emailLink> e vos arresponeram!
We prefer natural/conversational sentences. While phonetic diversity and different words in sentences is important, we are trying to make recording sentences as much fun as possible. Therefore it would be great if you could try to keep your sentences as natural/engaging as possible.
Preferim frases naturaus/conversacionaus. Maugrat qu'era diuersitat fonetica e es diferentes paraules enes frases son importantes, sajam qu'er enregistrament des frases sigue çò de mès de divertit possible. Donques, serie geniau se sajatz qu'es frases siguen tan naturaus e atractives coma sigue possible.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Mos comprometem a gestionar era vòsta informacion damb suenh. Entà mès informacion, vedetz er <privacyLink>Avertiment de privadança.</privacyLink>.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Mos comprometem a gestionar era vòsta informacion damb suenh. Entà mès informacion, vedetz era <privacyLink>Politica de privadança</privacyLink>.
We promise to handle your information with care. Read more in our <privacyLink>Privacy Notice</privacyLink>.
Vos prometèm de prendre cura de vòstras donadas. Per ne saber mai podètz legir nòstra <privacyLink>Politica de confidencialitat</privacyLink>.
We’re building an open source, multi-language dataset of voices that anyone can use to train speech-enabled applications.
Cream un conjunt de donades de votzes dubèrt e multilingüe que toti pòden emplegar entà entrenar aplicacions qu’utilizen era votz.
We’re crowdsourcing an open-source dataset of voices. Donate your voice, validate the accuracy of other people’s clips, make the dataset better for everyone.
Cream de forma collaborativa un conjunt de donades de votz dubèrt. Datz era vòsta votz, validatz era precision des talhs de d’autes persones e hetz qu'eth conjunt de donades sigue milhor entà toti.
We’re experiencing unexpected downtime
Eth lòc web non ei disponible temporauments
We’re going out to get coffee.
Nam en tren tà Tarragona.
We release the dataset every 3 months.
Publicam eth conjunt de donades cada 3 mesi.
We're receiving a lot of traffic and are currently investigating the issues.
Era plataforma va fòrça cargada darrèrament, investigam es problèmes e demoram solucionar-les lèu.
We’re sorry, your platform is not currently supported.
Malerosaments, era vòsta plataforma non ei compatibla actuauments.
We've just launched a new contribution experience
Avèm ajustat una nòva interfàcia de collaboracion
We've run out of sentences to record in this language...
Non i a cap mès de frasa entà enregistrar en aguesta lengua...
We want speech models to be better at understanding a diverse range of speakers. For this to happen, a voice dataset must represent lots of different people.
Volem qu'es modèls de parlar siguen milhors entà compréner ua ampla gamma de parlants. Entà qu'aquerò passe, un conjunt de donades de votz a de representar fòrça persones diferentes.
We want the Common Voice dataset to reflect the audio quality a speech-to-text engine will hear in the wild, so we’re looking for variety. In addition to a diverse community of speakers, a dataset with varying audio quality will teach the speech-to-text engine to handle various real-world situations, from background talking to car noise. As long as your voice clip is intelligible, it should be good enough for the dataset.
Volem qu'eth conjunt de donades deth Common Voice rebate era qualitat der àudio qu’un motor de votz a tèxte entene en un entorn reau, e donques cercam varietat. Ath delà d'ua comunitat plurau de locutors, un conjunt de donades damb qualitat d’àudio variabla permeterà entrenar eth motor de votz a tèxte entà gestionar situacions diferentes deth mon reau, de votzes de hons enquiath tapatge deth coche. Mentre eth talh de votz sigue intel·ligible, aurie d’èster pro bon entath conjunt de donades.
We want the machine learning algorithms to able to handle a variety of background noise, and even relatively loud noises can be accepted provided that they don’t prevent you from hearing the entirety of the text. Quiet background music is OK; music loud enough to prevent you from hearing each and every word is not.
Volem qu'es algoritmes d’aprendissatge automatic siguen capables de manejar diuèrsi tapatges de hons. Se pòden acceptar tapatges relativaments fòrts, tostemp que non empedisquen enténer era totalitat deth tèxte. Era musica de hons tranquilla va plan; era musica plan fòrta, qu’evite que sentetz totes es paraules, non va ben.
We will also <strong>let our community manager know
that you’re interested</strong>, so they can give you any
extra support or information you might need.
Tanben <strong>informaram ath nòste gestor de comunitats que i auetz interès</strong>, entà que vos pogue dar eth supòrt addicionau que sigue o informacion qu’auetz de besonh.
We will be in touch with more information about how to add your language to Common Voice very soon.
Lèu mos meteram eth contacte damb mès informacion sus com ahíger era vòsta lengua ath Common Voice.
We will be in touch with more information as it becomes available.
Vos contactarem tre qu’aurem mai d’informacions disponiblas.
We will be in touch with more information as it becomes available.
Mos meteram en contacte damb mès informacion tanlèu que sigue disponibla.
We will not make your email public.
Non haram publica era vòsta adreça electronica.
We will not make your email public.
Farem pas venir vòstra adreça electronica publica.
We will review your request to remove your voice recordings from the dataset. If your request is approved, we will contact those who have downloaded the dataset and request they remove your voice recordings as well.
Revisaram era vòsta sollicitud d'eliminar es enregistraments de votz deth conjunt de donades. Se s'apròve era sollicitud, mos meteram en contacte damb qui age baishat eth conjunt de donades e les demanaram que tanben eliminen es vòsti enregistraments de votz.
What does it mean that I can’t “determine the identity” of speakers in the Common Voice dataset?
Qué vò díder que non pogui «determinar era identitat» des parlants en conjunt de donades deth Common Voice?
What is a language on Common Voice?
Qué ei ua lengua en Common Voice?
What is Common Voice?
Qu’es aquò Common Voice ?
What is Common Voice?
Qué ei eth Common Voice?
What is Common Voice?
Qué ei eth Common Voice?
What issues are you experiencing with this sentence?
Quin problèma auetz damb aguesta frasa?
What kind of goal do you want to build?
Quin tipe d'objectiu voletz crear?
What level of audio quality is required for a voice clip to be used in the dataset?
Quin nivèu de qualitat d’àudio a d’auer un talh de votz entà poder-lo emplegar en conjunt de donades?
What’s inside the Common Voice dataset?
Qué includís eth conjunt de donades de votz?
What's Public?
Quina informacion es publica ?
What's Public?
Qué ei public?
What’s the difference between Common Voice and Deep Speech?
Quina diferéncia i a entre eth Common Voice e eth Deep Speech?
When a user rejects a voice clip it returns to the Queue. If rejected a second time, the voice clip is moved to the Clip Graveyard.
S'un usatgèr refuse un talh de votz, se torne a méter ena coa. Se se refuse un dusau còp, eth talh de votz passe ath «cementèri de talhs».
When listening, check very carefully that what has been recorded is exactly what has been written; reject if there are even minor errors. <br />Very common mistakes include:
En escotar, verificatz damb fòrça suenh que çò que s’a enregistrat ei exactaments çò que i a escrit; refusatz eth talh tanben se i a errors mendres. <br />Es errors mès abituaus son:
When will you release Common Voice data in other languages?
Quan publicaratz es donades deth Common Voice en d’autes lengües?
Where are these <wikipediaLink>public domain</wikipediaLink> sentences from?
D’a on son aguestes frases de <wikipediaLink>domeni public</wikipediaLink>?
Where does the source text come from?
D’a on provien es tèxtes?
Which variant of { $language } do you speak?
Quina varianta de { $language } parlatz?
Why ?
Perqué?
Why a profile?
Per qué un perfil?
Why a profile?
Perqué cal un perfil ?
Why a profile?
Perqué un perfil ?
Why Common Voice?
Perqué Common Voice?
Why does this matter?
Perqué ei important?
Why don’t you ask people to read from books or Wikipedia articles in different languages?
Per qué non demanatz ara gent que liege libres o articles dera Wikipèdia en diuèrses lengües?
Why do you need so many different speakers per language?
Perqué cau tanti parlants diferenti entà cada lengua?
Why is 10,000 validated hours the per language goal for capturing audio?
Perqué i é er objectiu de 10.000 ores validades d’enregistrament d’àudio entà cada lengua?
Why is Common Voice part of the Mozilla mission?
Perqué eth Common Voice hè part dera mission de Mozilla?
Why is it important?
Per qué ei important?
Why is my language not included yet?
Per qué encara non i é era mia lengua?
Why should I sign up for an account?
Perqué me calerie crear un compde?
Would you like to request your voice recordings be deleted too, or do you prefer to keep them in the Common Voice dataset?
Voletz demanar que tanben se suprimisquen es vòsti enregistraments de votz, o preferitz conservar-les en conjunt de donades deth Common Voice?
y
s
Y
S
Yes
Òc
Yes
Òc
Yes
Òc
Yes
Òc
Yes, send me emails. I’d like to stay informed about the Common Voice Project.
Òc, mandatz-me de corrièls. Vòli èsser avisat de l’actualitat del projècte Common Voice Project.
Yes, send me emails. I’d like to stay informed about the Common Voice Project.
Òc, enviatz-me corrèus. Voi èster informat/da deth projècte Common Voice.
Yes, send me emails. I'd like to stay informed about the progress of this language on Common Voice.
Òc, enviatz-me corrèus. Voi èster informat/da deth progrès d'aguesta lengua ath Common Voice.
Yes, send me emails. I'd like to stay informed about the progress of this language on Common Voice.
Òc-ben podètz m’escriure. Vòli èsser informat del progrés d’aquela lenga dins Common Voice.
Yes, we especially want your voice! Part of the aim of Common Voice is to gather as many different accents as possible so that voice recognition services work equally well for everyone. This means donations from non-native speakers are particularly important.
Òc, volem era vòsta votz en especiau! Part der objectiu deth Common Voice ei obtier tanti accents diferenti coma sigue possible entà qu'es servicis de reconeishença de votz foncionen parièr entà toti. Açò signifique qu'es donacions de parlants non nadius son particularaments importantes.
Yiddish
ídish
Yoruba
ioruba
You
Vos
You are about to initiate a download of <size>{ $size }GB</size>, proceed?
Ètz prèst entà iniciar ua baishada de <size>{ $size } GB</size>, voletz seguir?
You are about to initiate a download of <size>{ $size }GB</size>, proceed?
Sètz a man de telecargar un fichièr de <size>{ $size } Go</size>, volètz contunhar ?
You are prepared to initiate a download of <b>{ $size }</b>
Sò prèst entà iniciar ua baishada de <b>{ $size }</b>
You can also copy and paste the direct URLs into your favorite download manager. They will expire in 12 hours, but you can come back to this page to generate new ones any time.
Tanben podetz copiar e empegar es URL dirèctaments en vòste gestor de baishades preferit. Es ligams caducaràn en 12 ores, mès podetz tornar ad aguesta pagina entà generar-ne de naui en moment que sigue.
You can also use Keyboard Shortcuts: { sc-review-form-button-approve-shortcut } to Approve, { sc-review-form-button-reject-shortcut } to Reject, { sc-review-form-button-skip-shortcut } to Skip
Tanben podetz emplegar es dreceres de teclat: { sc-review-form-button-approve-shortcut } entà aprovar, { sc-review-form-button-reject-shortcut } entà refusar, { sc-review-form-button-skip-shortcut } entà ométer
You can choose to make your username public or anonymous.
Podetz escuélher se voletz qu'eth vòste nòm d'usatgèr sigue public o anonim.
You can go to the <datasetsPage>datasets page</datasetsPage>, select the version and language(s) you want, and download it! The files have <metadataLink>associated metadata</metadataLink>, such as demographic information and validation data. You’ll need to provide an email address to download the dataset.
Podetz anar tara <datasetsPage>pagina de conjunts de donades</datasetsPage>, seleccionar era version e es lengües que voletz e baishar-les! Es fichèrs an <metadataLink>metadades associades</metadataLink>, coma informacion demografica e donades de validacion. Auratz de balhar ua adreça electronica entà baishar eth conjunt de donades.
You can help build a diverse, open-source dataset by creating a Common Voice profile and contributing your voice.
Ajudatz a bastir un conjunt de donades liures e diuèrses en tot crear un perfil deth Common Voice e en tot collaborar damb era vòsta votz.
You can meet others in the Mozilla language communities by joining <discourseLink>Discourse</discourseLink> for topical conversations, or <matrixLink>Matrix</matrixLink> for quick advice.
Podetz conéisher d’autes persones enes comunitats lingüistiques de Mozilla en tot junher-vos a <discourseLink>Discourse</discourseLink> entà convèrses d'actualitat, o a <matrixLink>Matrix</matrixLink> entà obtier un conselh rapid.
You cannot request your recordings while another request is already in progress.
Non podetz sollicitar es vòsti enregistraments mentre i a ua auta peticion en cors.
You can request a new takeout of your recordings every { $days } days.
Podetz sollicitar ua naua exportacion des enregistraments cada { $days } dies.
You can run events to help people contribute. It’s easier than you think. You could do it online with a videoconferencing tool, or in person if it’s safe. <eventTemplate>Check out our templates and resources for running events</eventTemplate>.
Podetz organizar eveniments entà ajudar ara gent a collaborar. Ei mès facil que non pensatz. Ac podetz hèr en linha damb un estrument de videoconferéncia, o en persona s'ei segur. <eventTemplate>Consultatz es plantilhes e recorsi entà organizar eveniments</eventTemplate>.
You can use the <scraperLink>Sentence extractor</scraperLink> to leverage short sentences from Wikipedia.
Podetz emplegar eth <scraperLink>Extractor de frases</scraperLink> entà profitar frases cuertes dera Wikipèdia.<scraperLink>
You could include the website, e.g "Common Voice - https://commonvoice.mozilla.org/"
Podetz includir eth lòc web, per exemple, «Common Voice - https://commonvoice.mozilla.org/»
You could use academic referencing style, e.g Harvard style "Mozilla (2021) Common Voice. Available at https://commonvoice.mozilla.org/ (Accessed: 15th September 2021)"
Podetz utilizar un estil de referéncia academica. P. ex., er estil de Harvard «Mozilla (2021) Common Voice. Disponible en https://commonvoice.mozilla.org/ (Consultat: 15 de seteme de 2021)»
You could use social media platforms to get the message out. Share posts that explain why it matters, and get in touch with other people talking about issues like language rights, voice AI, or bias in tech. See more advice on <campaignLink>running a social campaign, including content you can re-use</campaignLink>.
Podetz utilizar es plataformes de hilats sociaus entà enviar eth messatge. Compartitz publicacions qu’expliquen perqué ei importanta e metetz-vos en contacte damb d’autes persones que parlen d'ahèrs coma es drets lingüistics, era intelligéncia artificiau de votz o es tendéncies tecnologiques. Vedetz mès conselhs sus com executar ua campanha sociau, includit eth contengut que podetz reutilizar.
You have not added any languages yet.
Encara non auetz ahijut cap de lengua.
You have not selected any languages. Please go to your <profileLink>Profile</profileLink> to select languages.
Non auetz seleccionat cap de lengua. Anatz tath vòste <profileLink>Perfil</profileLink> entà escuélher bèra lengua.
You must allow microphone access.
Cau que permetetz er accès ath micròfon.
You previously skipped sentences while reviewing. Resetting skipped sentences will show all skipped sentences again. This is independent of the language.
Abans auetz ometut frases mentre les revisàuetz. Se restablitz es frases omeses, se vos tornaràn a mostrar. Açò ei independent dera lengua.
Your accent is the way you pronounce words. It can be shaped by where you have lived, which other languages you speak and lots of other factors. You can share any information you feel is relevant here.
Er accent ei era forma en qué prononciatz es paraules. Se pòt crear a crear des lòcs a on auetz viscut, quine autes lengües parlatz e fòrça d’auti factors. Podetz compartir ací era informacion que sigue que consideretz remercabla.
Your anonymous voice recordings will remain in the Common Voice dataset. Once you delete your profile you will no longer be able to submit a request to remove your recordings from the dataset
Es vòsti enregistraments anonims se conservaràn en conjunt de donades deth Common Voice. Un còp suprimit eth vòste perfil, ja non poderatz sollicitar d’eliminar es vòsti enregistraments deth conjunt de donades
Your avatar now includes a voice clip!
Vòstre avatar inclutz ara una mòstra de votz !
Your daily goal has been created
S'a creat eth vòste objectiu diari
Your download has started.
A començat era baishada.
You’re contributing to a target segment
Collaboratz en un segment objectiu
You’re contributing to our first target segment
Collaboratz en nòste prumèr segment objectiu
You're currently set to <bold>NOT</bold> receive emails such as goal reminders, my
progress updates and newsletters about Common Voice
Actuauments, <bold>NON</bold> recebetz corrèus electronics coma rebrembes des objectius, actualizacions deth progrès personau o notícies sus eth Common Voice
You're currently set to receive emails such as goal reminders, my
progress updates and newsletters about Common Voice
Actuauments, auetz configurat recéber corrèus electronics, coma rebrembes d’objectius, actualizacions deth vòste progrès e bulletins de notícies sus eth Common Voice
Your email address
Era vòsta adreça electronica
Your files are being assembled. Please check again later.
Se recuelhen es vòsti fichèrs. Tornatz-ac a verificar mès tà deuant.
Your languages:
Es vòstes lengües:
Your Languages
Es vòstes lengües
Your username and email will not be associated with the published data.
Vòstre nom d’utilizaire e adreça seràn pas associats a las donadas publicadas.
Your username and email will not be associated with the published data.
Eth vòste nòm d'usatgèr e adreça electronica non seràn associadi damb es donades publicades.
Your weekly goal has been created
S'a creat eth vòste objectiu setmanau
You should receive an email shortly explaining
the steps to get your language added to Common Voice
Plan lèu auríetz de recéber un corrèu electronic en tot explicar es passi entà ahíger era vòsta lengua ath Common Voice
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } recording goal!
Auetz ajudat ath Common Voice a arténher eth <goalPercentage></goalPercentage> der objectiu diari de { $goalValue } enregistraments!
You've helped Common Voice reach <goalPercentage></goalPercentage> of our daily { $goalValue } validation goal!
Auetz ajudat ath Common Voice a arténher eth <goalPercentage></goalPercentage> der objectiu diari de { $goalValue } validacions!
You've successfully signed up for contributing to { $language }. Thank you.
Vos auetz soscrit corrèctaments entà collaborar en { $language }. Mercés.
You've successfully signed up for contributing to { $language }. Thank you.
Sètz ara inscrich per contribuir en { $language }. Mercés.
Zip #{ $offset } of { $total }
ZIP num. { $offset } de { $total }