The contents of this card can be edited in the source repository.
    
    Dataset card
Description
Seed data in Spanish (Latin American).
License
CC-BY-SA-4.0
Attribution
@inproceedings{wmt24-seed-spanish,
    title="Spanish Corpus and Provenance with Computer-Aided Translation for the {WMT24} {OLDI} Shared Task",
    author="Jose Cols",
    booktitle = "Proceedings of the Ninth Conference on Machine Translation",
    month = nov,
    year = "2024",
    address = "Miami, USA",
    publisher = "Association for Computational Linguistics"
}
Language codes
- ISO 639-3: spa
- ISO 15924: Latn
- Glottocode: amer1254
Additional language information
- Reference dictionary: Diccionario de americanismos.
- Reference grammar: Nueva gramática de la lengua española.
Workflow
Data was translated from English by ten professional translators, all native speakers of the target language and proficient in English. 100% of the data was reviewed by a team of three translators.
Additional guidelines
- Numeral formatting: Diccionario panhispánico de dudas.