CommonsCaptions Corpora

Short description: 
Wikimedia Commons-eko irudien azalpen elebidunak. es-eu eta en-ga // Bilingual captions of images from Wikimedia Commons, en-ga and es-eu
Authors (IXA members): 
Authors (no IXA members): 
Alberto Poncelas, Meghan Dowling
Description: 
[eu]
Wikimedia Commons-eko irudietatik jasotako azalpen elebidinak (captions), 2018ko apirilean.
- CommonsCaptions_es_eu corpusa:
3560 irudik euskaraz eta espainieraz dituzten azalpen elebidunak.
- CommonsCaptions_en_ga corpusa:
434 irudik gaelikoz eta eta espingelesez dituzten azalpen elebidunak.
TaggedCommonsCaptions_es_eu corpusa:
CommonsCaptions_es_eu corpusa bezalakoa da baina irudi bakoitzari etiketa hauetako bat gehitu zaio: (Person, HumanGroup, Place/Location, Institution, Building, AnimalPlant, Event/sport, History, Map/Icon, Culture, and Others)
---------------------------------------------------------------------------------------------------------------------------------------------------------------------
[en]
Collection of bilingual captions from Wikimedia Commons images
- CommonsCaptions_es_eu corpus:
Billingual captions in Spanish and in Basque collected from 3560 images in Wikimedia Commons.
- CommonsCaptions_en_ga corpus:
Billingual captions in English and in Irish collected from 434 images in Wikimedia Commons.
- TaggedCommonsCaptions_es_eu corpus:
The same as CommonsCaptions_es_eu corpus: but adding to each of the images a tag to distinguish 11 different kinds of image (Person, HumanGroup, Place/Location, Institution, Building, AnimalPlant, Event/sport, History, Map/Icon, Culture, and Others).
Ownership: 
Wikimedia Commons
License: 
CC-BY-SA 4.0