Journal

Measuring diachronic language distance using perplexity. Application to English, Portuguese and Spanish.

The objective of this work is to set a corpus-driven methodology to quantify automatically diachronic language distance
between chronological periods of several languages. We apply a perplexity-based measure to written text representing
different historical periods of three languages: European English, European Portuguese and European Spanish. For this
purpose, we have built historical corpora for each period, which have been compiled from different open corpus sources

Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool

Lately, discourse structure has received considerable attention due to the benefits carried out by its application in several NLP task such as opinion mining, summarization, question answering, text simplification, among others.

Pages

Subscribe to RSS - Journal