Topic modelling characterization of Mudejar art based on document titles

AbstractText mining techniques were applied to a corpus consisting in the titles of 2,454 documents on Mudejar art, a style unique to Spanish art history. Probabilistic topic modelling was used to analyse the semantic structure underlying the suite of documents studied. Two classifications were obtained, an initial, generic division into five topics followed by a second more refined division into ten. These were compared to the preliminary subject categories found for the corpus with the guidance of an area specialist.

Correcting real-word spelling errors: A new hybrid approach

AbstractSpelling correction is one of the main tasks in the field of Natural Language Processing. Contrary to common spelling errors, real-word errors cannot be detected by conventional spelling correction methods. The real-word correction model proposed by Mays, Damerau, and Mercer showed a great performance in different evaluations. In this research, however, a new hybrid approach is proposed which relies on statistical and syntactic knowledge to detect and correct real-word errors.

Evaluating multi-criteria Connection mechanisms: A new algorithm for browsing digital archives

AbstractSearching for articles of interest in a digital archive need not be through a free-form text search. In fact, many authors have suggested that the best way to find relevant items in an archive is to browse its contents rather than to search for specific keywords.

At the crossroads of digital humanities and historical lexicography: The Middle Dutch ‘seemly play (abel spel) of Winter and Summer’ as a research case

AbstractDigitization has changed the concept of dictionaries from merely alphabetically ordered reference works into lexical databases providing flexible search systems with interconnected lemmas. This article investigates ensuing opportunities and useful design options of digitized historical dictionaries as research tools for the study of texts. It appears that we have arrived at an interesting intersection of digital humanities and historical lexicography. The 14th-century ‘seemly play of Winter and Summer’ serves as a research case.

Pages

Subscribe to ADHO RSS