Articles from Digital Scholarship in the Humanities (formerly LLC)

Do language combinations affect translators’ stylistic visibility in translated texts?

AbstractThe question of translators’ stylistic visibility in translated texts has been a recurring theme in translation studies. Recently, the employment of state-of-the-art stylometric methods such as multivariate statistical analysis or machine learning techniques has enabled important progress to be made in exploring the problem. Nevertheless, studies are conflicting in their findings. Some find evidence of translators’ stylistic presence, while others fail to do so.

Unsupervised identification of text reuse in early Chinese literature

AbstractText reuse in early Chinese transmitted texts is extensive and widespread, often reflecting complex textual histories involving repeated transcription, compilation, and editing spanning many centuries and involving the work of multiple authors and editors. In this study, a fully automated method of identifying and representing complex text reuse patterns is presented, and the results evaluated by comparison to a manually compiled reference work.

Visual meta-data in qualitative analysis

AbstractDr Anne Luther is a researcher, curator, and software developer whose work examines the contemporary art market and data visualization in qualitative research. She received her PhD from Central Saint Martins College of Art and Design, London, and is currently a researcher at the Department for Modern Art History at the Institute of Art Studies and Historical Urban Studies at TU Berlin and at The Center for Data Arts at The New School in New York.

Mining and discovery of hidden relationships between software source codes and related textual documents

AbstractNormally, software documentations are produced, informally. They are written in the unnatural and non-structural form of the language, such as user manuals, user requirements, design documentation, tutorials, support documentation, and so on. Recent studies show that 61% of software projects are subject to failure or challenges due to an increase in the costs and production time. Various factors may lead to this issue, and one of the major contributing factors is the lack of links between the software's source code and its related documents.

Stylometry approaching Parnassus

AbstractThe article gives a brief outline of the fruitless attempts in the past at finding the author(s) of the Parnassus Plays, an anonymous trilogy performed between 1597 and 1601 at St. John’s College in Cambridge as part of the Christmas festivities, and then moves on to employ the R Stylo features Rolling Delta and Rolling Classify, which in various approaches confirm John Marston and Thomas Nashe as the authors.

Topic modelling characterization of Mudejar art based on document titles

AbstractText mining techniques were applied to a corpus consisting in the titles of 2,454 documents on Mudejar art, a style unique to Spanish art history. Probabilistic topic modelling was used to analyse the semantic structure underlying the suite of documents studied. Two classifications were obtained, an initial, generic division into five topics followed by a second more refined division into ten. These were compared to the preliminary subject categories found for the corpus with the guidance of an area specialist.

Correcting real-word spelling errors: A new hybrid approach

AbstractSpelling correction is one of the main tasks in the field of Natural Language Processing. Contrary to common spelling errors, real-word errors cannot be detected by conventional spelling correction methods. The real-word correction model proposed by Mays, Damerau, and Mercer showed a great performance in different evaluations. In this research, however, a new hybrid approach is proposed which relies on statistical and syntactic knowledge to detect and correct real-word errors.

Evaluating multi-criteria Connection mechanisms: A new algorithm for browsing digital archives

AbstractSearching for articles of interest in a digital archive need not be through a free-form text search. In fact, many authors have suggested that the best way to find relevant items in an archive is to browse its contents rather than to search for specific keywords.

At the crossroads of digital humanities and historical lexicography: The Middle Dutch ‘seemly play (abel spel) of Winter and Summer’ as a research case

AbstractDigitization has changed the concept of dictionaries from merely alphabetically ordered reference works into lexical databases providing flexible search systems with interconnected lemmas. This article investigates ensuing opportunities and useful design options of digitized historical dictionaries as research tools for the study of texts. It appears that we have arrived at an interesting intersection of digital humanities and historical lexicography. The 14th-century ‘seemly play of Winter and Summer’ serves as a research case.

EVI-LINHD, a virtual research environment for the Spanish-speaking community

AbstractLaboratorio de Innovación en Humanidades Digitales (UNED) has developed Entorno Virtual de Investigación del Laboratorio de Innovación en Humanidades Digitales (EVI-LINHD), the first virtual research environment devoted mainly to Spanish speakers interested in digital scholarly edition. EVI-LINHD combines different open-source software for developing a complete digital project: (1) a Web-based application markup tool—TEIscribe—combined with an eXistdb solution and a TEIPublisher platform, (2) Omeka for digital libraries, and (3) WordPress for simple Web pages.