Responsive image





Detecting Environmental, Social and Governance (ESG) Topics Using Domain-Specific Language Models and Data Augmentation

Nugent, Tim; Stelea, Nicole; Leidner, Jochen L. (2021)

Proceedings of the 14th International Conference on Flexible Query Answering Systems (FQAS 2021), Bratislava, Slovakia, September 19–24, 2021, S. 157-169.
DOI: 10.1007/978-3-030-86967-0_12


Peer Reviewed
 

Despite recent advances in deep learning-based language modelling, many natural language processing (NLP) tasks in the financial domain remain challenging due to the paucity of appropriately labelled data. Other issues that can limit task performance are differences in word distribution between the general corpora – typically used to pre-train language models – and financial corpora, which often exhibit specialized language and symbology. Here, we investigate two approaches that can help to mitigate these issues. Firstly, we experiment with further language model pre-training using large amounts of in-domain data from business and financial news. We then apply augmentation approaches to increase the size of our data-set for model fine-tuning. We report our findings on an Environmental, Social and Governance (ESG) controversies data-set and demonstrate that both approaches are beneficial to accuracy in classification tasks.

mehr

A Survey of Textual Data & Geospatial Technology

Leidner, Jochen L. (2021)

Handbook of Big Geospatial Data, S. 429–457.
DOI: 10.1007/978-3-030-55462-0_16


mehr

Text Meets Space: Geographic Content Extraction, Resolution and Information Retrieval

Leidner, Jochen L.; Martins, Bruno; McDonough, Katherine; Purves, Ross S. (2020)

Proceedings of the 42nd European Conference on Information Retrieval Research (ECIR 2020), Lisbon, Portugal, April 14–17, 2020 II, S. 669-673.
DOI: 10.1007/978-3-030-45442-5_89


Peer Reviewed
 

In this half-day tutorial, we will review the basic concepts of, methods for, and applications of geographic information retrieval, also showing some possible applications in fields such as the digital humanities. The tutorial is organized in four parts. First we introduce some basic ideas about geography, and demonstrate why text is a powerful way of exploring relevant questions. We then introduce a basic end-to-end pipeline discussing geographic information in documents, spatial and multi-dimensional indexing [19], and spatial retrieval and spatial filtering. After showing a range of possible applications, we conclude with suggestions for future work in the area.

mehr

Prof. Dr. Jochen L. Leidner


Hochschule Coburg

Fakultät Wirtschaftswissenschaften (FW)

T +49 9561 317 422
Jochen.Leidner[at]hs-coburg.de