Responsive image





What is NLP and What Trends Can We See in 2022?

Leidner, Jochen L. (2022)

Invited Talk, Joint Webinar of the CFA organization (French Chapter) and the London Stock Exchange Group (LSEG), London/Paris/online, November 18, 2022.


mehr

The University of Sheffield at CheckThat! 2020: Claim Identification and Verification on Twitter

McDonald, Thomas; Dong, Ziqing; Zhang, Yingji; Hampson, Rebekah; Young, James...

Cross Language Evaluation Forum (CLEF) Working Notes 2020: Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 22-25, 2020. 2696, 162.


Open Access Peer Reviewed
mehr

Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects

Leidner, Jochen L. (2022)

Proceedings of the 27th International Conference on Applications of Natural Language to Information Systems (NLDB 2022), Valencia, Spain, June 15-17, 2022, 517-523.


Peer Reviewed
 

While for data mining projects (for example in the context of e-commerce) some methodologies have already been developed (e.g. CRISP-DM, SEMMA, KDD), these do not account for (1) early evaluation in order to de-risk a project (2) dealing with text corpora (“unstructured” data) and associated natural language processing processes, and (3) non-technical considerations (e.g. legal, ethical, project management aspects). To address these three shortcomings, a new methodology, called “Data to Value”, is introduced, which is guided by a detailed catalog of questions in order to avoid a disconnect of large-scale NLP project teams with the topic when facing rather abstract box-and-arrow diagrams commonly associated with methodologies.

mehr

Literatur von Maschinen: Was kann künstliche Intelligenz?

Holtorf, Christian; Leidner, Jochen L. (2022)

Rahmen der Literaturtage „Coburg liest“ 2022. 2022.



Detecting Environmental, Social and Governance (ESG) Topics Using Domain-Specific Language Models and Data Augmentation

Nugent, Tim; Stelea, Nicole; Leidner, Jochen L. (2021)

Proceedings of the 14th International Conference on Flexible Query Answering Systems (FQAS 2021), Bratislava, Slovakia, September 19–24, 2021, 157-169.
DOI: 10.1007/978-3-030-86967-0_12


Peer Reviewed
 

Despite recent advances in deep learning-based language modelling, many natural language processing (NLP) tasks in the financial domain remain challenging due to the paucity of appropriately labelled data. Other issues that can limit task performance are differences in word distribution between the general corpora – typically used to pre-train language models – and financial corpora, which often exhibit specialized language and symbology. Here, we investigate two approaches that can help to mitigate these issues. Firstly, we experiment with further language model pre-training using large amounts of in-domain data from business and financial news. We then apply augmentation approaches to increase the size of our data-set for model fine-tuning. We report our findings on an Environmental, Social and Governance (ESG) controversies data-set and demonstrate that both approaches are beneficial to accuracy in classification tasks.

mehr

A Survey of Textual Data & Geospatial Technology

Leidner, Jochen L. (2021)

Handbook of Big Geospatial Data, 429–457.
DOI: 10.1007/978-3-030-55462-0_16


mehr

Text Meets Space: Geographic Content Extraction, Resolution and Information Retrieval

Leidner, Jochen L.; Martins, Bruno; McDonough, Katherine; Purves, Ross S. (2020)

Proceedings of the 42nd European Conference on Information Retrieval Research (ECIR 2020), Lisbon, Portugal, April 14–17, 2020 II, 669-673.
DOI: 10.1007/978-3-030-45442-5_89


Peer Reviewed
 

In this half-day tutorial, we will review the basic concepts of, methods for, and applications of geographic information retrieval, also showing some possible applications in fields such as the digital humanities. The tutorial is organized in four parts. First we introduce some basic ideas about geography, and demonstrate why text is a powerful way of exploring relevant questions. We then introduce a basic end-to-end pipeline discussing geographic information in documents, spatial and multi-dimensional indexing [19], and spatial retrieval and spatial filtering. After showing a range of possible applications, we conclude with suggestions for future work in the area.

mehr

Prof. Dr. Jochen L. Leidner


Hochschule Coburg

Fakultät Wirtschaftswissenschaften (FW)

T +49 9561 317 422
Jochen.Leidner[at]hs-coburg.de