Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects

Abstract

While for data mining projects (for example in the context of e-commerce) some methodologies have already been developed (e.g. CRISP-DM, SEMMA, KDD), these do not account for (1) early evaluation in order to de-risk a project (2) dealing with text corpora (“unstructured” data) and associated natural language processing processes, and (3) non-technical considerations (e.g. legal, ethical, project management aspects). To address these three shortcomings, a new methodology, called “Data to Value”, is introduced, which is guided by a detailed catalog of questions in order to avoid a disconnect of large-scale NLP project teams with the topic when facing rather abstract box-and-arrow diagrams commonly associated with methodologies.

Mehr zum Titel

Titel Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects
Medien Proceedings of the 27th International Conference on Applications of Natural Language to Information Systems (NLDB 2022), Valencia, Spain, June 15-17, 2022
Verlag Springer Nature
Heft ---
Band ---
ISBN 978-3-031-08472-0
Verfasser/Herausgeber Prof. Dr. Jochen L. Leidner
Seiten 517-523
Veröffentlichungsdatum 2022-06-13
Projekttitel ---
Zitation Leidner, Jochen L. (2022): Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects. Proceedings of the 27th International Conference on Applications of Natural Language to Information Systems (NLDB 2022), Valencia, Spain, June 15-17, 2022, S. 517-523.