While for data mining projects (for example in the context of e-commerce) some methodologies have already been developed (e.g. CRISP-DM, SEMMA, KDD), these do not account for (1) early evaluation in order to de-risk a project (2) dealing with text corpora (“unstructured” data) and associated natural language processing processes, and (3) non-technical considerations (e.g. legal, ethical, project management aspects). To address these three shortcomings, a new methodology, called “Data to Value”, is introduced, which is guided by a detailed catalog of questions in order to avoid a disconnect of large-scale NLP project teams with the topic when facing rather abstract box-and-arrow diagrams commonly associated with methodologies.
Titel | Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects |
---|---|
Medien | Proceedings of the 27th International Conference on Applications of Natural Language to Information Systems (NLDB 2022), Valencia, Spain, June 15-17, 2022 |
Verlag | Springer Nature |
Heft | --- |
Band | --- |
ISBN | 978-3-031-08472-0 |
Verfasser/Herausgeber | Prof. Dr. Jochen L. Leidner |
Seiten | 517-523 |
Veröffentlichungsdatum | 13.06.2022 |
Projekttitel | --- |
Zitation | Leidner, Jochen L. (2022): Data to Value: An 'Evaluation-First' Methodology for Natural Language Projects. Proceedings of the 27th International Conference on Applications of Natural Language to Information Systems (NLDB 2022), Valencia, Spain, June 15-17, 2022, S. 517-523. |