Skip to search formSkip to main contentSkip to account menu
DOI:10.1145/3640018 - Corpus ID: 266873607
@article{KapugamaGeeganage2024Text2ELEG, title={Text2EL+: Expert Guided Event Log Enrichment Using Unstructured Text}, author={Dakshi Tharanga Kapugama Geeganage and Moe Thandar Wynn and Arthur H. M. ter Hofstede}, journal={ACM Journal of Data and Information Quality}, year={2024}, volume={16}, pages={1 - 28}, url={https://api.semanticscholar.org/CorpusID:266873607}}
- D. K. Kapugama Geeganage, M. Wynn, A. T. ter Hofstede
- Published in ACM Journal of Data and… 10 January 2024
- Computer Science, Business
Text2EL+ is introduced, a three-phase approach to enrich event logs using unstructured text that applies natural language processing techniques, sentence embeddings, training pipelines and models, as well as contextual and expression validation to improve data quality.
Figures and Tables from this paper
- table 1
- figure 1
- table 2
- figure 2
- table 3
- figure 3
- table 4
- figure 5
- table 5
- figure 6
- table 6
- figure 7
- table 7
- figure 8
- table 8
- figure 9
- table 9
- figure 10
- table 10
- figure 11
- figure 12
- table 12
- table 14
- figure 14
Topics
Unstructured Text (opens in a new tab)Event Logs (opens in a new tab)Named Entity Recognition (opens in a new tab)Data Quality (opens in a new tab)Process Mining (opens in a new tab)Sentence Embeddings (opens in a new tab)
58 References
- D. T. K. GeeganageM. WynnA. Hofstede
- 2022
Computer Science, Business
2022 16th International Conference on Signal…
Text2EL is introduced, a two-phase event log enrichment approach based on unstructured text that applies techniques from natural language processing, sentence embeddings, and contextual and expression validation before enriching the event log.
- 1
- Adrian RebmannHan van der Aa
- 2021
Computer Science
CAiSE
This paper combines the analysis of textual attribute values, based on a state-of-the-art language model, with a novel attribute classification technique, and extracts information about up to eight semantic roles per event through so-called semantic role labeling of event data.
- 20 [PDF]
- Rolf B. BänzigerA. BasukoskiT. Chaussalet
- 2018
Computer Science, Business
2018 IEEE 20th International Conference on High…
This paper proposes an original framework to mine processes from CRM data, by leveraging the unstructured part of the data, using Latent Dirichlet Allocation (LDA), an unsupervised machine learning technique, to automatically detect and assign labels to activities.
- 18
- Highly Influential
- PDF
- A. GhoseGeorge KoliadisA. Chueng
- 2007
Business, Computer Science
2007 IEEE Congress on Services (Services 2007)
The Rapid Business Process Discovery (R-BPD) framework and prototype tool is proposed that can query heterogeneous information resources and rapidly construct prototype models to be incrementally adjusted to correctness by an analyst, which will increase analyst productivity by significant orders of magnitude over traditional approaches.
- 84
- PDF
- Elena V. EpurePatricia Martín-RodillaCharlotte HugR. DeneckèreC. Salinesi
- 2015
Computer Science, Medicine
2015 IEEE 9th International Conference on…
A novel unsupervised technique TextProcessMiner is created that discovers process instance models from unstructured, text-based process traces using natural language processing with a focus on the verb semantics and emerged as domain independent.
- 42
- PDF
- Christoph KechtAndreas EggerWolfgang KratschMaximilian Röglinger
- 2021
Computer Science, Business
2021 3rd International Conference on Process…
An approach that utilizes NLI to derive topics and process activities from customer service conversations and that represents them in a standardized XES event log is developed and shows that NLI helps construct event logs of high accuracy for process mining purposes.
- 9
- Highly Influential
- PDF
- Monika GuptaPrerna AgarwalTarun TaterSampath DechuAlexander Serebrenik
- 2020
Computer Science
Business Process Management Workshops
This work proposes to extract topical phrases (keyphrases) from the unstructured data using an un-supervised graph-based approach and captures underlying process interactions which allows to understand in detail the process realities and identify opportunities for improvement.
- 5
- Highly Influential
- PDF
- Maximilian RieferS. TernisTom Thaler
- 2016
Computer Science, Business
An overview of the current state-of-the-art in text-to-model mining from natural language text is given, which covers both advantages and disadvantages of current techniques.
- 28
- PDF
- Han van der AaClaudio Di CiccioH. LeopoldH. Reijers
- 2019
Computer Science
CAiSE
This paper presents the first automated approach for the extraction of declarative process models from natural language techniques that identify activities and their inter-relations from textual constraint descriptions and provides automated support for an otherwise tedious and complex manual endeavor.
- 56
- PDF
- Sareh SadeghianaslA. HofstedeS. SuriadiSelen Turkay
- 2020
Computer Science
2020 2nd International Conference on Process…
This paper proposes a gamified crowdsourcing approach to the detection and repair of problematic activity labels, namely those with identical semantics but different syntax, which shows promising results in terms of quality improvements achieved.
- 13
- PDF
...
...
Related Papers
Showing 1 through 3 of 0 Related Papers
Fig. 11. Number of notes by category.
Published in ACM Journal of Data and Information Quality 2024
Text2EL+: Expert Guided Event Log Enrichment Using Unstructured Text
D. K. Kapugama GeeganageM. WynnA. T. ter Hofstede
Figure 20 of 24