European Proceedings Logo

System of Automated Text Messages Clustering by Semantic Proximity Based on NLP and Machine Learning Methods

Table 2: Examples of data tokenization in KHN dataset

Original documents from the corpus Tokenized documents
Todays headlines Obama Administration Simplifies Application For Health Insurance todays headlineobama administration simplify application health insurance
Medicare falls behind with project to allow patients to get hospice and treatments to cure them at the same time medicare fall behind with project allow patient get hospice and treatment cure same time
< Back to article