Text Analytics And NLP
- Concepts
- Computer Science And IT
- BlockChain
- Medical Informatics
- Multimedia Computing
- Digital Currencies – Bitcoin and cryptocurrencies
- Context-Aware Search System (CASS)
- Big Data
- Industrial Internet of things (IIoT)
- Assisted reality and virtual reality
- Deep learning - Artificial Intelligence and Machine Learning
- Cryptography
- Embedded System
- Databases and Data mining
- Computer Vision
- Wireless Body Area Network (WBAN)
- Computer Graphics and Visualization
- Operating Systems
- Data Privacy
- Programming Languages and Systems
- Scientific And Numerical computing
- Cyber-Security And AI
- Softwre Engineering
- Natural language Generation
- Producing Text From Computer Data
- AI Optimized Hardware
- Decision Management
- Deep Learning Platforms
- Pytorch
- Biometrics
- Robotic Process Automation
- Text Analytics And NLP
Text Analytics And NLP
Considering the existence of large volume of data on the web, it has been always seen as an issue that the search engines sometimes provide related data but it can also not be denied that sometimes the data given is not related to the search made by user. Thus it can be asserted that it often becomes difficult to access the related data and since the data available on web is mostly unstructured, it is time-consuming to retrieve the related data (Ergün, 2017). In order to optimize the process of accessing data and processing there have been proposed or implemented a large number of technologies. Some of the key technologies are natural language processing (NLP), text analytics, speech recognition etc.
Natural language processing can be defined as an important area of computer science, artificial intelligence and computation linguistics which are associated with processing the human languages. While NLP aims at processing the text, text analytics is associated with retrieving or obtaining relevant information from the searched or unorganized text. Thus, text analytics can be defined as a type of data analysis enabled using NLP methodologies.
The evolution of NLP took place several years ago and it includes several methods for processing text (Kao & Poteet, 2005). Some of these methods or approaches are:
Stemming: This approach aims at removing suffixes from the text.
Lemmatization: This approach aims at substituting the inflected word.
Multi-word phase grouping, etc.
NLP can be asserted as a division of artificial intelligence which is based on algorithms for automatic representation and processing of different forms of natural language, consequently facilitating communication with computer and humans (Kalyanathaya, Akila, & Rajesh, 2019).
Contrary to NLP, text analytics has been in trend recently. It is associated with the techniques that are employed in the field of text mining, information retrieval etc. Predominantly, the objective of text analytics is to obtain certain relevant text from the large volume of irrelevant text available across the web. The most common form of text analytics is text retrieval which is generally performed by the search engines. In addition the text analytics is also associated with classifying text as per certain defined categories. The text retrieval tasks generally employ techniques of NLP discussed previously. It is related with identifying certain defined patterns in natural language texts. In order to analyze these natural language texts, there are used several techniques, as depicted in the works performed for natural language processing (Jurafsky & Martin, 2008).
It has been reported by some researchers that text analytics is one of the simplest means to obtain relevant data from the large volumes of data is text analytics (Pejić Bach, Krstić, Seljan, & Turulja, 2019). Obtaining unrelated data from the web and converting it into relevant information in order to enable decision-making process (Pejić Bach et al., 2019).
Though applied on unstructured text, the primary function is to arrange or structure text in order to analyze the text. Similarly, the authors (Liu, Fei, Hou, Zhang, & Sun, 2007) asserted that the approach aims at extracting patterns from different text documents. With the similar motive, the authors (Yehia1, Ibrahim, & Abulkhair, 2016). Reported about certain techniques such as classification, information extraction, clustering of topics etc, sentiment analysis (Nakayama & Wan, 2017; Schumaker, Zhang, & Chun Neng Huang, 2012), keyword extraction (Ong, Chen, Sung, & Zhu, 2015), NLP (Klopotan, Zoroja, & Meško, 2018) etc.
Employed text mining for extracting topics using content analysis of e-commerce organizations while the authors (Reyes-Menendez, Saura, & Alvarez-Alonso, 2018) used it for social media and categorizing them as per the sentiments.Recently, the vast amount of data is termed under a common term, Big Data. This data is usually unstructured and is associated with commercial domain (Marriott, 2015). This has attracted the attention of researchers towards text analytics. It has been described as an extended version of data mining by some researchers since it performs the task of retrieving related task from the unrelated sources of text.
It is also known as Intelligent Text Analysis since it intelligently extracts the structured data from the unstructured one as desired or searched by the user. Thus, text analytics can be defined as a wide field which is associated with obtaining information from unstructured or even semi-structured data sets. Some of the examples of unstructured data sets are emails, blogs, articles etc. It is also noteworthy that in text analytics exhibits similarity with data mining in almost all cases. However, there exists a difference that while text analytics tools extract structured information from unstructured one, the data mining tools enable handling structured data from databases.
References:
Aguirre, S. & Rodriguez, A. (2017). Automation of a Business Process Using Robotic Process Automation (RPA): A Case Study. In: [Online]. pp. 65–71. Available from: http://link.springer.com/10.1007/978-3-319-66963-2_7.
Asatiani, A. & Penttinen, E. (2016). Turning robotic process automation into commercial success – Case OpusCapita. Journal of Information Technology Teaching Cases. [Online]. 6 (2). pp. 67–74. Available from: http://journals.sagepub.com/doi/10.1057/jittc.2016.5.
Ergün, M. (2017). Using the Techniques of Data Mining and Text Mining in Educational Research Eği̇ti̇m Araştirmalarinda Data Mining Ve Text MininTekni̇kleri̇nden Yararlanma. (6). pp. 12180–189.
Frank, C. (2015). Introduction To Robotic Process Automation. Institute for Robotic Process and Automation. [Online]. pp. 35. Available from: https://irpaai.com/wp-content/uploads/2015/05/Robotic-Process-Automation-June2015.pdf.
Ghosh, G. (2018). Automation with RPA (Robotic Process Automation). International Journal of Computer Sciences and Engineering. [Online]. 6 (8). pp. 475–477. Available from: http://www.ijcseonline.org/full_paper_view.php?paper_id=2718.
Jurafsky, D. & Martin, J. (2008). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. [Online]. Available from: https://www.researchgate.net/publication/200111340_Speech_and_Language_Processing_An_Introduction_to_Natural_Language_Processing_Computational_Linguistics_and_Speech_Recognition/citation/download.
Kalyanathaya, K.P., Akila, D. & Rajesh, P. (2019). Advances in natural language processing –a survey of current research trends, development tools and industry applications. International Journal of Recent Technology and Engineering. [Online]. 7 (5). pp. 199–201. Available from: https://www.ijrte.org/wp-content/uploads/papers/v7i5c/E10480275C19.pdf.
Kao, A. & Poteet, S. (2005). Text mining and natural language processing. ACM SIGKDD Explorations Newsletter. [Online]. 7 (1). pp. 1–2. Available from: http://portal.acm.org/citation.cfm?doid=1089815.1089816.
Klopotan, I., Zoroja, J. & Meško, M. (2018). Early warning system in business, finance, and economics. International Journal of Engineering Business Management. [Online]. 10. pp. 184797901879701. Available from: http://journals.sagepub.com/doi/10.1177/1847979018797013.
Liu, D., Fei, S., Hou, Z.-G., Zhang, H. & Sun, C. (2007). Advances in Neural Networks. Lecture Notes in Computer Science. D. Liu, S. Fei, Z. Hou, H. Zhang, & C. Sun (eds.). [Online]. Berlin, Heidelberg: Springer Berlin Heidelberg. Available from: http://link.springer.com/10.1007/978-3-540-72395-0.
Madakam, S., M. Holmukhe, R. & Kumar Jaiswal, D. (2019). The Future Digital Work Force: Robotic Process Automation (RPA). Journal of Information Systems and Technology Management. [Online]. 16. Available from: http://www.jistem.tecsi.org/index.php/jistem/article/view/3077.
Marriott (2015). 2015 Annual Report. Merriott 2015 Annual Report. [Online]. pp. 23. Available from: http://hugin.info/131/R/2004940/740409.pdf.
Nakayama, M. & Wan, Y. (2017). Exploratory Study on Anchoring: Fake Vote Counts in Consumer Reviews Affect Judgments of Information Quality. Journal of theoretical and applied electronic commerce research. [Online]. 12 (1). pp. 1–20. Available from: http://www.scielo.cl/scielo.php?script=sci_arttext&pid=S0718-18762017000100002&lng=en&nrm=iso&tlng=en.
Naveen Reddy, K.P., Harichandana, U., Alekhya, T. & S. M., R. (2019). A Study of Robotic Process Automation Among Artificial Intelligence. International Journal of Scientific and Research Publications (IJSRP). [Online]. 9 (2). pp. p8651. Available from: http://www.ijsrp.org/research-paper-0219.php?rp=P868266.
Ong, T.H., Chen, H., Sung, W.K. & Zhu, B. (2015). Newsmap: A knowledge map for online news. Decision Support Systems. [Online]. 39 (4). pp. 583–597. Available from: https://arizona.pure.elsevier.com/en/publications/newsmap-a-knowledge-map-for-online-news.
Pejić Bach, M., Krstić, Ž., Seljan, S. & Turulja, L. (2019). Text Mining for Big Data Analysis in Financial Sector: A Literature Review. Sustainability. [Online]. 11 (5). pp. 1277. Available from: https://www.mdpi.com/2071-1050/11/5/1277.
Penttinen, E., Kasslin, H. & Asatiani, A. (2018). How to Choose between Robotic Process Automation and Back-End System Automation? [Online]. Available from: https://www.researchgate.net/publication/324918928_How_to_Choose_between_Robotic_Process_Automation_and_Back-End_System_Automation/citation/download.
Reyes-Menendez, A., Saura, J.R. & Alvarez-Alonso, C. (2018). Understanding #WorldEnvironmentDay User Opinions in Twitter: A Topic-Based Sentiment Analysis Approach. International journal of environmental research and public health. [Online]. 15 (11). Available from: http://www.ncbi.nlm.nih.gov/pubmed/30428520.
Schumaker, R.P., Zhang, Y. & Chun Neng Huang, H.C. (2012). Evaluating sentiment in financial news articles. Decision Support Systems. [Online]. 53 (3). Available from: https://nau.pure.elsevier.com/en/publications/evaluating-sentiment-in-financial-news-articles.
Siebeck, R. & Oulabi, A. (2012). Besitzt das Adrenalinpräparat Eppy pupillomotorische Eigenschaften? Klinische Monatsblatter fur Augenheilkunde. [Online]. 152 (5). pp. 705–709. Available from: https://www.horsesforsources.com/wp-content/uploads/2016/06/RS-1210_Robotic-automation-emerges-as-a-threat-060516.pdf.
Willcocks, L.P. & Lacity, M.C. (2016a). A New Approach to Automating Services. MIT Sloan Management Review. [Online]. 58 (1). pp. 40–49. Available from: http://eprints.lse.ac.uk/68135/1/Willcocks_New approach_2016.pdf.
Willcocks, L.P. & Lacity, M.C. (2016b). Robotic process automation at telefónica O2. MIS Quarterly Executive. [Online]. 15 (1). pp. 21–35. Available from: http://www.scopus.com/inward/record.url?eid=2-s2.0-84964632089&partnerID=tZOtx3y1.
Yehia1, A.M., Ibrahim, L.F. & Abulkhair, M.F. (2016). Text Mining and Knowledge Discovery from Big Data: Challenges and Promise. International Journal of Computer Science Issues. [Online]. 13 (3). pp. 54–61. Available from: http://ijcsi.org/contents.php?volume=13&&issue=3.

