![]() |
个人信息Personal Information
教授
博士生导师
硕士生导师
任职 : 智能计算教研室主任
性别:男
毕业院校:吉林大学
学位:博士
所在单位:计算机科学与技术学院
学科:计算机应用技术. 计算机软件与理论
办公地点:创新园大厦A820
联系方式:13304609362
电子邮箱:lucos@dlut.edu.cn
论文成果
当前位置: 姚念民欢迎报考硕博士 >> 科学研究 >> 论文成果Big Data Analytics, Text Mining and Modern English Language
点击次数:
论文类型:期刊论文
发表时间:2019-06-01
发表刊物:JOURNAL OF GRID COMPUTING
收录刊物:SCIE、EI、SSCI
卷号:17
期号:2,SI
页面范围:357-366
ISSN号:1570-7873
关键字:Text mining; TF-IDF; English language; Speed of linguistic changes
摘要:The modern English Language took centuries to convert from old English. The word hath' of old English for example, has taken centuries to become have' in the modern English Language. If these changes had not been occurred there would not have been the possibility of modern words. A text written in fifteen century can be difficult to read and if we go back a couple of more centuries, it would be like reading a different language. In this paper, we have used the text mining techniques to analyze the old and modern English languages. We have introduced the Common-Words Counting algorithm that identifies common words of 15(th) century that diminishes gradually in the later centuries. We computed the speed of linguistic changes and identified the reasons behind them. For this purpose, 34000 text books were downloaded from Project Gutenberg of different authors, between 15(th) to 19(th) centuries. These books were categorized into five centuries in the range from 15(th) to 19(th) centuries. We selected most common words from the books of 15(th) century and calculated their frequencies in other centuries. We calculated the sum of Term Frequency-Inverse Document Frequency (TF-IDF) of these words and proved that frequencies of words were decreasing from 15(th) century to 19(th) century with some words even disappeared in other centuries, such as doth', hath', punt, guise and selfe'. We calculated the speed of changing of words using the slope formula. We proved that the words were changing during each century with the speed of changing of words being the lowest during 16(th) - 17(th) centuries and the highest during 18(th) - 19(th) centuries which shows that the old words or their spellings were changed to the modern words during 18(th) - 19(th) centuries. The industrialization, modernization, and British Empire invasion were the key factors, which changed the old English language into modern English language.