Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data

Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data

Citation Author(s):
Young Geun
Hyun
Jindeuk
Ko
Jeong Hyeon
Han
Submitted by:
Young Hyun
Last updated:
Thu, 03/26/2020 - 07:37
DOI:
10.21227/wqem-sj55
Data Format:
License:
Dataset Views:
21
Rating:
0
0 ratings - Please login to submit your rating.
Share / Embed Cite
Abstract: 

The age of Artificial Intelligence (AI) is coming. Since Natural Language Processing (NLP) is a core AI technology for communication between humans and devices, it is vital to understand technological trends. Early research on NLP focused on syntactic processing such as information extraction and subject modeling but later developed into the semantic-oriented analysis. To analyze technological trends concerning NLP, especially semantic analysis, patent data that contains objective and extensive information is analyzed. The analysis procedures follow text mining to collect patent information, pre-processing, and analysis in keyword frequency, keyword network, and time series. The results reveal that there is a difference in the direction of technological development as the core keywords are at different frequencies and centrality among countries. Besides, from the time series analysis for five intervals over 20 years, twelve keywords of the rising / falling trend are observed in the US, seven in the EU, and five in Korea. The greater number of keywords infer that the US underwent further technological progress as compared to other countries. Moreover, the technical linkage of the US-EU is presumed to be sturdier than the US-Korea based on the keyword similarity over time. The analysis results of this study can be used as valuable references for future technical predictions related to NLP.

Instructions: 

The dataset is raw data used by Thesis. Abstract is extracted from patent information with python program, and csv file is applicable to this.

The Python program is written for abstract extraction, keyword network extraction, and time series analysis, and is divided into Korean and English.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!

Thank you for rating this dataset!

Please share additional details of your rating with the IEEE DataPort community by adding a comment.

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han, "Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/wqem-sj55. Accessed: Apr. 04, 2020.
@data{wqem-sj55-20,
doi = {10.21227/wqem-sj55},
url = {http://dx.doi.org/10.21227/wqem-sj55},
author = {Young Geun Hyun; Jindeuk Ko; Jeong Hyeon Han },
publisher = {IEEE Dataport},
title = {Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data},
year = {2020} }
TY - DATA
T1 - Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data
AU - Young Geun Hyun; Jindeuk Ko; Jeong Hyeon Han
PY - 2020
PB - IEEE Dataport
UR - 10.21227/wqem-sj55
ER -
Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han. (2020). Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data. IEEE Dataport. http://dx.doi.org/10.21227/wqem-sj55
Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han, 2020. Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data. Available at: http://dx.doi.org/10.21227/wqem-sj55.
Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han. (2020). "Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data." Web.
1. Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han. Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/wqem-sj55
Young Geun Hyun, Jindeuk Ko, Jeong Hyeon Han. "Technological Trends of Natural Language Processing Based Semantic Analysis: A Comparative Study of the US, the EU, and Korea Patents Data." doi: 10.21227/wqem-sj55