Chinese Personal Name Disambiguation Based on Vector Space Model Qing-hu Fan Hong-ying Zan Yu-mei Chai College of Information Engineering, Yu-xiang Jia Zhengzhou University, Zhengzhou, College of Information Engineering, Henan ,China Zhengzhou University, Zhengzhou,
[email protected] Henan ,China {iehyzan, ieymchai, ieyx- jia}@zzu.edu.cn Gui-ling Niu Foreign Languages School, Zhengzhou University, Zhengzhou, Henan ,China
[email protected] associate professor at the Department of Me- Abstract chanical and Electrical Engineering, Physics and Electrical and Mechanical Engineering College of Xiamen University, or it may be Peking Uni- versity Founder Group Corp that was established This paper introduces the task of Chinese per- by Peking University. It needs to associate with sonal name disambiguation of the Second context for disambiguating the entity Fang Zheng. CIPS-SIGHAN Joint Conference on Chinese For example, Fang Zheng who is an associate Language Processing (CLP) 2012 that Natural Language Processing Laboratory of professor at Xiamen University can be extracted Zhengzhou University took part in. In this task, with the feature that Xiamen University, Me- we mainly use the Vector Space Model to dis- chanical and Electrical Engineering College or ambiguate Chinese personal name. We extract associate professor, which can eliminate ambigu- different named entity features from diverse ity. names information, and give different weights to various named entity features with the im- 2 Related Research portance. First of all, we classify all the name documents, and then we cluster the documents In the early stages of Named Entity Disambigua- that cannot be mapped to names that have tion, Bagga and Baldwin (1998) use Vector been defined.