Welcome to the IKCEST

IEEE Transactions on Visualization and Computer Graphics | Vol.24, Issue.1 | | Pages 361-370

IEEE Transactions on Visualization and Computer Graphics

ConceptVector: Text Visual Analytics via Interactive Lexicon Building Using Word Embedding

Deokgun Park   Seungyeon Kim   Jurim Lee   Jaegul Choo   Niklas Elmqvist   Nicholas Diakopoulos  
Abstract

Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building a concept from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of natural language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides a user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts, we introduce a bipolar concept model and support for specifying irrelevant words. We validate the interactive lexicon building interface by a user study and expert reviews. Quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.

Original Text (This is the original text for your reference.)

ConceptVector: Text Visual Analytics via Interactive Lexicon Building Using Word Embedding

Central to many text analysis methods is the notion of a concept: a set of semantically related keywords characterizing a specific object, phenomenon, or theme. Advances in word embedding allow building a concept from a small set of seed terms. However, naive application of such techniques may result in false positive errors because of the polysemy of natural language. To mitigate this problem, we present a visual analytics system called ConceptVector that guides a user in building such concepts and then using them to analyze documents. Document-analysis case studies with real-world datasets demonstrate the fine-grained analysis provided by ConceptVector. To support the elaborate modeling of concepts, we introduce a bipolar concept model and support for specifying irrelevant words. We validate the interactive lexicon building interface by a user study and expert reviews. Quantitative evaluation shows that the bipolar lexicon generated with our methods is comparable to human-generated ones.

+More

Cite this article
APA

APA

MLA

Chicago

Deokgun Park, Seungyeon Kim, Jurim Lee, Jaegul Choo, Niklas Elmqvist, Nicholas Diakopoulos,.ConceptVector: Text Visual Analytics via Interactive Lexicon Building Using Word Embedding. 24 (1),361-370.

Disclaimer: The translated content is provided by third-party translation service providers, and IKCEST shall not assume any responsibility for the accuracy and legality of the content.
Translate engine
Article's language
English
中文
Pусск
Français
Español
العربية
Português
Kikongo
Dutch
kiswahili
هَوُسَ
IsiZulu
Action
Recommended articles

Report

Select your report category*



Reason*



By pressing send, your feedback will be used to improve IKCEST. Your privacy will be protected.

Submit
Cancel