MURAWAKI Yugo

Assistant Professor

Department of Intelligence Science and Technology,
Graduate School of Informatics,
Kyoto University
Yoshida-honmachi, Sakyo-ku, Kyoto, 606-8501, Japan

Phone: +81-75-753-5962
Fax: +81-75-753-5962
Email: murawaki (at) i (dot) kyoto-u (dot) ac (dot) jp

Research Interest

The Japanese language (1) does not delimit words by white space (like Chinese and Thai), (2) is written with several different character types such as kanji, hiragana and katakana, and (3) is agglutinative (rich in morphology). These features pose challenging problems in natural language processing. For example, we cannot use the split-on-space method to extract morphemes (words) from text. Simple string matching sometimes fails to find unknown morphemes because they are covered by shorter known morphemes. There is no orthographic distinction (i.e. capitalization) between common and proper nouns, and there seems no morphosyntactic (grammatical) distinction between them.

I have been working on automatic lexicon acquisition from text, as without it, we cannot correctly segment text into morphemes. I fully exploited the orthographical and linguistic features of Japanese: I used the mixed orthography to find unknown morphemes, and the agglutinative nature to identify their morphological categories. Currently I am working on classifying automatically acquired nouns into common and proper nouns with lexicosyntactic clues.

I am also interested in applying our findings in Japanese to typologically similar languages such as Mongolian, Uyghur and Manchu.

Education

Professional Experience

Academic Society

Skills

Publications

Journal
  • Yugo Murawaki. Spatial Structure of Evolutionary Models of Dialects in Contact. PLOS ONE, 10 (7), July 2015.
  • Yugo Murawaki. Exploiting Inter-label Dependencies in Hierarchical Multi-Label Document Classification. Journal of Natural Language Processing, Vol.21, No.1, March 2014. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Online Acquisition of Japanese Unknown Morphemes using Morphological Constraints. Journal of Natural Language Processing, Vol.17, No.1, January 2010. (in Japanese)
Conference
  • Kenji Yamauchi and Yugo Murawaki. Contrasting Vertical and Horizontal Transmission of Typological Features. In Proceedings of the 26th International Conference on Computational Linguistics (COLING2016), pp. 836-846, Osaka, Japan, December 2016.
  • Yugo Murawaki. Statistical Modeling of Creole Genesis. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2016), pp. 1329-1339, San Diego, U.S., June 2016.
  • Yugo Murawaki and Shinsuke Mori. Wikification for Scriptio Continua. In Proceedings of the 10th Edition of its Language Resources and Evaluation Conference (LREC 2016), pp. 1346-1351, Portoro┼ż, Slovenia, May 2016.
  • Yugo Murawaki. Continuous Space Representations of Linguistic Typology and their Application to Phylogenetic Inference. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2015), pp. 324-334, Denver, U.S., June 2015.
  • Yugo Murawaki. Global Model for Hierarchical Multi-Label Text Classification, In 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 46-54, Nagoya, Japan, October 2013.
  • Yugo Murawaki and Sadao Kurohashi. Semi-Supervised Noun Compound Analysis with Edge and Span Features. In Proceedings of COLING 2012: Technical Papers, pp. 1915-1931, Mumbai, India, December 2012.
  • Yugo Murawaki and Sadao Kurohashi. Non-parametric Bayesian Segmentation of Japanese Noun Phrases, In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP2011), pp. 605-615, Edinburgh, UK, July 2011.
  • Yugo Murawaki, Sadao Kurohashi. Semantic Classification of Automatically Acquired Nouns using Lexico-Syntactic Clues,. In COLING 2010: Posters, pp. 876-884, Beijing, China, August 2010.
  • Yugo Murawaki, Sadao Kurohashi. Online Japanese Unknown Morpheme Detection using Orthographic Variation. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), pp. 832-839, Malta, May 2010.
  • Yugo Murawaki, Sadao Kurohashi. Online Acquisition of Japanese Unknown Morphemes using Morphological Constraints. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing (EMNLP2008), pp. 429-437, Honolulu, Hawai'i, August 2008.
Unrefereed
  • Yugo Murawaki. Induction of Latent Binary Parameters from Linguistic Typological Features. In Proceedings of The 23rd Annual Meeting of The Association for Natural Language Processing (NLP2017), pp. ???-???, Tsukuba, March 2017. (in Japanese) (to appear).
  • Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara and Sadao Kurohashi. Improving the Annotation of Discourse Relations using Crowdsoursing. In Proceedings of The 23rd Annual Meeting of The Association for Natural Language Processing (NLP2017), pp. ???-???, Tsukuba, March 2017. (in Japanese) (to appear).
  • Yugo Murawaki. Mixture Models for Creole Genesis. In Proceedings of The 22nd Annual Meeting of The Association for Natural Language Processing (NLP2016), pp. 853-856, Sendai, March 2016. (in Japanese)
  • Yugo Murawaki. Continuous Space Representations of Linguistic Typology and their Application to Phylogenetic Inference. In Proceedings of The 21st Annual Meeting of The Association for Natural Language Processing (NLP2015), pp. 337-340, Kyoto, March 2015. (in Japanese)
  • Yugo Murawaki. Does the Lexicons of a Dialect Group Form a Phylogenetic Tree?. In the 9th Symposium for the Young Researcher Association for NLP Studies (YANS2014), 10 pages, Miura, September 2015. (in Japanese)
  • Yugo Murawaki, Shunsuke Aihara, Taisuke Harada, Makoto Nagao and Kumiko Tanaka-Ishii. The Scoring Method for the Reverse Dictionary Makoto. In Proceedings of The 20th Annual Meeting of The Association for Natural Language Processing (NLP2014), pp. 396-399, Sapporo, March 2014. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Noun Phrase Segmentation using Hybrid Type-based Sampling. In Proceedings of The Seventeenth Annual Meeting of The Association for Natural Language Processing (NLP2011), pp. 564-567, Toyohashi, March 2011. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Classification of Nouns Automatically Acquired from Text. In Proceedings of The Sixteenth Annual Meeting of The Association for Natural Language Processing (NLP2010), Tokyo, March 2010. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Processing Real-time Web Text with the aid of Online Lexicon Acquisition. In Proceedings of The Sixteenth Annual Meeting of The Association for Natural Language Processing (NLP2010, poster), Tokyo, March 2010. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Detecting Over-segmented Unknown Morphemes for Lexicon Acquisition. In Proceedings of The Fifteenth Annual Meeting of The Association for Natural Language Processing (NLP2009), pp. 324-327. Tottori, March 2009. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Unknown Morphemes Acquisition using Morphological Constraints. In Proceedings of The Fourteen Annual Meeting of The Association for Natural Language Processing (NLP2008), pp. 805-808. Tokyo, March 2008. (in Japanese)
  • Yugo Murawaki. Building Language Processing Base for Minority Languages: In the Case of Cyrillic Mongolian.In the Proceedings of the Forty-ninth Programming Symposium, pp. 141-148, Kanagawa, January 2008. (in Japanese)
  • Yugo Murawaki, Sadao Kurohashi. Constructing Dynamic Ontology with Predicate-Argument Structure for Information Analysis. In Proceedings of The Thirteen Annual Meeting of The Association for Natural Language Processing (NLP2008), pp. 867-870. Shiga, March 2007. (in Japanese)

Awards

Other Academic Activities

Last Updated: March 2017.