Ruey-Cheng Chen

National Taiwan University
1 Roosevelt Rd. Sec. 4, Taipei 106, Taiwan
+886 2 33664888 ext 303
rueycheng@turing.csie.ntu.edu.tw

I am a postdoctoral researcher at National Taiwan University. I received my doctorate in 2013 from National Taiwan University. During my Ph.D. study, I worked closely with my advisor Jieh Hsiang at the Department of Computer Science and Information Engineering on natural language processing and information retrieval. I have also spent an year visiting University of Southern California, working with Andrew Gordon at the Institute for Creative Technologies on topic modeling and cluster analysis. Prior to that, I worked full-time as a research assistant at Academia Sinica, in Lee-Feng Chien's group at the Institute of Information Science. Our research work focused more on information retrieval and web mining.

I received my B.S. and M.S. from National Taiwan University in 2001 and 2003, respectively.

Publications

Ruey-Cheng Chen. An improved MDL-based compression algorithm for unsupervised word segmentation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Short Papers), ACL '13, to appear. preprint

Ruey-Cheng Chen and Chia-Jung Lee. An information-theoretic account of static index pruning. In Proceedings of the 36th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '13, to appear. preprint

Ruey-Cheng Chen. Information preservation and its application to natural language processing. PhD Dissertation. 2013. preprint slides

Ruey-Cheng Chen, Chia-Jung Lee, Chiung-Min Tsai, and Jieh Hsiang. Information preservation in static index pruning. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, pages 2487-2490, New York, NY, USA, 2012. ACM. pdf poster

Ruey-Cheng Chen, Chiung-Min Tsai, and Jieh Hsiang. A regularized compression method to unsupervised word segmentation. In Proceedings of the Twelfth Meeting of the the Special Interest Group on Computational Morphology and Phonology, SIGMORPHON '12, pages 26-34, Montreal, Canada, 2012. Association for Computational Linguistics. pdf slides

Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, and Pu-Jen Cheng. Sampling the web as training data for text classification. International Journal of Digital Library Systems (IJDLS), 1(4):24-42, 2010. preprint

Ruey-Cheng Chen, Chiung-Min Tsai, and Jieh Hsiang. Relevance model revisited: With multiple document representations. In Proceedings of the 6th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS '10, pages 37-48, Berlin, Heidelberg, 2010. Springer-Verlag. preprint slides

Ruey-Cheng Chen, Reid Swanson, and Andrew Gordon. An adaptation of topic modeling to sentences. 2010. Unpublished. pdf

Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, Pei-Sen Liu, and Pu-Jen Cheng. Query formulation by selecting good terms. In Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, ROCLING '09, pages 69-84, Taichung, Taiwan, 2009. pdf

Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen Cheng, and Pei-Sen Liu. Web mining for unsupervised classification. In Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, ROCLING '09, pages 53-68, Taichung, Taiwan, 2009. pdf

Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, and Pu-Jen Cheng. Selecting effective terms for query formulation. In Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology, AIRS '09, pages 168-180, Berlin, Heidelberg, 2009. Springer-Verlag. pdf

Chia-Jung Lee, Ruey-Cheng Chen, Shao-Hang Kao, and Pu-Jen Cheng. A term dependency-based approach for query terms ranking. In Proceedings of the 18th ACM conference on Information and knowledge management, CIKM '09, pages 1267-1276, New York, NY, USA, 2009. ACM. pdf

Shuo-Peng Liao, Pu-Jen Cheng, Ruey-Cheng Chen, and Lee-Feng Chien. LiveImage: Organizing Web images by relevant concepts. In Proceedings of the Workshop on the Sciences of the Artificial, WSA '05, pages 210-220, Hualien, Taiwan, 2005. pdf

Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, and Lee-Feng Chien. Translating unknown queries with web corpora for cross-language information retrieval. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '04, pages 146-153, New York, NY, USA, 2004. ACM. pdf

Pao-Ann Hsiung, Farn Wang, and Ruey-Cheng Chen. On the verification of wireless transaction protocol using SGM and RED. In Proceedings of the Seventh International Conference on Real-Time Systems and Applications, RTCSA '00, Washington, DC, USA, 2000. IEEE Computer Society. pdf

Software

I am actively developing and maintaining the following software packages:

Fast Facts

This is my curriculum vitae.

I am a classic rock fan, a bassist, and a Linux user.

Like beers and math. (Who wouldn't?)

Since late 2011, I have officially become the in-house babysitter for my little boy.