I am a postdoctoral researcher at National Taiwan University. I received my doctorate in 2013 from National Taiwan University. During my Ph.D. study, I worked closely with my advisor Jieh Hsiang at the Department of Computer Science and Information Engineering on natural language processing and information retrieval. I have also spent an year visiting University of Southern California, working with Andrew Gordon at the Institute for Creative Technologies on topic modeling and cluster analysis. Prior to that, I worked full-time as a research assistant at Academia Sinica, in Lee-Feng Chien's group at the Institute of Information Science. Our research work focused more on information retrieval and web mining.
I received my B.S. and M.S. from National Taiwan University in 2001 and 2003, respectively.
Ruey-Cheng Chen. An improved MDL-based compression algorithm for unsupervised word segmentation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Short Papers), ACL '13, to appear. preprint
Ruey-Cheng Chen and Chia-Jung Lee. An information-theoretic account of static index pruning. In Proceedings of the 36th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '13, to appear. preprint
Ruey-Cheng Chen. Information preservation and its application to natural language processing. PhD Dissertation. 2013. preprint slides
Ruey-Cheng Chen, Chia-Jung Lee, Chiung-Min Tsai, and Jieh Hsiang. Information preservation in static index pruning. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM '12, pages 2487-2490, New York, NY, USA, 2012. ACM. pdf poster
Ruey-Cheng Chen, Chiung-Min Tsai, and Jieh Hsiang. A regularized compression method to unsupervised word segmentation. In Proceedings of the Twelfth Meeting of the the Special Interest Group on Computational Morphology and Phonology, SIGMORPHON '12, pages 26-34, Montreal, Canada, 2012. Association for Computational Linguistics. pdf slides
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, and Pu-Jen Cheng. Sampling the web as training data for text classification. International Journal of Digital Library Systems (IJDLS), 1(4):24-42, 2010. preprint
Ruey-Cheng Chen, Chiung-Min Tsai, and Jieh Hsiang. Relevance model revisited: With multiple document representations. In Proceedings of the 6th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS '10, pages 37-48, Berlin, Heidelberg, 2010. Springer-Verlag. preprint slides
Ruey-Cheng Chen, Reid Swanson, and Andrew Gordon. An adaptation of topic modeling to sentences. 2010. Unpublished. pdf
Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, Pei-Sen Liu, and Pu-Jen Cheng. Query formulation by selecting good terms. In Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, ROCLING '09, pages 69-84, Taichung, Taiwan, 2009. pdf
Wei-Yen Day, Chun-Yi Chi, Ruey-Cheng Chen, Pu-Jen Cheng, and Pei-Sen Liu. Web mining for unsupervised classification. In Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, ROCLING '09, pages 53-68, Taichung, Taiwan, 2009. pdf
Chia-Jung Lee, Yi-Chun Lin, Ruey-Cheng Chen, and Pu-Jen Cheng. Selecting effective terms for query formulation. In Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology, AIRS '09, pages 168-180, Berlin, Heidelberg, 2009. Springer-Verlag. pdf
Chia-Jung Lee, Ruey-Cheng Chen, Shao-Hang Kao, and Pu-Jen Cheng. A term dependency-based approach for query terms ranking. In Proceedings of the 18th ACM conference on Information and knowledge management, CIKM '09, pages 1267-1276, New York, NY, USA, 2009. ACM. pdf
Shuo-Peng Liao, Pu-Jen Cheng, Ruey-Cheng Chen, and Lee-Feng Chien. LiveImage: Organizing Web images by relevant concepts. In Proceedings of the Workshop on the Sciences of the Artificial, WSA '05, pages 210-220, Hualien, Taiwan, 2005. pdf
Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-Haur Wang, Wen-Hsiang Lu, and Lee-Feng Chien. Translating unknown queries with web corpora for cross-language information retrieval. In Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '04, pages 146-153, New York, NY, USA, 2004. ACM. pdf
Pao-Ann Hsiung, Farn Wang, and Ruey-Cheng Chen. On the verification of wireless transaction protocol using SGM and RED. In Proceedings of the Seventh International Conference on Real-Time Systems and Applications, RTCSA '00, Washington, DC, USA, 2000. IEEE Computer Society. pdf
I am actively developing and maintaining the following software packages:
This is my curriculum vitae.
I am a classic rock fan, a bassist, and a Linux user.
Like beers and math. (Who wouldn't?)
Since late 2011, I have officially become the in-house babysitter for my little boy.