Information Retrieval
Spring 2014
9:00 ~12:00 AM,
Thursdays
Instructor:
Prof. Berlin Chen (陳柏琳)
Tentative List of Topics:
02/20 | Book Chapter: Modern
Information Retrieval, Ch. 1 Paper: The History of Information Retrieval Research |
||
02/27 | Classical Models | cf. Modern Information Retrieval, Ch.3 | |
03/06 | Classical Models | ||
03/13 | Evaluation Metrics | ||
03/20 | Benchmark Collections | HW#1: Evaluations for IR (Due: 03/27) | |
03/27 | Extensions of Classic (Set, Algebra & Probabilistic) Models | HW#2: Classic Models for IR and Query Expansion (Due: 04/10) | |
04/03 | Exercise | ||
04/10 | Relevance Feedback and Query Expansion | ||
04/17 | Latent Semantic Analysis | HW#3: LSA for IR (Due: 05/16) | |
04/24 | Language Modeling for Information Retrieval | ||
05/01 | Language Modeling for Information Retrieval | ||
05/08 | Clustering: Metrics and Techniques | ||
05/15 | Clustering: Metrics and Techniques | ||
05/22 | Indexing and Searching | HW#4: LM for IR (Due 06/05) | |
05/29 | Paper
Presentation 曾厚強: Learning to Predict Readability using Diverse Linguistic Features (COLING 2010) 吳佳厚: Concept-Based Information Retrieval Using Explicit Semantic Analysis (ACM TOIS 2011) 陳煥元: Entity-Centric Document Filtering: Boosting Feature Mapping through Meta-Features (CIKM 2013) 張庭豪: CRF Framework for Supervised Preference Aggregation (CIKM 2013) 朱聖池: Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks 劉書宇: Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets (WI&IAT 2012) |
||
06/05 | Paper
Presentation 王思涵: Search Result Diversification in Resource Selection for Federated Search (SIGIR 2013) 劉憶年: How Do Users Respond to Voice Input Errors? Lexical and Phonetic Query Reformulation in Voice Search (SIGIR 2013) 彭紹峻: Using Micro-Reviews to Select an Efficient Set of Reviews (CIKM 2013) 吳培豪: Toward whole-session relevance exploring intrinsic diversity in web search (SIGIR 2013) 施凱文: Graph-of-word and TW-IDF: New Approach to Ad Hoc IR (CIKM 2013) 陳思澄: An Unsupervised Topic Segmentation Model Incorporating Word Order (SIGIR 2013) |
||
06/12 | User Interfaces for Search | ||
06/19 | Web Search Basics |
Textbooks:
• | R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval: The Concepts and Technology behind Search (2nd Edition), ACM Press, 2011 | |
• | Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval, Cambridge University Press, 2008 | |
• | W. Bruce Croft, Donald Metzler, and Trevor Strohman, Search Engines: Information Retrieval in Practice, Addison Wesley, 2009 | |
References:
.
Papers:
• | M. Sanderson and W. B. Croft, "The history of information retrieval research," Proceedings of the IEEE, Vol. 100, pp. 1444 - 1451, May 2012. | |
• | O. Kolomiyets, M.-F. Moens, "A survey on question answering technology from an information retrieval perspective," Information Sciences 181 (2011) 5412–5434 | |
• | Johan Schalkwyk et al., "Google Search by Voice: A case study," 2010. | |
• | D. Blei, A. Ng, and M. Jordan, "Latent Dirichlet allocation," Journal of Machine Learning Research, 3:993-1022, January 2003. | |
• | V. Lavrenko and W.B. Croft, "Relevance-Based Language Models" ACM SIGIR 2001. | |
• | C. H. Papadimitriou, P. Raghavan, H. Tamaki, S. Vempala, "Latent semantic indexing: A probabilistic analysis,'' analyzes an information retrieval technique related to principle components analysis. | |
• | Liu, X. and Croft, W.B., "Statistical Language Modeling For Information Retrieval," the Annual Review of Information Science and Technology, vol. 39, 2005 | |
• | Lan Huang. A Survey On Web Information Retrieval Technologies. 2000. | |
• | Karen Spa¨rck Jones, "Some Points in a Time," Computational Linguistics, Vol. 31, No. 1, 2005. | |
• | D. Hiemstra, "Information Retrieval Model," In: A. Goker, J. Davies, and M. Graham (eds.), Information Retrieval: Searching in the 21st Century, Wiley, 2009 | |
• | M. Steyvers, T. Griffiths, "Probabilistic Topic Models," In T. K. Landauer, D. S. McNamara, S. Dennis, W. Kintsch (eds.). Handbook of Latent Semantic Analysis, Mahwah NJ: Lawrence Erlbaum, 2007. | |
• | X. Yi, J. Allan, "A Comparative Study of Utilizing Topic Models for Information Retrieval," in the Proceedings of ECIR'09. | |
• | Nallapati, Discriminative Models for Information Retrieval, in the Proceedings of SIGIR 2004 | |
• | T. Joachims and F. Radlinski, Search Engines that Learn from Implicit Feedback, IEEE Trans. on Computer 40(8), pp. 34-40, 2007 | |
• | B. Chen, H.M. Wang, L.S. Lee, “A discriminative HMM/N-gram-based retrieval approach for Mandarin spoken documents,” ACM Transactions on Asian Language Information Processing, Vol. 3, No. 2, pp. 128-145, June 2004. | |
Information Retrieval Resources
• SIGIR-Information Retrieval Resources