National Institute of Technology Rourkela

राष्ट्रीय प्रौद्योगिकी संस्थान राउरकेला

ଜାତୀୟ ପ୍ରଯୁକ୍ତି ପ୍ରତିଷ୍ଠାନ ରାଉରକେଲା

An Institute of National Importance

Syllabus

Course Details

Subject {L-T-P / C} : CS4328 : Information Retrieval { 3-0-0 / 3}

Subject Nature : Theory

Coordinator : Prof. Sambit Bakshi

Syllabus

1 Boolean retrieval
2 The term vocabulary and postings lists
3 Dictionaries and tolerant retrieval
4 Index construction
5 Index compression
6 Scoring, term weighting and the vector space model
7 Computing scores in a complete search system
8 Evaluation in information retrieval
9 Relevance feedback and query expansion
10 XML retrieval
11 Probabilistic information retrieval
12 Language models for information retrieval
13 Text classification and Naive Bayes
14 Vector space classification
15 Support vector machines and machine learning on documents
16 Flat clustering
17 Hierarchical clustering
18 Matrix decompositions and latent semantic indexing
19 Web search basics
20 Web crawling and indexes
21 Link analysis

Course Objectives

  • To make students aware of differeny types of retrieval process
  • To make students able to analyse evaluation power of a given system
  • To make students aware of web search methodologies behind the search engines

Course Outcomes

Students will learn the basics of information retrieval by popular search engines.

Essential Reading

  • R. Baeza-Yates and B. Ribiero-Neto, Modern Information Retrieval, Pearson , ISBN: 9788131709771
  • C.D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval, Cambridge University Press , ISBN: 9780521865715

Supplementary Reading

  • C.D. Manning, Studyguide for an Introduction to Information Retrieval, Cram101 , ISBN: 9780521865715
  • W.B. Frakes and R. Baeza-Yates, Information Retrieval: Data Structures and Algorithms, Pearson India , ISBN: 9788131716922

Journal and Conferences

  • European Conference on Information Retrieval
  • Information Retrieval Journal, Springer