The aim of this project is to build a prototype of a large-scale search engine which works on millions of wikipedia pages(which are in xml format of more than 44GB) and retrieves the top-10 relevant wikipedia documents that matches the input query. This search engine takes wikipedia Corpus in XML format which is available at as input. Then it indices millions of wikipedia pages involving a comparable number of distinct terms. The final size of the index will be around 7-10 GB. Then given a query, it retrieves relevant ranked documents and their titles using this index.


Information Retrieval, Search engines

Technologies Needed

Project Structure

What we provide

  • Videos on Required technologies by Ravindrababu Ravula
  • Project Implementation Videos by Ravindrababu Ravula
  • Presentation Slides
  • Assignments with solutions
  • 12 weeks of expert guidance
  • Assistance to Complete the Documentation

Fee Structure

Fee for the Complete Project including the required technical courses is Rs. 20000.00(Please refer to the specific project to know the list of courses included)

Commencement and Validity

  • Programme commences from Jan 1, 2017
  • Validity for video lectures is 6-months from the commencement of the project
  • Expert guidance will be provided only for 12weeks from commencement of the project

Registration procedure

If you are interested in registering, you can make the payment in the following account either through net banking or at your nearest HDFC bank and email us the transaction id or scan copy of the pay-in-slip.
Account Name : Raudra Eduservices Private Limited
Account Number : 50200012182576
Account type : Current account
Bank : HDFC
After the payment is done, you can email us the screen shot or picture of transaction details or the pictures of the bank pay in slip at Once it is done, you will receive an acknowledgement mail regarding your payment status and will be given access to private lecture videos, assignments and source code from Jan 1 2017. You can watch the videos online anytime, anywhere and any number of times. Please note that the videos are not downloadable. Sharing your access or trying to sell or distribute videos is a legally punishable offense. Earlier we caught some people doing this and they were punished legally and a huge penalty was imposed on them.