Multi-threaded Extension of the IR Platform Terrier
Overview
Searching for patents on the large amount of available patents is a time consuming task. A parallel implementation of the Information Retrieval toolkit Terrier on a high-performance computer would increase the efficiency of the search process in very large document collections.
The goal of this project was to extend the Information Retrieval toolkit Terrier in such a way that it can be employed in a parallelised way on a supercomputing infrastructure. This included the parallelisation of the indexing process and resulting indexes as well as the parallelisation of query expansion algorithms and the query processing itself.
Project Partners
- Fondazione Ugo Bordoni, IT (Research & Development)