Open Source IR Tools
Indexing
- Lemur/Indri http://www.lemurproject.org/
- Lucene http://lucene.apache.org
- MG4J http://mg4j.dsi.unimi.it/
- Sphinx http://sphinxsearch.com/
- Terrier http://www.ir.dcs.gla.ac.uk/terrier/
- Wumpus http://www.wumpus-search.org/
Text annotation and Information Extraction
Document Categorization
Via data mining tools such as:
Image retrieval
Machine translation
LP - full scale parser
- Stanford parser http://nlp.stanford.edu/software/lex-parser.shtml#Download
- Minipar http://ai.stanford.edu/~rion/parsing/minipar_viz.html - download here: http://webdocs.cs.ualberta.ca/~lindek/minipar/
- Combinatory Categorial Grammar and Boxer (generates semantic representations) http://svn.ask.it.usyd.edu.au/trac/candc/wiki/boxer
- Part-of-speech tagger: TreeTagger - a language independent part-of-speech tagger http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html
Linguistic search corpus engines and linguistic annotation tools
- Manatee – a linguistic search corpus engine http://www.textforge.cz/
- TIGER - Corpus Query Tool with the goal of extending the NeGra corpus and its annotation. http://www.ims.uni-stuttgart.de/projekte/TIGER/
The annotation tool is http://www.coli.uni-saarland.de/projects/sfb378/negra-corpus/annotate.html - Callisto (PATExpert used the Callisto by the MITRE Corporation to annotate part-whole and motions in patents) http://callisto.mitre.org/index.html
- Annotation platforms for semantic role labeling used for Propbank http://code.google.com/p/propbank/
- Multilingual Propbank Annotation Tool Cornerstone and Jubilee http://www.aclweb.org/anthology/N/N10/N10-2004.pdf
- Introduction to Semantic roles http://www.ilc.cnr.it/EAGLES96/rep2/node8.html
- A toolkit for a Generative Lexicon http://hal.archives-ouvertes.fr/docs/00/27/99/36/PDF/06_Henry_Paper.pdf
Ontology platforms
- Protégé http://protege.stanford.edu/
- WebODE http://webode.dia.fi.upm.es/WebODEWeb/index.html (but you will be asked to use the NeOn toolkit if you do not already have password to WebODE http://neon-toolkit.org/wiki/Main_Page)
IRF Survey
Download the Survey on Patent Users Search Behavior, Search Functionality and System Requirements Read more