.. IEPY documentation master file, created by sphinx-quickstart on Wed Apr 23 20:02:15 2014. You can adapt this file completely to your liking, but it should at least contain the root `toctree` directive. Welcome to IEPY's documentation! ================================ IEPY is an open source tool for `Information Extraction `_ focused on Relation Extraction. To give an example of Relation Extraction, if we are trying to find a birth date in: `"John von Neumann (December 28, 1903 – February 8, 1957) was a Hungarian and American pure and applied mathematician, physicist, inventor and polymath."` then IEPY's task is to identify "``John von Neumann``" and "``December 28, 1903``" as the subject and object entities of the "``was born in``" relation. It's aimed at: - :doc:`users ` needing to perform Information Extraction on a large dataset. - :doc:`scientists ` wanting to experiment with new IE algorithms. You can follow the development of this project and report issues at http://github.com/machinalis/iepy or join the mailing list `here `__ Features -------- - :doc:`A corpus annotation tool ` with a `web-based UI `_ - :doc:`An active learning relation extraction tool ` pre-configured with convenient defaults. - :doc:`A rule based relation extraction tool ` for cases where the documents are semi-structured or high precision is required. - A web-based user interface that: - Allows layman users to control some aspects of IEPY. - Allows decentralization of human input. - A shallow entity ontology with coreference resolution via `Stanford CoreNLP `_ - :doc:`An easily hack-able active learning core `, ideal for scientist wanting to experiment with new algorithms. Contents: --------- .. toctree:: :maxdepth: 2 installation tutorial instantiation active_learning_tutorial rules_tutorial preprocess gazettes corpus_labeling how_to_hack troubleshooting language Authors ------- IEPY is © 2014 `Machinalis `_ in collaboration with the `NLP Group at UNC-FaMAF `_. Its primary authors are: * Rafael Carrascosa (rafacarrascosa at github) * Javier Mansilla (jmansilla at github) * Gonzalo García Berrotarán (j0hn at github) * Franco M. Luque (francolq at github) * Daniel Moisset (dmoisset at github) Changelog --------- .. include:: Changelog