IEPY
====
IEPY is an open source tool for
`Information Extraction `_
focused on Relation Extraction.
To give an example of Relation Extraction, if we are trying to find a
birth date in:
`"John von Neumann (December 28, 1903 – February 8, 1957) was a Hungarian and
American pure and applied mathematician, physicist, inventor and polymath."`
then IEPY's task is to identify "``John von Neumann``" and
"``December 28, 1903``" as the subject and object entities of the "``was born in``"
relation.
It's aimed at:
- `users `_
needing to perform Information Extraction on a large dataset.
- `scientists `_
wanting to experiment with new IE algorithms.
Features
--------
- `A corpus annotation tool `_
with a `web-based UI `_
- `An active learning relation extraction tool `_
pre-configured with convenient defaults.
- `A rule based relation extraction tool `_
for cases where the documents are semi-structured or high precision is required.
- A web-based user interface that:
- Allows layman users to control some aspects of IEPY.
- Allows decentralization of human input.
- A shallow entity ontology with coreference resolution via `Stanford CoreNLP `_
- `An easily hack-able active learning core `_,
ideal for scientist wanting to experiment with new algorithms.
Installation
------------
Install the required packages:
.. code-block:: bash
sudo apt-get install build-essential python3-dev liblapack-dev libatlas-dev gfortran openjdk-7-jre
Then simply install with **pip**:
.. code-block:: bash
pip install iepy
Full details about the installation is available on the
`Read the Docs `__ page.
Running the tests
-----------------
If you are contributing to the project and want to run the tests, all you have to do is:
- Make sure your JAVAHOME is correctly set. `Read more about it here `_
- In the root of the project run `nosetests`
Learn more
----------
The full documentation is available on `Read the Docs `__.
Authors
-------
IEPY is © 2014 `Machinalis `_ in collaboration
with the `NLP Group at UNC-FaMAF `_. Its primary
authors are:
* Rafael Carrascosa (rafacarrascosa at github)
* Javier Mansilla (jmansilla at github)
* Gonzalo García Berrotarán (j0hn at github)
* Franco M. Luque (francolq at github)
* Daniel Moisset (dmoisset at github)
You can follow the development of this project and report issues at
http://github.com/machinalis/iepy
You can join the mailing list `here `__