Pattern based fact extraction Icon

Pattern based fact extraction

Tool for extracting structured information from Estonian language

Add a Review
0 Downloads (This Week)
Last Update:
  Browse Code Git Repository

Description

A lot of information is available in form of unstructured free texts. Pattern based fact extraction is one possible approach of information retrieval, which tries to extract information in structured form that is usable by other data mining algorithms. This software allows to build and apply models for extracting examples of different relations for Estonian language. A relation can describe any link between entities in the text. For instance, a birthday relation describes the connection between persons and their birth dates.

Features:
- preprocessing scripts with deep linguistic analysis
- GUI tool for making manual annotations, additionally using active learning to speed up the process
- scripts for training and applying relations on different corpora
- simple web front-end with embedded server for making using the software more convenient for users

Pattern based fact extraction Web Site

Categories

Linguistics

License

BSD License

Update Notifications





Write a Review

User Reviews

Be the first to post a review of Pattern based fact extraction!

Additional Project Details

Languages

English

Intended Audience

Information Technology, Science/Research

User Interface

Qt, Web-based

Programming Language

C++, Python, S/R

Registered

2012-04-21
Screenshots can attract more users to your project.
Features can attract more users to your project.

Icons must be PNG, GIF, or JPEG and less than 1 MiB in size. They will be displayed as 48x48 images.