corpus free download - SourceForge

Showing 5 open source projects for "corpus"

View related business solutions

Database Windows Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

WikiSQL

A large annotated semantic parsing corpus for developing NL interfaces

A large crowd-sourced dataset for developing natural language interfaces for relational databases. WikiSQL is the dataset released along with our work Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning. Regarding tokenization and Stanza, when WikiSQL was written 3-years ago, it relied on Stanza, a CoreNLP python wrapper that has since been deprecated. If you'd still like to use the tokenizer, please use the docker image. We do not anticipate switching...

Downloads: 0 This Week

Last Update: 2022-07-26
See Project
2

Arabic Named Entity Gazetteer

Arabic Named Entity Gazetteer

...This gazetteer is compiled using an xml tags such as <class_name>Arabic Named Entity</class_name>. Each line has an Arabic entity (UTF-8 encoding). This release of WikiFANE_Gazet consists of 68343 entities categorised into 50 classes. To use this corpus, please cite the following publication: F. Alotaibi and M. Lee, "Automatically Developing a Fine-grained Arabic Named Entity Corpus and Gazetteer by utilizing Wikipedia", In Proceedings of IJCNLP, p392-400. Nagoya, Japan, October, 2013. Author URL: http://www.cs.bham.ac.uk/~fsa081/index.html http://fsalotaibi.kau.edu.sa Email: fsalotaibi {AT} kau.edu.sa fsa081 {AT} cs.bham.ac.uk

Downloads: 0 This Week

Last Update: 2014-08-24
See Project
3

nardino1513

Latin texts from 1513: XML edition and XQueries

A TEI XML edition of a small corpus of Latin texts written in Dalmatian city of Zadar (today in Croatia, then under Venice rule) by two humanist teachers. The edition is accompanied by XQueries designed to discover interesting features of the corpus. Queries can be reproduced with an XML database (BaseX, eXist).

Downloads: 0 This Week

Last Update: 2015-05-24
See Project
4

CorpusSearch

CorpusSearch finds syntactic structures in a corpus of annotated sentence trees. It can be used as a research tool on a corpus, or as a development tool for building the corpus.

Downloads: 39 This Week

Last Update: 2013-06-26
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
5

Kura

Kura is a complete, professional system for the handling of linguistic data, especially fieldwork data from small-corpus languages. It allows users to enter texts in any language and analyze those texts

Downloads: 0 This Week

Last Update: 2013-04-25
See Project