corpus free download - SourceForge

Showing 3 open source projects for "corpus"

View related business solutions

Distributed Computing Clear Filters & Widen Search

Our Free Plans just got better! | Auth0 by Okta
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.

Try free now
Bright Data - All in One Platform for Proxies and Web Scraping
Say goodbye to blocks, restrictions, and CAPTCHAs

Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.

Get Started
1

SmartMuseum

Software for work with Corpus of Everyday life history Sources

... collected in many museums and document archives. In this project, we consider the problem of creating software infrastructure for collaborative semantic annotation, information relation, and personalized access to corpus of everyday life history sources. Project financially supported from Department for Humanities of Russian Fund for Basic Research according to project # 16-01-12033. Authors: Vdovenko A., Marchenkov S., Petrina O., Korzun D.

Downloads: 0 This Week

Last Update: 2018-01-30
See Project
2

GloVe

GloVe model for distributed word representation

GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on aggregated global word-word co-occurrence statistics from a corpus, and the resulting representations showcase interesting linear substructures of the word vector space. The links provided contain word vectors obtained from the respective corpora. If you want word vectors trained on massive web datasets, you need only download one of these text files! Pre-trained word vectors...

Downloads: 0 This Week

Last Update: 2021-09-30
See Project
3

Samudra-Manthan

Samudra Manthan uses C and MPI for finding interesting n-grams(terms) in a large corpus of data. We use the GigaWord corpus to find top m interesting n-grams using TF*IDF measure.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project