Showing 27 open source projects for "content analysis"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    MinerU

    MinerU

    A high-quality tool for convert PDF to Markdown and JSON

    MinerU is an open-source, high-quality document extraction toolkit focused on converting PDFs (and other document formats) into structured Markdown and JSON. It leverages OCR and layout analysis to preserve semantic structure and metadata, ideal for research and data science workflows.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    PDF4QT

    PDF4QT

    Open source PDF editor

    PDF4QT is open source PDF editor based on Qt framework. It contains a C++ library, applications for viewing/editing PDF documents, and a command line tool. PDF4QT is an open-source PDF editor for Windows/Linux. It is a modern solution for viewing/editing/rendering PDF documents, for users and developers alike. For developers, there is a C++ library and a command line tool for use in scripts. For users, there are four applications offering many features. The project is hosted on Github and...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 3
    Vanilla.PDF

    Vanilla.PDF

    Cross-platform SDK for creating and modifying PDF documents

    ...The SDK offers full cross-platform support including Windows, Linux, macOS, and Android, with builds available for major compilers and architectures. Vanilla.PDF supports advanced PDF features such as adding CMS (PKCS#7) digital signatures, modifying content streams and metadata, and working with encryption and permissions based on standard PDF security models. It includes tools for parsing PDF internals like cross-reference tables and objects, providing fine-grained document analysis capabilities. The project is unit-tested with continuous integration pipelines, supporting sanitizers for enhanced code quality and stability.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Julia VS Code

    Julia VS Code

    Julia extension for Visual Studio Code

    ...With a completely live environment, Julia for VS Code aims to take the frustration and guesswork out of programming and put the fun back in. A hybrid “canvas programming” style combines the exploratory power of a notebook with the productivity and static analysis features of an IDE. VS Code is a powerful editor and customizable to your heart’s content (though the defaults are pretty good too). It has power features like multiple cursors, fuzzy file finding and Vim keybindings.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    DynamicalSystems.jl

    DynamicalSystems.jl

    Award winning software library for nonlinear dynamics timeseries

    DynamicalSystems.jl is an award-winning Julia software library for nonlinear dynamics and nonlinear time series analysis. To install DynamicalSystems.jl, run import Pkg; Pkg.add("DynamicalSystems"). To learn how to use it and see its contents visit the documentation, which you can either find online or build locally by running the docs/make.jl file. DynamicalSystems.jl is part of JuliaDynamics, an organization dedicated to creating high-quality scientific software. All implemented algorithms...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    ...For supported audio and video files, AI File Sorter can read embedded metadata (such as ID3, Vorbis, and MP4 tags) to suggest normalized names like year_artist_album_title.ext. AI analysis runs read-only, and all suggestions must be reviewed before being applied. AI File Sorter can run fully offline using local models like Mistral or LLaMA, so files and metadata stay on your device unless you configure a remote endpoint.
    Leader badge
    Downloads: 221 This Week
    Last Update:
    See Project
  • 7
    Catbird Linux

    Catbird Linux

    Linux for content creation, web scraping, coding, and data analysis.

    Catbird Linux is a USB pluggable Live Linux operating system built for media creation, web scraping, and software coding. It is the daily driver you want for retrieving data, making videos or podcasts, and making software tools to automate the repetitive tasks. It is ready for work in Python, Lua, and Go languages, with numerous packages for web scraping or downloading data via API calls. Using Catbird Linux, it is possible to accomplish in depth stock market analysis, track weather...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 8
    DVA-Profession

    DVA-Profession

    Mass video digitization workflow management for archiving

    Professional digital video archiving system solution, developed and used by the Austrian national audio/video archive, designed to handle vast amounts of video content from ingest to long-term storage, including analysis, transcoding and metadata.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Helpdezk

    Helpdezk

    HelpDEZk is a powerfull software that manages requests/incidents.

    HelpDEZk is a powerfull software that manages requests/incidents. It has all the needed requirements to an efficient workflow management of all processes involved in service execution. This control is done for internal demands and also for outsourced services. HelpDEZk can be used at any company’s area, serving as an support to the shared service center concept, beyond the ability to log all the processes and maintain the request’s history, it can pass it through many approval...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    HCS Analyzer

    HCS Analyzer

    High Content Screening Analyzer

    HCS Analyzer is an open source software dedicated to High Content Screening data processing and analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Semantic Assistants

    Natural Language Processing (NLP) for the Masses

    Semantic Assistants support users in content retrieval, analysis, and development, by offering context-sensitive NLP services directly integrated in standard desktop clients, like a word processor, and web information systems, like a wiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Texinfo Web Publisher

    Texinfo Web Publisher

    Multi-format web publishing system based on Texinfo

    Texinfo Web Publisher is a Makefile based publishing system featuring simultaneous con- tent creation into HTML, non-split HTML, Framed HTML, HTML Zip, XML, DocBook, PDF, DjVu, PostScript, DVI, Plain text, Info and EPUB book formats. All Texinfo Web Publisher output formats are from a single source. Texinfo Web Publisher can be used for website creation has FTP deployment capabilities and supports Cascading Style Sheets (CSS). Texinfo Web Publisher is a low maintenance solution for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DVCS-Ripper

    DVCS-Ripper

    Rip web accessible (distributed) version control systems: SVN/GIT/HG

    ...The utilities handle partial or blocked directory listings by guessing object paths and assembling missing pieces from loose files. Investigators and red-teamers use it to demonstrate the risk of deploying .git/ or .svn/ alongside web content, recovering not just code but also secrets committed by mistake. Outputs are normalized into usable repos so diffing, blame, and log analysis work as if you had pulled from the canonical remote. While powerful, it’s intended for legitimate testing and incident response, reinforcing best practices like build-time artifact generation and strict web server access rules.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ETU SQL for MySQL

    ETU SQL for MySQL

    Powerful tool for data analysis and editing

    Changelog (Version 7 - Update 1) New Discard Partialy Changes New Edit Row in Window New List of User Connections New Show Image on Form New Quickly View of Cell Values Download also: 1) ETU SQL for MS SQL : https://sourceforge.net/projects/etusqltango/ 2) ETU SQL for Oracle: https://sourceforge.net/projects/etusqlfororacle/ 3) ETU SQL for DB2: https://sourceforge.net/projects/etusqlfordb2/ 4) ETU SQL for PostgreSQL: https://sourceforge.net/projects/etusqlforpostegresql/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CMIS Input plugin for Pentaho

    CMIS Input plugin for Pentaho

    Allows querying Content Management Systems that use the CMIS.

    ...All this is possible within the Pentaho Suite, the Open Source Business Intelligence platform, which is useful to the extraction and analysis of structured and semi-structured data. With this goal (the extraction and analysis of data) has been designed and developed the CMIS Input plugin for Pentaho Data Integration (Kettle) that allows querying Content Management Systems that use the CMIS interoperability standard. The data, once extracted, can be stored and analyzed and perhaps presented in customized reports be published in various formats for the end user (PDF, Excel, etc..).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TeleScope

    TeleScope

    XML Data Stream Broker/Replicator

    TeleScope is the efficient intensive-load XML data stream broker, replicator and simple event processing platform (SEP) written in C for the Fedora 17-18, Slackware 13-14, Red Hat Enterprise Linux 6 (RHEL-6) Linux distributions. The platform is intended to be operated upon the single number/word values and is not meant to be deployed for full-text XML stream analysis. TeleScope has internal query language with a set of standard logical operators that allows to construct relatively complex...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    RDF Content Provider for iQser GIN

    Plugin to connect RDF sources with the GIN Server

    GIN Server is a semantic middleware for easy data integration and automized analysis. The extendable architecture allows to plugin in data sources, analytics and event handling. This RDF Content Provider enables access to Semantic Web Content as an RDF file or SPAEQL endpoint.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Annoki is a set of extensions for MediaWiki that facilitates team collaboration by making the wiki easier to use and more secure.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Docgen plugin for Protege provides a quick export of all the content of an ontology (classes, instances and documentation) in various formats (html, pdf, fo...). Images, Graphs, URL are readyly included in reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This library implements self-organizing neural networks, also called Kohonen Nets. They can be used for high dimensional data analysis. Example: content based image recognition ( CBIR ).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Quantitative information engineering systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo