Showing 85 open source projects for "python data analysis"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Sales CRM and Pipeline Management Software | Pipedrive Icon
    Sales CRM and Pipeline Management Software | Pipedrive

    The easy and effective CRM for closing deals

    Pipedrive’s simple interface empowers salespeople to streamline workflows and unite sales tasks in one workspace. Unlock instant sales insights with Pipedrive’s visual sales pipeline and fine-tune your strategy with robust reporting features and a personalized AI Sales Assistant.
    Try it for free
  • 1
    **CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    IdeoType is a book compiler that converts manuscript (XHTML) to book (PDF) on the fly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    EasyHTML

    A python package for building DOM of the HTML documents

    A python package that provides an easy access to elements of HTML and XHTML documents through the Document Object Model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 5

    dpanalyzer

    postprocessing tool for Project Gutenberg Distributed Proofreaders

    Specialized tool for PostProcessors of books produced by Project Gutenberg Distributed Proofreaders. Parses the markup structure of a project file out of the formatting rounds; reports about the text structure found, and identifies markup errors. Planned future features: generation of normalized dp output by rejoining split paragraphs and moving around footnotes, renumbering of pages; conversion to basic LaTeX and basic HTML markup for further processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Gumbo

    Gumbo

    An HTML5 parsing library in pure C99

    Gumbo is an implementation of the HTML5 parsing algorithm implemented as a pure C99 library with no outside dependencies. It's designed to serve as a building block for other tools and libraries such as linters, validators, templating languages, and refactoring and analysis tools. Gumbo gains some of this by virtue of being written in C, but it is not an important consideration for the intended use-case, and was not a major design factor. Gumbo is intentionally designed to turn an HTML document...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Markup.py is an intuitive, light-weight and customizable HTML/XML generator in Python. Development moved to github in 2017 and the name changed to MarkupPy in order to be more pip friendly. See http://github.com/tylerbakke/MarkupPy for all further details (documentation, new releases, etc.). This sourceforge page is left as a historic relic and won't be updated.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    WaTeX is a LaTeX HTML document renderer. Unlike other LaTeX to HTML translators which try to understand TeX, this takes a more high-level approach. You will not be able to convert very complex documents, but it is extensible and produces nice output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    GC-GeoStats

    Statistiques de profil pour géocacheur

    Génération de statistiques et autres badges pour incorporer dans le profil public sur geocaching.com
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    SQLDOM

    HTML parser and DOM-related procedures for Microsoft T-SQL

    SQLDOM is an easy and robust way to parse HTML directly into SQL tables, manipulate DOM nodes in a JQuery-like manner, and to render HTML from the SQL-based DOM. SQLDOM is written entirely in native T-SQL, and uses only temporary database objects (tempdb). No changes to user databases are required.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Wiko, the wiki compiler, compiles wiki like files into html and LaTeX, combining easy wiki syntax, your preferred non-web text editor and svn/cvs control to write static webs, cientific articles or even blogs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    entdec

    entdec

    This is the simple program lets you decode file, conains html entities

    ..., ed:"<©>". It isn't easy to edit such simbols code. Using this program, you may do this sequence of characters easy to manualy editing. In our example, you get"<©>". Usage: endec input_file output_file Programs gets only two parametres: input and output file names. Working, it indicates currient status and result of work. Tecnical details: This is a simple fixed buffer file processor. It reads and analysis every string for right entities and correct it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Aurora Application Server is a new Python Web Application Server and Framework. The main goal of the project is to provide the developer with a complete set of tools to speed up the application development process. See project wiki for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    HTML DOM Parser

    HTML parser which can be used for screen-scraping applications

    htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Angosso

    Angosso

    Performance and stability

    Develop: domain secure, and performing secure Domain Name System (DNS) dynamic updates. Domain Name System Security Extensions Servlet API Package The javax.servlet.http package contains a number of classes and interfaces that describe and define the contracts between a servlet class running under the HTTP protocol and the runtime environment provided for an instance of such a class by a conforming servlet container.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Html Assembler
    Html Assembler is a static site generator. It automatically integrates page content such as text and photos in a modifiable page template creating a complete set of html files ready for upload to your site.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    A python3 lib rendering html.parser into stack and callback models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A tool for autonomous and virtual topical data integration using the focused web-harvesting method.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Efficent WEB UI for wget utility written in python(twisted) . It's work without any web server. Script consists of a SINGLE file
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    XUProxy is an extensible multi-protocol proxy based on the Twisted framework. It supports multiple protocol plugins (currently only HTTP), and multiple "filter" plugins for things like logging, caching, and Proxomitron-compatible ad filtering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    HTTP functional and non-functional (load and performance) toolkit based on jython/grinder (http://grinder.sf.net) ...includes capabilities to support: SOA services, REST, json/xml encoding, AES and WS security ... and a stub to collect requests
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    now here: https://github.com/plastex/plastex plasTeX is a Python-based LaTeX document processing framework. It gives DOM-like access to a LaTeX document, as well as the ability to generate mulitple output formats (e.g. HTML, DocBook, tBook, etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.