align free download - SourceForge

Showing 36 open source projects for "align"

View related business solutions

Python Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models

DataDreamer is a tool designed to assist in the generation and manipulation of synthetic data for various applications, including testing and machine learning.

Downloads: 0 This Week

Last Update: 2025-02-02
See Project
2

WhisperJAV

Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

...Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. WhisperJAV introduces a specialized pipeline that separates text generation from timestamp alignment, allowing the system to generate transcripts and then align them with audio using forced alignment techniques. The framework supports several speech recognition models, including Qwen-based ASR systems and fine-tuned Whisper models trained on domain-specific dialogue.

Downloads: 22 This Week

Last Update: 2026-05-11
See Project
3

UI UX Pro Max

AI SKILL that provide design intelligence

...It uses an AI reasoning engine to generate complete design systems tailored to project requirements, recommending layouts, typography, colors, spacing, and component structures automatically based on natural language prompts. Users can ask for specific UI/UX tasks or design patterns, and the skill will produce guidelines, code snippets, and responsive implementation suggestions that align with industry best practices and accessibility standards. It supports a broad range of tech stacks including HTML/Tailwind, React, Vue, mobile UI frameworks, and more, making it versatile for designers and developers alike. The project also includes CLI tooling and template generators that streamline scaffold creation and design system persistence across sessions.

Downloads: 22 This Week

Last Update: 14 hours ago
See Project
4

/last30days

Claude Code skill that researches any topic across Reddit + X

...Rather than returning generic model responses, it intelligently analyzes social media and community discussions to identify what’s genuinely trending or working in practice across topics ranging from prompt techniques to tool usage or cultural trends. This makes it particularly useful for prompt engineers, content creators, and developers who want up-to-date prompts and insights that align with the most recent consensus and shared best practices in fast-moving fields like AI tooling.

Downloads: 10 This Week

Last Update: 7 days ago
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training

MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
6

abogen

Generate audiobooks from EPUBs, PDFs and text with captions

...In other words, it automates the pipeline of reading a digital book (or document), converting its text into speech via a TTS engine, and packaging the result into an audiobook format — likely along with timestamped captions or subtitles that align with the spoken audio. This can be very useful for accessibility, content consumption on the go, or for users who prefer audio over reading. The repository supports handling common ebook formats and generating outputs that combine audio plus caption metadata. By automating text-to-speech for arbitrary documents, abogen reduces the friction of producing audiobooks and could be integrated into larger workflows (e.g., batch converting a library of texts).

Downloads: 7 This Week

Last Update: 2026-02-06
See Project
7

GLM-4.6

Agentic, Reasoning, and Coding (ARC) foundation models

...Its reasoning capabilities have been strengthened, including improved tool usage during inference and more effective integration within agent frameworks. GLM-4.6 also enhances writing quality, producing outputs that better align with human preferences and role-playing scenarios. Benchmark evaluations demonstrate that it not only outperforms GLM-4.5 but also rivals leading global models such as DeepSeek-V3.1-Terminus and Claude Sonnet 4.

Downloads: 23 This Week

Last Update: 2026-02-01
See Project
8

GEO Content Writer

Backlog-row-first content production system for teams

GEO Content Writer is an AI-based content generation tool designed to create optimized content tailored for geographic and semantic search contexts. It focuses on producing articles, pages, and structured content that align with both traditional SEO requirements and emerging AI search patterns. The system leverages language models to generate content that is context-aware, location-specific, and optimized for discoverability. It supports automated workflows for generating large volumes of content while maintaining consistency and relevance. The tool is particularly useful for businesses targeting local markets or region-specific audiences. ...

Downloads: 0 This Week

Last Update: 2026-07-06
See Project
9

OpenLIT

OpenLIT is an open-source LLM Observability tool

...Whether you're working with popular LLM providers such as OpenAI and HuggingFace, or leveraging vector databases like ChromaDB, OpenLIT ensures your applications are monitored seamlessly, providing critical insights including GPU performance stats for self-hosted LLMs to improve performance and reliability. This project proudly follows the Semantic Conventions of the OpenTelemetry community, consistently updating to align with the latest standards in observability.

Downloads: 0 This Week

Last Update: 2026-07-15
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

LatentSync

Taming Stable Diffusion for Lip Sync

LatentSync is an open-source framework from ByteDance that produces high-quality lip-synchronization for video by using an audio-conditioned latent diffusion model, bypassing traditional intermediate motion representations. In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. The system leverages a U-Net diffusion backbone, with cross-attention of audio embeddings (via an audio encoder) and reference video frames to guide generation, and applies a set of loss functions (temporal, perceptual, sync-net based) to enforce lip-sync accuracy, visual fidelity, and temporal consistency. ...

Downloads: 3 This Week

Last Update: 2025-12-02
See Project
11

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow

...The system uses a combination of convolutional neural networks and recurrent neural networks to extract visual features and model sequential character patterns in handwriting. It also employs connectionist temporal classification (CTC) to align predicted character sequences with input images without requiring character-level segmentation. The repository provides code for training models, performing inference on handwritten text images, and evaluating recognition accuracy. SimpleHTR is commonly used as an educational example for understanding how modern handwriting recognition systems operate.

Downloads: 0 This Week

Last Update: 2026-06-25
See Project
12

EPLB

Expert Parallelism Load Balancer

EPLB is DeepSeek’s open implementation of a load balancing algorithm designed for expert parallelism (EP) settings in MoE architectures. In EP, different “experts” are mapped to different GPUs or nodes, so load imbalance becomes a performance bottleneck if certain experts are invoked much more often. EPLB solves this by duplicating heavily used experts (redundancy) and then placing those duplicates across GPUs to even out computational load. It uses policies like hierarchical load balancing...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
13

geo-seo-claude

GEO-first SEO skill for Claude Code

...It leverages AI to generate location-specific content tailored to different regions, allowing users to scale SEO efforts across multiple cities or markets without manual content creation. The system focuses on producing structured and keyword-optimized pages that align with search engine ranking factors, including localized relevance and semantic context. It is particularly useful for agencies, marketers, and businesses that need to manage large volumes of localized landing pages efficiently. Geo SEO Claude can integrate with existing content pipelines, enabling automated generation and deployment of SEO assets. ...

Downloads: 1 This Week

Last Update: 2026-05-27
See Project
14

verl

Volcano Engine Reinforcement Learning for LLMs

VERL is a reinforcement-learning–oriented toolkit designed to train and align modern AI systems, from language models to decision-making agents. It brings together supervised fine-tuning, preference modeling, and online RL into one coherent training stack so teams can move from raw data to aligned policies with minimal glue code. The library focuses on scalability and efficiency, offering distributed training loops, mixed precision, and replay/buffering utilities that keep accelerators busy. ...

Downloads: 1 This Week

Last Update: 2026-06-01
See Project
15

HunyuanWorld 1.0

Generating Immersive, Explorable, and Interactive 3D Worlds

...The architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to produce high-quality scene-scale 3D worlds from both text and images. HunyuanWorld-1.0 surpasses existing open-source methods in visual quality and geometric consistency, demonstrated by superior scores in BRISQUE, NIQE, Q-Align, and CLIP metrics.

Downloads: 1 This Week

Last Update: 2026-04-15
See Project
16

The Alignment Handbook

Robust recipes to align language models with human and AI preferences

The Alignment Handbook is an open-source resource created to provide practical guidance for aligning large language models with human preferences and safety requirements. The project focuses on the post-training stage of model development, where models are refined after pre-training to behave more helpfully, safely, and reliably in real-world applications. It provides detailed training recipes that explain how to perform tasks such as supervised fine-tuning, preference modeling, and...

Downloads: 0 This Week

Last Update: 2026-03-08
See Project
17

USO

Open-sourced unified customization model

USO is ByteDance’s “Unified Style and Subject-Driven Generation” framework, open-sourced to allow customization in generative modeling by disentangling style and subject representation and using reward learning to guide generation. The system is designed such that users can control both “what” is generated (the subject: e.g. a person, object, scene) and “how” it is generated (the style: artistic style, color palette, aesthetic) separately, giving much more flexibility than conventional...

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
18

DreamO

A Unified Framework for Image Customization

...DreamO’s design introduces a feature routing constraint that helps disentangle different control conditions (like identity, style, clothing) when more than one is specified, which significantly reduces conflicts and artifacts when combining controls. It also uses a “placeholder strategy” to precisely align conditional inputs (e.g. where to place clothing or objects) in generated images, giving users fine-grained control over composition.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
19

Qwen2.5-Omni

Capable of understanding text, audio, vision, video

Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds...

Downloads: 0 This Week

Last Update: 2025-09-23
See Project
20

SoniTranslate

Synchronized Translation for Videos

...It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets the generated dub track stay in sync with the original video structure. The project supports a wide range of languages for translation, spanning major world languages (English, Spanish, French, German, Chinese, Arabic, etc.) and many regional or less widely spoken languages, making it suitable for broad internationalization. ...

Downloads: 16 This Week

Last Update: 2025-11-28
See Project
21

LM Human Preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

lm-human-preferences is the official OpenAI codebase that implements the method from the paper Fine-Tuning Language Models from Human Preferences. Its purpose is to show how to align language models with human judgments by training a reward model from human comparisons and then fine-tuning a policy model using that reward signal. The repository includes scripts to train the reward model (learning to rank or score pairs of outputs), and to fine-tune a policy (a language model) with reinforcement learning (or related techniques) guided by that reward model. ...

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
22

OptiMate

Libraries for optimizing AI models, inference speed, and GPU usage

...It groups several internal optimization tools developed by Nebuly AI into a single repository that focuses on improving inference speed, reducing infrastructure usage, and streamlining model training workflows. Its modules help developers automatically apply optimization techniques that better align AI models with the capabilities of the underlying hardware such as GPUs and CPUs. One of the core components, Speedster, focuses on accelerating model inference by applying state of the art optimization techniques to increase performance while lowering operational costs. Another component, Nos, targets infrastructure optimization by improving GPU utilization in Kubernetes clusters through dynamic partitioning and elastic resource quotas.

Downloads: 0 This Week

Last Update: 3 hours ago
See Project
23

Text-ly

Text.ly - An alternative for Notepad.

LOOKING FOR Text Editor? You've Come At The Right Place! Editing Your text for your simplicity A Text editor for Editing Text....! Just download and install and use as an alternative for typical Notepad. This application is compiled from the Pyinstaller library so don't mind there is a vulnerability or something the antivirus program might show it as malware or trojan this happens with most of the apps compiled from the Pyinstaller library. So No worries There is not any malware or virus...

Downloads: 2 This Week

Last Update: 2022-01-29
See Project
24

XLM (Cross-lingual Language Model)

PyTorch original implementation of Cross-lingual Language Model

XLM (Cross-lingual Language Model) is a family of multilingual pretraining methods that align representations across languages to enable strong zero-shot transfer. It popularized objectives like Masked Language Modeling (MLM) across many languages and Translation Language Modeling (TLM) that jointly trains on parallel sentence pairs to tighten cross-lingual alignment. Using a shared subword vocabulary, XLM learns language-agnostic features that work well for classification and sequence labeling tasks such as XNLI, NER, and POS without target-language supervision. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
25

FixRes

Reproduces results of "Fixing the train-test resolution discrepancy"

FixRes is a lightweight yet powerful training methodology for convolutional neural networks (CNNs) that addresses the common train-test resolution discrepancy problem in image classification. Developed by Facebook Research, FixRes improves model generalization by adjusting training and evaluation procedures to better align input resolutions used during different phases. The approach is simple but highly effective, requiring no architectural modifications and working across diverse CNN backbones such as ResNet, ResNeXt, PNASNet, and EfficientNet. FixRes demonstrates that a mismatch between training and testing resolutions often leads to suboptimal accuracy, and fine-tuning the classifier and batch normalization layers at higher test resolutions significantly enhances performance. ...

Downloads: 3 This Week

Last Update: 4 days ago
See Project