Data manipulation and transformation for audio signal processing
From Paper to Presentation in One Click
Chinese and English multimodal conversational language model
The library to build & auto-optimize LLM applications
PyTorch3D is FAIR's library of reusable components for deep learning
Automate native Android apps with AI using accessibility APIs
Contexts Optical Compression
Generate audiobooks from e-books
No-code LLM Platform to launch APIs and ETL Pipelines
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Elyra extends JupyterLab with an AI centric approach
Foundation model for image generation
Benchmarking Multimodal Agents for Open-Ended Tasks
Gracefully face hCaptcha challenge with multimodal llms
Zero-code platform for building AI agents from natural language input
Open-source platform for building enterprise-grade agents
Phi-3.5 for Mac: Locally-run Vision and Language Models
Gemma open-weight LLM library, from Google DeepMind
ComfyUI wrapper nodes for HunyuanVideo
Python package for AutoML on Tabular Data with Feature Engineering
Multilingual Document Layout Parsing in a Single Vision-Language Model
An on-premises, OCR-free unstructured data extraction
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A frontier, first-principles handbook
Marrying Grounding DINO with Segment Anything & Stable Diffusion