A gradio web UI for running Large Language Models like LLaMA
The Triton Inference Server provides an optimized cloud
Build AI-powered semantic search applications
State-of-the-art Multilingual Question Answering research
API for the GPT-J language mode. Including a FastAPI backend
Aseryla code repositories
A multi-modeling and simulation environment to study complex systems
Aims to enable researcher to tap in to mobile computing capability