Getting Started

Stimulir Documentation

Stimulir is an adaptive AI stack for self-improving workflows. The console gives every team three workspaces — Engineering, Lab, and Compute — backed by the HybrIE runtime.

Latest release

v0.1.170 publishes the Python SDK import surface and keeps prompt, data, and eval paths scriptable.

The current console release makes from stimulir import StimulirClient work from application code, keeps CLI installs separate from SDK dependencies, and connects curated assets and prompt versions to Lab evaluation runs before promotion.

Two API surfaces

Stimulir exposes two complementary APIs:

  • Console platform APIhttps://api.stimulir.com. Manages workspaces, API keys, BYOK credentials, usage, and billing, and serves OpenAI-compatible inference at /api/v1/inference/chat/completions. Platform endpoints authenticate with your session token; inference authenticates with hyb_* API keys. See Authentication & Workspaces.
  • HybrIE runtime API — the engine the Lab and Compute workspaces control. It serves an OpenAI-compatible HTTP API on port 8080 (gRPC on 9090) with local inference (Qwen3 / Qwen3-Coder via Candle on Metal or CUDA), training, evaluation, adapters, and compute orchestration. In BYOC deployments you run this runtime on your own nodes.

Latest capabilities

Explore the docs

Next steps

  • Follow the Quickstart to install the CLI, create an API key, and make your first inference call.
  • Use the Python SDK or prompts and data assets commands to seed client prompts, ingest traces, stage datasets, and create Lab-ready snapshots.
  • Run Lab evals against prompt versions, staged data, inference endpoints, adapters, and RL policies before promotion.