Sophon Forge | Sophon

Sophon's hosted, OpenAI-compatible LLM inference — managed models, API keys, and usage tracking on the Sophon Platform.

Sophon Forge is the hosted inference service of the Sophon Platform. It gives you an OpenAI-compatible API to a managed set of language models, so you can run Sophon — or any app — without managing your own model keys or GPU infrastructure. Forge handles hosting, scaling, API keys, rate limits, and usage tracking for you.

Create a Forge account at platform.sophon.buildersoft.io. The Free tier needs no credit card.

What Forge offers

OpenAI-compatible API — drop-in /v1/chat/completions, /v1/embeddings, and /v1/models endpoints. Point any OpenAI-compatible client at Forge by changing the base URL and API key.
Managed models — a curated lineup of hosted models across speed/quality tiers, plus premium models on higher plans. Hosting is handled server-side, so models can be upgraded without changing your code.
API key management — create, rotate, and revoke keys per workspace, with separate Live and Test environments. Keys are shown once and stored hashed.
Workspaces — keys, usage, and plans are scoped to a workspace, so teams can share access with Owner/Member roles.
Usage tracking — a dashboard showing current-month token usage by model and key, plus historical daily aggregates.
Plans & quotas — monthly token allowances and per-minute request limits enforced per plan, with rate-limit headers on every response.

Plans & limits

Every account starts on the Free plan automatically. Higher plans raise the monthly token allowance and request rate, and unlock premium models.

	Free	Starter	Pro	Enterprise
Monthly tokens	100K	1M	10M	Unlimited
Requests / minute	10	30	60	120
Premium models	—	—	Included	Included

Current pricing and plan availability are shown on the Forge dashboard.

Getting started

1. Create an account

Sign up at platform.sophon.buildersoft.io and verify your email. A personal workspace is created for you automatically.

2. Create an API key

Open your workspace and go to Forge → API Keys
Click Create API Key, give it a name, and choose Live or Test
Copy the key (format forge_sk_live_...) — it is shown only once

3. Call the API

Forge speaks the OpenAI protocol. Use your key as a Bearer token against the Platform base URL:

curl https://api.platform.sophon.buildersoft.io/v1/chat/completions \
  -H "Authorization: Bearer forge_sk_live_..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "<model-id>",
    "messages": [{ "role": "user", "content": "Hello" }]
  }'

List the model ids available on your plan with GET /v1/models, or browse them in the dashboard. Responses include rate-limit headers — X-RateLimit-Remaining, X-RateLimit-Reset, and X-Forge-Tokens-Remaining — so clients can back off before hitting a limit.

Use Forge in Sophon

Forge is available in the Sophon app as the sophon-forge model provider, so you can run your agents on Forge-hosted models instead of bringing your own provider key.

In the Dashboard, go to Settings → Models & Providers → Add Provider
Choose Sophon Forge
Paste your Forge API key (and, if prompted, the base URL https://platform.sophon.buildersoft.io)
Test the connection, then set its priority and budget

See the full provider list in Supported Providers and general model setup in Configuration.

Sophon ForgeNEW