DOC · MICRODC-DEV-001v3.4 · BETA

Submit LLM, embedding, document, or container jobs asynchronously. Get results at a fraction of the cost of dedicated GPU instances. No infrastructure to manage, no minimum commitment, pay per compute-second.

Everything you need to integrate.

§01 · CAPABILITIES
CAP · 01

Simple REST API

Clean, well-documented endpoints. Integrates with any language or framework.

CAP · 02

Pay per use

Only pay for compute time. No idle costs, no minimum commitments.

CAP · 03

Batch processing

Submit thousands of jobs at once. Built for data pipelines and bulk work.

CAP · 04

100+ models

Llama, Mistral, Qwen, and more. All major open-source LLMs.

§02 · QUICK START

Get started in minutes.

Install the Python SDK, point it at MicroDC.ai, submit a job. Poll, stream, or hand us a webhook.

Python SDK Get API Key →
from microdc import MicroDC

client = MicroDC(api_key="your-api-key")

# Submit an inference job
job = client.submit_job(
    model="llama-3.1-8b",
    prompt="Explain quantum computing in simple terms",
    max_tokens=500,
)

# Check job status
status = client.get_job(job.id)
print(f"Status: {status.state}")

# Get results when ready
result = client.wait_for_result(job.id)
print(result.output)

Four job types, one API.

§03 · JOB TYPES
TypeWhat it runsRoutingPricing
llmChat completions on any supported LLMBy model nameToken-based
embedVector embeddings for RAG, search, classificationBy model nameToken-based
documentSummarize, extract, analyze uploaded filesBy model nameFlat-rate
containerAny Docker image with your script or codeBy capability (docker)GPU-hour or CPU-core-hour
§04 · OPENAI-COMPATIBLE

Drop-in replacement for the
OpenAI client.

Point your existing openai client at MicroDC.ai and keep your code. Multimodal content lists supported. Works with LangChain, LlamaIndex, Instructor, and any OpenAI-shaped library.

  • No code rewrite — just change base_url
  • Multimodal content lists (text + image parts)
  • Async-native under the hood
from openai import OpenAI

client = OpenAI(
    api_key="your-microdc-key",
    base_url="https://api.microdc.ai/v1",
)

resp = client.chat.completions.create(
    model="llama-3.1-8b",
    messages=[
        {"role": "user",
         "content": "Summarize quantum tunneling."}
    ],
)
print(resp.choices[0].message.content)
§05 · CONTAINER JOBS

Bring your own Docker image.

Submit any image. Upload script files (.py, .sh, .js, .ts, .go, .rs, .java...) as inputs. Live log streaming via per-job heartbeat. Routes only to workers advertising the docker capability.

  • Batch ETL · scientific compute · headless browsers
  • Video transcoding · custom ML pipelines
  • GPU-hour or CPU-core-hour billing — your call
job = client.submit_job(
    type="container",
    model="my-registry/pdf-extract:v2",
    payload={
        "image": "my-registry/pdf-extract:v2",
        "args": ["--input", "/data/report.pdf"],
        "env": {"LOG_LEVEL": "info"},
    },
    files=["report.pdf"],
)

# Stream logs while it runs
for line in client.stream_logs(job.id):
    print(line)

result = client.wait_for_result(job.id)

End-to-end encrypted, zero-knowledge.

§06 · ENCRYPTION

The server never sees your prompts or results. Per-job symmetric keys, public-key result encryption, automatic key destruction on acknowledgment. For regulated industries, IP-sensitive workflows, and anyone who simply doesn't want their prompts logged.

STEP 01

You encrypt

Client generates a per-job symmetric key and encrypts the payload. A public key is sent alongside so results return encrypted to you.

STEP 02

Worker decrypts & runs

Only workers with the encryption capability can claim. They receive the key on claim, decrypt in memory, run the job.

STEP 03

Result re-encrypted

Worker encrypts the result with your public key and submits. You decrypt with your private key. Per-job keys deleted on acknowledgment.

§07 · BUILD

Ship your first job.

Free credits to start. No credit card. No minimum. Full API and SDK access from day one.