An asynchronous job queue spanning a continent of GPUs. Submit work, receive results. Built for enterprise batch inference — document pipelines, dataset enrichment, overnight runs — at a fraction of dedicated-cluster cost.
Point your base_url at MicroDC.ai and submit asynchronously.
Poll, stream, or hand us a webhook — your call. End-to-end encrypted
payloads available for sensitive workloads.
from microdc import Client client = Client(api_key=os.environ["MDC_KEY"]) # Submit asynchronously — get a job id back. job = client.jobs.create( model="llama-3.3-70b", messages=[{"role":"user","content":"..."}], encrypt=True, # zero-knowledge webhook="https://api.acme.co/done", ) # Or poll. Or stream. Your call. result = client.jobs.wait(job.id) print(result.choices[0].message.content)
curl https://api.microdc.ai/v3/jobs \
-H "Authorization: Bearer $MDC_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "llama-3.3-70b",
"messages": [{"role":"user","content":"..."}],
"encrypt": true,
"webhook": "https://api.acme.co/done"
}'
# Returns: { "id": "J-a1b2c3", "status": "queued", ... }
curl https://api.microdc.ai/v3/jobs/J-a1b2c3 \
-H "Authorization: Bearer $MDC_KEY"
REST, Python, OpenAI-compat. Submit jobs, get results. Encrypted payloads for sensitive workloads. Custom-amount credits — no minimums, no monthly fees.
Pre-negotiated capacity tiers, dedicated worker groups, private VPC routing, SLA-backed throughput. We're onboarding significant compute to handle medium-to-large customers.
Connect a worker in five minutes. Context-tier routing means low-spec hardware still earns. Multi-GPU and concurrent jobs supported. Payouts in credits or PayPal.
POST your job — model, payload, optional encryption keys, optional webhook.
Our scheduler matches the job to a worker tier with the right VRAM, locality, and price ceiling.
A worker pulls, runs, and returns. Streaming token deltas where applicable.
Webhook fires, or your client retrieves. Receipt-of-compute logged for billing transparency.
Summarize tens of thousands of PDFs, extract structure from contracts, normalize OCR output. Queue at 2am, deliver at 6am.
Run grids of experiments across model × prompt × dataset. No reservation, no spin-up time, no idle burn.
Classify, tag, embed, and score records at warehouse scale. Webhook back into your ETL.
Personalized summaries, briefs, and translations. Rate-limit-free batches.
Plug LLM steps into Airflow, Temporal, or n8n. Idempotent retries, signed receipts.
Submit during business hours, results on your morning desk. Cheapest tier.
No credit card to start. No monthly minimum. Free tier for evaluation, custom contracts for production. We're shipping reserved-capacity tiers for medium-to-large enterprise workloads this quarter — talk to us if you're sizing.