DOC · MICRODC-FAQ-001v3.4 · BETA

Three audiences, one platform. Jump to your section below.

For business leaders.

§01 · BUSINESS
1.1 · COMPANY & PLATFORM

What is AI inference and why does it matter?

AI inference is the process of using a trained AI model to generate outputs from new inputs — running the AI to get answers, summaries, or predictions.

Examples: asking ChatGPT a question, summarizing a 50-page document, classifying support tickets, generating product descriptions, extracting info from contracts.

Inference requires powerful GPU hardware. At scale, costs add up quickly — which is exactly the problem MicroDC.ai solves. If your business processes text or data using AI at scale, you're paying for inference. We help you pay up to 90% less.

What is MicroDC.ai and what problem does it solve?

MicroDC.ai is a distributed AI inference platform — a two-sided marketplace connecting businesses that need AI compute with individuals and organizations that have idle GPU resources.

Problem: AI inference costs are skyrocketing. Major cloud providers charge premium prices and building internal infrastructure requires significant CapEx.

Solution: We aggregate distributed GPU resources globally, providing enterprise-grade inference at up to 90% lower cost than traditional cloud, with no infrastructure management.

Who founded MicroDC.ai?

MicroDC.ai was founded by Ray Sisson (CEO) and Jeffrey Rivero (CTO), with Neil Quarterman serving as CFO.

  • Ray Sisson — MBA from Chicago Booth, JD from Georgetown. Transportation, infrastructure, and corporate board leadership.
  • Jeffrey Rivero — 25+ years in cybersecurity and AI/ML, CISSP. Built platforms handling 1M+ events/sec, AI for the US military.
  • Neil Quarterman — Yale and Georgia Tech MBA. Managed financials through 140% growth and M&A across 11 entities in 8 countries.

What differentiates MicroDC.ai from AWS, Google Cloud, or Azure?

FactorMicroDC.aiMajor Cloud
Business modelMarketplace (asset-light)Infrastructure owner
Cost savingsUp to 90% lowerPremium pricing
Processing modelAsynchronous batchReal-time (expensive)
Minimum commitmentPay-per-use, no minimumOften required
Setup timeMinutesHours to days
1.2 · BUSINESS VALUE & ROI

What cost savings can we expect?

Organizations typically see 50–90% cost reduction compared to traditional cloud GPU instances:

  • Asynchronous model eliminates idle compute charges
  • Zero cost for DevOps, maintenance, or scaling
  • Pay-per-use — only for tokens processed
  • Scale from $10 to $10,000/month seamlessly

Example: A document-processing company processing 100,000 documents/month saved 78% by switching from dedicated GPU instances to MicroDC.ai's batch processing.

What are the ideal use cases?

Ideal: document summarization, content generation at scale, data enrichment, research/batch analytics, overnight processing, report generation.

Consider alternatives for: sub-second real-time chatbots, interactive user-facing AI, streaming responses, mission-critical real-time systems.

How does pricing and billing work?

Simple credit-based system. 1 credit = $0.01 USD. Welcome bonus credits for new accounts. No setup, monthly, or platform fees. Per-job pricing based on model complexity and tokens processed.

Tiers: Economy (lower cost, flexible timing) · Standard (balanced) · Premium (priority).

1.3 · SECURITY & COMPLIANCE

How is our data protected on a distributed network?

Security is foundational. Our CTO is CISSP-certified with 25+ years in cybersecurity:

  • End-to-end encryption (TLS 1.3 in transit, encrypted at rest)
  • Worker isolation — workers can't see customer identities or other jobs
  • Data minimization — only job payloads sent to workers
  • Automatic deletion of job data within configurable retention
  • Complete audit trails of all transactions and access
  • JWT auth with short-lived tokens

For zero-knowledge guarantees, opt into end-to-end encrypted jobs — the server never sees plaintext.

What compliance certifications do you have?

Current: security-first architecture from CISSP-certified leadership, regular audits and pen tests, secure development lifecycle.

Planned: SOC 2 Type II, ISO 27001, GDPR documentation. Contact us for current compliance status.

Can we use MicroDC.ai for sensitive or regulated data?

For sensitive workloads we recommend: anonymize/pseudonymize before processing, remove PII when possible, use retention settings to minimize exposure, consider end-to-end encrypted jobs (zero-knowledge), evaluate dedicated worker pools.

For HIPAA, PCI-DSS, or regulated workloads, contact sales to discuss dedicated infrastructure options.

1.4 · ENTERPRISE SUPPORT

Do you offer enterprise agreements and volume pricing?

Yes — customized enterprise packages: volume discounts, priority processing, SLA guarantees, dedicated support, custom integration, NET-30 invoice billing for qualified customers. Email sales@microdc.ai.

What SLAs and uptime guarantees do you provide?

Platform availability: 99.9% API uptime target.

Job processing SLA by tier:

  • Economy — best-effort, typically < 4 hours
  • Standard — 95% completed within 1 hour
  • Premium — 99% completed within 15 minutes

Enterprise customers receive customized SLAs with credits for missed targets.

How can we become a GPU provider?

Connect resources on your schedule. Workers receive a generous revenue share. Real-time earnings dashboard. Multiple payout options (credits with bonus, PayPal cash). See the Providers page.

1.5 · ROADMAP & BLOCKCHAIN

What is MicroDC.ai's blockchain integration strategy?

We're developing blockchain integration for transparency, trust, and decentralized payments:

  • Transparent job verification — immutable record of completed work and payments
  • Smart-contract payments — automated, trustless payouts to providers
  • Decentralized reputation — on-chain worker scores
  • Token-based incentives — potential utility token for participation
  • Cross-border payments via crypto

Traditional payment methods will remain available alongside any blockchain features.

How will blockchain benefit enterprise customers?

  • Auditable compute — cryptographic proof of work for compliance
  • Cost transparency — verifiable pricing
  • Dispute resolution — smart-contract arbitration
  • Global settlement without banking delays
  • Tamper-proof logs of all platform interactions

What is MicroDC.ai's product roadmap?

Now: Batch inference, 100+ models, credit system, web dashboard.

Next: Streaming responses, custom model hosting, enhanced analytics.

Future: On-chain verification, token payments, global expansion.

For engineers & developers.

§02 · ENGINEERS
2.1 · RECENT FEATURES

Can I use the OpenAI SDK with MicroDC.ai?

Yes — we provide an OpenAI-compatible Chat Completions endpoint. Point your existing openai client at https://api.microdc.ai/v1 and keep your code. Multimodal content lists supported. Works with LangChain, LlamaIndex, Instructor, and any OpenAI-shaped library.

Is my data private? Can the server see my prompts?

By default, jobs are submitted in plaintext so the server can assist with routing and pricing. For zero-knowledge guarantees, use end-to-end encrypted jobs: client encrypts payload with a per-job symmetric key; only the claiming worker receives key material; result is re-encrypted with your public key; per-job keys deleted on acknowledgment. Only workers with the encryption capability can claim encrypted jobs.

Can I run my own Docker image on MicroDC.ai?

Yes — use the container job type. Provide image name, optional args/env, and any script files (.py, .sh, .js, .ts, .go, .rs, .java...) as inputs. Routes only to workers advertising the docker capability. Live log streaming via per-job heartbeat.

How is container job pricing structured?

Metered pricing — GPU-hour rate × runtime, or CPU-core-hour rate × cores × runtime, plus a $2.00 platform fee per run. Minimum charges apply. CPU-only jobs default to $0.005/core-hour when the worker's CPU model isn't tier-priced.

What are context tiers — do they affect my jobs?

Every job is labeled with a context tier (1–4) at submission based on prompt length. Workers declare a max_context_tier via heartbeat — your job will only be claimed by a capable worker. Transparent to your code; you don't set the tier, the server computes it.

Tier 1 = 0–1.5K chars, tier 2 = 1.5K–6K, tier 3 = 6K–24K, tier 4 = 24K+.

2.2 · ARCHITECTURE

What is the overall system architecture?

Three components:

Client (SDK/API)     Server (Hub)         Workers (Compute)
      |                   |                     |
      |  Submit Job       |                     |
      |------------------>|                     |
      |                   |  Queue Job          |
      |                   |-------------------->|
      |                   |                     |
      |                   |  Poll for Jobs      |
      |                   |<--------------------|
      |                   |                     |
      |                   |  Return Results     |
      |                   |<--------------------|
      |  Get Results      |                     |
      |<------------------|                     |

Server: API backend, PostgreSQL, scheduler. Workers: distributed GPU nodes running inference. Client: Python SDK and REST API.

What is the technology stack?

Backend: FastAPI, PostgreSQL, JWT auth, Docker.
Frontend (console): HTMX, Alpine.js, Tailwind CSS, Jinja2.
Workers: Python runtime, vLLM / llama.cpp, CUDA/ROCm GPU support.
Infra: Docker Compose (dev), Kubernetes (prod), Nginx, Redis (planned).

How does job scheduling and distribution work?

  1. Client submits job with model requirements
  2. Scheduler finds workers with required GPU/VRAM
  3. Jobs ordered by priority tier and submission time
  4. Best-fit selection considers worker performance history
  5. Failed jobs rescheduled to a different worker

Smart retry: a failed job won't be reassigned to the same worker.

2.3 · API & SDK

How do I submit a job using the Python SDK?

pip install git+https://gitlab.com/microdc/python-client.git

from microdc import Client, LLMCall

client = Client(api_key="mDC_your_api_key")

job = LLMCall(model="llama3.3")
job.add_user_message("Explain quantum computing in simple terms")

job_id = client.send_job(job)
result = client.wait_for_job(job_id)
print(result.output)

Full SDK docs: gitlab.com/microdc/python-client

What REST API endpoints are available?

Base URL: https://api.microdc.ai/v1 — Authorization: Bearer mDC_your_api_key

MethodEndpointDescription
POST/jobsSubmit new job
GET/jobs/{id}Get job status / result
GET/jobsList your jobs
GET/modelsList available models
GET/account/balanceCheck credit balance
DELETE/jobs/{id}Cancel a pending job

What job types are supported?

  • LLMCall — text generation, chat completions, system prompts, temperature/top_p
  • EmbedCall — text embeddings for search/RAG, batch embedding, multiple models
  • ImageGeneration — Stable Diffusion, custom dimensions, negative prompts
  • CustomJob / Container — flexible custom workloads, arbitrary Docker payloads

How do I handle webhooks?

job = LLMCall(model="llama3.3")
job.add_user_message("Process this document...")

job_id = client.send_job(
    job,
    callback_url="https://your-api.com/webhook/microdc"
)

# Your webhook will receive POST with:
# { "job_id": "uuid", "status": "completed", "result": {...} }

Or use polling with client.wait_for_job(job_id).

2.4 · MODELS & PERFORMANCE

What models are available?

100+ open-source models.

LLM: Llama 3.3 (8B, 70B), Mistral (7B, Mixtral), Qwen 2.5, DeepSeek, Phi-3, Gemma 2.
Embedding: BGE, E5, GTE, Nomic Embed.
Image: Stable Diffusion XL, SDXL Turbo, Flux.

Live catalog: /models.html

Can I deploy custom or fine-tuned models?

Yes. Custom model hosting is available — upload weights, automatic validation, deployment to compatible workers, version control, private access. Contact us for requirements and pricing.

What are typical job completion times?

Most jobs complete within 30 seconds to 5 minutes. Premium tier averages under 1 minute. Factors: priority tier, model size, token count, queue depth.

2.5 · SECURITY & AUTH

How does API authentication work?

export MICRODC_API_KEY="mDC_your_api_key"

client = Client()  # auto-reads from env
# or
client = Client(api_key="mDC_...")

# HTTP header:
Authorization: Bearer mDC_your_api_key

Never commit API keys to version control. Use env vars or secrets management.

How is data encrypted and protected?

  • TLS 1.3 in transit
  • AES-256 at rest
  • Worker isolation in Docker
  • Data minimization — only necessary data sent to workers
  • Automatic purging after configurable retention
  • Job content not logged server-side

What about rate limiting?

Default limits (raisable for enterprise):

LimitValue
Job submissions100/minute
Status checks300/minute
Concurrent jobs50 (default)
Burst allowance2× for 10 seconds

Rate-limit headers: X-RateLimit-Remaining

2.6 · BECOMING A WORKER

Hardware requirements?

ComponentMinimumRecommended
GPUNVIDIA GTX 1060 6GBRTX 3080+ / A100
VRAM6GB16GB+
RAM16GB32GB+
Storage100GB SSD500GB+ NVMe
Internet50 Mbps100+ Mbps

AMD GPUs (ROCm) supported. CPU-only workers can handle smaller models.

How do I set up a worker node?

git clone https://gitlab.com/microdc/worker.git
cd worker
pip install -r requirements.txt
export MICRODC_WORKER_TOKEN="mdc_wrk_your_token"
python worker.py --models llama3-8b,mistral-7b

The worker will auto-register, download required models, and start polling for jobs. Setup guide: gitlab.com/microdc/worker

How are worker earnings calculated?

Per-job ledger entry on the JobAssignment record — your source of truth, not a drift-prone counter. Aggregated to your Earnings page in three windows: lifetime, 90-day, 30-day. Factors: model complexity, job volume, performance/reliability score, geographic latency preferences.

2.7 · BLOCKCHAIN (FUTURE)

What blockchain technology will MicroDC.ai use?

Under evaluation: Ethereum L2s (Arbitrum, Optimism), Solana, Polygon, custom Cosmos-SDK chain. Selection criteria: low fees, high throughput, dev ecosystem, regulatory clarity.

How will smart contracts be used?

  • Payment escrow — locked, released on verified completion
  • Proof of work — cryptographic verification
  • Reputation system — on-chain worker scores
  • Dispute resolution — decentralized arbitration

Will there be a MicroDC.ai token?

Exploring a utility token: payment currency, worker staking, governance, fee discounts, rewards. Token economics are under development. Traditional fiat payments will always be supported.

When will blockchain features be available?

Phased rollout: Phase 1 — accept stablecoin payments (USDC/USDT). Phase 2 — on-chain job-completion proofs. Phase 3 — full decentralization, token launch, staking, governance.

For GPU providers.

§03 · PROVIDERS
3.1 · GETTING STARTED

What hardware do I actually need?

Less than you'd think. We support a wide range via context-tier routing — workers declare capacity and only receive jobs they can handle.

  • Tier 1 (short prompts) — ARM SBCs, Orange Pi Max 16GB, old laptops, CPU-only
  • Tier 2 — Entry-level GPUs (GTX 1660, RTX 3060 8GB) or high-RAM CPU rigs
  • Tier 3 — Mid-range GPUs with 16GB+ VRAM (RTX 4070, A4000)
  • Tier 4 — High-end GPUs with 24GB+ VRAM (RTX 4090, A100, H100)

Stable internet (~50 Mbps recommended), Linux or Windows, Docker for container jobs.

Can I run a worker without a GPU?

Yes. CPU-only workers heartbeat cleanly and can serve embedding jobs and small-LLM / short-prompt jobs efficiently. Embedding workloads in particular run at acceptable latency without a GPU at all.

How long does setup take?

About 5 minutes from sign-up to first heartbeat. Create an account, generate a worker token in the console, install the worker, start it. Jobs start flowing automatically.

Setup guide: gitlab.com/microdc/worker

3.2 · EARNINGS & PAYOUTS

How do I get paid — credits or cash?

Your choice, per payout request:

  • Credits (with bonus) — instant. A bonus percentage on top. Great if you also submit jobs.
  • PayPal cash — real USD to your PayPal account. Recorded as a PAYOUT transaction.

Head to the Earnings page in your dashboard (under Workers).

How am I actually paid per job?

For each completed job the customer is charged and you receive a payout share (the platform retains a margin). Your per-job payout is recorded in the JobAssignment ledger — your source of truth. Earnings aggregated in lifetime, 90-day, and 30-day windows.

What about failed jobs or timeouts?

Only successfully completed jobs earn a payout. If a job fails or times out, the customer isn't charged and you aren't paid for it.

3.3 · CONTEXT TIERS & ROUTING

What is max_context_tier and should I set it?

An integer (1–4) your worker advertises via heartbeat. The scheduler then only sends you jobs at or below that tier. Set it if your hardware struggles with long prompts. Don't set it if you have plenty of headroom — you'll see all jobs.

Setting a tier doesn't reduce your earning ceiling — it lets you earn consistently on jobs you can actually complete instead of timing out on oversized prompts.

Can I run multiple jobs at once on a big GPU?

Yes. Set max_concurrent_jobs > 1 in your worker config. The server will only mark you BUSY when at full capacity. Defaults to 1.

Why isn't my worker getting jobs?

  • Model mismatch — jobs route to workers advertising the requested model
  • Context tier set too low — check your heartbeat payload
  • Heartbeat stale — server considers you offline after missing heartbeats
  • Already BUSY — at full max_concurrent_jobs capacity
3.4 · WORKER GROUPS

Can I organize my machines?

Yes — use Worker Groups. Named groups ("Home-Lab", "Colo-Rack-A", "Customer-X") with aggregate stats per group including 30-day, 90-day, and all-time earnings sourced from the real ledger.

What happens if I delete a group?

Workers stay. Group membership is just a label — deleting the group sets the FK to NULL on members. Nothing else changes.

3.5 · DOCKER, ENCRYPTION, SPECIAL CAPABILITIES

How do I accept Docker container jobs?

Advertise capabilities: ["docker"] in your heartbeat. You'll start receiving customer container jobs. Each container job includes a per-job heartbeat endpoint to stream logs and reset timeouts. Metered pricing — GPU-hour or CPU-core-hour rate × runtime, plus $2.00 platform fee.

Is running random customer containers safe?

The worker runs jobs in Docker with standard container isolation. For maximum safety, run on a dedicated machine or VM. If you don't want to accept arbitrary code, simply don't advertise docker — you'll only receive LLM / embedding / document jobs.

How do encrypted jobs work on the worker side?

Advertise capabilities: ["encryption"] to opt in. On claim you receive the symmetric key + IV. You decrypt the payload in memory, run inference, then re-encrypt the result with the customer's public key before submitting. Per-job keys are deleted on customer acknowledgment.

3.6 · RELIABILITY & MONITORING

What if my worker goes offline mid-job?

The zombie monitor detects stalled jobs using activity-based tracking (no progress for 10 minutes). Stalled jobs are requeued to another worker, and your worker is freed. You don't get paid for the stalled attempt. Repeated stalls (retry exhaustion) result in a FAILED job — preventing infinite retry loops.

Can I set an availability schedule?

Today: just start and stop the worker process. When running, it heartbeats and receives jobs; when stopped, it's offline. Many providers run workers overnight or during spare hours via cron, systemd timers, or Task Scheduler.

What if I delete a worker from the console?

The worker's registration token is deactivated. Any further login or heartbeat receives a clear 403. Historical earnings stay in your ledger — records aren't deleted.

§04 · STILL HAVE QUESTIONS?

Talk to a human.

Our team is here to help. Reach out for personalized assistance with your specific use case.