Pre-configured clusters shipping now

Your AI Cluster. Delivered Ready.

Pre-configured Apple Silicon clusters with autonomous ops built in. Plug in, power on, run inference. No Linux. No cloud bills. No setup.

500Wtotal drawvs 3,000W GPU

2.2yrbreakevenvs cloud

Zerosetup timeplug in, run

Choose your cluster

Every tier ships burn-tested, ABM-enrolled, and ready to run inference on day one.

Start here

Single Unit

from $2,499one-time

Mac Mini, Mac Studio, or MacBook Pro
r1o stack pre-loaded
72-hour burn-tested
ABM enrolled
1 model pre-loaded
Add more units anytime

Browse Hardware

Best valueMost popular

Pro Cluster

$15,999one-time

4x Mac Mini M4 Pro 64 GB
256 GB total unified memory
3x TB5 cables included
RDMA mesh + JACCL distributed inference
r1o Agent Stack + asmi monitoring
3 models pre-loaded and serving
72-hour burn-in per unit

Get Started

2,456 GB/s bandwidth

Max Cluster

$27,999one-time

4x MacBook Pro M5 Max 128 GB
512 GB total unified memory
3x TB5 cables included
RDMA mesh + JACCL distributed inference
r1o Agent Stack + asmi monitoring
5 models pre-loaded and serving
Portable — take your cluster anywhere

Get Started

Maximum inference

Ultra Cluster

$42,999one-time

4x Mac Studio M3 Ultra 256 GB
1 TB total unified memory
6x TB5 cables included
RDMA mesh + JACCL distributed inference
r1o Agent Stack + asmi monitoring
Unlimited models pre-loaded
90-day post-deploy support

Contact Sales

2 TB unified memory

Sovereign

$89,999one-time

4x Mac Studio M3 Ultra 512 GB
2 TB total unified memory
6x TB5 cables included
Runs 400B+ parameter models natively
r1o Agent Stack + full automation
White-glove on-site installation
1-year dedicated support
Discontinued config — sourced exclusively

Contact Sales

What's included

Feature	Starter	Pro	Max	Ultra	Sovereign
Hardware	1 unit	4 units	4 units	4 units	4 units
Total unified memory	128 GB	256 GB	512 GB	1 TB	2 TB
Memory bandwidth	546 GB/s	273 GB/s ×4	614 GB/s ×4	800 GB/s ×4	800 GB/s ×4
TB5 cables included		3x	3x	6x	6x
TB5 RDMA
r1o Stack
Agent Stack	Add-on
Models pre-loaded	1	3	5	Unlimited	Unlimited
400B+ models
Portable
Support	Email	30 days	30 days	90 days	1 year
On-site setup				Available

Why not cloud?

The math is straightforward. Own your compute.

Cloud GPU

$18,400 / year recurring
Data leaves your network
Scaling = more bills
NVIDIA dependency & allocation lottery

RackStudio

One-time $13,999
Data stays on-prem, always
Scales with more units
Your hardware, forever

Included with Pro & Ultra

The r1o Agent Stack

Your cluster's brain. Autonomous operations that monitor, heal, and manage your inference fleet so you don't have to.

Apple Business MCP

Device enrollment, MDM, compliance

Cluster Intelligence

Real-time GPU/RAM/power monitoring via asmi

Self-Healing

Auto-restart failed inference, recovery agents

Model Management

Deploy, quantize, shard across units

Hermes Notifications

iMessage/Slack alerts for health events

NanoMDM

Push profiles, manage your fleet remotely

Ready to own your inference?

Reach out and we will scope your cluster, ship it configured, and get you running inference the day it arrives.

m@r1o.ai Browse hardware