Hosting & Deployment

Your data, your rules. SentientOne AI runs wherever your security and compliance requirements demand — in the cloud, on your own servers, or a hybrid of both. You choose where your data lives.

How every request flows

Request flow · your app → SentientOne → LLM
POST /v1/chatCloudOn-PremHybrid200 OK
Your App

Any platform — web, mobile, internal tooling.

API Gateway

Authenticates, rate-limits, routes to the right region.

Agent Config

System prompt · model · tools · LLM key.

LLM + MCP Tools

Anthropic · OpenAI · Gemini · Groq + your MCP servers.

Zero AI code in your appResponse: conversation_id · message · tool_calls

Cloud Hosted

Fastest way to get started.

  • Fully managedZero infrastructure to maintain. SentientOne handles uptime, scaling, security patches, and platform upgrades.
  • Auto-scalingHandles traffic spikes without config changes. Cold starts are rare thanks to warm instance pooling.
  • Global edgeEdge routing for low-latency API calls. Requests land at the nearest PoP and are routed to the closest region for processing.
  • Automatic updatesUpdates, patches, and security fixes ship continuously. No maintenance windows, no breaking changes without prior notice.
  • 99.9% uptime SLAMulti-region failover with health-checked routing. Higher SLAs available on Enterprise.

On-Premise

Maximum control & compliance.

  • Your network, your rulesDeploy on your own servers, VPC, or private cloud. The platform is shipped as a versioned container bundle with a documented upgrade procedure.
  • Data sovereigntyData never leaves your network. Outbound LLM provider calls can be routed through your own egress proxy so even the LLM hop is observable.
  • SSO / LDAP / IAMIntegrate with your existing identity provider. SAML 2.0 and OIDC are first-class; LDAP supported via adapter.
  • Air-gapped optionFor regulated industries — full air-gapped deployment with offline updates and signed bundle verification.
  • Custom retentionDefine your own retention policies and data residency controls. Default is plan-aligned but every value is tunable.

Hybrid

Need the best of both worlds? Run the orchestration layer in the cloud for simplicity while keeping sensitive data processing on-premise. Or use cloud for development and staging, with on-premise for production.

Operational guarantees

Data residency

Choose where your data is stored — US, EU, APAC, or your own data centre. Meet regional compliance requirements without compromising performance.

Zero-downtime updates

Platform updates roll out with blue-green deployments. No maintenance windows, no service interruptions. On-premise customers control their own update schedule.

Disaster recovery

Automated backups, point-in-time recovery, and cross-region replication. Your agent configurations and conversation history are always recoverable.

Picking a deployment

Start in Cloud for the fastest path to production. Move to Hybrid when you need to keep specific data sets in your network. Move to On-Premise only when regulation demands it — every other model trades off some operational simplicity.