Seldon Core

Seldon Core

Seldon Core 2 is a Kubernetes-native MLOps and LLMOps framework for deploying machine learning models and Large Language Model systems at scale.

Use it when

  • You need a Kubernetes-native solution for ML model deployment.
  • You're deploying both traditional ML models and Large Language Models (LLMs).
  • You require advanced MLOps features like A/B testing, canary deployments, and experiment routing.
  • Your infrastructure spans multiple environments (on-premise, hybrid, multi-cloud).
  • You need sophisticated monitoring and observability for ML systems with auditable prediction data.
  • You want to reduce infrastructure costs through multi-model serving and resource optimization.
  • You have a team with strong Kubernetes and MLOps expertise.
  • You need to compose complex AI applications through pipelines.

Watch out

  • Auto-scaling limitations: Requires additional setup (like KEDA) and does not support scaling to zero when idle.
  • Kubernetes expertise required: Not suitable for companies with limited MLOps capabilities.
  • Community support: Only average community support available compared to more established platforms.
  • Configuration complexity: Common deployment issues include incorrect configuration settings and accessibility problems.
  • Local model deployment issues: Problems with syncing/copying local models to persistent volumes.
  • Network timing issues: Istio VirtualServices may not be ready immediately after container startup.
  • Metrics integration challenges: Issues with integrating Triton metrics and port recognition.
  • Resource-intensive operations: May exceed allocated limits if not properly configured.

Available in stages

Model Serving

Installation

helm repo add seldon-charts https://seldonio.github.io/helm-charts helm install seldon-core-v2-crds seldon-charts/seldon-core-v2-crds helm install seldon-core-v2 seldon-charts/seldon-core-v2-setup --namespace seldon-mesh

Example stacks

Example stacks coming soon...