Seldon Core

Seldon Core 2 is a Kubernetes-native MLOps and LLMOps framework for deploying machine learning models and Large Language Model systems at scale.

Use it when

•You need a Kubernetes-native solution for ML model deployment.
•You're deploying both traditional ML models and Large Language Models (LLMs).
•You require advanced MLOps features like A/B testing, canary deployments, and experiment routing.
•Your infrastructure spans multiple environments (on-premise, hybrid, multi-cloud).
•You need sophisticated monitoring and observability for ML systems with auditable prediction data.
•You want to reduce infrastructure costs through multi-model serving and resource optimization.
•You have a team with strong Kubernetes and MLOps expertise.
•You need to compose complex AI applications through pipelines.

Watch out

⚠Auto-scaling limitations: Requires additional setup (like KEDA) and does not support scaling to zero when idle.
⚠Kubernetes expertise required: Not suitable for companies with limited MLOps capabilities.
⚠Community support: Only average community support available compared to more established platforms.
⚠Configuration complexity: Common deployment issues include incorrect configuration settings and accessibility problems.
⚠Local model deployment issues: Problems with syncing/copying local models to persistent volumes.
⚠Network timing issues: Istio VirtualServices may not be ready immediately after container startup.
⚠Metrics integration challenges: Issues with integrating Triton metrics and port recognition.
⚠Resource-intensive operations: May exceed allocated limits if not properly configured.

Available in stages

Model Serving

Installation

helm repo add seldon-charts https://seldonio.github.io/helm-charts helm install seldon-core-v2-crds seldon-charts/seldon-core-v2-crds helm install seldon-core-v2 seldon-charts/seldon-core-v2-setup --namespace seldon-mesh

Example stacks

Example stacks coming soon...

Visit Official Website →