NannyML

NannyML is an open-source post-deployment data science library that detects silent model failures in production and estimates ML model performance without ground truth.

Use it when

•Ground truth delays: When actual outcome labels are delayed or completely absent in production.
•Performance estimation needed: When you need to estimate model performance without waiting for target labels.
•Meaningful alerts required: When you want alerts focused on actual performance impact rather than just data drift.
•Silent failure detection: When you need to detect model performance degradation that occurs without obvious warning signs.
•Business impact tracking: When you need to tie model performance to monetary or business-oriented outcomes.
•Multi-model type support: When working with binary classification, multiclass, or regression models.
•Production model reliability: When maintaining model reliability and performance in real-world deployments is critical.

Watch out

⚠Reference dataset requirements: Requires stable reference datasets that meet evaluation metrics; common mistake is using training data as reference.
⚠False alarm potential: Can overwhelm teams with false alarms if not properly configured, though focuses on meaningful alerts.
⚠Chunk size sensitivity: Requires careful chunk size configuration - too small chunks lead to unreliable statistical results.
⚠Univariate detection limitations: May miss complex system changes when monitoring individual variables.
⚠Drift-performance misalignment: Not every data drift affects model performance, and performance degradation can result from other causes.
⚠Statistical sensitivity: Drift detection methods can be overly sensitive and require careful configuration.
⚠Multivariate complexity: Detecting multivariate drift is more complex than single variable monitoring.
⚠Outlier sensitivity: May be sensitive to extreme values leading to false alarms or missed detections.

Available in stages

Model Monitoring

Installation

pip install nannyml

Example stacks

Example stacks coming soon...

Visit Official Website →