Docs/AI Backup
AI Infrastructure Backup
Protect AI models, vector databases, prompts, and AI workspace conversations.
Overview
AI infrastructure is often treated as disposable or easily reproducible. In reality, AI projects represent months of fine-tuning, experimentation, and optimization that cannot be quickly recreated. AI Infrastructure Backup (AI Infra Backup tier: $395/month, 10 TB storage, 20 devices) protects your most valuable AI assets against data loss, account suspensions, model drift, and disaster scenarios.
What You Get
- •Model Registry Backup with FastCDC dedup — Back up model weights from HuggingFace, Ollama, and LM Studio with content-aware chunking (50 checkpoint versions cost ~3× one full model size).
- •Vector DB Connectors — Schema-aware backup for Pinecone, Weaviate, Qdrant, Milvus, pgvector, and Chroma without costly re-embedding.
- •Training Dataset Vault — Version and protect raw datasets, training splits, and augmented datasets used in model training.
- •Prompt & Agent Library Versioning — Backup all prompts, system instructions, and custom agent definitions across all platforms.
- •Notebook & Experiment Snapshots — Automatic backup of Jupyter, Observable, and Google Colab notebooks along with their execution history.
- •RAG Source Vault — Protect embeddings, chunked documents, and metadata for retrieval-augmented generation (RAG) pipelines.
- •Natural-Language Restore Assistant — Query backups in plain English: 'Restore the Jan 15 version of my customer_support_agent before the prompt rewrites' — AI finds and restores the exact version.
- •AI Anomaly Detection — Detects suspicious model changes (weight drift, injection attacks, poisoning) and alerts you to drift before deployment.
- •Model Lineage Graph — Track which models depend on which datasets, prompts, and fine-tuning runs for full traceability.
- •EU AI Act / ISO 42001 / NIST AI RMF Audit Reports — Auto-generated compliance reports for regulated AI deployments.
Who It's For
AI Infrastructure Backup is designed for organizations that depend on AI as a critical business function:
- •AI-native startups — Protect fine-tuned models and prompt libraries that define your product.
- •In-house AI teams at regulated companies — Finance, pharma, legal firms that must audit and archive every model version for compliance.
- •AI consultancies — Back up client models, datasets, and configurations across multiple projects.
- •Research groups — Preserve training datasets and model checkpoints that took months to prepare.
- •LLM application teams — Protect RAG indexes, prompt chains, and agent definitions from accidental or malicious deletion.
Quick Start
Get started in three steps:
- •Upgrade to AI Infra Backup tier in your customer portal (Dashboard → Billing & Plans).
- •Open the BackupEngine desktop agent and navigate to the AI Assets tab.
- •Click 'Scan & Protect' to detect local AI models, notebooks, prompts, and datasets automatically.
💡 Tip
Auto-scan detects models from HuggingFace cache, Ollama, LM Studio, Jupyter directories, and common RAG framework paths. You can add custom paths manually if needed.
Next Steps
Explore the full AI Backup documentation:
- •ML Model Registry Backup — Learn how FastCDC dedup reduces storage costs for model checkpoints.
- •Vector Database Backup — Set up schema-aware backup for your vector database without re-embedding.
- •AI Workspace Backup — Protect Claude.ai, ChatGPT, Gemini conversations and custom agents.