문의를 보내주셔서 감사합니다! 팀원이 곧 연락드리겠습니다.
예약을 보내주셔서 감사합니다! 저희 팀 멤버 중 한 분이 곧 연락드리겠습니다.
코스 개요
Foundations of Agentic Systems in Production
- Agentic architectures: loops, tools, memory, and orchestration layers
- Lifecycle of agents: development, deployment, and continuous operation
- Challenges of production-scale agent management
Infrastructure and Deployment Models
- Deploying agents in containerized and cloud environments
- Scaling patterns: horizontal vs vertical scaling, concurrency, and throttling
- Multi-agent orchestration and workload balancing
Monitoring and Observability
- Key metrics: latency, success rate, memory usage, and agent call depth
- Tracing agent activity and call graphs
- Instrumenting observability using Prometheus, OpenTelemetry, and Grafana
Logging, Auditing, and Compliance
- Centralized logging and structured event collection
- Compliance and auditability in agentic workflows
- Designing audit trails and replay mechanisms for debugging
Performance Tuning and Resource Optimization
- Reducing inference overhead and optimizing agent orchestration cycles
- Model caching and lightweight embeddings for faster retrieval
- Load testing and stress scenarios for AI pipelines
Cost Control and Governance
- Understanding agent cost drivers: API calls, memory, compute, and external integrations
- Tracking agent-level costs and implementing chargeback models
- Automation policies to prevent agent sprawl and idle resource consumption
CI/CD and Rollout Strategies for Agents
- Integrating agent pipelines into CI/CD systems
- Testing, versioning, and rollback strategies for iterative agent updates
- Progressive rollouts and safe deployment mechanisms
Failure Recovery and Reliability Engineering
- Designing for fault tolerance and graceful degradation
- Retry, timeout, and circuit breaker patterns for agent reliability
- Incident response and post-mortem frameworks for AI operations
Capstone Project
- Build and deploy an agentic AI system with full monitoring and cost tracking
- Simulate load, measure performance, and optimize resource usage
- Present final architecture and monitoring dashboard to peers
Summary and Next Steps
요건
- Strong understanding of MLOps and production machine learning systems
- Experience with containerized deployments (Docker/Kubernetes)
- Familiarity with cloud cost optimization and observability tools
Audience
- MLOps engineers
- Site Reliability Engineers (SREs)
- Engineering managers overseeing AI infrastructure
21 시간
회원 평가 (3)
좋은 지식과 실습의 조합
Ion Mironescu - Facultatea S.A.I.A.P.M.
코스 - Agentic AI for Enterprise Applications
기계 번역됨
이론과 실습, 고수준과 저수준의 관점을 결합한 접근
Ion Mironescu - Facultatea S.A.I.A.P.M.
코스 - Autonomous Decision-Making with Agentic AI
기계 번역됨
실습 문제
Daniel - Facultatea S.A.I.A.P.M.
코스 - Agentic AI in Multi-Agent Systems
기계 번역됨