문의를 보내주셔서 감사합니다! 팀원이 곧 연락드리겠습니다.
예약을 보내주셔서 감사합니다! 저희 팀 멤버 중 한 분이 곧 연락드리겠습니다.
코스 개요
Introduction to Reinforcement Learning and Agentic AI
- Decision-making under uncertainty and sequential planning
- Key components of RL: agents, environments, states, and rewards
- Role of RL in adaptive and agentic AI systems
Markov Decision Processes (MDPs)
- Formal definition and properties of MDPs
- Value functions, Bellman equations, and dynamic programming
- Policy evaluation, improvement, and iteration
Model-Free Reinforcement Learning
- Monte Carlo and Temporal-Difference (TD) learning
- Q-learning and SARSA
- Hands-on: implementing tabular RL methods in Python
Deep Reinforcement Learning
- Combining neural networks with RL for function approximation
- Deep Q-Networks (DQN) and experience replay
- Actor-Critic architectures and policy gradients
- Hands-on: training an agent using DQN and PPO with Stable-Baselines3
Exploration Strategies and Reward Shaping
- Balancing exploration vs. exploitation (ε-greedy, UCB, entropy methods)
- Designing reward functions and avoiding unintended behaviors
- Reward shaping and curriculum learning
Advanced Topics in RL and Decision-Making
- Multi-agent reinforcement learning and cooperative strategies
- Hierarchical reinforcement learning and options framework
- Offline RL and imitation learning for safer deployment
Simulation Environments and Evaluation
- Using OpenAI Gym and custom environments
- Continuous vs. discrete action spaces
- Metrics for agent performance, stability, and sample efficiency
Integrating RL into Agentic AI Systems
- Combining reasoning and RL in hybrid agent architectures
- Integrating reinforcement learning with tool-using agents
- Operational considerations for scaling and deployment
Capstone Project
- Design and implement a reinforcement learning agent for a simulated task
- Analyze training performance and optimize hyperparameters
- Demonstrate adaptive behavior and decision-making in an agentic context
Summary and Next Steps
요건
- Strong proficiency in Python programming
- Solid understanding of machine learning and deep learning concepts
- Familiarity with linear algebra, probability, and basic optimization methods
Audience
- Reinforcement learning engineers and applied AI researchers
- Robotics and automation developers
- Engineering teams working on adaptive and agentic AI systems
28 시간
회원 평가 (3)
좋은 지식과 실습의 조합
Ion Mironescu - Facultatea S.A.I.A.P.M.
코스 - Agentic AI for Enterprise Applications
기계 번역됨
이론과 실습, 고수준과 저수준의 관점을 결합한 접근
Ion Mironescu - Facultatea S.A.I.A.P.M.
코스 - Autonomous Decision-Making with Agentic AI
기계 번역됨
실습 문제
Daniel - Facultatea S.A.I.A.P.M.
코스 - Agentic AI in Multi-Agent Systems
기계 번역됨