Markov Decision Process (MDP) is a formal framework for decision-making where outcomes depend solely on the current state (Markov property). \

architecture

Markov Decision Processes (MDPs): The mathematical framework for modelling decision-making, characterized by states, actions, transition probabilities, and rewards. Your understanding of probability theory and stochastic processes will be crucial here.