Markov Decision Process (MDP) is a formal framework for decision-making where outcomes depend solely on the current state (Markov property). \

architecture