Markov Decision Process Explorer
Visualize, simulate, and optimize Markov Decision Processes with advanced reinforcement learning algorithms
Load Preset Examples
Choose from pre-built MDP examples to get started quickly
Simple 3-State MDP
beginnerA basic 3-state Markov chain with deterministic transitions. Great for learning the basics.
2x2 Grid World
beginnerA classic grid world problem where an agent navigates to reach a goal while avoiding obstacles.
Gambler's Problem
intermediateA classic reinforcement learning problem where a gambler tries to reach a target amount.
Robot Navigation
intermediateA robot navigating through a simple environment with obstacles and goals.
MDP Configuration
Configure your Markov Decision Process using the interactive controls below
States
Define the possible states in your MDP
Actions
Define the available actions in your MDP
Discount Factor (γ)
Controls how much future rewards are valued relative to immediate rewards
Transitions
Configure state transitions, probabilities, and rewards for each action
State: S0
Action: a
Action: b
State: S1
Action: a
Action: b
State: S2
Action: a
Action: b
Monte Carlo Simulation
Configure simulation parameters and run Monte Carlo analysis
Configure your MDP using the visual configurator above to see the graph.