Dhruv Kumar Yadav

11253
dhruvkr@
CS365A :: Assignment #1

Cartoon

Click Here to go to image.

Task Identification

Playing soccer Brushing your teeth Playing a tennis match Practicing tennis against
a wall
Deciding what item to take
at hostel lunch
Deterministic vs Stochastic Stochastic Deterministic Stochastic Deterministic Deterministic
Episodic vs Sequential Sequential Episodic Sequential Sequential Sequential
Static vs Dynamic Dynamic Static Dynamic Dynamic Static
Discrete vs Continuous Continuous Continuous Continuous Continuous Discrete
Fully observable vs
Partially observable
Partially observable Fully observable Partially observable Fully observable Fully observable
Single agent vs Multiagent Multiagent Single agent Multiagent Single agent Single agent

Playing Soccer
Stochastic :: In a soccer match next state is not completely determined by the action executed by the agent. The environment is complex. One can never predict the behaviour of other players exactly and thus have to worry about uncertainty in passing/shooting/defending. Therefore, it is stochastic in nature.
Sequential :: In a soccer match, a success i.e. a goal can be result of someone passing from midfield to striker few minutes ago. Thus the choice current action i.e. passing another mid fielder/ striker can affect my future decision. Short term consequences ( eg pass is correct or not) have long term consequences making them sequential.
Dynamic :: In a soccer match, other players keep moving while a algorithm computes what to do next. As the environment can change while an agent is deliberating; the environment is dynamic for the agent.
Continuous :: Soccer match is a continuous state and continuous time problem ; as the speed and position of other players keeps on changing through continuous range of values, smoothly over time.
Partially observable :: In a soccer match, an agent can not determine what other agents are thinking and thus is partially observable.
Multi-agent :: In a soccer match, a striker needs to account position, speed and direction of defenders of opposite team. Each player needs to account for the action of the other player making it multiagent.

Reason for few of the labels
Practicing tennis against a wall - Deterministic :: The next state is completely determined by the current state and action executed by agent i.e force applied, direction in which it is applied etc.
Practicing tennis against a wall - Dynamic :: As the player keeps on moving according to the ball while computing what to do next.
NOTE In deciding what item to take at hostel lunch there is assumption that there is enough food for every student.
Deciding what item to take at hostel lunch - Sequential :: If an agent takes dry vegetable then it prefer chapatti else if I take vegetable with curry it prefers rice with them. Thus choice of current action can effect th future action making it sequential.