Reinforcement Learning

Data Skeptic

In many real world situations, a person/agent doesn't necessarily know their own objectives or the mechanics of the world they're interacting with. However, if the agent receives rewards which are correlated with the both their actions and the state of the world, then reinforcement learning can be used to discover behaviors that maximize the reward earned.

Next Episodes

Data Skeptic

Beer-in-Hand Data Science @ Data Skeptic

📆 2018-02-05 01:00


Data Skeptic

Evolutionary Computation @ Data Skeptic

📆 2018-02-02 01:00


Data Skeptic

Evolutionary Computation @ Data Skeptic

📆 2018-02-02 01:00



Data Skeptic

Markov Decision Processes @ Data Skeptic

📆 2018-01-26 18:27