Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I don't think so. Reinforcement learning is about teaching by pairing correct actions with rewards, like giving a dog a treat for doing a trick.

Dynamic programming is a recursive algorithm that caches partial results (from the bottom up).

Apples and oranges



I'm no expert, but there are approaches to reinforcement learning where you build a table of the expected payoff for each scenario, and these can recursively reference other entries in the table. Take a look at Q-learning, for example.


Dynamic programming is one of the most widely-used algorithms in reinforcement learning and in fact the tic-tac-toe example using dynamic programming is a classic demo in reinforcement learning classes or textbooks.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: