cvl-robot's diary

研究ノート メモメモ https://github.com/dotchang/

逆強化学習の勉強を始めるためのリンク集

[0] https://people.eecs.berkeley.edu/~pabbeel/cs287-fa12/slides/inverseRL.pdf
[1] 逆強化学習を理解する - Qiita

[2] 逆強化学習のC言語による実装
逆強化学習をC言語で実装してみた - mabonki0725の日記
GitHub - mabonki0725/IRL_16cell
[3] DNNによる逆強化学習
逆強化学習の深層学習版をC言語で実装してみた - mabonki0725の日記
GitHub - mabonki0725/IRL_DNN
[4] ベイズによる逆強化学習
UC.Berklayの協業強化学習の論文を読む - mabonki0725の日記
GitHub - mabonki0725/IRL_bayes
[5] ガウス過程による逆強化学習
ガウス過程による逆強化学習の論文を読む - mabonki0725の日記
ガウス過程による逆強化学習を実装(python)してみる - mabonki0725の日記
GitHub - mabonki0725/IRL_GP

[6] [4]で紹介されていたベイズによる逆強化学習
ノンパラメトリックベイズを用いた逆強化学習
https://www.aaai.org/Papers/IJCAI/2007/IJCAI07-416.pdf
GitHub - erensezener/aima-based-irl: IRL implementation based on Norvig's AIMA code.
[7] GANとIRL
UC.Berkeleyの敵対的逆強化学習の論文を読む - mabonki0725の日記
[DL輪読会]逆強化学習とGANs

[8] Linear programming IRL, Maximum entropy IRL, Deep maximum entropy IRLのGridworldとObjectworldを対象としたPython実装
GitHub - MatthewJA/Inverse-Reinforcement-Learning: Implementations of selected inverse reinforcement learning algorithms.

[9] 読むべき論文まとめ
irl_rocks/papers at master · sjchoi86/irl_rocks · GitHub
https://github.com/sjchoi86/irl_rocks/blob/master/IRL_survey.pdf

http://ai.stanford.edu/~ang/papers/icml00-irl.pdf
www.andrewng.org
https://rl-tokyo.github.io/resource/20160518-RodeoBoy24420.pdf
1. Algorithms for Inverse Reinforcement Learning 2 - ppt video online download

http://ai.stanford.edu/~ang/papers/icml04-apprentice.pdf
http://www.r.dl.itc.u-tokyo.ac.jp/study_ml/pukiwiki/index.php?plugin=attach&refer=schedule%2F2007-06-07&openfile=Apprenticeship-pub.pdf
Apprenticeship learning - Wikipedia
www.youtube.com
github.com

http://martin.zinkevich.org/publications/maximummarginplanning.pdf
MMP - Optimized Robotics
Robot Intelligence Technology and Applications 4: Results from the 4th ... - Google ブックス

https://www.aaai.org/Papers/AAAI/2008/AAAI08-227.pdf
Max's Blog | Revisit Maximum Entropy Inverse Reinforcement Learning
GitHub - MatthewJA/Inverse-Reinforcement-Learning: Implementations of selected inverse reinforcement learning algorithms.

https://homes.cs.washington.edu/~zoran/gpirl.pdf
github.com

[1507.04888] Maximum Entropy Deep Inverse Reinforcement Learning
エネルギーベースの逆強化学習の論文を再読する - mabonki0725の日記
github.com
GitHub - MatthewJA/Inverse-Reinforcement-Learning: Implementations of selected inverse reinforcement learning algorithms.

www.slideshare.net

[1611.03852] A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models

www.slideshare.net
www.youtube.com

[1606.03476] Generative Adversarial Imitation Learning

www.slideshare.net
http://sssslide.com/speakerdeck.com/takoika/lun-wen-shao-jie-generative-adversarial-imitation-learningsssslide.com
github.com
github.com

[1603.00448] Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
www.youtube.com
機械学習論文読みメモ_24 - Qiita

[1612.04318] Incorporating Human Domain Knowledge into Large Scale Cost Function Learning

http://www.cs.ox.ac.uk/people/shimon.whiteson/pubs/shiarlisaamas16.pdf
Shimon Whiteson: Inverse Reinforcement Learning from Failure

[1612.02179] Model-based Adversarial Imitation Learning

[1607.02329] Watch This: Scalable Cost-Function Learning for Path Planning in Urban Environments
www.youtube.com

[1612.06699] Unsupervised Perceptual Rewards for Imitation Learning
sermanet.github.io
www.youtube.com

[10] Udacity
www.youtube.com