Inverse RL

Overall Progress 0%

Max-ent IRL: learn reward from expert; linear reward, forward RL.