endtoend.ai

Studying Artificial Intelligence, from backbone to application.

All Stories

RL Weekly 6: AlphaStar, Rectified Nash Response, and Causal Reasoning with Meta RL

RL Weekly 6: AlphaStar, Rectified Nash Response, and Causal Reasoning with Meta RL

This week, we look at AlphaStar, a Starcraft II AI, PSRO_rN, an evaluation algorithm encouraging diverse population of well-trained agents, and a novel Meta-RL approach...

RL Weekly 5: Robust Control of Legged Robots, Compiler Phase-Ordering, and Go Explore on Sonic the Hedgehog

RL Weekly 5: Robust Control of Legged Robots, Compiler Phase-Ordering, and Go Explore on Sonic the Hedgehog

This week, we look at impressive robust control of legged robots by ETH Zurich and Intel, compiler phase-ordering by UC Berkeley and MIT, and a...

RL Weekly 4: Generating Problems with Solutions, Optical Flow with RL, and Model-free Planning

RL Weekly 4: Generating Problems with Solutions, Optical Flow with RL, and Model-free Planning

In this issue, we introduce new curriculum learning algorithm by Uber AI Labs, model-free planning algorithm by DeepMind, and optical-flow based control algorithm by Intel...

RL Weekly 3: Learning to Drive through Dense Traffic, Learning to Walk, and Summarizing Progress in Sim-to-Real

RL Weekly 3: Learning to Drive through Dense Traffic, Learning to Walk, and Summarizing Progress in Sim-to-Real

In this issue, we introduce the DeepTraffic competition from Lex Fridman's MIT Deep Learning for Self-Driving Cars course. We also review a new paper on...

PyTorch Implementations of Policy Gradient Methods

PyTorch Implementations of Policy Gradient Methods

A well-written baseline is crucial to research. We compare and recommend popular open source implementations of reinforcement learning algorithms in PyTorch.

RL Weekly 2: Tuning AlphaGo, Macro-strategy for MOBA, Sim-to-Real with conditional GANs

RL Weekly 2: Tuning AlphaGo, Macro-strategy for MOBA, Sim-to-Real with conditional GANs

In this issue, we discuss hyperparameter tuning for AlphaGo from DeepMind, Hierarchical RL model for a MOBA game from Tencent, and GAN-based Sim-to-Real algorithm from...

Never miss an issue of RL Weekly from us, subscribe to our newsletter

Explore →

learn (2) personal (4) explore (5) paper-unraveled (3) rl-weekly (44) blogging (2) tutorial (9) release (1)