WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning | Qwik News

WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning

22 points by theredsix 2 days ago | 1 comments

HellsMaddy 2 days ago |
Repo seems to be here: https://github.com/THUDM/WebRL