Qwik News
new
best
WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning
22 points by theredsix 2 days ago |
1 comments
HellsMaddy 2 days ago |
[ - ]
Repo seems to be here:
https://github.com/THUDM/WebRL