Show HN: I built an integration for RL training of browser agents for everyone

7 points
1/21/1970
a day ago
by filtr12

Comments


nithisha2201

Interesting, how do you handle the observability side during training? One thing I ran into with multi-agent RL is that reward signals alone don't tell you much about why an agent is failing. Curious if you've built any tooling around that.

a day ago