meditation

Recent Learnings

RLVR

Reinforcement Learning with Verifiable Rewards is actually things I want to try very much.

Several resources and frameworks within the open-source ecosystem for building and integrating environments for agent training:

Other things