Hi everyone,
I am preparing to submit an independent research paper to arXiv under the cs.LG (Machine Learning) category, and I am looking for an endorser to help clear the system’s submission threshold.
The paper formalizes “Hopper,” a variant of the Muon optimizer that I adapted specifically for RL fine-tuning pipelines like GRPO. I recently shared some of my empirical findings on the forum—specifically how reducing to ns_steps=1 creates a “lazy orthogonality” that accelerates early reasoning discovery—which you can see here: Hopper — partial orthogonalization changes early reasoning behavior in RL.
You might also recognize me from my technical deep-dive earlier this month on numerical divergence in hybrid models.
If anyone here is an active arXiv author with cs.LG endorsement privileges and would be willing to take a quick look at my draft to endorse it, please let me know! I am more than happy to share the full PDF and the open-source training scripts via DM.
Thanks so much, Jen