> tell me about sara's research

here are my recent publications

Papers

Code-Game Setter def f(x): return x**2+3 f(7)=? Solver 7**2 + 3 = 52 exec verified Math-Game Setter ∫ x² dx from 0..2 solve? Solver x³/3 |₀² = 8/3 LLM judged vs

SAGE: Self-play Adversarial Games Enhance Large Language Model Reasoning Capabilities

Amjith, S., Wang, M.X., Lynch, J., Gundlach, H., & Thompson, N.

RSI @ ICLR 2026

A framework for improving LLM reasoning through adversarial self-play where a Setter generates challenging problems and a Solver attempts to solve them, achieving up to +10% on MATH and +8% on MBPP with cross-domain transfer.

Image + Q VLM Qwen SQ1: What vehicle? A1 SQ2: How many seats? A2 SQ3: People per seat? A3 Final: 2 GRPO Reward Format + Correct +1 Wrong / No format -1 RL update CLEVR 52.2% A-OKVQA 51.6%

Self-Questioning Vision-Language Models: Reinforcement Learning for Compositional Visual Reasoning

Amjith, S.

E23D @ CVPR 2026

A self-questioning framework that trains a VLM to decompose compositional visual questions into sub-questions using GRPO, without any reasoning demonstrations. Applied to a 3B-parameter model on CLEVR and A-OKVQA, both self-questioning and standard RL substantially improve accuracy over the untrained model.