Research
A curated list of my publications and invited talks. Author order and venues mirror the CV.
2025
-
arXivRubrics as Rewards: Reinforcement Learning Beyond Verifiable DomainsarXiv preprint, 2025.
-
ICLRRefusal-Trained LLMs Are Easily Jailbroken As Browser AgentsInternational Conference on Learning Representations (ICLR), 2025.
-
Under ReviewAdaptive Guidance Accelerates Reinforcement Learning of Reasoning ModelsUnder review, 2025.
2024
2023
-
NeurIPS WS
-
ICML WSAn Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNetsICML 2023 Workshop.
-
AAAIDeep Conservative Reinforcement Learning for Personalization of Mechanical Ventilation TreatmentAAAI, 2023. *Equal contributions.
2022
-
EMNLP (Ind.)Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service SupportEMNLP 2022 Industry Track.
2021
-
Long-PEGASUS
Patent
-
USHeuristic-Systematic Decision-Making Model for User Feedback Based on User Behavior and System TelemetryUS Patent App. US20240037346A1.
Invited Talks
-
AAAI WSRefusal-Trained LLMs Are Easily Jailbroken As Browser AgentsAAAI @ Web Agents, 2025 — Invited Talk.