publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. ICML
    DenseSteer: Steering Small Language Models towards Dense Math Reasoning
    Yang Ouyang, Shuhang Lin, and Jung-Eun Kim
    2026

2025

  1. ICML
    Plan Then Action: High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
    Zhihao Dou, Qinjian Zhao, Zhongwei Wan, Dinggen Zhang, Weida Wang, Towsif Raiyan, Benteng Chen, Qingtao Pan, Yang Ouyang, Zhiqiang Gao, and others
    arXiv preprint arXiv:2510.01833, 2025
  2. NAACL
    Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense
    Yang Ouyang, Hengrui Gu, Shuhang Lin, Wenyue Hua, Jie Peng, Bhavya Kailkhura, Meijun Gao, Tianlong Chen, and Kaixiong Zhou
    In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
  3. ICLR
    Min-k%++: Improved baseline for detecting pre-training data from large language models
    Jingyang Zhang*, Jingwei Sun*, Eric Yeats, Yang Ouyang, Martin Kuo, Jianyi Zhang, Hao Frank Yang, and Hai Li
    The Thirteenth International Conference on Learning Representations, 2025