2025
- Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in RoboticsIn Advances in Neural Information Processing Systems (NeurIPS), 2025
- Improving Chemical Understanding of LLMs via SMILES ParsingIn Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
- Structural Reasoning Improves Molecular Understanding of LLMIn Annual Meeting of the Association for Computational Linguistics (ACL), 2025
- Alternative Mixed Integer Linear Programming Optimization for Joint Job Scheduling and Data Allocation in Grid ComputingFuture Generation Computer Systems, 2025
- Align to Misalign: Automatic LLM Jailbreak with Meta-Optimized LLM JudgesarXiv preprint, 2025
- Revisiting the Uniform Information Density Hypothesis in LLM Reasoning TracesarXiv preprint, 2025
- TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRAarXiv preprint, 2025
- Prior-based Noisy Text Data Filtering: Fast and Strong Alternative for PerplexityarXiv preprint, 2025
-
- LLMs Think, But Not in Your Flow: Reasoning-Level Personalization for Black-Box Large Language ModelsarXiv preprint, 2025
2024
- Can LLMs Generate Diverse Molecules? Towards Alignment with Structural DiversityarXiv preprint, 2024
-
-
- Meta-Crafting: Improved Detection of Out-of-distributed Texts via Crafting Metadata SpaceAAAI 2024 Student Abstract, 2024
2023
2022
- What Makes Better Augmentation Strategies? Augment Difficult but Not too DifferentIn International Conference on Learning Representations (ICLR), 2022
- Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute EstimationIn International Conference on Learning Representations (ICLR), 2022
2020
- M2m: Imbalanced Classification via Major-to-minor TranslationIn Conference on Computer Vision and Pattern Recognition (CVPR), 2020