publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. dataman.png
    DataMan: Data Manager for Pre-training Large Language Models
    Anonymous
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. tablegpt2.png
    Tablegpt2: A large multimodal model with tabular data integration
    Aofeng Su, Aowen Wang, Chao Ye, and 8 more authors
    arXiv preprint arXiv:2411.02059, 2024
  2. huang.png
    Navigate Complex Physical Worlds via Geometrically Constrained LLM
    Yongqiang Huang, Wentao Ye, Liyao Li, and 1 more author
    In Proceedings of the 1st Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U), 2024
  3. chen_rl.png
    From Laws to Motivation: Guiding Exploration through Law-Based Reasoning and Rewards
    Ziyu Chen, Zhiqing Xiao, Xinbei Jiang, and 1 more author
    In Intrinsically-Motivated and Open-Ended Learning Workshop@ NeurIPS2024, 2024
  4. long2024.png
    On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
    Lin Long, Rui Wang, Ruixuan Xiao, and 4 more authors
    In Findings of the Association for Computational Linguistics ACL 2024, 2024
  5. Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection
    Xiaomeng Hu, Yiming Zhang, Ru Peng, and 4 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
  6. A Comparative Study on Reasoning Patterns of OpenAI’s o1 Model
    Siwei Wu, Zhongyuan Peng, Xinrun Du, and 8 more authors
    arXiv preprint arXiv:2410.13639, 2024
  7. When Quantum Computing Meets Database: A Hybrid Sampling Framework for Approximate Query Processing
    Sai Wu, Meng Shi, Dongxiang Zhang, and 3 more authors
    IEEE Transactions on Knowledge and Data Engineering, 2024
  8. xiao2024.png
    FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents
    Ruixuan Xiao, Wentao Ma, Ke Wang, and 5 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2024, 2024