Publications

Publications by categories in reversed chronological order.

2024

  1. LLM
    look.png
    Look before you leap: An exploratory study of uncertainty measurement for large language models
    Yuheng Huang, Jiayang Song, Zhijie Wang, and 4 more authors
    IEEE Transactions on Software Engineering., 2024
  2. LLM-Robotics
    ISR.png
    ISR-llm: Iterative self-refined large language model for long-horizon sequential task planning
    Zhehua Zhou, Jiayang Song, Kunpeng Yao, and 2 more authors
    In 2024 IEEE International Conference on Robotics and Automation (ICRA), 2024
  3. AI-CPS
    Robotics-benchmark.png
    Towards building AI-CPS with NVIDIA Isaac sim: An industrial benchmark and case study for robotics manipulation
    Zhehua Zhou, Jiayang Song, Xuan Xie, and 5 more authors
    In Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice, 2024
  4. LLM
    Online-Safety-Analysis.png
    Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
    Xuan Xie, Jiayang Song, Zhehua Zhou, and 3 more authors
    Under Review, 2024
  5. VLA
    vla-testing.png
    Towards testing and evaluating vision-language-action models for robotic manipulation: An empirical study
    Zhijie Wang, Zhehua Zhou, Jiayang Song, and 3 more authors
    Under Review, 2024
  6. LLM
    luna.png
    LUNA: A Model-Based Universal Analysis Framework for Large Language Models
    Da Song, Xuan Xie, Jiayang Song, and 4 more authors
    IEEE Transactions on Software Engineering, 2024
  7. AI-CPS
    mortar.png
    MORTAR: A Model-based Runtime Action Repair Framework for AI-enabled Cyber-Physical Systems
    Renzhi Wang, Zhehua Zhou, Jiayang Song, and 3 more authors
    Under Review, 2024
  8. LLM
    actracer.png
    Active Testing of Large Language Model via Multi-Stage Sampling
    Yuheng Huang, Jiayang Song, Qiang Hu, and 2 more authors
    Under Review, 2024
  9. LLM
    multilingual.png
    Multilingual blending: Llm safety alignment evaluation with language mixture
    Jiayang Song, Yuheng Huang, Zhehua Zhou, and 1 more author
    Under Review, 2024
  10. LLM
    rag.png
    Towards Understanding Retrieval Accuracy and Prompt Quality in RAG Systems
    Shengming Zhao, Yuheng Huang, Jiayang Song, and 3 more authors
    Under Review, 2024
  11. Robotics-RL
    tnnls.png
    GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
    Zhehua Zhou, Xuan Xie, Jiayang Song, and 2 more authors
    IEEE Transactions on Neural Networks and Learning Systems, 2024
  12. LLM
    LADEV.png
    LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation
    Zhijie Wang, Zhehua Zhou, Jiayang Song, and 3 more authors
    Under Review, 2024
  13. LLM
    LADEV.png
    LeCov: Multi-level Testing Criteria for Large Language Models
    Xuan Xie, Jiayang Song, Yuheng Huang, and 4 more authors
    Under Review, 2024

2023

  1. LLM-Robotics
    reward.png
    Self-refined large language model as automated reward function designer for deep reinforcement learning in robotics
    Jiayang Song, Zhehua Zhou, Jiawei Liu, and 3 more authors
    Under Review, 2023
  2. AI-CPS
    siege.png
    SIEGE: A Semantics-Guided Safety Enhancement Framework for AI-Enabled Cyber-Physical Systems
    Jiayang Song, Xuan Xie, and Lei Ma
    IEEE Transactions on Software Engineering, 2023
  3. AI-CPS
    mosaic.png
    Mosaic: Model-based Safety Analysis Framework for AI-enabled Cyber-Physical Systems
    Xuan Xie, Jiayang Song, Zhehua Zhou, and 2 more authors
    Under Review, 2023
  4. AI-CPS
    auto-repair.png
    Autorepair: Automated repair for ai-enabled cyber-physical systems under safety-critical conditions
    Deyun Lyu, Jiayang Song, Zhenya Zhang, and 4 more authors
    Under Review, 2023

2022

  1. AI-CPS
    ai-cps-benchmark.png
    When cyber-physical systems meet AI: A benchmark, an evaluation, and a way forward
    Jiayang Song, Deyun Lyu, Zhenya Zhang, and 3 more authors
    In Proceedings of the 44th International Conference on Software Engineering: Software Engineering in Practice, 2022