<h1 align = "center">Large Language Models for Software Engineering</h1> <p align="center"> <a href="https://arxiv.org/abs/2312.15223"><img src="https://img.shields.io/badge/arXiv-2312.15223-blue.svg"></a> <img src="https://img.shields.io/github/stars/iSEngLab/AwesomeLLM4SE?color=yellow&label=Stars"> </p>

Title: A Survey on Large Language Models for Software Engineering

Authors: Quanjun Zhang, Chunrong Fang, Yang Xie, Yaxin Zhang, Yun Yang, Weisong Sun, Shengcheng Yu, Zhenyu Chen

A collection of academic publications and methodologies on the classification of Code Large Language Models' pre-training tasks, downstream tasks, and the application of Large Language Models in the field of Software Engineering (LLM4SE).

We welcome all researchers to contribute to this repository and further contribute to the knowledge of the Large Language Models with Software Engineering field. Please feel free to contact us if you have any related references by Github issue or pull request.

👏 Citation

@article{zhang2023survey,
  title={A Survey on Large Language Models for Software Engineering},
  author={Zhang, Quanjun and Fang, Chunrong and Xie, Yang and Zhang, Yaxin and Yang, Yun and Sun, Weisong and Yu, Shengcheng and Chen, Zhenyu},
  journal={arXiv preprint arXiv:2312.15223},
  year={2023}
}

🔥🔥 New Papers

🔥PyGen: A Collaborative Human-AI Approach to Python Package Creation ——Package Creation [2024-arXiv]
🔥Code-mixed LLM: Improve Large Language Models' Capability to Handle Code-Mixing through Reinforcement Learning from AI Feedback [2024-arXiv]
🔥Fuzzing Robotic Software Using HPC [2024-arXiv]
🔥InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models [2024-NeurIPS]
🔥Transforming the field of Vulnerability Prediction: Are Large Language Models the key?
🔥IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs [2024-NeurIPS]
🔥A Comprehensive Survey of AI-Driven Advancements and Techniques in Automated Program Repair and Code Generation [2024-arXiv]
🔥Towards the Integration of Large Language Models and Automatic Assessment Tools: Enhancing Student Support in Programming Assignments [2024-Koli Calling]
🔥Semantic Error Detection in Code Translation Using Knowledge-Driven Static Analysis with AI Chain
🔥RUG: Turbo LLM for Rust Unit Test Generation
🔥ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Models for Code Generation [2024-arXiv]
🔥Automatically Write Code Checker: An LLM-based Approach with Logic-guided API Retrieval and Case by Case Iteration[2024-arXiv]
🔥LeDex: Training LLMs to Better Self-Debug and Explain Code [2024-NeurIPS]
🔥EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization [2024-NeurIPS]
🔥SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents [2024-NeurIPS]
🔥ACES: Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative Models [2024-NeurIPS]
🔥LProtector: An LLM-driven Vulnerability Detection System [2024-arXiv]
🔥An Empirical Study on the Potential of LLMs in Automated Software Refactoring [2024-arXiv]
🔥DSLXpert: LLM-driven Generic DSL Code Generation [2024-arXiv]
🔥ProConSuL: Project Context for Code Summarization with LLMsAssessing the Answerability of Queries in Retrieval-Augmented Code Generation[2024-arXiv]
🔥Smart-LLaMA: Two-Stage Post-Training of Large Language Models for Smart Contract Vulnerability Detection and Explanation [2024-arXiv]
🔥Escalating LLM-based Code Translation Benchmarking into the Class-level Era [2024-arXiv]
🔥CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement [2024-arXiv]
🔥CoCoP: Enhancing Text Classification with LLM through Code Completion Prompt [2024-arXiv]
🔥ProConSuL: Project Context for Code Summarization with LLMs [2024-arXiv]
🔥CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models [2024-arXiv]
🔥CORE: Resolving Code Quality Issues using LLMs [2024-FSE]
🔥A deep dive into large language models for automated bug localization and repair [2024-FSE]
🔥Prompt Fix: Vulnerability Automatic Repair Technology Based on Prompt Engineering [2024-ICNC]
🔥Evaluating Large Language Models for Real-World Vulnerability Repair in C/C++ Code[2024-IWSPA]
🔥Investigating large language models capabilities for automatic code repair in Python[2024-Cluster Computing]
🔥LPR: Large Language Models-Aided Program Reduction[2024-ISSTA]
🔥A Case Study of LLM for Automated Vulnerability Repair: Assessing Impact of Reasoning and Patch Validation Feedback (2024年7月) AIware 2024
🔥When Large Language Models Confront Repository-Level Automatic Program Repair: How Well They Done? [2024-ICSE]
🔥Automated Validation of COBOL to Java Transformation[2024-ASE]
🔥LLM4Workflow: An LLM-based Automated Workflow Model Generation Tool[2024-ASE]
🔥LLM-Based Java Concurrent Program to ArkTS Converter[2024-ASE]
🔥PACGBI: A Pipeline for Automated Code Generation from Backlog Items[2024-ASE]
🔥Attacks and Defenses for Large Language Models on Coding Tasks[2024-ASE]
🔥Unity Is Strength: Collaborative LLM-Based Agents for Code Reviewer Recommendation[2024-ASE]
🔥Bridging Gaps in LLM Code Translation: Reducing Errors with Call Graphs and Bridged Debuggers[2024-ASE]
🔥Using LLM for Mining and Testing Constraints in API Testing[2024-ASE]
🔥AdvSCanner: Generating Adversarial Smart Contracts to Exploit Reentrancy Vulnerabilities Using LLM and Static Analysis[2024-ASE]
🔥iSMELL: Assembling LLMs with Expert Toolsets for Code Smell Detection and Refactoring[2024-ASE]
🔥Effective Vulnerable Function Identification based on CVE Description Empowered by Large Language Models[2024-ASE]
🔥WaDec: Decompiling WebAssembly Using Large Language Model[2024-ASE]
🔥Exploring Parameter-Efficient Fine-Tuning of Large Language Model on Automated Program Repair[2024-ASE]
🔥Understanding Code Changes Practically with Small-Scale Language Models[2024-ASE]
🔥Leveraging Large Language Model to Assist Detecting Rust Code Comment Inconsistency[2024-ASE]
🔥Semantic Sleuth: Identifying Ponzi Contracts via Large Language Models[2024-ASE]
🔥Test-Driven Development and LLM-based Code Generation[2024-ASE]
🔥Enhancing Software Design and Developer Experience Via LLMs[2024-ASE]
🔥Test smells in LLM-Generated Unit Tests [2024-arXiv]
🔥Effi-Code: Unleashing Code Efficiency in Language Models[2024-arXiv]
🔥Agent-as-a-Judge: Evaluate Agents with Agents[2024-arXiv]
🔥Unraveling the Potential of Large Language Models in Code Translation: How Far Are We?[2024-arXiv]
🔥Generalized Adversarial Code-Suggestions: Exploiting Contexts of LLM-based Code-Completion[2024-arXiv]
🔥A Model Is Not Built By A Single Prompt: LLM-Based Domain Modeling With Question Decomposition[2024-arXiv]
🔥Test smells in LLM-Generated Unit Tests[2024-arXiv]
🔥Advancing Bug Detection in Fastjson2 with Large Language Models Driven Unit Test Generation[2024-arXiv]
🔥Large Language Models for Energy-Efficient Code: Emerging Results and Future Directions[2024-arXiv]
🔥Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation[2024-arXiv]
🔥Towards Trustworthy LLMs for Code: A Data-Centric Synergistic Auditing Framework[2024-arXiv]
🔥COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement[2024-arXiv]
🔥Decoding Secret Memorization in Code LLMs Through Token-Level Characterization[2024-arXiv]
🔥Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs[2024-arXiv]
🔥Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example[2024-arXiv]
🔥PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents[2024-arXiv]
🔥AutoComply: Automating Requirement Compliance in Automotive Integration Testing[2024-CSE]
🔥TableAnalyst: an LLM-agent for tabular data analysis-Implementation and evaluation on tasks of varying complexity[2024-CSE]
🔥Using Learning from Answer Sets for Robust Question Answering with LLM[2024-LPNMR]
🔥Mitigating Gender Bias in Code Large Language Models via Model Editing[2024-arXiv]
🔥RealVul: Can We Detect Vulnerabilities in Web Applications with LLM?[2024-arXiv]
🔥VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search[MATH-AI]
🔥Exploring and Lifting the Robustness of LLM-powered Automated Program Repair with Metamorphic Testing[2024-arXiv]
🔥REDO: Execution-Free Runtime Error Detection for COding Agents[2024-arXiv]
🔥Checker Bug Detection and Repair in Deep Learning Libraries[2024-arXiv]
🔥Automating and Validating Agent and Environment Code Generation with Large Language Models[2024-NeurIPS]
🔥Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach[2025-ICLR]
🔥Assessing Code Clone Detection Capabilities of Large Language Models on Human and Ai-Generated Code: Zero-Shot and Fine-Tuning Approaches
🔥Codepori: Large-Scale System for Autonomous Software Development Using Multi-Agent Tec

AwesomeLLM4SE

Install / Use

README

👏 Citation

🔥🔥 New Papers