AIware 2025
Wed 19 - Thu 20 November 2025
co-located with ASE 2025
Wed 19 Nov 2025 09:20 - 09:28 at Grand Hall 1 - AIware & Security Chair(s): Weiyi Shang

Modern software package registries like PyPI have become critical infrastructure for software development, but are increasingly exploited by threat actors distributing malicious packages with sophisticated multi-stage attack chains. While Large Language Models (LLMs) offer promising capabilities for automated code analysis, their application to security-critical malware detection faces fundamental challenges, including hallucination and context confusion, which can lead to missed detections or false alarms. We present CHASE (Collaborative Hierarchical Agents for Security Exploration), a high-reliability multi-agent architecture that addresses these limitations through a Plan-and-Execute coordination model, specialized Worker Agents focused on specific analysis aspects, and integration with deterministic security tools for critical operations. Our key insight is that reliability in LLM-based security analysis emerges not from improving individual model capabilities but from architecting systems that compensate for LLM weaknesses while leveraging their semantic understanding strengths. Evaluation on a dataset of 3,000 packages (500 malicious, 2,500 benign) demonstrates that CHASE achieves 98.4% recall with only 0.08% false positive rate, while maintaining a practical median analysis time of 4.5 minutes per package, making it suitable for operational deployment in automated package screening. Furthermore, we conducted a survey with cybersecurity professionals to evaluate the generated analysis reports, identifying their key strengths and areas for improvement. This work provides a blueprint for building reliable AI-powered security tools that can scale with the growing complexity of modern software supply chains. Our project page is available at: https://t0d4.github.io/CHASE-AIware25/

CHASE_AIware25.pdf (CHASE_AIware25.pdf)726KiB

Wed 19 Nov

Displayed time zone: Seoul change

09:20 - 10:30
AIware & SecurityMain Track at Grand Hall 1
Chair(s): Weiyi Shang University of Waterloo
09:20
8m
Talk
CHASE: LLM Agents for Dissecting Malicious PyPI Packages
Main Track
Takaaki Toda Waseda University, Tatsuya Mori Waseda University
File Attached
09:28
8m
Talk
CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
Main Track
Cheng Cheng Concordia University, Jinqiu Yang Concordia University
Pre-print
09:36
8m
Talk
Security in the Wild: An Empirical Analysis of LLM-Powered Applications and Local Inference Frameworks
Main Track
Julia Gomez-Rangel Texas A&M University - Corpus Christi, Young Lee Texas A & M University - San Antonio, Bozhen Liu Texas A&M University - Corpus Christi
Pre-print
09:44
8m
Talk
How Quantization Impacts Privacy Risk on LLMs for Code?
Main Track
Md Nazmul Haque North Carolina State University, Hua yang North Carolina State University, Zhou Yang University of Alberta, Alberta Machine Intelligence Institute , Bowen Xu North Carolina State University
Pre-print
09:52
8m
Talk
Securing the Multi-Chain Ecosystem: A Unified, Agent-Based Framework for Vulnerability Repair in Solidity and Move
Main Track
Rabimba Karanjai University of Houston, Lei Xu Kent State University, Weidong Shi University of Houston
10:00
8m
Talk
SEALGuard: Safeguarding the Multilingual Conversations in Southeast Asian Languages for AI-Powered Software
Main Track
Wenliang Shan Monash University, Michael Fu The University of Melbourne, Rui Yang Monash University and Transurban, Kla Tantithamthavorn Monash University and Atlassian
Pre-print File Attached
10:10
20m
Live Q&A
Joint Q&A and Discussion #AISecurity
Main Track