AIware 2025
Wed 19 - Thu 20 November 2025
co-located with ASE 2025

Thu 20 Nov

Displayed time zone: Seoul change

15:00 - 15:29
Evaluation Frameworks, and Quantitative Assessment of LLMs (Part 1)Benchmark & Dataset Track / Main Track at Grand Hall 1
Chair(s): Zhou Yang University of Alberta, Alberta Machine Intelligence Institute
15:00
8m
Talk
Automated Extract Method Refactoring with Open-Source LLMs: A Comparative Study
Main Track
Sivajeet Chand Technical University of Munich, Melih Kilic Technical University of Munich, Roland Würsching Technical University of Munich, Sushant Kumar Pandey University of Groningen, The Netherlands, Alexander Pretschner TU Munich
Pre-print
15:08
8m
Talk
Benchmarking Web API Integration Code Generation
Benchmark & Dataset Track
Daniel Maninger TU Darmstadt, Leon Chemnitz TU Darmstadt, Amir Molzam Sharifloo , Mira Mezini TU Darmstadt; hessian.AI; National Research Center for Applied Cybersecurity ATHENE
Pre-print
15:16
8m
Talk
From Search to Reasoning: A Five-Level RAG Capability Framework for Enterprise Data
Benchmark & Dataset Track
Gurbinder Gill , Ritvik Gupta Carnegie Mellon University, USA, Denis Lusson , Anand Chandrashekar , Donald Nguyen Corvic AI
Pre-print
15:24
5m
Talk
SWE-Sharp-Bench: A Reproducible Benchmark for C# Software Engineering Tasks
Benchmark & Dataset Track
Sanket Mhatre Microsoft, Yasharth Bajpai Microsoft, Sumit Gulwani Microsoft, Emerson Murphy-Hill Microsoft, Gustavo Soares Microsoft
Pre-print