Can LLMs model real-world systems in TLA+? 

Editors’ note: AI has been actively pushing the frontier of applied formal methods for computing systems. In this article, the Specula team wrote about their experience of evaluating LLMs on modeling system code, the basic capability for agentic model checking, using TLA+, a specification language for concurrent and distributed systems. The article is the 7th … Read more

Lessons from Five Years of Artifact Evaluation at EuroSys

Reproducibility is a cornerstone of scientific progress, yet in systems research, it remains a persistent challenge. Over the past five years, the Artifact Evaluation (AE) process at the European Conference on Computer Systems (EuroSys) has aimed to address this challenge head-on. In this blog post, we reflect on what we have learned from organizing AE … Read more

Artifact evaluation, theory and practice

Previously on the SIGOPS blog, we discussed the current status of artifact evaluation (AE) and how to improve it. At HotOS’23, we took another step in this direction: attendees discussed artifact evaluation in a panel led by Shriram Krishnamurthi (Brown), Margo Seltzer (UBC), and Neeraja Yadwadkar (UT Austin), and organized by Roberta De Viti (MPI-SWS), … Read more