The benchmark for measuring how well AI models understand Roblox — from Luau scripting and the Roblox API to game architecture and Studio workflows.
Overall scores across all Roblox knowledge categories. Scored out of 100. Higher is better.
Strengths, weaknesses, and key findings from each evaluated model.
Each model is evaluated across six core areas of Roblox development knowledge.
Luau syntax, type annotations, idiomatic patterns, and correct usage of Roblox services, instances, methods, and events.
RemoteEvents, RemoteFunctions, client-server architecture, replication, and secure communication patterns.
Identifying bugs, reading error output, diagnosing common Roblox issues, and producing correct fixes.
DataStoreService reliability, session locking, retry logic, data migration, and preventing data loss.
ScreenGui structure, UIListLayout, responsive scaling, tween animations, and polished player-facing interfaces.
Optimization patterns, memory management, efficient loops, throttling, and building games that scale with player count.
A structured approach to evaluating Roblox-specific AI capabilities.
Questions crafted by experienced Roblox developers covering real-world scenarios and edge cases.
Each model receives identical prompts with no prior context. Responses collected via official APIs.
Responses scored by Roblox developers on correctness, completeness, and best practices.