Sponsored by
; contest hosted at the FPL'26 conference.
Hosted on GitHub Pages using the Dinky theme
Teams will be scored as follows:
inf) assign the same rank to eachℹ️ NOTE
The contest organizers reserve the right to disqualify poorly performing submissions.
For example, if a contestant submission improves a particular design by 50.0 MHz, uses $0.25 USD in OpenRouter tokens, and runs in 1200 seconds, the total score would be:
α = 50.0, β = 0.25, γ = 1200/3600
Benchmark Score = 50.0 - (0.1 * 50.0) * 0.25 - (0.1 * 50) * 1200/3600 = 47.083
Over time, we plan to release a number of additional public benchmarks on which all competing submissions will be evaluated. Contestants will also be evaluated on a set of hidden benchmarks which will not be made public until after the contest has concluded.
Since testing and validation will occur on AWS instances, we will limit runtime to 1 hour of wall clock time per benchmark on the contest runtime environment. After 1 hour expires, the last solution design generated by the team's submission will be calculated and validated. Teams should update the best solution they've found on the output DCP filename location as they go.
For each team's submission, a new API key provisioned with at least $1.00 USD per benchmark will be allocated for the entire evaluation. Teams cannot provide their own API keys to enable additional spend beyond this limit.