An official leaderboard for efficient neural network operations.
| Rank | Model | Author | Score ▾Overall efficiency score. Higher is better! | Test AccAccuracy on CIFAR-10 | Ω_proxyOperational Complexity. Lower is better. | C_DDescriptive Complexity | C_P (M)Parametric Complexity (in Millions) | Latency (ms)Inference time per batch | Tier |
|---|---|---|---|---|---|---|---|---|---|
| 1 | StandardConv | Official Baseline | 990.14 | 69.31% | 6.00 | 6 | 0.000 | 0.50 | B |
| 2 | DeformableConv | Official Baseline | 771.40 | 69.45% | 8.00 | 8 | 0.003 | 1.63 | C |
| 3 | DepthwiseConv | Official Baseline | 681.67 | 61.35% | 8.00 | 8 | 0.000 | 0.53 | C |
To ensure that all submissions to "The Architect's Arena" are fair, consistent, and reproducible, all participants must adhere to the following official benchmarking rules.
Your submission will be evaluated based on its performance under this specific, standardized environment.
CIFAR-10
CIFAR10Dataset class in the benchmark package are applied.gwo_benchmark.Evaluator
Evaluator provided within the package. Custom evaluators are not permitted for official leaderboard submissions to maintain a level playing field in the training and evaluation loop.train_config dictionary passed to the Evaluator must exactly match the following parameters:
train_config = {
"epochs": 30,
"lr": 0.001,
"momentum": 0.9,
"batch_size": 64
}
Evaluator is Stochastic Gradient Descent (SGD). No learning rate schedulers, weight decay modifications, or alternative optimizers (e.g., Adam) are allowed in the official track.GWOModule)Your model must adhere to the GWOModule interface to allow for automated complexity calculation and evaluation.
gwo_benchmark.base.GWOModule.C_D): You must define the C_D class variable as a static integer. This value should be calculated honestly, based on the Official Primitive Dictionary provided in the README. Your reasoning for the chosen C_D value should be included as comments in your code or in your submission description.C_P): You must correctly implement the get_parametric_complexity_modules() method. This method should return a list of all nn.Modules whose parameters contribute to the dynamic generation of the operation (e.g., offset prediction networks), not the primary weights of the operation itself (like the convolutional kernel).GWOModule implementation is clean, commented, and ready for review.GWOModule code.C_D) and a brief justification for it.C_D calculation.Latency measurements.By standardizing these conditions, we can confidently compare the architectural efficiency of diverse neural network operations. We look forward to seeing your innovative designs in the Arena.
If you want to add your own operation to the leaderboard, please contact us using the methods below.
Email: dafaafafaf33@gmail.com
Discord: https://discord.gg/tfuYKTGTk5
You can also add it from this link.