nebius
Qwen2.5-Coder-7B
In this workload, the estimated monthly API cost is $0.17. The route has a listed context window of 32.8K.
Open model detailsData-led article
Coding-agent cost can move quickly when every task repeats the same repo context across several turns. This comparison keeps the scope narrow: it ranks a small set of checked-in model records by estimated API cost for one workload, then links back into the database for deeper inspection.
Scenario
This page uses the site's current model data. Always verify final pricing with the provider before production use.
2k prompt plus 5k repo context.
Enough room for a multi-step reply.
Repeated turns on the same task.
A small recurring automation workload.
Cost screen
Models are sorted by estimated monthly API cost. The table does not say which model writes better code.
| Model | Context | Input / 1M | Output / 1M | Monthly cost |
|---|---|---|---|---|
| Qwen2.5-Coder-7B nebius | 32.8K | $0.0100 / 1M tokens | $0.0300 / 1M tokens | $0.17 |
| llama3.2-11b-vision-instruct lambda_ai | 131.1K | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $0.21 |
| llama3.2-3b-instruct lambda_ai | 131.1K | $0.0150 / 1M tokens | $0.0250 / 1M tokens | $0.21 |
| Llama-3.2-3B-Instruct deepinfra | 131.1K | $0.0200 / 1M tokens | $0.0200 / 1M tokens | $0.26 |
| Meta-Llama-3.1-8B-Instruct-Turbo deepinfra | 131.1K | $0.0200 / 1M tokens | $0.0300 / 1M tokens | $0.28 |
| Mistral-Nemo-Instruct-2407 deepinfra | 131.1K | $0.0200 / 1M tokens | $0.0400 / 1M tokens | $0.30 |
nebius
In this workload, the estimated monthly API cost is $0.17. The route has a listed context window of 32.8K.
Open model detailslambda_ai
In this workload, the estimated monthly API cost is $0.21. The route has a listed context window of 131.1K.
Open model detailslambda_ai
In this workload, the estimated monthly API cost is $0.21. The route has a listed context window of 131.1K.
Open model detailsChange token counts, iterations, and task volume on the coding-agent page before choosing a model route.
Caveats
The site has sparse coding benchmark coverage for this low-cost slice, so this article does not rank models by coding quality. It also does not include provider discounts, cache behavior, region-specific pricing, rate limits, or tool-use charges. Treat it as a shortlist for cost inspection, then test the model on your own coding tasks.