Oops you specified provider but forgot quant

GitHub - qpwo/openrouter_triples: decide which fucking quant you are using. Same provider may serve multiple quant! · GitHub

/" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

/;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

qpwo

openrouter_triples

Public

Notifications You must be signed in to change notification settings

Fork

Star

main

BranchesTags

Go to file

CodeOpen more actions menu

Folders and files NameNameLast commit message Last commit date Latest commit

History 3 Commits 3 Commits

models.jsonl

openrouter_openapi.json

openrouters_triples_cli.mjs

openrouters_triples_cli.py

readme.md

View all files

Repository files navigation

decide which fucking quant you are using. Same provider may serve multiple quant!

$ python openrouters_triples_cli.py -h usage: openrouters_triples_cli.py [-h] {fetch-models,prompt} ...

positional arguments: {fetch-models,prompt}

options: -h, --help show this help message and exit

$ python openrouters_triples_cli.py fetch-models [1/358] qwen/qwen3.7-max endpoints=1 total_endpoints=1 [2/358] deepseek/deepseek-v4-pro endpoints=12 total_endpoints=13 [3/358] google/gemini-3.5-flash endpoints=2 total_endpoints=15 ... [356/358] openai/gpt-4 endpoints=2 total_endpoints=854 [357/358] openai/gpt-4-0314 endpoints=1 total_endpoints=855 [358/358] gryphe/mythomax-l2-13b endpoints=3 total_endpoints=858 saved 358 models and 858 endpoint triples to models.jsonl in 0.6s

$ python openrouters_triples_cli.py -h usage: openrouters_triples_cli.py [-h] {fetch-models,prompt} ...

positional arguments: {fetch-models,prompt}

options: -h, --help show this help message and exit

$ python openrouters_triples_cli.py prompt -h usage: openrouters_triples_cli.py prompt [-h] --model MODEL --provider PROVIDER [--quant QUANT] --prompt PROMPT [--system SYSTEM] [--models MODELS] [--provider-order PROVIDER_ORDER] [--max-tokens MAX_TOKENS] [--temperature TEMPERATURE] [--raw]

options: -h, --help show this help message and exit --model MODEL --provider PROVIDER --quant QUANT --prompt PROMPT --system SYSTEM --models MODELS --provider-order PROVIDER_ORDER --max-tokens MAX_TOKENS --temperature TEMPERATURE --raw

$ ag bf16 models.jsonl | shuf -n1 {"row_type":"model_endpoint","fetched_at":"2026-05-22T21:21:50Z","model":"meta-llama/llama-3-8b-instruct","provider":"novita","provider_order":"novita","provider_name":"Novita","provider_slug":"novita","quant":"bf16","quantization":"bf16","model_obj":{"id":"meta-llama/llama-3-8b-instruct","canonical_slug":"meta-llama/llama-3-8b-instruct","hugging_face_id":"meta-llama/Meta-Llama-3-8B-Instruct","nam...

$ ./openrouters_triples_cli.py prompt --model meta-llama/llama-3-8b-instruct --provider novita --quant bf16 --prompt 'hi, this is a test. say exactly "read u loud and clear"' --max-tokens 32 Read you loud and clear.

About

decide which fucking quant you are using. Same provider may serve multiple quant!

Resources

Readme

Uh oh!

There was an error while loading. Please reload this page.

Activity

Stars

star

Watchers

watching

Forks

forks

Report repository

Contributors

Uh oh!

There was an error while loading. Please reload this page.

Languages

JavaScript 51.4%

Python 48.6%

You can’t perform that action at this time.

Oops you specified provider but forgot quant

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play