模型 | 提供方 | 上下文窗口 | 调用限制 |
Qwen/Qwen2-7B-Instruct | siliconflow | 32K | RPM:100,QPS:3 |
Qwen/Qwen2-1.5B-Instruct | siliconflow | 32K | RPM:100,QPS:3 |
Qwen/Qwen1.5-7B-Chat | siliconflow | 32K | RPM:100,QPS:3 |
THUDM/gIm-4-9b-chat | siliconflow | 32K | RPM:100,QPS:3 |
THUDM/ChatGLM3-6b | siliconflow | 32K | RPM:100,QPS:3 |
01-ai/Yi-1.5-9B-Chat-16K | siliconflow | 16K | RPM:100,QPS:3 |
01-ai/Yi-1.5-6B-Chat | siliconflow | 4K | RPM:100,QPS:3 |
gemini 1.5 flash | Google | / | 15 RPM,100万个 TPM,1500 RPD |
Gemini 1.5 Pro | Google | / | 2 RPM,32,000TPM,50 RPD |
Gemini 1.0 Pro | Google | / | 15 RPM,32,000TPM,1500 RPD |
LLaMA3 8b | Groq | 8,192 tokens | 14400 RPD,18000 TPM,14370 RPD,17997 TPM |
LLaMA3 70b | Groq | 8,192 tokens | 14400 RPD,18000 TPM,14370 RPD,17997 TPM |
Mixtral 8x7b | Groq | 32,768 tokens | 14400 RPD,18000 TPM,14370 RPD,17997 TPM |
Gemma 7b | Groq | 8,192 tokens | 14400 RPD,18000 TPM,14370 RPD,17997 TPM |
Whisper | Groq | 25 MB | 14400 RPD,18000 TPM,14370 RPD,17997 TPM |
GLM3-130B | 火山引擎 | 8K | / |
GLM3-130B金融模型 | 火山引擎 | 8K | / |
Lama3-8B(开源) | 火山引擎 | 8K | / |
Llama3-70B(开源) | 火山引擎 | 32K | / |
Mistral-7B(开源) | 火山引擎 | 4K | / |
baichuan-7B(开源) | 火山引擎 | 2K | / |
Dolly-V2-12B(开源) | 火山引擎 | Free | / |
豆包·Function call模型(32K) | 扣子 | 32K | QPS:2,QPM:60.QPD:3000 |
通义千问-Max(8K) | 扣子 | 8K | QPS:2,QPM:60.QPD:3000 |
MiniMax 6.5s(245K) | 扣子 | 245K | QPS:2,QPM:60.QPD:3000 |
Moonshot(8K) | 扣子 | 8K | QPS:2,QPM:60.QPD:3000 |
Moonshot(32K) | 扣子 | 32K | QPS:2,QPM:60.QPD:3000 |
Moonshot(128K) | 扣子 | 128K | QPS:2,QPM:60.QPD:3000 |
hunyuan-lite | 腾讯 |