Claude 3 Haiku vs GPT-3.5 Turbo 1106: Benchmarks, Pricing, and Context Window Comparison
Claude 3 Haiku vs GPT-3.5 Turbo 1106 compares provider, context window, token pricing, benchmark performance, and release timeline in one side-by-side view. Use this page to quickly identify which model is a better fit for your production constraints, quality targets, and estimated cost per request.
Verdict
Claude 3 Haiku has lower listed token pricing, while GPT-3.5 Turbo 1106 can still be preferable if benchmark results better match your workload.
Author: Mirai Minds Research Team
Last updated:
Compare
to
Overview
Claude 3 Haiku was released 5 months after GPT-3.5 Turbo 1106.
GPT-3.5 Turbo 1106 | ||
|---|---|---|
Provider The entity that provides this model. | OpenAI | |
Input Context Window The number of tokens supported by the input context window. | 200K tokens | 16.4K tokens |
Maximum Output Tokens The number of tokens that can be generated by the model in a single request. | 4,096 tokens | 16.4K tokens |
Release Date When the model was first released. | Mar 13, 2024 over 1 yearago 2024-03-13 | Nov 06, 2023 over 1 year 2023-11-06 |
Leaderboard
GPT-3.5 Turbo 1106 | ||
|---|---|---|
Rank | 11 | 46 |
Arena Elo | 1181 | 1072 |
95% CI | +3/-3 | +5/-4 |
Votes | 66065 | 17819 |
License | Proprietary | Proprietary |
Knowledge Cutoff | 8/2023 8/2023 | 9/2021 9/2021 |
Pricing
GPT-3.5 Turbo 1106 | ||
|---|---|---|
Input Cost of input data provided to the model. | $0.25 per million tokens | $1.00 per million tokens |
Output Cost of output tokens generated by the model. | $1.25 per million tokens | $2.00 per million tokens |
Benchmarks
Compare relevant benchmarks between Claude 3 Haiku and GPT-3.5 Turbo 1106 Instruct.
GPT-3.5 Turbo 1106 | ||
|---|---|---|
MMLU Evaluating LLM knowledge acquisition in zero-shot and few-shot settings. | 76.7 (5-shot) | Benchmark not available. |
MMMU A wide ranging multi-discipline and multimodal benchmark. | 50.2 | Benchmark not available. |
HellaSwag A challenging sentence completion benchmark. | 85.9 (10-shot) | Benchmark not available. |

GPT-3.5 Turbo 1106