Discover AI models for every task
Showing 1-15 of 15 models
by Deepseek
DeepSeek V3.2 is the latest iteration of the DeepSeek V3 series with significant performance improvements. Features enhanced reasoning, coding capabilities, and better instruction following across diverse tasks.
Highly efficient DeepSeek flagship engineered for fast, capable reasoning and low-cost inference.
DeepSeek V3.1 is an optimized variant of DeepSeek V3 with enhanced chat capabilities. Offers excellent cost-efficiency with 685B MoE architecture and improved response quality for conversational tasks.
DeepSeek R1 0528 is an upgraded 685B parameter reasoning model with significantly enhanced depth of reasoning and inference capabilities. Achieves 87.5% on AIME 2025 (up from 70%), 91.4% on AIME 2024, 73.3% on LiveCodeBench, and 1930 Codeforces rating. Features system prompt support, averages 23K thinking tokens per question for deeper analysis, and reduced hallucination rate. Released under MIT license supporting commercial use and distillation. Performance approaching O3 and Gemini 2.5 Pro levels.
by Black Forest Labs
Black Forest Labs FLUX.2 [dev] is the latest generation text-to-image model with significant improvements over FLUX.1. Features enhanced prompt following, superior image quality, and faster generation. Built on the proven rectified flow transformer architecture with optimizations for better detail, composition, and text rendering. Excellent for creative workflows, concept art, and high-quality image generation with both text-to-image and image-to-image capabilities.
by OpenAI
OpenAI Whisper Large V3 is a state-of-the-art automatic speech recognition model with 1550M parameters supporting 99 languages. Achieves 10-20% WER reduction compared to V2, trained on 1M hours weakly labeled + 4M hours pseudo-labeled audio. Features 128 Mel frequency bins (increased from 80), improved robustness to accents and background noise, and new Cantonese language support. Supports speech transcription and speech-to-English translation with sentence and word-level timestamps. Optimized with torch.compile for 4.5x speedup. Ideal for accessibility tools, multilingual transcription, and enterprise ASR applications.
OpenAI Whisper Large V3 Turbo is an optimized variant of Whisper V3 with significantly faster inference while maintaining high accuracy across 99 languages. Features architectural optimizations for reduced latency including faster encoder-decoder inference and efficient attention mechanisms. Delivers near-V3 accuracy with 2-3x speed improvement, ideal for real-time transcription applications, live subtitling, and high-throughput ASR workloads. Supports full multilingual capabilities, timestamps, and speech translation to English. Perfect for production deployments requiring both quality and speed.
by Mistral
Devstral 2 123B is Mistral AI's flagship agentic coding model, featuring 123B parameters optimized for software engineering tasks. Achieves 72.2% on SWE-bench Verified and 61.3% on SWE-bench Multilingual. Excels at codebase exploration, multi-file editing, and agentic workflows with tool use. Supports 200K context window with enhanced function calling and structured output. Designed for IDE integration via Mistral Vibe CLI. Released under modified MIT license for unrestricted commercial use.
Mistral Small 3.2 24B Instruct is a multimodal instruction-tuned model supporting both vision and text with 24B parameters and 128K context. Major improvements over 3.1 include better instruction following (84.78%), 2x reduction in repetition errors, and robust function calling. Achieves 65.33% on Wildbench v2, 43.1% on Arena Hard v2, 92.90% on HumanEval Pass@5. Vision benchmarks: 87.4% ChartQA, 94.86% DocVQA, 62.50% MMMU. Supports up to 10 images per prompt with integrated vision-based function calling.
Black Forest Labs FLUX.1 [dev] with LoRA adapter support. This variant enables fine-tuned generation with custom trained LoRA weights for specialized styles, characters, or concepts. Based on the full 12B parameter FLUX.1 [dev] model with all its capabilities including high-resolution generation, accurate text rendering, and detailed composition. Perfect for custom workflows and specialized image generation tasks.
Black Forest Labs FLUX.1 [dev] is a cutting-edge 12 billion parameter rectified flow transformer for text-to-image generation. Second only to FLUX.1 [pro] with strong prompt following matching closed-source alternatives. Features guidance distillation for efficient inference, high-resolution generation (1024x1024), accurate text rendering, and detailed composition. Supports both text-to-image and image-to-image generation. Open weights enable scientific research and innovative workflows.
Black Forest Labs FLUX.2 [klein] 9B is a balanced image generation model offering excellent quality-to-speed ratio. With 9 billion parameters, it provides better detail and composition than the 4B variant while remaining faster than full-size models. Ideal for production workloads requiring a balance between quality, speed, and cost. Supports both text-to-image and image-to-image generation.
Black Forest Labs FLUX.2 [klein] 4B is a lightweight, fast image generation model optimized for speed and efficiency. With 4 billion parameters, it delivers quick image generation while maintaining good quality. Perfect for rapid prototyping, bulk generation, and applications requiring low latency. Supports both text-to-image and image-to-image generation with excellent cost-efficiency.
by Google
Larger Gemma model delivering high-quality chat and coding with efficient inference.
by Qwen
Qwen3 VL 235B A22B Instruct is Alibaba's vision-language MoE model with 235B total / 22B active parameters. Combines state-of-the-art text and vision understanding with excellent performance on multimodal reasoning tasks.