Discover AI models for every task
Showing 1-11 of 11 models
by Black Forest Labs
Black Forest Labs FLUX.2 [klein] 4B is a lightweight, fast image generation model optimized for speed and efficiency. With 4 billion parameters, it delivers quick image generation while maintaining good quality. Perfect for rapid prototyping, bulk generation, and applications requiring low latency. Supports both text-to-image and image-to-image generation with excellent cost-efficiency.
Black Forest Labs FLUX.2 [klein] 9B is a balanced image generation model offering excellent quality-to-speed ratio. With 9 billion parameters, it provides better detail and composition than the 4B variant while remaining faster than full-size models. Ideal for production workloads requiring a balance between quality, speed, and cost. Supports both text-to-image and image-to-image generation.
Black Forest Labs FLUX.2 [dev] is the latest generation text-to-image model with significant improvements over FLUX.1. Features enhanced prompt following, superior image quality, and faster generation. Built on the proven rectified flow transformer architecture with optimizations for better detail, composition, and text rendering. Excellent for creative workflows, concept art, and high-quality image generation with both text-to-image and image-to-image capabilities.
Black Forest Labs FLUX.1 [schnell] is the fastest variant of the FLUX.1 family, optimized for rapid text-to-image generation with fewer inference steps. Built on the same 12B parameter rectified flow transformer architecture as FLUX.1 [dev] but distilled for maximum speed. Generates high-quality 1024x1024 images in 1-4 steps compared to 20-50 steps for standard models. Ideal for real-time applications, interactive tools, and high-throughput image generation scenarios. Apache 2.0 licensed for unrestricted use including commercial applications.
Black Forest Labs FLUX.1 [dev] with LoRA adapter support. This variant enables fine-tuned generation with custom trained LoRA weights for specialized styles, characters, or concepts. Based on the full 12B parameter FLUX.1 [dev] model with all its capabilities including high-resolution generation, accurate text rendering, and detailed composition. Perfect for custom workflows and specialized image generation tasks.
Black Forest Labs FLUX.1 [dev] is a cutting-edge 12 billion parameter rectified flow transformer for text-to-image generation. Second only to FLUX.1 [pro] with strong prompt following matching closed-source alternatives. Features guidance distillation for efficient inference, high-resolution generation (1024x1024), accurate text rendering, and detailed composition. Supports both text-to-image and image-to-image generation. Open weights enable scientific research and innovative workflows.
by Mistral
Mistral Small 3.2 24B Instruct is a multimodal instruction-tuned model supporting both vision and text with 24B parameters and 128K context. Major improvements over 3.1 include better instruction following (84.78%), 2x reduction in repetition errors, and robust function calling. Achieves 65.33% on Wildbench v2, 43.1% on Arena Hard v2, 92.90% on HumanEval Pass@5. Vision benchmarks: 87.4% ChartQA, 94.86% DocVQA, 62.50% MMMU. Supports up to 10 images per prompt with integrated vision-based function calling.
Mistral Voxtral Small 24B is a multimodal model supporting both text and audio inputs with 24B parameters. Enables natural voice conversations and audio understanding alongside text processing. Features audio transcription, audio-based reasoning, and voice-to-text capabilities. Built on Mistral architecture with specific training for audio modalities. Ideal for voice assistants, audio analysis applications, and multimodal AI systems requiring combined text and speech processing.
Devstral 2 123B is Mistral AI's flagship agentic coding model, featuring 123B parameters optimized for software engineering tasks. Achieves 72.2% on SWE-bench Verified and 61.3% on SWE-bench Multilingual. Excels at codebase exploration, multi-file editing, and agentic workflows with tool use. Supports 200K context window with enhanced function calling and structured output. Designed for IDE integration via Mistral Vibe CLI. Released under modified MIT license for unrestricted commercial use.
by Google
Larger Gemma model delivering high-quality chat and coding with efficient inference.
by Deepseek
DeepSeek V3.2 is the latest iteration of the DeepSeek V3 series with significant performance improvements. Features enhanced reasoning, coding capabilities, and better instruction following across diverse tasks.