Każdy wiodący model AI — jedna subskrypcja. Przeglądaj, porównuj i uruchamiaj ponad 100 modeli z czołowych laboratoriów.
Koniec z żonglowaniem subskrypcjami. Kunya daje Ci dostęp do każdego wiodącego modelu AI — czat, obraz, wideo, audio i kod — z jednym logowaniem.
GPT, Claude, Gemini, Grok, Llama, Mistral, DeepSeek i więcej. Przełączaj modele w trakcie rozmowy.
GPT Image, FLUX, Stable Diffusion, Seedream, Grok Imagine — każdy główny model obrazów w jednym studio.
Sora, Veo, Kling, Luma, Runway. Tekst na wideo, obraz na wideo, podmiana twarzy, synchronizacja ust.
Generowanie pełnych piosenek, klonowanie głosu, TTS w 50+ językach, transkrypcja, podcasty.
Anthropic
Most capable Opus — enhanced coding, agentic workflows, and long-horizon reasoning with 1M context
Anthropic
Previous Opus — enhanced SWE, vision, and long-horizon agentic reasoning with 1M context
Anthropic
Hybrid reasoning model with 1M context, top-tier coding and agentic performance
Anthropic
Previous premium model with maximum intelligence
Anthropic
Best combination of speed and intelligence, near-flagship performance
Anthropic
Previous smart model for complex agents and coding
Anthropic
Fastest model with near-frontier intelligence
OpenAI
Newest frontier model — highest reasoning for coding and professional work
OpenAI
Highly capable GPT model for coding and agentic tasks
OpenAI
Most powerful GPT model with maximum compute for complex reasoning
OpenAI
The best model for coding and agentic tasks across industries
OpenAI
Intelligent reasoning model with configurable reasoning effort
OpenAI
Previous intelligent reasoning model for coding and agentic tasks
Most advanced reasoning model with complex problem-solving
Cheapest frontier-class model — half the cost of Gemini 3 Flash with strong tool calling
Low-latency Live API model for real-time dialogue and voice-first AI applications
Frontier intelligence optimized for agentic workflows, coding, and video at higher speed
Frontier intelligence with superior search and grounding
State-of-the-art thinking model for complex problems
Best price-performance for large scale processing
xAI
Fastest, most intelligent Grok — 1M context, 3 reasoning levels, top agentic tool calling
xAI
Latest Grok beta optimized for multi-agent orchestration
xAI
Latest Grok beta with extended reasoning
DeepSeek
1M context, thinking + non-thinking modes, tool calls
DeepSeek
Flagship model — 1M context, thinking + non-thinking modes
Qwen
Alibaba's flagship general-purpose LLM via DashScope - top-tier reasoning and coding
Qwen
Automated deep research - plans research steps, performs web searches, generates structured reports
MiniMax
Recursive self-improvement — SOTA in software engineering, tool calling, and office productivity
MiniMax
M2.7 at ~100 tps — same performance, faster and more agile
MiniMax
Peak performance and ultimate value — master the complex
MiniMax
M2.5 at ~100 tps — same performance, faster and more agile
MiniMax
Polyglot programming mastery with precision code refactoring
MiniMax
Agentic capabilities with function calling and advanced reasoning
ByteDance
ByteDance flagship — 76.5% SWE-Bench, 98.3% AIME 2025, hour-long video understanding
Z-AI
Latest Z-AI flagship — enhanced long-horizon coding and autonomous agent tasks
Z-AI
Fast inference model optimized for agentic workflows and tool use
Xiaomi
Xiaomi's 1T-parameter flagship — agentic workflows, tool calling, and advanced reasoning with 1M context
Moonshot
Long-horizon coding, UI/UX generation, and multi-agent orchestration with parallel sub-agents
Moonshot
State-of-the-art visual coding and agentic tool-calling with multimodal reasoning
StepFun
196B MoE reasoning model — activates 11B per token, extremely fast
Qwen
Hybrid attention + MoE vision-language model with 1M context
OpenRouter
1T parameter frontier model built for agentic multi-step reasoning
OpenRouter
Omni-modal frontier model with vision, hearing, reasoning, and action
Nous Research
Flagship uncensored reasoning model from Nous Research — hybrid think/respond mode, low refusal rates, strong at math, code, and structured output
Nous Research
Efficient uncensored reasoning model from Nous Research — hybrid think/respond mode, low refusal rates, strong at math, code, and structured output
ByteDance
Versatile multimodal model with low latency for agent and vision tasks
High-efficiency image generation optimized for speed and volume, up to 4K with thinking
OpenAI
Latest state-of-the-art image generation with fast, high-quality output and flexible sizes
Sourceful
Most powerful Riverflow V2 preview - unified text-to-image and image-to-image
Stability AI
Latest SD with improved quality, typography, and prompt understanding
ByteDance
ByteDance Seedream 5.0 Lite image editing — intelligent multi-image editing with reasoning, style transfer, and beautification (2K-3K)
Qwen
Alibaba's flagship image generation - high realism, fine detail, excellent text rendering
Qwen
Alibaba's image editing model - modify text, add/remove objects, style transfer, detail enhancement
Wan
Alibaba Wan 2.6 text-to-image generation - photorealistic to illustrative styles
ByteDance
ByteDance Seedream 5.0 Lite — high-quality 2K/3K image generation with text-to-image and image editing
Midjourney
Midjourney V7 — industry-leading image generation with stunning aesthetics. 4 images per generation. Supports --ar, --s, --c and all V7 parameters.
ByteDance (Dreamina)
ByteDance Seedream 5.0 via Dreamina/ModelArk — high-quality 2K image generation. Admin-only for comparison with Evolink provider.
ByteDance (Dreamina)
ByteDance Seedream 5.0 Lite via Dreamina/ModelArk — fast 2K image generation. Admin-only for comparison with Evolink provider.
OpenAI
OpenAI Sora 2 Pro — highest quality image animation (up to 12s, 1080p)
Google Veo 3.1 Extend — continue an existing video up to ~30s total (720p/1080p)
Google Veo 3.1 — generate video from up to 3 reference images (up to 8s, 1080p)
Google Veo 3.1 — animate between a first and last keyframe (up to 8s, 1080p)
MiniMax
Latest MiniMax model — cinematic motion, expressive faces, anime & illustration styles, 15 camera commands
Easel
Premium face swap with hair preservation, 2x upscale, and detail enhancement
ByteDance
ByteDance motion transfer — full body, expressions, lip movement from driving video to any character (humans, animals, cartoons)
ByteDance
ByteDance OmniHuman — audio-driven avatar animation with emotion and cognitive simulation
ByteDance
ByteDance OmniHuman 1.5 — film-grade talking avatar from photo + audio with micro-expressions and cognitive simulation
Sync
Most powerful lipsync — native visual intelligence for professional-quality video-to-video
Sync
High-quality realistic lipsync preserving natural teeth and unique facial features
Kling
Kling audio-to-video lip sync — realistic lip movements from audio (2-60s audio, 720p/1080p)
Alibaba
Wan 2.2 A14B — high-quality anime/artistic video with improved motion and expressions (480p-720p)
Alibaba
Wan 2.2 motion transfer — replicate expressions and movements from a reference video onto a character image
Alibaba
Wan 2.2 character replacement — replace the character in a video while preserving scene lighting and motion
Kling
Kling v2.5 lip sync — superseded by Kling LipSync audio-to-video endpoint
Kling
Kling O3 Pro — generate the next shot from a reference video, preserving motion & camera style (3-15s, 1080p)
Kling
Kling O3 Standard — generate the next shot from a reference video (3-15s, 720p)
Kling
Kling O3 Pro — edit existing videos with element injection and style transfer (3-15s, 1080p)
Kling
Kling V3 Pro — cinematic text-to-video with multi-shot and native audio (3-15s, 1080p)
Kling
Kling V3 Pro — animate images with multi-shot storyboarding (3-15s, 1080p)
Kling
Kling O3 Standard — text-to-video with multi-shot and audio (3-15s, 720p)
Kling
Kling O3 Standard — animate images with start/end frame control (3-15s, 720p)
Kling
Kling O3 Standard — reference-to-video with @Element character locking + @Image style refs (3-15s, 720p)
Kling
Kling O3 Pro — reference-driven text-to-video with character consistency (3-15s, 1080p)
Kling
Kling O3 Pro — best-in-class image-to-video with element referencing (3-15s, 1080p)
Kling
Kling O3 Pro — reference-to-video with @Element character locking (frontal+multi-angle refs) + @Image style refs (3-15s, 1080p)
Kling 4K
Kling O3 4K — reference-to-video with @Element character locking at native 4K. Up to 7 refs (3-15s)
Kling 4K
Kling V3 Native 4K — professional-grade 4K video from text (3-15s)
Kling 4K
Kling V3 Native 4K — professional-grade 4K video from images (3-15s)
Kling 4K
Kling O3 Native 4K — professional-grade 4K video with reference support (3-15s)
Kling 4K
Kling O3 Native 4K — professional-grade 4K video from images with references (3-15s)
Kling Direct
Kling V3 Standard via direct API — 720p text-to-video (5/10/15s)
Kling Direct
Kling V3 Standard via direct API — 720p image-to-video (5/10s)
Kling Direct
Kling V3 Pro via direct API — 1080p image-to-video (5/10s)
Kling Direct
Kling O3 Standard via direct API — 720p text-to-video (3-15s)
Kling Direct
Kling O3 Standard via direct API — 720p image-to-video (3-15s)
Kling Direct
Kling O3 Pro via direct API — 1080p image-to-video (3-15s)
Kling Direct
Kling V3 native 4K image-to-video via direct API (3-10s)
Kling Direct
Kling O3 native 4K image-to-video via direct API (3-15s)
Seedance
ByteDance Seedance 2.0 via FAL — cinematic T2V with native audio, up to 15s at 1080p
Seedance
ByteDance Seedance 2.0 via FAL — animate images with native audio, start/end frame control, up to 15s
Seedance
ByteDance Seedance 2.0 via FAL — multimodal ref system: up to 9 images + 3 videos + 3 audio, native audio
Seedance
ByteDance Seedance 2.0 Fast via FAL — lower latency and cost, up to 15s
Seedance
ByteDance Seedance 2.0 Fast via FAL — fast image-to-video with native audio
Seedance
ByteDance Seedance 2.0 Fast via FAL — fast multimodal reference, up to 9 images + 3 videos + 3 audio
Happy Horse
Alibaba Happy Horse 1.0 — #1 ranked AI video model, native audio + lip-sync, up to 15s at 1080p
Happy Horse
Alibaba Happy Horse 1.0 — #1 ranked I2V with native audio, multilingual lip-sync, up to 15s at 1080p
Happy Horse
Alibaba Happy Horse 1.0 — reference-driven video with character consistency (1-9 images), native audio, 1080p
Happy Horse
Alibaba Happy Horse 1.0 — natural language video editing with up to 5 reference images, 1080p
Wan
Alibaba Wan 2.6 - cinematic multi-shot text-to-video with audio, up to 15s at 1080p
Wan
Alibaba Wan 2.2 - generate video from first and last frame images, 5s at 1080p
Wan
Alibaba Wan 2.6 - replicate character appearance from reference videos, multi-character support, up to 10s
Wan
Alibaba Wan 2.1 - multi-image reference, video redraw, local editing, extension, frame expansion
Wan
Alibaba Wan 2.2 - animate a person image using motion from a reference video, up to 30s
Wan
Alibaba Wan 2.2 - replace people in videos with people from images, keeping original background, up to 30s
Seedance
ByteDance Seedance 2.0 — text-driven video with synchronized audio, lip-sync, web search, up to 15s
Seedance
ByteDance Seedance 2.0 — first/last frame image-driven video with synchronized audio, up to 15s
Seedance
ByteDance Seedance 2.0 — multimodal @-reference system: up to 9 images + 3 videos + 3 audio tracks
Seedance
ByteDance Seedance 2.0 Fast — faster text-driven video at lower cost, synchronized audio, up to 15s
Seedance
ByteDance Seedance 2.0 Fast — faster image-driven video at lower cost, synchronized audio, up to 15s
Seedance
ByteDance Seedance 2.0 Fast — faster multimodal @-reference at lower cost, up to 9 images + 3 videos + 3 audio
Seedance
ByteDance Seedance 1.5 — synchronized audio+video generation with lip-sync and foley (up to 12s)
Kling
Kling V3 — standard text-to-video with multi-shot and sound effects (5s or 10s)
Kling
Kling V3 — image-to-video with first/last frame, multi-shot, and sound effects (5s or 10s)
Kling
Kling V3 — motion transfer from reference video to character in reference image (up to 10s per render)
Kling
Kling O3 (V3 Omni) — highest quality text-to-video with multi-shot and sound (3-15s)
Kling
Kling O3 (V3 Omni) — best-in-class image-to-video with reference images, elements, and multi-shot (3-15s)
Wan
Alibaba Wan 2.7 — multi-shot narrative, auto BGM/SFX or driving-audio lip-sync, 2-15s
HappyHorse
Alibaba Happy Horse 1.0 — #1 ranked text-to-video, native audio + lip-sync, 3-15s
HappyHorse
Alibaba Happy Horse 1.0 — image-to-video with native audio, 3-15s
HappyHorse
Alibaba Happy Horse 1.0 — reference-driven video with 1-9 images, native audio, 3-15s
HappyHorse
Alibaba Happy Horse 1.0 — natural language video editing with up to 5 reference images
Kling
Kling O1 — style-focused image-to-video with first/last frame support (5s or 10s)
Powerful, low-latency speech generation with expressive audio tags for precise narration control — 70+ languages
Google Neural2 voices — highly natural-sounding TTS using novel synthesis methods
ElevenLabs
ElevenLabs Eleven v3 — ultra-realistic voice synthesis with 30+ languages and voice cloning
ElevenLabs
ElevenLabs Flash v2.5 — lowest latency TTS for real-time applications, 32 languages
Qwen
Alibaba's multilingual TTS with 49 voices, 10+ languages - ElevenLabs alternative
Qwen
Instruction-controllable TTS - control speech style via text instructions, 10+ languages
Qwen
Generate custom voices from text descriptions - design unique voices without audio samples
Qwen
Clone voices from 10-20 second audio samples - highly natural voice replication
CosyVoice
Next-gen generative TTS model - high-quality real-time streaming synthesis
ElevenLabs
Studio-grade music with vocals or instrumentals, up to 10 min, multilingual lyrics
Suno (Kunya)
Latest Suno model — superior musical expression, fast generation, vocals + instrumentals
Qwen
Alibaba's flagship code model via DashScope - code generation, completion, and debugging
Qwen
Fast, cost-effective code model via DashScope for rapid code tasks
Zacznij z Kunya i uruchom dowolny model natychmiast. Bez zarządzania kluczami API, bez osobnych subskrypcji.