AI Library 2025 | 21medien

🔧 GPUs & Hardware

NVIDIA A100

NVIDIA's workhorse AI GPU with 40GB/80GB memory—the most widely deployed GPU for machine learning training and inference.

Provider: NVIDIA

gpus hardware nvidia a100 ai-training inference

AMD Instinct MI300X

AMD's proven CDNA 3 data center GPU with 192GB HBM3 memory, predecessor to MI325X and MI350 series

gpu datacenter inference training amd cdna3

AMD Instinct MI325X

Mid-generation AMD GPU with 256GB HBM3E memory, 6TB/s bandwidth, bridging MI300X and MI350 series

gpu datacenter inference training amd cdna3 hbm3e

AMD Instinct MI350X

AMD's latest flagship data center GPU with 288GB HBM3E memory, 8TB/s bandwidth, and 4x performance over MI300X

gpu datacenter inference training amd cdna4 2025

AMD Instinct MI355X

AMD's extreme performance liquid-cooled GPU with 288GB HBM3E, 1,400W TDP, matching NVIDIA B200

gpu datacenter inference training amd cdna4 liquid-cooling 2025

Apple M4 Max

Apple's high-performance SoC with 40 GPU cores, 38 TOPS Neural Engine, and 128GB unified memory for AI

soc apple-silicon neural-engine ai unified-memory 2025

Apple Silicon

Apple's custom ARM-based chips for Mac computers with ML acceleration.

Provider: Apple

hardware arm ml-acceleration apple neural-engine

NVIDIA B200

NVIDIA's next-generation Blackwell GPU with 192GB HBM3e—2.5× faster than H100 for AI training, launching 2024-2025.

Provider: NVIDIA

gpus hardware nvidia b200 blackwell next-gen

NVIDIA H100

NVIDIA's flagship AI GPU with 80GB HBM3 memory and 3TB/s bandwidth—the fastest GPU for training large language models and diffusion models.

Provider: NVIDIA

gpus hardware nvidia h100 ai-training llm

NVIDIA H200

NVIDIA's enhanced H100 with 141GB HBM3e memory—the highest-capacity GPU for massive models and long-context inference.

Provider: NVIDIA

gpus hardware nvidia h200 large-memory llm

🔧 Development Tools

Accelerate

Hugging Face library for effortless distributed PyTorch training with FSDP, DeepSpeed, and FP8 support

pytorch distributed-training huggingface fsdp deepspeed mixed-precision

Apple MLX

Apple's array framework for machine learning on Apple Silicon with unified memory and NumPy-like API

framework ml apple-silicon open-source python swift

ComfyUI

Advanced node-based Stable Diffusion workflow interface with 45,000+ GitHub stars and 1,600+ custom nodes—the professional standard for complex image generation pipelines.

Provider: comfyanonymous (Open Source)

development-tools comfyui stable-diffusion node-based-workflow image-generation workflow-automation open-source

Diffusers

Hugging Face's state-of-the-art library for diffusion models supporting FLUX, Stable Diffusion, and more

diffusion image-generation huggingface stable-diffusion flux pytorch

LangChain

Open-source framework for building LLM-powered applications with modular components for chains, agents, memory, and tool integration—the industry standard for production AI workflows.

Provider: Harrison Chase (LangChain Inc.)

development-tools langchain llm-framework ai-orchestration python javascript

LlamaIndex

Data framework for connecting custom data sources to LLMs—specialized for RAG applications with advanced indexing, retrieval strategies, and production-ready data ingestion pipelines.

Provider: Jerry Liu (LlamaIndex Inc.)

development-tools llamaindex rag-framework data-framework document-indexing python

The dominant deep learning framework powering 70%+ of AI research papers with intuitive Python-first design, dynamic computation graphs, and production-ready deployment—the foundation for training everything from GPT models to computer vision systems.

Provider: Meta AI (Open Source)

development-tools pytorch deep-learning machine-learning neural-networks open-source

smolagents

Hugging Face's minimalist agentic framework where agents write Python code, reducing steps by 30%

agents llm huggingface code-agents multi-agent python

TensorFlow

Google's production-grade ML framework powering Search, YouTube, and Gmail with 180,000+ GitHub stars—offering comprehensive tools from research to deployment including TensorFlow Lite for mobile and TensorFlow.js for browsers.

Provider: Google (Open Source)

development-tools tensorflow deep-learning machine-learning google open-source

vLLM

High-throughput LLM inference engine achieving 24x faster serving with PagedAttention for memory efficiency—the production standard for deploying large language models at scale.

Provider: UC Berkeley (Open Source)

development-tools vllm llm-inference model-serving performance-optimization open-source

🌐 Cloud AI Providers

Leading cloud providers offer managed AI platforms and APIs for scalable, enterprise-grade deployment

Anthropic API

Direct API access to Claude models including Sonnet 4.5, Opus, and Haiku for building safe and reliable AI applications.

Provider: Anthropic

api claude anthropic llm-api claude-sonnet claude-opus constitutional-ai

Amazon Bedrock

Fully managed service providing access to leading foundation models from multiple AI providers.

Provider: Amazon Web Services

Cloud AI AWS Foundation Models Managed Service Multi-Provider

Azure OpenAI Service

Enterprise-grade OpenAI models with Microsoft's security, compliance, and global infrastructure.

Provider: Microsoft

Cloud AI Azure OpenAI Enterprise AI Microsoft Compliance

Claude Sonnet 4.5

Anthropic's best coding model with 61.4% OSWorld benchmark and 30+ hours extended thinking capabilities.

Provider: Anthropic

LLM Claude Anthropic AI Safety Long Context Coding Extended Thinking

Gemini 2.5 Pro

Google's advanced multimodal AI with hybrid reasoning capabilities, generally available October 2025.

Provider: Google

LLM Gemini Google Multimodal Real-time AI Hybrid Reasoning

Gemini 2.5

Google's advanced multimodal AI model with exceptional speed and native multimodal understanding.

Provider: Google

LLM Gemini Google Multimodal Real-time AI

GPT-5

OpenAI's most advanced large language model released in August 2025 with exceptional reasoning and coding capabilities.

Provider: OpenAI

LLM GPT OpenAI Multimodal Chat Reasoning Coding

HyperStack

Scalable GPU cloud infrastructure with dedicated resources optimized for AI/ML workloads and cost-effective pricing.

Provider: Nexgen Cloud

gpu-cloud ai-infrastructure cloud-computing nvidia-gpus ml-platform

OpenAI API

Direct API access to OpenAI's foundation models including GPT-5, GPT-4, DALL-E 3, and Whisper for building AI applications.

Provider: OpenAI

api gpt-5 gpt-4 dall-e whisper llm-api openai

Google Vertex AI

Unified AI platform for building, deploying, and scaling machine learning models with Google Cloud's managed infrastructure.

Provider: Google Cloud

ml-platform google-cloud automl model-serving mlops gemini vertex-ai

🔧 Audio AI

AudioCraft

Meta's comprehensive open-source audio generation toolkit featuring MusicGen for music synthesis, AudioGen for sound effects, and EnCodec neural audio codec for high-quality compression and generation.

Provider: Meta AI

audio-generation music-synthesis sound-effects neural-codec open-source text-to-audio

Bark

Open-source text-to-audio model from Suno AI that generates realistic, multilingual speech with emotional prosody, laughter, sound effects, and music, supporting 100+ languages with zero-shot voice cloning.

Provider: Suno AI

text-to-speech audio-generation voice-cloning emotional-speech multilingual-tts open-source

Stable Audio

Text-to-audio model for music and sound effects with 44.1kHz output.

Provider: Stability AI

audio-generation music-ai sound-effects

🔧 Image Generation

AUTOMATIC1111 Stable Diffusion Web UI

Popular open-source web interface for Stable Diffusion with extensive features, extensions, and local deployment capabilities.

Provider: AUTOMATIC1111 (Open Source Community)

stable-diffusion image-generation ai-art webui local-deployment open-source

Fooocus

Simplified Stable Diffusion interface focusing on prompt and image generation with minimal configuration and automatic optimization.

Provider: lllyasviel (Open Source)

fooocus stable-diffusion image-generation simplified-ui ai-art local-deployment

Ideogram 2.0

AI image generator specialized in accurate text rendering, typography, and design-focused visual creation with photorealistic quality.

Provider: Ideogram AI

ideogram ai-art image-generation text-rendering typography design

Midjourney v7

Leading cloud-based AI image generation service known for artistic quality, photorealism, and intuitive Discord-based interface.

Provider: Midjourney, Inc.

midjourney ai-art image-generation cloud-service discord photorealism

Stable Diffusion 3.5

Latest open-source diffusion model with improved quality, prompt adherence, and multimodal capabilities for text-to-image generation.

Provider: Stability AI

stable-diffusion open-source image-generation diffusion-model text-to-image ai-art

🔧 Event Bus

AWS EventBridge

Serverless event bus service for application integration.

Provider: Amazon Web Services

event-bus serverless aws integration

🔧 Secrets Management

AWS Secrets Manager

Managed secrets storage and rotation service.

Provider: Amazon Web Services

secrets-management aws security encryption

HashiCorp Vault

Secrets management and data protection platform.

Provider: HashiCorp

secrets-management security devops encryption

🔧 Cloud Infrastructure

Amazon Web Services (AWS)

Global cloud infrastructure with comprehensive AI/ML services—SageMaker AI, Bedrock, EC2 P5 instances, and 200+ services.

Provider: Amazon

cloud-infrastructure aws amazon ml-platform gpu-cloud

Microsoft Azure

Microsoft's cloud platform with comprehensive AI services—Azure OpenAI, Azure Machine Learning, 11,000+ models.

Provider: Microsoft

cloud-infrastructure azure microsoft azure-openai enterprise-ai

Google Cloud Platform (GCP)

Google's cloud platform with advanced AI services—Vertex AI, TPUs, Gemini models, and 200+ foundation models.

Provider: Google

cloud-infrastructure gcp google-cloud vertex-ai tpu

Lambda Labs

GPU cloud optimized for AI/ML—rent H100, A100, or A6000 GPUs by the hour for training, fine-tuning, and inference at competitive prices.

Provider: Lambda Labs

cloud-infrastructure gpu-cloud lambda-labs h100 a100

RunPod

Community GPU cloud with spot and on-demand instances—rent GPUs from $0.20/hr with global availability and serverless inference.

Provider: RunPod

cloud-infrastructure gpu-cloud runpod spot-instances serverless

🔧 Cloud AI

Azure AI Services

Microsoft's comprehensive cloud AI services and APIs.

Provider: Microsoft

cloud-ai azure microsoft

📄 Document AI

Extract, understand, and automate document workflows with AI

Azure Document Intelligence

Microsoft's AI service for extracting text, structure, and insights from documents.

Provider: Microsoft

Document AI OCR Azure Data Extraction Microsoft

Google Document AI

Google's comprehensive document understanding and processing platform powered by AI.

Provider: Google

Document AI OCR Google Cloud Data Extraction NLP

🔧 AI Company

Black Forest Labs

AI research company behind FLUX image generation models.

Provider: Black Forest Labs

ai-company image-generation flux text-to-image

Stability AI

Leading AI company behind Stable Diffusion and open-source generative models.

Provider: Stability AI

ai-company image-generation audio-generation stable-diffusion open-source

🔧 AI Concepts

Chain-of-Thought Prompting

Prompting technique that dramatically improves AI reasoning by asking models to show their work step-by-step—unlocking complex problem-solving abilities.

Provider: Industry Standard

ai-concepts prompting chain-of-thought reasoning zero-shot few-shot

Constitutional AI

Anthropic's technique for training helpful, harmless AI using principles rather than human feedback—scaling AI alignment through AI self-critique.

Provider: Anthropic

ai-concepts constitutional-ai ai-alignment ai-safety anthropic claude

Diffusion Models

Revolutionary generative AI technique powering Stable Diffusion, DALL-E, and Midjourney—achieving photorealistic image generation by learning to reverse a gradual noising process, fundamentally transforming creative industries and visual AI.

Provider: Research Community (Multiple Origins)

ai-concepts diffusion-models generative-ai image-generation stable-diffusion machine-learning

Distributed Training

Training neural networks across multiple GPUs or machines to handle massive models and datasets—essential for modern AI at scale.

Provider: Industry Standard

ai-concepts distributed-training model-parallelism data-parallelism gpu-cluster

Few-Shot Learning

Training AI models to perform tasks with only a handful of examples—enabling rapid adaptation to new tasks without extensive retraining.

Provider: Industry Standard

ai-concepts few-shot-learning in-context-learning meta-learning transfer-learning

Fine-tuning

Process of adapting pre-trained AI models to specific tasks or domains by continuing training on custom datasets, creating specialized models without training from scratch.

Provider: AI Research Community

ai-concepts fine-tuning model-training transfer-learning customization

Knowledge Distillation

Model compression technique that transfers knowledge from large 'teacher' models to smaller 'student' models—enabling 10-100x faster inference while retaining 95-99% accuracy, revolutionizing edge AI deployment and cost optimization.

Provider: Research Community (Hinton et al.)

ai-concepts knowledge-distillation model-compression edge-ai optimization deployment

LoRA (Low-Rank Adaptation)

Parameter-efficient fine-tuning technique that adapts large language models using low-rank matrix decomposition, reducing trainable parameters by 99% while maintaining quality.

Provider: Microsoft Research

ai-concepts fine-tuning parameter-efficient model-adaptation peft optimization

Model Serving

Infrastructure and techniques for deploying AI models in production—handling requests, scaling, optimization, and monitoring at enterprise scale.

Provider: Industry Standard

ai-concepts model-serving mlops inference deployment production-ai

Prompt Engineering

Technique of crafting effective instructions for large language models to optimize output quality, accuracy, and desired behavior through structured prompts.

Provider: AI Research Community

ai-concepts prompt-engineering llm-optimization ai-interaction best-practices

Quantization (Model Compression)

Model compression technique that reduces memory and computational requirements by converting high-precision weights (32-bit) to lower precision (8-bit, 4-bit) with minimal accuracy loss.

Provider: AI Research Community

ai-concepts model-compression optimization edge-deployment inference efficiency

RAG (Retrieval-Augmented Generation)

Technique that enhances LLM responses by retrieving relevant information from external knowledge bases before generating answers.

Provider: AI Research Community

ai-concepts retrieval llm-enhancement knowledge-base vector-search enterprise-ai

RLHF (Reinforcement Learning from Human Feedback)

Training AI models using human preferences to align outputs with human values—the technique behind ChatGPT's helpfulness and Claude's safety.

Provider: Industry Standard

ai-concepts rlhf reinforcement-learning alignment human-feedback ppo

Synthetic Data

Artificially generated training data that mimics real-world patterns—solving privacy, cost, and data scarcity challenges in AI development.

Provider: Industry Standard

ai-concepts synthetic-data data-generation privacy augmentation training-data

Transfer Learning

Fundamental deep learning paradigm enabling models trained on large datasets to be adapted for specific tasks with minimal data—accelerating development cycles from months to days and reducing compute requirements by 90%+.

Provider: Research Community (Multiple Origins)

ai-concepts transfer-learning fine-tuning pretrained-models machine-learning deep-learning

Transformer Architecture

Revolutionary neural network architecture using self-attention mechanisms—powering GPT, BERT, Claude, and 95%+ of modern language models, fundamentally transforming NLP by eliminating recurrence and enabling massive parallelization.

Provider: Google Research (2017)

ai-concepts transformer attention-mechanism neural-networks deep-learning nlp

Vector Embeddings

Numerical representations of text, images, or other data that capture semantic meaning in high-dimensional space for similarity search and AI applications.

Provider: AI Research Community

ai-concepts embeddings semantic-search vector-space nlp machine-learning

Zero-Shot Learning

Enabling AI models to perform tasks they've never explicitly seen during training—the ultimate in generalization and transfer learning.

Provider: Industry Standard

ai-concepts zero-shot-learning transfer-learning generalization prompting

🔧 Vector Databases

ChromaDB

The AI-native embedded vector database designed for developers—offering zero-configuration setup, built-in embeddings, and production deployment in minutes, powering 50,000+ AI applications from startups to enterprises.

Provider: Chroma (Open Source)

vector-databases chromadb embedded-database rag vector-search open-source

FAISS

Meta's library for billion-scale vector similarity search with GPU acceleration—the fastest, most memory-efficient vector search solution powering production systems at Meta, OpenAI, and thousands of AI applications.

Provider: Meta AI

vector-databases faiss meta similarity-search gpu-acceleration high-performance

Milvus

Cloud-native vector database built for trillion-scale vector search achieving 10,000+ QPS with Kubernetes-native architecture—the enterprise-grade open-source solution powering billion-vector deployments at eBay, Walmart, and NVIDIA.

Provider: LF AI & Data Foundation (Open Source)

vector-databases milvus cloud-native kubernetes vector-search open-source

Pinecone

Fully managed vector database optimized for similarity search at scale, powering RAG systems, recommendation engines, and semantic search with millisecond latency for billions of vectors.

Provider: Pinecone Systems Inc.

vector-databases pinecone similarity-search vector-search embeddings rag

PostgreSQL pgvector

Add vector similarity search to PostgreSQL with ACID compliance—leverage your existing SQL database for embeddings, combining relational integrity with semantic search without deploying specialized vector infrastructure.

Provider: PostgreSQL Community

vector-databases postgresql pgvector relational-database sql acid-compliance

Qdrant

High-performance vector similarity search engine built in Rust achieving sub-10ms queries on 100M+ vectors—the open-source vector database engineered for production AI applications with advanced filtering and payload storage.

Provider: Qdrant Solutions GmbH (Open Source)

vector-databases qdrant vector-search similarity-search rust open-source

Redis Vector Search

High-performance vector similarity search built into Redis Stack—combining sub-millisecond vector queries with Redis's legendary caching, real-time operations, and 1M+ ops/second throughput for hybrid AI applications.

Provider: Redis Ltd.

vector-databases redis redis-stack in-memory real-time hybrid-search

Weaviate

Open-source vector database with GraphQL API, native multi-modal search, and cloud-native architecture—powering semantic search, RAG systems, and AI applications at scale.

Provider: Weaviate B.V.

vector-databases weaviate open-source graphql semantic-search rag

🔧 Language Model

Claude Haiku

Fast, cost-effective Claude model optimized for speed and efficiency.

Provider: Anthropic

language-model claude anthropic fast-model

Claude Opus

Most capable Claude model for complex tasks requiring deep reasoning.

Provider: Anthropic

language-model claude anthropic advanced-reasoning

Gemini 2.5 Flash

Fast, efficient Gemini model with multimodal capabilities.

Provider: Google

language-model gemini google multimodal fast-model

GPT-4

OpenAI's flagship large language model with advanced reasoning capabilities.

Provider: OpenAI

language-model gpt openai multimodal

Llama 4 Maverick

Advanced Llama 4 model for complex reasoning and extended context.

Provider: Meta

language-model llama meta open-source

Llama 4 Scout

Efficient Llama 4 variant optimized for speed and resource efficiency.

Provider: Meta

language-model llama meta open-source

🔧 LLM Platform

Cohere

Enterprise-focused LLM platform with Command R+ models and RAG capabilities.

Provider: Cohere

language-models enterprise-ai embeddings rag

Groq

Ultra-fast LLM inference powered by custom LPU hardware, achieving 500+ tokens/second.

Provider: Groq

language-models inference lpu ultra-fast

Mistral AI

European AI company with open-source and commercial LLMs including Mistral Large and Mixtral MoE.

Provider: Mistral AI

language-models open-source european-ai

Together AI

Fast, cost-effective inference platform for open-source LLMs with competitive pricing.

Provider: Together AI

language-models inference-platform open-source

🔧 Inference Optimization

Continuous Batching

Dynamic batching technique for improved LLM serving throughput.

Provider: Research

inference optimization batching throughput llm-serving

🎨 Text-to-Image

Generate high-quality images from text descriptions using cutting-edge diffusion models

DALL-E 3

OpenAI's advanced text-to-image model with exceptional prompt understanding and ChatGPT integration.

Provider: OpenAI

Text-to-Image DALL-E OpenAI Image Generation AI Art ChatGPT

FLUX.1

Leading open-source text-to-image model from Stability AI alumni, delivering photorealistic quality with superior prompt adherence across Pro, Dev, and Schnell variants.

Provider: Black Forest Labs

image-generation text-to-image photorealistic open-source diffusion-models commercial-use

Google Imagen 3

Google's advanced text-to-image model with photorealistic quality and responsible AI features.

Provider: Google

Text-to-Image Google Imagen Image Generation Google AI

Midjourney v6

Leading AI art generation platform known for exceptional aesthetic quality and artistic capabilities.

Provider: Midjourney

Text-to-Image Midjourney AI Art Image Generation Creative AI

Recraft V3

#1 ranked AI image generator for design-focused images, the only model supporting long text generation and vector art, with precise style control and positioning.

Provider: Recraft AI

image-generation text-to-image design-tools vector-graphics text-generation professional-design brand-assets

SDXL Lightning

Sub-second image generation model from ByteDance using progressive adversarial distillation, generating 1024px images in 1-8 steps with quality superior to SDXL Turbo.

Provider: ByteDance

image-generation text-to-image fast-inference distillation stable-diffusion real-time-ai open-source

Stable Diffusion SDXL

Open-source text-to-image model producing high-quality, photorealistic images with commercial licensing.

Provider: Stability AI

Text-to-Image Stable Diffusion Image Generation Open Source Diffusion Models SDXL

🔧 AI Models

DeepSeek R1

Open-source reasoning model matching OpenAI o1 performance at 96% lower cost through pure reinforcement learning

Provider: DeepSeek

reasoning open-source mathematics coding reinforcement-learning

OpenAI o4-mini

Cost-efficient reasoning model optimized for fast, accurate problem-solving at 10x lower cost than o3

Provider: OpenAI

reasoning cost-efficient mathematics coding openai

🔧 Infrastructure

Docker

Containerization platform enabling consistent application deployment across environments with isolated, portable containers.

Provider: Docker Inc.

docker containerization devops containers deployment infrastructure

Elasticsearch

Distributed search and analytics engine built on Apache Lucene for full-text search, log analytics, and real-time data exploration.

Provider: Elastic N.V.

elasticsearch search-engine analytics lucene full-text-search elk-stack

NVIDIA GB200 Grace Blackwell

NVIDIA's most advanced AI superchip combining Grace CPU and Blackwell GPU, delivering 25x more energy efficiency than H100 for AI inference.

Provider: NVIDIA

nvidia-gpu ai-hardware grace-cpu blackwell superchip ai-infrastructure

Apache Kafka

Distributed event streaming platform for high-throughput, fault-tolerant message publishing, storage, and real-time processing.

Provider: Apache Software Foundation

kafka event-streaming message-broker distributed-systems real-time data-pipeline

Kubernetes

Open-source container orchestration platform for automating deployment, scaling, and management of containerized applications.

Provider: Cloud Native Computing Foundation

kubernetes k8s orchestration containers cloud-native devops

MongoDB

Document-oriented NoSQL database with flexible schema, horizontal scalability, and JSON-like document storage for modern applications.

Provider: MongoDB Inc.

mongodb nosql database document-database json bson

PostgreSQL

Advanced open-source relational database with robust features, ACID compliance, and extensive SQL support for enterprise applications.

Provider: PostgreSQL Global Development Group

postgresql database sql relational-database postgres acid

RabbitMQ

Open-source message broker implementing AMQP for reliable asynchronous communication between distributed systems and microservices.

Provider: VMware (Broadcom)

rabbitmq message-broker amqp queue messaging microservices

Redis

In-memory data store used as database, cache, message broker, and streaming engine with sub-millisecond latency.

Provider: Redis Ltd.

redis cache in-memory-database key-value-store message-broker data-structures

🔊 Speech & Audio

Speech recognition, text-to-speech, and audio generation technologies

ElevenLabs TTS

Advanced AI text-to-speech platform with ultra-realistic voices and emotion control.

Provider: ElevenLabs

Text-to-Speech TTS Voice Synthesis Voice Cloning Audio AI

OpenAI Whisper

OpenAI's robust speech recognition model supporting 99+ languages with exceptional accuracy and noise resistance.

Provider: OpenAI

Speech Recognition ASR OpenAI Transcription Multilingual Audio AI

🔧 Attention Mechanism

Flash Attention

Fast and memory-efficient attention mechanism for transformers.

Provider: Research

attention transformer efficiency optimization gpu

🔧 Data Privacy

GDPR Compliance

European data protection regulations for AI and data processing.

Provider: European Union

privacy compliance gdpr regulations data-protection

🎬 Text-to-Video

Create videos from text prompts with AI-powered video generation

Google Veo 3

World's first AI video generator with native audio generation, creating synchronized soundtracks with dialogue, sound effects, and ambient noise alongside 720p/1080p video.

Provider: Google DeepMind

video-generation audio-generation text-to-video google-deepmind multimodal-ai youtube-shorts

Google Veo

Google's advanced text-to-video model generating high-quality 1080p videos with cinematic effects.

Provider: Google

Text-to-Video Video Generation Google AI Video Google DeepMind

HunyuanVideo

Open-source 13 billion parameter video generation model from Tencent, the largest open-source video model with 720p HD output and advanced camera controls.

Provider: Tencent

video-generation open-source text-to-video diffusion-models 3d-vae large-model tencent

Kling AI

Chinese AI video generation platform with 22M+ users and 168M+ videos generated, featuring advanced diffusion transformer architecture with 3D VAE.

Provider: Kuaishou Technology

video-generation diffusion-models text-to-video chinese-ai content-creation

LTX Video

First DiT-based real-time video generation model generating 30 FPS at 1216×704, now supporting 60+ second clips with ethical training on fully-licensed data.

Provider: Lightricks

video-generation real-time-ai open-source ethical-ai text-to-video diffusion-transformer long-form-video

Mochi 1

10 billion parameter open-source video generation model with Apache 2.0 license, featuring novel AsymmDiT architecture for photorealistic 30fps video with advanced physics simulation.

Provider: Genmo AI

video-generation open-source text-to-video diffusion-models photorealistic physics-simulation commercial-license

Pika 2.0

Advanced AI video generation platform with Scene Ingredients feature, enabling 10-second 1080p videos with custom scene composition from uploaded images and enhanced temporal stability.

Provider: Pika Labs

video-generation text-to-video image-to-video ai-animation content-creation scene-composition

Runway Gen-4

Next-generation AI video model from Runway with world consistency, character persistence across scenes, and superior motion physics simulation, released March 2025.

Provider: Runway

video-generation text-to-video image-to-video character-consistency cinematic-ai motion-physics

Runway Gen-2

Advanced AI video generation platform with comprehensive creative tools for professional filmmakers.

Provider: Runway

Text-to-Video Video Generation Runway AI Video Creative Tools

OpenAI Sora

OpenAI's groundbreaking text-to-video model creating realistic, cinematic videos up to 60 seconds from text descriptions.

Provider: OpenAI

Text-to-Video Video Generation OpenAI AI Video Generative AI Cinematic AI

Veo 3 Fast

Low-latency video generation for YouTube Shorts with 480p output.

Provider: Google

video-generation youtube google-ai

Wan 2.1

Open-source AI video generation model with diffusion transformer architecture, generating 5-second 480P videos on consumer GPUs like RTX 4090.

Provider: Alibaba / Tongyi Lab

video-generation text-to-video open-source ai-video alibaba diffusion-transformer

Wan 2.2

Advanced open-source AI video model with Mixture-of-Experts architecture (27B/14B active), supporting 720P video generation with text-to-video, image-to-video, speech-to-video, and character animation.

Provider: Alibaba / Tongyi Lab

video-generation text-to-video open-source ai-video alibaba mixture-of-experts 720p-video

Wan 2.5

Revolutionary AI video model with native audio-video synchronization (second only to Google Veo 3), generating 4K videos up to 10 seconds with automatic voiceovers, sound effects, and background music.

Provider: Alibaba / Tongyi Lab

video-generation audio-generation text-to-video ai-video alibaba 4k-video audio-video-sync

🔧 Training Technique

Gradient Checkpointing

Memory-efficient training technique for large models.

Provider: Research

training memory-optimization efficiency deep-learning

🔧 Inference Server

Hugging Face TGI

Text Generation Inference server for optimized LLM serving.

Provider: Hugging Face

inference serving llm optimization huggingface

🤝 Open Source Platforms

Open-source AI frameworks and model repositories

Hugging Face

Leading open-source platform hosting 500,000+ AI models, 100,000+ datasets, and comprehensive ML tools.

Provider: Hugging Face

Open Source AI Platform Transformers Model Hub ML Community AI Infrastructure

Meta Llama 4

Meta's advanced open-source LLM released April 2025 with multimodal capabilities and MoE architecture.

Provider: Meta

Open Source LLM Meta Llama Foundation Model Multimodal MoE

🔧 Networking

InfiniBand

High-performance networking technology for HPC and AI clusters.

Provider: InfiniBand Trade Association

networking hpc high-performance

🔧 Caching

Memcached

High-performance distributed memory caching system for accelerating web applications with sub-millisecond response times.

Provider: Memcached

caching distributed-systems performance in-memory key-value-store

🔧 Video Generation AI

Midjourney V1 Video

First video generation model from Midjourney, creating 5-21 second cinematic videos through image-to-video workflow with signature artistic quality.

Provider: Midjourney

video-generation image-to-video ai-cinematography creative-tools artistic-video midjourney

🔧 Model Architecture

Mixture of Experts

Neural network architecture with specialized expert modules.

Provider: Research

model-architecture moe efficiency scaling sparse-models

🔧 AI Architecture

Multi-Agent Systems

AI systems with multiple collaborating agents for complex tasks.

Provider: Research

multi-agent ai-architecture collaboration autonomous-agents

🔧 Storage Protocol

NVMe

High-performance storage protocol for SSDs and data centers.

Provider: NVM Express

storage protocol performance

🔧 Model Training

QLoRA

Efficient fine-tuning technique using quantization and LoRA.

Provider: Research

fine-tuning quantization lora efficiency peft

🔧 ML Platform

Replicate

Cloud platform for running AI models via API with pay-per-use pricing and thousands of models.

Provider: Replicate

ml-platform model-deployment api-service

🔧 Video Generation

Runway Gen-3 Alpha

Advanced AI video generation platform offering text-to-video, image-to-video, and creative video editing tools for filmmakers and creators.

Provider: Runway AI

runway video-generation ai-video gen-3 text-to-video filmmaking

Veo 3

Google's production-ready AI video generation model creating high-quality videos with synchronized audio from text prompts

Provider: Google DeepMind

video-generation text-to-video multimodal google creative-ai

21medien AI Library

🔧 GPUs & Hardware

🔧 Development Tools

🌐 Cloud AI Providers

🔧 Audio AI

🔧 Image Generation

🔧 Event Bus

🔧 Secrets Management

🔧 Cloud Infrastructure

🔧 Cloud AI

📄 Document AI

🔧 AI Company

🔧 AI Concepts

🔧 Vector Databases

🔧 Language Model

🔧 LLM Platform

🔧 Inference Optimization

🎨 Text-to-Image

🔧 AI Models

🔧 Infrastructure

🔊 Speech & Audio

🔧 Attention Mechanism

🔧 Data Privacy

🎬 Text-to-Video

🔧 Training Technique