πŸ”§ GPUs & Hardware

NVIDIA A100

NVIDIA A100

NVIDIA's workhorse AI GPU with 40GB/80GB memoryβ€”the most widely deployed GPU for machine learning training and inference.

Provider: NVIDIA
gpus hardware nvidia a100 ai-training inference
NVIDIA B200

NVIDIA B200

NVIDIA's next-generation Blackwell GPU with 192GB HBM3eβ€”2.5Γ— faster than H100 for AI training, launching 2024-2025.

Provider: NVIDIA
gpus hardware nvidia b200 blackwell next-gen
NVIDIA H100

NVIDIA H100

NVIDIA's flagship AI GPU with 80GB HBM3 memory and 3TB/s bandwidthβ€”the fastest GPU for training large language models and diffusion models.

Provider: NVIDIA
gpus hardware nvidia h100 ai-training llm
NVIDIA H200

NVIDIA H200

NVIDIA's enhanced H100 with 141GB HBM3e memoryβ€”the highest-capacity GPU for massive models and long-context inference.

Provider: NVIDIA
gpus hardware nvidia h200 large-memory llm

🌐 Cloud AI Providers

Leading cloud providers offer managed AI platforms and APIs for scalable, enterprise-grade deployment

Anthropic API

Anthropic API

Direct API access to Claude models including Sonnet 4.5, Opus, and Haiku for building safe and reliable AI applications.

Provider: Anthropic
api claude anthropic llm-api claude-sonnet claude-opus constitutional-ai
Amazon Bedrock

Amazon Bedrock

Fully managed service providing access to leading foundation models from multiple AI providers.

Provider: Amazon Web Services
Cloud AI AWS Foundation Models Managed Service Multi-Provider
Azure OpenAI Service

Azure OpenAI Service

Enterprise-grade OpenAI models with Microsoft's security, compliance, and global infrastructure.

Provider: Microsoft
Cloud AI Azure OpenAI Enterprise AI Microsoft Compliance
Claude Sonnet 4.5

Claude Sonnet 4.5

Anthropic's best coding model with 61.4% OSWorld benchmark and 30+ hours extended thinking capabilities.

Provider: Anthropic
LLM Claude Anthropic AI Safety Long Context Coding Extended Thinking
Gemini 2.5 Pro

Gemini 2.5 Pro

Google's advanced multimodal AI with hybrid reasoning capabilities, generally available October 2025.

Provider: Google
LLM Gemini Google Multimodal Real-time AI Hybrid Reasoning
Gemini 2.5

Gemini 2.5

Google's advanced multimodal AI model with exceptional speed and native multimodal understanding.

Provider: Google
LLM Gemini Google Multimodal Real-time AI
GPT-5

GPT-5

OpenAI's most advanced large language model released in August 2025 with exceptional reasoning and coding capabilities.

Provider: OpenAI
LLM GPT OpenAI Multimodal Chat Reasoning Coding
HyperStack

HyperStack

Scalable GPU cloud infrastructure with dedicated resources optimized for AI/ML workloads and cost-effective pricing.

Provider: Nexgen Cloud
gpu-cloud ai-infrastructure cloud-computing nvidia-gpus ml-platform
OpenAI API

OpenAI API

Direct API access to OpenAI's foundation models including GPT-5, GPT-4, DALL-E 3, and Whisper for building AI applications.

Provider: OpenAI
api gpt-5 gpt-4 dall-e whisper llm-api openai
Google Vertex AI

Google Vertex AI

Unified AI platform for building, deploying, and scaling machine learning models with Google Cloud's managed infrastructure.

Provider: Google Cloud
ml-platform google-cloud automl model-serving mlops gemini vertex-ai

πŸ”§ Hardware

Apple Silicon

Apple Silicon

Apple's custom ARM-based chips for Mac computers with ML acceleration.

Provider: Apple
hardware arm ml-acceleration apple

πŸ”§ Audio AI

AudioCraft

AudioCraft

Meta's comprehensive open-source audio generation toolkit featuring MusicGen for music synthesis, AudioGen for sound effects, and EnCodec neural audio codec for high-quality compression and generation.

Provider: Meta AI
audio-generation music-synthesis sound-effects neural-codec open-source text-to-audio
Bark

Bark

Open-source text-to-audio model from Suno AI that generates realistic, multilingual speech with emotional prosody, laughter, sound effects, and music, supporting 100+ languages with zero-shot voice cloning.

Provider: Suno AI
text-to-speech audio-generation voice-cloning emotional-speech multilingual-tts open-source
Stable Audio

Stable Audio

Text-to-audio model for music and sound effects with 44.1kHz output.

Provider: Stability AI
audio-generation music-ai sound-effects

πŸ”§ Image Generation

AUTOMATIC1111 Stable Diffusion Web UI

AUTOMATIC1111 Stable Diffusion Web UI

Popular open-source web interface for Stable Diffusion with extensive features, extensions, and local deployment capabilities.

Provider: AUTOMATIC1111 (Open Source Community)
stable-diffusion image-generation ai-art webui local-deployment open-source
Fooocus

Fooocus

Simplified Stable Diffusion interface focusing on prompt and image generation with minimal configuration and automatic optimization.

Provider: lllyasviel (Open Source)
fooocus stable-diffusion image-generation simplified-ui ai-art local-deployment
Ideogram 2.0

Ideogram 2.0

AI image generator specialized in accurate text rendering, typography, and design-focused visual creation with photorealistic quality.

Provider: Ideogram AI
ideogram ai-art image-generation text-rendering typography design
Midjourney v7

Midjourney v7

Leading cloud-based AI image generation service known for artistic quality, photorealism, and intuitive Discord-based interface.

Provider: Midjourney, Inc.
midjourney ai-art image-generation cloud-service discord photorealism
Stable Diffusion 3.5

Stable Diffusion 3.5

Latest open-source diffusion model with improved quality, prompt adherence, and multimodal capabilities for text-to-image generation.

Provider: Stability AI
stable-diffusion open-source image-generation diffusion-model text-to-image ai-art

πŸ”§ Event Bus

AWS EventBridge

AWS EventBridge

Serverless event bus service for application integration.

Provider: Amazon Web Services
event-bus serverless aws integration

πŸ”§ Secrets Management

AWS Secrets Manager

AWS Secrets Manager

Managed secrets storage and rotation service.

Provider: Amazon Web Services
secrets-management aws security
HashiCorp Vault

HashiCorp Vault

Secrets management and data protection platform.

Provider: HashiCorp
secrets-management security devops

πŸ”§ Cloud Infrastructure

Amazon Web Services (AWS)

Amazon Web Services (AWS)

Global cloud infrastructure with comprehensive AI/ML servicesβ€”SageMaker AI, Bedrock, EC2 P5 instances, and 200+ services.

Provider: Amazon
cloud-infrastructure aws amazon ml-platform gpu-cloud
Microsoft Azure

Microsoft Azure

Microsoft's cloud platform with comprehensive AI servicesβ€”Azure OpenAI, Azure Machine Learning, 11,000+ models.

Provider: Microsoft
cloud-infrastructure azure microsoft azure-openai enterprise-ai
Google Cloud Platform (GCP)

Google Cloud Platform (GCP)

Google's cloud platform with advanced AI servicesβ€”Vertex AI, TPUs, Gemini models, and 200+ foundation models.

Provider: Google
cloud-infrastructure gcp google-cloud vertex-ai tpu
Lambda Labs

Lambda Labs

GPU cloud optimized for AI/MLβ€”rent H100, A100, or A6000 GPUs by the hour for training, fine-tuning, and inference at competitive prices.

Provider: Lambda Labs
cloud-infrastructure gpu-cloud lambda-labs h100 a100
RunPod

RunPod

Community GPU cloud with spot and on-demand instancesβ€”rent GPUs from $0.20/hr with global availability and serverless inference.

Provider: RunPod
cloud-infrastructure gpu-cloud runpod spot-instances serverless

πŸ”§ Cloud AI

Azure AI Services

Azure AI Services

Microsoft's comprehensive cloud AI services and APIs.

Provider: Microsoft
cloud-ai azure microsoft

πŸ“„ Document AI

Extract, understand, and automate document workflows with AI

Azure Document Intelligence

Azure Document Intelligence

Microsoft's AI service for extracting text, structure, and insights from documents.

Provider: Microsoft
Document AI OCR Azure Data Extraction Microsoft
Google Document AI

Google Document AI

Google's comprehensive document understanding and processing platform powered by AI.

Provider: Google
Document AI OCR Google Cloud Data Extraction NLP

πŸ”§ AI Company

Black Forest Labs

Black Forest Labs

AI research company behind FLUX image generation models.

Provider: Black Forest Labs
ai-company image-generation flux
Stability AI

Stability AI

Leading AI company behind Stable Diffusion and Stable Audio models.

Provider: Stability AI
ai-company image-generation audio-generation

πŸ”§ AI Concepts

Chain-of-Thought Prompting

Chain-of-Thought Prompting

Prompting technique that dramatically improves AI reasoning by asking models to show their work step-by-stepβ€”unlocking complex problem-solving abilities.

Provider: Industry Standard
ai-concepts prompting chain-of-thought reasoning zero-shot few-shot
Constitutional AI

Constitutional AI

Anthropic's technique for training helpful, harmless AI using principles rather than human feedbackβ€”scaling AI alignment through AI self-critique.

Provider: Anthropic
ai-concepts constitutional-ai ai-alignment ai-safety anthropic claude
Diffusion Models

Diffusion Models

Revolutionary generative AI technique powering Stable Diffusion, DALL-E, and Midjourneyβ€”achieving photorealistic image generation by learning to reverse a gradual noising process, fundamentally transforming creative industries and visual AI.

Provider: Research Community (Multiple Origins)
ai-concepts diffusion-models generative-ai image-generation stable-diffusion machine-learning
Distributed Training

Distributed Training

Training neural networks across multiple GPUs or machines to handle massive models and datasetsβ€”essential for modern AI at scale.

Provider: Industry Standard
ai-concepts distributed-training model-parallelism data-parallelism gpu-cluster
Few-Shot Learning

Few-Shot Learning

Training AI models to perform tasks with only a handful of examplesβ€”enabling rapid adaptation to new tasks without extensive retraining.

Provider: Industry Standard
ai-concepts few-shot-learning in-context-learning meta-learning transfer-learning
Fine-tuning

Fine-tuning

Process of adapting pre-trained AI models to specific tasks or domains by continuing training on custom datasets, creating specialized models without training from scratch.

Provider: AI Research Community
ai-concepts fine-tuning model-training transfer-learning customization
LoRA (Low-Rank Adaptation)

LoRA (Low-Rank Adaptation)

Parameter-efficient fine-tuning technique that adapts large language models using low-rank matrix decomposition, reducing trainable parameters by 99% while maintaining quality.

Provider: Microsoft Research
ai-concepts fine-tuning parameter-efficient model-adaptation peft optimization
Model Serving

Model Serving

Infrastructure and techniques for deploying AI models in productionβ€”handling requests, scaling, optimization, and monitoring at enterprise scale.

Provider: Industry Standard
ai-concepts model-serving mlops inference deployment production-ai
Prompt Engineering

Prompt Engineering

Technique of crafting effective instructions for large language models to optimize output quality, accuracy, and desired behavior through structured prompts.

Provider: AI Research Community
ai-concepts prompt-engineering llm-optimization ai-interaction best-practices
Quantization (Model Compression)

Quantization (Model Compression)

Model compression technique that reduces memory and computational requirements by converting high-precision weights (32-bit) to lower precision (8-bit, 4-bit) with minimal accuracy loss.

Provider: AI Research Community
ai-concepts model-compression optimization edge-deployment inference efficiency
RAG (Retrieval-Augmented Generation)

RAG (Retrieval-Augmented Generation)

Technique that enhances LLM responses by retrieving relevant information from external knowledge bases before generating answers.

Provider: AI Research Community
ai-concepts retrieval llm-enhancement knowledge-base vector-search enterprise-ai
RLHF (Reinforcement Learning from Human Feedback)

RLHF (Reinforcement Learning from Human Feedback)

Training AI models using human preferences to align outputs with human valuesβ€”the technique behind ChatGPT's helpfulness and Claude's safety.

Provider: Industry Standard
ai-concepts rlhf reinforcement-learning alignment human-feedback ppo
Synthetic Data

Synthetic Data

Artificially generated training data that mimics real-world patternsβ€”solving privacy, cost, and data scarcity challenges in AI development.

Provider: Industry Standard
ai-concepts synthetic-data data-generation privacy augmentation training-data
Transformer Architecture

Transformer Architecture

Revolutionary neural network architecture using self-attention mechanismsβ€”powering GPT, BERT, Claude, and 95%+ of modern language models, fundamentally transforming NLP by eliminating recurrence and enabling massive parallelization.

Provider: Google Research (2017)
ai-concepts transformer attention-mechanism neural-networks deep-learning nlp
Vector Embeddings

Vector Embeddings

Numerical representations of text, images, or other data that capture semantic meaning in high-dimensional space for similarity search and AI applications.

Provider: AI Research Community
ai-concepts embeddings semantic-search vector-space nlp machine-learning
Zero-Shot Learning

Zero-Shot Learning

Enabling AI models to perform tasks they've never explicitly seen during trainingβ€”the ultimate in generalization and transfer learning.

Provider: Industry Standard
ai-concepts zero-shot-learning transfer-learning generalization prompting

πŸ”§ Vector Databases

ChromaDB

ChromaDB

The AI-native embedded vector database designed for developersβ€”offering zero-configuration setup, built-in embeddings, and production deployment in minutes, powering 50,000+ AI applications from startups to enterprises.

Provider: Chroma (Open Source)
vector-databases chromadb embedded-database rag vector-search open-source
FAISS

FAISS

Meta's library for billion-scale vector similarity searchβ€”the fastest, most memory-efficient vector search for research and production.

Provider: Meta AI
vector-databases faiss meta similarity-search gpu-acceleration
Milvus

Milvus

Cloud-native vector database built for trillion-scale vector search achieving 10,000+ QPS with Kubernetes-native architectureβ€”the enterprise-grade open-source solution powering billion-vector deployments at eBay, Walmart, and NVIDIA.

Provider: LF AI & Data Foundation (Open Source)
vector-databases milvus cloud-native kubernetes vector-search open-source
Pinecone

Pinecone

Fully managed vector database optimized for similarity search at scale, powering RAG systems, recommendation engines, and semantic search with millisecond latency for billions of vectors.

Provider: Pinecone Systems Inc.
vector-databases pinecone similarity-search vector-search embeddings rag
PostgreSQL pgvector

PostgreSQL pgvector

Add vector similarity search to PostgreSQLβ€”leverage your existing database for embeddings without deploying specialized vector infrastructure.

Provider: PostgreSQL Community
vector-databases postgresql pgvector relational-database sql
Qdrant

Qdrant

High-performance vector similarity search engine built in Rust achieving sub-10ms queries on 100M+ vectorsβ€”the open-source vector database engineered for production AI applications with advanced filtering and payload storage.

Provider: Qdrant Solutions GmbH (Open Source)
vector-databases qdrant vector-search similarity-search rust open-source
Redis Vector Search

Redis Vector Search

Redis with vector similarity searchβ€”combine caching, key-value storage, and vector search in one blazingly fast in-memory database.

Provider: Redis
vector-databases redis in-memory caching semantic-search
Weaviate

Weaviate

Open-source vector database with GraphQL API, native multi-modal search, and cloud-native architectureβ€”powering semantic search, RAG systems, and AI applications at scale.

Provider: Weaviate B.V.
vector-databases weaviate open-source graphql semantic-search rag

πŸ”§ Language Model

Claude Haiku

Claude Haiku

Fast, cost-effective Claude model optimized for speed and efficiency.

Provider: Anthropic
language-model claude anthropic
Claude Opus

Claude Opus

Most capable Claude model for complex tasks requiring deep reasoning.

Provider: Anthropic
language-model claude anthropic
Gemini 2.5 Flash

Gemini 2.5 Flash

Fast, efficient Gemini model with multimodal capabilities.

Provider: Google
language-model gemini google multimodal
GPT-4

GPT-4

OpenAI's flagship large language model with advanced reasoning capabilities.

Provider: OpenAI
language-model gpt openai
Llama 4 Maverick

Llama 4 Maverick

Advanced Llama 4 model for complex reasoning and extended context.

Provider: Meta
language-model llama meta open-source
Llama 4 Scout

Llama 4 Scout

Efficient Llama 4 variant optimized for speed and resource efficiency.

Provider: Meta
language-model llama meta open-source

πŸ”§ LLM Platform

Cohere

Cohere

Enterprise-focused LLM platform with Command R+ models and RAG capabilities.

Provider: Cohere
language-models enterprise-ai embeddings rag
Groq

Groq

Ultra-fast LLM inference powered by custom LPU hardware, achieving 500+ tokens/second.

Provider: Groq
language-models inference lpu ultra-fast
Mistral AI

Mistral AI

European AI company with open-source and commercial LLMs including Mistral Large and Mixtral MoE.

Provider: Mistral AI
language-models open-source european-ai
Together AI

Together AI

Fast, cost-effective inference platform for open-source LLMs with competitive pricing.

Provider: Together AI
language-models inference-platform open-source

πŸ”§ Development Tools

ComfyUI

ComfyUI

Powerful node-based interface for Stable Diffusion workflows with 89,000+ GitHub stars and 1,600+ supported nodes.

Provider: Community (Open Source)
workflow-builder stable-diffusion node-based-interface open-source image-generation video-generation gui-tool
LangChain

LangChain

Open-source framework for building LLM-powered applications with modular components for chains, agents, memory, and tool integrationβ€”the industry standard for production AI workflows.

Provider: Harrison Chase (LangChain Inc.)
development-tools langchain llm-framework ai-orchestration python javascript
LlamaIndex

LlamaIndex

Data framework for connecting custom data sources to LLMsβ€”specialized for RAG applications with advanced indexing, retrieval strategies, and production-ready data ingestion pipelines.

Provider: Jerry Liu (LlamaIndex Inc.)
development-tools llamaindex rag-framework data-framework document-indexing python
PyTorch

PyTorch

The dominant deep learning framework powering 70%+ of AI research papers with intuitive Python-first design, dynamic computation graphs, and production-ready deploymentβ€”the foundation for training everything from GPT models to computer vision systems.

Provider: Meta AI (Open Source)
development-tools pytorch deep-learning machine-learning neural-networks open-source
TensorFlow

TensorFlow

Google's production-grade ML framework powering Search, YouTube, and Gmail with 180,000+ GitHub starsβ€”offering comprehensive tools from research to deployment including TensorFlow Lite for mobile and TensorFlow.js for browsers.

Provider: Google (Open Source)
development-tools tensorflow deep-learning machine-learning google open-source
vLLM

vLLM

High-throughput LLM inference engine achieving 24x faster serving with PagedAttention for memory efficiencyβ€”the production standard for deploying large language models at scale.

Provider: UC Berkeley (Open Source)
development-tools vllm llm-inference model-serving performance-optimization open-source

πŸ”§ Inference Optimization

Continuous Batching

Continuous Batching

Dynamic batching technique for improved LLM serving throughput.

Provider: Research
inference optimization batching throughput

🎨 Text-to-Image

Generate high-quality images from text descriptions using cutting-edge diffusion models

DALL-E 3

DALL-E 3

OpenAI's advanced text-to-image model with exceptional prompt understanding and ChatGPT integration.

Provider: OpenAI
Text-to-Image DALL-E OpenAI Image Generation AI Art ChatGPT
FLUX.1

FLUX.1

Leading open-source text-to-image model from Stability AI alumni, delivering photorealistic quality with superior prompt adherence across Pro, Dev, and Schnell variants.

Provider: Black Forest Labs
image-generation text-to-image photorealistic open-source diffusion-models commercial-use
Google Imagen 3

Google Imagen 3

Google's advanced text-to-image model with photorealistic quality and responsible AI features.

Provider: Google
Text-to-Image Google Imagen Image Generation Google AI
Midjourney v6

Midjourney v6

Leading AI art generation platform known for exceptional aesthetic quality and artistic capabilities.

Provider: Midjourney
Text-to-Image Midjourney AI Art Image Generation Creative AI
Recraft V3

Recraft V3

#1 ranked AI image generator for design-focused images, the only model supporting long text generation and vector art, with precise style control and positioning.

Provider: Recraft AI
image-generation text-to-image design-tools vector-graphics text-generation professional-design brand-assets
SDXL Lightning

SDXL Lightning

Sub-second image generation model from ByteDance using progressive adversarial distillation, generating 1024px images in 1-8 steps with quality superior to SDXL Turbo.

Provider: ByteDance
image-generation text-to-image fast-inference distillation stable-diffusion real-time-ai open-source
Stable Diffusion SDXL

Stable Diffusion SDXL

Open-source text-to-image model producing high-quality, photorealistic images with commercial licensing.

Provider: Stability AI
Text-to-Image Stable Diffusion Image Generation Open Source Diffusion Models SDXL

πŸ”§ Infrastructure

Docker

Docker

Containerization platform enabling consistent application deployment across environments with isolated, portable containers.

Provider: Docker Inc.
docker containerization devops containers deployment infrastructure
Elasticsearch

Elasticsearch

Distributed search and analytics engine built on Apache Lucene for full-text search, log analytics, and real-time data exploration.

Provider: Elastic N.V.
elasticsearch search-engine analytics lucene full-text-search elk-stack
NVIDIA GB200 Grace Blackwell

NVIDIA GB200 Grace Blackwell

NVIDIA's most advanced AI superchip combining Grace CPU and Blackwell GPU, delivering 25x more energy efficiency than H100 for AI inference.

Provider: NVIDIA
nvidia-gpu ai-hardware grace-cpu blackwell superchip ai-infrastructure
Apache Kafka

Apache Kafka

Distributed event streaming platform for high-throughput, fault-tolerant message publishing, storage, and real-time processing.

Provider: Apache Software Foundation
kafka event-streaming message-broker distributed-systems real-time data-pipeline
Kubernetes

Kubernetes

Open-source container orchestration platform for automating deployment, scaling, and management of containerized applications.

Provider: Cloud Native Computing Foundation
kubernetes k8s orchestration containers cloud-native devops
MongoDB

MongoDB

Document-oriented NoSQL database with flexible schema, horizontal scalability, and JSON-like document storage for modern applications.

Provider: MongoDB Inc.
mongodb nosql database document-database json bson
PostgreSQL

PostgreSQL

Advanced open-source relational database with robust features, ACID compliance, and extensive SQL support for enterprise applications.

Provider: PostgreSQL Global Development Group
postgresql database sql relational-database postgres acid
RabbitMQ

RabbitMQ

Open-source message broker implementing AMQP for reliable asynchronous communication between distributed systems and microservices.

Provider: VMware (Broadcom)
rabbitmq message-broker amqp queue messaging microservices
Redis

Redis

In-memory data store used as database, cache, message broker, and streaming engine with sub-millisecond latency.

Provider: Redis Ltd.
redis cache in-memory-database key-value-store message-broker data-structures

πŸ”Š Speech & Audio

Speech recognition, text-to-speech, and audio generation technologies

ElevenLabs TTS

ElevenLabs TTS

Advanced AI text-to-speech platform with ultra-realistic voices and emotion control.

Provider: ElevenLabs
Text-to-Speech TTS Voice Synthesis Voice Cloning Audio AI
OpenAI Whisper

OpenAI Whisper

OpenAI's robust speech recognition model supporting 99+ languages with exceptional accuracy and noise resistance.

Provider: OpenAI
Speech Recognition ASR OpenAI Transcription Multilingual Audio AI

πŸ”§ Attention Mechanism

Flash Attention

Flash Attention

Fast and memory-efficient attention mechanism for transformers.

Provider: Research
attention transformer efficiency optimization

πŸ”§ Data Privacy

GDPR Compliance

GDPR Compliance

European data protection regulations for AI and data processing.

Provider: European Union
privacy compliance gdpr regulations

🎬 Text-to-Video

Create videos from text prompts with AI-powered video generation

Google Veo 3

Google Veo 3

World's first AI video generator with native audio generation, creating synchronized soundtracks with dialogue, sound effects, and ambient noise alongside 720p/1080p video.

Provider: Google DeepMind
video-generation audio-generation text-to-video google-deepmind multimodal-ai youtube-shorts
Google Veo

Google Veo

Google's advanced text-to-video model generating high-quality 1080p videos with cinematic effects.

Provider: Google
Text-to-Video Video Generation Google AI Video Google DeepMind
HunyuanVideo

HunyuanVideo

Open-source 13 billion parameter video generation model from Tencent, the largest open-source video model with 720p HD output and advanced camera controls.

Provider: Tencent
video-generation open-source text-to-video diffusion-models 3d-vae large-model tencent
Kling AI

Kling AI

Chinese AI video generation platform with 22M+ users and 168M+ videos generated, featuring advanced diffusion transformer architecture with 3D VAE.

Provider: Kuaishou Technology
video-generation diffusion-models text-to-video chinese-ai content-creation
LTX Video

LTX Video

First DiT-based real-time video generation model generating 30 FPS at 1216Γ—704, now supporting 60+ second clips with ethical training on fully-licensed data.

Provider: Lightricks
video-generation real-time-ai open-source ethical-ai text-to-video diffusion-transformer long-form-video
Mochi 1

Mochi 1

10 billion parameter open-source video generation model with Apache 2.0 license, featuring novel AsymmDiT architecture for photorealistic 30fps video with advanced physics simulation.

Provider: Genmo AI
video-generation open-source text-to-video diffusion-models photorealistic physics-simulation commercial-license
Pika 2.0

Pika 2.0

Advanced AI video generation platform with Scene Ingredients feature, enabling 10-second 1080p videos with custom scene composition from uploaded images and enhanced temporal stability.

Provider: Pika Labs
video-generation text-to-video image-to-video ai-animation content-creation scene-composition
Runway Gen-4

Runway Gen-4

Next-generation AI video model from Runway with world consistency, character persistence across scenes, and superior motion physics simulation, released March 2025.

Provider: Runway
video-generation text-to-video image-to-video character-consistency cinematic-ai motion-physics
Runway Gen-2

Runway Gen-2

Advanced AI video generation platform with comprehensive creative tools for professional filmmakers.

Provider: Runway
Text-to-Video Video Generation Runway AI Video Creative Tools
OpenAI Sora

OpenAI Sora

OpenAI's groundbreaking text-to-video model creating realistic, cinematic videos up to 60 seconds from text descriptions.

Provider: OpenAI
Text-to-Video Video Generation OpenAI AI Video Generative AI Cinematic AI
Veo 3 Fast

Veo 3 Fast

Low-latency video generation for YouTube Shorts with 480p output.

Provider: Google
video-generation youtube google-ai
Wan 2.1

Wan 2.1

Open-source AI video generation model with diffusion transformer architecture, generating 5-second 480P videos on consumer GPUs like RTX 4090.

Provider: Alibaba / Tongyi Lab
video-generation text-to-video open-source ai-video alibaba diffusion-transformer
Wan 2.2

Wan 2.2

Advanced open-source AI video model with Mixture-of-Experts architecture (27B/14B active), supporting 720P video generation with text-to-video, image-to-video, speech-to-video, and character animation.

Provider: Alibaba / Tongyi Lab
video-generation text-to-video open-source ai-video alibaba mixture-of-experts 720p-video
Wan 2.5

Wan 2.5

Revolutionary AI video model with native audio-video synchronization (second only to Google Veo 3), generating 4K videos up to 10 seconds with automatic voiceovers, sound effects, and background music.

Provider: Alibaba / Tongyi Lab
video-generation audio-generation text-to-video ai-video alibaba 4k-video audio-video-sync

πŸ”§ Training Technique

Gradient Checkpointing

Gradient Checkpointing

Memory-efficient training technique for large models.

Provider: Research
training memory-optimization efficiency

πŸ”§ Inference Server

Hugging Face TGI

Hugging Face TGI

Text Generation Inference server for optimized LLM serving.

Provider: Hugging Face
inference serving llm optimization

🀝 Open Source Platforms

Open-source AI frameworks and model repositories

Hugging Face

Hugging Face

Leading open-source platform hosting 500,000+ AI models, 100,000+ datasets, and comprehensive ML tools.

Provider: Hugging Face
Open Source AI Platform Transformers Model Hub ML Community AI Infrastructure
Meta Llama 4

Meta Llama 4

Meta's advanced open-source LLM released April 2025 with multimodal capabilities and MoE architecture.

Provider: Meta
Open Source LLM Meta Llama Foundation Model Multimodal MoE

πŸ”§ Networking

InfiniBand

InfiniBand

High-performance networking technology for HPC and AI clusters.

Provider: InfiniBand Trade Association
networking hpc high-performance

πŸ”§ Caching

Memcached

Memcached

High-performance distributed memory caching system.

Provider: Memcached
caching distributed-systems performance

πŸ”§ Video Generation AI

Midjourney V1 Video

Midjourney V1 Video

First video generation model from Midjourney, creating 5-21 second cinematic videos through image-to-video workflow with signature artistic quality.

Provider: Midjourney
video-generation image-to-video ai-cinematography creative-tools artistic-video midjourney

πŸ”§ Model Architecture

Mixture of Experts

Mixture of Experts

Neural network architecture with specialized expert modules.

Provider: Research
model-architecture moe efficiency

πŸ”§ AI Architecture

Multi-Agent Systems

Multi-Agent Systems

AI systems with multiple collaborating agents for complex tasks.

Provider: Research
multi-agent ai-architecture collaboration

πŸ”§ Storage Protocol

NVMe

NVMe

High-performance storage protocol for SSDs and data centers.

Provider: NVM Express
storage protocol performance

πŸ”§ Model Training

QLoRA

QLoRA

Efficient fine-tuning technique using quantization and LoRA.

Provider: Research
fine-tuning quantization lora efficiency

πŸ”§ ML Platform

Replicate

Replicate

Cloud platform for running AI models via API with pay-per-use pricing and thousands of models.

Provider: Replicate
ml-platform model-deployment api-service

πŸ”§ Video Generation

Runway Gen-3 Alpha

Runway Gen-3 Alpha

Advanced AI video generation platform offering text-to-video, image-to-video, and creative video editing tools for filmmakers and creators.

Provider: Runway AI
runway video-generation ai-video gen-3 text-to-video filmmaking