AI compute

From GISAXS

Revision as of 11:29, 19 February 2025 by KevinYager (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to: navigation, search

Contents

1 Cloud GPU
2 Cloud Training Compute
3 Cloud LLM Routers & Inference Providers
- 3.1 Multi-model Web Chat Interfaces
- 3.2 Multi-model Web Playground Interfaces
4 Local Router
5 Acceleration Hardware

Cloud GPU

Cloud Training Compute

Cloud LLM Routers & Inference Providers

OpenRouter (open and closed models, no Enterprise tier)
LiteLLM (closed models, Enterprise tier)
Cent ML (open models, Enterprise tier)
Fireworks AI (open models, Enterprise tier)
Abacus AI (open and closed models, Enterprise tier)
Portkey (open? and closed models, Enterprise tier)
Huggingface Inference Providers Hub

Multi-model Web Chat Interfaces

Multi-model Web Playground Interfaces

Local Router

Acceleration Hardware

Nvidia GPUs
Google TPU
Etched: Transformer ASICs
Cerebras
Untether AI
Graphcore
SambaNova Systems
Groq
Tesla Dojo
Deep Silicon: Combined hardware/software solution for accelerated AI (e.g. ternary math)

Retrieved from "http://gisaxs.com/index.php?title=AI_compute&oldid=6982"