Difference between revisions of "AI compute"

Revision as of 15:29, 18 February 2025

Nvidia GPUs
Google TPU
Etched: Transformer ASICs
Cerebras
Untether AI
Graphcore
SambaNova Systems
Groq
Tesla Dojo
Deep Silicon: Combined hardware/software solution for accelerated AI (e.g. ternary math)

@@ Line 12: / Line 12: @@
 ==Cloud LLM Routers & Inference Providers==
-* [https://openrouter.ai/ OpenRouter]
+* [https://openrouter.ai/ OpenRouter] (open and closed models, no Enterprise tier)
-* [https://www.litellm.ai/ LiteLLM]
+* [https://www.litellm.ai/ LiteLLM] (closed models, Enterprise tier)
-* [https://centml.ai/ Cent ML]
+* [https://centml.ai/ Cent ML] (open models, Enterprise tier)
-* [https://fireworks.ai/ Fireworks AI]
+* [https://fireworks.ai/ Fireworks AI] (open models, Enterprise tier)
+* [https://abacus.ai/ Abacus AI] (open and closed models, Enterprise tier)
+* [https://portkey.ai/ Portkey] (open? and closed models, Enterprise tier)
 * Huggingface [https://huggingface.co/blog/inference-providers Inference Providers Hub]
-* [https://abacus.ai/ Abacus AI]
-* [https://portkey.ai/ Portkey]
 ===Multi-model Web Chat Interfaces===