Difference between revisions of "AI compute"

Revision as of 09:51, 29 January 2025

Nvidia GPUs
Google TPU
Etched: Transformer ASICs
Cerebras
Untether AI
Graphcore
SambaNova Systems
Groq
Tesla Dojo
Deep Silicon: Combined hardware/software solution for accelerated AI (e.g. ternary math)

@@ Line 11: / Line 11: @@
 * [https://glaive.ai/ Glaive AI]
-==Cloud LLM Routers==
+==Cloud LLM Routers & Inference Providers==
 * [https://openrouter.ai/ OpenRouter]
 * [https://www.litellm.ai/ LiteLLM]
 * [https://centml.ai/ Cent ML]
 * [https://fireworks.ai/ Fireworks AI]
+* Huggingface [https://huggingface.co/blog/inference-providers Inference Providers Hub]
 ==Acceleration Hardware==