Difference between revisions of "AI compute"
KevinYager (talk | contribs) |
KevinYager (talk | contribs) (→Cloud LLM Routers & Inference Providers) |
||
(3 intermediate revisions by the same user not shown) | |||
Line 11: | Line 11: | ||
* [https://glaive.ai/ Glaive AI] | * [https://glaive.ai/ Glaive AI] | ||
− | ==Cloud LLM Routers== | + | ==Cloud LLM Routers & Inference Providers== |
* [https://openrouter.ai/ OpenRouter] | * [https://openrouter.ai/ OpenRouter] | ||
* [https://www.litellm.ai/ LiteLLM] | * [https://www.litellm.ai/ LiteLLM] | ||
+ | * [https://centml.ai/ Cent ML] | ||
+ | * [https://fireworks.ai/ Fireworks AI] | ||
+ | * Huggingface [https://huggingface.co/blog/inference-providers Inference Providers Hub] | ||
+ | |||
+ | ===Multi-model Web Chat Interfaces=== | ||
+ | * [https://simtheory.ai/ SimTheory] | ||
+ | * [https://abacus.ai/ Abacus AI] | ||
==Acceleration Hardware== | ==Acceleration Hardware== |
Latest revision as of 09:08, 2 February 2025
Contents
Cloud GPU
Cloud Training Compute
Cloud LLM Routers & Inference Providers
- OpenRouter
- LiteLLM
- Cent ML
- Fireworks AI
- Huggingface Inference Providers Hub
Multi-model Web Chat Interfaces
Acceleration Hardware
- Nvidia GPUs
- Google TPU
- Etched: Transformer ASICs
- Cerebras
- Untether AI
- Graphcore
- SambaNova Systems
- Groq
- Tesla Dojo
- Deep Silicon: Combined hardware/software solution for accelerated AI (e.g. ternary math)