Large language models and the platforms that power them.
Meta's open-weight instruction-following model. Available in 8B and 70B parameter sizes.
A 7B parameter model with strong performance on math and coding benchmarks.
Mixture-of-experts model. Only 12B active parameters per token, with 46B total.
Alibaba's multilingual LLM family with strong code and math capabilities.