llama-3.3-70b

模型描述

Meta Llama 3.3 is a state-of-the-art 70 billion parameter multilingual large language model (LLM) designed for text generation tasks. As an instruction-tuned variant of the Llama architecture, it specializes in assistant-like dialogue applications across English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai. The model employs an optimized transformer architecture with Grouped-Query Attention (GQA) for efficient inference, trained on over 15 trillion tokens of publicly available data with a knowledge cutoff in December 2023. It leverages both supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align responses with human preferences for helpfulness and safety. Notable features include a 128k token context window, tool calling capabilities, and compliance with Meta’s custom commercial license (Llama 3.3 Community License). The model demonstrates strong performance on industry benchmarks while explicitly prohibiting unlawful uses or applications in unsupported languages without proper safety measures.

全文结束

推荐模型

DeepSeek-V3-0324

深度寻求-V3-0324 是一个升级的人工智能模型,具有增强的推理、编码、中文写作和网络搜索能力,在某些任务中超越了 GPT-4.5,同时保持 128K 上下文支持和开源 MIT 许可。

o3

Our most powerful reasoning model with leading performance on coding, math, science, and vision

gpt-4o-mini-rev

使用逆向工程在官方应用程序中调用模型并将其转换为 API。