bge-m3

2024-02-02
嵌入
By 开源

Input: ￥0.14 / M tokens Output: ￥0.14 / M tokens
特征：文本输入
上下文： 8K

Input: ￥0.14 / M tokens Output: ￥0.14 / M tokens
特征：文本输入
上下文： 8K

模型描述

BGE-M3 stands out for its Multi-Functionality (simultaneous dense, sparse, and multi-vector retrieval), Multi-Linguality (100+ languages), and Multi-Granularity (up to 8,192-token documents). It enhances retrieval pipelines by enabling hybrid retrieval (e.g., combining dense embeddings with BM25-like sparse weights) and re-ranking for higher accuracy. The model integrates seamlessly with tools like Vespa and Milvus, and its unified fine-tuning supports diverse retrieval methods. Recent updates include improved MIRACL benchmark performance and multilingual long-document datasets (MLDR).

推荐模型

QwQ-32B

对话, 推理
Qwen

QwQ-32B 是 Qwen 系列中的一个 32.5B 参数推理模型，具有先进的架构和 131K 令牌上下文长度，旨在在复杂任务中超越像 DeepSeek-R1 这样的最先进模型。

2025-03-06

DeepSeek-R1

对话, 推理
DeepSeek

与 OpenAI-o1 相当的性能，完全开源模型和技术报告，代码和模型在 MIT 许可证下发布：自由提炼和商业化。

2025-01-20

gemini-2.0-flash

对话, 识图
Gemini

Gemini 2.0 Flash 提供了下一代功能和改进的能力，包括更快的速度、原生工具使用、多模态生成和 1M 令牌上下文窗口。

2025-02-14