[Request] 自定义嵌入模型 Custom Embedding model #3785

ProAlexUSC · 2024-09-05T19:02:26Z

🥰 需求描述

目前嵌入模型仅支持 text-embedding-3-small，能否支持其他模型，或者提供模型 remap 的能力?

At present, the embedded model only supports text-embedding-3-small. Can it support other models, or provide the ability for model remapping?

🧐 解决方案

我在 OpenAI 模型配置下，使用其他兼容 OpenAI 接口的大模型提供商，它提供了 netease-youdao/bce-embedding-base_v1 模型。
希望能使用这个模型作为嵌入模型，或者直接重命名 netease-youdao/bce-embedding-base_v1 为 text-embedding-3-small

Under the OpenAI model configuration, I am using a large model provider that is compatible with the OpenAI API, which offers the netease-youdao/bce-embedding-base_v1 model.
I hope to use this model as the embedding model, or directly rename netease-youdao/bce-embedding-base_v1 to text-embedding-3-small.

📝 补充信息

No response

The text was updated successfully, but these errors were encountered:

lobehubbot · 2024-09-05T19:02:40Z

👀 @ProAlexUSC

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible.
Please make sure you have given us as much context as possible.
非常感谢您提交 issue。我们会尽快调查此事，并尽快回复您。请确保您已经提供了尽可能多的背景信息。

dosubot · 2024-09-05T19:03:48Z

@ProAlexUSC 你好！我是Dosu，我是一个可以帮助你解决 bug、回答问题，并成为贡献者的机器人。如果你有任何问题，我会很乐意帮助你。

You can add support for other embedding models or remap existing models within the system. The models.json file in the src/libs/agent-runtime/togetherai/fixtures/ directory allows you to specify model remapping. For example, you can rename netease-youdao/bce-embedding-base_v1 to text-embedding-3-small as follows:

{
  "name": "netease-youdao/bce-embedding-base_v1",
  "renamed": "text-embedding-3-small",
  ...
}

This will allow the system to use the netease-youdao/bce-embedding-base_v1 model under the name text-embedding-3-small ^[1].

To continue talking to Dosu, mention @dosu.

arvinxx · 2024-09-06T01:45:21Z

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

lobehubbot · 2024-09-06T01:45:34Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

Lostsite · 2024-09-08T05:16:18Z

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

Why must be 1024 dimensions? Also, based on OpenAI, "text-embedding-3-small" is 1,536 dimensions.

ref: https://platform.openai.com/docs/models/embeddings

arvinxx · 2024-09-08T06:43:48Z

@Lostsite Just read the offical docuement.

https://openai.com/index/new-embedding-models-and-api-updates/

https://platform.openai.com/docs/guides/embeddings/use-cases

exqmjmz · 2024-09-11T14:08:01Z

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

BAAI/bge-m3 这个向量支持1024纬度在siliconflow有这个模型，目前是免费
https://docs.siliconflow.cn/reference/createembedding-1

CaiJingLong · 2024-09-12T05:05:16Z

So, must have OpenAI API to use it?
Why not support ollama embedding.
Or, if it support ollama embedding, how to set it?

Btw, I use self host server mode to deploy with docker-compose.

exqmjmz · 2024-09-12T07:12:07Z

智谱的embedding-3也支持1024纬度以上的向量
https://open.bigmodel.cn/dev/api/vector/embedding-3

alwqx · 2024-09-22T06:16:29Z

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

赞，希望支持 ollama 运行的嵌入模型

lobehubbot · 2024-09-22T06:16:40Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

Like, I hope to support the embedded model run by ollama

JunerLee · 2024-09-26T16:04:41Z

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

目前也是遇到了这个问题，服务端只能用openai的词嵌入向量化，国内很多模型也支持价格也不贵，希望可以支持更多家的向量化模型，比如ChatGLM的

lobehubbot · 2024-09-26T16:04:56Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

We are currently encountering this problem as well. The server can only use openai's word embedding vectorization. Many domestic models also support it and the price is not expensive. We hope to support more vectorization models, such as ChatGLM's.

Salmon-Bard · 2024-09-27T12:14:51Z

How can I get lobe-chat to a call a Rest API that exposes my local hosted embedding model ??
My be there should be some webhook or plugin to pass embedding created via any model

AlexBlack2202 · 2024-10-29T08:18:47Z

hello, is there any plain for support my own custom embedding, i think this is not a very hard problem

arvinxx · 2024-10-29T13:19:56Z

@AlexBlack2202 No. It's a hard work for the architecture design to make it easy to intergate more embedding providers. Here is PR for it: #4370. Stay tuned

thunderbug1 · 2024-12-16T09:19:29Z

Any news on this?
This is very important for us since we only use Azure OpenAI models and we cannot use the knowledgebase feature without configuring the embedding model.

CaiJingLong · 2024-12-17T03:20:39Z

在我看来，这个功能甚至应该是优先级最高的功能之一，因为它影响了所有无法使用OpenAI或只能本地运行 embedding 的用户

lobehubbot · 2024-12-17T03:20:51Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

In my opinion, this feature should even be one of the highest priority features since it affects all users who don't have access to OpenAI or can only run embedding locally

steinvenic · 2024-12-18T06:49:23Z

Hope to support this feature as soon as possible.

changjiong · 2025-01-14T06:40:23Z

Any news on this?

lobehubbot · 2025-01-15T03:17:34Z

✅ @ProAlexUSC

This issue is closed, If you have any questions, you can comment and reply.
此问题已经关闭。如果您有任何问题，可以留言并回复。

wxxsfxyzm · 2025-03-02T08:54:18Z

想要使用其他embedding模型的可以考虑走oneapi转一下：

ninggu2008 · 2025-03-09T15:25:19Z

想要使用其他embedding模型的可以考虑走oneapi转一下：

按照你的办法折腾了一下，成功了。从这个角度看，官方实现应该没啥难度，就是改个名字，不明白半年过去了为啥一点进度都没有。

lobehubbot · 2025-03-09T15:25:31Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

If you want to use other embedding models, you can consider going to oneapi:

I tried my best to do it and it was successful. From this perspective, it should be no difficulty in implementing the official implementation, just change the name, and don’t understand why there is no progress after half a year.

ProAlexUSC added the 🌠 Feature Request New feature or request | 特性与建议 label Sep 5, 2024

arvinxx added the ollama Relative to Ollama Provider and ollama models label Sep 18, 2024

This was referenced Sep 27, 2024

[Request] 文件向量化支持 GitHub 的 OpenAI Text Embedding 3 #4128

Closed

[Request] Add support for other text-embedding model providers #4180

Closed

[Request] 内嵌功能适配Azure OpenAI #4017

Closed

This was referenced Oct 13, 2024

[Request] 向量模型是否可以支持使用智谱的？ #4207

Closed

[Request] 知识库Ollama Embedding模型支持 #4408

Closed

This was referenced Oct 29, 2024

[Request] 是否可以提供自定義 bedrock embedding model 的範例 #4457

Closed

[Bug] 自定义embedding模型为智谱清言，但是依旧无法进行向量化 #4486

Closed

RS-Imagine mentioned this issue Dec 29, 2024

[Request] Custom Embedded Model RS-Imagine/lobe-chat#15

Closed

This was referenced Jan 14, 2025

[Request] 什么时候能够支持更多的向量化模型 #5396

Closed

[Request] 向量化模型 #5435

Closed

[Request] 向量化模型现只支持openai？ #5225

Closed

✨ feat: support to customize Embedding model with env #5177

Merged

arvinxx closed this as completed in #5177 Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Request] 自定义嵌入模型 Custom Embedding model #3785

[Request] 自定义嵌入模型 Custom Embedding model #3785

ProAlexUSC commented Sep 5, 2024

lobehubbot commented Sep 5, 2024

dosubot bot commented Sep 5, 2024

arvinxx commented Sep 6, 2024

lobehubbot commented Sep 6, 2024

Lostsite commented Sep 8, 2024 •

edited

Loading

arvinxx commented Sep 8, 2024

exqmjmz commented Sep 11, 2024

CaiJingLong commented Sep 12, 2024 •

edited

Loading

exqmjmz commented Sep 12, 2024

alwqx commented Sep 22, 2024

lobehubbot commented Sep 22, 2024

JunerLee commented Sep 26, 2024

lobehubbot commented Sep 26, 2024

Salmon-Bard commented Sep 27, 2024 •

edited

Loading

AlexBlack2202 commented Oct 29, 2024

arvinxx commented Oct 29, 2024

thunderbug1 commented Dec 16, 2024 •

edited

Loading

CaiJingLong commented Dec 17, 2024

lobehubbot commented Dec 17, 2024

steinvenic commented Dec 18, 2024

changjiong commented Jan 14, 2025

lobehubbot commented Jan 15, 2025

wxxsfxyzm commented Mar 2, 2025

ninggu2008 commented Mar 9, 2025

lobehubbot commented Mar 9, 2025

[Request] 自定义嵌入模型 Custom Embedding model #3785

[Request] 自定义嵌入模型 Custom Embedding model #3785

Comments

ProAlexUSC commented Sep 5, 2024

🥰 需求描述

🧐 解决方案

📝 补充信息

lobehubbot commented Sep 5, 2024

dosubot bot commented Sep 5, 2024

arvinxx commented Sep 6, 2024

lobehubbot commented Sep 6, 2024

Lostsite commented Sep 8, 2024 • edited Loading

arvinxx commented Sep 8, 2024

exqmjmz commented Sep 11, 2024

CaiJingLong commented Sep 12, 2024 • edited Loading

exqmjmz commented Sep 12, 2024

alwqx commented Sep 22, 2024

lobehubbot commented Sep 22, 2024

JunerLee commented Sep 26, 2024

lobehubbot commented Sep 26, 2024

Salmon-Bard commented Sep 27, 2024 • edited Loading

AlexBlack2202 commented Oct 29, 2024

arvinxx commented Oct 29, 2024

thunderbug1 commented Dec 16, 2024 • edited Loading

CaiJingLong commented Dec 17, 2024

lobehubbot commented Dec 17, 2024

steinvenic commented Dec 18, 2024

changjiong commented Jan 14, 2025

lobehubbot commented Jan 15, 2025

wxxsfxyzm commented Mar 2, 2025

ninggu2008 commented Mar 9, 2025

lobehubbot commented Mar 9, 2025

Lostsite commented Sep 8, 2024 •

edited

Loading

CaiJingLong commented Sep 12, 2024 •

edited

Loading

Salmon-Bard commented Sep 27, 2024 •

edited

Loading

thunderbug1 commented Dec 16, 2024 •

edited

Loading