SemanticKernel/C#：实现接口，接入本地嵌入模型

前言

本文通过Codeblaze.SemanticKernel这个项目，学习如何实现ITextEmbeddingGenerationService接口，接入本地嵌入模型。

项目地址：https://github.com/BLaZeKiLL/Codeblaze.SemanticKernel

实践

SemanticKernel初看以为只支持OpenAI的各种模型，但其实也提供了强大的抽象能力，可以通过自己实现接口，来实现接入不兼容OpenAI格式的模型。

Codeblaze.SemanticKernel这个项目实现了ITextGenerationService、IChatCompletionService与ITextEmbeddingGenerationService接口，由于现在Ollama的对话已经支持了OpenAI格式，因此可以不用实现ITextGenerationService和IChatCompletionService来接入Ollama中的模型了，但目前Ollama的嵌入还没有兼容OpenAI的格式，因此可以通过实现ITextEmbeddingGenerationService接口，接入Ollama中的嵌入模型。

查看ITextEmbeddingGenerationService接口：

代表了一种生成浮点类型文本嵌入的生成器。

再看看IEmbeddingGenerationService<string, float>接口：

[Experimental("SKEXP0001")]
public interface IEmbeddingGenerationService<TValue, TEmbedding> : IAIService where TEmbedding : unmanaged
{Task<IList<ReadOnlyMemory<TEmbedding>>> GenerateEmbeddingsAsync(IList<TValue> data, Kernel? kernel = null, CancellationToken cancellationToken = default(CancellationToken));
}

再看看IAIService接口：

说明我们只要实现了

Task<IList<ReadOnlyMemory<TEmbedding>>> GenerateEmbeddingsAsync(IList<TValue> data, Kernel? kernel = null, CancellationToken cancellationToken = default(CancellationToken));IReadOnlyDictionary<string, object?> Attributes { get; }

这个方法和属性就行。

学习Codeblaze.SemanticKernel中是怎么做的。

添加OllamaBase类：

 public interface IOllamaBase{Task PingOllamaAsync(CancellationToken cancellationToken = new());}public abstract class OllamaBase<T> : IOllamaBase where T : OllamaBase<T>{public IReadOnlyDictionary<string, object?> Attributes => _attributes;private readonly Dictionary<string, object?> _attributes = new();protected readonly HttpClient Http;protected readonly ILogger<T> Logger;protected OllamaBase(string modelId, string baseUrl, HttpClient http, ILoggerFactory? loggerFactory){_attributes.Add("model_id", modelId);_attributes.Add("base_url", baseUrl);Http = http;Logger = loggerFactory is not null ? loggerFactory.CreateLogger<T>() : NullLogger<T>.Instance;}/// <summary>/// Ping Ollama instance to check if the required llm model is available at the instance/// </summary>/// <param name="cancellationToken"></param>public async Task PingOllamaAsync(CancellationToken cancellationToken = new()){var data = new{name = Attributes["model_id"]};var response = await Http.PostAsJsonAsync($"{Attributes["base_url"]}/api/show", data, cancellationToken).ConfigureAwait(false);ValidateOllamaResponse(response);Logger.LogInformation("Connected to Ollama at {url} with model {model}", Attributes["base_url"], Attributes["model_id"]);}protected void ValidateOllamaResponse(HttpResponseMessage? response){try{response.EnsureSuccessStatusCode();}catch (HttpRequestException){Logger.LogError("Unable to connect to ollama at {url} with model {model}", Attributes["base_url"], Attributes["model_id"]);}}}

注意这个

public IReadOnlyDictionary<string, object?> Attributes => _attributes;

实现了接口中的属性。

添加OllamaTextEmbeddingGeneration类：

#pragma warning disable SKEXP0001public class OllamaTextEmbeddingGeneration(string modelId, string baseUrl, HttpClient http, ILoggerFactory? loggerFactory): OllamaBase<OllamaTextEmbeddingGeneration>(modelId, baseUrl, http, loggerFactory),ITextEmbeddingGenerationService{public async Task<IList<ReadOnlyMemory<float>>> GenerateEmbeddingsAsync(IList<string> data, Kernel? kernel = null,CancellationToken cancellationToken = new()){var result = new List<ReadOnlyMemory<float>>(data.Count);foreach (var text in data){var request = new{model = Attributes["model_id"],prompt = text};var response = await Http.PostAsJsonAsync($"{Attributes["base_url"]}/api/embeddings", request, cancellationToken).ConfigureAwait(false);ValidateOllamaResponse(response);var json = JsonSerializer.Deserialize<JsonNode>(await response.Content.ReadAsStringAsync().ConfigureAwait(false));var embedding = new ReadOnlyMemory<float>(json!["embedding"]?.AsArray().GetValues<float>().ToArray());result.Add(embedding);}return result;}}

注意实现了GenerateEmbeddingsAsync方法。实现的思路就是向Ollama中的嵌入接口发送请求，获得embedding数组。

为了在MemoryBuilder中能用还需要添加扩展方法：

#pragma warning disable SKEXP0001public static class OllamaMemoryBuilderExtensions{/// <summary>/// Adds Ollama as the text embedding generation backend for semantic memory/// </summary>/// <param name="builder">kernel builder</param>/// <param name="modelId">Ollama model ID to use</param>/// <param name="baseUrl">Ollama base url</param>/// <returns></returns>public static MemoryBuilder WithOllamaTextEmbeddingGeneration(this MemoryBuilder builder,string modelId,string baseUrl){builder.WithTextEmbeddingGeneration((logger, http) => new OllamaTextEmbeddingGeneration(modelId,baseUrl,http,logger));return builder;}       }

开始使用

 public async Task<ISemanticTextMemory> GetTextMemory3(){var builder = new MemoryBuilder();var embeddingEndpoint = "http://localhost:11434";var cancellationTokenSource = new System.Threading.CancellationTokenSource();var cancellationToken = cancellationTokenSource.Token;builder.WithHttpClient(new HttpClient());builder.WithOllamaTextEmbeddingGeneration("mxbai-embed-large:335m", embeddingEndpoint);IMemoryStore memoryStore = await SqliteMemoryStore.ConnectAsync("memstore.db");builder.WithMemoryStore(memoryStore);var textMemory = builder.Build();return textMemory;}

  builder.WithOllamaTextEmbeddingGeneration("mxbai-embed-large:335m", embeddingEndpoint);

实现了WithOllamaTextEmbeddingGeneration这个扩展方法，因此可以这么写，使用的是mxbai-embed-large:335m这个向量模型。

我使用WPF简单做了个界面，来试试效果。

找了一个新闻嵌入：

文本向量化存入数据库中：

现在测试RAG效果：

回答的效果也还可以。

大模型使用的是在线api的Qwen/Qwen2-72B-Instruct，嵌入模型使用的是本地Ollama中的mxbai-embed-large:335m。

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.rhkb.cn/news/397043.html

如若内容造成侵权/违法违规/事实不符，请联系长河编程网进行投诉反馈email:809451989@qq.com，一经查实，立即删除！

SemanticKernel/C#：实现接口，接入本地嵌入模型

前言

实践

开始使用

相关文章

近似算法：求Π的近似值（迭代法）

git系统学习

重启人生计划-大梦方醒

MybatisPlus——扩展功能（一）

软考高级：真实的程序、核心程序、小型基准程序、合成基准程序

使用Selenium调试Edge浏览器的常见问题与解决方案

运行时数据区

某MDM主数据管理系统与微软Dynamic CRM系统（新加坡节点）集成案例

基于Java中的SSM框架实现在线收银系统项目【项目源码+论文说明】

气膜建筑的抗风与防火性能：保障仓储的安全—轻空间

ZLM+wvp-pro使用错误记录

一站搞定原型链：深入理解JavaScript的继承机制

冥想第一千二百四十八天(12478)

Linux应用层开发（7）：网络编程

STM32的USB接口介绍

LVS负载均衡集群部署之—NAT模式的介绍及搭建步骤

一行实现88个群智能算法优化混合核极限学习机HKELM的多特征输入单输出的数据回归预测Matlab程序全家桶

【exgcd 扩展欧几里得算法】[ABC340F] S = 1 题解

DC-3靶机打靶练习！！！！

Basic‘ attribute type should not be a container解决方法