偷偷摘套内射激情视频,久久精品99国产国产精,中文字幕无线乱码人妻,中文在线中文a,性爽19p

<sub id="gep0j"><p id="gep0j"></p></sub>

51CTO首頁(yè)

AI.x社區(qū)

軟考社區(qū)

免費(fèi)課

企業(yè)培訓(xùn)

鴻蒙開(kāi)發(fā)者社區(qū)

信創(chuàng)認(rèn)證

公眾號(hào)矩陣

移動(dòng)端

視頻課免費(fèi)課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考信創(chuàng)認(rèn)證華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項(xiàng)目管理免費(fèi)題庫(kù)

在線學(xué)習(xí)

文章資源問(wèn)答課堂專欄直播

51CTO

鴻蒙開(kāi)發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營(yíng)

鴻蒙開(kāi)發(fā)者社區(qū)訂閱號(hào)

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開(kāi)發(fā)者社區(qū)視頻號(hào)

51CTO軟考題庫(kù)

賬號(hào)設(shè)置退出

SpringAI + Redis：構(gòu)建高性能RAG問(wèn)答系統(tǒng)的架構(gòu)設(shè)計(jì)與實(shí)戰(zhàn)

作者：farerboy 2025-09-26 02:15:00

Spring AI與Redis的結(jié)合為Java開(kāi)發(fā)者提供了構(gòu)建高性能RAG系統(tǒng)的理想方案。通過(guò)本文介紹的架構(gòu)設(shè)計(jì)和實(shí)現(xiàn)方案，企業(yè)可以快速搭建屬于自己的智能問(wèn)答系統(tǒng)，顯著提升知識(shí)管理效率。

引言：RAG技術(shù)為何成為企業(yè)AI應(yīng)用首選

實(shí)現(xiàn)成本降低千倍、響應(yīng)速度秒級(jí)的企業(yè)級(jí)知識(shí)庫(kù)解決方案

在當(dāng)前AI技術(shù)飛速發(fā)展的背景下，企業(yè)面臨著一個(gè)核心挑戰(zhàn)：如何讓大語(yǔ)言模型（LLM）準(zhǔn)確掌握企業(yè)內(nèi)部知識(shí)并避免產(chǎn)生幻覺(jué)（Hallucination）？檢索增強(qiáng)生成（Retrieval-Augmented Generation，RAG）技術(shù)應(yīng)運(yùn)而生，它通過(guò)將信息檢索與生成模型相結(jié)合，有效解決了這一難題。

Spring AI作為Spring官方推出的AI開(kāi)發(fā)框架，為Java開(kāi)發(fā)者提供了構(gòu)建AI應(yīng)用的標(biāo)準(zhǔn)化方案。結(jié)合Redis這一高性能向量數(shù)據(jù)庫(kù)，我們可以構(gòu)建出響應(yīng)迅速、成本可控、易于維護(hù)的RAG問(wèn)答系統(tǒng)。本文將深入探討這一技術(shù)組合的架構(gòu)設(shè)計(jì)、核心實(shí)現(xiàn)和優(yōu)化策略。

一、RAG技術(shù)架構(gòu)設(shè)計(jì)

1.1 系統(tǒng)整體架構(gòu)

基于Spring AI和Redis的RAG系統(tǒng)主要包含以下組件：

圖片

1.2 技術(shù)棧選型依據(jù)

Spring AI：提供統(tǒng)一的AI應(yīng)用開(kāi)發(fā)接口，支持多種大模型和向量數(shù)據(jù)庫(kù)
Redis Stack：具備向量搜索能力的高性能內(nèi)存數(shù)據(jù)庫(kù)，適合實(shí)時(shí)檢索場(chǎng)景
OpenAI API/本地模型：平衡性能與成本的需求

二、環(huán)境準(zhǔn)備與核心配置

2.1 項(xiàng)目依賴配置

<dependencies>
    <dependency>
        <groupId>org.springframework.boot</groupId>
        <artifactId>spring-boot-starter-web</artifactId>
    </dependency>
    <dependency>
        <groupId>org.springframework.ai</groupId>
        <artifactId>spring-ai-openai-spring-boot-starter</artifactId>
    </dependency>
    <!-- Redis 向量存儲(chǔ) -->
    <dependency>
        <groupId>org.springframework.ai</groupId>
        <artifactId>spring-ai-starter-vector-store-redis</artifactId>
    </dependency>
    <dependency>
        <groupId>org.springframework.ai</groupId>
        <artifactId>spring-ai-redis-spring-boot-starter</artifactId>
    </dependency>
    <dependency>
        <groupId>org.springframework.ai</groupId>
        <artifactId>spring-ai-pdf-document-reader</artifactId>
    </dependency>
    <!-- 文檔解析（支持 Word、Excel 等） -->
    <dependency>
        <groupId>org.springframework.ai</groupId>
        <artifactId>spring-ai-tika-document-reader</artifactId>
    </dependency>
</dependencies>

2.2 應(yīng)用配置文件

# application.yml
spring:
  ai:
    openai:
      embedding:
        options:
          model: text-embedding-v4 # 使用百煉平臺(tái)的嵌入模型
    vectorstore:
      redis:
        uri: redis://localhost:6379
        index: knowledge-base
        prefix: "doc:"
        initialize-schema: true


server:
  port: 8080

三、核心實(shí)現(xiàn)源碼解析

3.1 數(shù)據(jù)加載服務(wù)實(shí)現(xiàn)

知識(shí)庫(kù)的初始化是RAG系統(tǒng)的基礎(chǔ)，需要將文檔轉(zhuǎn)換為向量并存儲(chǔ)到Redis中。

@Service
@Slf4j
public class DataLoaderService {


    @Value("classpath:knowledge/*.pdf")
    private Resource[] knowledgeResources;


    @Autowired
    private VectorStore vectorStore;


    @PostConstruct
    public void initializeKnowledgeBase() {
        log.info("開(kāi)始初始化知識(shí)庫(kù)...");


        for (Resource resource : knowledgeResources) {
            try {
                // 使用PDF文檔閱讀器
                PagePdfDocumentReader pdfReader = new PagePdfDocumentReader(
                    resource,
                    PdfDocumentReaderConfig.builder()
                        .withPagesPerDocument(1)
                        .build()
                );


                // 文本分割器，確保文檔塊大小合適
                TokenTextSplitter textSplitter = new TokenTextSplitter(
                    1000,  // 最大token數(shù)
                    200,   // 重疊token數(shù)
                    true   // 分段存儲(chǔ)
                );


                // 讀取、分割并存儲(chǔ)文檔
                List<Document> documents = pdfReader.get();
                List<Document> chunks = textSplitter.apply(documents);


                vectorStore.add(chunks);
                log.info("已加載文檔: {}，分割為 {} 個(gè)塊", 
                    resource.getFilename(), chunks.size());


            } catch (Exception e) {
                log.error("加載文檔失敗: {}", resource.getFilename(), e);
            }
        }
        log.info("知識(shí)庫(kù)初始化完成");
    }
}

3.2 RAG服務(wù)核心邏輯

RAG服務(wù)的核心在于實(shí)現(xiàn)檢索與生成的協(xié)同工作。

@Service
@Slf4j
public class RagService {


    @Autowired
    private VectorStore vectorStore;


    @Autowired
    private ChatClient chatClient;


    // 相似度搜索配置
    private static final int TOP_K = 5;
    private static final double SIMILARITY_THRESHOLD = 0.7;


    public Generation retrieve(String userQuery) {
        // 1. 向量相似度搜索
        SearchRequest searchRequest = SearchRequest.query(userQuery)
            .withTopK(TOP_K)
            .withSimilarityThreshold(SIMILARITY_THRESHOLD);


        List<Document> relevantDocs = vectorStore.similaritySearch(searchRequest);


        if (relevantDocs.isEmpty()) {
            return new Generation("未找到相關(guān)信息，請(qǐng)嘗試其他問(wèn)題。");
        }


        // 2. 構(gòu)建增強(qiáng)提示
        String context = buildContext(relevantDocs);
        String enhancedPrompt = buildEnhancedPrompt(userQuery, context);


        // 3. 調(diào)用LLM生成回答
        Prompt prompt = new Prompt(enhancedPrompt);
        ChatResponse response = chatClient.call(prompt);


        return response.getResult();
    }


    private String buildContext(List<Document> documents) {
        StringBuilder contextBuilder = new StringBuilder();
        contextBuilder.append("相關(guān)參考信息：\n\n");


        for (int i = 0; i < documents.size(); i++) {
            Document doc = documents.get(i);
            contextBuilder.append(String.format("[%d] %s\n\n", i + 1, doc.getText()));
        }


        return contextBuilder.toString();
    }


    private String buildEnhancedPrompt(String userQuery, String context) {
        return String.format("""
            你是一個(gè)專業(yè)的客服助手，請(qǐng)根據(jù)以下參考信息回答問(wèn)題。
            如果參考信息不足以回答問(wèn)題，請(qǐng)明確說(shuō)明。
            不要編造信息，保持回答準(zhǔn)確、簡(jiǎn)潔。


            %s


            用戶問(wèn)題：%s


            請(qǐng)根據(jù)以上信息提供回答：
            """, context, userQuery);
    }
}

3.3 控制器層實(shí)現(xiàn)

@RestController
@RequestMapping("/api/rag")
@Slf4j
public class RagController {


    @Autowired
    private RagService ragService;


    @PostMapping("/chat")
    public ResponseEntity<ChatResponse> chat(@RequestBody ChatRequest request) {
        try {
            long startTime = System.currentTimeMillis();


            Generation generation = ragService.retrieve(request.getQuestion());


            long responseTime = System.currentTimeMillis() - startTime;
            log.info("問(wèn)題處理完成: 問(wèn)題長(zhǎng)度={}, 響應(yīng)時(shí)間={}ms", 
                request.getQuestion().length(), responseTime);


            ChatResponse response = new ChatResponse(
                generation.getOutput().getContent(),
                responseTime
            );


            return ResponseEntity.ok(response);


        } catch (Exception e) {
            log.error("處理問(wèn)題時(shí)發(fā)生錯(cuò)誤", e);
            return ResponseEntity.status(HttpStatus.INTERNAL_SERVER_ERROR)
                .body(new ChatResponse("系統(tǒng)繁忙，請(qǐng)稍后重試", -1));
        }
    }


    // DTO類
    @Data
    @AllArgsConstructor
    public static class ChatRequest {
        private String question;
    }


    @Data
    @AllArgsConstructor
    public static class ChatResponse {
        private String answer;
        private long responseTimeMs;
    }
}

四、高級(jí)特性與優(yōu)化策略

4.1 使用QuestionAnswerAdvisor優(yōu)化RAG流程

Spring AI提供了Advisor接口來(lái)標(biāo)準(zhǔn)化RAG流程的實(shí)現(xiàn)。

@Configuration
@Slf4j
public class RagAdvisorConfig {


    @Bean
    public QuestionAnswerAdvisor questionAnswerAdvisor(
            VectorStore vectorStore, 
            ChatClient chatClient) {


        return new QuestionAnswerAdvisor(vectorStore, chatClient) {
            @Override
            public Prompt before(String question) {
                // 自定義檢索邏輯
                SearchRequest request = SearchRequest.query(question)
                    .withTopK(5)
                    .withSimilarityThreshold(0.75)
                    .withFilterExpression("category == 'technical'");


                List<Document> docs = vectorStore.similaritySearch(request);


                // 構(gòu)建系統(tǒng)消息
                SystemMessage systemMessage = new SystemMessage(
                    "你是一個(gè)技術(shù)專家，請(qǐng)根據(jù)以下文檔回答問(wèn)題：\n" + 
                    docs.stream()
                        .map(Document::getText)
                        .collect(Collectors.joining("\n\n"))
                );


                UserMessage userMessage = new UserMessage(question);


                return new Prompt(List.of(systemMessage, userMessage));
            }


            @Override
            public String after(ChatResponse response) {
                // 后處理：添加引用和驗(yàn)證
                String answer = response.getResult().getOutput().getContent();
                return answer + "\n\n*以上信息僅供參考*";
            }
        };
    }
}

4.2 性能優(yōu)化實(shí)踐

向量索引優(yōu)化

spring:
  ai:
    vectorstore:
      redis:
        index-type: HNSW  # 使用分層導(dǎo)航小世界算法
        distance-metric: COSINE  # 余弦相似度
        index-options: |
          {
            "EF_CONSTRUCTION": 200,
            "M": 16
          }

緩存策略實(shí)現(xiàn)

@Service
@Slf4j
public class CachingRagService {


    @Autowired
    private RagService ragService;


    @Autowired
    private RedisTemplate<String, String> redisTemplate;


    private static final long CACHE_TTL = 3600; // 1小時(shí)


    public Generation retrieveWithCache(String userQuery) {
        // 生成查詢指紋作為緩存鍵
        String cacheKey = generateCacheKey(userQuery);


        // 嘗試從緩存獲取
        String cachedAnswer = redisTemplate.opsForValue().get(cacheKey);
        if (cachedAnswer != null) {
            log.debug("緩存命中: {}", cacheKey);
            return new Generation(cachedAnswer);
        }


        // 緩存未命中，執(zhí)行RAG流程
        Generation generation = ragService.retrieve(userQuery);


        // 緩存結(jié)果
        if (shouldCache(generation)) {
            redisTemplate.opsForValue().set(
                cacheKey, 
                generation.getOutput().getContent(),
                Duration.ofSeconds(CACHE_TTL)
            );
        }


        return generation;
    }


    private String generateCacheKey(String query) {
        return "rag:cache:" + Integer.toHexString(query.hashCode());
    }


    private boolean shouldCache(Generation generation) {
        // 只緩存高質(zhì)量的回答
        String content = generation.getOutput().getContent();
        return !content.contains("不確定") && !content.contains("無(wú)法回答");
    }
}

五、實(shí)戰(zhàn)案例：企業(yè)知識(shí)庫(kù)問(wèn)答系統(tǒng)

5.1 系統(tǒng)特色功能

基于Spring AI和Redis的RAG系統(tǒng)在實(shí)際應(yīng)用中表現(xiàn)出色：

精準(zhǔn)問(wèn)答：針對(duì)"公司請(qǐng)假流程是什么？"等問(wèn)題，能直接從員工手冊(cè)中檢索相關(guān)信息生成準(zhǔn)確回答
多文檔支持：支持PDF、Word、HTML等多種格式文檔的自動(dòng)處理和向量化
實(shí)時(shí)更新：知識(shí)庫(kù)更新后，系統(tǒng)能夠立即感知并提供最新信息

5.2 性能對(duì)比數(shù)據(jù)

六、總結(jié)與展望

Spring AI與Redis的結(jié)合為Java開(kāi)發(fā)者提供了構(gòu)建高性能RAG系統(tǒng)的理想方案。通過(guò)本文介紹的架構(gòu)設(shè)計(jì)和實(shí)現(xiàn)方案，企業(yè)可以快速搭建屬于自己的智能問(wèn)答系統(tǒng)，顯著提升知識(shí)管理效率。

未來(lái)，隨著Spring AI生態(tài)的不斷完善，我們可以期待更多高級(jí)特性的出現(xiàn)：

多模態(tài)RAG：支持圖像、表格等非文本內(nèi)容的檢索與生成
自適應(yīng)學(xué)習(xí)：系統(tǒng)能夠根據(jù)用戶反饋?zhàn)詣?dòng)優(yōu)化檢索策略
邊緣部署：支持在資源受限環(huán)境中運(yùn)行輕量級(jí)RAG系統(tǒng)

責(zé)任編輯：武曉燕來(lái)源：小林聊編程

SpringAI Redis RAG

點(diǎn)贊

51CTO技術(shù)棧公眾號(hào)

業(yè)務(wù)
速覽

媒體

51CTO CIOAge HC3i

社區(qū)

51CTO博客鴻蒙開(kāi)發(fā)者社區(qū) AI.x社區(qū)

教育

51CTO學(xué)堂精培企業(yè)培訓(xùn) CTO訓(xùn)練營(yíng)

<ol id="y75xh"></ol>

<tfoot id="y75xh"></tfoot>