乐于分享
好东西不私藏

RAG 从入门到实战(三):PDF 解析与文本分块完全指南

RAG 从入门到实战(三):PDF 解析与文本分块完全指南

系列教程: RAG 从入门到实战(共 5 篇)
本文难度: ⭐⭐⭐ 进阶
预计时间: 60 分钟
代码已验证: ✅ Python 3.9+


一、前言

1.1 为什么文档处理很重要?

Garbage In, Garbage Out - 如果文档处理不好,RAG 系统效果会大打折扣。

常见问题:

  • ❌ PDF 表格解析错乱
  • ❌ 分块切断语义
  • ❌ 特殊字符处理不当
  • ❌ 中文分词不准确

1.2 本文你将学到

  • ✅ PDF 解析(PyPDF2、pdfplumber)
  • ✅ 文本分块策略
  • ✅ 中文 Embedding 模型
  • ✅ 完整数据处理流水线

二、PDF 解析实战

2.1 安装依赖

# 基础 PDF 解析pip install pypdf==3.17.0# 高级 PDF 解析(推荐)pip install pdfplumber==0.10.3# 表格提取pip install tabula-py==2.8.2# OCR(可选,处理扫描件)pip install paddleocr==2.7.3

2.2 PyPDF2 基础使用

创建 pdf-parser.py

# -*- coding: utf-8 -*-"""PDF 解析基础教程运行:python pdf-parser.py"""from pypdf import PdfReader# ========== 1. 读取 PDF ==========pdf_path = "sample.pdf"  # 替换为你的 PDF 文件reader = PdfReader(pdf_path)print(f"📄 PDF 页数:{len(reader.pages)}")# ========== 2. 提取文本 ==========full_text = ""for i, page in enumerate(reader.pages, 1):    text = page.extract_text()    full_text += text    print(f"\n--- 第{i}页 ---")    print(text[:500])  # 只显示前 500 字符# ========== 3. 保存文本 ==========with open("output.txt", "w", encoding="utf-8") as f:    f.write(full_text)print(f"\n✅ 文本已保存到 output.txt")print(f"总字符数:{len(full_text)}")

2.3 pdfplumber 高级解析

创建 advanced-pdf-parser.py

# -*- coding: utf-8 -*-"""pdfplumber 高级 PDF 解析运行:python advanced-pdf-parser.py"""import pdfplumberimport pandas as pdpdf_path = "sample.pdf"# ========== 1. 提取文本(保留格式) ==========with pdfplumber.open(pdf_path) as pdf:    for i, page in enumerate(pdf.pages, 1):        text = page.extract_text()        print(f"\n=== 第{i}页 ===")        print(text)# ========== 2. 提取表格 ==========tables_data = []with pdfplumber.open(pdf_path) as pdf:    for i, page in enumerate(pdf.pages, 1):        tables = page.extract_tables()        for j, table in enumerate(tables):            print(f"\n📊 第{i}页 - 表格{j+1}")            # 转换为 DataFrame            df = pd.DataFrame(table)            print(df.head())            # 保存为 CSV            df.to_csv(f"table_page{i}_table{j+1}.csv", index=False)# ========== 3. 提取特定区域文本 ==========with pdfplumber.open(pdf_path) as pdf:    page = pdf.pages[0]    # 定义区域(左,上,右,下)    crop_box = (50, 100, 500, 300)    cropped = page.crop(crop_box)    text = cropped.extract_text()    print("\n🎯 指定区域文本:")    print(text)# ========== 4. 提取元数据 ==========with pdfplumber.open(pdf_path) as pdf:    metadata = pdf.metadata    print("\n📋 PDF 元数据:")    for key, value in metadata.items():        print(f"{key}{value}")

2.4 处理中文 PDF

创建 chinese-pdf-parser.py

# -*- coding: utf-8 -*-"""中文 PDF 解析(处理编码问题)运行:python chinese-pdf-parser.py"""from pypdf import PdfReaderimport redef clean_text(text):    """清洗文本"""    if not text:        return ""    # 移除多余空白    text = re.sub(r'\s+', ' ', text)    # 移除特殊字符    text = re.sub(r'[^\w\s\u4e00-\u9fff,。!?;:""''、]', '', text)    return text.strip()def parse_chinese_pdf(pdf_path):    """解析中文 PDF"""    reader = PdfReader(pdf_path)    pages_text = []    for page in reader.pages:        text = page.extract_text()        cleaned = clean_text(text)        if cleaned:            pages_text.append(cleaned)    return pages_text# 使用示例if __name__ == "__main__":    pdf_path = "chinese_document.pdf"    pages = parse_chinese_pdf(pdf_path)    print(f"✅ 成功解析 {len(pages)} 页")    # 合并所有页面    full_text = "\n".join(pages)    print(f"总字符数:{len(full_text)}")    # 保存    with open("chinese_output.txt", "w", encoding="utf-8") as f:        f.write(full_text)

三、文本分块策略

3.1 为什么需要分块?

原因:

  1. Embedding 模型有限制 - 大多数模型有最大长度限制(如 512 tokens)
  2. 检索精度 - 小块更容易找到精确匹配
  3. 上下文窗口 - 大模型输入长度有限

3.2 LangChain 分块器

创建 text-splitter-demo.py

# -*- coding: utf-8 -*-"""文本分块器对比运行:python text-splitter-demo.py"""from langchain.text_splitter import (    RecursiveCharacterTextSplitter,    CharacterTextSplitter,    TokenTextSplitter)# 示例文本sample_text = """人工智能(Artificial Intelligence,简称 AI)是计算机科学的一个分支,它企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器。该领域的研究包括机器人、语言识别、图像识别、自然语言处理和专家系统等。人工智能从诞生以来,理论和技术日益成熟,应用领域也不断扩大。可以设想,未来人工智能带来的科技产品,将会是人类智慧的"容器"。"""# ========== 1. 字符分块器 ==========print("="*60)print("字符分块器 (CharacterTextSplitter)")print("="*60)char_splitter = CharacterTextSplitter(    separator="\n",    chunk_size=50,    chunk_overlap=10)char_chunks = char_splitter.split_text(sample_text)for i, chunk in enumerate(char_chunks, 1):    print(f"\n块{i} ({len(chunk)}字符):")    print(chunk)# ========== 2. 递归字符分块器(推荐) ==========print("\n" + "="*60)print("递归字符分块器 (RecursiveCharacterTextSplitter)")print("="*60)rec_splitter = RecursiveCharacterTextSplitter(    separators=["\n\n", "\n", "。", "!", "?", ";", ",", " ", ""],    chunk_size=50,    chunk_overlap=10,    length_function=len)rec_chunks = rec_splitter.split_text(sample_text)for i, chunk in enumerate(rec_chunks, 1):    print(f"\n块{i} ({len(chunk)}字符):")    print(chunk)# ========== 3. Token 分块器 ==========print("\n" + "="*60)print("Token 分块器 (TokenTextSplitter)")print("="*60)token_splitter = TokenTextSplitter(    chunk_size=20,    chunk_overlap=5)token_chunks = token_splitter.split_text(sample_text)for i, chunk in enumerate(token_chunks, 1):    print(f"\n块{i}:")    print(chunk)

3.3 中文优化分块器

创建 chinese-text-splitter.py

# -*- coding: utf-8 -*-"""中文优化的文本分块器运行:python chinese-text-splitter.py"""import refrom langchain.text_splitter import RecursiveCharacterTextSplitterclass ChineseTextSplitter(RecursiveCharacterTextSplitter):    """中文优化的文本分块器"""    def __init__(self, **kwargs):        # 中文分隔符优先级        separators = [            "\n\n",      # 段落            "\n",        # 换行            "。",        # 句号            "!",        # 感叹号            "?",        # 问号            ";",        # 分号            ",",        # 逗号            "、",        # 顿号            " ",         # 空格            ""           # 最后按字符        ]        super().__init__(            separators=separators,            chunk_size=kwargs.get('chunk_size', 200),            chunk_overlap=kwargs.get('chunk_overlap', 20),            length_function=len        )# 使用示例if __name__ == "__main__":    text = """    人工智能是计算机科学的一个分支。它研究如何使计算机能够模拟人类智能。    机器学习是人工智能的核心技术之一。深度学习是机器学习的重要分支。    自然语言处理让计算机能够理解和生成人类语言。计算机视觉让计算机能够"看"懂图像。    """    splitter = ChineseTextSplitter(chunk_size=50, chunk_overlap=10)    chunks = splitter.split_text(text)    print(f"分块数量:{len(chunks)}\n")    for i, chunk in enumerate(chunks, 1):        print(f"块{i}{chunk}")

3.4 分块参数调优指南

参数小值大值影响
chunk_size100500小值=更精确,大值=更多上下文
chunk_overlap050重叠保持语义连贯
separators多分隔符=更自然分块

推荐配置:

# 中文文档splitter = RecursiveCharacterTextSplitter(    chunk_size=200,      # 200 字符    chunk_overlap=20,    # 10% 重叠    separators=["\n\n", "\n", "。", "!", "?", ",", ""])

四、Embedding 模型实战

4.1 中文 Embedding 模型对比

模型维度速度效果推荐场景
text2vec-base-chinese768⭐⭐⭐⭐⭐⭐⭐⭐⭐入门/快速
text2vec-large-chinese1024⭐⭐⭐⭐⭐⭐⭐⭐生产环境
m3e-base768⭐⭐⭐⭐⭐⭐⭐⭐通用场景
bge-large-zh1024⭐⭐⭐⭐⭐⭐⭐⭐高精度

4.2 使用 sentence-transformers

创建 embedding-demo.py

# -*- coding: utf-8 -*-"""Embedding 模型使用示例运行:python embedding-demo.py"""from sentence_transformers import SentenceTransformerimport numpy as npfrom sklearn.metrics.pairwise import cosine_similarity# ========== 1. 加载模型 ==========print("正在加载模型...")model = SentenceTransformer('shibing624/text2vec-base-chinese')print("✅ 模型加载完成")# ========== 2. 编码文本 ==========sentences = [    "人工智能是未来",    "AI 技术发展前景广阔",    "今天天气不错",    "机器学习需要大量数据"]print("\n正在编码...")embeddings = model.encode(sentences)print(f"✅ 编码完成,向量维度:{embeddings.shape}")# ========== 3. 计算相似度 ==========print("\n📊 文本相似度矩阵:")similarity = cosine_similarity(embeddings)for i, s1 in enumerate(sentences):    print(f"\n'{s1}' 与其他文本的相似度:")    for j, s2 in enumerate(sentences):        if i != j:            print(f"  - '{s2}': {similarity[i][j]:.4f}")# ========== 4. 保存和加载 ==========import pickle# 保存with open("embeddings.pkl", "wb") as f:    pickle.dump(embeddings, f)print("\n✅ Embedding 已保存")# 加载with open("embeddings.pkl", "rb") as f:    loaded_embeddings = pickle.load(f)print("✅ Embedding 已加载")

4.3 批量编码优化

# 批量编码(更快)sentences = ["文本 1", "文本 2", ..., "文本 N"]# ❌ 慢:逐个编码embeddings = [model.encode(s) for s in sentences]# ✅ 快:批量编码embeddings = model.encode(sentences, batch_size=32, show_progress_bar=True)

五、完整数据处理流水线

创建 data-pipeline.py

# -*- coding: utf-8 -*-"""完整的 RAG 数据处理流水线运行:python data-pipeline.py"""from pypdf import PdfReaderfrom langchain.text_splitter import RecursiveCharacterTextSplitterfrom sentence_transformers import SentenceTransformerfrom langchain.vectorstores import Chromafrom langchain.schema import Documentimport osclass RAGDataPipeline:    """RAG 数据处理流水线"""    def __init__(self, pdf_path, output_dir="./rag_data"):        self.pdf_path = pdf_path        self.output_dir = output_dir        os.makedirs(output_dir, exist_ok=True)        # 初始化组件        self.splitter = RecursiveCharacterTextSplitter(            chunk_size=200,            chunk_overlap=20,            separators=["\n\n", "\n", "。", "!", "?", ",", ""]        )        print("正在加载 Embedding 模型...")        self.embeddings = SentenceTransformer('shibing624/text2vec-base-chinese')        print("✅ 模型加载完成")    def extract_text_from_pdf(self):        """从 PDF 提取文本"""        print(f"\n📄 正在解析 PDF: {self.pdf_path}")        reader = PdfReader(self.pdf_path)        pages_text = []        for i, page in enumerate(reader.pages, 1):            text = page.extract_text()            if text:                pages_text.append({                    "page": i,                    "text": text.strip()                })        print(f"✅ 提取 {len(pages_text)} 页文本")        return pages_text    def split_text(self, pages_text):        """文本分块"""        print("\n✂️  正在分块...")        chunks = []        for page_data in pages_text:            page_chunks = self.splitter.split_text(page_data["text"])            for chunk in page_chunks:                chunks.append(Document(                    page_content=chunk,                    metadata={"page": page_data["page"]}                ))        print(f"✅ 分块完成:共 {len(chunks)} 个文本块")        return chunks    def create_vectorstore(self, chunks):        """创建向量数据库"""        print("\n🗄️  正在创建向量数据库...")        db = Chroma.from_documents(            documents=chunks,            embedding=self.embeddings,            persist_directory=self.output_dir        )        print(f"✅ 向量数据库创建完成")        print(f"📁 存储位置:{self.output_dir}")        return db    def run(self):        """运行完整流水线"""        print("="*60)        print("RAG 数据处理流水线")        print("="*60)        # 1. 提取文本        pages_text = self.extract_text_from_pdf()        # 2. 分块        chunks = self.split_text(pages_text)        # 3. 创建向量库        db = self.create_vectorstore(chunks)        # 4. 测试检索        print("\n🔍 测试检索...")        query = "什么是人工智能"        results = db.similarity_search(query, k=2)        print(f"\n查询:'{query}'")        print("\n检索结果:")        for i, doc in enumerate(results, 1):            print(f"\n[{i}] (第{doc.metadata['page']}页)")            print(doc.page_content)        print("\n" + "="*60)        print("✅ 流水线执行完成!")        print("="*60)        return db# 使用示例if __name__ == "__main__":    # 替换为你的 PDF 文件    pdf_file = "sample.pdf"    if os.path.exists(pdf_file):        pipeline = RAGDataPipeline(pdf_file)        db = pipeline.run()    else:        print(f"❌ 文件不存在:{pdf_file}")        print("请将 PDF 文件放到当前目录后重试")

六、常见问题

6.1 PDF 解析乱码

原因: 字体编码问题

解决方案:

# 尝试 pdfplumberimport pdfplumberwith pdfplumber.open("file.pdf") as pdf:    text = pdf.pages[0].extract_text()

6.2 分块后语义不连贯

解决方案:

  1. 增加 chunk_overlap(建议 10-20%)
  2. 使用更细的分隔符
  3. 后处理合并相关块

6.3 Embedding 模型加载慢

解决方案:

# 首次加载会下载模型(慢)# 后续使用本地缓存(快)# 手动预下载from sentence_transformers import SentenceTransformerSentenceTransformer('shibing624/text2vec-base-chinese')

七、课后练习

练习 1: PDF 解析

  1. 找一个中文 PDF 文档
  2. 使用 pdfplumber 提取文本和表格
  3. 保存为 TXT 和 CSV

练习 2: 分块调优

  1. 尝试不同 chunk_size(100/200/500)
  2. 对比检索效果
  3. 找到最优参数

练习 3: 完整流水线

  1. 运行 data-pipeline.py
  2. 用自己的 PDF 测试
  3. 优化检索结果

八、总结

8.1 核心要点

组件推荐方案关键参数
PDF 解析pdfplumber-
文本分块RecursiveCharacterTextSplitterchunk_size=200
Embeddingtext2vec-base-chinese768 维
向量库Chroma持久化

8.2 最佳实践

  1. PDF 解析: 优先 pdfplumber,处理中文更好
  2. 分块: 200 字符 + 20 字符重叠
  3. Embedding: 中文用 text2vec 系列
  4. 批处理: 批量编码提升性能

下一篇: RAG 从入门到实战(四):检索优化与 Rerank 技巧


最后更新: 2026-04-03
代码验证环境: Python 3.9.18, pypdf 3.17.0, sentence-transformers 2.2.2

基本 文件 流程 错误 SQL 调试
  1. 请求信息 : 2026-05-01 01:47:18 HTTP/1.1 GET : https://www.yeyulingfeng.com/a/568670.html
  2. 运行时间 : 0.163062s [ 吞吐率:6.13req/s ] 内存消耗:4,767.28kb 文件加载:145
  3. 缓存信息 : 0 reads,0 writes
  4. 会话信息 : SESSION_ID=dcaaaaf42830dca87e15ca1a3bf00874
  1. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/public/index.php ( 0.79 KB )
  2. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/autoload.php ( 0.17 KB )
  3. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/composer/autoload_real.php ( 2.49 KB )
  4. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/composer/platform_check.php ( 0.90 KB )
  5. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/composer/ClassLoader.php ( 14.03 KB )
  6. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/composer/autoload_static.php ( 6.05 KB )
  7. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/helper.php ( 8.34 KB )
  8. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-validate/src/helper.php ( 2.19 KB )
  9. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/ralouphie/getallheaders/src/getallheaders.php ( 1.60 KB )
  10. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/helper.php ( 1.47 KB )
  11. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/stubs/load_stubs.php ( 0.16 KB )
  12. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Exception.php ( 1.69 KB )
  13. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-container/src/Facade.php ( 2.71 KB )
  14. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/symfony/deprecation-contracts/function.php ( 0.99 KB )
  15. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/symfony/polyfill-mbstring/bootstrap.php ( 8.26 KB )
  16. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/symfony/polyfill-mbstring/bootstrap80.php ( 9.78 KB )
  17. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/symfony/var-dumper/Resources/functions/dump.php ( 1.49 KB )
  18. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-dumper/src/helper.php ( 0.18 KB )
  19. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/symfony/var-dumper/VarDumper.php ( 4.30 KB )
  20. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/guzzlehttp/guzzle/src/functions_include.php ( 0.16 KB )
  21. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/guzzlehttp/guzzle/src/functions.php ( 5.54 KB )
  22. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/App.php ( 15.30 KB )
  23. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-container/src/Container.php ( 15.76 KB )
  24. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/psr/container/src/ContainerInterface.php ( 1.02 KB )
  25. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/provider.php ( 0.19 KB )
  26. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Http.php ( 6.04 KB )
  27. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/helper/Str.php ( 7.29 KB )
  28. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Env.php ( 4.68 KB )
  29. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/common.php ( 0.03 KB )
  30. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/helper.php ( 18.78 KB )
  31. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Config.php ( 5.54 KB )
  32. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/alipay.php ( 3.59 KB )
  33. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/facade/Env.php ( 1.67 KB )
  34. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/app.php ( 0.95 KB )
  35. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/cache.php ( 0.78 KB )
  36. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/console.php ( 0.23 KB )
  37. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/cookie.php ( 0.56 KB )
  38. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/database.php ( 2.48 KB )
  39. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/filesystem.php ( 0.61 KB )
  40. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/lang.php ( 0.91 KB )
  41. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/log.php ( 1.35 KB )
  42. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/middleware.php ( 0.19 KB )
  43. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/route.php ( 1.89 KB )
  44. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/session.php ( 0.57 KB )
  45. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/trace.php ( 0.34 KB )
  46. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/config/view.php ( 0.82 KB )
  47. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/event.php ( 0.25 KB )
  48. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Event.php ( 7.67 KB )
  49. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/service.php ( 0.13 KB )
  50. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/AppService.php ( 0.26 KB )
  51. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Service.php ( 1.64 KB )
  52. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Lang.php ( 7.35 KB )
  53. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/lang/zh-cn.php ( 13.70 KB )
  54. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/initializer/Error.php ( 3.31 KB )
  55. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/initializer/RegisterService.php ( 1.33 KB )
  56. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/services.php ( 0.14 KB )
  57. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/service/PaginatorService.php ( 1.52 KB )
  58. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/service/ValidateService.php ( 0.99 KB )
  59. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/service/ModelService.php ( 2.04 KB )
  60. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-trace/src/Service.php ( 0.77 KB )
  61. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Middleware.php ( 6.72 KB )
  62. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/initializer/BootService.php ( 0.77 KB )
  63. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/Paginator.php ( 11.86 KB )
  64. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-validate/src/Validate.php ( 63.20 KB )
  65. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/Model.php ( 23.55 KB )
  66. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/Attribute.php ( 21.05 KB )
  67. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/AutoWriteData.php ( 4.21 KB )
  68. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/Conversion.php ( 6.44 KB )
  69. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/DbConnect.php ( 5.16 KB )
  70. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/ModelEvent.php ( 2.33 KB )
  71. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/concern/RelationShip.php ( 28.29 KB )
  72. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/contract/Arrayable.php ( 0.09 KB )
  73. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/contract/Jsonable.php ( 0.13 KB )
  74. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/model/contract/Modelable.php ( 0.09 KB )
  75. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Db.php ( 2.88 KB )
  76. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/DbManager.php ( 8.52 KB )
  77. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Log.php ( 6.28 KB )
  78. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Manager.php ( 3.92 KB )
  79. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/psr/log/src/LoggerTrait.php ( 2.69 KB )
  80. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/psr/log/src/LoggerInterface.php ( 2.71 KB )
  81. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Cache.php ( 4.92 KB )
  82. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/psr/simple-cache/src/CacheInterface.php ( 4.71 KB )
  83. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/helper/Arr.php ( 16.63 KB )
  84. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/cache/driver/File.php ( 7.84 KB )
  85. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/cache/Driver.php ( 9.03 KB )
  86. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/contract/CacheHandlerInterface.php ( 1.99 KB )
  87. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/Request.php ( 0.09 KB )
  88. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Request.php ( 55.78 KB )
  89. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/middleware.php ( 0.25 KB )
  90. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Pipeline.php ( 2.61 KB )
  91. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-trace/src/TraceDebug.php ( 3.40 KB )
  92. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/middleware/SessionInit.php ( 1.94 KB )
  93. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Session.php ( 1.80 KB )
  94. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/session/driver/File.php ( 6.27 KB )
  95. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/contract/SessionHandlerInterface.php ( 0.87 KB )
  96. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/session/Store.php ( 7.12 KB )
  97. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Route.php ( 23.73 KB )
  98. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/RuleName.php ( 5.75 KB )
  99. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/Domain.php ( 2.53 KB )
  100. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/RuleGroup.php ( 22.43 KB )
  101. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/Rule.php ( 26.95 KB )
  102. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/RuleItem.php ( 9.78 KB )
  103. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/route/app.php ( 3.94 KB )
  104. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/facade/Route.php ( 4.70 KB )
  105. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/dispatch/Controller.php ( 4.74 KB )
  106. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/route/Dispatch.php ( 10.44 KB )
  107. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/controller/Index.php ( 9.87 KB )
  108. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/BaseController.php ( 2.05 KB )
  109. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/facade/Db.php ( 0.93 KB )
  110. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/connector/Mysql.php ( 5.44 KB )
  111. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/PDOConnection.php ( 52.47 KB )
  112. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/Connection.php ( 8.39 KB )
  113. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/ConnectionInterface.php ( 4.57 KB )
  114. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/builder/Mysql.php ( 16.58 KB )
  115. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/Builder.php ( 24.06 KB )
  116. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/BaseBuilder.php ( 27.50 KB )
  117. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/Query.php ( 15.71 KB )
  118. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/BaseQuery.php ( 45.13 KB )
  119. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/TimeFieldQuery.php ( 7.43 KB )
  120. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/AggregateQuery.php ( 3.26 KB )
  121. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/ModelRelationQuery.php ( 20.07 KB )
  122. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/ParamsBind.php ( 3.66 KB )
  123. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/ResultOperation.php ( 7.01 KB )
  124. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/WhereQuery.php ( 19.37 KB )
  125. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/JoinAndViewQuery.php ( 7.11 KB )
  126. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/TableFieldInfo.php ( 2.63 KB )
  127. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-orm/src/db/concern/Transaction.php ( 2.77 KB )
  128. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/log/driver/File.php ( 5.96 KB )
  129. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/contract/LogHandlerInterface.php ( 0.86 KB )
  130. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/log/Channel.php ( 3.89 KB )
  131. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/event/LogRecord.php ( 1.02 KB )
  132. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-helper/src/Collection.php ( 16.47 KB )
  133. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/facade/View.php ( 1.70 KB )
  134. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/View.php ( 4.39 KB )
  135. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/app/controller/Es.php ( 3.30 KB )
  136. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Response.php ( 8.81 KB )
  137. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/response/View.php ( 3.29 KB )
  138. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/Cookie.php ( 6.06 KB )
  139. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-view/src/Think.php ( 8.38 KB )
  140. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/framework/src/think/contract/TemplateHandlerInterface.php ( 1.60 KB )
  141. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-template/src/Template.php ( 46.61 KB )
  142. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-template/src/template/driver/File.php ( 2.41 KB )
  143. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-template/src/template/contract/DriverInterface.php ( 0.86 KB )
  144. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/runtime/temp/c935550e3e8a3a4c27dd94e439343fdf.php ( 31.50 KB )
  145. /yingpanguazai/ssd/ssd1/www/wwww.yeyulingfeng.com/vendor/topthink/think-trace/src/Html.php ( 4.42 KB )
  1. CONNECT:[ UseTime:0.000584s ] mysql:host=127.0.0.1;port=3306;dbname=wenku;charset=utf8mb4
  2. SHOW FULL COLUMNS FROM `fenlei` [ RunTime:0.000764s ]
  3. SELECT * FROM `fenlei` WHERE `fid` = 0 [ RunTime:0.000305s ]
  4. SELECT * FROM `fenlei` WHERE `fid` = 63 [ RunTime:0.000287s ]
  5. SHOW FULL COLUMNS FROM `set` [ RunTime:0.000575s ]
  6. SELECT * FROM `set` [ RunTime:0.000223s ]
  7. SHOW FULL COLUMNS FROM `article` [ RunTime:0.000632s ]
  8. SELECT * FROM `article` WHERE `id` = 568670 LIMIT 1 [ RunTime:0.002830s ]
  9. UPDATE `article` SET `lasttime` = 1777571238 WHERE `id` = 568670 [ RunTime:0.017754s ]
  10. SELECT * FROM `fenlei` WHERE `id` = 64 LIMIT 1 [ RunTime:0.005377s ]
  11. SELECT * FROM `article` WHERE `id` < 568670 ORDER BY `id` DESC LIMIT 1 [ RunTime:0.000739s ]
  12. SELECT * FROM `article` WHERE `id` > 568670 ORDER BY `id` ASC LIMIT 1 [ RunTime:0.000394s ]
  13. SELECT * FROM `article` WHERE `id` < 568670 ORDER BY `id` DESC LIMIT 10 [ RunTime:0.000875s ]
  14. SELECT * FROM `article` WHERE `id` < 568670 ORDER BY `id` DESC LIMIT 10,10 [ RunTime:0.006625s ]
  15. SELECT * FROM `article` WHERE `id` < 568670 ORDER BY `id` DESC LIMIT 20,10 [ RunTime:0.012215s ]
0.164796s