langchain-data-handling

Featured

Implement LangChain RAG pipelines with document loaders, text splitters, embeddings, and vector stores (Chroma, Pinecone, FAISS). Trigger: "langchain RAG", "langchain documents", "langchain vector store", "langchain embeddings", "document loaders", "text splitters", "retrieval".

AI & Automation 2,274 stars 319 forks Updated today MIT

Install

View on GitHub

Quality Score: 99/100

Stars 20%

100

Recency 20%

100

Frontmatter 20%

Documentation 15%

100

Issue Health 10%

License 10%

100

Description 5%

100

Skill Content

# LangChain Data Handling: RAG & Document Processing ## Overview Build Retrieval-Augmented Generation (RAG) pipelines: load documents, split into chunks, embed with OpenAI/Cohere, store in vector databases (FAISS, Chroma, Pinecone), and query with retrieval chains. ## Prerequisites - `@langchain/core`, `@langchain/openai` installed - For vector stores: `npm install @langchain/community` (FAISS) or `npm install @langchain/pinecone @pinecone-database/pinecone` ## Step 1: Document Loaders ```typescript import { TextLoader } from "langchain/document_loaders/fs/text"; import { PDFLoader } from "@langchain/community/document_loaders/fs/pdf"; import { DirectoryLoader } from "langchain/document_loaders/fs/directory"; import { CSVLoader } from "@langchain/community/document_loaders/fs/csv"; // Load a single file const textDocs = await new TextLoader("./data/readme.md").load(); const pdfDocs = await new PDFLoader("./data/manual.pdf").load(); // Load entire directory with type-based routing const dirLoader = new DirectoryLoader("./data/", { ".txt": (path) => new TextLoader(path), ".pdf": (path) => new PDFLoader(path), ".csv": (path) => new CSVLoader(path), }); const allDocs = await dirLoader.load(); console.log(`Loaded ${allDocs.length} documents`); ``` ## Step 2: Text Splitting ```typescript import { RecursiveCharacterTextSplitter } from "@langchain/textsplitters"; const splitter = new RecursiveCharacterTextSplitter({ chunkSize: 1000, // max chars per chunk c...

Details

Author: jeremylongshore
Repository: jeremylongshore/claude-code-plugins-plus-skills
Created: 7 months ago
Last Updated: today
Language: Python
License: MIT

Integrates with

OpenAI · AI Anthropic · AI LangChain · AI Pinecone · Database npm · DevTools

Similar Skills

Semantically similar based on skill content — not just same category

AI & Automation Solid

langchain4j-rag-implementation-patterns

Provides Retrieval-Augmented Generation (RAG) implementation patterns with LangChain4j for Java. Generates document ingestion pipelines, embedding stores, vector search, and semantic search capabilities. Use when building chat-with-documents systems, document Q&A over PDFs or text files, AI assistants with knowledge bases, semantic search over document repositories, or knowledge-enhanced AI applications with source attribution.

263 Updated 1 weeks ago

giuseppe-trisciuoglio

AI & Automation Listed

langchain

Build LLM applications with LangChain and LangGraph. Use when creating RAG pipelines, agent workflows, chains, or complex LLM orchestration. Triggers on LangChain, LangGraph, LCEL, RAG, retrieval, agent chain.

2 Updated today

Makiya1202

AI & Automation Solid

langchain-retriever

LangChain retriever implementation with various retrieval strategies for RAG applications

1,160 Updated today

a5c-ai

AI & Automation Solid

rag-architect

Designs and implements production-grade RAG systems by chunking documents, generating embeddings, configuring vector stores, building hybrid search pipelines, applying reranking, and evaluating retrieval quality. Use when building RAG systems, vector databases, or knowledge-grounded AI applications requiring semantic search, document retrieval, context augmentation, similarity search, or embedding-based indexing.

9,537 Updated 1 weeks ago

Jeffallan

AI & Automation Solid

rag

Implements document chunking, embedding generation, vector storage, and retrieval pipelines for Retrieval-Augmented Generation systems. Use when building RAG applications, creating document Q&A systems, or integrating AI with knowledge bases.

263 Updated 1 weeks ago

giuseppe-trisciuoglio