Retriever drop-in per pipeline RAG LangChain con supporto completo al tracciamento.
SeiznRetriever è un sostituto drop-in per qualsiasi retriever LangChain. Fornisce ricerca vettoriale con tracciamento, caching e reranking integrati.
Installa l'SDK Seizn insieme a LangChain.
# TypeScript / JavaScript
npm install seizn @langchain/core
# Python
pip install seizn langchainCrea una chain RAG con SeiznRetriever in poche righe di codice.
import { SeiznRetriever } from 'seizn/langchain';
import { ChatOpenAI } from '@langchain/openai';
import { createRetrievalChain } from 'langchain/chains/retrieval';
import { createStuffDocumentsChain } from 'langchain/chains/combine_documents';
import { ChatPromptTemplate } from '@langchain/core/prompts';
// Initialize the Seizn retriever
const retriever = new SeiznRetriever({
apiKey: process.env.SEIZN_API_KEY,
dataset: 'my-docs',
topK: 5,
threshold: 0.7,
});
// Create a RAG chain
const llm = new ChatOpenAI({ model: 'gpt-4' });
const prompt = ChatPromptTemplate.fromTemplate(`
Answer the question based on the following context:
{context}
Question: {input}
`);
const documentChain = await createStuffDocumentsChain({ llm, prompt });
const retrievalChain = await createRetrievalChain({
combineDocsChain: documentChain,
retriever,
});
// Run the chain
const response = await retrievalChain.invoke({
input: 'How do I configure rate limiting?',
});
console.log(response.answer);
// Trace ID available for debugging
console.log('Trace:', response.seiznTrace);import os
from seizn.langchain import SeiznRetriever
from langchain_openai import ChatOpenAI
from langchain.chains import create_retrieval_chain
from langchain.chains.combine_documents import create_stuff_documents_chain
from langchain_core.prompts import ChatPromptTemplate
# Initialize the Seizn retriever
retriever = SeiznRetriever(
api_key=os.environ["SEIZN_API_KEY"],
dataset="my-docs",
top_k=5,
threshold=0.7,
)
# Create a RAG chain
llm = ChatOpenAI(model="gpt-4")
prompt = ChatPromptTemplate.from_template("""
Answer the question based on the following context:
{context}
Question: {input}
""")
document_chain = create_stuff_documents_chain(llm, prompt)
retrieval_chain = create_retrieval_chain(retriever, document_chain)
# Run the chain
response = retrieval_chain.invoke({
"input": "How do I configure rate limiting?"
})
print(response["answer"])
# Trace ID available for debugging
print("Trace:", response.get("seizn_trace"))Riduci latenza e costi memorizzando le query ripetute.
const retriever = new SeiznRetriever({
apiKey: process.env.SEIZN_API_KEY,
dataset: 'my-docs',
cache: {
enabled: true,
ttl: 3600, // 1 hour
},
});Migliora la qualità dei risultati con il reranking cross-encoder.
const retriever = new SeiznRetriever({
apiKey: process.env.SEIZN_API_KEY,
dataset: 'my-docs',
rerank: {
enabled: true,
model: 'cohere-rerank-v3',
topN: 3,
},
});Filtra i risultati per campi metadati prima della ricerca vettoriale.
const retriever = new SeiznRetriever({
apiKey: process.env.SEIZN_API_KEY,
dataset: 'my-docs',
filter: {
category: 'api-docs',
language: 'en',
},
});| Errore | Causa | Soluzione |
|---|---|---|
SEIZN_AUTH_ERROR | Chiave API non valida o mancante | Verifica la variabile d'ambiente SEIZN_API_KEY |
SEIZN_RATE_LIMIT | Troppe richieste al secondo | Implementa backoff esponenziale o aggiorna il piano |
Empty results | Soglia troppo alta o nessun documento corrispondente | Abbassa la soglia o controlla i contenuti del dataset |