Home / File/ retrievers.py — langchain Source File

retrievers.py — langchain Source File

Architecture documentation for retrievers.py, a python file in the langchain codebase. 6 imports, 0 dependents.

Entity Profile

Dependency Diagram

graph LR
  3dc07534_4362_fcff_0fb8_62009ef45c51["retrievers.py"]
  8e2034b7_ceb8_963f_29fc_2ea6b50ef9b3["typing"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> 8e2034b7_ceb8_963f_29fc_2ea6b50ef9b3
  f3bc7443_c889_119d_0744_aacc3620d8d2["langchain_core.callbacks"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> f3bc7443_c889_119d_0744_aacc3620d8d2
  c554676d_b731_47b2_a98f_c1c2d537c0aa["langchain_core.documents"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> c554676d_b731_47b2_a98f_c1c2d537c0aa
  38bc5323_3713_7377_32f8_091293bea54b["langchain_core.retrievers"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> 38bc5323_3713_7377_32f8_091293bea54b
  6e58aaea_f08e_c099_3cc7_f9567bfb1ae7["pydantic"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> 6e58aaea_f08e_c099_3cc7_f9567bfb1ae7
  706126c9_5306_8b22_9189_e911bde5257c["langchain_perplexity._utils"]
  3dc07534_4362_fcff_0fb8_62009ef45c51 --> 706126c9_5306_8b22_9189_e911bde5257c
  style 3dc07534_4362_fcff_0fb8_62009ef45c51 fill:#6366f1,stroke:#818cf8,color:#fff

Relationship Graph

Source Code

from __future__ import annotations

from typing import Any, Literal

from langchain_core.callbacks import CallbackManagerForRetrieverRun
from langchain_core.documents import Document
from langchain_core.retrievers import BaseRetriever
from pydantic import Field, SecretStr, model_validator

from langchain_perplexity._utils import initialize_client


class PerplexitySearchRetriever(BaseRetriever):
    """Perplexity Search retriever."""

    k: int = Field(default=10, description="Max results (1-20)")
    max_tokens: int = Field(default=25000, description="Max tokens across all results")
    max_tokens_per_page: int = Field(default=1024, description="Max tokens per page")
    country: str | None = Field(default=None, description="ISO country code")
    search_domain_filter: list[str] | None = Field(
        default=None, description="Domain filter (max 20)"
    )
    search_recency_filter: Literal["day", "week", "month", "year"] | None = None
    search_after_date: str | None = Field(
        default=None, description="Date filter (format: %m/%d/%Y)"
    )
    search_before_date: str | None = Field(
        default=None, description="Date filter (format: %m/%d/%Y)"
    )

    client: Any = Field(default=None, exclude=True)
    pplx_api_key: SecretStr = Field(default=SecretStr(""))

    @model_validator(mode="before")
    @classmethod
    def validate_environment(cls, values: dict) -> Any:
        """Validate the environment."""
        return initialize_client(values)

    def _get_relevant_documents(
        self, query: str, *, run_manager: CallbackManagerForRetrieverRun
    ) -> list[Document]:
        params = {
            "query": query,
            "max_results": self.k,
            "max_tokens": self.max_tokens,
            "max_tokens_per_page": self.max_tokens_per_page,
            "country": self.country,
            "search_domain_filter": self.search_domain_filter,
            "search_recency_filter": self.search_recency_filter,
            "search_after_date": self.search_after_date,
            "search_before_date": self.search_before_date,
        }
        params = {k: v for k, v in params.items() if v is not None}
        response = self.client.search.create(**params)

        return [
            Document(
                page_content=result.snippet,
                metadata={
                    "title": result.title,
                    "url": result.url,
                    "date": result.date,
                    "last_updated": result.last_updated,
                },
            )
            for result in response.results
        ]

Subdomains

Dependencies

  • langchain_core.callbacks
  • langchain_core.documents
  • langchain_core.retrievers
  • langchain_perplexity._utils
  • pydantic
  • typing

Frequently Asked Questions

What does retrievers.py do?
retrievers.py is a source file in the langchain codebase, written in python. It belongs to the CoreAbstractions domain, Serialization subdomain.
What does retrievers.py depend on?
retrievers.py imports 6 module(s): langchain_core.callbacks, langchain_core.documents, langchain_core.retrievers, langchain_perplexity._utils, pydantic, typing.
Where is retrievers.py in the architecture?
retrievers.py is located at libs/partners/perplexity/langchain_perplexity/retrievers.py (domain: CoreAbstractions, subdomain: Serialization, directory: libs/partners/perplexity/langchain_perplexity).

Analyze Your Own Codebase

Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.

Try Supermodel Free