test_tokenization() — langchain Function Reference

Architecture documentation for the test_tokenization() function in test_schema.py from the langchain codebase.

Function python LangChainCore ApiManagement

Entity Profile

LangChainCore→ ApiManagement→ test_tokenization() — langchain Function Reference

Dependency Diagram

graph TD
  04ec2c54_5612_70ab_f4f8_451e702a60c4["test_tokenization()"]
  76ce3c98_8b2b_4359_3dd9_9cadd328e8f1["TestTokenCountingWithGPT2Tokenizer"]
  04ec2c54_5612_70ab_f4f8_451e702a60c4 -->|defined in| 76ce3c98_8b2b_4359_3dd9_9cadd328e8f1
  style 04ec2c54_5612_70ab_f4f8_451e702a60c4 fill:#6366f1,stroke:#818cf8,color:#fff

Relationship Graph

Source Code

libs/langchain/tests/integration_tests/test_schema.py lines 7–9

    def test_tokenization(self) -> None:
        # Check that the tokenization is consistent with the GPT-2 tokenizer
        assert _get_token_ids_default_method("This is a test") == [1212, 318, 257, 1332]