Home / Function/ test_md_header_text_splitter_2() — langchain Function Reference

test_md_header_text_splitter_2() — langchain Function Reference

Architecture documentation for the test_md_header_text_splitter_2() function in test_text_splitters.py from the langchain codebase.

Entity Profile

Dependency Diagram

graph TD
  4cd52b0a_ab31_13de_2e8d_97b6aaf4417a["test_md_header_text_splitter_2()"]
  6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"]
  4cd52b0a_ab31_13de_2e8d_97b6aaf4417a -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135
  style 4cd52b0a_ab31_13de_2e8d_97b6aaf4417a fill:#6366f1,stroke:#818cf8,color:#fff

Relationship Graph

Source Code

libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 1291–1327

def test_md_header_text_splitter_2() -> None:
    """Test markdown splitter by header: Case 2."""
    markdown_document = (
        "# Foo\n\n"
        "    ## Bar\n\n"
        "Hi this is Jim\n\n"
        "Hi this is Joe\n\n"
        " ### Boo \n\n"
        " Hi this is Lance \n\n"
        " ## Baz\n\n"
        " Hi this is Molly"
    )

    headers_to_split_on = [
        ("#", "Header 1"),
        ("##", "Header 2"),
        ("###", "Header 3"),
    ]
    markdown_splitter = MarkdownHeaderTextSplitter(
        headers_to_split_on=headers_to_split_on,
    )
    output = markdown_splitter.split_text(markdown_document)
    expected_output = [
        Document(
            page_content="Hi this is Jim  \nHi this is Joe",
            metadata={"Header 1": "Foo", "Header 2": "Bar"},
        ),
        Document(
            page_content="Hi this is Lance",
            metadata={"Header 1": "Foo", "Header 2": "Bar", "Header 3": "Boo"},
        ),
        Document(
            page_content="Hi this is Molly",
            metadata={"Header 1": "Foo", "Header 2": "Baz"},
        ),
    ]
    assert output == expected_output

Domain

Subdomains

Frequently Asked Questions

What does test_md_header_text_splitter_2() do?
test_md_header_text_splitter_2() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_md_header_text_splitter_2() defined?
test_md_header_text_splitter_2() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 1291.

Analyze Your Own Codebase

Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.

Try Supermodel Free