test_md_header_text_splitter_2() — langchain Function Reference
Architecture documentation for the test_md_header_text_splitter_2() function in test_text_splitters.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD 4cd52b0a_ab31_13de_2e8d_97b6aaf4417a["test_md_header_text_splitter_2()"] 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"] 4cd52b0a_ab31_13de_2e8d_97b6aaf4417a -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135 style 4cd52b0a_ab31_13de_2e8d_97b6aaf4417a fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 1291–1327
def test_md_header_text_splitter_2() -> None:
"""Test markdown splitter by header: Case 2."""
markdown_document = (
"# Foo\n\n"
" ## Bar\n\n"
"Hi this is Jim\n\n"
"Hi this is Joe\n\n"
" ### Boo \n\n"
" Hi this is Lance \n\n"
" ## Baz\n\n"
" Hi this is Molly"
)
headers_to_split_on = [
("#", "Header 1"),
("##", "Header 2"),
("###", "Header 3"),
]
markdown_splitter = MarkdownHeaderTextSplitter(
headers_to_split_on=headers_to_split_on,
)
output = markdown_splitter.split_text(markdown_document)
expected_output = [
Document(
page_content="Hi this is Jim \nHi this is Joe",
metadata={"Header 1": "Foo", "Header 2": "Bar"},
),
Document(
page_content="Hi this is Lance",
metadata={"Header 1": "Foo", "Header 2": "Bar", "Header 3": "Boo"},
),
Document(
page_content="Hi this is Molly",
metadata={"Header 1": "Foo", "Header 2": "Baz"},
),
]
assert output == expected_output
Domain
Subdomains
Source
Frequently Asked Questions
What does test_md_header_text_splitter_2() do?
test_md_header_text_splitter_2() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_md_header_text_splitter_2() defined?
test_md_header_text_splitter_2() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 1291.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free