test_md_header_text_splitter_3() — langchain Function Reference
Architecture documentation for the test_md_header_text_splitter_3() function in test_text_splitters.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD 876a9f52_828c_8bd2_9bfb_7b7fc1c9e740["test_md_header_text_splitter_3()"] 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"] 876a9f52_828c_8bd2_9bfb_7b7fc1c9e740 -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135 style 876a9f52_828c_8bd2_9bfb_7b7fc1c9e740 fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 1330–1381
def test_md_header_text_splitter_3() -> None:
"""Test markdown splitter by header: Case 3."""
markdown_document = (
"# Foo\n\n"
" ## Bar\n\n"
"Hi this is Jim\n\n"
"Hi this is Joe\n\n"
" ### Boo \n\n"
" Hi this is Lance \n\n"
" #### Bim \n\n"
" Hi this is John \n\n"
" ## Baz\n\n"
" Hi this is Molly"
)
headers_to_split_on = [
("#", "Header 1"),
("##", "Header 2"),
("###", "Header 3"),
("####", "Header 4"),
]
markdown_splitter = MarkdownHeaderTextSplitter(
headers_to_split_on=headers_to_split_on,
)
output = markdown_splitter.split_text(markdown_document)
expected_output = [
Document(
page_content="Hi this is Jim \nHi this is Joe",
metadata={"Header 1": "Foo", "Header 2": "Bar"},
),
Document(
page_content="Hi this is Lance",
metadata={"Header 1": "Foo", "Header 2": "Bar", "Header 3": "Boo"},
),
Document(
page_content="Hi this is John",
metadata={
"Header 1": "Foo",
"Header 2": "Bar",
"Header 3": "Boo",
"Header 4": "Bim",
},
),
Document(
page_content="Hi this is Molly",
metadata={"Header 1": "Foo", "Header 2": "Baz"},
),
]
assert output == expected_output
Domain
Subdomains
Source
Frequently Asked Questions
What does test_md_header_text_splitter_3() do?
test_md_header_text_splitter_3() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_md_header_text_splitter_3() defined?
test_md_header_text_splitter_3() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 1330.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free