test_experimental_markdown_syntax_text_splitter_split_lines() — langchain Function Reference
Architecture documentation for the test_experimental_markdown_syntax_text_splitter_split_lines() function in test_text_splitters.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD 25791de2_af1e_d801_4246_1fe08c2fa870["test_experimental_markdown_syntax_text_splitter_split_lines()"] 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"] 25791de2_af1e_d801_4246_1fe08c2fa870 -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135 style 25791de2_af1e_d801_4246_1fe08c2fa870 fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 1854–1925
def test_experimental_markdown_syntax_text_splitter_split_lines() -> None:
"""Test experimental markdown syntax splitter."""
markdown_splitter = ExperimentalMarkdownSyntaxTextSplitter(return_each_line=True)
output = markdown_splitter.split_text(EXPERIMENTAL_MARKDOWN_DOCUMENT)
expected_output = [
Document(
page_content="Content for header 1", metadata={"Header 1": "My Header 1"}
),
Document(
page_content="Content for header 2",
metadata={"Header 1": "My Header 1", "Header 2": "Header 2"},
),
Document(
page_content="Content for header 3",
metadata={
"Header 1": "My Header 1",
"Header 2": "Header 2",
"Header 3": "Header 3",
},
),
Document(
page_content="This should be tagged with Header 1 and Header 2 Again",
metadata={"Header 1": "My Header 1", "Header 2": "Header 2 Again"},
),
Document(
page_content="```python",
metadata={
"Code": "python",
"Header 1": "My Header 1",
"Header 2": "Header 2 Again",
},
),
Document(
page_content="def func_definition():",
metadata={
"Code": "python",
"Header 1": "My Header 1",
"Header 2": "Header 2 Again",
},
),
Document(
page_content=" print('Keep the whitespace consistent')",
metadata={
"Code": "python",
"Header 1": "My Header 1",
"Header 2": "Header 2 Again",
},
),
Document(
page_content="```",
metadata={
"Code": "python",
"Header 1": "My Header 1",
"Header 2": "Header 2 Again",
},
),
Document(
page_content="We should also split on the horizontal line",
metadata={"Header 1": "Header 1 again"},
),
Document(
page_content="This will be a new doc but with the same header metadata",
metadata={"Header 1": "Header 1 again"},
),
Document(
page_content="And it includes a new paragraph",
metadata={"Header 1": "Header 1 again"},
),
]
assert output == expected_output
Domain
Subdomains
Source
Frequently Asked Questions
What does test_experimental_markdown_syntax_text_splitter_split_lines() do?
test_experimental_markdown_syntax_text_splitter_split_lines() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_experimental_markdown_syntax_text_splitter_split_lines() defined?
test_experimental_markdown_syntax_text_splitter_split_lines() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 1854.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free