Home / Function/ test_html_splitter_with_nested_div_preserved() — langchain Function Reference

test_html_splitter_with_nested_div_preserved() — langchain Function Reference

Architecture documentation for the test_html_splitter_with_nested_div_preserved() function in test_text_splitters.py from the langchain codebase.

Entity Profile

Dependency Diagram

graph TD
  71175d3e_b369_6a34_f559_8b81626211eb["test_html_splitter_with_nested_div_preserved()"]
  6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"]
  71175d3e_b369_6a34_f559_8b81626211eb -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135
  style 71175d3e_b369_6a34_f559_8b81626211eb fill:#6366f1,stroke:#818cf8,color:#fff

Relationship Graph

Source Code

libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 3520–3546

def test_html_splitter_with_nested_div_preserved() -> None:
    """Test HTML splitter preserving nested div elements.

    Nested div elements should be preserved when specified in elements_to_preserve
    """
    html_content = """
    <div>
        <h1>Header</h1>
        <p>outer text</p>
        <div>inner div content</div>
        <p>more outer text</p>
    </div>
    """
    with suppress_langchain_beta_warning():
        splitter = HTMLSemanticPreservingSplitter(
            headers_to_split_on=[("h1", "Header 1")],
            elements_to_preserve=["div"],
            max_chunk_size=1000,
        )
    documents = splitter.split_text(html_content)

    assert len(documents) == 1
    content = documents[0].page_content
    # The inner div content should be preserved
    assert "inner div content" in content
    assert "outer text" in content
    assert "more outer text" in content

Domain

Subdomains

Frequently Asked Questions

What does test_html_splitter_with_nested_div_preserved() do?
test_html_splitter_with_nested_div_preserved() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_html_splitter_with_nested_div_preserved() defined?
test_html_splitter_with_nested_div_preserved() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 3520.

Analyze Your Own Codebase

Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.

Try Supermodel Free