test_html_code_splitter() — langchain Function Reference
Architecture documentation for the test_html_code_splitter() function in test_text_splitters.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD f6220ce0_d090_afb7_be27_f6ed531e534f["test_html_code_splitter()"] 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"] f6220ce0_d090_afb7_be27_f6ed531e534f -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135 style f6220ce0_d090_afb7_be27_f6ed531e534f fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 1224–1257
def test_html_code_splitter() -> None:
splitter = RecursiveCharacterTextSplitter.from_language(
Language.HTML, chunk_size=60, chunk_overlap=0
)
code = """
<h1>Sample Document</h1>
<h2>Section</h2>
<p id="1234">Reference content.</p>
<h2>Lists</h2>
<ul>
<li>Item 1</li>
<li>Item 2</li>
<li>Item 3</li>
</ul>
<h3>A block</h3>
<div class="amazing">
<p>Some text</p>
<p>Some more text</p>
</div>
"""
chunks = splitter.split_text(code)
assert chunks == [
"<h1>Sample Document</h1>\n <h2>Section</h2>",
'<p id="1234">Reference content.</p>',
"<h2>Lists</h2>\n <ul>",
"<li>Item 1</li>\n <li>Item 2</li>",
"<li>Item 3</li>\n </ul>",
"<h3>A block</h3>",
'<div class="amazing">',
"<p>Some text</p>",
"<p>Some more text</p>\n </div>",
]
Domain
Subdomains
Source
Frequently Asked Questions
What does test_html_code_splitter() do?
test_html_code_splitter() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_html_code_splitter() defined?
test_html_code_splitter() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 1224.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free