test_character_text_splitter_discard_regex_separator_on_merge() — langchain Function Reference
Architecture documentation for the test_character_text_splitter_discard_regex_separator_on_merge() function in test_text_splitters.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD a596bf61_a33e_1dc4_4de6_7819ff491f27["test_character_text_splitter_discard_regex_separator_on_merge()"] 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135["test_text_splitters.py"] a596bf61_a33e_1dc4_4de6_7819ff491f27 -->|defined in| 6d6b8ad4_1cfe_fbb0_e58e_76a50487c135 style a596bf61_a33e_1dc4_4de6_7819ff491f27 fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/text-splitters/tests/unit_tests/test_text_splitters.py lines 4050–4061
def test_character_text_splitter_discard_regex_separator_on_merge() -> None:
"""Test that regex lookahead separator is not re-inserted when merging."""
text = "SCE191 First chunk. SCE103 Second chunk."
splitter = CharacterTextSplitter(
separator=r"(?=SCE\d{3})",
is_separator_regex=True,
chunk_size=200,
chunk_overlap=0,
keep_separator=False,
)
output = splitter.split_text(text)
assert output == ["SCE191 First chunk. SCE103 Second chunk."]
Domain
Subdomains
Source
Frequently Asked Questions
What does test_character_text_splitter_discard_regex_separator_on_merge() do?
test_character_text_splitter_discard_regex_separator_on_merge() is a function in the langchain codebase, defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py.
Where is test_character_text_splitter_discard_regex_separator_on_merge() defined?
test_character_text_splitter_discard_regex_separator_on_merge() is defined in libs/text-splitters/tests/unit_tests/test_text_splitters.py at line 4050.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free