JSFrameworkTextSplitter Class — langchain Architecture
Architecture documentation for the JSFrameworkTextSplitter class in jsx.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD 1ab21219_4509_1529_5d8c_68eef426fddc["JSFrameworkTextSplitter"] bb2ab62d_6801_f254_be09_f34b99245781["RecursiveCharacterTextSplitter"] 1ab21219_4509_1529_5d8c_68eef426fddc -->|extends| bb2ab62d_6801_f254_be09_f34b99245781 ec07a042_d848_6245_6d7d_ce7e30d34fd5["jsx.py"] 1ab21219_4509_1529_5d8c_68eef426fddc -->|defined in| ec07a042_d848_6245_6d7d_ce7e30d34fd5 567f4ba3_fb63_f0fd_04b6_e4012f63ea1d["__init__()"] 1ab21219_4509_1529_5d8c_68eef426fddc -->|method| 567f4ba3_fb63_f0fd_04b6_e4012f63ea1d 5a1c72d3_bbc1_f993_4ee6_a14531b0f159["split_text()"] 1ab21219_4509_1529_5d8c_68eef426fddc -->|method| 5a1c72d3_bbc1_f993_4ee6_a14531b0f159
Relationship Graph
Source Code
libs/text-splitters/langchain_text_splitters/jsx.py lines 9–102
class JSFrameworkTextSplitter(RecursiveCharacterTextSplitter):
"""Text splitter that handles React (JSX), Vue, and Svelte code.
This splitter extends `RecursiveCharacterTextSplitter` to handle React (JSX), Vue,
and Svelte code by:
1. Detecting and extracting custom component tags from the text
2. Using those tags as additional separators along with standard JS syntax
The splitter combines:
* Custom component tags as separators (e.g. `<Component`, `<div`)
* JavaScript syntax elements (function, const, if, etc)
* Standard text splitting on newlines
This allows chunks to break at natural boundaries in React, Vue, and Svelte
component code.
"""
def __init__(
self,
separators: list[str] | None = None,
chunk_size: int = 2000,
chunk_overlap: int = 0,
**kwargs: Any,
) -> None:
"""Initialize the JS Framework text splitter.
Args:
separators: Optional list of custom separator strings to use
chunk_size: Maximum size of chunks to return
chunk_overlap: Overlap in characters between chunks
**kwargs: Additional arguments to pass to parent class
"""
super().__init__(chunk_size=chunk_size, chunk_overlap=chunk_overlap, **kwargs)
self._separators = separators or []
def split_text(self, text: str) -> list[str]:
"""Split text into chunks.
This method splits the text into chunks by:
* Extracting unique opening component tags using regex
* Creating separators list with extracted tags and JS separators
* Splitting the text using the separators by calling the parent class method
Args:
text: String containing code to split
Returns:
List of text chunks split on component and JS boundaries
"""
# Extract unique opening component tags using regex
# Regex to match opening tags, excluding self-closing tags
opening_tags = re.findall(r"<\s*([a-zA-Z0-9]+)[^>]*>", text)
component_tags = []
for tag in opening_tags:
if tag not in component_tags:
component_tags.append(tag)
component_separators = [f"<{tag}" for tag in component_tags]
js_separators = [
"\nexport ",
" export ",
"\nfunction ",
"\nasync function ",
" async function ",
"\nconst ",
"\nlet ",
"\nvar ",
"\nclass ",
" class ",
"\nif ",
" if ",
"\nfor ",
" for ",
"\nwhile ",
" while ",
"\nswitch ",
" switch ",
Extends
Source
Frequently Asked Questions
What is the JSFrameworkTextSplitter class?
JSFrameworkTextSplitter is a class in the langchain codebase, defined in libs/text-splitters/langchain_text_splitters/jsx.py.
Where is JSFrameworkTextSplitter defined?
JSFrameworkTextSplitter is defined in libs/text-splitters/langchain_text_splitters/jsx.py at line 9.
What does JSFrameworkTextSplitter extend?
JSFrameworkTextSplitter extends RecursiveCharacterTextSplitter.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free