test_extract_sub_links_exclude() — langchain Function Reference
Architecture documentation for the test_extract_sub_links_exclude() function in test_html.py from the langchain codebase.
Entity Profile
Dependency Diagram
graph TD 5d8ee210_99c6_98ba_cef9_3542471df142["test_extract_sub_links_exclude()"] 89d29efc_3f6a_dceb_4c69_b201fa975bdd["test_html.py"] 5d8ee210_99c6_98ba_cef9_3542471df142 -->|defined in| 89d29efc_3f6a_dceb_4c69_b201fa975bdd style 5d8ee210_99c6_98ba_cef9_3542471df142 fill:#6366f1,stroke:#818cf8,color:#fff
Relationship Graph
Source Code
libs/core/tests/unit_tests/utils/test_html.py lines 132–158
def test_extract_sub_links_exclude() -> None:
html = (
'<a href="https://foobar.com">one</a>'
'<a href="http://baz.net">two</a>'
'<a href="//foobar.com/hello">three</a>'
'<a href="/how/are/you/doing">four</a>'
'<a href="alexis.html"</a>'
)
expected = sorted(
[
"http://baz.net",
"https://foobar.com",
"https://foobar.com/hello",
"https://foobar.com/hello/alexis.html",
]
)
actual = sorted(
extract_sub_links(
html,
"https://foobar.com/hello/bill.html",
base_url="https://foobar.com",
prevent_outside=False,
exclude_prefixes=("https://foobar.com/how", "http://baz.org"),
)
)
assert actual == expected
Domain
Subdomains
Source
Frequently Asked Questions
What does test_extract_sub_links_exclude() do?
test_extract_sub_links_exclude() is a function in the langchain codebase, defined in libs/core/tests/unit_tests/utils/test_html.py.
Where is test_extract_sub_links_exclude() defined?
test_extract_sub_links_exclude() is defined in libs/core/tests/unit_tests/utils/test_html.py at line 132.
Analyze Your Own Codebase
Get architecture documentation, dependency graphs, and domain analysis for your codebase in minutes.
Try Supermodel Free