Skip to contents

Parsing

Parse one or many URLs into structured components.

safe_parse_url()
Parse a URL comprehensively, extracting and deriving all relevant components.
safe_parse_urls()
Parse multiple URLs and return a data.frame of components

Accessors

Extract individual URL components from character vectors.

get_clean_url()
Get cleaned URLs
get_parse_status()
Get the parse status of URLs
get_scheme()
Get URL schemes
get_host()
Get URL hosts
get_domain()
Get domain names
get_tld()
Extract the top-level domain (TLD) from a URL
get_subdomain()
Get URL subdomains
get_path()
Get URL paths
get_query()
Get URL query strings
get_fragment()
Get URL fragments
get_port()
Get URL ports
get_user()
Get URL user names
get_password()
Get URL passwords
get_userinfo()
Get URL userinfo

Joining

Join datasets on canonicalized URL keys.

canonical_join()
Canonical Join of Two URL Sets (Base R Version)

Cache management

Inspect and control the memoization caches.

rurl_cache_info()
Inspect the rurl memoization caches
rurl_cache_config()
Configure the rurl memoization caches
rurl_clear_caches()
Clear all rurl caches