URL: The complete web address of a page, for example, http://yellow.car.com/model/2009
Scheme: The protocol used, such as HTTP or HTTPS.
Subdomain: An optional prefix to the domain,for example, yellow
Domain: The main address of the website,for example, car.com
Top-Level Domain (TLD): The suffix indicating the domain type, such as ".com" or
".org".
Loading Data:
You can load your data into the textbox by typing, pasting your data or loading it from a csv file.
To load data from a CSV file, follow these steps:
Select Data Source: Choose "CSV" from the dropdown labeled "Data Source".
Load CSV File: Click the "Load File" button and select the CSV file you want to load data from.
Select Column (Optional): Once the file is loaded, the dropdown menu will display the column headers from the CSV file. If there are no headers, default names like "Column 1," "Column 2," etc., will be used.
Example: Let's say you have a CSV file with two columns and headers (URLs and Descriptions). After loading the file, the dropdown menu will show "URL" and Descriptions"as selectable columns.
Clearing Data:
Click the "Clear" button to remove all data from the textbox.
Functions:
Trim to Root: Remove any path and parameters from the URL, retaining the subdomain (if present). For example: Trimming the URL 'http://abc.ilovepumpkin.com/yellow/ripe' to its root will result in 'http://abc.ilovepumpkin.com'.
Trim to Root [Remove Subdomain]: Remove any subdomain and path parameters from the URL, retaining only the root domain. For example: Trimming the URL 'http://abc.ilovepumpkin.com/yellow/ripe' to its root without the subdomain will result in 'http://ilovepumpkin.com'.
Remove Duplicates: Remove duplicate words or URLs from the list. For example: If you have two identical URLs in your list, such as "http://example.com" and "http://example.com", this function will retain only one instance.
Remove Duplicate Domain: Remove all occurrences of URLs that have the same domain, leaving only the first one. For example: If you have multiple URLs with the same domain, such as "http://example.com/page1" and "http://example.com/page2", only the first URL will be retained.
Add/Remove Scheme: Adds the scheme (HTTP) to URLs without one; removes schemes if all URLs have them.
Toggle Scheme: Switch between HTTP, HTTPS, and HTTP with WWW for the URLs (HTTP://WWW).
Remove URLs with TLD: Remove URLs containing a specified Top-Level Domain (TLD), such as ".com" or ".org."
Exclude URLs Containing Word:Remove URLs containing a specified word.