Skip to content

RandomRhythm/LevelUpDomain

Repository files navigation

Level Up Domain

Takes a list of domain names and creates a sampling list by only outputting one subdomain for each primary domain.

This script will parse a provided list of domain names to perform the following:

  • Remove invalid domains
  • Separate IP addresses from the domains
  • Output primary domain name prevalence
  • Output the list of primary domain names
  • Output a domain sampling

Domain names have a top-level domain (TLD) and then subdomains. However, not all subdomains immediately beneath a TLD are unique to an owner/operator. Such a subdomain is known as a public suffix, which is a domain that allows for domain registration beneath it. This script utilizes lists from surbl.org, iana, and internal checks to determine the highest level of a domain structure that is unique to an owner. In other words, subdomains are removed to identify the primary domain name that a registrar would sell for registration. After acquiring the primary domain name, then a sampling of full domain structures can be obtained.

The Level Up Domain script will create the following files:

  • Domain_Sample.txt - Sample of domain names.
  • DomainPrev.csv - Prevalence based on parent domain.
  • LevelUp_Domains.txt - List of primary domain names.
  • IP_Addresses - IP addresses identified in the list of domains.
  • Invalid_Domain_IP.txt - List of items provided that were an invalid IP address or domain name.

Warning! The input file only supports ANSI and UTF-16 LE encoding. Use notepad Save As dialog or other method to change the encoding beforehand.

About

Takes a list of domains and output one unique domain structure for each unique second (or third) level domain

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published