URL Categorization Database visualization

The Definitive URL Categorization Database

Fuel your platforms with a constantly updated database of 25 million domains, enriched with IAB categories, company firmographics, technology stacks, and more. The foundational data layer for modern AdTech, Cybersecurity, and Market Intelligence.


Data That Drives Decisions

One comprehensive database to power a universe of applications. Accurate, deep, and always fresh, our data provides the context you need to build smarter, safer, and more effective products.

Comprehensive Domain Categorization
Comprehensive Categorization

Leverage dual-taxonomy classification with both IAB Content Taxonomy for advertising and a multi-level Web Filtering Taxonomy for security and compliance.

Unmatched Web Coverage
Unmatched Coverage & Freshness

Our database covers over 99% of active internet usage, sourced from the Google Chrome UX Report, ensuring you have data on the domains that truly matter. Updated weekly.

Rich Domain Intelligence Data
Rich Domain Intelligence

Go beyond categories with invaluable metadata: company details, 3000+ web technologies, domain age, popularity rank, country, and inferred user personas.

Our Data in Numbers

The scale and quality of our data are unmatched, providing the bedrock for industry-leading platforms.

Go Beyond the URL: A Multi-Layered Data Approach

Every domain in our database is a rich tapestry of information. We provide multiple layers of data to give you the most complete picture for any application.

Dual-Taxonomy Categorization

We classify every domain against two leading industry standards: the IAB Tech Lab Content Taxonomy for contextual ad targeting and a granular, multi-tiered Web Filtering taxonomy for security and parental controls. This dual approach provides maximum flexibility and precision, whether you're placing an ad or blocking a threat.

Company & Firmographic Data

Transform domains into business opportunities. With 17 million domains linked to company profiles, you can access firmographic data such as company name, industry, size, and location. Supercharge your lead generation, account-based marketing (ABM), and B2B market research.

Web Technology Stack Tracking

Gain a competitive edge by knowing what technologies a website is built on. We track over 3,000 different web technologies, from CMS and e-commerce platforms to advertising networks and analytics tools. Ideal for sales intelligence, competitive analysis, and tech-based market segmentation.

Popularity & Traffic Insights

Understand a domain's relevance in the real world. By leveraging data from the Google Chrome UX Report and our proprietary OpenPageRank algorithm, we provide metrics that reflect a domain's popularity and authority. This helps you prioritize high-traffic domains and assess a site's influence.

Developer-First API & Delivery

Integrate our data seamlessly into your workflow. We offer a high-performance REST API for real-time lookups and bulk delivery via downloadable CSV files or direct cloud storage sync (S3, GCS). Our clear documentation and flexible delivery options get you up and running in minutes.

Global Coverage & Domain Metadata

Context is everything. Our database includes essential metadata for each domain, including its creation date (domain age), primary country of operation, and TLD information. This data provides crucial signals for fraud detection, SEO analysis, and international market assessment.

Inferred User Persona & Demographics

Understand the audience behind the domain. Using a combination of content analysis and traffic patterns, we provide inferred user personas and demographic profiles for a significant portion of our domains. This is a game-changer for marketers looking to refine audience targeting and product positioning.

Powering the Platforms of Tomorrow

From startups to Fortune 500s, leading companies across multiple industries rely on our data to build innovative products, mitigate risk, and uncover new opportunities. Discover how our URL Categorization Database can revolutionize your business.

Use Case: AdTech & Programmatic Advertising

Precision Targeting, Total Brand Safety

In the fast-paced world of digital advertising, context and safety are paramount. Our database allows ad exchanges, DSPs, and SSPs to classify inventory in real-time, ensuring ads are placed in relevant, brand-safe environments. Move beyond simple keyword blocking to true contextual understanding.

  • Use IAB categories for highly effective contextual targeting campaigns that boost engagement and ROI.

  • Prevent ad placement on sites with inappropriate or harmful content, protecting brand reputation.

  • Enrich bid stream data with domain categories and tech stacks to make smarter, data-driven bidding decisions.

Learn More
AdTech dashboard showing contextual targeting
Cybersecurity threat intelligence map
Use Case: Cybersecurity & Threat Intelligence

Identify Threats Before They Strike

The majority of cyber threats, from phishing to malware distribution, originate from malicious domains. Our database is a critical first line of defense for firewalls, Secure Web Gateways (SWG), and threat intelligence platforms. By providing up-to-date categorization of malicious and high-risk sites, we help you block threats proactively.

  • Classify domains into categories like 'Phishing', 'Malware', 'Spam', and 'Newly Registered Domains' to inform security policies.

  • Integrate our data feed into SIEM and SOAR platforms to enrich alerts and accelerate incident response.

  • Use domain age and popularity metrics as key indicators in your risk scoring algorithms to detect suspicious activity.

Learn More
Use Case: Web & Content Filtering

Create Safe & Productive Online Environments

From K-12 schools to large enterprises, creating a safe and productive internet experience is essential. Our granular Web Filtering taxonomy allows you to build powerful filtering solutions that comply with regulations like CIPA and meet corporate acceptable use policies. Protect users from harmful content while maximizing productivity.

  • Easily block categories such as 'Adult Content', 'Gambling', and 'Violence' to ensure a safe browsing environment.

  • Allow IT administrators to create custom policies, blocking time-wasting sites like social media or streaming services during work hours.

  • Provide a foundational dataset for ISPs and MSPs offering managed filtering services to their customers.

Learn More
Web filtering policy management interface
Sales intelligence platform showing leads
Use Case: Lead Generation & Sales Intelligence

Uncover Your Next Best Customer

Stop prospecting in the dark. Our database links 17 million domains to rich company data, turning the web into your ultimate lead list. Identify companies that fit your ideal customer profile (ICP) based on their industry, size, location, and—most importantly—the technology they use.

  • Find all websites using a competitor's technology (e.g., Shopify, HubSpot) to create targeted outreach campaigns.

  • Filter for companies in a specific industry (e.g., 'Financial Services') and size ('100-500 employees') to build high-quality lead lists.

  • Enrich your existing CRM records with technology stack and company data to improve lead scoring and personalization.

Learn More
Use Case: SEO & Competitor Analysis

Decode Your Competitors' Digital Strategy

Gain an unfair advantage in the SERPs. Our data provides a panoramic view of your competitive landscape. Understand how competitors categorize their content, what technologies they use to drive growth, and where their digital authority comes from. This intelligence is crucial for building a winning SEO and content strategy.

  • Analyze the topical authority of competing domains by looking at their primary and secondary IAB categories.

  • Discover link-building opportunities by identifying high-authority sites (via OpenPageRank) in your niche.

  • Benchmark your technology stack against competitors to find gaps in your marketing, analytics, or sales tools.

Learn More
SEO competitor analysis tool
Market research report with graphs
Use Case: Market Research & Investment Analysis

Track Market Trends at Web-Scale

Our database serves as a powerful economic indicator for the digital world. Analysts, investors, and researchers can track the adoption rate of new technologies, measure the growth of specific market sectors, and identify emerging industry trends long before they become mainstream. It's macro-level insight derived from micro-level data.

  • Measure the market share of competing e-commerce platforms (e.g., Shopify vs. Magento) over time.

  • Identify the fastest-growing web categories to spot new investment opportunities or market shifts.

  • Analyze the technology stacks of recently funded startups to understand where venture capital is flowing.

Learn More
Use Case: Domain Investing & Parking

Maximize Your Domain Portfolio's Value

For domain investors, accurate categorization is the key to monetization. Our database allows you to automatically classify entire portfolios of domains, enabling more effective parking page monetization through contextually relevant ads. It also helps in valuing domains based on their category, popularity, and age.

  • Automatically categorize thousands of domains to optimize ad feeds on parking pages, increasing CTR and revenue.

  • Use domain age, OpenPageRank, and category to build more accurate valuation models for buying and selling.

  • Identify high-value expired domains in lucrative niches by filtering our database for specific keywords and categories.

Learn More
Domain portfolio management dashboard
Brand safety shield protecting a logo
Use Case: Brand Safety & Reputation Management

Protect Your Brand Across the Web

Your brand's reputation is your most valuable asset. Our database helps brand managers and PR agencies monitor where their brand is being mentioned and ensure it's not associated with undesirable content. It's a proactive tool for managing online reputation and mitigating brand risk.

  • Power media monitoring tools to classify the context of brand mentions, separating positive from negative environments.

  • Create inclusion/exclusion lists for programmatic advertising to guarantee ads only appear on pre-vetted, safe domains.

  • Identify and track counterfeit or brand-jacking websites by monitoring newly registered domains related to your brand.

Learn More
Use Case: Data Enrichment Services

Add a Layer of Context to Your Data

If your product works with lists of domains or companies, our database is the perfect enrichment solution. Whether you have a CRM, a marketing automation platform, or a custom analytics solution, you can use our API to append valuable context—like industry, tech stack, and content category—to your existing data.

  • Enrich user profiles in your CDP with domain categories based on their browsing history.

  • Append firmographic and technology data to company domains in your CRM to create richer, more actionable records.

  • Augment your own data products by licensing our categorization and intelligence layers.

Learn More
Data enrichment workflow diagram
Academic research showing network graphs
Use Case: Academic & Non-Profit Research

A Foundational Dataset for Web Science

Our database provides a structured, large-scale snapshot of the web, making it an invaluable resource for researchers in computer science, communications, sociology, and economics. Study the internet's structure, the spread of information, and the evolution of online discourse with a reliable, longitudinal dataset.

  • Analyze the topical interconnectivity of the web by creating graphs based on domain categories.

  • Track the prevalence of certain types of content (e.g., 'Misinformation') over time.

  • Study the global distribution of web technologies and their correlation with economic factors.

Learn More
Use Case: Content Delivery Networks (CDNs)

Optimize Caching and Routing with Content-Awareness

For CDNs and hosting providers, performance is everything. By understanding the type of content a domain serves, you can make smarter decisions about caching strategies and traffic routing. Our database provides the content-awareness needed to squeeze out every last millisecond of performance.

  • Apply different caching policies based on content category (e.g., more aggressive caching for 'News' sites vs. 'E-commerce').

  • Prioritize and route traffic for high-value categories like 'Financial Services' or 'Healthcare' over dedicated, low-latency networks.

  • Enhance security offerings by integrating our threat categories directly into your CDN's WAF and bot management solutions.

Learn More
Global CDN network map with optimized routes

Frequently Asked Questions

Have questions? We’ve got answers. Here are some of the most common questions we receive about our URL Categorization Database.

Our database is built upon a multi-source approach to ensure maximum coverage and accuracy. The core of our database, especially for determining active web usage, is the Google Chrome User Experience (CrUX) Report, which includes data from 18 million of the most visited websites, covering over 99% of web traffic. We supplement this with data from our own proprietary web crawlers, domain registration feeds, and partnerships with data providers. This combination allows us to have both depth on popular sites and breadth across the entire domain landscape.

The entire database is refreshed on a weekly basis. This includes re-crawling and re-analyzing domains to detect changes in content, technology stacks, or threat status. Newly registered domains and data from the latest CrUX report are incorporated during this weekly cycle. For our threat intelligence customers, we offer an optional daily feed of newly identified malicious domains to provide more immediate protection.

Accuracy is our top priority. We use a sophisticated, multi-stage process. It begins with machine learning models trained on a massive, human-verified dataset. These models analyze page content, metadata, link structure, and other signals. The initial ML-driven classification is then passed through a rules-based engine for validation and refinement. Finally, we have a team of human analysts who continually review and audit the classifications, with a particular focus on ambiguous or sensitive categories. This human-in-the-loop system allows us to achieve over 98% accuracy across our primary categories.

We offer flexible delivery options to suit your needs. For bulk data access, the most common format is a compressed CSV file, delivered via a secure download link or synced directly to your cloud storage bucket (Amazon S3, Google Cloud Storage, or Azure Blob Storage). For real-time lookups, we provide a high-availability REST API that returns data in a clean JSON format. We can also accommodate custom formats or delivery mechanisms for enterprise clients.

Absolutely. We encourage you to evaluate our data quality firsthand. We offer a generous free sample of the database, which includes 100,000 domains with all associated data points (categories, tech stack, company info, etc.). This sample provides a representative cross-section of our full database. You can request your free sample directly from our website, and it will be delivered to you instantly.

The two taxonomies serve different purposes. The IAB Tech Lab Content Taxonomy is the industry standard for digital advertising; its categories are designed to describe the content of a page for contextual ad targeting (e.g., 'Automotive', 'Sports', 'Healthy Living'). The Web Filtering Taxonomy is designed for security and policy enforcement; its categories focus on identifying potentially harmful, inappropriate, or unproductive content (e.g., 'Adult Content', 'Phishing', 'Social Networking', 'Gambling'). By providing both, we support a wider range of use cases from a single data source.