WebsiteCategorizationAPI
Home
Demo Tools - Categorization
Website Categorization Text Classification URL Database Taxonomy Mapper
Demo Tools - Website Intel
Technology Detector Quality Score Competitor Finder
Demo Tools - Brand Safety
Brand Safety Checker Brand Suitability Quality Checker
Demo Tools - Content
Sentiment Analyzer Context Aware Ads
MCP Servers
MCP Real-Time API MCP Database Lookup
AI Agents
Map of Internet for AI Agents 100 Use Cases
Domains By
Domains for your ICP Domains by Vertical Domains by Country Domains by Technologies
Resources
API Documentation Pricing Login
Try Categorization
Option B: Database Lookup MCP Server

MCP Database Lookup Server

Get instant domain-level categorization lookups from a pre-classified database of 30 million or more domains. Connect your AI assistant directly to our database via the Model Context Protocol and retrieve IAB categories, web filtering data, buyer personas, and PageRank scores in milliseconds.

30M+
Pre-Classified Domains
< 50ms
Average Response Time
99%+
Internet Traffic Coverage
100
Domains Per Batch

What Is the Model Context Protocol?

The Model Context Protocol is an open standard developed by Anthropic that enables AI assistants to communicate with external tools, databases, and APIs in a structured, secure way.

The Model Context Protocol, commonly known as MCP, was created to solve a fundamental challenge in the AI ecosystem: how do you give large language models reliable, real-time access to external data and functionality without compromising security or requiring complex custom integrations? Before MCP, every tool integration required bespoke code, API wrappers, and fragile prompt engineering. MCP changes this by providing a universal communication layer between AI applications and the tools they need.

At its core, MCP follows a client-server architecture. The AI application, whether that is Claude Desktop, Claude Code, Cursor, or any other MCP-compatible assistant, acts as the client. The MCP server runs locally on your machine and exposes a set of tools that the AI can discover and call. Communication happens over standard input and output streams using a JSON-based protocol, which means the server process stays contained within your local environment and never needs to be exposed to the internet.

When you ask your AI assistant a question like "What category is stripe.com?", the assistant recognizes that it needs external data. It sends a structured tool call to the MCP server. The server processes that request, connects to the websitecategorizationapi.com Database API, retrieves the categorization data, and returns the result to the assistant. The entire round trip happens in milliseconds for database lookups, making the experience feel seamless and instantaneous.

AI Assistant Claude / Cursor
Local MCP Server Python on your machine
Database API websitecategorizationapi.com
Instant Results Categorization data

Because MCP servers run locally, your API key stays on your machine and is never transmitted to the AI model provider. The server only communicates with the specific API endpoints it was designed to use. This architecture gives you full control over what tools are available, how they are configured, and what data flows through them. You can inspect every request and response, stop the server at any time, and configure it to fit your specific workflow.

What the Database Lookup Server Does

The Database Lookup MCP server connects your AI assistant directly to our pre-classified database of over 30 million domains, providing instant categorization results for domain-level queries.

This MCP server is designed for speed and efficiency. Instead of analyzing webpages in real time, it retrieves pre-computed categorization data from our database, which means lookups return in milliseconds rather than seconds. Every domain in the database has already been crawled, analyzed, and classified across multiple taxonomy standards, so the data you receive is comprehensive and immediately usable.

The Database Lookup server supports domain-level queries only. You can look up domains like stripe.com or bbc.co.uk, but you cannot look up individual subpages such as bbc.co.uk/news/technology. If you need subpage-level classification, use the Real-Time API MCP server instead.

Data Available Per Domain

Each domain record in the database contains a rich set of categorization signals and metadata. When you look up a domain, the server returns all of the following data fields, giving you a multi-dimensional understanding of any website in a single request:

IAB v2 Categories (Tiers 1-4)

Full IAB Content Taxonomy version 2 classifications across all four tiers of depth. This is the industry standard for digital advertising and content categorization, ranging from broad top-level categories down to granular sub-categories that precisely describe the domain's content focus.

IAB v3 Categories (Tiers 1-4)

The latest IAB Content Taxonomy version 3 classifications with updated category structure and expanded coverage. Version 3 introduces new categories for emerging content types and refines the existing hierarchy, ensuring your data stays aligned with current industry standards.

Web Filtering Category

A content-filtering classification designed for security, compliance, and parental control applications. This identifies domains by content type in categories such as news, social media, adult content, gambling, malware, and dozens more, enabling automated policy enforcement.

Buyer Personas

Audience and buyer persona classifications that describe the typical visitors and customers of each domain. This data is useful for ad targeting, lead enrichment, and market research, helping you understand who is likely interacting with a given website.

OpenPageRank Score

A domain authority metric based on the Open PageRank initiative. This numerical score indicates the relative importance and authority of each domain on the web, useful for filtering out low-quality sites and prioritizing high-traffic, reputable domains in your workflows.

Country

The primary country associated with each domain, determined through a combination of hosting location, content language, top-level domain, and audience geo-distribution analysis. This enables geographic filtering, regional compliance checks, and market segmentation.

The database contains over 30 million pre-classified domains, covering an estimated 99% or more of active internet traffic by volume. The most popular domains, long-tail niche sites, and regional web properties are all represented, making it suitable for the vast majority of domain-level categorization needs.

Available Tools in the Server

The Database Lookup MCP server exposes five tools that your AI assistant can discover and call automatically. Each tool is designed for a specific type of query.

lookup_domain Single Domain Lookup

This is the most commonly used tool in the server. It takes a single domain name as input and returns the complete categorization record from the database. The response includes IAB v2 and v3 categories across all tiers, the web filtering classification, buyer persona data, the OpenPageRank score, and country information. Use this when you want detailed intelligence on one specific domain.

Example conversation with your AI assistant:

Example Prompt
// You ask Claude:
"Look up the domain stripe.com in the database."

// Claude calls lookup_domain with domain="stripe.com"
// and returns something like:

Domain: stripe.com
IAB v2 Tier 1: Technology & Computing
IAB v2 Tier 2: Computing > Internet Technology
Web Filtering: Financial Services
Buyer Persona: Tech-Savvy Business Decision Maker
PageRank: 8.42
Country: US

bulk_lookup Batch Domain Lookup (up to 100)

The bulk lookup tool accepts an array of up to 100 domain names in a single request and returns the full categorization record for each domain found in the database. This is significantly more efficient than calling lookup_domain repeatedly, both in terms of network overhead and credit consumption time. Any domains not found in the database will return null values, which indicates you should fall back to the Real-Time API for those specific domains.

Example conversation:

Example Prompt
// You ask Claude:
"Look up these domains in bulk: google.com, stripe.com,
shopify.com, github.com, and notion.so"

// Claude calls bulk_lookup with all 5 domains
// Returns categorization data for each domain

search_domains Search by Category or Country

This tool enables reverse lookups: instead of querying a known domain, you can search the database by category or country to discover domains that match specific criteria. For example, you can find all domains classified as "Financial Services" in Germany, or search for e-commerce domains in the United Kingdom. This is useful for market research, competitive analysis, lead generation, and building targeted domain lists.

Example Prompt
// You ask Claude:
"Search the database for e-commerce domains based in Germany."

// Claude calls search_domains with the appropriate parameters
// Returns a list of matching domains with their categorization data

database_stats View Database Statistics

Returns high-level statistics about the database, including the total number of classified domains, category distribution breakdowns, geographic coverage information, and the timestamp of the most recent database update. This tool takes no input parameters and is useful for understanding the scope and freshness of the data you are working with.

Example Prompt
// You ask Claude:
"Show me the database statistics."

// Claude calls database_stats
// Returns total domain count, category coverage, last update date, etc.

check_credits Monitor Your Usage

Queries your account to display the current credit balance and usage information. Each domain lookup (single or within a bulk request) costs one credit. This tool helps you monitor consumption and plan ahead, especially when running large batch operations. It takes no input parameters and returns your remaining credit balance alongside recent usage data.

Example Prompt
// You ask Claude:
"How many credits do I have left?"

// Claude calls check_credits and returns your balance

Complete Setup Instructions

Follow these steps to install and configure the Database Lookup MCP server on your machine. The entire setup process typically takes about five minutes.

1

Create an Account

If you do not already have an account, go to websitecategorizationapi.com and create a free account. Registration is quick and only requires an email address. Once your account is active, you will have access to the API dashboard and credit management system. New accounts include a set of free trial credits so you can test the service before purchasing additional credits.

2

Get Your API Key

After logging in, navigate to the Profile page from your dashboard. Your API key is displayed in the API credentials section. Copy this key and keep it in a secure location. You will need this key to authenticate the MCP server when it connects to the database API. Treat your API key like a password and do not share it publicly or commit it to version control repositories.

3

Install Python 3.10 or Later

The MCP server is built in Python and requires version 3.10 or higher. Check your current Python version by opening a terminal and running the command below. If you need to install or upgrade Python, visit python.org/downloads for the latest installer. On macOS, you can also install Python via Homebrew with brew install python. On Ubuntu or Debian-based Linux distributions, use sudo apt install python3.

Terminal
python3 --version
# Should output: Python 3.10.x or higher
4

Download the MCP Server Files

Download the Database Lookup MCP server package from the provided link or clone the repository. The server consists of a small set of Python files and a requirements file. Place the files in a directory where you want the server to live. A common location would be a directory within your home folder, such as ~/mcp-servers/database-lookup/. The directory should contain the main server script, the requirements.txt file, and a template for the environment configuration.

Terminal
# Create a directory and navigate into it
mkdir -p ~/mcp-servers/database-lookup
cd ~/mcp-servers/database-lookup

# Copy or download the server files here
5

Install Dependencies

The server depends on three Python packages: mcp (the Model Context Protocol SDK for Python), httpx (a modern async HTTP client), and python-dotenv (for loading environment variables from a .env file). Install all dependencies at once using the included requirements file. It is recommended to use a virtual environment to keep these packages isolated from your system Python installation.

Terminal
# Optional: create and activate a virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install the required packages
pip install -r requirements.txt

# The requirements.txt contains:
#   mcp
#   httpx
#   python-dotenv
6

Configure Your API Key

The server needs your API key to authenticate requests. The recommended approach is to create a .env file in the server directory with your key. Alternatively, you can set an environment variable in your shell profile. Either method works, but the .env file approach is simpler and keeps the configuration contained within the server directory.

.env file
# Create a .env file in the server directory
WEBSITE_CATEGORIZATION_API_KEY=your_api_key_here
Alternative: Shell Environment Variable
# Add to ~/.bashrc, ~/.zshrc, or equivalent
export WEBSITE_CATEGORIZATION_API_KEY="your_api_key_here"
7

Configure in Claude Desktop

Claude Desktop discovers MCP servers through its configuration file. The location of this file differs by operating system. Open the file in a text editor and add the server configuration in the mcpServers object. If the file does not exist, create it. After saving the configuration, restart Claude Desktop to load the new server.

macOS
Windows
Linux
~/Library/Application Support/Claude/claude_desktop_config.json
{
  "mcpServers": {
    "website-categorization-db": {
      "command": "python3",
      "args": [
        "/Users/YOUR_USERNAME/mcp-servers/database-lookup/server.py"
      ],
      "env": {
        "WEBSITE_CATEGORIZATION_API_KEY": "your_api_key_here"
      }
    }
  }
}
%APPDATA%\Claude\claude_desktop_config.json
{
  "mcpServers": {
    "website-categorization-db": {
      "command": "python",
      "args": [
        "C:\\Users\\YOUR_USERNAME\\mcp-servers\\database-lookup\\server.py"
      ],
      "env": {
        "WEBSITE_CATEGORIZATION_API_KEY": "your_api_key_here"
      }
    }
  }
}
~/.config/Claude/claude_desktop_config.json
{
  "mcpServers": {
    "website-categorization-db": {
      "command": "python3",
      "args": [
        "/home/YOUR_USERNAME/mcp-servers/database-lookup/server.py"
      ],
      "env": {
        "WEBSITE_CATEGORIZATION_API_KEY": "your_api_key_here"
      }
    }
  }
}

Replace YOUR_USERNAME with your actual system username and your_api_key_here with the API key from Step 2. If you are using a virtual environment, replace python3 with the full path to the Python binary inside the venv, such as /Users/YOUR_USERNAME/mcp-servers/database-lookup/venv/bin/python3.

8

Configure in Claude Code

Claude Code, the CLI version of Claude for software development, supports MCP servers through its own configuration system. You can add a server from the command line using the claude mcp add command, or by editing the .mcp.json file in your project root. Both methods achieve the same result.

Terminal - Claude Code CLI
# Add the MCP server via the CLI
claude mcp add website-categorization-db \
  python3 /path/to/mcp-servers/database-lookup/server.py \
  -e WEBSITE_CATEGORIZATION_API_KEY=your_api_key_here
.mcp.json (in your project root)
{
  "mcpServers": {
    "website-categorization-db": {
      "command": "python3",
      "args": ["/path/to/mcp-servers/database-lookup/server.py"],
      "env": {
        "WEBSITE_CATEGORIZATION_API_KEY": "your_api_key_here"
      }
    }
  }
}
9

Configure in Cursor

Cursor, the AI-powered code editor, also supports MCP servers. Open Cursor and navigate to Settings, then find the MCP Servers section. Click to add a new server and fill in the configuration details. The command should be python3 (or the full path to your venv Python), with the server script path as the argument. Set the environment variable for your API key in the environment configuration field.

Cursor MCP Configuration
// In Cursor: Settings → MCP Servers → Add Server

Name:      website-categorization-db
Command:   python3
Args:      /path/to/mcp-servers/database-lookup/server.py
Env:       WEBSITE_CATEGORIZATION_API_KEY=your_api_key_here
10

Test It Out

After configuring your AI assistant, restart it to load the new MCP server. You should see the Database Lookup tools appear in the available tools list. Try the following example prompts to verify everything is working correctly:

Example Test Prompts
# Test single domain lookup
"What category is bbc.co.uk?"

# Test bulk lookup
"Categorize these domains: amazon.com, etsy.com, shopify.com,
walmart.com, and target.com"

# Test search
"Find technology domains from Japan in the database."

# Test database stats
"Show me the current database statistics."

# Test credit check
"How many API credits do I have remaining?"

If the AI assistant responds with categorization data, the server is working correctly. If you encounter errors, double-check that your API key is set correctly, the file path to the server script is accurate, and all dependencies are installed.

Use Cases for the Database Lookup Server

The combination of instant lookups, bulk processing capability, and rich categorization data makes this server suitable for a wide range of professional and technical applications.

Bulk Domain Analysis and Research

Analyze large lists of domains quickly by feeding them through the bulk lookup tool. Researchers, analysts, and data teams can categorize thousands of domains in minutes by batching them into groups of 100, creating categorized datasets for further analysis without waiting for real-time classification of each individual site.

Ad-Tech Bid Stream Enrichment

Enrich bid request data in programmatic advertising pipelines with IAB category information and brand safety signals. Because database lookups return in milliseconds, they can be integrated into real-time bidding workflows where latency budgets are tight and decisions need to be made in under 100 milliseconds per impression opportunity.

Web Content Filtering

Build parental controls, corporate web filters, and safe browsing solutions using the web filtering category data. Each domain is classified into categories like adult content, gambling, social media, news, and more, enabling you to block or allow domains based on content policy rules without inspecting the actual page content in real time.

Lead Enrichment and Sales Intelligence

Enrich prospect lists and CRM data with domain categorization, buyer persona information, and authority scores. Sales teams and marketing platforms can automatically classify prospect websites to understand their industry, target audience, and web authority, enabling better lead scoring, segmentation, and personalized outreach strategies.

Domain Portfolio Categorization

Domain investors, registrars, and marketplace operators can categorize entire portfolios of domains to understand distribution across industries, identify undervalued domains by category, and create filtered browsing experiences for buyers. The bulk lookup tool makes it practical to classify portfolios of thousands of domains efficiently.

Compliance Screening

Financial institutions, payment processors, and regulated businesses can screen merchant and partner websites against compliance policies. The web filtering categories and content classifications help identify websites involved in gambling, adult content, pharmaceuticals, or other regulated industries that may require additional review or restrictions.

Database Lookup vs Real-Time API

We offer two MCP servers with different strengths. Choose the right one for your workflow, or use both together for maximum flexibility.

Feature Database Lookup Real-Time API
Response Speed Instant (milliseconds) 2 to 10 seconds per URL
Coverage 30M+ pre-classified domains Any URL on the internet
URL Depth Domain-level only (e.g. stripe.com) Full subpage support (e.g. bbc.co.uk/news/tech)
Data Freshness Quarterly database updates Real-time analysis of live page content
Taxonomy Support IAB v2/v3 + Web Filtering IAB v2/v3, IPTC, Web Filtering, and more
Enrichment Data Personas, PageRank, Country Full enriched data including technologies, quality scores
Bulk Operations Up to 100 domains per batch One URL per request
Search by Category Yes, reverse lookups supported Not available
Best For Speed-critical and bulk analysis workflows Subpage classification and fresh content analysis

Use Database Lookup When:

  • You need instant responses for latency-sensitive applications
  • You are processing large lists of domains in bulk
  • Domain-level classification is sufficient for your use case
  • You want to search the database by category or country
  • You need buyer personas and PageRank data alongside categories

Use Real-Time API When:

  • You need to classify individual subpages, not just domains
  • The domain is not found in the 30M+ database
  • You need the most up-to-date classification of a live page
  • You need additional taxonomies beyond IAB and Web Filtering
  • You need technology detection and full quality scores

Many teams use both servers together. The Database Lookup server handles the fast, high-volume domain-level work, while the Real-Time API server handles subpage classification, newly registered domains, and cases where the database returns no result. You can install both MCP servers simultaneously in Claude Desktop, Claude Code, or Cursor.

Database Coverage and Example Data

The database contains over 30 million pre-classified domains, representing the vast majority of active websites on the internet by traffic volume.

Our database is built by continuously crawling and classifying the most visited websites globally, along with a comprehensive sweep of the long-tail web. The result is a collection of over 30 million domain records, each enriched with multiple layers of categorization data. In terms of real-world traffic, these 30 million domains account for an estimated 99 percent or more of all HTTP requests on the internet, because the distribution of web traffic follows a power law where the top millions of domains receive the overwhelming majority of visits.

The database includes major global websites, regional and country-specific sites, niche industry portals, e-commerce storefronts, news publications, blogs, corporate sites, SaaS platforms, government domains, educational institutions, and virtually every other category of web property. Below are example records for a few well-known domains to illustrate the type and depth of data returned by the lookup tools.

stripe.com
IAB v2 T1 Technology & Computing
Web Filter Financial Services
Persona Tech Decision Maker
PageRank 8.42
Country US
bbc.co.uk
IAB v2 T1 News and Politics
Web Filter News / Media
Persona Informed News Reader
PageRank 9.18
Country GB
shopify.com
IAB v2 T1 Technology & Computing
Web Filter E-Commerce
Persona E-Commerce Entrepreneur
PageRank 8.95
Country CA

Note: The example data above is illustrative. Actual data fields and values may vary. The database is updated quarterly to ensure domain classifications stay accurate as websites evolve and new domains are added.

Simple Credit-Based Pricing

Every domain lookup through the Database MCP server costs one credit. If you look up a single domain with lookup_domain, that is one credit. If you run a bulk_lookup of 100 domains, that is 100 credits. The database_stats and check_credits tools do not consume credits. This pricing model is identical whether you use the MCP server or the API directly.

New accounts receive free trial credits to get started. Visit the pricing page for full details on credit packages, volume discounts, and enterprise plans.

View Pricing Plans

Frequently Asked Questions

Common questions about the Database Lookup MCP server, its capabilities, and how it compares to other options.

Can I look up subpages like bbc.co.uk/news/technology?
No. The Database Lookup server is designed for domain-level queries only. It accepts root domains such as bbc.co.uk, stripe.com, or nytimes.com, and returns the stored categorization data for that domain. It does not support individual subpages or URLs with paths. If you need to classify specific subpages, such as a particular article or product page, use the Real-Time API MCP server instead. The Real-Time server crawls and classifies the actual content of any URL on the internet, including deeply nested subpages.
How fresh is the data in the database?
The database is updated on a quarterly cycle. During each update, our crawlers re-visit all indexed domains, add newly discovered domains, and reclassify any sites whose content has changed significantly. For most established websites, categorization data remains stable between updates. However, if you need the absolute latest classification for a domain that has recently changed its content focus, you can use the Real-Time API MCP server for a fresh, on-the-fly analysis of the current page content.
What happens if a domain is not in the database?
If you look up a domain that is not present in the database, the server returns null values for that domain, indicating no record was found. This typically happens with very new domains, extremely low-traffic sites, or recently registered domains that have not yet been crawled. In these cases, you should fall back to the Real-Time API MCP server, which can classify any live URL regardless of whether it appears in the pre-classified database. Many users configure both MCP servers and use the database as the primary source with the real-time API as a fallback.
Is the Database Lookup faster than the Real-Time API?
Yes, significantly. The Database Lookup server returns results in milliseconds because it is retrieving pre-computed data from an indexed database. There is no page crawling, content parsing, or machine learning classification happening at query time. The Real-Time API, by contrast, needs to fetch the target URL, download its content, parse the HTML, extract features, and run the classification model, which typically takes between 2 and 10 seconds per URL. If speed is your priority and domain-level data is sufficient, the database approach is substantially faster.
Can I use both MCP servers at the same time?
Absolutely. Claude Desktop, Claude Code, and Cursor all support multiple MCP servers running simultaneously. You can configure both the Database Lookup server and the Real-Time API server in the same configuration file. The AI assistant will see all available tools from both servers and can decide which one to call based on the nature of your request. For example, if you ask about a domain, it might use the database. If you ask about a specific article URL, it might use the real-time server. You can also instruct the assistant which server to prefer in your prompts.
How many domains can I look up in a single batch request?
The bulk_lookup tool accepts up to 100 domains per request. Each domain in the batch counts as one credit. If you need to process more than 100 domains, you can make multiple batch requests sequentially. For example, to categorize 500 domains, you would make 5 batch requests of 100 domains each. The AI assistant can automate this for you by splitting a larger list into batches and processing them in sequence, then combining the results into a single response.
Do I need separate API keys for the Database and Real-Time servers?
No. The same API key and credit balance work across both MCP servers as well as direct API calls. Your account is unified, so any credits you purchase can be used with any of our services. This makes it simple to manage billing and usage even if you are using multiple integration methods simultaneously.
Which AI assistants are compatible with this MCP server?
The server follows the open MCP standard and works with any MCP-compatible AI client. Currently supported and tested clients include Claude Desktop (the GUI application from Anthropic), Claude Code (the CLI tool for developers), and Cursor (the AI-powered code editor). As the MCP ecosystem grows, additional clients and editors are expected to add MCP support. Because MCP is an open protocol, any application that implements the MCP client specification can connect to this server without modification.

Start Looking Up Domains in Milliseconds

Create your free account, grab your API key, and connect the Database Lookup MCP server to your AI assistant. You can be running domain categorization queries in under five minutes.