Data Dictionary

Complete reference for all data fields, response attributes, and schema used in API responses.

Response Schema

The Website Categorization API returns structured JSON responses containing domain categorization data. This dictionary defines all fields and their meanings to help you correctly interpret and use the data in your applications.

Categorization Response

The main categorization endpoint returns the following structure:

{
  "domain": "example.com",
  "categories": [...],
  "meta": {...},
  "enrichment": {...}
}

Root Fields

Field Type Description
domain string The domain that was categorized, normalized to lowercase without protocol or path.
categories array Array of category objects representing the domain's content classifications. Contains 1-5 categories ordered by confidence score.
meta object Metadata about the API request including request ID, processing time, and usage information.
enrichment object Optional additional data about the domain (available on higher-tier plans).

Category Object Fields

Field Type Description
id string IAB category identifier (e.g., "IAB19-6"). Unique identifier for the category in the taxonomy.
name string Human-readable category name (e.g., "Technology & Computing > Software").
tier integer Category depth in the taxonomy hierarchy. Tier 1 is top-level, Tier 2 is sub-category, etc. Range: 1-4.
confidence float Confidence score for this categorization. Range: 0.0 to 1.0, where 1.0 indicates highest confidence.
parent_id string ID of the parent category (null for Tier 1 categories). Enables traversing the category hierarchy.

Meta Object Fields

Field Type Description
request_id string Unique identifier for this API request. Include in support requests for troubleshooting.
processing_time_ms integer Server-side processing time in milliseconds.
credits_used integer Number of API credits consumed by this request.
cached boolean Whether the result was served from cache (true) or computed in real-time (false).
last_updated string ISO 8601 timestamp of when this domain was last analyzed.

Enrichment Object Fields

Available on Professional and higher plans:

Field Type Description
business_model string Detected business model (e.g., "saas", "ecommerce", "publisher", "marketplace").
audience string Primary target audience (e.g., "b2b", "b2c", "enterprise", "smb").
traffic_tier string Relative traffic level (e.g., "high", "medium", "low", "minimal").
company_size string Estimated company size (e.g., "enterprise", "mid_market", "small_business", "startup").
technologies array Array of detected technologies on the website (CMS, analytics, frameworks, etc.).

Confidence Score Interpretation

Use confidence scores to filter results based on your quality requirements:

  • 0.90 - 1.00: Very high confidence, suitable for automated decisions
  • 0.75 - 0.89: High confidence, reliable for most use cases
  • 0.50 - 0.74: Moderate confidence, consider human review for critical decisions
  • Below 0.50: Lower confidence, treat as suggestions rather than definitive classifications

Ready to Explore the Data?

Start using our rich categorization data in your applications.

View Full Documentation