This document provides detailed explanations for each field returned when using expanded_categories option with our URL Categorization API. The data is generated through advanced contextual analysis of a URL's content to provide deep insights into its purpose, technology, and audience.
Performs a security scan on the URL to check against known threats. This field indicates whether the URL is flagged for distributing malware, engaging in social engineering (e.g., phishing), or hosting other malicious software.
Identifies the front-end and back-end web technologies used on the page. Our system scans for signatures of over 4,000 frameworks, analytics tools, advertising networks, and other libraries to profile the site's tech stack.
"{ "technologies": { "urls": { "https://www.apple.com/": { "status": 200 } }, "technologies": [ { "slug": "cart-functionality", "name": "Cart Functionality", "description": "Websites that have a shopping cart or checkout page, either using a known ecommerce platform or a custom solution.", "confidence": 100, "version": null, "icon": "Cart-generic.svg", "website": "https://www.wappalyzer.com/technologies/ecommerce/cart-functionality", "cpe": null, "categories": [ { "id": 6, "slug": "ecommerce", "name": "Ecommerce" } ], "rootPath": true }, { "slug": "apple-mapkit-js", "name": "Apple MapKit JS", "description": "Apple MapKit JS lets you embed interactive maps directly into your websites across platforms and operating systems, including iOS and Android.", "confidence": 100, "version": null, "icon": "Apple.svg", "website": "https://developer.apple.com/maps/web/", "cpe": null, "categories": [ { "id": 35, "slug": "maps", "name": "Maps" } ], "rootPath": true }, { "slug": "adobe-target", "name": "Adobe Target", "description": "Adobe Target is an A/B testing, multi-variate testing, personalisation, and optimisation application", "confidence": 100, "version": "2.3.2", "icon": "Adobe.svg", "website": "https://www.adobe.com/marketing/target.html", "cpe": null, "categories": [ { "id": 74, "slug": "a-b-testing", "name": "A/B Testing" }, { "id": 76, "slug": "personalisation", "name": "Personalisation" } ], "rootPath": true }, { "slug": "adobe-analytics", "name": "Adobe Analytics", "description": "Adobe Analytics is a web analytics, marketing and cross-channel analytics application.", "confidence": 100, "version": null, "icon": "Adobe Analytics.svg", "website": "https://www.adobe.com/analytics/adobe-analytics.html", "cpe": null, "categories": [ { "id": 10, "slug": "analytics", "name": "Analytics" } ], "rootPath": true }, { "slug": "preact", "name": "Preact", "description": "Preact is a JavaScript library that describes itself as a fast 3kB alternative to React with the same ES6 API.", "confidence": 100, "version": null, "icon": "Preact.svg", "website": "https://preactjs.com", "cpe": null, "categories": [ { "id": 59, "slug": "javascript-libraries", "name": "JavaScript libraries" } ], "rootPath": true }, { "slug": "hsts", "name": "HSTS", "description": "HTTP Strict Transport Security (HSTS) informs browsers that the site should only be accessed using HTTPS.", "confidence": 100, "version": null, "icon": "default.svg", "website": "https://www.rfc-editor.org/rfc/rfc6797#section-6.1", "cpe": null, "categories": [ { "id": 16, "slug": "security", "name": "Security" } ], "rootPath": true }, { "slug": "open-graph", "name": "Open Graph", "description": "Open Graph is a protocol that is used to integrate any web page into the social graph.", "confidence": 100, "version": null, "icon": "Open Graph.png", "website": "https://ogp.me", "cpe": null, "categories": [ { "id": 19, "slug": "miscellaneous", "name": "Miscellaneous" } ], "rootPath": true } ] } }"
Extracts the main, high-level subjects or themes discussed on the web page. This provides a quick, categorical understanding of the page's core subject matter, distinct from specific keywords.
Provides a list of domains that are considered similar in terms of industry, content, and market position. This is ideal for competitive analysis and market research.
Detects and categorizes important named entities mentioned in the content. Entities can include organizations, products, services, locations, people, and specific technologies. This helps in quickly identifying the key players and subjects of discussion.
Infers potential customer profiles or "buyer personas" that the page content and offers are targeting. This analysis is based on the products, language, and promotions present on the page.
A list of important keywords and search terms that are highly relevant to the page's content. This is valuable for SEO analysis, content strategy, and pay-per-click (PPC) ad campaign planning.
"Related Keywords": [ "macbook air", "iphone 16", "airpods pro 2", "apple intelligence", "trade-in", "apple card", "apple tv+", "apple music", "hearing aid feature", "education savings" ],Identifies brands mentioned on the page and provides a high-level reputation analysis (e.g., Positive, Neutral, Negative) based on the context in which they are mentioned. This differs from general sentiment by focusing specifically on the brand's portrayal.
Provides an estimated demographic profile of the target audience. This includes attributes like age ranges and potential roles (e.g., student, professional) based on the page's content, offers, and language.
Evaluates the overall emotional tone of the content related to key entities on the page. The sentiment is typically classified as Positive, Neutral, or Negative, giving insight into how subjects are being portrayed.
Detects the primary natural language of the text content on the page.
Identifies the primary legal entity or organization associated with the website's content. If an address is explicitly mentioned in a clear context (like a footer or contact page), it will be extracted here.
A set of categorical tags derived from the page content, similar to blog post tags. These tags help in filtering, grouping, and organizing URLs based on fine-grained topics and product categories.
Lists companies that are direct competitors or operate in a similar space. This field is based on an analysis of the primary entity's market, products, and services mentioned on the page.