WebsiteCategorizationAPI
Home
Demo Tools - Categorization
Website Categorization Text Classification URL Database Taxonomy Mapper
Demo Tools - Website Intel
Technology Detector Quality Score Competitor Finder
Demo Tools - Brand Safety
Brand Safety Checker Brand Suitability Quality Checker
Demo Tools - Content
Sentiment Analyzer Context Aware Ads
MCP Servers
MCP Real-Time API MCP Database Lookup
Agentic Workflows
AI Agent Database 100 Use Cases Hedge Fund Workflows Banking Workflows Healthcare Workflows E-Commerce Workflows SaaS Workflows View All 47 Industries →
Domains By
Domains for your ICP Domains by Vertical Domains by Country Domains by Technologies
Resources
API Documentation Pricing Login
Try Categorization
Persian Web Dataset

Persian Language Website Database

Navigate a distinct digital ecosystem. Our database captures websites written in Farsi (فارسی), covering the enclosed web of Iran (using local alternatives to global platforms) and the vibrant international diaspora.

650K+

Active Persian Sites

Local Apps

Unique Tech Stack

Poetry

Rich Cultural Content

Inside the "Halal Net"

The Persian web is one of the most isolated yet active ecosystems in the world. Blocked from many global services, local developers have built robust alternatives for ride-hailing (Snapp), e-commerce (Digikala), and video (Aparat).

Our database uses Right-to-Left (RTL) analysis and specific Farsi character detection (gaf, che, pe, zhe) to differentiate it from Arabic. We help you understand this self-contained market and the diverse content created by millions of Persian speakers worldwide.

Local Platform Detection

Understanding the Iranian stack:

  • Shaparak Integration: Identification of the national payment gateway system, a sure sign of a domestic business.
  • Hosting Location: We flag sites hosted on domestic Iranian servers (Intranet) vs international hosting.

Understanding the Persian Digital Landscape

The Persian digital landscape is one of the most fascinating and complex on the internet, shaped by a combination of high technical literacy, government-imposed internet restrictions, and a deeply entrepreneurial culture. With over 80 million people in Iran alone and tens of millions more in Afghanistan, Tajikistan, and the global diaspora, the Persian web represents a massive yet often overlooked digital market. Iranian developers have built an entirely self-contained tech ecosystem with local alternatives for virtually every major global platform, creating unique opportunities for market intelligence.

Key industries on the Persian web include e-commerce through platforms like Digikala and Basalam, ride-hailing via Snapp and Tapsi, food delivery through SnappFood, and digital payments via Shaparak-connected gateways. The real estate sector is extremely active online, with platforms like Divar and Sheypoor functioning as comprehensive classified marketplaces. Iran's robust manufacturing sector covering automobiles, petrochemicals, steel, and carpets maintains extensive B2B web presences, while the tourism industry operates a parallel digital infrastructure featuring hotels, tour operators, and cultural heritage sites marketed in Farsi.

The Persian web ecosystem is uniquely split between the domestic Iranian intranet and the international diaspora web. Inside Iran, websites operate under specific hosting regulations and connect to the national payment infrastructure, making them identifiable through technical markers. Outside Iran, a vibrant diaspora web thrives in cities like Los Angeles, Toronto, London, and Hamburg, featuring media outlets, professional services, and cultural organizations. Afghan Dari-language websites from Kabul and Herat add another dimension, while Tajik Cyrillic-script Persian content represents a growing frontier, making this one of the most linguistically diverse datasets in our collection.

Strategic Use Cases

Diaspora Marketing

Target the affluent Persian communities in Los Angeles ("Tehrangeles"), Toronto, and London. Filter for Farsi sites hosted outside Iran.

Literature & Arts

Persian culture is literary. Access thousands of blogs, publishers, and cultural portals preserving classical poetry and modern art.

App Store Analytics

Cafe Bazaar is the local Android store. Identify the landing pages of top Iranian mobile apps and games.

Farsi NLP Training

Farsi is a low-resource language for many Western AI models. Use our corpus to train on the specific script, font rendering, and grammar of Persian.

E-Commerce Market Research

Analyze Iran's booming domestic e-commerce ecosystem. Map Digikala competitors, niche vertical marketplaces, and local payment gateway integrations to understand purchasing patterns in this self-contained economy.

Media & Sentiment Analysis

Monitor Persian-language news outlets, opinion blogs, and social commentary platforms. Essential for geopolitical analysts, media researchers, and organizations tracking public discourse across the Persian-speaking world.

Database Coverage by Category

E-Commerce & Marketplaces

Online retailers, classified platforms like Divar, B2B trade portals, and niche vertical stores selling electronics, fashion, and handicrafts.

News & Media

State and independent news agencies, political commentary blogs, diaspora media outlets, and cultural magazines published in Farsi.

Technology & Startups

Iranian tech startups, app developer portfolios, SaaS platforms, and the thriving Cafe Bazaar mobile app ecosystem landing pages.

Literature & Culture

Poetry archives, literary journals, film review sites, art galleries, and platforms preserving Persian calligraphy and classical heritage.

Education & Academia

University portals, online learning platforms, research institute websites, and test preparation services for the Iranian Konkur exam system.

Real Estate & Property

Property listing platforms, construction companies, interior design firms, and real estate agencies operating across Iranian cities.

Regional Coverage: Iran, Afghanistan & Global Diaspora

Our Persian database provides extensive coverage of the Iranian domestic web, spanning content hosted on .ir domains and locally hosted servers across all 31 provinces. The heaviest concentration of websites originates from Tehran, Isfahan, Mashhad, Shiraz, and Tabriz, covering everything from bazaar merchants with e-commerce portals to industrial manufacturers and government service websites. We distinguish between sites connected to the Shaparak national payment system and those operating internationally, giving you precise insight into which businesses operate domestically versus globally.

Beyond Iran, our database captures Dari-language content from Afghanistan, particularly from Kabul, Herat, and Mazar-i-Sharif, as well as Tajik Persian content from Tajikistan. The global diaspora coverage is equally comprehensive, indexing Farsi-language websites from major communities in Los Angeles, Toronto, London, Hamburg, Sydney, and Dubai. This diaspora web includes media outlets like Manoto, professional service providers, cultural organizations, and e-commerce sites targeting Persian speakers abroad, providing a complete 360-degree view of the global Persian digital ecosystem.

Enriched Data Fields

Every domain includes comprehensive metadata

  • Domain name and URL with RTL script rendering validation
  • Language confidence score distinguishing Farsi from Arabic and Urdu
  • Domestic vs diaspora hosting classification flag
  • Shaparak payment gateway integration detection
  • Industry classification using IAB content taxonomy
  • Technology stack detection including local CMS platforms
  • Social media presence across both global and local platforms
  • Contact information including Iranian phone format validation
  • SSL certificate status and server location identification
  • Domain age, registration details, and last verified crawl date

Unlock the Persian Web

Get the most accurate, verified list of Farsi-language websites.

Get the Data