Transparent, accurate, and comprehensive performance monitoring
WPindex.io employs a sophisticated multi-stage process to identify, validate, and continuously monitor WordPress websites' performance metrics. Our methodology combines artificial intelligence, proprietary detection algorithms, and industry-standard performance testing to provide accurate, representative data about the state of WordPress performance across the web.
We leverage OpenAI's API to intelligently extract and identify potential WordPress websites from publicly available online sources. This AI-driven approach allows us to discover a diverse, representative sample of WordPress sites across different industries, geographic regions, and website scales.
Key Features: Natural language processing, pattern recognition, source diversity analysis
Each discovered website undergoes verification through our proprietary WordPress detection scanner. This sophisticated tool analyzes multiple technical indicators to confirm whether a site is genuinely powered by WordPress, ensuring our dataset maintains high accuracy and relevance.
Detection Methods: HTML structure analysis, WordPress-specific markers, meta tag verification, common WordPress patterns
Verified WordPress sites are continuously monitored using Google's PageSpeed Insights API v5. We collect comprehensive performance metrics for both mobile and desktop experiences, capturing Core Web Vitals and additional performance indicators that provide a complete picture of each site's performance characteristics.
Metrics Collected: Lighthouse Score, LCP, INP, CLS, TBT, FCP, TTFB, Core Web Vitals Pass Rate
Raw performance data undergoes systematic aggregation to produce meaningful insights. We calculate daily averages, 30-day rolling windows, and comprehensive statistics that reveal trends and patterns in WordPress performance across our entire dataset.
Aggregation Methods: Daily rollups, 30-day moving averages, percentile calculations, trend analysis
We utilize PageSpeed Insights lab data (Lighthouse) rather than field data to ensure consistent, reproducible measurements across all sites.
Sites are measured on a rolling schedule with one domain tested per minute, ensuring fresh data while respecting API rate limits.
Both mobile and desktop strategies are recorded, with mobile performance given primary focus reflecting modern web usage patterns.
Aggregations include averages, medians, and percentiles to provide comprehensive statistical representation of performance distributions.
All data collection occurs through publicly accessible APIs and web endpoints. We do not store personally identifiable information, access private areas of websites, or collect any data beyond performance metrics. Our methodology respects robots.txt directives and adheres to responsible web scraping practices.
Site owners can request removal from our monitoring at any time by contacting us at dragos@wpindex.io.
This methodology is continuously refined to ensure accuracy and relevance. Last updated: December 2025