< Wikimedia Product < Data dictionary

This page describes the data set pageviews_hourly that stores on Druid Datasources, which can be accessed via Superset/Turnilo. pageviews_hourly on Druid is generated by aggregating wmf.pageview_hourly on Hive by hour, while wmf.pageview_hourly on Hive is extracted from wmf.pageview_actor.

Schema

Field namedata typedescriptiondata examplesource schemasource field
projectstringProject name from requests hostnameaa.wikipediawmf.pageview_actorpageview_info['project']
agent_typestringAgent accessing the pages, can be spider, user or automated (see BotDetection)userwmf.pageview_actoragent_type
ua_browser_familystringName of web browser (if not using an official Wikipedia mobile app), extracted from the client device's User-AgentFirefoxwmf.pageview_actoruser_agent_map['browser_family']
ua_device_familystringClient device family (e.g. brand of manufacturer, product name), extracted from the client device's User-Agent if providedOtherwmf.pageview_actoruser_agent_map['device_family']
citystringCity iso code of the accessing agents (computed using maxmind GeoIP database)Apple Valleywmf.pageview_actorgeocoded_data['city']
subdivisionstringSubdivision of the accessing agents (computed using maxmind GeoIP database)Californiawmf.pageview_actorgeocoded_data['subdivision']
ua_wmf_app_versionstringVersion of official Wikipedia mobile app (for iOS, Android, and KaiOS), extracted from the client device's User-Agent-wmf.pageview_actoruser_agent_map['wmf_app_version']
countrystringCountry (text) of the accessing agents (computed using maxmind GeoIP database)United Stateswmf.pageview_actorgeocoded_data['country']
country_codestringCountry iso code of the accessing agents (computed using maxmind GeoIP database)USwmf.pageview_actorgeocoded_data['country_code']
ua_os_majorstringMajor version of that Operating System, extracted from the client device's User-Agent10wmf.pageview_actoruser_agent_map['os_major']
continentstringContinent of the accessing agents (computed using maxmind GeoIP database)North Americawmf.pageview_actorgeocoded_data['continent']
ua_os_familystringOperating System family used by the client device, extracted from the User-AgentMac OS Xwmf.pageview_actoruser_agent_map['os_family']
language_variantstringLanguage variant from requests path (not set if present in project name)defaultwmf.pageview_actorpageview_info['language_variant']
ua_os_minorstringMinor version of that Operating System, extracted from the client device's User-Agent14wmf.pageview_actoruser_agent_map['os_minor']
referer_classstringCan be none (null, empty or \'-\'), unknown (domain extraction failed), internal (domain is a wikimedia project), external (search engine) (domain is one of google, yahoo, bing, yandex, baidu, duckduckgo), external (any other)nonewmf.pageview_actorreferer_class
zero_carrierstringNULL as zero program is overNullNULL
access_methodstringMethod used to access the pages, can be desktop, mobile web, or mobile appdesktopwmf.pageview_actoraccess_method
ua_browser_majorstringMajor version of the client browser, extracted from the client device's User-Agent68wmf.pageview_actoruser_agent_map['browser_major']
project_familystringProject familywikipediacanonical_data.wikisdatabase_group
view_countbigintNumber of views1wmf.pageview_actorcount(1) then aggregated by hour

Dashboards which use this table

Past Pageviews (Past 12 Weeks)

Known issues and changes

This article is issued from Mediawiki. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.