1、Integrating Big Data and Administrative Sources for Estimating Vehicle Mileage and Analyzing Road Traffic AccidentsaIstat Italian National Institute of Statistics-Directorate for Methodology and Statistical Process DesignbIstat-Italian National Institute of Statistics-Directorate for Social Statisti
2、cs and WelfarecIstat-Italian National Institute of Statistics-Directorate for Information TechnologiesWeb Intelligence Network Conference-From Web to DataStatistic Poland-Arche Dwr Uphagena HotelGdask(Poland)-4 February 2025-5 February 2025Marco Broccoli a,Silvia Bruzzoneband Riccardo Giannini c2 Pr
3、oject Target Identification of the Big Data Source Procedural Workflow for Massive Web Scraping The Technology Behind an iMacros-Based Macro Selection of Vehicle Categories Software Architecture of the Project Output Generated by Web Scraping Methodology Applied for Validation Volumes of the Compara
4、tive Administrative Data Source Verification of Results ConclusionsPresentation OutlineBROCCOLI-BRUZZONE-GIANNINI-ISTAT3BROCCOLI-BRUZZONE-GIANNINI-ISTATProject Target The goal is to estimate the average mileage covered by vehicles listed for sale,segmented bytype,emission class,fuel type,province(or
5、 city of sale),and other statistically relevant attributes.This data will be compared with the variables present in the Public Motor Vehicle Registry(PRA)and the Vehicle Inspection Archive,provided by the Ministry of Infrastructure and Transport(MIT).Estimating vehicle kilometers traveled(VKT)on the
6、 national road network is part of a broaderproject.The ultimate aim is to estimate traffic flows and the real exposure risk rates for roadaccidents.The project also seeks to compare and integrate data from administrative sources and Big Data totest the potential and validity of both sources.The adde