Vilhuber @ CAED 2015: “Usage and outcomes of the Synthetic Data Server”

23 October 2015 @ 08:30 – 25 October 2015 @ 14:15
Comparative Analysis of Enterprise Data (CAED) 2015 Conference
Şht. Muhtar
taksim istanbul apart, 34435 Beyoğlu/İstanbul
Print Friendly, PDF & Email

“Usage and outcomes of the Synthetic Data Server,” Lars Vilhuber (NCRN, Cornell University) and John Abowd (NCRN, Cornell University) 

The Synthetic Data Server (SDS) at Cornell University was set up to provide early access to new synthetic data products by the U.S. Census Bureau. These datasets are made available to interested researchers in a controlled environment, prior to a more generalized release. Over the past 5 years, 4 synthetic datasets were made available on the server, and over 100 users have accessed the server over that time period. This paper reports on interim outcomes of the activity: results of validation requests from a user perspective, functioning of the feedback loop due to validation and user input, and the role of the SDS as a access gateway to and educational tool for other mechanisms of accessing detailed person, household, establishment, and firm statistics.