Event Calendar

Oct
23
Fri
2015
Vilhuber @ CAED 2015: “Usage and outcomes of the Synthetic Data Server” @ Comparative Analysis of Enterprise Data (CAED) 2015 Conference
Oct 23 @ 08:30 – Oct 25 @ 14:15
Print Friendly, PDF & Email

“Usage and outcomes of the Synthetic Data Server,” Lars Vilhuber (NCRN, Cornell University) and John Abowd (NCRN, Cornell University) 

The Synthetic Data Server (SDS) at Cornell University was set up to provide early access to new synthetic data products by the U.S. Census Bureau. These datasets are made available to interested researchers in a controlled environment, prior to a more generalized release. Over the past 5 years, 4 synthetic datasets were made available on the server, and over 100 users have accessed the server over that time period. This paper reports on interim outcomes of the activity: results of validation requests from a user perspective, functioning of the feedback loop due to validation and user input, and the role of the SDS as a access gateway to and educational tool for other mechanisms of accessing detailed person, household, establishment, and firm statistics.

Tickets: http://caed2015.sabanciuniv.edu/registration-form.

Dec
1
Tue
2015
FCSM 2015: Total Variability Measures for Selected Quarterly Workforce Indicators and LEHD Origin Destination Employment Statistics in OnTheMap @ FCSM 2015 Research Conference
Dec 1 @ 13:15 – 15:00
Print Friendly, PDF & Email

“Total Variability Measures for Selected Quarterly Workforce Indicators and LEHD Origin Destination Employment Statistics in OnTheMap”, Kevin McKinney (U.S. Census Bureau), Lars Vilhuber (Cornell University and U.S. Census Bureau), John Abowd (Cornell University and U.S. Census Bureau), Andrew Green (Cornell University)
Abstract

We report results from the first comprehensive total quality evaluation of three major indicators in the U.S. Census Bureau’s Longitudinal Employer-Household Dynamics (LEHD) Program Quarterly Workforce Indicators (QWI): beginning-of-quarter employment, full-quarter employment, and average monthly earnings of full-quarter employees. Beginning-of-quarter employment is also the main tabulation variable in the LEHD Origin-Destination Employment Statistics workplace reports as displayed in OnTheMap (OTM). The evaluation is conducted using the multiple threads generated by the edit and imputation models used in the LEHD Infrastructure File System. These threads conform to the Rubin (1987) multiple imputation model. Each implicate is the output of formal probability models that address coverage, edit and imputation errors. Design-based sampling variability and finite population corrections are also included in the evaluation. We derive special formulas for the Rubin total variability and its components that are consistent with the disclosure avoidance system used for QWI and LODES/OTM workplace reports. These formulas allow us to publish the complete set of detailed total quality measures for QWI and LODES. The analysis reveals that the three publication variables under study are estimated very accurately for tabulations involving at least 10 jobs. Tabulations involving three to nine jobs have acceptable quality. Tabulations involving one or two jobs, which are generally suppressed in the QWI, have substantial total variability but their publication in LODES allows the formation of larger custom aggregations, which will in general have the accuracy estimated for tabulations in the QWI of similar magnitude.

FCSM 2015: Two Perspectives on Commuting and Workplace: A Microdata Comparison of Home to Work Flows Across Linked Survey and Administrative Files @ Federal Committee on Statistical Methodology (FCSM) 2015 Research Conference
Dec 1 @ 15:15 – 17:00
Print Friendly, PDF & Email

“Two Perspectives on Commuting and Workplace: A Microdata Comparison of Home to Work Flows Across Linked Survey and Administrative Files,” Andrew Green (U.S. Census Bureau, Cornell University), Mark Kutzbach (U.S. Census Bureau), Lars Vilhuber (U.S. Census Bureau, Cornell University)

Dec
2
Wed
2015
Vilhuber presents at FCSM 2015: Crowdsourcing Codebook Enhancements: A DDI-based Approach @ Federal Committee on Statistical Methodology (FCSM) 2015 Research Conference
Dec 2 @ 08:30 – 10:15
Print Friendly, PDF & Email

“Crowdsourcing Codebook Enhancements: A DDI-based Approach”
Benjamin Perry (Cornell University), Venkata Kambhampaty (Cornell University), Kyle Brumsted (McGill University), Lars Vilhuber (Cornell University), William Block (Cornell University)

Dec
3
Thu
2015
FCSM 2015: “Formal Privacy Protection for Data Products Combining Individual and Employer Frames” @ Federal Committee on Statistical Methodology (FCSM) 2015 Research Conference
Dec 3 @ 10:30 – 12:15
Print Friendly, PDF & Email

“Formal Privacy Protection for Data Products Combining Individual and Employer Frames”, Ashwin Machanavajjhala (Duke University), Samuel Haney (Duke University), Matthew Graham (U.S. Census Bureau), Mark Kutzbach (U.S. Census Bureau), Lars Vilhuber (Cornell University and U.S. Census Bureau), John Abowd (Cornell University and U.S. Census Bureau)

Nov
30
Wed
2016
Lars Vilhuber: “Disclosure Limitation and Confidentiality Protection in Linked Data” @ Centre interuniversitaire de recherche en analyse des organisations
Nov 30 @ 08:30 – 14:00
Print Friendly, PDF & Email

Lars Vilhuber speaks about “Disclosure Limitation and Confidentiality Protection in Linked Data” at the Center for Interuniversity Research and Analysis of Organizations‘s conference on “Facilitate the access to Quebec data: How and to what ends?” The conference is jointly organized with the Quebec inter-University Centre for Social Statistics (QICSS). The presentation relies on joint work with John M. Abowd and Ian M. Schmutte.
[Presentation]