Bridging The Data Divide: Data In The International Context
The theme of our 2015 conference is Bridging the Data Divide: Data in the International Context. Going hand in hand with the well-known digital divide is a growing inequity in access to data. Increasing budget concerns have placed strains on governments, universities, and other institutions in the provision of data services. From the cancellation of the Statistical Abstract of the United States, to the controversy over the Canadian Census long form, to political barriers in the data collection process in some countries, access to data and the data divide presents organizational, economic and educational challenges to the community of data professionals worldwide.
Benjamin Perry, Venkata Kambhampaty, Kyle Brumsted, Lars Vilhuber, & William C. Block: “Crowdsourcing Codebook Development and Enhancements in CED²AR”
Abstract: Recent years have shown the power of usersourced information evidenced by the success of Wikipedia and its many emulators. This sort of unstructured discussion is currently not feasible as a part of the otherwise successful metadata repositories. Creating and augmenting metadata is a laborintensive endeavor. Harnessing collective knowledge from actual data users can supplement officially generated metadata. As part of our Comprehensive Extensible Data Documentation and Access Repository (CED²AR) infrastructure, we demonstrate a prototype of crowdsourced DDI on actual codebooks. While the system itself is more general, the demonstrated implementation relies on a set of linked deployments of the basic software on web servers. The backend transparently handles changes, and frontend has the ability to separate official edits (by designated curators of the data and the metadata) from crowdsourced content. The implementation allows a data curator, such as a statistical agency, to collect and incorporate improvements suggested by knowledgeable users in a structured way.