Wolfram Data Summit 2011: Presentation Abstracts

Data Feast, Privacy Famine: What Is a Healthy Data Diet?

Jim Adler

Chief Privacy Officer & General Manager of Data Systems, Intelius, Inc.

Show » Hide »

Crowdsourcing Big Data

Lukas Biewald

Chairman and Co-founder, CrowdFlower

Show » Hide »

Data Science in Education and for Discovery

Kirk Borne

Professor of Astrophysics and Computational Sciences, George Mason University

Show » Hide »

IPUMS International—Building a Census Data Time Machine

Peter Clark

IT Core Director, Minnesota Population Center, University of Minnesota

Show » Hide »

The Role of Visualization and Citizen Science in Astronomy

Alberto Conti

Archive Scientist, Space Telescope Science Institute

Show » Hide »

A Rapid-Learning Health System: Using Electronic Health Records and Apps

Lynn Etheredge

Director, Rapid Learning Project, George Washington University

Show » Hide »

ACCRA Cost of Living Index—A Private Data Collection Effort since 1968

Dean Frutiger

COLI project Manager, C2ER

Show » Hide »

A New and Old View of Computing and Data

David Alan Grier

Associate Professor of International Science and Technology Policy, Elliott School of International Affairs

Show » Hide »

Managing Technical Talent: How to Find the Right Analyst for Your Problem

Nicholas Gruen

Chairman, Kaggle

Show » Hide »

The Need for Data Standards: How the InChI Project Is More than Just a Standard for Chemists

Steve Heller

Project Director, InChI Trust

Show » Hide »

Metadata Standards and XML Technologies for Unlocking Statistical Data

Pascal Heus

Vice President, Metadata Technology/Open Data Foundation

As demand for socioeconomic data, health data, and official statistics continues to grow, government agencies, international organizations, data producers, and research centers are under increased pressure to make data more widely available to researchers, stakeholders, or the general public. This presents significant challenges, as such information cannot always be easily released. Fundamental statistical principles and national legislation require the custodians to protect the privacy of the underlying respondents and ensure the data is used according to its intended purposes. Data by itself is also of limited usefulness if not of quality and surrounded by comprehensive documentation. Effectively and responsibly providing access to statistical data is not a trivial task.

At the same time, funding agencies around the globe are formulating new policies encouraging or requiring researchers to provide data management plans or strategies as an integrated component of their proposals. This aims to encourage knowledge sharing, collaboration, and open access to publicly funded research data. While sound, these polices raises new challenges to the individual data users.

Fortunately, the past decade has seen the emergence of metadata standards, best practices, and technologies that can facilitate such processes. Specifications such as the Statistical Data and Metadata Exchange standard (SDMX) or the Data Documentation Initiative (DDI) have come to maturity and are rapidly being adopted by agencies and individuals around the globe. Unlike in other domains, these standards have been widely endorsed and have the advantage to face no or little competition. Numerous tools and platforms are also becoming available to facilitate the management, discovery, access, or analysis of data. Combined, these provide powerful instruments to realize effective and secure data preservation, dissemination, exchange, and sharing solutions.

Our presentation will summarize the challenges of providing access to statistical data; outline the standards and technology landscape surrounding socioeconomic data, health data, and official statistics; provide an update on recent achievements; and highlight ongoing initiatives around the globe.

Show » Hide »

Data Modeling among Non-programmers

Mads Hjorth

Data Architect, Danish Commerce and Companies Agency

Show » Hide »

Commercial Search Engine Developers and Universities: A Critical Time for Collaboration in the Coming Age of Publicly Accessible Research Data

Stefan Kramer

Research Data Management Librarian, Cornell Institute for Social and Economic Research

Show » Hide »

From Dollars to Ideas: New Tools for Measuring Influence

Thomas Lee

Director of Sunlight Labs, Sunlight Foundation

Show » Hide »

How to Compare One Million Images? Visualizing Patterns in Art, Games, Comics, Photography, Cinema, Animation, Web, and Print Media

Lev Manovich

Professor, University of California, San Diego (UCSD)

Show » Hide »

Statistical Abstract of the United States: The Value of Data

Ian O'Brien

Branch Chief, U.S. Census Bureau

Show » Hide »

Introducing Encyclopedia of Life V2: International, Personal, and Reusable Biodiversity Data

Cynthia Parr

Director, Species Pages Group Encyclopedia of Life, Smithsonian's National Museum of Natural History

Show » Hide »

Thomson Reuters and Big Data

James Powell

CTO, Thomson Reuters

Show » Hide »

Data Visualization

Kim Rees

Partner and Head of Visualization, Periscopic

Show » Hide »

The Inscrutable Lines of Cause and Effect

Byron Reese

Chief Innovation Officer, Demand Media

Show » Hide »

Sports Analytics: Managing and Making Sense of Player-Tracking Data

John Sasman

Associate Vice President, Commercial Products, STATS, LLC

Show » Hide »

Doing Business in the Face of the Information Explosion

Anthony Scriffignano

Vice President Global Data Strategy, Dun & Bradstreet

Show » Hide »

Global Health Data Exchange

Peter Speyer

Director of Data Development, Institute for Health Metrics and Evaluation

Show » Hide »

Making State Government Data Accessible and Understandable

Derek Stanford

State Representative, Washington State

Show » Hide »

Crowdsourced, Collaborative Genealogy

Noah Tutak

CEO, Geni.com

Show » Hide »

Empowering People with Data—Data.gov: What's Now and What's Next

Alan Vander Mallie

Data.gov Program Manager, U.S. General Services Administration

Show » Hide »

Drug Efficacy in the Wild

Timothy Vaughan

Research Scientist, PatientsLikeMe

Show » Hide »

Financial Data Management

Yi Wang

Chief Technology Officer, Morningstar Inc.

Show » Hide »

The Sea Around Us: Seeing Our Past, Present, and Future through Data in Space and Time (Impacts of Fisheries on the World's Marine Ecosystems)

Dirk Zeller

Project Manager & Senior Researcher, Sea Around Us Project, UBC Fisheries Centre FishBase & SeaLifeBase

Show » Hide »

Presentation Abstracts

Preliminary List