Archive for the 'Data' Category

Page 2 of 24

Big Data Initiative at NIH-OBSSR

From the Connector blog post:

The NIH Big Data to Knowledge (BD2K) initiative is designed to address these issues and facilitate broad use of biomedical big data through new data sharing policies, catalogs of datasets, and training. Behavioral and social scientists should be aware of several recently-issued RFAs. In these RFAs NIH is requesting applications for Centers of Excellence, Data Coordination Centers, training enhancement, and data facilitation. If you are involved in mHealth, this might be a great opportunity for you, or if you are pooling data for the purposes of GxE interaction studies in the behavioral and social sciences this initiative might also fit you well. Critically consider your current research and ways that Big Data may already be part of your portfolio.

Read the full post
NIH Big Data to Knowledge (BD2K) website

UM Now Has Access to IndiaStat

Via Jungwon Yang:

The University of Michigan Library is pleased to announce that we now access to Indiastat which is a database provides key statistics of India, including census, election, trade, education, health data and more.

To access the data, please click on a link called “IP Login” at the top of the main page.
We subscribe a single user option, so please remind users to logout when they finish to explore the data. ( If a current user does not use the database over 15 minutes, then Indiastat will automatically disconnect the accession of data).

Access IndiaStat here: http://www.lib.umich.edu/database/link/31254

NIH adds substantial set of genetic, health information to online database

Researchers will now have access to genetic data linked to medical information on a diverse group of more than 78,000 people, enabling investigations into many diseases and conditions. The data, from one of the nation’s largest and most diverse genomics projects — Genetic Epidemiology Research on Aging (GERA) — have just been made available to qualified researchers through the database of Genotypes and Phenotypes (dbGaP), an online genetics database of the National Institutes of Health.

Details can found here.

Treasure Trove: US Congressional District Shapefiles, 1789-2012

Why should NSF fund political science? Here’s a great reason:

United States Congressional District Shapefiles
Jeffrey B. Lewis, Brandon DeVine, and Lincoln Pritcher with Kenneth C. Martis

This site provides digital boundary definitions for every U.S. Congressional District in use between 1789 and 2012. These were produced as part of NSF grant SBE-SES-0241647 between 2009 and 2013.

The current release of these data is experimental. We have had done a good deal of work to validate all of the shapes. However, it is quite likely that some irregularities remain. Please email jblewis@ucla.edu with questions or suggestions for improvement. We hope to have a ticketing system for bugs and a versioning system up soon. The district definitions currently available should be considered an initial-release version.

BLS: Budget cut casualties

The FY2014 budget for BLS has cut two important data programs: The Quarterly Census of Employment and Wages and the International Price Program. The latter is a principal economic indicator.

2014 Budget Enacted for Bureau of Labor Statistics
BLS Information Press Release | Bureau of Labor Statistics
February 25, 2014

The following document describes the U.S. Import and Export Price indexes, which clarifies what other government agencies and the business community are losing.

Get to Know a Principal Economic Indicator: U.S. Import and Export Price Indexes
Council of Professional Associations on Federal Statistics | www.copafs.org

U.S. Import and Export Price Indexes
This is the data/information link for the February 14, 2014 release
Data tables for the U.S. Import and Export Price Indexes
Downloadable tables | html version | Archival releases

Blexting: it’s not what you think

Blexting is short for “blight texting.” It is an app that a Detroit-based start-up (Loveland Technologies) created, which is being used to map all Detroit structures to fight blight. Here’s a bit of the coverage of the software and the amazing progress the blexters have made in mapping Detroit blight:

map

Watch: Battling Blight with “Blexting”
Hell Yeah Detroit | Your Online Guide to Being a Better Detroiter
January 26, 2014

Loveland’s passion: Battle blight
Amy Haimerl | Crain’s Detroit
February 19, 2014
Map tech – aka ‘blexting’ – charts growth

Battling Blight: Detroit Maps Entire City To Find Bad Buildings
Quinn Klinefelter | National Public Radio
February 18, 2014

A Picture of Detroit Ruin, Street by Forlorn Street
Monica Davey | New York Times
February 17, 2014

Nerd alert: administrative data, paradata, and BYOD

Bob Groves is no longer the Census Bureau director, but the Census Bureau’s plans for the 2020 Census have many of the elements that he wrote about in the Census Bureau’s Director’s blog and presented at professional meetings. He has had a lasting impact at the Census Bureau.

In an historic move, Census Bureau tries electronic outreach
D’Vera Cohn | Pew Research Center
February 18, 2014
Read the post to find out what BYOD means.

A recent memorandum from the White House, encourages the use of administrative data by federal agencies for statistical purposes. This may prove useful to some of the 2020 efforts.

Guidance for Providing and Using Administrative Data for Statistical Purposes
White House | Office of Management and Budget
February 14, 2014

Finally, the reference to “updated guidance” in the Pew piece sounds quite a bit like paradata used in responsive survey design of the NSFG. The Census enumerates all households so it isn’t a survey, but paradata can guide the data collection process – when to enumerate (weekend or not, evening or not) and when to get data from other sources.

Use of Paradata in a Responsive Design Framework to Manage a Field Data Collection
J. Wagner, et.al. | Journal of Official Statistics

Responsive Survey Design, Demographic Data Collection, and Models of Demographic Behavior
W. Axinn, C. Link, and R. Groves | Demography

Inequality: States & Cities

Here are two reports on inequality – one for states, including historical data and one for the 50 largest cities. The state-based analysis uses state-level tax data whereas the city-based analysis uses the American Community Survey. The city-based study is referenced in a story in the New York Times.

The Increasingly Unequal States of America: Income Inequality by state, 1917 to 2011
Estelle Sommeiller and Mark Price | Economic Analysis and Research Network
February 19, 2013

All Cities Are Not Created Unequal
Alan Berube | Brookings
February 20, 2014
Appendix: Income Inequality in America’s 50 Largest Cities, 2007-2012

Study Finds Greater Income Inequality in Nation’s Thriving Cities
Annie Lowrey | New York Times
February 20, 2014

County-to-County Migration Flows

The Census Bureau has released county to county migration flow data from the 2007-2011 ACS. This allows researchers to look at outbound, inbound, and net migration flows by selected characteristics (education, household income, and individual income).

Perhaps, Governor Snyder had advance access to these data before his state-of-the-state address as the only positive net value for Wayne County (and Michigan) is “movers from abroad.” Wayne County has a net loss of 28,000 to other Michigan counties and a net loss of approximately 17,000 to out of state counties. It has about one-third of Michigan’s movers from abroad (7,620 out of 24,715).

Check out this spreadsheet for Michigan counties – click on image:

spreadsheet

[Click here for Michigan Data Table]

Data, Guides, and Flows Mapper Interface
2007-2011 County-to-County Migration Flows
Megan Benetsky | US Census Bureau
Very useful working paper, which shows the sorts of analyses possible with the data.

County-to-County Migration Flows Tables

Census Flows Mapper

Additional Press
Many New Educated Entrants to Big U.S. Cities Came from Overseas
Neil Shah | Wall Street Journal
February 6, 2014
This article quotes Bill Frey who notes that many of the higher educated migrants to big cities are foreign born.

A Detailed Map of the Net Migration Flows for Every U.S. County
Emily Badger | Atlantic Cities
February 11, 2014

New PUMA boundaries for the 2012 release of ACS data

The 2012 ACS releases (2012, 2010-2012, and 2008-2012) use new boundaries for PUMAs. These new definitions are based on new guidelines established by the Census Bureau as well as results from the 2010 Census.

PUMA Guidelines

The upshot of the guidelines is that the building blocks for PUMAs must be census tracts or counties. PUMAs can no longer be comprised of places or multiple places, especially as in the case of Michigan these multi-place PUMAs were sometimes comprised of non contiguous places.

The Census Bureau also encourages that the newly constructed PUMAs map to metropolitan areas.

The definition for the composition of PUMAs from the Census Bureau’s site is not all that informative. It is an Excel spreadsheet with the geographic identifier and Name of the PUMA, e.g., Northwest Detroit for PUMA 263208.

2010 Census Gazetteer Files: PUMAs

To know which census tracts are “Northwest Detroit” one needs to map census tracts to PUMAs. One can do this via the MableGeocorr site [Source: census tract; Target: PUMA2012].

Of course many PUMAs are comprised of multiple counties or a single county, so that sort of detail is not necessary for them. I will update this post later this week with a crosswalk, which includes “census tracts” for multi-PUMA counties and counties for single/combined county PUMAs.