Home > Data Services > Catalog . Restricted Data . Census . ACS

Search Data Services

Meta Search
search across all the following databases:

Data Catalog
Data and documentation

KnowledgeBase
Common questions and answers.

Resources
Entire collection of data resources.


Latest Data News

RSS Feed icon

The Antidote for “Anecdata”: A Little Science Can Separate Data Privacy Facts from Folklore

Big Data: NYC Taxi Cab Trips

Getting A More Accurate Count Of Arab Americans

Measuring Race in the Census: Its Fluid

Expansion of Free Lunch Could Have a Negative Effect on Research Data

Data Services Knowledge Base: Data Management Questions

  1. I'm interested in looking at birth weight as a measure of child well-being. I can only find this in the Child Development Supplement (CDS), but this is only available from 1997 on.
  2. The PSID is not based on a simple random sample. What variables should I use for complex sample survey variance estimation?
  3. How does one handle weights in a longitudinal analysis with differential sample attrition over time, etc? I am using selected years in the NLSY.
  4. I need to know if there is a good source to look up 2000 PUMA geography via maps or lists. I need to get an idea of how different metros and counties line up with PUMAs.
  5. I am unsure how to merge geographic data to a file that I have. I noticed that the census tract numbers do not stand alone in the data file that I downloaded from the census and that the GEO variable is saved as a text variable so I am unable to merge my data sets using that variable. Would you be willing to show me how to link these two data sets if all I have is the census tract number?
  6. Has anyone combined the 1999-2002 data with the 2003-2004 NHANES data? I'm using the 4yr weights for the 99-02 but I'm not sure what to do with the 2yr weights on the 2003-2004 data - there don't appear to be updates with 6yr weights. How do I handle the weights when combining 03-04 with the earlier data?
  7. I am having trouble merging selected files in the Health and Retirement Study (HRS). Some of the IDs are string variables and some are numeric. I am using stata. I can convert the string ID to numeric with the following command: generate HHIDNO = real(HHID); However, when I try to convert the numeric ID back to string, something is not working and thus I cannot merge with the main file where the ID is a string variable. Here is the command I used to convert the numeric ID back to a string ID: generate HHID = " " + string(HHIDNO);
  8. Do you know where I can get a zipcode-to-county crosswalk online database? I have it for the state of Michigan but now I'd like one for the entire U.S. if possible. There are certain sites that have all states, but I would not want to download the files one state at a time.
  9. How do I read SAS export files?
  10. Do you know where I can find a list of the variables that are available in the summary files for the 2000 census down to the block level? I should know the answer to this by now.
  11. Do you know if there is a way to construct a Hispanic measure in the 1960 PUMS? I assume by the 1970 census there is a Hispanic category. Is that right?
  12. What were the MSAs the Census Bureau used in the 1990 census? There is some ambiguity as to what the definitions of MSAs were for publications and tables that came out of the 1990 census.
  13. The PSID changed their sample in 1997 - reduced the number of original families that are included in the survey. How does this affect the weights?
  14. I am having trouble measuring density for zip codes. I have pulled off land area and population size from the national SF1 file, but the results I'm getting are way too small.
  15. I can not find the 2000 version of the 1990 “School District Data book." I know that there is a 2000 version, since the National Center for Education Statistics has it on their website: [http://nces.ed.gov/surveys/sdds/index.aspx](http://nces.ed.gov/surveys/sdds/index.aspx) The NCES interface is extraordinarily cumbersome to use.
  16. I need national data from Summary File 3 from the 2000 Census. I need data at the county, census tract, and block level – summary levels 40, 140, and 150). This looks like a lot of files to download for just these summary levels. Is there a better solution?
  17. I am working with the National Comorbidity Survey Replication (NCS-R) and I want to know the exact sampling method, universe eligibility, etc. for the following items: TB15L: Self report of tobacco as causation of emotional problems SC21, SC22, SC23: Variables related to depression DA31B_101: Religious preference DA40: Age of mother when you were born PEA52: Personality question - “I often feel empty inside"
  18. The 1970 natality detail file does not have FIPS codes to identify states and counties. It has NCHS codes. Is there a crosswalk between the two?
  19. What level of geography do natality detail files go down to?
  20. I am merging two data sets and end up with too few variables. What is going on?
  21. I have a zip code file that has all zip codes in the nation that I am merging to a person file. When I look at the merged file I see that there is no zip code information for the respondents who live in zip code 48103, which is a valid zip code. Why didn’t the merge work?
  22. I am considering using Geolytics Neighborhood Change Database for some longitudinal analyses that I am doing with death certificate data geocoded at the county level. However, I am a little unclear if the NCDB has info for nonmetro counties. The documentation is a bit confusing on this point. Could you advise?
  23. Why are there zero weights in the 1990 public use microdata file for the U.S. census?
  24. I want to create a kml file using zip code data and boundaries. Can you point me in the right direction?
  25. I am working with a zip code file from the Census Bureau. Some of the zip codes have XX or HH in the code, e.g., 298HH. How can I get rid of these zip codes? What does the HH/XX code(s) represent anyway?
  26. I have some census characteristics that I want to put into GIS. However, I am having trouble doing so because the FIPS codes in my data are represented as numbers; e.g., 1 for Alabama instead of 01. Likewise, Autauga county is represented as 1 instead of 001. I need an ID variable for Autauga county, Alabama that looks like 01001. How can I convert numeric data back to text in Excel? Right now, state is in one column and county is in the next column.
  27. How should I define my variables in Stata? Specifically, what are the options for character, integers and real numbers?
  28. I need to know what the boundaries of metropolitan areas were at the time of the 2000 census. How are metropolitan areas defined? As an example, I need to know Albuquerque, New Mexico, Atlantic-Cape May, New Jersey, and Chicago, Illinois.
  29. Why do the counts from SF1 and SF3 differ?
  30. I am looking for some software that can take an address on a single line with perhaps an error in the street name and return a corrected address with a zip code. For instance: 1501 Washtenaw Ann Arbor, MI would be corrected to: 1501 Washtenaw Ann Arbor, MI 48104-3179
  31. I am using the IPUMS data and for 1970 the only file available is the 1970 metro, form 2. Is this a 1% file?
  32. I am looking for census data by gender on age, marital status, education, employment status, income, and race for congressional districts for each Congress from the 103rd to the 110th.
  33. I have looked at census data on commuting patterns, but there is no information on the characteristics of the commuters. How can I get data on commuters? I am interested in the townships surrounding Philadelphia. I also need maps that show townships for Pennsylvania. I'd like the maps to include major roads.
  34. I want to use the NLS97 cohort for a study of religious affiliation of youth. I am having trouble finding religious preference.
  35. I want to combine two years of the March CPS to compare with data from a survey that took place over a two-year period. The survey took place in 1994 and 1995.
  36. Does the American Community Survey (ACS) have data for zip codes?
  37. Is there data for Washtenaw county in the ACS yet? How many counties in Michigan are available?
  38. I am trying to match data for 2006 and 2007 from the March CPS so that I can track respondents across the two years. I downloaded the NBER matching files, but they don't seem relevant to data this recent. I also read the information about matching in the CPS codebook, but it's not clear to me how to operationalize their suggestions. Would it be possible to get some assistance with this issue?
  39. I need an indicator of rurality for zip codes. Are you aware of something I could use?
  40. I am trying to merge two files using SAS and I am getting a message that says "variable stcnty has been defined as both character and numeric." The merge did not work.
  41. I have a neighborhood that I define by census tracts. How can I create characteristics for these neighborhoods in SAS
  42. I need many tables from the summary census data for 1990 at the census tract and block group level. I find American FactFinder difficult to use because I can only get data for a single tract at a time. I need all census tracts and block in Los Angeles county. Is there a better solution?
  43. What’s the difference between a PMSA and an MSA?
  44. Where can I look at a map of census tracts for Manhattan? I am interested in the area above 110th street.
  45. How do I get a chi square?
  46. Can a person use excel data in SAS?
  47. Can I identify the lower 9th ward in New Orleans by zip code or census tracts?
  48. I am having trouble merging person records to the household record. There are around 5,000 person records, but when I finish my merge, I end up with almost 15,000 observations. Here's my code: use "W:\youth.dta" merge hhid using "W:\hhld.dta"
  49. I want to combine the 2005 and 2006 ACS microdata. What do I do to the weights?
  50. I have a student looking to measure the geographic dispersion of families (at least in the U.S., and other places if possible), specifically: 1) how many families with children at home have grandparents who are out of state and 2) how many divorced parents with children at home are not co-located in the same state (e.g., child lives with one parent and the other is out of state or child spends time with each parent in separate states)." There seem to be lots of measures of an individual household's mobility (from the U.S. Census and CPS per se), and the U.S. Census even has a measure of grandparents living in the same household with children. Do you know of any nationally representative data that will fit the bill?
  51. What are the geographic areas available for the 2007 ACS?
  52. I am using data from selected zip codes in California for 1980 to 2000. How can I tell if the boundaries for these zip codes have remained the same?
  53. I am using data from the American Community Survey (ACS) for Monroe and Lenawee counties in Michigan. The tables from American Factfinder have margins of error for all the cells. However, if I am combining the two counties is there a way to calculate new margins of error based on this larger population?
  54. I am trying to merge a file with some restricted use items with the public use version of the data. I'm using stata. When I run the merge, I got a message saying "merge already defined." Is this because some of the variable names are the same?
  55. I have duplicate variable names in two data sets that I would like to merge using stata. Do I have to go through and change variable names?
  56. I downloaded the separate Cape Area Study Panel Waves 1-2-3 Household and Waves 1-2-3 Young Adult data and now I would like to merge them. How do I do that?
  57. I am using a restricted data file that has zip codes for the geocode ID. I need to add some race-specific characteristics to the zip codes. However, zip codes are not iterated by race the way other geographies are (e.g., states, counties, census tracts). Is there a way around this?
  58. I use NHGIS to get summary data from the census - not just historical data, but even the 2000 Census. However, when I try to get data for zip codes, I can only get data for one zip code at a time. Is there a solution for this?
  59. Do you know if there is any advantage for a county to be a member of a metropolitan statistical area (MSA) in terms of either government funding, business opportunities or any other economic/financial advantage?
  60. If I have a list of 1,000 latitude and longitude points can I get a report out of ArcGIS with the census tract associated with this point?
  61. What proportion of American Community Survey (ACS) interviews end up in the ACS microdata samples?
  62. I have run into an ArcMap mxd file that has a cell that I am interested in joining to that is defined as 'string'. This is happening in the counties template within my ArcGIS version 9. The cell has a 5 digit value representing a state and county mixed FIPS code (i.e. 26183 ) How do I go about converting the cell to a numeric one?
  63. I have an excel file that I want to join to an Arcmap template. My file has a 'previous sample' column that has 1 and 0 values in it. Every time I try to do the join the 'previous sample' variable is not listing in my attributes table. What is going on?
  64. What is the difference between OCCSOC and OCCCEN in the 2000 census (and the ACS)?
  65. What is the difference in the standard occupational classification system over time?
  66. How can an intercensal estimate change? I have an estimate from a P-25 report for July 1, 1977 and it does not agree with what is on the Census Bureau estimation web site.
  67. I am trying to determine census tract changes between 1980 and 1990. Can you point me to something?
  68. Where can I find out more information on the quality of the data used in the American Community Survey?
  69. How can I make a map without having special software?
  70. How do I create a map using ArcGIS?
  71. I need more information on IPUMS. Where can I find it?
  72. What are the sizes of the geographic units for the 2006 ACS and how are they determined?
  73. How do you create county level data using microdata files?
  74. What are the sizes of the geographic units for the 2007 ACS and how are they determined?
  75. What types of institutions are considered "group quarters" in the 2006 ACS? How will the inclusion of GQ affect comparisons with previous ACS?
  76. How do I use American FactFinder?
  77. We've had some interest on campus in developing a data archive for data generated by research performed by the college's students and faculty. We're working on developing procedures for migrating disclosure risk for human published guidance on what is an acceptable disclosure risk level. My statistical consultant informs me that it is mathematically impossible to get the disclosure risk (as measured by mu-Argus) down to 0, but nobody seems to be able to tell me what level of disclosure risk other archives consider appropriate , what level the U.S. government uses when preparing its microdata products, etc.
  78. When will new zip code data be made available?
  79. I want to create an annual file with census-type characteristics of counties in California. What is the best source for this? This needs to be current, but I want it updated every year.
  80. How does one read strings in SAS? For instance, if I am reading in a name like Washtenaw County in a pipe delimted file?
  81. Is it possible to define metro areas by zip codes? We have a dataset with zip codes and would like to tie the respondent to the appropriate metropolitan area.
  82. Can you help me convert ICPSR 9619 into SPSS?
  83. I would like to merge voting district shapefiles for counties into a state file. I could do this manually in ArcGIS, but it seems like it would take a long time. Is there a solution?
  84. Can one interpolate between the 2000 and 5-year ACS (2005-2009) to get characteristics of census tracts? In other words, can the 5-year data be thought of as a snapshot for 2010?
  85. I am getting an error in stata “outcome does not vary” when I am trying to run a logistic regression.
  86. I am trying to run a SAS job on my American Community Survey file of 15 million cases. In my log file I am getting an error that reads "Insufficient space is available to run job." How do I remedy this?
  87. What is up with the allocation item for earnings in the March CPS? It drops from a reasonable 22% to around 2% between 1987 and 1988. I am using data from IPUMS-CPS (qincwage).
  88. I am using Social Explorer to get zip code data. I only need zip codes from the state of New Jersey. The only option through Social Explorer is requesting ZCTAs for 3-digit zip codes. How can I get 5-digit zip codes?
  89. I have census tract data from the 5 boroughs of New York City. The data do not include counties. Will this be a problem?
  90. I have funny results for the distribution of education using 1950 IPUMS data. What am I doing wrong?
  91. I am having trouble merging data from 1990 summary data at the census tract level to a crosswalk from Mable/Geocorr.
  92. I am examining the relationship between metropolitan-level foreclosure and racial residential segregation in U.S. American cities between 1990 and 2010. I am including characteristics which can be found in decennial census surveys and ACS estimates. One variable absent from these sources is the age in which the largest city in the metro reached a population size of 50,000. I was curious to know of any source that has already compiled this.