Data Services Questions by Popularity

  1. I have some census characteristics that I want to put into GIS. However, I am having trouble doing so because the FIPS codes in my data are represented as numbers; e.g., 1 for Alabama instead of 01. Likewise, Autauga county is represented as 1 instead of 001. I need an ID variable for Autauga county, Alabama that looks like 01001. How can I convert numeric data back to text in Excel? Right now, state is in one column and county is in the next column. [280 web views]
  2. I am at the University of Minnesota, Economics department. I am looking for the data on households income across sectors in the year 2003 in Russia. To be more precise, I need the data on low-income households employees compensation and high-income households employee compensation across sectors. If you have any idea what would be a good source, I would greatly appreciate it. [249 web views]
  3. How is the residence of prisoners noted on death certificates - the prison or the original home address? What about other institutional populations (dorms, hospitals, etc.)? [242 web views]
  4. I am generating some age-specific counts for various neighborhoods to compare with data in the Research Data Center (RDC). However, I am having trouble getting these counts to agree with what previous workers on this project have generated. Can you take a look at my numbers vs the project counts for our neighborhoods? [241 web views]
  5. I am having trouble opening an IMF government finance statistics data file. [221 web views]
  6. I am trying to some mortality data for the state of Michigan and Montcalm county, Michigan from 1960 to present. We’d prefer age-adjusted figures over age-specific. Do you have any thoughts on how best to obtain these data? This is for a health disparities documentary and the data are needed immediately. [149 web views]
  7. How should I define my variables in Stata? Specifically, what are the options for character, integers and real numbers? [147 web views]
  8. The PSID is not based on a simple random sample. What variables should I use for complex sample survey variance estimation? [146 web views]
  9. How do I read SAS export files? [146 web views]
  10. I need national data from Summary File 3 from the 2000 Census. I need data at the county, census tract, and block level – summary levels 40, 140, and 150). This looks like a lot of files to download for just these summary levels. Is there a better solution? [144 web views]
  11. We would like access to the original Indianapolis Fertility Survey (1941). Does PSC have these data? [144 web views]
  12. I am looking for estimates of the size of the 15-44 year old female population by state for a far back as I can get it. I am trying to construct fertility rates and have births back to 1930. Even if I can not get them back to 1930, getting them back to the late 1940s would be very helpful. [143 web views]
  13. Do you know if the 1960 census asks the same ‘where did you live 5 years ago’ question that was asked in 1970-2000? [143 web views]
  14. I have a zip code file that has all zip codes in the nation that I am merging to a person file. When I look at the merged file I see that there is no zip code information for the respondents who live in zip code 48103, which is a valid zip code. Why didn’t the merge work? [142 web views]
  15. Can you help me find a public-use codebook for Baccalaureate and Beyond (B&B)and the National Education Longitudinal Study (NELS)? [142 web views]
  16. I need an indicator of rurality for zip codes. Are you aware of something I could use? [142 web views]
  17. I am looking for the total number of births (no rates), 1975-present, by state. I need for this to be in electronic form if possible. [140 web views]
  18. I am working with NELS data (base year, first follow-up and second follow-up). When I tried to extract the variables from raw data using the SAS setup cards provided by ICPSR, I do not get the right number of cases – way too few. [140 web views]
  19. I would like to get a copy of the Census 1990 elderly households 8% sample dataset and codebook. Please let me know if it is available and the procedure for obtaining it. [138 web views]
  20. I need data from several years of the March Current Population Survey but notice that these data do not have set-up files associated with them. [138 web views]
  21. I am using microdata from the 2000 Census of Population and Housing. I have found some families with a code of "0" on family income. Is this an error on the part of the Census Bureau? If not, what is the explanation? [138 web views]
  22. How does one handle weights in a longitudinal analysis with differential sample attrition over time, etc? I am using selected years in the NLSY? [137 web views]
  23. I’m interested in putting in two small grant proposals over the next few weeks – and both will be completely secondary data analysis – one of the data sets (MIDUS) is available through ICPSR and the other is the Wisconsin Longitudinal Study, which is publicly available. What is the UM policy on public use data and IRBs? [137 web views]
  24. I need to know if there is a good sources to look up 2000 PUMA geography via maps or lists. I need to get an idea of how different metros and counties line up with PUMAs. [137 web views]
  25. I am having trouble measuring density for zip codes. I have pulled off land area and population size from the national SF1 file, but the results I'm getting are way too small. [137 web views]
  26. How does one get access to the Baccalaureate and Beyond data (public-use)? [137 web views]
  27. I want to compare the mobility of siblings in the PSID. How can I find siblings considering they no longer live in their family of origin? [136 web views]
  28. I downloaded a US Census Extract recently and have been running some descriptive statistics. In terms of race, Brazilian Immigrants mostly identify as white, "some other race" and "two or more races". I would like to find out how they classified in the latter 2 categories. Is there any way I can do this? Also, if it is just a matter of creating an extract with specific variables (e.g. "OTHER", NUMRACE), would it be possible to add these variables to my existing dataset? Or would I need to do a completely new data extract with all of the variables? [135 web views]
  29. Has anyone gotten clearance for the American's Changing Lives study, currently PI'd by Michigan researchers Jim House and Paula Lantz? We use the data because it's in-house, but if I wanted to put it in an R03 for October, would I need to go through IRB for approval? [134 web views]
  30. I would like to deposit data from our research project with ICPSR. Any guidance you can give me would be appreciated. [133 web views]
  31. ICPSR used to have a FastTrack service where some data were released fairly immediately – before normal ICPSR processing. I cannot seem to find this anymore. [133 web views]
  32. I am considering using Geolytics Neighborhood Change Database for some longitudinal analyses that I am doing with death certificate data geocoded at the county level. However, I am a little unclear if the NCDB has info for nonmetro counties. The documentation is a bit confusing on this point. Could you advise? [132 web views]
  33. I am trying to get a sense of the American Community Survey (ACS). It is harder than I would have thought. Am I correct in thinking that annually they survey approximately 3,000K households per year? Are the samples independent of each other - e.g., 2005 and 2006? Finally, they do not seem to ask about country of birth. This seems odd although you might be able to make an estimate with ancestry and citizenship. [131 web views]
  34. Where can I find out the change in the distribution of religious denominations over time? [130 web views]
  35. I was looking at the variables in the Collaborative Psychiatric Epidemiology Surveys (CPES) and I need to know how pregnant women are identified in the surveys. Also, how many pregnant women are in the samples? [130 web views]
  36. What level of geography do natality detail files go down to? [130 web views]
  37. Can I access data from ICPSR from my home PC? My home computer is not part of the university IP pool. [130 web views]
  38. I am looking at the life history calendar for the Chitwan Valley (Nepal) study data. The total number of observations listed is 5,271, but there are only 1,469 observations in the life history data. What is the solution? [130 web views]
  39. I want to create a kml file using zip code data and boundaries. Can you point me in the right direction? [129 web views]
  40. The Pew Hispanic Center put out a report last November with Race, age and citizenship status info from the September 2007 CPS. (See table below from report): [http://pewhispanic.org/files/reports/83.pdf](http://pewhispanic.org/files/reports/83.pdf) I didn’t know you could get race info from anything other than the March CPS. What do you know about race information available for other months? [129 web views]
  41. The codebook for the 1972-2006 GSS does not agree with the data. The weight variables are noted as being in columns 6982-6993 and yet the data only goes to column 6801. [129 web views]
  42. I am looking for some software that can take an address on a single line with perhaps an error in the street name and return a corrected address with a zip code. For instance: 1501 Washtena Ann Arbor, MI would be corrected to: 1501 Washtenaw Ann Arbor, MI 48104-3179 [129 web views]
  43. I am having trouble reading a file into SAS. SAS is complaining about the last variable in each record. If I look at it with an editor, it looks like a ^M. [128 web views]
  44. The PSID changed their sample in 1997 - reduced the number of original families that are included in the survey. How does this affect the weights? [128 web views]
  45. I am interested to public use microdata from the Survey of Consumers. I would like it for as many years as possible. I do not know if the producers readily give this out but I know of researchers who have access to the data. [127 web views]
  46. I am doing an analysis with the PSID and I need a very simple variable – race. Where is the race variable? [124 web views]
  47. I need to know what the boundaries of metropolitan areas were at the time of the 2000 census. How are metropolitan areas defined? As an example, I need to know Albuquerque, New Mexico, Atlantic-Cape May, New Jersey, and Chicago, Illinois. [123 web views]
  48. I'm writing to see if you know how I can obtain a variable that indicates the "urbanicity" (a binary "urban" versus "rural" variable would be fine) of all U.S. counties. This variable should be fairly recent (around 2000 is good). I thought it would be easy to track down on the web but haven't had luck. [116 web views]
  49. Do you know anything about the National Longitudinal Mortality Study? I am interested in whether the cause of death (CAUSE) is available in the public use data. [116 web views]
  50. I am unsure how to merge some geographic data a file that I have. I noticed that the census tract numbers do not stand alone in the data file that I downloaded from the census and that the GEO variable is saved as a text variable so I am unable to merge my data sets using that variable. Would you be willing to show me how to link these two data sets if all I have is the census tract number? [115 web views]
  51. I'm interested in looking at birth weight as a measure of child well-being. I can only find this in the Child Development Supplement (CDS), but this is only available from 1997 on. [114 web views]
  52. The Institute for Social Research (ISR) used to support Multiple Classification Analysis (MCA) through OSIRIS. I liked this program very much. Can you tell me if this is still available? [114 web views]
  53. The 1970 natality detail file does not have FIPS codes to identify states and counties. It has NCHS codes. Is there a crosswalk between the two? [113 web views]
  54. I am interested in creating some tables from the 2006 General Social Survey. It looks like ICPSR has the data. How easy is it to make a 4-way table – Political affiliation * Age * Education * Race/Ethnicity? [113 web views]
  55. I can not find the 2000 version of the 1990 “School District Data book." I know that there is a 2000 version, since the National Center for Education Statistics has it on their website: http://nces.ed.gov/surveys/sdds/selectgoe.asp The NCES interface is extraordinarily cumbersome to use. [112 web views]
  56. I have some raw data without any set-up cards. Can you remind me how to read in data with stata? [112 web views]
  57. Has anyone combined the 1999-2002 data with the 2003-2004 NHANES data? I'm using the 4yr weights for the 99-02 but I'm not sure what to do with the 2yr weights on the 2003-2004 data - there don't appear to be updates with 6yr weights. How do I handle the weights when combining 03-04 with the earlier data? [111 web views]
  58. I am interested in surveys with the question "Have you ever been divorced?" as opposed to the question "What is your current marital status?" I am interested in this at the state and sub-state level. [111 web views]
  59. I am working with the National Comorbidity Survey Replication (NCS-R) and I want to know the exact sampling method, universe eligibility, etc. for the following items: TB15L: Self report of tobacco as causation of emotional problems SC21, SC22, SC23: Variables related to depression DA31B_101: Religious preference DA40: Age of mother when you were born PEA52: Personality question - “I often feel empty inside" [111 web views]
  60. Do you know if the summary files include information on the US protectorates in the Pacific (Guam, Marshall Islands, Samoa, Micronesia, etc.) and the Virgin Islands? I know the Census Bureau conducts a census in these locations. [111 web views]
  61. I am working with a zip code file from the Census Bureau. Some of the zip codes have XX or HH in the code, e.g., 298HH. How can I get rid of these zip codes? What does the HH/XX code(s) represent anyway? [111 web views]
  62. I am merging two data sets and end up with too few variables. What is going on? [110 web views]
  63. Why do the counts from SF1 and SF3 differ? [110 web views]
  64. What were the MSAs the Census Bureau used in the 1990 census? There is some ambiguity as to what the definitions of MSAs were for publications and tables that came out of the 1990 census. [109 web views]
  65. I am confused about the different paths to access data and work areas at PSC. [109 web views]
  66. ICPSR has the Early Head Start Research and Evaluation (EHSRE) study but it only has the public use file. Where is the restricted version of the data and what are the access conditions? [108 web views]
  67. I am having trouble merging selected files in the Health and Retirement Study (HRS). Some of the IDs are string variables and some are numeric. I am using stata. I can convert the string ID to numeric with the following command: generate HHIDNO = real(HHID); However, when I try to convert the numeric ID back to string, something is not working and thus I cannot merge with the main file where the ID is a string variable. Here is the command I used to convert the numeric ID back to a string ID: generate HHID = " " + string(HHIDNO); [107 web views]
  68. Is there a public use version of the ACS, similar to PUMS, with household data? Or just the interface where you get custom tables? [107 web views]
  69. Is there a document or documents that compares occupation codings in the census over time . . . say 1960-2000? [106 web views]
  70. Do you know where I can find a list of the variables that are available in the summary files for the 2000 census down to the block level? I should know the answer to this by now. [106 web views]
  71. How do I map a network drive? A restricted space has been set up for me on Novell, but I cannot see it. [106 web views]
  72. I am using the "Immigrants Admitted to the US" files and would like to compare my results with the statistical yearbooks that the Immigration and Naturalization service produced. I'm not finding these on the INS website. [106 web views]
  73. Can you please give me an estimate of the current size of the baby boomer cohort (born 1946 to 1964)? [105 web views]
  74. Do you know where I can get a zipcode-to-county crosswalk online database? I have it for the state of Michigan but now I'd like one for the entire U.S. if possible. There are certain sites that have all states, but I would not want to download the files one state at a time. [104 web views]
  75. Do you know if there is a way to construct a Hispanic measure in the 1960 PUMS? I assume by the 1970 census there is a Hispanic category. Is that right? [103 web views]
  76. I have found some estimates data on the Census Bureau website, but I have no idea how to look at it. Can you help me? [103 web views]
  77. I need time series data on attitudes in New York City from 1999 to the present. [99 web views]
  78. I know SAS on unix can read compressed data. Can SAS for the PC read compressed data? [99 web views]
  79. Why are there zero weights in the 1990 public use microdata file for the U.S. census? [98 web views]
  80. I am using the IPUMS data and for 1970 the only file available is the 1970 metro, form 2. Is this a 1% file? [98 web views]
  81. I am working on a grant proposal with a colleague in the medical school. Her grants person told her that the restricted Add Health data requires a non-networked computer. Is this true? We would prefer NOT to have to include a computer in the budget if possible. [97 web views]
  82. I want to download the 2006 ACS microdata from the Census Bureau website as a stata file. Can you point me to the location where I can do this? [97 web views]
  83. I need a longitudinal data file that has multiple measures of blood pressure. The sample should include women of all races. [96 web views]
  84. I am looking for the size of the 18-19 year old population in the US over time. This must be available from the Census Bureau but I am not finding anything. [90 web views]
  85. Do you know of a source of vital statistics data for the United Kingdom and Canada? My ideal data would be single years of age and cause of death for many years. [89 web views]
  86. I have looked at census data on commuting patterns, but there is no information on the characteristics of the commuters. How can I get data on commuters? I am interested in the townships surrounding Philadelphia. I also need maps that show townships for Pennsylvania. I'd like the maps to include major roads. [88 web views]
  87. What data allows one to tell the extent of divorce on family structure? What I mean is that lots of children live in a two parent family, but it is often a blended family, not the original set of biological parents. I want a count of the number of kids in the following living arrangements: (a) single parent; (b) married couple, with only one biological parent; (c) married couple, with both biological parents; (d) not living with either biological parent. [87 web views]
  88. I want to use the NLS97 cohort for a study of religious affiliation of youth. I am having trouble finding religious preference. [86 web views]
  89. Can one look at fertility in the American Community Survey (ACS)? Is there a children ever born question? [84 web views]
  90. I want to combine two years of the March CPS to compare with data from a survey that took place over a two-year period. The survey took place in 1994 and 1995. [81 web views]
  91. Does the American Community Survey (ACS) have data for zip codes? [80 web views]
  92. I want to apply for the restricted version of PSID data (geocode). Can you clarify what sort of secure environment PSID requires. [79 web views]
  93. Is there a way to get a count of the number of single women over 40 who moved in the past year? [77 web views]
  94. Is there data for Washtenaw county in the ACS yet? How many counties in Michigan are available? [73 web views]
  95. I am trying to match data for 2006 and 2007 from the March CPS so that I can track respondents across the two years. I downloaded the NBER matching files, but they don't seem relevant to data this recent. I also read the information about matching in the CPS codebook, but it's not clear to me how to operationalize their suggestions. Would it be possible to get some assistance with this issue? [68 web views]
  96. Can you remind me how to uncompress a file with a *.gz extension? [66 web views]

NEW PSC blog

Recent resources, events, news

New Publications

Burgard & Lee-Rife. "Community Characteristics & Sexual Behavior." PSC Research Report.

Walsemann, Geronimus & Gee. "Accumulating Disadvantage"

Next Brown Bag

Seminars will start up again in fall 2008
Check for new schedule


W A R N I N G

If you are reading this, it may be that you are using rather old web browsing software that does not support modern international Web technology standards. For a better experience of the Web and this site in particular, please upgrade your web browser software today. The following are good choices: Firefox 2; Opera 9; Safari 3.