Does anyone have information about using the National Practitioner Data Bank (NPDB) in R? It's public use data available from the U.S. Department of Health and Human Services that can be downloaded as a .dat or .por file. I can't even figure out how to open the file in R. [[alternative HTML version deleted]]
On Dec 27, 2011, at 4:55 PM, alison korte wrote:> Does anyone have information about using the National Practitioner Data Bank (NPDB) in R? It's public use data available from the U.S. Department of Health and Human Services that can be downloaded as a .dat or .por file. I can't even figure out how to open the file in R.The files that are available from: http://www.npdb-hipdb.hrsa.gov/resources/publicData.jsp are either tab delimited ASCII files (.DAT) or SPSS (.POR) files. For R, you would want to use the former and import the files using ?read.delim, which defaults to using a '\t' (TAB) character as the delimiter. See the R Data Import/Export manual for more information: http://cran.r-project.org/manuals.html Technical details about the file structure are available in: http://www.npdb-hipdb.hrsa.gov/resources/reports/PublicUseDataFile-DAT-Format.pdf BTW, that took about two minutes of searching using Google. HTH, Marc Schwartz
Alison, library(foreign) ndpb<-read.spss("NPDB1107.por") generates: Warning message: In `levels<-`(`*tmp*`, value = c("Self Query Processing", "Non-Fed. Malp. Payor", : duplicated levels will not be allowed in factors anymore and summary(ndpb) does generate quite a list. Perhaps you can take it from there. Clint -- Clint Bowman INTERNET: clint at ecy.wa.gov Air Quality Modeler INTERNET: clint at math.utah.edu Department of Ecology VOICE: (360) 407-6815 PO Box 47600 FAX: (360) 407-7534 Olympia, WA 98504-7600 USPS: PO Box 47600, Olympia, WA 98504-7600 Parcels: 300 Desmond Drive, Lacey, WA 98503-1274 On Tue, 27 Dec 2011, alison korte wrote:> Does anyone have information about using the National Practitioner Data Bank (NPDB) in R? It's public use data available from the U.S. Department of Health and Human Services that can be downloaded as a .dat or .por file.?I can't even figure out how to open the file in R. > [[alternative HTML version deleted]] > >