similar to: general question on approaches to getting data from data providers

Displaying 20 results from an estimated 8000 matches similar to: "general question on approaches to getting data from data providers"

2008 Feb 13
3
Generalized nonlinear mixed model function?
I am wondering if there is an R function that could estimate a generalized nonlinear mixed model. >From my reading it seems that nlme from the nlme package can fit nonlinear mixed models, while lmer from the lme4 package can fit generalized linear mixed models. One alternative I?ve found is gnlmix from the repeated package, although this only allows for a single random effect. Is there
2008 Jan 17
1
Any tools for working with US 2000 census data?
I've been given the job of extracting some data from the United States 2000 census (files at http://www2.census.gov/census_2000/datasets/Summary_File_2/Maryland/all_ Maryland.zip 52M). I'm only interested in Census Block Groups (CBGs) located within Baltimore City, Maryland. Additionally, I just have to extract certain data fields. I think I'll be using Summary File 2. This is my first
2016 Apr 22
0
Storage cluster advise, anybody?
On 22/04/16 03:18 PM, Valeri Galtsev wrote: > Dear Experts, > > I would like to ask everybody: what would you advise to use as a storage > cluster, or as a distributed filesystem. > > I made my own research of what I can do, but I hit a snag with my > seemingly best choice, so I decided to stay away from it finally, and ask > clever people what they would use. > >
2006 May 22
2
How to execute time consuming code
Hello all, I have a screen scraping application (go to a lots of sites, extract 10k stuff, integrate the results, put them to DB etc). Now i want to use a Rails application as a frontend to this: The user can push a button which triggers the screen scraping app and view the results (preferably asynchronously, but that does not really matter right now). Questions: - Should the screen scraping app
2024 Dec 06
1
Sum by group
I have population data (?totpopE?) at the census tract level (?GEOID?), which are nested within Precincts (?Precinct?). Please see below my data structure. I used the code to sum population data per precinct: inters <- inters %>% group_by(Precinct) %>% mutate(TotalPop = sum(totpopE) ) However, said code produced too large sums because each census tract (?GEOID?) has multiple
2011 Feb 10
1
"Error in plot.window(...) : invalid 'xlim' value" from plot(...par(new = TRUE))
[New to the community; still in early part of R's learning curve.] Several months ago, I was requested to generate some graphs on a periodic basis. Accordingly, I managed to figure out a way to do so, using a combination of Perl and R (in a FreeBSD environment). While I've needed to adjust a few things here and there, the general approach has been pretty solid , and the R part has had
2016 Apr 22
1
Storage cluster advise, anybody?
Hi Valeri On Fri, Apr 22, 2016 at 10:24 PM, Digimer <lists at alteeve.ca> wrote: > On 22/04/16 03:18 PM, Valeri Galtsev wrote: >> Dear Experts, >> >> I would like to ask everybody: what would you advise to use as a storage >> cluster, or as a distributed filesystem. >> >> I made my own research of what I can do, but I hit a snag with my >>
2011 Dec 03
1
side-by-side map with different geographies using spplot
Hello, I want to create side-by-side maps of similar attribute data in two different cities using a single legend. To simply display side-by-side census block group boundary (non-thematic) maps for Minneapolis & Cleveland I do the following: library(rgdal) library(sp) Minneapolis=readOGR("../Minneapolis/Census/2010/Census_BlockGroup_GEO/","tl_2010_27053_bg10")
2011 Apr 10
1
survival object
Hi All, I am trying to do a survivorship analysis with library(survival)from a data set that looks like this: I followed a bunch of naturally germinated seedlings of an annual plant from germination to death (none made it to reproduce, and died in a period of ~60 days after germination.) I also know the size of the seed of every individual censused. So I am trying to analyze seedling survival as
2006 Jan 27
1
Caching from screen scraping
Hi all, I need to do some screen scraping from my rails app. Given an ethernet (MAC) adress, I scrape results from an internal web page that returns location and hostname. How can I cache the result from that screen scraping as to be polite to the scrapee? I would like to expire the results daily. In perl, I would use Cache::File. Can I use rails caching for this? What''s the best
2017 Jul 11
0
[R-pkgs] acs version 2.1.0 update: download and analyze data from the US Census in R
We are pleased to announce the release of version 2.1.0 of the "acs" package, now available on CRAN <http://cran.r-project.org/web/packages/acs/index.html>. The package allows users to download, manipulate, analyze, and present demographic data from the U.S. Census, with special tools and methods to simplify the tasks of working with estimates and standard errors contained in data
2007 Oct 03
0
can you help me with empirical probability
I found you online....... Can you help with empirical probability? Hi Partha. I really liked your email that you sent me, it really inspired me. I have been breezing through the chapters, and doing quite well, You should be a teacher. After all the time my college instructor spent with the class on the slopes etc.... There were very few of us who really understood it. However, after reading
2009 Feb 18
1
R as a web scraping tool using RCurl
Hi List, I am trying to leverage my knowledge of R in trying to use it for tasks that may not make R the best choice for these tasks. I wish to automate a web scraping task, which requires a multi-step procedure: 1) log in to a website 2) Go to a particular page 3) From the drop down menu, click on a particular link 4) From the tabulated data presented, choose relevant information based on a
2018 Jan 18
0
Web scraping different levels of a website
I am web scraping a page at http://catalog.ihsn.org/index.php/catalog#_r=&collection=&country=&dtype=&from=1890&page=1&ps=100&sid=&sk=&sort_by=nation&sort_order=&to=2017&topic=&view=s&vk= From this url, I have built up a dataframe through the following code: dflist <- map(.x = 1:417, .f = function(x) { Sys.sleep(5) url <-
2007 Mar 23
1
Completely off topic, but amusing?
Folks: Thought that many on this list might find this amusing, perhaps even a bit relevant. Hope it's OK: ************ WASHINGTON - The government's estimate of the number of Americans without health insurance fell by nearly 2 million Friday, but not because anyone got health coverage. The Census Bureau <http://search.news.yahoo.com/search/news/?p=Census+Bureau> said it has
2007 May 20
1
Tips on testing
Hey WWW::Mechanize users, I''ve been using Mechanize for about 3 or 4 months now, and I''m rather enjoying it. I''ve done scraping in PHP & Python, starting with raw text and HTTP requests, moving up to regex and Curl, and finally landed myself here, making me much more productive when scraping pages. At the moment, I''m in the middle of a decent sized
2009 May 11
1
Building US maps in R
Hi, I'm trying to build some maps of the US by county that will have the following characteristics: Feature/Map Map 1 Map2 Both Broken out by county Yes Yes Yes Heatmaps of US Census Data for income by county Yes No Yes Heatmaps of US Census Data for race by county (recoded as white and &non-white, with each county color coded based on the majority) No Yes No Polygon
2010 Jan 26
1
Does Amazon.com block scraping?
Hi there Does anyone know if Amazon.com has any sort of server side script that tries to block scraping activities? I first noticed that if I didn?t change the agent alias, it would fetch a page exactly like the normal one, but without the intial search field(maybe a silly way to prevent scraping). Then after it, I changed to some other alias, and submit a search. I got the result page as
2003 Mar 24
1
negative binomial regression
I would like to know if it is possible to perform negative binomial regression with rate data (incidence density) using the glm.nb (in MASS) function. I used the poisson regression glm call to assess the count of injuries across census tracts. The glm request was adjusted to handle the data as rates using the offset parameter since the population of census tracts can vary by a factor of
2010 Dec 16
0
Revolutions Blog: November Roundup
I write about R every weekday at the Revolutions blog: ?http://blog.revolutionanalytics.com and every month I post a summary of articles from the previous month of particular interest to readers of r-help. In case you missed them, here are some articles related to R from the month of November: Dirk Eddelbuettel and Romain Francois went to Google to talk about integrating R (using Rcpp, for