Displaying 2 results from an estimated 2 matches for "database_download".
2014 May 14
2
Starting work on Perf Test Module
Hello,
I am beginning work on the perf test module. The initial steps that I aim
to accomplish are :-
-> Download the wikipedia dumps for multiple languages .
-> Write python scripts to tokenize the dump (will probably use something
like nltk which has powerful inbuilt tokenizers)
-> Discuss and finalize the design of the search and query expansion perf
tests as I want to complete them
2012 Apr 13
0
Lobbying database
...ot; and maybe also the "Contributions" database, run a number of
checks, screen out the nonsense and create search capabilities similar
to what is offered at this web site but without the garbage?
I downloaded one file from
"http://www.senate.gov/legislative/Public_Disclosure/database_download.htm".
I see that it's "xml" inside. I have not worked with XML much before,
but it doesn't look too difficult just from a casual perusal -- and R
has an "XML" package.
Also, do you have a list publications by others who have done things
with these data?...