Displaying 4 results from an estimated 4 matches for "weixian".
2012 Apr 20
2
Posting list encoding improvements - pfd encoding & var len encoding comparison program
...3. The implemented fixed length encoding uses 4 bytes as fixed length. This
is not optimal and can be further optimized in PFD.
4. The program generates uniform random doc id gap and doc len to make
posting list.
*You can access the code via my github:
https://github.com/zwxxx/pfd_simple_test*
--
Weixian Zhou
Department of Computer Science and Engineering
University at Buffalo, SUNY
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120419/72da0ab2/attachment.html>
-------------- next part -------------...
2012 Mar 31
1
Project: Posting list encoding improvements
Hi Xapianers:
My name is Weixian Zhou, Computer Science student of University at Buffalo,
State University of New York. I am interested in the project of posting
list encoding improvements and weighting schemes. I have some questions
toward them.
1) After read the comments in brass_postlist.cc, I am still not very clear
about the...
2012 Apr 01
2
Project: QueryParser Reimplementation, to Olly Betts and Dan Colish
...e
the query is the most difficult. Here?s some idea.
a. documents contain more consecutive matching words rank higher.
b. different words have different weight (e.g., tf-idf), documents with
bigger weight rank higher.
c. use k-gram to divide long query and limit the query times.
Thank you.
--
Weixian Zhou
Department of Computer Science and Engineering
University at Buffalo, SUNY
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120401/32fdeaff/attachment.html>
2012 Apr 14
1
[xapian] a bug fixed in brass_database.cc
...I fixed a bug in brass_database.cc.
The bug is:
*FIXME: this should be done by checking memory usage, not the number of*
*changes. We could also look at the amount of data the inverter object*
*currently holds.*
I also modified the simpleindex.cc so that it now supports batch files
indexing.
--
Weixian Zhou
Department of Computer Science and Engineering
University at Buffalo, SUNY
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xapian.org/pipermail/xapian-devel/attachments/20120414/689f53a9/attachment.html>
-------------- next part -------------...