I've just checked in the current state of the utf-8 support onto a branch in SVN. For instructions about how to get it (and a summary of what's still to do) see this page on the wiki: http://wiki.xapian.org/Utf8Support The major omission currently is utf-8 stemming, which requires importing a newer version of the Snowball stemmers - I'm currently working on that. Sorry for not creating the branch sooner incidentally. I was hoping to release 0.9.7 first to avoid having to apply patches to two places, but there are some niggling portability issues which are taking a lot of time to sort out (each time I make a tiny change I then have to wait several hours for it to get rebuilt on N different machines to see if it actually fixed the issue...) Cheers, Olly
On Fri, Sep 15, 2006 at 12:16:15AM +0100, Olly Betts wrote:> I've just checked in the current state of the utf-8 support onto a > branch in SVN.I tried the snapshot tarball script on the branch and it worked without any changes (I'm impressed even if you aren't), so if you don't want to have to download code from SVN and make sure you have all the right special tools installed you can now find snapshot tarballs here: http://www.oligarchy.co.uk/xapian/branches/utf8/ Cheers, Olly