Akash
2014-Aug-18 07:42 UTC
Huge difference between the lucene index size created by v2.1 and v2.2
Hi everyone, While examining dovecot versions v2.1 and v2.2 for their lucene search performances, I have noticed a huge difference in the index sizes created by them. Both versions were compiled on same system, against same libclucene, same configure options and were used with same dovecot.conf during run-time. I deleted the lucene-indexes folder and dovecot* files in the Maildir prior to indexing with both versions. The tests were performed on a untouched mail archive folder containing 300000 mails without any dovecot* files in it: root at server:/home/admin/mails/.Archive# ls -l total 78640 drwx------ 2 2500 2500 14098432 Aug 17 22:16 cur drwx------ 2 2500 2500 12435456 Jul 30 09:46 new drwx------ 2 2500 2500 4096 Aug 2 13:02 tmp The command used was: doveadm -v index -u admin Archive After search indexing on v2.1 resulted in: root at server:/home/admin/mails# ls -lh lucene-indexes total 390M -rw------- 1 2500 2500 390M Aug 18 07:03 _25.cfs -rw------- 1 2500 2500 20 Aug 18 07:03 segments.gen -rw------- 1 2500 2500 46 Aug 18 07:03 segments_4d Whereas dovecot v2.2 resulted in: root at server:/home/admin/mails# ls -lh lucene-indexes total 1.5G -rw------- 1 2500 2500 1.5G Aug 18 06:41 _5g.cfs -rw------- 1 2500 2500 20 Aug 18 06:41 segments.gen -rw------- 1 2500 2500 46 Aug 18 06:41 segments_az 390M vs 1.5G. That is a huge difference in size. Why is that? Thanks in advance. -Regards, Akash
Timo Sirainen
2014-Aug-18 08:47 UTC
Huge difference between the lucene index size created by v2.1 and v2.2
On 18 Aug 2014, at 09:42, Akash <akbwiz+dovecot at gmail.com> wrote:> 390M vs 1.5G. That is a huge difference in size. Why is that?Can you test if the attached patch shrinks it back? I had been planning on making that also configurable. There might be something else also causing it. -------------- next part -------------- A non-text attachment was scrubbed... Name: diff Type: application/octet-stream Size: 357 bytes Desc: not available URL: <http://dovecot.org/pipermail/dovecot/attachments/20140818/86aa501a/attachment-0001.obj> -------------- next part --------------