Julius Roberts
2010-Feb-18 03:19 UTC
[zfs-discuss] Problems with sudden zfs capacity loss on snv_79a
Yes snv_79a is old, yes we''re working separately on migrating to
snv_111b or later. But i need to solve this problem ASAP to buy me
some more time for that implementation.
We pull data from a variety of sources onto our zpool called Backups,
then we snapshot them. We keep around 20 or so and then delete them
automatically. We''ve been doing this for around two years on this
system and it''s been absolutely fantastic. Free-space hovers around
300G. But suddenly something has changed:
root at darling(/)$:zfs list | head -1 && zfs list | tail -7
NAME
USED AVAIL REFER MOUNTPOINT
Backups/natoffice/onsite at 20091231_2347_TriggeredBy_20091231_2330
30.1G - 287G -
Backups/natoffice/onsite at 20100131_2349_TriggeredBy_20100131_2330
17.7G - 287G -
Backups/natoffice/onsite at 20100205_0001_TriggeredBy_20100204_2330
15.9G - 287G -
Backups/natoffice/onsite at 20100212_0424_TriggeredBy_20100211_2330
152G - 285G -
Backups/natoffice/onsite at 20100216_0430_TriggeredBy_20100215_2330
154G - 287G -
Backups/natoffice/onsite at 20100217_0431_TriggeredBy_20100216_2330
154G - 287G -
Backups/natoffice/onsite at 20100218_0423_TriggeredBy_20100217_2330
0 - 287G -
Normally a snapshot shows USED around 15G to 30G. But suddenly,
snapshots of the same filesystem are showing USED ~150G. There are no
corresponding increases in any of the machines we copy data from, nor
has any of that data changed significantly. You can see that the
REFER hasn''t changed much at all, this is normal. So we''re
backing up
the same amount of data, but it now occupies so much more on disk.
That of course means we can''t keep nearly as many snapshots, and that
makes us all very nervous.
Any ideas?
Other info:
Normally the snapshots are consecutive days but we''ve had to cull data
as we were running out of space.
root at darling(/)$:zpool status
pool: Backups
state: ONLINE
scrub: none requested
config:
NAME STATE READ WRITE CKSUM
Backups ONLINE 0 0 0
raidz1 ONLINE 0 0 0
c2d0 ONLINE 0 0 0
c5d0 ONLINE 0 0 0
c6d0 ONLINE 0 0 0
errors: No known data errors
root at darling(/)$:zfs list Backups
NAME USED AVAIL REFER MOUNTPOINT
Backups 1.34T 456G 25.3K /Backups
--
Kind regards, Jules
Zen left me, then I remembered, nothing to forget.
Giovanni Tirloni
2010-Feb-18 23:44 UTC
[zfs-discuss] Problems with sudden zfs capacity loss on snv_79a
On Thu, Feb 18, 2010 at 1:19 AM, Julius Roberts <hooliowobbits at gmail.com>wrote:> Yes snv_79a is old, yes we''re working separately on migrating to > snv_111b or later. But i need to solve this problem ASAP to buy me > some more time for that implementation. > > We pull data from a variety of sources onto our zpool called Backups, > then we snapshot them. We keep around 20 or so and then delete them > automatically. We''ve been doing this for around two years on this > system and it''s been absolutely fantastic. Free-space hovers around > 300G. But suddenly something has changed: > > root at darling(/)$:zfs list | head -1 && zfs list | tail -7 > NAME > USED AVAIL REFER MOUNTPOINT > Backups/natoffice/onsite at 20091231_2347_TriggeredBy_20091231_2330 > 30.1G - 287G - > Backups/natoffice/onsite at 20100131_2349_TriggeredBy_20100131_2330 > 17.7G - 287G - > Backups/natoffice/onsite at 20100205_0001_TriggeredBy_20100204_2330 > 15.9G - 287G - > Backups/natoffice/onsite at 20100212_0424_TriggeredBy_20100211_2330 > 152G - 285G - > Backups/natoffice/onsite at 20100216_0430_TriggeredBy_20100215_2330 > 154G - 287G - > Backups/natoffice/onsite at 20100217_0431_TriggeredBy_20100216_2330 > 154G - 287G - > Backups/natoffice/onsite at 20100218_0423_TriggeredBy_20100217_2330 > 0 - 287G - > > Normally a snapshot shows USED around 15G to 30G. But suddenly, > snapshots of the same filesystem are showing USED ~150G. There are no > corresponding increases in any of the machines we copy data from, nor > has any of that data changed significantly. You can see that the > REFER hasn''t changed much at all, this is normal. So we''re backing up > the same amount of data, but it now occupies so much more on disk. > That of course means we can''t keep nearly as many snapshots, and that > makes us all very nervous. > > Any ideas? >Is it possible that your users are now deleting everything before starting to write the backup data ? -- Giovanni Tirloni sysdroid.com -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100218/39011c90/attachment.html>