thr3ads.net - zfs discuss - [zfs-discuss] File level snapshots in ZFS? [Mar 2007]

If this information is useful, please help other people find it:
Share via:

Atul Vidwansa

2007-Mar-29 18:22 UTC

[zfs-discuss] File level snapshots in ZFS?

Hi,
Is it possible to take file level snapshots in ZFS? Suppose I want to
keep a version of the file before writing new data to it, how do I do
that? My goal would be to rollback the file to earlier version (i.e.
discard the new changes) depending upon a policy.  I would like to
keep only 1 version of a file at a time and while writing new data,
earlier version will be discarded and current state of file (before
writing) would be saved in the version.

Thanks,
-Atul

Albert Chin

2007-Mar-29 18:31 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On Thu, Mar 29, 2007 at 11:52:56PM +0530, Atul Vidwansa
wrote:> Is it possible to take file level snapshots in ZFS? Suppose I want to
> keep a version of the file before writing new data to it, how do I do
> that? My goal would be to rollback the file to earlier version (i.e.
> discard the new changes) depending upon a policy.  I would like to
> keep only 1 version of a file at a time and while writing new data,
> earlier version will be discarded and current state of file (before
> writing) would be saved in the version.
Doubt it. Snapshots are essentiall "free" and take no time so might as
well just snapshot the file system.

-- 
albert chin (china at thewrittenword.com)

Richard Elling

2007-Mar-29 18:50 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

Atul Vidwansa wrote:> Hi,
> Is it possible to take file level snapshots in ZFS? Suppose I want to
> keep a version of the file before writing new data to it, how do I do
> that? My goal would be to rollback the file to earlier version (i.e.
> discard the new changes) depending upon a policy.  I would like to
> keep only 1 version of a file at a time and while writing new data,
> earlier version will be discarded and current state of file (before
> writing) would be saved in the version.
Most folks use SCCS, CVS, or some other version control system for this
sort of task.  These will work fine on ZFS.
  -- richard

Atul Vidwansa

2007-Mar-29 19:01 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

Hi Richard,
    I am not talking about source(ASCII) files. How about versioning
production data? I talked about file level snapshots because
snapshotting entire filesystem does not make sense when application is
changing just few files at a time.

Regards,
-atul

On 3/30/07, Richard Elling <Richard.Elling at sun.com>
wrote:> Atul Vidwansa wrote:
> > Hi,
> > Is it possible to take file level snapshots in ZFS? Suppose I want to
> > keep a version of the file before writing new data to it, how do I do
> > that? My goal would be to rollback the file to earlier version (i.e.
> > discard the new changes) depending upon a policy.  I would like to
> > keep only 1 version of a file at a time and while writing new data,
> > earlier version will be discarded and current state of file (before
> > writing) would be saved in the version.
>
> Most folks use SCCS, CVS, or some other version control system for this
> sort of task.  These will work fine on ZFS.
>   -- richard
>

Richard Elling

2007-Mar-29 20:43 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

Atul Vidwansa wrote:> Hi Richard,
>    I am not talking about source(ASCII) files. How about versioning
> production data? I talked about file level snapshots because
> snapshotting entire filesystem does not make sense when application is
> changing just few files at a time.
CVS supports binary files.  The nice thing about version control systems
is that you can annotate the versions.  With ZFS snapshots, you don''t
get
annotations.
  -- richard
> Regards,
> -atul
> 
> On 3/30/07, Richard Elling <Richard.Elling at sun.com> wrote:
>> Atul Vidwansa wrote:
>> > Hi,
>> > Is it possible to take file level snapshots in ZFS? Suppose I want
to
>> > keep a version of the file before writing new data to it, how do I
do
>> > that? My goal would be to rollback the file to earlier version
(i.e.
>> > discard the new changes) depending upon a policy.  I would like to
>> > keep only 1 version of a file at a time and while writing new
data,
>> > earlier version will be discarded and current state of file
(before
>> > writing) would be saved in the version.
>>
>> Most folks use SCCS, CVS, or some other version control system for this
>> sort of task.  These will work fine on ZFS.
>>   -- richard
>>

Shawn Walker

2007-Mar-29 23:04 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 29/03/07, Atul Vidwansa <atulvid at gmail.com>
wrote:> Hi Richard,
>     I am not talking about source(ASCII) files. How about versioning
> production data? I talked about file level snapshots because
> snapshotting entire filesystem does not make sense when application is
> changing just few files at a time.
>
> Regards,
> -atul
It does make sense if you understand how snapshots work. If you only
change a few files, your snapshots aren''t going to use much room.

What you want is version control, not ZFS snapshots. I highly
recommend you look into them instead.

-- 
"Less is only more where more is no good." --Frank Lloyd Wright

Shawn Walker, Software and Systems Analyst
binarycrusader at gmail.com - http://binarycrusader.blogspot.com/

Wee Yeh Tan

2007-Mar-30 00:51 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Shawn Walker <binarycrusader at gmail.com>
wrote:> On 29/03/07, Atul Vidwansa <atulvid at gmail.com> wrote:
> > Hi Richard,
> >     I am not talking about source(ASCII) files. How about versioning
> > production data? I talked about file level snapshots because
> > snapshotting entire filesystem does not make sense when application is
> > changing just few files at a time.
> >
> > Regards,
> > -atul
>
> It does make sense if you understand how snapshots work. If you only
> change a few files, your snapshots aren''t going to use much room.
This goes back to file system layout.  If the production data is
housed in the file system where many other changes taking place and
the administrator is only interested in backing up a few of those
files, file system layer snaps will not be "cheap".
> What you want is version control, not ZFS snapshots. I highly
> recommend you look into them instead.
He mentioned production data and I imagine this could be "big
production data".  Then, CVS and the like will be too heavy and he
needs to re-layout his file system.  At the very least, isolating the
production data into its own dataset will help.

-- 
Just me,
Wire ...

Shawn Walker

2007-Mar-30 01:20 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 29/03/07, Wee Yeh Tan <weeyeh at gmail.com>
wrote:> On 3/30/07, Shawn Walker <binarycrusader at gmail.com> wrote:
> > On 29/03/07, Atul Vidwansa <atulvid at gmail.com> wrote:
> > > Hi Richard,
> > >     I am not talking about source(ASCII) files. How about
versioning
> > > production data? I talked about file level snapshots because
> > > snapshotting entire filesystem does not make sense when
application is
> > > changing just few files at a time.
> > >
> > > Regards,
> > > -atul
> >
> > It does make sense if you understand how snapshots work. If you only
> > change a few files, your snapshots aren''t going to use much
room.
>
> This goes back to file system layout.  If the production data is
> housed in the file system where many other changes taking place and
> the administrator is only interested in backing up a few of those
> files, file system layer snaps will not be "cheap".
But his example was "changing just few files at a time," so
that''s a
different case. Let''s not try to discuss multiple examples at once :)
> > What you want is version control, not ZFS snapshots. I highly
> > recommend you look into them instead.
>
> He mentioned production data and I imagine this could be "big
> production data".  Then, CVS and the like will be too heavy and he
> needs to re-layout his file system.  At the very least, isolating the
> production data into its own dataset will help.
Actually, recent version control systems can be very efficient at
storing binary files. Careful consideration of the layout of your file
system applies regardless of which type of file system it is (zfs,
ufs, etc.).

-- 
"Less is only more where more is no good." --Frank Lloyd Wright

Shawn Walker, Software and Systems Analyst
binarycrusader at gmail.com - http://binarycrusader.blogspot.com/

Al Hopper

2007-Mar-30 01:28 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On Thu, 29 Mar 2007, Shawn Walker wrote:
> On 29/03/07, Wee Yeh Tan <weeyeh at gmail.com> wrote:
> > On 3/30/07, Shawn Walker <binarycrusader at gmail.com> wrote:
> > > On 29/03/07, Atul Vidwansa <atulvid at gmail.com> wrote:
> > > > Hi Richard,
> > > >     I am not talking about source(ASCII) files. How about
versioning
> > > > production data? I talked about file level snapshots because
> > > > snapshotting entire filesystem does not make sense when
application is
> > > > changing just few files at a time.
> > > >
> > > > Regards,
> > > > -atul
> > >
> > > It does make sense if you understand how snapshots work. If you
only
> > > change a few files, your snapshots aren''t going to use
much room.
> >
> > This goes back to file system layout.  If the production data is
> > housed in the file system where many other changes taking place and
> > the administrator is only interested in backing up a few of those
> > files, file system layer snaps will not be "cheap".
>
> But his example was "changing just few files at a time," so
that''s a
> different case. Let''s not try to discuss multiple examples at once
:)
The OP is long on generalities and short on specifics.  Without more
specifics, it''s hard to provide useful input.

... snip ...

Al Hopper  Logical Approach Inc, Plano, TX.  al at logical-approach.com
           Voice: 972.379.2133 Fax: 972.379.2134  Timezone: US CDT
OpenSolaris.Org Community Advisory Board (CAB) Member - Apr 2005
OpenSolaris Governing Board (OGB) Member - Feb 2006 to Mar 2007

Wee Yeh Tan

2007-Mar-30 02:46 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Shawn Walker <binarycrusader at gmail.com>
wrote:> Actually, recent version control systems can be very efficient at
> storing binary files.
Still no where as efficient as a ZFS snapshot.
> Careful consideration of the layout of your file
> system applies regardless of which type of file system it is (zfs,
> ufs, etc.).
True.  ZFS does open up a whole new can of worms/flexibility.


-- 
Just me,
Wire ...

Nicholas Lee

2007-Mar-30 03:00 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Wee Yeh Tan <weeyeh at gmail.com>
wrote:>
> > Careful consideration of the layout of your file
> > system applies regardless of which type of file system it is (zfs,
> > ufs, etc.).
>
> True.  ZFS does open up a whole new can of worms/flexibility.

How do hard-links work across zfs mount/filesystems in the same pool?

Create a /zfs-track filesystem, symlink all the files you want to regularly
snapshot there.

In fact this might be an interesting mechanism to track files that changes
in /etc against system defaults.

Nicholas
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20070330/b609d8e5/attachment.html>

Wee Yeh Tan

2007-Mar-30 03:14 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Nicholas Lee <emptysands at gmail.com>
wrote:> How do hard-links work across zfs mount/filesystems in the same pool?
No.

<http://src.opensolaris.org/source/xref/onnv/onnv-gate/usr/src/uts/common/fs/vnode.c#1322>

My guess is that it should be technically possible in the same pool
though but just not though the syscall.
> Create a /zfs-track filesystem, symlink all the files you want to regularly
> snapshot there.
>
> In fact this might be an interesting mechanism to track files that changes
> in /etc against system defaults.
That is definitely a job for a revision control system.


-- 
Just me,
Wire ...

Shawn Walker

2007-Mar-30 04:10 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 29/03/07, Wee Yeh Tan <weeyeh at gmail.com>
wrote:> On 3/30/07, Shawn Walker <binarycrusader at gmail.com> wrote:
> > Actually, recent version control systems can be very efficient at
> > storing binary files.
>
> Still no where as efficient as a ZFS snapshot.
Maybe, but they''re far better at doing versioning and providing a
history of changes.

Now if someone wants to build a revision control system on top of zfs
somehow, more power to them...

-- 
"Less is only more where more is no good." --Frank Lloyd Wright

Shawn Walker, Software and Systems Analyst
binarycrusader at gmail.com - http://binarycrusader.blogspot.com/

Atul Vidwansa

2007-Mar-30 04:36 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

Lets say I reorganized my zpools. Now there are 2 pools:
Pool1:
Production data, combination of binary and text files. Only few files
change at a time. Average file sizes are around 1MB. Does it make
sense to take zfs snapshots of the pool? Will the snapshot consume as
much space as original zpool?

Pool2:
Again, production data, mostly binary files. File sizes are huge
~10GB. These files change frequently. Does it make sense to snapshot
it? Somehow I am reluctant to think about putting these files under
something like CVS. As I mentioned before, I need only 1 version of
the file at any time.

Any suggestions?
Regards,
_Atul

On 3/30/07, Shawn Walker <binarycrusader at gmail.com>
wrote:> On 29/03/07, Wee Yeh Tan <weeyeh at gmail.com> wrote:
> > On 3/30/07, Shawn Walker <binarycrusader at gmail.com> wrote:
> > > Actually, recent version control systems can be very efficient at
> > > storing binary files.
> >
> > Still no where as efficient as a ZFS snapshot.
>
> Maybe, but they''re far better at doing versioning and providing a
> history of changes.
>
> Now if someone wants to build a revision control system on top of zfs
> somehow, more power to them...
>
> --
> "Less is only more where more is no good." --Frank Lloyd Wright
>
> Shawn Walker, Software and Systems Analyst
> binarycrusader at gmail.com - http://binarycrusader.blogspot.com/
>

Nicholas Lee

2007-Mar-30 05:00 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Shawn Walker <binarycrusader at gmail.com>
wrote:>
> Maybe, but they''re far better at doing versioning and providing a
> history of changes.

I;d have to agree. I track 6000 blobs (OOo gzip files, pdfs and other stuff)
in svn even with 1300 changesets over 3 years there is a marginal disk cost
on the repository.  The only issue is all the .svn directories, they add to
the disk requirement on the client side.

[nic at base:/export/svn] du -sh nicholas/
712M    nicholas/
[nic at shell:~/tmp/company] du  --exclude="*.svn*" -sh
789M    .
[nic at shell:~/tmp/company] du -sh
1.6G    .

Yes the repository is smaller than the exported data set. :)

[nic at shell:~/tmp/company] find . | grep -v svn | wc -l
6089
[nic at shell:~/tmp/company] find . | wc -l
26151

Now if someone wants to build a revision control system on top of
zfs> somehow, more power to them..

For something like a photo archive, zfs snapshots are likely to be more
useful.  The main advantage being over something like svn being able to
remove snaps in the middle of a set.

At the end of the day disk cost is cheap, it is the management cost you have
to consider.

Nicholas
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20070330/80d824ec/attachment.html>

Nicholas Lee

2007-Mar-30 05:11 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 3/30/07, Atul Vidwansa <atulvid at gmail.com>
wrote:>
> Lets say I reorganized my zpools. Now there are 2 pools:
> Pool1:
> Production data, combination of binary and text files. Only few files
> change at a time. Average file sizes are around 1MB. Does it make
> sense to take zfs snapshots of the pool? Will the snapshot consume as
> much space as original zpool?

Depends on what you want to track/do.

If you want file history and being able to do something like "svn
info" to
see what is out of date then vcs is a good choice. If you use want to be
able to access backups and you want to build your own blob management system
then snapshots probably are good enough.

The above is a typical case for source control, and is likely to be easier
this in the long run.

Pool2:> Again, production data, mostly binary files. File sizes are huge
> ~10GB. These files change frequently. Does it make sense to snapshot
> it? Somehow I am reluctant to think about putting these files under
> something like CVS. As I mentioned before, I need only 1 version of
> the file at any time.

10Gb files are likely to much harder with something like subversion. With
files these size I can''t see you being interested in ''file
diffs'' and really
being able to look at point in time shots.

Plus If you say take daily backups, with say a 100Mb daily midnight
changeset. Then want to remove the Mon to Sat''s changesets. I
don''t think
subversion will handle that well.

The other good thing about zfs snapshots is low system load for a snap.  If
you have the disk space to do something like 5min snaps, using a vcs system
will probably generate a reasonable amount of system load and operation
completion time is not going to be constant time.

Nicholas
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
<http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20070330/bdea1221/attachment.html>

Manoj Joseph

2007-Mar-30 13:47 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

Richard Elling wrote:> Atul Vidwansa wrote:
>> Hi Richard,
>>    I am not talking about source(ASCII) files. How about versioning
>> production data? I talked about file level snapshots because
>> snapshotting entire filesystem does not make sense when application is
>> changing just few files at a time.
> 
> CVS supports binary files.  The nice thing about version control systems
> is that you can annotate the versions.  With ZFS snapshots, you
don''t get
> annotations.
Sure version control systems do file versioning. But, ZFS with its COW 
brings a new way of doing this.

I do not see applications like emacs, star office etc using
SCCS/CVS. But I can easily see then using file snapshots if zfs were to 
offer it (am conveniently ignoring portability).

It was suggested that filesystem snapshots be used to achieve the same 
purpose. It would not work, if you have to roll back one file change but 
not others...

Extended attributes could potentially be used to annotate file 
snapshots... ;)

I can also see possibilities with clustered/distributed applications 
(parallel Postgresql perhaps?) needing to commit/revoke across servers 
using this. Layered distributed filesystems could potentially use this 
for recovery.

But I also remember a long thread on this not too long ago going 
nowhere. ;)

Just my $0.02.

-Manoj

Toby Thain

2007-Mar-30 14:01 UTC

head link

[zfs-discuss] File level snapshots in ZFS?

On 29-Mar-07, at 5:43 PM, Richard Elling wrote:
> Atul Vidwansa wrote:
>> Hi Richard,
>>    I am not talking about source(ASCII) files. How about versioning
>> production data? I talked about file level snapshots because
>> snapshotting entire filesystem does not make sense when  
>> application is
>> changing just few files at a time.
>
> CVS supports binary files.
And Subversion supports them even better... But even more relevantly  
to the OP, it has cheap tagging.

--Toby
> The nice thing about version control systems
> is that you can annotate the versions.  With ZFS snapshots, you  
> don''t get
> annotations.
>  -- richard
>
>> Regards,
>> -atul
>> On 3/30/07, Richard Elling <Richard.Elling at sun.com> wrote:
>>> Atul Vidwansa wrote:
>>> > Hi,
>>> > Is it possible to take file level snapshots in ZFS?

Eric Haycraft

2007-Mar-30 14:26 UTC

head link

[zfs-discuss] Re: File level snapshots in ZFS?

> Lets say I reorganized my zpools. Now there are 2
> pools:
> Pool1:
> Production data, combination of binary and text
> files. Only few files
> change at a time. Average file sizes are around 1MB.
> Does it make
> sense to take zfs snapshots of the pool? Will the
> snapshot consume as
> much space as original zpool?
A snapshot takes up almost no space when originally created. Essentially just
enough space to keep a reference pointer to the current state of data. The
snapshot ''grows'' as data in the pool is changed. After a
snapshot is taken, it will  grow in size by updates being made to existing
blocks. The existing blocks are maintained and the snapshot keeps pointers to
the existing blocks while new blocks are created that are pointed to as the
current version. In short, the snapshot only keeps track of changes from the
current state to the snapshot''s state... It does not make a copy of
everything. Depending upon the frequency and quantity of blocks updated, this
could be very little space or a ton of space for the snapshots.

For example:
You have file A at 20 MB and file B at 20 MB files in a directory and you take a
snapshot:
 - at this point, 40 MB is used
You add file C (20 MB) and delete file B
 - now the current file system contains 40 MB (A + C) and the snapshot is using
up 20 MB (B)
You modify 1 MB of file A
 - Now the current file system contains 40 MB (A + C) and the snapshot is using
21 MB (Files B and 1MB from the previous state of A)


Hope that helps :)
> 
> Pool2:
> Again, production data, mostly binary files. File
> sizes are huge
> ~10GB. These files change frequently. Does it make
> sense to snapshot
> it? Somehow I am reluctant to think about putting
> these files under
> something like CVS. As I mentioned before, I need
> only 1 version of
> the file at any time.
> 
The answer is it depends
How often do you want snapshots? (More frequently would indicate zfs over cvs)
Do you want to keep all snapshots? (If you want to remove snapshots at some
point, zfs is your best bet)
Do you need version annotation of some type? (cvs would be better for this, you
can diff with zfs though if the .zfs folder is enabled).
How many files do you care about snapshotting? (A small percentage of files in a
directory would make cvs a better option..all files in a directory would make
zfs a better option)
Are there a lot of other files in the directory that change that you
don''t want snapshots of? (Log/temp files can add a lot of space to
snapshots in zfs. If this is the case and they can''t be moved to a
different filesystem, zfs may not be for you)
Do you want a snapshot of all files at once or single files at a time? (all at
once would be better for zfs, single files at a time would indicate cvs)

>From what I have read, I think your best bet is to use ZFS rather than
version control software. Also note that you can do snapshots by file system
(and thus by directory) and don''t have to snapshot the entire pool at
once. Therefore, you could have zpool ''tank'' with two
filesystems - tank/production/snapshot and tank/production/no_snapshot  - setup
and segregate your files by whether or not to take snapshots.
Eric
 
 
This message posted from opensolaris.org

zfs discuss - Mar 2007 - File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] File level snapshots in ZFS?

[zfs-discuss] Re: File level snapshots in ZFS?