Is there any way to stop a resilver? We gotta stop this thing - at minimum, completion time is 300,000 hours, and maximum is in the millions. Raidz2 array, so it has the redundancy, we just need to get data off. -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/9dbb6cf5/attachment.html>
Has it been running long? Initially the numbers are way off. After a while it settles down into something reasonable. How many disks, and what size, are in your raidz2? -Scott On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com> wrote:> Is there any way to stop a resilver? > > We gotta stop this thing - at minimum, completion time is 300,000 hours, and > maximum is in the millions. > > Raidz2 array, so it has the redundancy, we just need to get data off.-------------------------------------------------------------------------------- We value your opinion! How may we serve you better? Please click the survey link to tell us how we are doing: http://www.craneae.com/ContactUs/VoiceofCustomer.aspx Your feedback is of the utmost importance to us. Thank you for your time. -------------------------------------------------------------------------------- Crane Aerospace & Electronics Confidentiality Statement: The information contained in this email message may be privileged and is confidential information intended only for the use of the recipient, or any employee or agent responsible to deliver it to the intended recipient. Any unauthorized use, distribution or copying of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify the sender immediately and destroy the original message and all attachments from your electronic files. -------------------------------------------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/a79aa958/attachment.html>
It''s always running less than an hour. It usually starts at around 300,000h estimate(at 1m in), goes up to an estimate in the millions(about 30mins in) and restarts. Never gets past 0.00% completion, and K resilvered on any LUN. 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke < scott.meilicke at craneaerospace.com> wrote:> Has it been running long? Initially the numbers are *way* off. After a > while it settles down into something reasonable. > > How many disks, and what size, are in your raidz2? > > -Scott > > > On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com> wrote: > > Is there any way to stop a resilver? > > We gotta stop this thing - at minimum, completion time is 300,000 hours, > and maximum is in the millions. > > Raidz2 array, so it has the redundancy, we just need to get data off. > > ------------------------------ > We value your opinion! <http://www.craneae.com/surveys/satisfaction.htm>How may we serve you better?Please click the survey link to tell us how we > are doing: <http://www.craneae.com/surveys/satisfaction.htm> > http://www.craneae.com/surveys/satisfaction.htm > > Your feedback is of the utmost importance to us. Thank you for your time. > > Crane Aerospace & Electronics Confidentiality Statement: > The information contained in this email message may be privileged and is > confidential information intended only for the use of the recipient, or any > employee or agent responsible to deliver it to the intended recipient. Any > unauthorized use, distribution or copying of this information is strictly > prohibited and may be unlawful. If you have received this communication in > error, please notify the sender immediately and destroy the original message > and all attachments from your electronic files. >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/ea4fbd01/attachment-0001.html>
What version of OS? Are snapshots running (turn them off). So are there eight disks? On 9/29/10 8:46 AM, "LIC mesh" <licmesh at gmail.com> wrote:> It''s always running less than an hour. > > It usually starts at around 300,000h estimate(at 1m in), goes up to an > estimate in the millions(about 30mins in) and restarts. > > Never gets past 0.00% completion, and K resilvered on any LUN. > > 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. > > > > > On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke > <scott.meilicke at craneaerospace.com> wrote: >> Has it been running long? Initially the numbers are way off. After a while it >> settles down into something reasonable. >> >> How many disks, and what size, are in your raidz2? ? >> >> -Scott >> >> >> On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com <http://licmesh at gmail.com> >> > wrote: >> >>> Is there any way to stop a resilver? >>> >>> We gotta stop this thing - at minimum, completion time is 300,000 hours, and >>> maximum is in the millions. >>> >>> Raidz2 array, so it has the redundancy, we just need to get data off.-------------------------------------------------------------------------------- We value your opinion! How may we serve you better? Please click the survey link to tell us how we are doing: http://www.craneae.com/ContactUs/VoiceofCustomer.aspx Your feedback is of the utmost importance to us. Thank you for your time. -------------------------------------------------------------------------------- Crane Aerospace & Electronics Confidentiality Statement: The information contained in this email message may be privileged and is confidential information intended only for the use of the recipient, or any employee or agent responsible to deliver it to the intended recipient. Any unauthorized use, distribution or copying of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify the sender immediately and destroy the original message and all attachments from your electronic files. -------------------------------------------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/549cc2b5/attachment.html>
What caused the resilvering to kick off in the first place? Lin On Sep 29, 2010, at 8:46 AM, LIC mesh wrote:> It''s always running less than an hour. > > It usually starts at around 300,000h estimate(at 1m in), goes up to an estimate in the millions(about 30mins in) and restarts. > > Never gets past 0.00% completion, and K resilvered on any LUN. > > 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. > > > > > On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke <scott.meilicke at craneaerospace.com> wrote: > Has it been running long? Initially the numbers are way off. After a while it settles down into something reasonable. > > How many disks, and what size, are in your raidz2? > > -Scott > > > On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com> wrote: > > Is there any way to stop a resilver? > > We gotta stop this thing - at minimum, completion time is 300,000 hours, and maximum is in the millions. > > Raidz2 array, so it has the redundancy, we just need to get data off. > > > We value your opinion! How may we serve you better?Please click the survey link to tell us how we are doing: http://www.craneae.com/surveys/satisfaction.htm > > Your feedback is of the utmost importance to us. Thank you for your time. > > Crane Aerospace & Electronics Confidentiality Statement: > The information contained in this email message may be privileged and is confidential information intended only for the use of the recipient, or any employee or agent responsible to deliver it to the intended recipient. Any unauthorized use, distribution or copying of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify the sender immediately and destroy the original message and all attachments from your electronic files. > > > _______________________________________________ > zfs-discuss mailing list > zfs-discuss at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/48321c92/attachment.html>
This is an iSCSI/COMSTAR array. The head was running 2009.06 stable with version 14 ZFS, but we updated that to build 134 (kept the old OS drives) - did not, however, update the zpool - it''s still version 14. The targets are all running 2009.06 stable, exporting 4 raidz1 LUNs each of 6 drives - 8 shelves have 1TB drives, the other 8 have 2TB drives. The head sees the filesystem as comprised of 8 vdevs of 8 iSCSI LUNs each, with SSD ZIL and SSD L2ARC. On Wed, Sep 29, 2010 at 11:49 AM, Scott Meilicke < scott.meilicke at craneaerospace.com> wrote:> What version of OS? > Are snapshots running (turn them off). > > So are there eight disks? > > > > On 9/29/10 8:46 AM, "LIC mesh" <licmesh at gmail.com> wrote: > > It''s always running less than an hour. > > It usually starts at around 300,000h estimate(at 1m in), goes up to an > estimate in the millions(about 30mins in) and restarts. > > Never gets past 0.00% completion, and K resilvered on any LUN. > > 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. > > > > > On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke < > scott.meilicke at craneaerospace.com> wrote: > > Has it been running long? Initially the numbers are *way* off. After a > while it settles down into something reasonable. > > How many disks, and what size, are in your raidz2? > > -Scott > > > On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com < > http://licmesh at gmail.com> > wrote: > > Is there any way to stop a resilver? > > We gotta stop this thing - at minimum, completion time is 300,000 hours, > and maximum is in the millions. > > Raidz2 array, so it has the redundancy, we just need to get data off. > > ------------------------------ > We value your opinion! <http://www.craneae.com/surveys/satisfaction.htm>How may we serve you better?Please click the survey link to tell us how we > are doing: <http://www.craneae.com/surveys/satisfaction.htm> > http://www.craneae.com/surveys/satisfaction.htm > > Your feedback is of the utmost importance to us. Thank you for your time. > > Crane Aerospace & Electronics Confidentiality Statement: > The information contained in this email message may be privileged and is > confidential information intended only for the use of the recipient, or any > employee or agent responsible to deliver it to the intended recipient. Any > unauthorized use, distribution or copying of this information is strictly > prohibited and may be unlawful. If you have received this communication in > error, please notify the sender immediately and destroy the original message > and all attachments from your electronic files. >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/d38ffe00/attachment.html>
Most likely an iSCSI timeout, but that was before my time here. Since then, there have been various individual drives lost along the way on the shelves, but never a whole LUN, so, theoretically, /except/ for iSCSI timeouts, there has been no great reason to resilver. On Wed, Sep 29, 2010 at 11:51 AM, Lin Ling <lin.ling at oracle.com> wrote:> > What caused the resilvering to kick off in the first place? > > Lin > > On Sep 29, 2010, at 8:46 AM, LIC mesh wrote: > > It''s always running less than an hour. > > It usually starts at around 300,000h estimate(at 1m in), goes up to an > estimate in the millions(about 30mins in) and restarts. > > Never gets past 0.00% completion, and K resilvered on any LUN. > > 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. > > > > > On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke < > scott.meilicke at craneaerospace.com> wrote: > >> Has it been running long? Initially the numbers are *way* off. After a >> while it settles down into something reasonable. >> >> How many disks, and what size, are in your raidz2? >> >> -Scott >> >> >> On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com> wrote: >> >> Is there any way to stop a resilver? >> >> We gotta stop this thing - at minimum, completion time is 300,000 hours, >> and maximum is in the millions. >> >> Raidz2 array, so it has the redundancy, we just need to get data off. >> >> >> >> ------------------------------ >> We value your opinion! <http://www.craneae.com/surveys/satisfaction.htm>How may we serve you better?Please click the survey link to tell us how we >> are doing: <http://www.craneae.com/surveys/satisfaction.htm> >> http://www.craneae.com/surveys/satisfaction.htm >> >> Your feedback is of the utmost importance to us. Thank you for your time. >> >> Crane Aerospace & Electronics Confidentiality Statement: >> The information contained in this email message may be privileged and is >> confidential information intended only for the use of the recipient, or any >> employee or agent responsible to deliver it to the intended recipient. Any >> unauthorized use, distribution or copying of this information is strictly >> prohibited and may be unlawful. If you have received this communication in >> error, please notify the sender immediately and destroy the original message >> and all attachments from your electronic files. >> > > _______________________________________________ > zfs-discuss mailing list > zfs-discuss at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > >-------------- next part -------------- An HTML attachment was scrubbed... URL: <http://mail.opensolaris.org/pipermail/zfs-discuss/attachments/20100929/16ac324d/attachment-0001.html>
Can you post the output of ''zpool status''? Thanks, George LIC mesh wrote:> Most likely an iSCSI timeout, but that was before my time here. > > Since then, there have been various individual drives lost along the way > on the shelves, but never a whole LUN, so, theoretically, /except/ for > iSCSI timeouts, there has been no great reason to resilver. > > > > On Wed, Sep 29, 2010 at 11:51 AM, Lin Ling <lin.ling at oracle.com > <mailto:lin.ling at oracle.com>> wrote: > > > What caused the resilvering to kick off in the first place? > > Lin > > On Sep 29, 2010, at 8:46 AM, LIC mesh wrote: > >> It''s always running less than an hour. >> >> It usually starts at around 300,000h estimate(at 1m in), goes up >> to an estimate in the millions(about 30mins in) and restarts. >> >> Never gets past 0.00% completion, and K resilvered on any LUN. >> >> 64 LUNs, 32x5.44T, 32x10.88T in 8 vdevs. >> >> >> >> >> On Wed, Sep 29, 2010 at 11:40 AM, Scott Meilicke >> <scott.meilicke at craneaerospace.com >> <mailto:scott.meilicke at craneaerospace.com>> wrote: >> >> Has it been running long? Initially the numbers are *way* off. >> After a while it settles down into something reasonable. >> >> How many disks, and what size, are in your raidz2? >> >> -Scott >> >> >> On 9/29/10 8:36 AM, "LIC mesh" <licmesh at gmail.com >> <http://licmesh at gmail.com/>> wrote: >> >> Is there any way to stop a resilver? >> >> We gotta stop this thing - at minimum, completion time is >> 300,000 hours, and maximum is in the millions. >> >> Raidz2 array, so it has the redundancy, we just need to >> get data off. >> >> >> >> ------------------------------------------------------------------------ >> We value your opinion! >> <http://www.craneae.com/surveys/satisfaction.htm> How may we >> serve you better?Please click the survey link to tell us how >> we are doing: >> <http://www.craneae.com/surveys/satisfaction.htm>http://www.craneae.com/surveys/satisfaction.htm >> >> Your feedback is of the utmost importance to us. Thank you for >> your time. >> >> Crane Aerospace & Electronics Confidentiality Statement: >> The information contained in this email message may be >> privileged and is confidential information intended only for >> the use of the recipient, or any employee or agent responsible >> to deliver it to the intended recipient. Any unauthorized use, >> distribution or copying of this information is strictly >> prohibited and may be unlawful. If you have received this >> communication in error, please notify the sender immediately >> and destroy the original message and all attachments from your >> electronic files. >> >> >> _______________________________________________ >> zfs-discuss mailing list >> zfs-discuss at opensolaris.org <mailto:zfs-discuss at opensolaris.org> >> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > > > ------------------------------------------------------------------------ > > _______________________________________________ > zfs-discuss mailing list > zfs-discuss at opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss