PC Review


Reply
Thread Tools Rate Thread

Recover data from a failed RAID5 array

 
 
Martin Goldmann
Guest
Posts: n/a
 
      4th Mar 2008
Hi!

I have an Adaptec 2410SA controller with four 160GB drives configured in a
RAID5 array for a total capacity 480GB.

The array contains a lot of very important data, which I can't bear to
loose. A lot of homemade Cubase music, pictures of my kids growing up, home
video...

The other day, one of the drives began making a unhealthy clicking noise.
Since I didn't have a spare disk at hand, I ordered a new one online.
Unfortunately, before the the new drive arrived, something strange happened.
For some reason, one of the remaining drives went offline, and the
controller will not recognize the array any longer.
I have since found out, that the first drive failed because of a loose
connection in the SATA power connector. Since the drive also has a legacy
4-pin molex connector, I now have the drive up and running again. However,
the Adaptec controller still lists the drive as failed (SMART error), and
says that the array has two missing members.

I am pretty sure that at least three of the four drives are working
properly, but that the controller will not 'give them a try'.
I can't find any options in the controller BIOS or the Adaptec Storage
Manager software which will retest the drives or try and remount the array.

I have (finally!) gained access to the Command Line Interface, but being a
novice, I'm afraid I will do something wrong and permanently erase all array
data.

I'm hoping that someone here are more familiar with the Adaptec CLI, or know
of some other way of rescuing the data. I have plenty of storage space on
another computer to backup the recovered data to
Product documentation on the controller can be found here:
http://www.adaptec.com/en-US/support...ta/AAR-2410SA/

Also, please no comments on the importance of doing regular backups. I'm in
tears already!

Martin Goldmann,
Denmark

 
Reply With Quote
 
 
 
 
Michael Hawes
Guest
Posts: n/a
 
      4th Mar 2008

"Martin Goldmann" <(E-Mail Removed)> wrote in message
news:47cdb834$0$89167$(E-Mail Removed)...
> Hi!
>
> I have an Adaptec 2410SA controller with four 160GB drives configured in a
> RAID5 array for a total capacity 480GB.
>
> The array contains a lot of very important data, which I can't bear to
> loose. A lot of homemade Cubase music, pictures of my kids growing up,
> home video...
>
> The other day, one of the drives began making a unhealthy clicking noise.
> Since I didn't have a spare disk at hand, I ordered a new one online.
> Unfortunately, before the the new drive arrived, something strange
> happened. For some reason, one of the remaining drives went offline, and
> the controller will not recognize the array any longer.
> I have since found out, that the first drive failed because of a loose
> connection in the SATA power connector. Since the drive also has a legacy
> 4-pin molex connector, I now have the drive up and running again. However,
> the Adaptec controller still lists the drive as failed (SMART error), and
> says that the array has two missing members.
>
> I am pretty sure that at least three of the four drives are working
> properly, but that the controller will not 'give them a try'.
> I can't find any options in the controller BIOS or the Adaptec Storage
> Manager software which will retest the drives or try and remount the
> array.
>
> I have (finally!) gained access to the Command Line Interface, but being a
> novice, I'm afraid I will do something wrong and permanently erase all
> array data.
>
> I'm hoping that someone here are more familiar with the Adaptec CLI, or
> know of some other way of rescuing the data. I have plenty of storage
> space on another computer to backup the recovered data to
> Product documentation on the controller can be found here:
> http://www.adaptec.com/en-US/support...ta/AAR-2410SA/
>
> Also, please no comments on the importance of doing regular backups. I'm
> in tears already!
>
> Martin Goldmann,
> Denmark
>

Chances are not good. When the first drive went offline it was no longer in
the array. Any data written to the other drives after this event was not
written to the failed drive and the other 3 drives became a stripe array. If
you lose a drive from a stripe array (RAID0) then you lose all the data. If
you revive the second faulty software you may find recovery program that can
rebuild the RAID0. If your DATA was THAT important it should be backed up to
external storage device, or you dhould have spare drive onsite for immeduate
replacement.

Mike.


 
Reply With Quote
 
Martin Goldmann
Guest
Posts: n/a
 
      5th Mar 2008

Thank you for your reply, Michael!


"Michael Hawes" <(E-Mail Removed)> wrote in message
news:47cdd5d5$(E-Mail Removed)...
> Chances are not good. When the first drive went offline it was no longer
> in the array. Any data written to the other drives after this event was
> not written to the failed drive and the other 3 drives became a stripe
> array. If you lose a drive from a stripe array (RAID0) then you lose all
> the data.


I don't think that much (if anything) was written to the drives after the
first drive died. Also, the second drive failing, I suspect was caused by by
some error in the controller. The drive seems to be OK, and will complete a
verify command.


> If you revive the second faulty software you may find recovery program
> that
> can rebuild the RAID0.


I'm not sure what you're saying here. Faulty sowftware?


> If your DATA was THAT important it should be backed up to external storage
> device, or you dhould have spare drive onsite for immeduate
> replacement.


Yes, I'm painfully aware of it. I should have made backups, and should have
had a spare drive, and a spare controller too i guess.

Best regards,
Martin Goldmann

 
Reply With Quote
 
Arno Wagner
Guest
Posts: n/a
 
      5th Mar 2008
Previously Martin Goldmann <(E-Mail Removed)> wrote:
> Hi!


> I have an Adaptec 2410SA controller with four 160GB drives configured in a


Aha, that pice of trash. I have one of these controllers that serves
as a paperweight, since it was extremely unreliable and the management
tools were a pain. It had d tendency to kick disks from the array
without good reason and without useful diagnostics as to why it
had kicked the disk.

> RAID5 array for a total capacity 480GB.


> The array contains a lot of very important data, which I can't bear to
> loose. A lot of homemade Cubase music, pictures of my kids growing up, home
> video...


> The other day, one of the drives began making a unhealthy clicking noise.
> Since I didn't have a spare disk at hand, I ordered a new one online.
> Unfortunately, before the the new drive arrived, something strange happened.
> For some reason, one of the remaining drives went offline, and the
> controller will not recognize the array any longer.


That is the reason you should either have a spare on hand or take
the array down until you have one. Sorry.

> I have since found out, that the first drive failed because of a loose
> connection in the SATA power connector. Since the drive also has a legacy
> 4-pin molex connector, I now have the drive up and running again. However,
> the Adaptec controller still lists the drive as failed (SMART error), and
> says that the array has two missing members.


The first drive will be out of sync, unless you mounted the whole
array as read-only during the time it was degraded.

> I am pretty sure that at least three of the four drives are working
> properly, but that the controller will not 'give them a try'.
> I can't find any options in the controller BIOS or the Adaptec Storage
> Manager software which will retest the drives or try and remount the array.


> I have (finally!) gained access to the Command Line Interface, but being a
> novice, I'm afraid I will do something wrong and permanently erase all array
> data.


The right way to do this is to image all the drives before
continuing. And the CLI of this controller is barely usable.

> I'm hoping that someone here are more familiar with the Adaptec CLI, or know
> of some other way of rescuing the data. I have plenty of storage space on
> another computer to backup the recovered data to
> Product documentation on the controller can be found here:
> http://www.adaptec.com/en-US/support...ta/AAR-2410SA/


> Also, please no comments on the importance of doing regular
> backups. I'm in tears already!


Make image copies (sector wise) of all the 4 drives. Then you can
experiment with low risk. What you likely need to do is to get the
second drive that failed back, unless you stopped writing to the
array after the first drive failed. In that case forcing it to use
the first drive again (if possible) may solve the issue with only
minor damage to the data.

Professional data recovery might be a good option. After all,
your data is completely available on the two working drives and
the second one that failed. Repairing that one should solve the issue.
You may also be able for force the controller to accept the second
failed drive again, but from what I remember of the CLI, I
am not sure that is possible.

If you want to risk working on this yourself, the next step
would be to diagnose the second failed drive and after that the
sector-wise backups of all drives.

I would also advise to throw the controller away after this
and get a better one. 3ware, for example, has a good reputation.

Arno

 
Reply With Quote
 
Mike Tomlinson
Guest
Posts: n/a
 
      6th Mar 2008
In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes
<(E-Mail Removed)> writes

>Chances are not good. When the first drive went offline it was no longer in
>the array. Any data written to the other drives after this event was not
>written to the failed drive and the other 3 drives became a stripe array.


Are you sure about that? It should have continued as a RAID5, albeit
running in degraded mode.

If the OP can persuade the array to accept the first failed drive (the
one with the loose power connector) again, he may stand a chance of
recovering his data. It seems to me that is what he is asking.

--
(\__/) Bunny says NO to Windows Vista!
(='.'=) http://www.cs.auckland.ac.nz/~pgut00...ista_cost.html
(")_(")

 
Reply With Quote
 
Arno Wagner
Guest
Posts: n/a
 
      6th Mar 2008
Previously Mike Tomlinson <(E-Mail Removed)> wrote:
> In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes
> <(E-Mail Removed)> writes


>>Chances are not good. When the first drive went offline it was no longer in
>>the array. Any data written to the other drives after this event was not
>>written to the failed drive and the other 3 drives became a stripe array.


> Are you sure about that? It should have continued as a RAID5, albeit
> running in degraded mode.


> If the OP can persuade the array to accept the first failed drive (the
> one with the loose power connector) again, he may stand a chance of
> recovering his data. It seems to me that is what he is asking.


The problem is that if the OP wrote anything at all to the array
after the first drive failed, adding the first drive will corrupt
the areas written to. What the OP really needs is to get the
second failed drive working ahain, as there will have been no
writes to the then broken array.

And np, a degraded RAID5 is not a striping array. Since RAID5
distributes the parity, a degraded RAID5 is a degraded RAID5
and nothing else. With RAID4 you could get something very close
to a stiping array, if the parity disk failed.

Arno

 
Reply With Quote
 
Floyd
Guest
Posts: n/a
 
      6th Mar 2008
Mike Tomlinson wrote
> In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes <(E-Mail Removed)> writes
>
> > Chances are not good. When the first drive went offline it was no longer in
> > the array. Any data written to the other drives after this event was not
> > written to the failed drive and the other 3 drives became a stripe array.


> Are you sure about that?


Corse he is. How dare you question him.
This is Mike Hawes we are talking about.
Expert extraordinaire, *with bells on*.

> It should have continued as a RAID5,


And how is RAID5 not a striped array?

> albeit running in degraded mode.


> If the OP can persuade the array to accept the first failed drive (the
> one with the loose power connector) again, he may stand a chance of
> recovering his data. It seems to me that is what he is asking.


Right, so how come you aren't telling him how to do that.
 
Reply With Quote
 
Michael Hawes
Guest
Posts: n/a
 
      6th Mar 2008

"Arno Wagner" <(E-Mail Removed)> wrote in message
news:(E-Mail Removed)...
> Previously Mike Tomlinson <(E-Mail Removed)> wrote:
>> In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes
>> <(E-Mail Removed)> writes

>
>>>Chances are not good. When the first drive went offline it was no longer
>>>in
>>>the array. Any data written to the other drives after this event was not
>>>written to the failed drive and the other 3 drives became a stripe array.

>
>> Are you sure about that? It should have continued as a RAID5, albeit
>> running in degraded mode.

>
>> If the OP can persuade the array to accept the first failed drive (the
>> one with the loose power connector) again, he may stand a chance of
>> recovering his data. It seems to me that is what he is asking.

>
> The problem is that if the OP wrote anything at all to the array
> after the first drive failed, adding the first drive will corrupt
> the areas written to. What the OP really needs is to get the
> second failed drive working ahain, as there will have been no
> writes to the then broken array.
>
> And np, a degraded RAID5 is not a striping array. Since RAID5
> distributes the parity, a degraded RAID5 is a degraded RAID5
> and nothing else. With RAID4 you could get something very close
> to a stiping array, if the parity disk failed.
>
> Arno
>
>

A degraded RAID5 distributes the data across the remaining drives
without any parity infomation. Please list the differences between that and
a RAID0 stripe set. When the second drive died all the data was lost.

Mike.


 
Reply With Quote
 
Stretch
Guest
Posts: n/a
 
      7th Mar 2008
Michael Hawes wrote in news:47d07533$(E-Mail Removed)
> "Arno Wagner" <(E-Mail Removed)> wrote in message
> news:(E-Mail Removed)...
> > Previously Mike Tomlinson <(E-Mail Removed)> wrote:
> > > In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes
> > > <(E-Mail Removed)> writes

> >
> > > > Chances are not good. When the first drive went offline it was no longer
> > > > in
> > > > the array. Any data written to the other drives after this event was not
> > > > written to the failed drive and the other 3 drives became a stripe
> > > > array.

> >
> > > Are you sure about that? It should have continued as a RAID5, albeit
> > > running in degraded mode.

> >
> > > If the OP can persuade the array to accept the first failed drive (the
> > > one with the loose power connector) again, he may stand a chance of
> > > recovering his data. It seems to me that is what he is asking.

> >
> > The problem is that if the OP wrote anything at all to the array
> > after the first drive failed, adding the first drive will corrupt
> > the areas written to. What the OP really needs is to get the
> > second failed drive working ahain, as there will have been no
> > writes to the then broken array.
> >
> > And np, a degraded RAID5 is not a striping array. Since RAID5
> > distributes the parity, a degraded RAID5 is a degraded RAID5
> > and nothing else. With RAID4 you could get something very close
> > to a stiping array, if the parity disk failed.
> >
> > Arno
> >


> A degraded RAID5 distributes the data across the remaining drives
> without any parity infomation.


Ooh, so much conviction that it almost sounds plausible, doesn't it.

> Please list the differences between that and a RAID0 stripe set.
> When the second drive died all the data was lost.
>
> Mike.

 
Reply With Quote
 
Arno Wagner
Guest
Posts: n/a
 
      7th Mar 2008
Previously Michael Hawes <(E-Mail Removed)> wrote:

> "Arno Wagner" <(E-Mail Removed)> wrote in message
> news:(E-Mail Removed)...
>> Previously Mike Tomlinson <(E-Mail Removed)> wrote:
>>> In article <47cdd5d5$(E-Mail Removed)>, Michael Hawes
>>> <(E-Mail Removed)> writes

>>
>>>>Chances are not good. When the first drive went offline it was no longer
>>>>in
>>>>the array. Any data written to the other drives after this event was not
>>>>written to the failed drive and the other 3 drives became a stripe array.

>>
>>> Are you sure about that? It should have continued as a RAID5, albeit
>>> running in degraded mode.

>>
>>> If the OP can persuade the array to accept the first failed drive (the
>>> one with the loose power connector) again, he may stand a chance of
>>> recovering his data. It seems to me that is what he is asking.

>>
>> The problem is that if the OP wrote anything at all to the array
>> after the first drive failed, adding the first drive will corrupt
>> the areas written to. What the OP really needs is to get the
>> second failed drive working ahain, as there will have been no
>> writes to the then broken array.
>>
>> And np, a degraded RAID5 is not a striping array. Since RAID5
>> distributes the parity, a degraded RAID5 is a degraded RAID5
>> and nothing else. With RAID4 you could get something very close
>> to a stiping array, if the parity disk failed.
>>
>> Arno
>>
>>

> A degraded RAID5 distributes the data across the remaining drives
> without any parity infomation.


This is wrong. There is no data redistribution or movement when
a disk in a RAID5 fails. And thet is well, since such data movement
is a) not needed and b) would potentially cause additional
problems. Putting high load on a degraded array is a bad idea.
In fact, without hot or cold spare added, you should typically
take it down until you have a spare.

> Please list the differences between that and
> a RAID0 stripe set. When the second drive died all the data was lost.


It is quite true that the loss of a second drive takes down a RAID5,
similar to the loss of any drive in a RAID0. But the data is organized
differently before that happens and even afterwards.

Let me show you an example with 3 disks. Data is in layers from top
to bottom. Partity is like follows. '+' stands for xor and x<n>
for stripe n.

RAID5:

disk1 disk2 disk3
s0 s1 s0+s1
s2 s2+s3 s3
s4+s5 s4 s5

RAID5 degraded (here disk 3 has failed):

disk1 disk2
s0 s1
s2 s2+s3 since (s2+s3)+s2 = s3, s3 can be reconstucted
s4+s5 s4 since (s4+s5)+s4 = s5, s5 can be reconstructed

RAID0 with 2 disks:

disk1 disk2
s0 s1
s2 s3
s4 s5


This should make the difference clear.

Arno


 
Reply With Quote
 
 
 
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
data recovery of a failed RAID5 array... Brad Pears Microsoft Windows 2000 Hardware 6 31st Jan 2009 10:01 PM
OS drives on own controller vs being on same controller with data raid5 array (performance?) markm75 Storage Devices 3 4th Jul 2007 12:53 AM
How to recover failed spanned array? Mike Busch Windows XP Hardware 5 16th Apr 2004 02:46 AM
Can't expand RAID5 Array Brad Reichert Microsoft Windows 2000 File System 0 10th Feb 2004 01:53 PM
Expanding RAID5 Array Brad Reichert Microsoft Windows 2000 File System 1 19th Jan 2004 10:16 PM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 08:17 AM.