System freezing up

K

kony

They clearly feel otherwise. And pointless with online gaming anyway.


Not really, it's more a matter of what's available for
purchase. Choices in *PC* systems supporting ECC are quite
limited.


But would not lockup randomly as the OP is seeing.
It would lock up completely reproducibly.

Maybe or maybe not... some software errors aren't so easily
reproduced, except for the common factor that the same
software is what ends up crashing. Take a hex editor and
flip a few bits in excell.exe... it won't crash every time
(depending on the locations), may not ever crash for that
matter, but it may not consistently do so either.


It would still be a completely reproducible lockup once the file is corrupted.


If you could control every tiny aspect of the program, which
isn't so likely with closed source software. Even so it
would be reproducable to the extent that same app was
crashing not something else, except that if there were
memory errors the odds of other files, other apps being
effected too might be quite a bit higher, and/or DLL used by
them that are a part of windows.
 
P

paulmd

Rod said:
Nope, still wont produce a freeze, the system will time out.

I'm recovering data right now from a hard drive that has frozen every
computer it's been attached to. At least in windows.
 
J

James Brown

kony said:
Not really,

Fraid so.
it's more a matter of what's available for purchase.

Nope, if it really was essential, it would be readily available.

He's always had a mindless bee in his bonnet about ECC.
Choices in *PC* systems supporting ECC are quite limited.

Because it isnt as essential as he mindlessly claims.
Maybe or maybe not...

Nope, certainly not.
some software errors aren't so easily reproduced,

They are if its due to a damaged file damaged in the way he claimed.
except for the common factor that the
same software is what ends up crashing.

It will crash completely reproducibly when
that particular op affected is executed.
Take a hex editor and flip a few bits in excell.exe...
it won't crash every time (depending on the locations),

Never said it would. But if it does crash when doing a
particular op due to that sort of corruption, that will be
completely reproducible when that op is invoked etc.
may not ever crash for that matter,
Duh.

but it may not consistently do so either.

It has to with the particular op thats been damaged by the corruption.
If you could control every tiny aspect of the program,
which isn't so likely with closed source software.

You dont need to do that. The op thats got corrupted will
completely reproducibly go flat on its face whenever its executed.
Even so it would be reproducable to the extent that same
app was crashing not something else, except that if there
were memory errors the odds of other files, other apps
being effected too might be quite a bit higher, and/or
DLL used by them that are a part of windows.

Meaningless waffle.
 
C

CBFalconer

James said:
They clearly feel otherwise. And pointless with online gaming anyway.


But would not lockup randomly as the OP is seeing.
It would lock up completely reproducibly.


It would still be a completely reproducible lockup once the file is
corrupted.

Provided the software gets all the same inputs and timing and is
loaded in the same physical addresses. Not very likely.

At any rate, one thing you can do is take a MD5sum of every
executable on the system (which includes scripts, dlls, sos, libs,
configuration files, etc.). (Option: Set them all to be r/o.)
Periodically run MD5sum -c on the output of the earlier run. This
will catch any changes in those files. On windoze don't forget the
hidden or system directories or files.

I have an equivalent program (validate) which extends the files by
two bytes to store a CCIT16 checksum. That way the sum moves with
the file. Some files won't stand the extension. In particular
text files need an added EOF mark before the checksum, thus 3 byte
extension, and that fouls some editors/compilers etc. Validate is
available on my page, download section. You can use both if you
wish.
 
K

kony

Fraid so.


Nope, if it really was essential, it would be readily available.

Nothing is "really essential". Want an optical drive? I
can claim it isn't essential and if nobody will sell you
one, where are you going to get it? Not having one wouldn't
diminish the usefulness of having it.

He's always had a mindless bee in his bonnet about ECC.

.... and he's right, any serious use system should have it,
but unfortunately the industry was too concerned about
buyers not perceiving the significance and only saw they
could produce a system for a couple dozen dollars less w/o
ECC. It became less popular, lower volumem sales and that
also contributed to higher prices... pricing itself outside
of what most people deem it worth, for artificial reasons,
not for the intrinsic value. There are lots of other areas
in systems where someone tried (did) save a dollar here or
there and these things end up problematic for many users
too, like low quality fans or PSU.



Because it isnt as essential as he mindlessly claims.

By a similar token we could claim a CPU's cache doesn't need
parity or ECC protection, but there it is. Perhaps you
don't feel you need it, and discount anyone else having a
use for this extra protection against memory errors.

Could just be you want cheap without consideration of any
possible side effects. Lots of people build systems that
way, buy a cheap case or keyboard or whatever, but in the
end in is an inferior system in total. "Essential"?
Subjective term in this case.


Nope, certainly not.


They are if its due to a damaged file damaged in the way he claimed.

Nope, as I'd already mentioned you can easily take a hex
editor and flip some bits in an application file and NOT
easily reproduce the error as you suggest. In theory there
might be a reproducible way to cause an error as a result,
but ever determining that method is not presumed easy... all
a matter of what in particular was changed.

Really no point in your arguing about this, as I suggested
twice now you can edit a file yourself and see the proof.

It will crash completely reproducibly when
that particular op affected is executed.

In theory, but you're trying to presume only one isolated
operation and nothing more. Lots of codepaths in some apps,
it may not be so clear and you could go years inbetween
errors without being able to realize it was a bug in code,
or corruption (instead of placing blame elsewhere). That's
a long way from any reasonable presumption of reproducing
it, as-in, actually reproducing it.



Never said it would. But if it does crash when doing a
particular op due to that sort of corruption, that will be
completely reproducible when that op is invoked etc.

IF you can reproduce it like this, you can backtrack and
make the claim. That's not the same thing as being able to
always attribute a crash to corruption. You're also
entirely ignoring other problems besides crashes. If a
calculation went awry, or a graphical element is wrong, you
might not notice or assume you entered data wrong or myriad
other scenarios and potential conclusions. It is not at all
presumable that you have such a simple single operation
resulting in a repeatable crash to backtrack.

It has to with the particular op thats been damaged by the corruption.

Not in a way that you'd necessarily notice, and if there is
ongoing corruption do you expect it to magically stop
happening? Isolating one repeatable op failure is not at
all going to be easy if there is (the liklihood of) ongoing
further damage to that app, other windows apps, etc.

If you saw several apps crashing and the OS, would you
presume each and every file was corrupt or some other
problem? It's not just one op in one app then, it "seems"
random.



You dont need to do that. The op thats got corrupted will
completely reproducibly go flat on its face whenever its executed.

Your argument only works when ignoring all the possible
variables. If you could somehow fix all these variable, yes
you would be able to reproduce it, and check some files.
Without fixation of the variables, show us people who have
found this fault. We can assume there are people with
instable memory but what percent actually DO find this
fault? That's the whole point of ECC, you don't have to FIX
a problem like this, don't have to wonder if there's a error
you will find, or will remain hidden. I can assure you I
could edit some files on your system and you would not know
based on using the application, but it might effect the data
you produce, or the reliability of the system when it
matters. Continually finding problems and fixing them is a
far worse alternative than not having any.
 
R

ruthlezz_1

Thanks to all who contributed information/suggestions. It looks like I
have a lot of diagnosis ahead of me.
 
J

James Brown

Nothing is "really essential".

Mindless stuff with what is readily buyable.
Want an optical drive? I can claim it isn't essential and
if nobody will sell you one, where are you going to get it?
Not having one wouldn't diminish the usefulness of having it.

Irrelevant to what gets offered by the market.
... and he's right, any serious use system should have it,

Pity that online games aint anything even remotely
resembling anything like a 'serious use system'
but unfortunately the industry was too concerned about
buyers not perceiving the significance and only saw they
could produce a system for a couple dozen dollars less w/o
ECC.

The industry has actually noticed that it isnt necessary
with personal desktop systems, and the industry is right.
It became less popular, lower volumem sales and that
also contributed to higher prices... pricing itself outside
of what most people deem it worth, for artificial reasons,

Nope, because they noticed that the systems without it work fine.
not for the intrinsic value. There are lots of other areas
in systems where someone tried (did) save a dollar here or
there and these things end up problematic for many users
too, like low quality fans or PSU.

Irrelevant to whether systems without ECC work fine.
By a similar token we could claim a CPU's cache doesn't
need parity or ECC protection, but there it is. Perhaps you
don't feel you need it, and discount anyone else having a
use for this extra protection against memory errors.

You two clowns claim its essential. It aint.
Could just be you want cheap without consideration of any
possible side effects.

Or hordes have noticed that systems without ECC work fine.
Lots of people build systems that way, buy a cheap case
or keyboard or whatever, but in the end in is an inferior
system in total. "Essential"? Subjective term in this case.

Pity about his original claim.
Yep.

as I'd already mentioned you can easily take a hex
editor and flip some bits in an application file and NOT
easily reproduce the error as you suggest.

You've misunderstood what is meant. No one ever claimed
that ever corruption would always produce a visible error,
obviously if that code is never executed, it never will.

HOWEVER, if that corrupted code IS executed, you
will see the same effect every time it IS EXECUTED.
In theory there might be a reproducible way to cause an error
as a result, but ever determining that method is not presumed
easy... all a matter of what in particular was changed.
Really no point in your arguing about this,

Yep, you dont have a clue about the basics
and never do manage to work them out.
as I suggested twice now you can
edit a file yourself and see the proof.

It aint the proof of what is actually being discussed.

Not that you'll ever be able to grasp that.
In theory, but you're trying to presume only
one isolated operation and nothing more.

Nope, assuming nothing of the sort.

JUST stating that if that bit of code gets corrupted,
you'll get a completely reproducible result every
time THAT BIT OF CODE IS EXECUTED.
Lots of codepaths in some apps,
it may not be so clear and you could go years inbetween
errors without being able to realize it was a bug in code,
or corruption (instead of placing blame elsewhere). That's
a long way from any reasonable presumption of reproducing
it, as-in, actually reproducing it.

Irrelevant waffle.
IF you can reproduce it like this, you can backtrack and
make the claim. That's not the same thing as being able to
always attribute a crash to corruption.

No one ever said it was.
You're also entirely ignoring other problems besides crashes.

Nope, that was what was being discussed.
If a calculation went awry, or a graphical element is wrong,
you might not notice or assume you entered data wrong or
myriad other scenarios and potential conclusions.

Those would STILL BE COMPLETELY REPRODUCIBLE,
albeit pass unnoticed much of the time.
It is not at all presumable that you have such a simple single
operation resulting in a repeatable crash to backtrack.

No one ever said it was.
Not in a way that you'd necessarily notice,

Irrelevant whether you notice it or not, it still happens.
and if there is ongoing corruption do you
expect it to magically stop happening?
Nope.

Isolating one repeatable op failure is not at
all going to be easy if there is (the liklihood of) ongoing
further damage to that app, other windows apps, etc.

Irrelevant to whether the corruption that has occured
will produce completely reproducible effects.
If you saw several apps crashing and the OS, would you
presume each and every file was corrupt or some other
problem? It's not just one op in one app then, it "seems"
random.


Irrelevant to whether the corruption that has occured
will produce completely reproducible effects.
Your argument only works when ignoring all the possible variables.
Nope.

If you could somehow fix all these variable, yes
you would be able to reproduce it, and check some files.

I wasnt talking about diagnosing where the corruption is.

I JUST said that once the corruption that has occured
it will produce completely reproducible effects.
Without fixation of the variables, show us people who have
found this fault. We can assume there are people with
instable memory but what percent actually DO find this
fault? That's the whole point of ECC, you don't have to FIX
a problem like this, don't have to wonder if there's a error
you will find, or will remain hidden.

Irrelevant to whether the corruption that has occured
will produce completely reproducible effects.
I can assure you I
could edit some files on your system and you would not know
based on using the application, but it might effect the data
you produce, or the reliability of the system when it
matters. Continually finding problems and fixing them is a
far worse alternative than not having any.

Irrelevant to whether the corruption that has occured
will produce completely reproducible effects.
 
J

James Brown

Provided the software gets all the same inputs and
timing and is loaded in the same physical addresses.

Nope. Once the file is corrupt, the physical addresses are irrelevant.
Not very likely.
At any rate, one thing you can do is take a MD5sum of every
executable on the system (which includes scripts, dlls, sos, libs,
configuration files, etc.). (Option: Set them all to be r/o.)
Periodically run MD5sum -c on the output of the earlier run. This
will catch any changes in those files. On windoze don't forget the
hidden or system directories or files.

So your original claim that ECC is essential for personal desktop systems
is clearly just plain wrong. There are other perfectly viable approaches.
I have an equivalent program (validate) which extends the files by
two bytes to store a CCIT16 checksum. That way the sum moves with
the file. Some files won't stand the extension. In particular
text files need an added EOF mark before the checksum, thus 3 byte
extension, and that fouls some editors/compilers etc. Validate is
available on my page, download section. You can use both if you wish.

So your original claim that ECC is essential for personal desktop systems
is clearly just plain wrong. There are other perfectly viable approaches.

And some antivirus software does it like that, just to check for virus activity.
 
R

Rod Speed

I'm recovering data right now from a hard drive that has frozen every
computer it's been attached to. At least in windows.

Dont believe it froze at all, you just didnt wait long enough.
 
C

CBFalconer

James said:
.... snip ...


The industry has actually noticed that it isnt necessary
with personal desktop systems, and the industry is right.

No, the industry has noticed that they can get away with it due to
the ignorance of the unwashed public. Once someone loses their
data, and/or spends days or weeks looking for the cause and fix,
including multiple reloads and reinstallations, they may become
more appreciative of ECC. But the odds are they don't even know it
exists or what it does.

Similarly for disk backup systems.
 
C

CBFalconer

James said:
Nope. Once the file is corrupt, the physical addresses are irrelevant.

Obviously you have never spent any time diagnosing a fault. The
throw away society is here again.
So your original claim that ECC is essential for personal desktop systems
is clearly just plain wrong. There are other perfectly viable approaches.

And some antivirus software does it like that, just to check for virus
activity.

You want to stop everything every 5 minutes or so and spend 10
minutes to an hour or more running the checks? I would prefer to
have the machine do it for me. Instantaneouly, without fuss or
oversight. You also would have to spend much time setting it up,
after all some files have to be modifiable.
 
P

paulmd

Rod said:
Dont believe it froze at all, you just didnt wait long enough.

Actually, for once you're right. "Long enough" was on the order of 20
minutes. Which is a damn long time to wait with no screen image.
 
J

James Brown


Fraid so.
the industry has noticed that they can get away
with it due to the ignorance of the unwashed public.

Wrong. The industry has noticed that even if you furiously
run the sort of file checking ute you mentioned, you dont
in fact see file corruption happening.
Once someone loses their data, and/or spends days or weeks looking
for the cause and fix, including multiple reloads and reinstallations,

Taint gunna happen with a system that runs a decent memory tester like memtest86 fine.
they may become more appreciative of ECC.

Nope, they'll get a clue from advice like in here, that they might
well have a memory problem and how to test for that possibility.
But the odds are they don't even know it exists or what it does.

And hardly anyone will actually be stupid enough to suggest that
they replace the motherboard with one that supports ECC.
Similarly for disk backup systems.

Completely different, actually. And even you should have noticed that the
industry does offer disk backup mechanisms for personal desktop systems.

Funny that.
 
J

James Brown

Obviously you have never spent any time diagnosing a fault.

Guess who has just got egg all over its silly little face, very spectacularly indeed.

Been doing that like since before you were even born thanks.
The throw away society is here again.

Do the decent thing and set fire to yourself in 'protest' or sumfin.
You want to stop everything every 5 minutes or so and
spend 10 minutes to an hour or more running the checks?

Nope, I've actually got enough of a clue to realise that there is
absolutely no need to do that with a system that doesnt have
ECC and which passes a decent test like memtest86 fine.
I would prefer to have the machine do it for me.

Your problem.
Instantaneouly, without fuss or oversight.

But with a dramatic crippling of your choices with personal desktop motherboards.
You also would have to spend much time setting
it up, after all some files have to be modifiable.

Nope, I know how to fix that problem very quickly if it ever does happen, thanks.
 
K

kony

Completely different, actually. And even you should have noticed that the
industry does offer disk backup mechanisms for personal desktop systems.

Funny that.


Whole lotta good it'd do if the original is already corrupt.
 
J

James Brown

kony said:
On Thu, 4 Jan 2007 17:02:20 +1100, "James Brown"



Whole lotta good it'd do if the original is already corrupt.

Irrelevant to his claim about them not being AWARE of disk backup systems.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top