Constant Raid failure NB/SB to hot?

Athunt

New member
CORPORAL
Joined
Apr 21, 2010
Messages
15
Good evening gents :)

I have 2x OCZ vertex 30gb raid-0 in my system and my raid array fails every 1-2 months.. im getting a little bit tired of recovering backups so im trying to find the source of the problem.
I have a MSI Eclipse SLI motherboard with the buggy NB which sometimes (not often) goes up to 95 degrees (most of the time its around 85 degrees). I already started a topic on the OCZ forums aswell and they say it could be the NB/SB that messes up my raid all the time.

I have 2 Samsung Spinpoints running in raid-0 wihtout any problems. In my opinion it couldn't be the NB/SB cause once every 1-2 months doesnt seem alot for the NB/SB to be the problem, i think it would occur alot more times if it really was the NB/SB. Putting new cooling on the NB/SB would be my extreme solution cause i have my CPU water cooled and it would take me sometime to take the mobo out again and trying to fix the problem.

I have my i7 920 running @ 4Ghz 24/2 at max load 60 degrees and is Linx stable (Ran it for multiple hours)

Do you guys think that the NB/SB is the problem or do you have any suggestions?

Ps. i have no problems with my PC without those SSD's.

Thanks in advance :D

 
Do you guys think that the NB/SB is the problem or do you have any suggestions?
If it is the IOH, it would be the first actual reported case of trouble. As long as it doesn't go over 95C Tcontrol, data loss or corruption shouldn't take place. Some have run a considerable amount over that & haven't reported any failures or problems. As for the ICH, that could be a problem, but temp wise, they seem to run 'very' cool natively. If you want to add a fan for cooling the IOH, an Antec Spot Cool works well, & it can be mounted probably off of the top right mount point & goosenecked over towards the IOH.

The details of your system are a little sketchy, like OS, PSU, bios version, AHCI set prior to OS install, etc. I kept losing HDDs randomly on an Eclipse Plus & it turned out to be a faulty PSU +5V rail causing it. 
 
1st thing I would do is try getting a small fan to blow air directly onto the heatsink like Ron said. SpotCool has been a great solution for many people that have liquid cooling systems and reduced or almost no cooling airflow over that heatsink.
 
I'll give some more system specs:

i7 920D0 running @ 4ghz
MSI Eclipse SLI
OCZ platinum XTC 6 gb
2x samsung spinpoint 750gb raid-0
2x ocz vertex 30 gb raid-0
Coolermaster realpower 850 Watts
Sapphire 5870 Vapor-X
OS: Windows 7 64bit

It's true that the ICH doesnt get very hot but it's connected with heatpipes to the NB so alot of heat is transfered from the NB to the ICH..
Don't exactly know what is to hot for the soutbridge though :p So you guys say that the heat of the NB almost cant be the issue here?
How hot can the NB and SB rly get before errors occur?

I'll take a look for a nice small fan for the IOH :)

ps. indeed the antec spot cool seems a good option... probably have to put it in pull setup close to the IOH and not let it blow on it..
 
Still think it aint gonna help much the fan, cause as u might now alot of ppl had the same issue with the x58 boards.
I took the heatsink off 1 times and noticed their was like no thermal paste on the chips.. only some black rock solid compuond ;)
So i put some mx3 on it instead but no temp difference cause the heatsink is just not tight enough cause of the plastic screws..

Would a tiny fan blowing on it really decrease the temp by that much..?
 
"Would a tiny fan blowing on it really decrease the temp by that much..?"

Yes if your current airflow over it is very low. You should see if you can get some thin nylon washers at a local hobby shop or some such place and use them to increase the tension on the screws holding the heatsink.
 
How hot can the NB and SB rly get before errors occur?
As per Intel, they give 95C Tcontrol as the upper limit for sustained operation before 'possibly' data loss & corruption make take place. Some have run at 110C & not reported a problem, but I would never advise that. The Antec Spot Cool can easily lower the IOH temp by 10C or more. For nylon hardware, U.S. size is #4-40 nuts, bolts, & washers. You would be amazed how much that will reduce the temps. As for the SB, I never even messed with the TIM for it. Unless the contact is 'really' poor, the SB just doesn't seem to get very hot at all.

Added: I'm still thinking along the lines of some kind of power regulation is behind the problem.
 
I will first try it only with the fan to lower it down some, cause nylon washers means pulling out my mobo which is gonna take pretty long cause of the WC setup ;)

Henry do you think the northbirdge is the problem to my raid issue?
To be honest i think that one (or both) of my SSD('s) isnt functioning properly.. but im not an expert ;)

@ HU16E
True the SB normally won't get that hot but is connected through heatpipes with the NB so the NB is probably heating up the SB aswell.. Though i dont know the temp of the SB at the moment and it has no temp sensor on it. Power regulation hmm so that means de psu, kind of hard to test if it is the psu.

I'll give my psu a test with OCCT and see what's the result :)

 
Result of the OCCT PSU test

Image of the 3.3 volt line


Image of 5 volt line


image 12 volt line



Seems ok to me?

PS.
OMG i just put a 120 mm fan aiming for the NB and it decreased from 80 to 54 degrees.. this is insane
 
As you see, you had inadequate airflow over the heatsink to cool the fins, and the extra fan does wonders for the temperatures.
 
True it's amazing didnt had the idea that it would made 30 degrees difference... but would the temp also be the issue for the raid to fail constantly?
PSU seems ok not? NB is fine now so have to see...  if it crashes again it is the SSD's
 
Back
Top