geof152002d6
New member
- Joined
- Jan 29, 2024
- Messages
- 8
Hi,
System:
MSI B450 Tomahawk Max II
BIOS 7C02vHC/2023-10-27
AMD Ryzen 7 3700x
PowerColor Red Devil Radeon 7900 XTX
80 GB ram (32 + 32 + 8 + 8), running at default speeds
NVME: 1TB, 4TB
Windows 11 Pro 22H2 all updates applied
AMD driver 24.1.1
A big Cooler Master CPU cooler
Joined the forum just to post this. For the past month since upgrading to latest bios and adding 2x32GB dimms system has been unstable and crashing randomly under load when using GPU. Sometimes after a few minutes, sometimes after a few hours. The crash was black screen with all fans and lights still on. Not just a graphics crash since machine unreachable over the network after this happened.
The only way to get the system to work again was to hold down power button for 10 seconds and then power back on. Sometimes after this happened it was also necessary to reinstall the AMD drivers to get the GPU working as well.
First I tested the memory with memtest86 and got some errors - must be the new ram right? Nope... did load bios defaults (F6) and retest with the pro edition for 10 cycles/70+ hours - zero errors. Test system in Ubuntu 23.10 bootable USB using s-tui - no errors.
Switch GPU to run on 3x PCI leads since crashes were occuring under GPU load. Not sure if this made any difference
Next up, test GPU since crashes always happened when GPU was running. Used furmark and unigine heaven both at same time no problems.
Finally go to test the CPU in Windows and get errors after a minute or so:
* prime 95: rounding error
* OCCT "CPU error"
Looking on the forums this prime 95 error is normally caused by low CPU voltage when overclocking - but I'm not overclocking I'm using BIOS defaults.
Went to BIOS to see if I could increase voltage, while looking I find option "Precision Boost Overdrive" set to "auto". It messes with CPU speed and power which is suspicious. Set disabled, rebooted and rerun prime95, OCCT, furmark and Heaven all at the same time for over an hour - zero errors.
System has now been stable under load for 24+ hours - fixed!
Looks like this BIOS setting is bugged? My next step would have been to start RMAing hardware. I'm not overclocking this system at all and have no interest in ever doing so. Since this setting was auto after loading defaults, perhaps it should be set disabled since its listed in the overclocking section? Or perhaps its some error with the setting on this board?
In all I spent about 5 days looking at this so hopefully helps someone or the BIOS can be fixed...
Cheers,
Geoff
System:
MSI B450 Tomahawk Max II
BIOS 7C02vHC/2023-10-27
AMD Ryzen 7 3700x
PowerColor Red Devil Radeon 7900 XTX
80 GB ram (32 + 32 + 8 + 8), running at default speeds
NVME: 1TB, 4TB
Windows 11 Pro 22H2 all updates applied
AMD driver 24.1.1
A big Cooler Master CPU cooler
Joined the forum just to post this. For the past month since upgrading to latest bios and adding 2x32GB dimms system has been unstable and crashing randomly under load when using GPU. Sometimes after a few minutes, sometimes after a few hours. The crash was black screen with all fans and lights still on. Not just a graphics crash since machine unreachable over the network after this happened.
The only way to get the system to work again was to hold down power button for 10 seconds and then power back on. Sometimes after this happened it was also necessary to reinstall the AMD drivers to get the GPU working as well.
First I tested the memory with memtest86 and got some errors - must be the new ram right? Nope... did load bios defaults (F6) and retest with the pro edition for 10 cycles/70+ hours - zero errors. Test system in Ubuntu 23.10 bootable USB using s-tui - no errors.
Switch GPU to run on 3x PCI leads since crashes were occuring under GPU load. Not sure if this made any difference
Next up, test GPU since crashes always happened when GPU was running. Used furmark and unigine heaven both at same time no problems.
Finally go to test the CPU in Windows and get errors after a minute or so:
* prime 95: rounding error
* OCCT "CPU error"
Looking on the forums this prime 95 error is normally caused by low CPU voltage when overclocking - but I'm not overclocking I'm using BIOS defaults.
Went to BIOS to see if I could increase voltage, while looking I find option "Precision Boost Overdrive" set to "auto". It messes with CPU speed and power which is suspicious. Set disabled, rebooted and rerun prime95, OCCT, furmark and Heaven all at the same time for over an hour - zero errors.
System has now been stable under load for 24+ hours - fixed!
Looks like this BIOS setting is bugged? My next step would have been to start RMAing hardware. I'm not overclocking this system at all and have no interest in ever doing so. Since this setting was auto after loading defaults, perhaps it should be set disabled since its listed in the overclocking section? Or perhaps its some error with the setting on this board?
In all I spent about 5 days looking at this so hopefully helps someone or the BIOS can be fixed...
Cheers,
Geoff