Level1Techs - Intel has a Pretty Big Problem

bazsy@lemmy.world · 2 months ago

What troubleshooting steps did you take so far? I would try these:

different OS, maybe a live usb running fedora or ubuntu if it is possible to emulate the workload where this appears
bios reset to defaults, no OC not even XMP
memtest, either the memtest86+ boot iso or the runtime memtester can detect obvious errors
long smart self test on OS drive and an fsck or scrub based on FS

Also the logs show a very old nvidia gpu which is not supported by the new driver. I don’t know if this can cause crashes, haven’t used one in ages, maybe someone else has more insight.

bazsy@lemmy.world · 3 months ago

That monitor will hold it back. 1080p wouldn’t be bad if modern games run without TAA blur, but most games require it. Even a cheap 144hz IPS 1440p will give you a better experience.

bazsy@lemmy.world · 11 months ago

That ATX board would be great. The mATX B650M PG is also better than the previous one, it is good enough. If you can find the B650M-HDV/M.2 in stock that is even better if you don’t need 3 m.2 slots.

That monitor was indeed a lucky deal. It looks to be a good combination for this setup.

bazsy@lemmy.world · 11 months ago

I don’t think it’s the scheduler this time with a single CCD, but there is significant difference. These tests focus on compute and productivity with almost no games, so most of the difference could come from this bias. Another possible option is the power profile (EPP balance_performance) holding back the 7700x on linux.

bazsy@lemmy.world · 11 months ago

The draft is pretty good. Only a few points to consider changing:

That is an entry level Motherboard which may limit your upgrades in the future. It overheats with a 16 core ryzen 9.
The ram size is good, but the speed and latencies are just as important nowadays. A 6000 MT/s CL30 Expo ram could improve CPU performance, but it’s a kind of OC so not every combination is fully stable at the highest speeds.
Especially with competitive and indie games it’s easy to run them at high FPS. I would consider getting a 1440p high refresh rate (144+ Hz) monitor if you don’t have one already. It’s a huge upgrade coming from 1080p60Hz.

bazsy@lemmy.world · 1 year ago

That’s more than enough. You can’t do any more.

bazsy@lemmy.world · edit-2 1 year ago

As an Android flavour it should be safe after uninstalling all apps associated with the university. Did any of them need a “device owner” permission? That’s the only way to be more persistent on Android without root access.

bazsy@lemmy.world · 1 year ago

Level1Techs - Intel has a Pretty Big Problem

bazsy@lemmy.world · 1 year ago

The mobile and TV clients are often limited to the codecs with hardware acceleration. Or just selecting a lower bitrate on the client will cause transcoding.

bazsy@lemmy.world · 1 year ago

I think calling it a “cache” is not precise. The primary function of the DRAM is to hold the dictionary for translating logical addresses (e.g. sectors) from the OS to the physical addresses (which NAND chip, which bank etc.). This indirection is needed for the controller to do wear leveling without corrupting the filesystem.

On a SATA SSD without DRAM each read IO could mean 2 actual reads: first the dictionary to find the data and than the actual data being read. As you said HBM helps by eliminating this extra read.

The read and write caching is just a use of the remaining DRAM capacity. Since modern Operating Systems use the general RAM for the same function it is usually just a small increase to the throughput.

bazsy@lemmy.world · edit-2 1 year ago

An Interview with Pat Gelsinger - More Than Moore

bazsy@lemmy.world · 1 year ago

Temporal anti-aliasing: a blessing or a curse?

bazsy@lemmy.world · 1 year ago

NVIDIA GH200 CPU Performance Benchmarks Against AMD EPYC Zen 4 & Intel Xeon Emerald Rapids Review

bazsy@lemmy.world · edit-2 1 year ago

There is an even more relevant video of using external storage trough USB. He recommends using software raid:

Can We Build a Home Server Out of Mini PCs?

bazsy@lemmy.world · 1 year ago

Are both drives fully encrypted with LUKS? Is trim enabled in both crypttab and fstab?

bazsy@lemmy.world · edit-2 1 year ago

Thanks for the links! I updated my config from z3fold to zsmalloc and adjusted the vm.page-cluster to test these out.

Reading a bit more, I think when using large max_pool_percent (>30) with Zswap the two solutions are more similar than not. A crucial difference is what use-case is more acceptable since Zswap can cause unresponsiveness (and potential lockup) under high memory pressure. While Zram could result in an OOM crash in a similar worst-case scenario.

bazsy@lemmy.world · 1 year ago

Btrfs with compression enabled and subvolumes set.

And enable/automate maintenance services for BTRFS. For example: balace should be run on heavily used system disks or scrub could help detect errors even on single disks.

ZRAM (With proper sysctl.conf like PopOS does).

Could you explain the preference of ZRAM over ZSWAP? I thought the latter was the more advanced and better performing solution. Is there some magic in Pop’s config?

bazsy@lemmy.world · edit-2 1 year ago

Happy to help! Tough you are right, this is a rather generic error that doesn’t help much just confirms that the GPU is the issue.

At this point it could be a driver issue since there are similar open bug reports. A hardware problem is still possible since you previously said that it’s unstable on windows too, and power related issues can also lead to this error message.

bazsy@lemmy.world · 1 year ago

Most distros use systemd and its logging solution: journald. You can use journalctl to read the logs around the time of the crash for e.g.:

journalctl -S -5m this shows the last 5 minutes. Use this when a game crashes but the system continues working and did not reboot.
journalctl -b -1 -S -10m this shows the last 10 minutes from the previous boot. Use this if the crash froze the whole system and rebooted.

Look for red lines (errors) and what wrote them. AMD GPU faults usually have the ‘amdgpu’ mentioned, memory errors could appear as ‘protection fault’.

bazsy@lemmy.world · 1 year ago

Did you check the system logs to see what caused it?

Many things can result in seemingliy random crashes. Any overclock (including XMP and Expo) or undervolt or even a bios version can be problematic.

I would check first if it’s stable on windows.

bazsy@lemmy.world · 1 year ago

What filesystem are you using? Is it encrypted?

Could you run a benchmark to verify if reads and writes are both affected? KDiskMark is like crystaldiskmark or Gnome Disks has a built in benchmark.

bazsy@lemmy.world · 2 years ago

Many are already mentioned but there is a lack of SoL and KyoAni shows so these are what I missed:

Shirobako
Hibike! Euphonium
Hyouka
Hanasaku Iroha

bazsy@lemmy.world · 2 years ago

Filesystem permissions

For many apps it is not an issue and provides additional security but in other cases it’s very annoying and not trivial to fix.

Example1: opening a .docx from Thunderbird flatpak with OnlyOffice flatpak does not work out of the box.

Example2: mpv and VLC flatpaks work well for local files, but fail to open network shares from Dolphin.

I think a possible solution would be runtime permission dialogs when denied access.

bazsy@lemmy.world · 2 years ago

I’m not 100% sure, but for me it caused a similar “freezing” or unresponsive experience when the daily cleanups run in the morning. If there was a freeze after every (even short) sleep and resume that might be a different issue.

bazsy@lemmy.world · 2 years ago

AMD EPYC Bergamo is a Fantastically Fresh Take on Cloud Native Compute

bazsy@lemmy.world · 2 years ago

[Gamers Nexus] AMD's Labs - Secrets of a $182 Billion Chip Maker - Full Documentary

bazsy@lemmy.world · 2 years ago

Rebuilding Intel – Foundry vs IDM Decades of Inefficiencies Unraveled

bazsy@lemmy.world · 2 years ago

Level1Techs - Intel has a Pretty Big Problem

Level1Techs - Intel has a Pretty Big Problem

An Interview with Pat Gelsinger - More Than Moore

An Interview with Pat Gelsinger - More Than Moore

Temporal anti-aliasing: a blessing or a curse?

Temporal anti-aliasing: a blessing or a curse?

NVIDIA GH200 CPU Performance Benchmarks Against AMD EPYC Zen 4 & Intel Xeon Emerald Rapids Review

NVIDIA GH200 CPU Performance Benchmarks Against AMD EPYC Zen 4 & Intel Xeon Emerald Rapids Review

AMD EPYC Bergamo is a Fantastically Fresh Take on Cloud Native Compute

AMD EPYC Bergamo is a Fantastically Fresh Take on Cloud Native Compute

[Gamers Nexus] AMD's Labs - Secrets of a $182 Billion Chip Maker - Full Documentary

[Gamers Nexus] AMD's Labs - Secrets of a $182 Billion Chip Maker - Full Documentary

Rebuilding Intel – Foundry vs IDM Decades of Inefficiencies Unraveled

Rebuilding Intel – Foundry vs IDM Decades of Inefficiencies Unraveled

AMD Zen 4c Not an E-core

AMD Zen 4c Not an E-core