Forum: >>> Magnum BBS <<<

Re: graphics cards for serious (science) usage?

From Paul@21:1/5 to David Chmelik on Wed Feb 5 06:30:54 2025

XPost: alt.comp.hardware.pc-homebuilt

On Wed, 2/5/2025 4:01 AM, David Chmelik wrote:

Do any graphics processing unit (GPU) or display/video/graphics card companies admit/market GPUs or cards to be used for serious reasons:
science operating systems (OS, UNIX/GNU/Linux) CUDA & OpenCL (I know they
do for ones more expensive than Nvidia GeForce RTX 5090 OC, AMD Radeon RX 7900 XTX & forthcoming 9000s, but asking about these & low-to-mid-range)?

We have AMD, ASUS, GigaByte, Nvidia hardware. One or more companies won't answer about CUDA & OpenCL nor safe temperature ranges nor configuration software (either ASUS or GigaByte makes their own... the other doesn't nor knows defaults) saying 'graphics cards are for gaming' (or that GPU companies' software shows safe temperatures despite cards differ) which is ridiculous: 'hardcore gamers' probably only use small percentage of (powerful) graphics cards... first of all, they're for graphics (computational geometry/graphics computer programming/science to make
games, (photo)graphics/art/computer-aided-design, video, etc. software)
but they won't say this first. Now most (powerful) graphics cards (other than fair number of average/office/non-gaming or casual/rare gaming usage) may be in (UNIX/GNU/Linux) science (super)computers, whether modelling/ research (like/using BOINC.Berkeley.edu), artificial intelligence (AI) or small-to-medium science (AI) workstation P(S)C usage... and many still cryptocurrency mining. I'll call all this 'number-crunching'.

Seems some these companies outsourced customer support to third-world countries unaware scientific/non-desktop/-gaming usage exists. Some put 'gaming' in card names, despite maybe many/most are for users or number- crunching. I asked one about OpenCL and they referred me to '"our" forum'
on gitlab.freedesktop.org , which isn't their site (it's for graphical
user interfaces (GUI) such as the X Window System (X), etc. for science
OS) let alone a forum, and CUDA & OpenCL don't need desktop: typically
start on command-line/console/shell/terminal (by 'terminal' I mean not graphical user interface (GUI)) to compute in background (screen/tmux).
The customer support agent apparently thought OpenCL misspelled OpenGL.

Lately I asked AMD if GNU/Linux AMDGPU-Pro OpenCL still works on Radeon RX Vega... I tried unofficial close variants (Debian/Devuan, Ubuntu/Neon GNU/ Linuxes) which used to work but now some don't: too altered, so should I
try something listed in amdgpu-install (Debian, Mint, *Ubuntu) rather than variants, or was Vega dropped? I don't think AMDGPU-Pro depends on
systemd (oriented to desktops, not traditional servers) so since Devuan doesn't alter anything else, OpenCL should work, and KDE Neon claims they just add newer X/KDE desktop environment (DE) without altering underlying Kubuntu (an official Ubuntu) though I've found sometimes false (upgrades
to experimental/development/testing KDE graphics libraries too new for AMDGPU-Pro with older cards requiring dkms, despite OpenCL doesn't use
dkms, can completely break display many ways, which we need).

Is it best to ask AMD, ASUS, GigaByte, Nvidia, try other companies (Intel GPUs, who else for cards)? I still like ASUS for having most display-
ports, but lately prefer GigaByte for best price with longer warranty
(than ASUS Tuf).

On AMD, I would start with ROCm. ROCm is available immediately on 7900XTX, 7900XT, and the Radeon VII you mention (and that card is drifting out of support and soon to be deprecated). Apparently there are moves afoot,
to include some of the lesser RDNA cards, so the supported list could
grow in future.

https://en.wikipedia.org/wiki/ROCm

It's hard for me to say whether OpenCL is still a separate development or
not. I don't know if it is packaged separately.

One of the sites that was capable of writing good article was Anandtech.
Future has laid off the staff there. Their server remains running,
there is a user forum, but in terms of writing new technical articles,
there is no one there now to do that. Anand, the original owner of the site, went off to work for Apple, and Ryan Smith took over running the site after that. And he was laid off. They could have written an article explaining
the block diagram of the software product.

But other than that, the marketing efforts are awful, and don't
tell end users anything about why you would buy a product. There is
Quadro with certified drivers for CAD work. The full capabilities of
OpenGL are enabled on a Quadro, whereas if you do OpenGL CAD
work on a gamer card, it "slows down after around 50 objects".
I expect the driver has the ability to consider what the program
name is which just started, and control the level of acceleration
available to that program.

NVidia has TensorCore, but what is that and how is it implemented ?
Is it just a shader program ? Or a logic block ?

(This covers some of it, but is likely a bit game-oriented.)

https://www.techpowerup.com/299130/nvidia-ada-ad102-block-diagram-and-new-architectural-features-detailed

Even readouts like this, do not map to whatever software
stack exists today. The card (ten years old?) on the left,
has no Ray Tracing, and is about to out of support. The device
on the right, is the iGPU in an AMD 5600G processor.

[Picture]

https://i.postimg.cc/59v3Fb8B/Techpowerup-GPUZ-2-62.gif

bullwinkle@TIKTACK:~$ inxi -F
System:
Host: TIKTACK Kernel: 6.8.0-51-generic arch: x86_64 bits: 64
Desktop: Cinnamon v: 6.4.6 Distro: Linux Mint 22.1 Xia
Machine:
Type: Desktop Mobo: Micro-Star model: MPG B550 GAMING PLUS (MS-7C56) v: 1.0
UEFI: AMI date: 07/13/2024
CPU:
Info: 6-core model: AMD Ryzen 5 5600G with Radeon Graphics bits: 64
Graphics:
Device-1: AMD Cezanne [Radeon Vega Series / Radeon Mobile Series]
driver: amdgpu v: kernel
Display: x11 server: X.Org v: 21.1.11 with: Xwayland v: 23.2.6 driver: X:
loaded: amdgpu unloaded: fbdev,modesetting,vesa dri: radeonsi gpu: amdgpu
resolution: 1920x1080~75Hz
API: EGL v: 1.5 drivers: radeonsi,swrast platforms: x11,surfaceless,device
API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.0.9-0ubuntu0.3
renderer: AMD Radeon Graphics (radeonsi renoir LLVM 17.0.6 DRM 3.57
6.8.0-51-generic)

Driver Manager says it "needs no drivers", which means AMDGPU is
already running it after the OS was installed. It was the default driver. (Whereas on NVidia, you have Nouveau versus NVidia from Driver Manager.)

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul@21:1/5 to Paul on Thu Feb 6 00:13:45 2025

XPost: alt.comp.hardware.pc-homebuilt

On Wed, 2/5/2025 6:30 AM, Paul wrote:

On Wed, 2/5/2025 4:01 AM, David Chmelik wrote:

Do any graphics processing unit (GPU) or display/video/graphics card
companies admit/market GPUs or cards to be used for serious reasons:
science operating systems (OS, UNIX/GNU/Linux) CUDA & OpenCL (I know they
do for ones more expensive than Nvidia GeForce RTX 5090 OC, AMD Radeon RX
7900 XTX & forthcoming 9000s, but asking about these & low-to-mid-range)?

We have AMD, ASUS, GigaByte, Nvidia hardware. One or more companies won't >> answer about CUDA & OpenCL nor safe temperature ranges nor configuration
software (either ASUS or GigaByte makes their own... the other doesn't nor >> knows defaults) saying 'graphics cards are for gaming' (or that GPU
companies' software shows safe temperatures despite cards differ) which is >> ridiculous: 'hardcore gamers' probably only use small percentage of
(powerful) graphics cards... first of all, they're for graphics
(computational geometry/graphics computer programming/science to make
games, (photo)graphics/art/computer-aided-design, video, etc. software)
but they won't say this first. Now most (powerful) graphics cards (other
than fair number of average/office/non-gaming or casual/rare gaming usage) >> may be in (UNIX/GNU/Linux) science (super)computers, whether modelling/
research (like/using BOINC.Berkeley.edu), artificial intelligence (AI) or
small-to-medium science (AI) workstation P(S)C usage... and many still
cryptocurrency mining. I'll call all this 'number-crunching'.

Seems some these companies outsourced customer support to third-world
countries unaware scientific/non-desktop/-gaming usage exists. Some put
'gaming' in card names, despite maybe many/most are for users or number-
crunching. I asked one about OpenCL and they referred me to '"our" forum' >> on gitlab.freedesktop.org , which isn't their site (it's for graphical
user interfaces (GUI) such as the X Window System (X), etc. for science
OS) let alone a forum, and CUDA & OpenCL don't need desktop: typically
start on command-line/console/shell/terminal (by 'terminal' I mean not
graphical user interface (GUI)) to compute in background (screen/tmux).
The customer support agent apparently thought OpenCL misspelled OpenGL.

Lately I asked AMD if GNU/Linux AMDGPU-Pro OpenCL still works on Radeon RX >> Vega... I tried unofficial close variants (Debian/Devuan, Ubuntu/Neon GNU/ >> Linuxes) which used to work but now some don't: too altered, so should I
try something listed in amdgpu-install (Debian, Mint, *Ubuntu) rather than >> variants, or was Vega dropped? I don't think AMDGPU-Pro depends on
systemd (oriented to desktops, not traditional servers) so since Devuan
doesn't alter anything else, OpenCL should work, and KDE Neon claims they
just add newer X/KDE desktop environment (DE) without altering underlying
Kubuntu (an official Ubuntu) though I've found sometimes false (upgrades
to experimental/development/testing KDE graphics libraries too new for
AMDGPU-Pro with older cards requiring dkms, despite OpenCL doesn't use
dkms, can completely break display many ways, which we need).

Is it best to ask AMD, ASUS, GigaByte, Nvidia, try other companies (Intel
GPUs, who else for cards)? I still like ASUS for having most display-
ports, but lately prefer GigaByte for best price with longer warranty
(than ASUS Tuf).

As for things like FP64, ordinary gamer video cards have low ratios
(1 in 32 cores does FP64). The more expensive cards, the CAD ones,
enable more FP64 units.

The exception to the rule, was a certain NVidia Titan card, which
had a high percentage of FP64, yet it was a consumer card. Those were
prized items at one time, and their used price might have been
higher than their new price.

You can check the build options here, to see what "desirable"
configurations look like.

https://www.pugetsystems.com/workstations/threadripper/t140-xl/

One of the reasons you can't have four of the "top" accelerators,
is the power supply runs out of legs. And it is possible the cooling
becomes an issue (can't vent the hot air fast enough).

I have no idea, how AI launchers treat the accelerator cards,
whether they are clever with the card usage, or, only one card is
used and the other three ignored. In the past, there was an issue
with bus master transfers, where one PCIe card could bus master transfer
right into the VRAM on a second card. Whether anything needs to be
present in the CPU PCIe Hub for that, I don't know. On some old NVidia
designs, a separate switching chip added to the motherboard, seemed
to have the required PCIe properties for that style of operation.

And there exist, even larger power supplies, but they don't
fit in a conventional computer case.

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Lawrence D'Oliveiro@21:1/5 to Paul on Thu Feb 6 21:05:40 2025

XPost: alt.comp.hardware.pc-homebuilt

On Wed, 5 Feb 2025 06:30:54 -0500, Paul wrote:

It's hard for me to say whether OpenCL is still a separate development
or not.

OpenCL seemed like a nice idea for a vendor-independent standard, but
there was never enough competition in the GPU market for it to succeed.

I think it has now been revamped and renamed to “SYCL”, but I don’t think it is achieving much more success.

Market-dominant vendors (*cough* Nvidia *cough*) see no benefit in
allowing competition with their own proprietary products.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul@21:1/5 to Lawrence D'Oliveiro on Fri Feb 7 00:21:06 2025

XPost: alt.comp.hardware.pc-homebuilt

On Thu, 2/6/2025 4:05 PM, Lawrence D'Oliveiro wrote:

On Wed, 5 Feb 2025 06:30:54 -0500, Paul wrote:

It's hard for me to say whether OpenCL is still a separate development
or not.

OpenCL seemed like a nice idea for a vendor-independent standard, but
there was never enough competition in the GPU market for it to succeed.

I think it has now been revamped and renamed to “SYCL”, but I don’t think
it is achieving much more success.

Market-dominant vendors (*cough* Nvidia *cough*) see no benefit in
allowing competition with their own proprietary products.

I think OpenCL runs on all three platforms. Now
that Intel makes stuff powerful enough to benefit
from it.

Even NVidia supports OpenCL, so you should be able to
write an application, and have multiple hardware assist
options for it.

But we won't know about the good works of these companies,
unless someone writes technical articles about them.
We need to know what the arch is inside, and why
a certain product is worth buying.

https://www.techpowerup.com/gpu-specs/geforce-rtx-5090.c4216

Shading Units 21760
TMUs 680
ROPs 176
SM Count 170
Tensor Cores 680
RT Cores 170

L1 Cache 128 KB (per SM)
L2 Cache 96 MB

Pixel Rate 423.6 GPixel/s
Texture Rate 1,637 GTexel/s
FP16 (half) 104.8 TFLOPS (1:1)
FP32 (float) 104.8 TFLOPS
FP64 (double) 1.637 TFLOPS (1:64) <=== more enabled, on CAD version of card

As a customer, it's your job to figure out what those
mean, and how those could help you.

For CUDA programs, the dev has to "tune" the resource
settings for their program, so it "runs good" on "your board".
Naturally, some programs offered, are tested on only one board
and it won't be the board you have.

The architecture on the cards, isn't exactly the same, but at
least on this card, it has its first Ray Tracing cores. And as
far as I know, Ray Tracing cores are good for gaming, and not
for BOINC or AI. Tensor Cores might be good for AI.

https://www.techpowerup.com/gpu-specs/radeon-rx-7900-xtx.c3941

Shading Units 6144
TMUs 384
ROPs 192
Compute Units 96
RT Cores 96

L0 Cache 64 KB per WGP
L1 Cache 256 KB per Array
L2 Cache 6 MB
L3 Cache 96 MB

Pixel Rate 479.6 GPixel/s
Texture Rate 959.2 GTexel/s
FP16 (half) 122.8 TFLOPS (2:1)
FP32 (float) 61.39 TFLOPS
FP64 (double) 1.918 TFLOPS (1:32)

and of course we should list that card, because at the
moment, there are none left for sale.

Good luck,

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Paul@21:1/5 to Mr. Man-wai Chang on Fri Feb 7 08:42:23 2025

XPost: alt.comp.hardware.pc-homebuilt

On Fri, 2/7/2025 6:28 AM, Mr. Man-wai Chang wrote:

On 7/2/2025 5:05 am, Lawrence D'Oliveiro wrote:

On Wed, 5 Feb 2025 06:30:54 -0500, Paul wrote:

It's hard for me to say whether OpenCL is still a separate development
or not.

OpenCL seemed like a nice idea for a vendor-independent standard, but
there was never enough competition in the GPU market for it to succeed.

I think OpemCL was about competing against Micro$oft DirectX. But how many games need to be portable these days? :)

OpenCL C=Computing
OpenGL G=Graphics

Paul

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Ginger1
  Mon Sep 15 19:33:54 2025
  from London via SSH
- Bob Worm
  Mon Sep 15 15:42:34 2025
  from Wales, Uk via Telnet
- Gretchiie
  Mon Sep 15 05:16:29 2025
  from Derry, Nh via Telnet
- Fred Blogs
  Mon Sep 15 00:03:12 2025
  from Uk via SSH
- Plume
  Sun Sep 14 09:34:52 2025
  from Uk via Raw
- Gretchiie
  Sun Sep 14 06:07:30 2025
  from Derry, Nh via Telnet
- Thlc
  Sat Sep 13 17:11:34 2025
  from Rognac, France via Telnet
- Thlc
  Sat Sep 13 17:04:03 2025
  from Rognac, France via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	546
Nodes:	16 (2 / 14)
Uptime:	10:45:03
Calls:	10,389
Calls today:	4
Files:	14,061
Messages:	6,416,855
Posted today:	1

Re: graphics cards for serious (science) usage?

Who's Online

Recent Visitors

System Info