How to Read This Database for BIOS Flashing

The specifications above are organized to serve the GPU hardware enthusiast's primary workflow: identifying cards, locating compatible firmware, and verifying flash tool compatibility. The Device ID column is the most critical for firmware operations — it is the hexadecimal identifier that flash tools match against the BIOS file being applied.

Before any flash operation, run nvflash --version (NVIDIA) or ATIFlash -i / VBFlash -i (AMD) to get the installed card's exact device ID and SubVendor ID. Match both against the BIOS repository entry, not just the product name. A "GeForce RTX 4090" BIOS from ASUS (SubVendor 0x1043) is not the same file as an "RTX 4090" BIOS from MSI (SubVendor 0x1462), even though both have device ID 0x2684.

Architecture Notes — 2026 Generation

NVIDIA Blackwell (RTX 50 Series)

Blackwell is NVIDIA's 4 nm (TSMC N4P) GPU architecture introduced in late 2024/early 2025. The flagship GB202 die achieves 21,760 CUDA cores — the largest NVIDIA consumer die ever manufactured. Key architectural changes from Ada Lovelace: GDDR7 memory across the lineup replacing GDDR6X (previous flagships); 5th-generation RT cores with improved incoherent ray traversal performance; FP4 tensor operations for accelerated AI inference at 4-bit precision; and enhanced Tensor Memory Accelerator (TMA) for improved memory access patterns in AI workloads.

From a firmware modding standpoint, Blackwell's most significant change is the enhanced firmware signing that restricts unsigned BIOS modifications. Standard BIOS replacement between same-model BIOSes (ASUS RTX 5090 stock BIOS to ASUS RTX 5090 updated stock BIOS) functions normally via nvflash 5.82x. Custom BIOS modifications involving power table edits or cross-flash operations face increased friction compared to Ada and earlier generations.

AMD RDNA 4 (RX 9000 Series)

RDNA 4 introduces AMD's 4th-generation ray accelerators (double the RDNA 3 capability per CU), new matrix acceleration units for AI workloads, and a redesigned shader processor that improves performance-per-CU versus RDNA 3. The Navi 48 die, used in RX 9070 XT and RX 9070, represents AMD's high-performance mainstream positioning — competitive with the RTX 5070 tier at launch pricing.

RDNA 4 uses GDDR6 rather than GDDR7. AMD's decision was driven by cost optimization for the targeted price band and by GDDR6's maturity at the 256-bit and 128-bit bus widths used in the Navi 48 and Navi 44 designs. The 576 GB/s bandwidth on the RX 9070 XT's 256-bit GDDR6-18 interface is directly comparable to the RX 6900 XT's bandwidth from three years prior — RDNA 4 delivers its performance gains primarily through improved compute efficiency and ray tracing hardware, not memory bandwidth.

Historical Context: From MVKTech's Era to Blackwell

When this site operated as a GPU modding portal in the early 2000s, the GPU landscape was fundamentally different. The cards of that era — NVIDIA GeForce FX series, ATI Radeon 9000/9700/9800 — had shader pipelines with separate vertex and pixel shader units, measured in the dozens of execution units rather than thousands. Overclocking was done via registry edits and BIOS clock table manipulation. The community that developed around tools like NiBiTor, ATIFlash, and the early GPU-Z grew directly from the modding discussions hosted on sites like mvktech.net, Beyond3D, and Rage3D.

From the GeForce FX 5800 to the RTX 5090 is a factor of roughly 800× in shader throughput, 160× in memory bandwidth, and 1,400× in dedicated ray tracing performance. The BIOS flashing toolchain has evolved from simple clock table editors to complex firmware replacement utilities managing multi-partition UEFI images, signed firmware sections, and architecture-specific power delivery tables. The core practice — modifying the GPU's firmware to access hardware capabilities beyond factory settings — is the same discipline that motivated the early community.

Frequently Asked Questions

What is a GPU device ID and why does it matter for BIOS flashing?
A GPU device ID is a 16-bit hexadecimal identifier that uniquely identifies the GPU die configuration. Flash tools validate the target BIOS file's embedded device ID against the installed card before writing — if they do not match, the flash is rejected. The device ID is displayed in GPU-Z, Windows Device Manager, and by the flash tools themselves.
What is the difference between RTX 50 and RTX 40 architectures?
RTX 50 (Blackwell) uses GDDR7 memory, 5th-gen RT cores, FP4 tensor operations, and enhanced firmware signing that restricts BIOS modification. RTX 40 (Ada) uses GDDR6X on flagships, 3rd-gen RT cores, and less restrictive signing, making BIOS editing more accessible. Performance gains from RTX 40 to RTX 50 are approximately 30–50% at equivalent product tiers.
What is GDDR7 and which GPUs use it?
GDDR7 is the 7th-generation Graphics DDR memory standard offering approximately 2× the bandwidth of GDDR6 at equivalent bus widths. NVIDIA RTX 50 series (Blackwell) uses GDDR7 across the lineup. AMD RX 9000 series uses GDDR6 — AMD targeted cost efficiency at the RX 9000 price points, where GDDR7 would have increased BOM cost disproportionately.
How do I check which GPU die revision my card uses?
GPU-Z shows the die name (AD102, GB203, Navi 48) in the 'GPU' field and silicon stepping in 'Revision.' For BIOS operations, the device ID and SubVendor ID shown in GPU-Z are more directly relevant than the die revision name.
What does TDP mean and is it the same as power consumption?
TDP is the maximum sustained power dissipation the cooling solution is rated for. Peak instantaneous power during boost transients can exceed TDP by 20–30% for short periods. The listed TDP directly corresponds to the power limit ceiling that BIOS modification can raise — nvflash and ATIFlash BIOS modifications typically target the TDP table entries in the firmware.
What is the memory bus width and why does it matter?
Memory bus width (in bits) determines how many data lines exist between the GPU's memory controllers and VRAM chips. Total bandwidth = bus width × effective memory speed. A 256-bit GDDR7 bus at 28 Gbps per pin delivers 896 GB/s. Bus width is fixed by the die design and cannot be changed via BIOS or software.
What is the difference between Navi 48 and Navi 44?
Navi 48 (RX 9070 XT/9070): 32 compute units, 256-bit bus, 16 GB GDDR6, higher TDP. Navi 44 (RX 9060 XT/9060): fewer compute units, 128-bit bus, lower TDP, mainstream targeting. Both share RDNA 4 shader architecture with 4th-gen ray accelerators and matrix acceleration units.
What is the SubVendor ID?
SubVendor ID identifies the AIB manufacturer: ASUS 0x1043, MSI 0x1462, Gigabyte 0x1458, EVGA 0x3842, Sapphire 0x1DA2, PowerColor 0x148C, XFX 0x1682. Flash tools use it as a secondary compatibility check. Cross-flashing between AIB variants requires overriding this check (nvflash -6, ATIFlash -i).
How many CUDA cores does the RTX 5090 have?
21,760 CUDA cores (170 SMs × 128 CUDA cores per SM). This is approximately 33% more than the RTX 4090's 16,384 CUDA cores, achieved through both a larger die (GB202 vs AD102) and incremental SM design improvements.
What is the RDNA 4 performance hierarchy?
RX 9070 XT (flagship consumer, 304 W) → RX 9070 (upper-mid, 220 W) → RX 9060 XT (mainstream, 150 W) → RX 9060 (entry mainstream, 115 W). The RX 9070 XT trades performance with the RTX 5070 and 5070 Ti at 1440p in GPU-limited workloads.
Can I use the database to find a compatible BIOS for my card?
Use the device ID column as the primary filter when searching the BIOS Repository. Confirm the device ID against GPU-Z on the installed card, then match SubVendor ID to the AIB partner. Both must match the BIOS file's embedded identifiers for a valid compatible flash.
What is unified shader architecture?
Introduced in the mid-2000s (NVIDIA Tesla, AMD TeraScale), unified shaders replaced dedicated vertex and pixel shader pipelines with general-purpose ALUs schedulable for any workload type. The same execution units now handle vertex shading, pixel shading, compute, ray traversal, and AI inference — the architectural basis for GPU-accelerated computing.
What is the difference between GDDR6 and GDDR6X?
GDDR6X uses PAM4 signaling (4 voltage levels per symbol vs GDDR6's 2) to double data rate per pin without increasing physical clock speed. Up to 21 Gbps/pin for GDDR6X vs 16–18 Gbps/pin for GDDR6. Trade-off: higher per-die power. NVIDIA used GDDR6X on RTX 40 flagships; AMD used standard GDDR6 on RX 7000 with a wider 384-bit bus for comparable bandwidth.
What is a compute unit (CU) in AMD GPU architecture?
An AMD Compute Unit is the fundamental shader execution block. Each RDNA 2–4 CU contains 64 stream processors (2 × SIMD32). Total stream processors = CU count × 64. NVIDIA's equivalent is the Streaming Multiprocessor (SM), containing 128 CUDA cores per SM in Ada and Blackwell.
Why does the RTX 4090 have a 450 W TDP?
AD102 at NVIDIA's 2520 MHz boost target requires approximately 450 W sustained. NVIDIA implemented the 16-pin PCIe Gen 5 connector (12VHPWR, 600 W rated) to support this power level. AIB variants commonly enable 516–525 W via their BIOS — this is within the connector's rating and the VRM designs used on high-end AIB RTX 4090 cards.
What is the RX 9070 XT device ID?
PCI Device ID 0x7446 under AMD vendor 0x1002. SubVendor ID varies: Sapphire 0x1DA2, PowerColor 0x148C, ASRock 0x1849, Gigabyte 0x1458. Always confirm via GPU-Z before selecting a BIOS — power limits, fan profiles, and PCB-specific parameters differ between AIB implementations even with the same device ID.
What is an AIB partner GPU vs reference design?
AIB (Add-in Board) partners (ASUS, MSI, Gigabyte, Sapphire, PowerColor, XFX, ASRock) design their own PCBs, power delivery, and cooling solutions around GPU dies from NVIDIA or AMD. Founders Edition (NVIDIA) and AMD reference cards are the silicon manufacturer's own board designs. AIB BIOSes differ in power limits, fan curves, factory OC tables, and the SubVendor ID embedded in firmware.