But bigger caches only alleviate the memory bandwidth issue. AMD proved that larger caches were a viable tradeoff with the RDNA 2 architecture and Infinity Cache, and Nvidia is doing something similar with Ada. The bigger caches certainly pay off in effective bandwidth. The L2 cache meanwhile received a big upgrade, up to 16MB per 64 bits of interface width. For the RTX 40-series, rather than sticking with similar widths, Nvidia opted for 384-bit on AD102, 256-bit on AD103, 192-bit on AD104, and 128-bit on AD106 and AD107. There were plenty of other factors to consider, but the ones we're talking about here are simply this: How wide should the memory interface be, and how much L2 cache should there be?įor the RTX 30-series, Nvidia had up to a 384-bit width on GA102 (RTX 3090 Ti down to RTX 3080), up to 256-bit on GA104 (RTX 3070 Ti down to RTX 3060 Ti), up to 192-bit on GA106 (RTX 30), and up to 128-bit on GA107 (mobile RTX 3050 / 3050 Ti and later a desktop 3050) - all with a 1MB L2 cache per 64-bits of interface width. That's interesting, as the 3060 Ti and 3070 two years ago were both targeting 1440p.įundamentally, Nvidia had a design decision to make several years back when the Ada Lovelace architecture and chips were in the planning phase. Nvidia isn't marketing the RTX 4060 Ti as a 1440p or 4K gaming solution, probably precisely due to its lack of VRAM capacity and bandwidth. We talked recently about why 4K gaming requires so much more VRAM, and that applies here. 8GB of VRAM shouldn't be a problem for most games running at 1080p, but 1440p and especially 4K could prove problematic. Memory capacity and bandwidth are going to be major factors in performance. On the other hand, rasterization performance will be a lot closer and a more interesting comparison point. It's a safe bet that Nvidia can match or exceed those cards when it comes to ray tracing performance and AI workloads - winning the latter by default since it's often the only GPU option supported. Looking at the competition based on relatively similar pricing, we have AMD's RX 6750 XT with 12GB and Intel's Arc A770 16GB. The 4060 Ti also supports FP8 mode on its tensor cores, so if/when AI applications add support for that, it can deliver a potential 353 teraflops. As usual, real-world clocks will exceed those values, but in terms of theoretical compute from the shaders and tensor cores, the 4060 Ti delivers 22.1 teraflops versus 16.2 teraflops for FP32, and 177 teraflops versus 130 teraflops for FP16 (with sparsity). The result is fewer GPU cores than the RTX 3060 Ti, but Nvidia makes up for that with significantly higher core clocks - 2535 MHz boost versus 1665 MHz. You can also see the block diagram for the AD106 chip and the 4060 Ti below, which we'll get to in a moment. The RTX 4060 Ti uses Nvidia's new AD106 GPU - the same chip found in the RTX 4070 Laptop GPU, incidentally. That's a crowded table, but the first column is the most pertinent. Nvidia RTX 4060 Ti and Other GPU Specifications Graphics Card Let's dive into the spec sheet to see what the Nvidia RTX 4060 Ti offers. The bad news is that it barely surpasses its predecessor overall, and design decisions made years ago are certainly at play. It also supports new Ada features like DLSS 3 Frame Generation, SER, DMM, and OMM. So the good news is that the RTX 4060 Ti is generally faster than the previous generation RTX 3060 Ti at the same price while using less power. Looking at native performance in our GPU benchmarks hierarchy (which will be updated later today), the RTX 4060 Ti comes in just ahead of the RTX 3070 at 1080p, but falls behind the RTX 3060 Ti at 1440p and 4K. Is the RTX 4060 Ti one of the best graphics cards? That largely depends on how many games you play support DLSS 3 and whether you're willing to trade latency for AI-interpolated FPS. Nvidia has a potential solution for the capacity problem with a 16GB model planned for release in July, but it won't address any concerns with the memory interface. Unfortunately, it also brings a lot of potential compromises into play, chief among them being the 128-bit memory interface and 8GB of VRAM. The Nvidia GeForce RTX 4060 Ti brings true mainstream pricing to the Ada Lovelace architecture and RTX 40-series GPUs, starting at $399 for the Founders Edition and reference-clocked models.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |