Reverse engineering reference for NVIDIA's CUDA compiler toolchain
| Component | Binary | Documentation | Status |
|---|---|---|---|
| cicc | CUDA C→PTX compiler, 60 MB, LLVM 20.0.0 + EDG 6.6 | wiki | Live |
| cudafe++ | CUDA C++ frontend, 8.5 MB, EDG 6.6, 6,483 functions, 69 pages | wiki | Live |
| ptxas | PTX→SASS assembler, 37.7 MB, proprietary (no LLVM), 159-phase pipeline | wiki | Live |
| nvcc | CUDA compilation driver | — | Planned |
| nvlink | CUDA device linker, 37 MB (95% embedded ptxas), 40,532 functions | wiki | Live |
| nvptxcompiler | PTX JIT compilation library | — | Planned |
| Tool | Description | Documentation | Status |
|---|---|---|---|
| fatbin | Fat binary manipulation toolkit — dump, unpack, extract PTX, repack with ZSTD (1–22) | readme | Released |
All analysis is from static reverse engineering of stripped x86-64 ELF binaries using IDA Pro 9.x. No source code or any other restricted or copyrighted material was used — all findings derive solely from analysis of compiled binaries distributed as part of the publicly available CUDA Toolkit.