Skip to content

Meaning of the num_stages variable. #512

Answered by ptillet
DanielHesslow asked this question in Q&A
Discussion options

You must be logged in to vote

Hello! The Triton compiler pipelines some loops using the new asynchronous copy instruction on A100. num_stages refers to the depth of this pipeline. Because data from the next iteration(s) is asynchronously prefetched into shared memory, the shared memory requirements of the kernel increases linearly

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@DanielHesslow
Comment options

@Shoreshen
Comment options

Answer selected by DanielHesslow
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants