About training/inference speed improvements of TiTok #86

qyangcv · 2025-02-21T15:11:27Z

Thank you for the great job! I just have some doubt about the efficiency of TiTok.

In the paper of TiTok, throughput (samples/s/gpus) of TiTok is basically larger than MaskGIT. However, take TiTok-L-32(32 tokens) for example, quantized latent tokens(32) is concatenated with mask tokens(256) to make sure decoder have enough tokens to generate image, so the tokens fed into decoder should be 32+256=288, larger than MaskGIT(256 tokens). Since time complexity of transformer is O(N^2)，I just wonder why TiTok is faster than MaskGIT.

I'm new to the area of image generation, so I would be much grateful for your reply. If I have any misunderstanding about TiTok, please point it out.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About training/inference speed improvements of TiTok #86

About training/inference speed improvements of TiTok #86

qyangcv commented Feb 21, 2025 •

edited

Loading

About training/inference speed improvements of TiTok #86

About training/inference speed improvements of TiTok #86

Comments

qyangcv commented Feb 21, 2025 • edited Loading

qyangcv commented Feb 21, 2025 •

edited

Loading