Replies: 1 comment 1 reply
-
Hi @chinoll, thank you for asking. From the idea perspective, I remember we tested PLD with GPT-2, so it should also work with GPT-like models in Megatron-LM in theory. However, it was done long ago and it was not based on the Megatron-LM code base, so we do not have any released code for this. On the other hand, the underlying parallelism strategies also matter when considering using PLD. Which parallelism strategies are you using in Megatron-LM? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Can PLD technology be used on the Megatron-LM?
Beta Was this translation helpful? Give feedback.
All reactions