Replies: 2 comments
-
it seems the weight are divided. |
Beta Was this translation helpful? Give feedback.
0 replies
-
I suggest you look at the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I try the tensor parallelism by modifying offline_inference.py script
I print the hidden states in model script. I find the worker and driver worker use same input and output same result. So it seems tensor parallelism just executes multiple copys in each device.
Anything is wrong? Wish get some answers.
Beta Was this translation helpful? Give feedback.
All reactions