[WIP] 2d parallelism chapter - tensor parallelism #39

corey-lambda · 2024-10-09T14:29:08Z

Adds details on tensor parallelism.

corey-lambda · 2024-10-15T13:50:38Z

Discussing tensor parallelism here: https://discuss.pytorch.org/t/tp-fsdp-sync-module-states-cpu-offload/211386/2

corey-lambda · 2024-10-16T14:24:59Z

07-2d-parallelism/train_llm.py

+        sharding_strategy=ShardingStrategy.NO_SHARD,
+        device_mesh=mesh["dp"],


It seems like if you pass in sync_module_states=True, and HYBRID_SHARD, that it will broadcast from the right process group.

I'd test this out. But it also means we'd need to load the weights on ranks 0-7?

Adding sketch of 2d parallelism chapter

c6073a5

corey-lambda mentioned this pull request Oct 9, 2024

Add tensor parallelism to 405b chapter or advanced topics? #37

Open

corey-lambda added 6 commits October 9, 2024 16:10

More notes

1e62cb4

Merge branch 'main' into 2d-parallel

b22b80c

More updates to readme

9a146df

Noting that TP requires identical input. How to address?

8c869ed

Some small updates for tp

fa7c5ea

Making attn tp

79558f1

corey-lambda changed the title ~~[WIP] 2d parallelism chapter~~ [WIP] 2d parallelism chapter - tensor parallelism Oct 15, 2024

More explanations & adding wrap policy/ddp back in

bce44e2

corey-lambda commented Oct 16, 2024

View reviewed changes

Filling out readme more

439cb16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] 2d parallelism chapter - tensor parallelism #39

[WIP] 2d parallelism chapter - tensor parallelism #39

corey-lambda commented Oct 9, 2024 •

edited

Loading

corey-lambda commented Oct 15, 2024

corey-lambda Oct 16, 2024

		sharding_strategy=ShardingStrategy.NO_SHARD,
		device_mesh=mesh["dp"],

[WIP] 2d parallelism chapter - tensor parallelism #39

Are you sure you want to change the base?

[WIP] 2d parallelism chapter - tensor parallelism #39

Conversation

corey-lambda commented Oct 9, 2024 • edited Loading

corey-lambda commented Oct 15, 2024

corey-lambda Oct 16, 2024

Choose a reason for hiding this comment

corey-lambda commented Oct 9, 2024 •

edited

Loading