Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add docs for installing TE from source #518

Open
dlwh opened this issue Mar 8, 2024 · 3 comments
Open

Add docs for installing TE from source #518

dlwh opened this issue Mar 8, 2024 · 3 comments

Comments

@dlwh
Copy link
Member

dlwh commented Mar 8, 2024

(We also need to not break when TE isn't installed...)

WIP

Run through this:

https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&Distribution=Ubuntu&target_version=22.04&target_type=deb_local

Then

wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt-get -y install cudnn9-cuda-12
export PATH=$CUDA_HOME/bin/:$PATH
NVTE_FRAMEWORK=jax pip install git+https://github.com/NVIDIA/TransformerEngine.git@stable
@Helw150
Copy link
Collaborator

Helw150 commented Mar 11, 2024

I think the current implementation might be using nightly rather than stable? After installing stable fused attention, I'm still getting errors of cannot import fused_attn from fused_attn.

It seems to be defined in the main branch, but not in the stable branch here https://github.com/NVIDIA/TransformerEngine/blob/stable/transformer_engine/jax/fused_attn.py

@dlwh
Copy link
Member Author

dlwh commented Mar 11, 2024

ah probably. we're mostly using the docker container (which we need docs for still as well cc @vadam5 )

In the meantime, something like this ought to work

docker run -v levanter:/levanter --gpus="$SLURM_STEP_GPUS" -e CUDA_VISIBLE_DEVICES -e WANDB_MODE=offline  --shm-size=16g -it ghcr.io/stanford-crfm/levanter/levanter_jax:radium_test

@sbhavani
Copy link

sbhavani commented Aug 8, 2024

We will be releasing a pip wheel for TE in the coming months that should resolve the installation issues

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants