setup_model.py and setup_optimizer.py moved to seprate files and adde… #264

malinjawi · 2024-10-10T11:38:06Z

Description:
This PR addresses issue #225 by refactoring model and optimizer setup functions into setup_model.py and setup_optimizer.py. Key changes include:

Moved setup functions to separate files for better organization.
Added type hints for clarity and improved type checking.

These changes improve code maintainability and readability. Please review and test!

…d type hinits Signed-off-by: malinjawi <[email protected]>

Signed-off-by: malinjawi <[email protected]>

RobotSail · 2024-10-16T04:50:58Z

src/instructlab/training/setup_optimizer.py

Good idea moving out the setup logic. We probably don't need a separate file for each function, so I recommend just moving all of the setup functions to a single file. We already have setup_accelerator.py, so maybe we can move these all there & simply rename the file as setup_objects.py or something similar? (trying to avoid using setup.py)

RobotSail · 2024-10-16T04:51:42Z

src/instructlab/training/setup_optimizer.py

+from instructlab.training.config import DistributedBackend
+
+
+def setup_optimizer(args: Any, model: torch.nn.Module) -> torch.optim.Optimizer:


args here is actually not a typing.Any but rather a argparse.Namespace object:

Suggested change

def setup_optimizer(args: Any, model: torch.nn.Module) -> torch.optim.Optimizer:

def setup_optimizer(args: argparse.Namespace model: torch.nn.Module) -> torch.optim.Optimizer:

RobotSail · 2024-10-16T04:53:45Z

src/instructlab/training/setup_model.py

+
+
+def setup_model(
+    args: Any, tokenizer: Any, train_loader: Any, grad_accum: int


Here are the proper types for each arg:

args: argparse.Namespace

tokenizer: transformers.PreTrainedTokenizer,

train_loader: torch.utils.data.DataLoader

Suggested change

args: Any, tokenizer: Any, train_loader: Any, grad_accum: int

args: argparse.Namespace, tokenizer: transformers.PreTrainedTokenizer, train_loader: torch.utils.data.DataLoader, grad_accum: int

RobotSail

Thanks for this PR Mohammad! I left a few comments, but this looks good so far.

Maxusmusti · 2024-10-25T18:16:23Z

Hi @malinjawi could you also rebase on the latest main branch when applying review feedback, thanks!

malinjawi added 2 commits October 10, 2024 14:29

setup_model.py and setup_optimizer.py moved to seprate files and adde…

0357fe4

…d type hinits Signed-off-by: malinjawi <[email protected]>

removed white spaces

a3f927d

Signed-off-by: malinjawi <[email protected]>

malinjawi mentioned this pull request Oct 10, 2024

Model/Optimizer Setup functions need moving and typehints #225

Open

fixed make verify and make fix import issues and linting errors

48c29bf

Signed-off-by: malinjawi <[email protected]>

ktam3 linked an issue Oct 10, 2024 that may be closed by this pull request

Model/Optimizer Setup functions need moving and typehints #225

Open

RobotSail reviewed Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

setup_model.py and setup_optimizer.py moved to seprate files and adde… #264

setup_model.py and setup_optimizer.py moved to seprate files and adde… #264

malinjawi commented Oct 10, 2024

RobotSail Oct 16, 2024

RobotSail Oct 16, 2024

RobotSail Oct 16, 2024

RobotSail left a comment

Maxusmusti commented Oct 25, 2024

		from instructlab.training.config import DistributedBackend


		def setup_optimizer(args: Any, model: torch.nn.Module) -> torch.optim.Optimizer:

	def setup_optimizer(args: Any, model: torch.nn.Module) -> torch.optim.Optimizer:
	def setup_optimizer(args: argparse.Namespace model: torch.nn.Module) -> torch.optim.Optimizer:



		def setup_model(
		args: Any, tokenizer: Any, train_loader: Any, grad_accum: int

	args: Any, tokenizer: Any, train_loader: Any, grad_accum: int
	args: argparse.Namespace, tokenizer: transformers.PreTrainedTokenizer, train_loader: torch.utils.data.DataLoader, grad_accum: int

setup_model.py and setup_optimizer.py moved to seprate files and adde… #264

Are you sure you want to change the base?

setup_model.py and setup_optimizer.py moved to seprate files and adde… #264

Conversation

malinjawi commented Oct 10, 2024

RobotSail Oct 16, 2024

Choose a reason for hiding this comment

RobotSail Oct 16, 2024

Choose a reason for hiding this comment

RobotSail Oct 16, 2024

Choose a reason for hiding this comment

RobotSail left a comment

Choose a reason for hiding this comment

Maxusmusti commented Oct 25, 2024