paddle.nn.Embedding()与torch.nn.Embedding()的权重对不齐！ #68695

openvino-book · 2024-10-14T12:10:54Z

bug描述 Describe the Bug

请问如何初始化让paddle.nn.Embedding()与torch.nn.Embedding()的权重可以一样？

PyTorch代码：

import torch
print("PyTorch version:", torch.__version__)
torch.manual_seed(1)

vocab_size = 6
output_dim = 3

embedding_layer = torch.nn.Embedding(vocab_size, output_dim)
print(embedding_layer.weight)
print(embedding_layer(torch.tensor([3])))

Paddle代码：

import paddle
print("PaddlePaddle version:", paddle.__version__)
paddle.seed(1)

vocab_size = 6
output_dim = 3

embedding_layer = paddle.nn.Embedding(vocab_size, output_dim)
print(embedding_layer.weight)
print(embedding_layer(paddle.to_tensor([3])))

二者输出结果完全不一样

其他补充信息 Additional Supplementary Information

No response

zoooo0820 · 2024-10-15T03:19:22Z

参考 #44565 默认初始化的方法有差异

ILoveAmy · 2024-10-15T11:55:01Z

参考 #44565 默认初始化的方法有差异

多谢！~ 原来paddle.nn.Embedding默认的参数初始化，是用的]XavierUniform，：[-x, x]间的均匀分布，其中 x = \sqrt{\frac{6.0}{fan_in + fan_out}}

XavierUniform做参数初始化非常不错！有助于提高训练速度和收敛性，并能增强模型的泛化能力

openvino-book added status/new-issue 新建 type/bug-report 报bug labels Oct 14, 2024

paddle-bot bot assigned zoooo0820 Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

paddle.nn.Embedding()与torch.nn.Embedding()的权重对不齐！ #68695

paddle.nn.Embedding()与torch.nn.Embedding()的权重对不齐！ #68695

openvino-book commented Oct 14, 2024

zoooo0820 commented Oct 15, 2024

ILoveAmy commented Oct 15, 2024 •

edited

Loading

paddle.nn.Embedding()与torch.nn.Embedding()的权重对不齐！ #68695

paddle.nn.Embedding()与torch.nn.Embedding()的权重对不齐！ #68695

Comments

openvino-book commented Oct 14, 2024

bug描述 Describe the Bug

其他补充信息 Additional Supplementary Information

zoooo0820 commented Oct 15, 2024

ILoveAmy commented Oct 15, 2024 • edited Loading

ILoveAmy commented Oct 15, 2024 •

edited

Loading