Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to reproduce the results #35

Open
fenghuo-cao opened this issue Feb 4, 2022 · 9 comments
Open

Unable to reproduce the results #35

fenghuo-cao opened this issue Feb 4, 2022 · 9 comments

Comments

@fenghuo-cao
Copy link

Hi, I see that someone has reproduced your experimental results, but I haven't been able to reproduce them. I would like to ask if the version of python, cuda, pytorch has a big impact on the results?
Because I use a 3090 graphics card, the version of cuda and pytorch cannot be consistent with the author.
cuda==11.0 torch==1.7 detectron2==0.3
Only the number of GPUs is modified in the code, and the others are not modified. The base model provided by the author is used in the reproduction process, and the experimental results of the reproduction are quite different from the author. And it fluctuates a lot.

@fenghuo-cao
Copy link
Author

I have tested G_FSOD's split_1 1_shot seed_0 many times. Your paper only mentions the experimental result of taking the average of ten experiments. Your experimental result is 57.03, while mine is between 50-55. What is the cause of this problem? of? Have you encountered similar problems during your experiments?

@RuoyuChen10
Copy link

Hi, I see that someone has reproduced your experimental results, but I haven't been able to reproduce them. I would like to ask if the version of python, cuda, pytorch has a big impact on the results? Because I use a 3090 graphics card, the version of cuda and pytorch cannot be consistent with the author. cuda==11.0 torch==1.7 detectron2==0.3 Only the number of GPUs is modified in the code, and the others are not modified. The base model provided by the author is used in the reproduction process, and the experimental results of the reproduction are quite different from the author. And it fluctuates a lot.

I haven't tested the code yet, but according to my experience, based on the framework of Detectron2 implementation, there is a big difference between the implementation results of 1 3090 graphics card and 2 2080Ti graphics cards. I guess it is because of Detectron2 optimized multi-card training.

2 3090 may work, you can try graphics 2 cards.

@fenghuo-cao
Copy link
Author

@RuoyuChen10 Ok, I see what you mean, but I don't have a machine with two 3090 graphics cards. I'll try running on 2 2080Ti graphics cards in a while. thank you.

@CrapbagMo
Copy link

Hi, I came up with the same issue. Have you reproduced the results yet in 2 cards setting?

@IceHowe
Copy link

IceHowe commented Jun 23, 2022

@RuoyuChen10 Ok, I see what you mean, but I don't have a machine with two 3090 graphics cards. I'll try running on 2 2080Ti graphics cards in a while. thank you.

Can you tell me how much time it takes to train a epoch on COCO using 2 2080TI?

@kangkang189
Copy link

I run your code under 2080 with batchsize=2 per gpu,i got the lowwer Ap ,how can i get the ap like your paper?

@STSTERANDMOMO
Copy link

Hi, I see that someone has reproduced your experimental results, but I haven't been able to reproduce them. I would like to ask if the version of python, cuda, pytorch has a big impact on the results? Because I use a 3090 graphics card, the version of cuda and pytorch cannot be consistent with the author. cuda==11.0 torch==1.7 detectron2==0.3 Only the number of GPUs is modified in the code, and the others are not modified. The base model provided by the author is used in the reproduction process, and the experimental results of the reproduction are quite different from the author. And it fluctuates a lot.

I met a problem。rm: cannot remove 'checkpoints/voc/defrcn/defrcn_fsod_r101_novel1/tfa-like/5shot_seed9/model_final.pth': No such file or directory。can you help me?

@ruee-z
Copy link

ruee-z commented May 26, 2023

Hi, I see that someone has reproduced your experimental results, but I haven't been able to reproduce them. I would like to ask if the version of python, cuda, pytorch has a big impact on the results? Because I use a 3090 graphics card, the version of cuda and pytorch cannot be consistent with the author. cuda==11.0 torch==1.7 detectron2==0.3 Only the number of GPUs is modified in the code, and the others are not modified. The base model provided by the author is used in the reproduction process, and the experimental results of the reproduction are quite different from the author. And it fluctuates a lot.

May I ask if you later reproduced the results of the paper and how did you do it? I also encountered the same problem now, I ran on two 2080 with bs=4

@xsh8757398
Copy link

I have the same problem. There is roughly a 3-7 difference between my reproduced results and those in Table 6 with AP, AP50, AP75, bAP, and nAP. Does the machine used have much to do with the results? I am using two 1080s with a batch size of 4.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants