Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about the infer code. #5

Closed
yiboz2001 opened this issue Apr 13, 2024 · 1 comment
Closed

Some questions about the infer code. #5

yiboz2001 opened this issue Apr 13, 2024 · 1 comment

Comments

@yiboz2001
Copy link

Hello!
Thanks for your open-source work~ I have some questions after trying the infer code.

  1. What does the "[tdv]" token mean, which is added to the original caption? It seems not to be mentioned in the paper.
  2. Can it generate results of zero elev degree? The default value is 45 degrees, and I meet code errors when I change the value to 0 directly.
    Looking for your reply!
@yashkant
Copy link
Owner

yashkant commented Apr 14, 2024

hi, thanks for checking out spad!

  1. we appended the "[tdv]" token to all captions during multiview training. this provides an additional signal through text that the domain of the new data is slightly different from single view training that was performed previously (on laion).

  2. i believe you maybe looking for 90-degree elevation, since 0-degree would place all the cameras at the same location (on z-axis). here's the camera conversion equations that we follow, and here's the corresponding code. i believe you could try setting the elevation to 90-degrees, and see if that generates the right results.

hope this helps!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants