You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello!
Thanks for your open-source work~ I have some questions after trying the infer code.
What does the "[tdv]" token mean, which is added to the original caption? It seems not to be mentioned in the paper.
Can it generate results of zero elev degree? The default value is 45 degrees, and I meet code errors when I change the value to 0 directly.
Looking for your reply!
The text was updated successfully, but these errors were encountered:
we appended the "[tdv]" token to all captions during multiview training. this provides an additional signal through text that the domain of the new data is slightly different from single view training that was performed previously (on laion).
i believe you maybe looking for 90-degree elevation, since 0-degree would place all the cameras at the same location (on z-axis). here's the camera conversion equations that we follow, and here's the corresponding code. i believe you could try setting the elevation to 90-degrees, and see if that generates the right results.
Hello!
Thanks for your open-source work~ I have some questions after trying the infer code.
Looking for your reply!
The text was updated successfully, but these errors were encountered: