188

WonderWorld: Interactive 3D Scene Generation from a Single Image

If you click on the image of "Link" (I know he is not really) in the "Interactive Viewing" section then you can see that in front of him (out of view) is a bunch of noise. I think it is interesting that it would predict randomness above just predicting nothing being there.

This is awesome tech.

10 months agostephen_cagle

Super impressive, and I can see it being useful in many cases already. Especially making interactive experiences in combination with position tracker of a user in a room. As you move around the room your perspective changes.

In a more creative approach I could imagine creating fake windows using flat-screen TVs in this approach as well. As you move around the room the perspectives would change as well, giving an illusion of the windows being real. Of course this would only work for a single person at a time but it would be quite interesting to experience. It should not be too difficult to hack it together as a solo dev.

10 months agoopdahl

Does anyone know if there are variants of this that output voxels? It feels like a more concrete representation of the space versus Gaussian splats.

10 months agoghayes
[deleted]
10 months ago

It isn’t released yet, but the thing akin to this that Roblox is working on (to be open-weights) most likely is voxel-based.

10 months agoloremaster

Source on Roblox working on something like this?

10 months agobogwog

"This internal AI project will power generative creation on our platform. Our 3D foundational model will be open source and multimodal, and it will power 3D generation through text, video, and 3D prompts. We see a powerful future where Roblox experiences will have extensive generative AI capabilities to power real-time creation integrated with gameplay. We’ll provide these capabilities in a resource-efficient way, so we can make them available to everyone on the platform."

https://corp.roblox.com/newsroom/2024/09/rdc-2024-robloxs-ne...

10 months agoxnx

This is AMAZING!

I hope this is released for public use at some point. I'd love to run it through some of my older photos to see what it does with them.

10 months agojayantbhawal

It feels like "3D" is a stretch given the approach they're using. Obviously the result is pretty cool, but I suspect anything built using this tech is going to have a very distinct feel (almost like sprite based video games).

10 months agorobertclaus

This is incredible. You could build entire games this way.

10 months agotetris11

Would they be any good though, that remains to be seen.

10 months agokeyle

Imagine Google street view data put to use in combination with this. You would essentially have an open world game of any city on earth.

10 months agoowenpalmer

Or, you could confuse the heck out of GeoGuessr players :)

10 months agosloucher

Yeah, I was thinkin' exactly the same thing. Even if not for games (although that would be nifty) just imagine the additional "depth" and "sense of presence" this would bring to Google's Street View. Street View was already pretty slick, but pile things like this on top of what they've already got there? Just ... Wow!

10 months agoblooalien

can wait for the code /api

10 months agoandroid521

Very cool!

10 months agoLarsDu88

This is the future I was promised. Take my money please.

10 months agofnordpiglet

How very supercalifragilisticexpialidocious!