Researchers develop an impressive style-based 3D-aware generator for high-res image synthesis

Researchers at the Max Planck Institute for Informatics and the University of Hong Kong have developed StyleNeRF, a 3D-aware generative model trained on unstructured 2D images that synthesizes high-resolution images with a high level of multi-view consistency.

Compared to existing approaches, which either struggle to synthesize high-resolution images with fine details or produce 3D-inconsistent artifacts, StyleNeRF integrates its neural radiance field (NeRF) into a style-based generator. By employing this approach, StyleNeRF delivers improved render efficiency and better consistency with 3D generation.

A comparison between StyleNeRF (column five) and four competing generative models, including HoloGAN, GRAF, pi-GAN and GIRAFFE. Each image is generated with four different viewpoints. As you can see, StyleNeRF performs exceptionally well here compared to the alternatives. Click to enlarge.

StyleNeRF uses volume rendering to produce a low-resolution feature map and progressively applies 2D upsampling to improve quality and produce high-resolution images with fine detail. As part of the full paper, the team outlines a better upsampler (section 3.2 and 3.3) and a new regularization loss (section 3.3).

In the real-time demo video below, you can see that StyleNeRF works very quickly and offers an array of impressive tools. For example, you can adjust the mixing ratio of a pair of images to generate a new mix and adjust the generated image’s pitch, yaw, and field of view.

Compared to alternative 3D generative models, StyleNeRF’s team believes that its model works best when generating images under direct camera control. While GIRAFFE synthesizes with better quality, it also presents 3D inconsistent artifacts, a problem that StyleNeRF promises to overcome. The research states, ‘Compared to the baselines, StyleNeRF achieves the best visual quality with high 3D consistency across views.’

Measuring the visual quality of image generation by using the Frechet Inception Distance (FID) and Kernel Inception Distance (KID), StyleNeRF performs well across three sets.

Table 1 – Quantitative comparisons at 256^2. The team calculated FID, KID x 10^3 and presented the average rendering time for a single batch. The 2D GAN (StyleGAN2) numbers are for reference. Lower FID and KID numbers are better. Click to enlarge.

Figure 7 from the research paper shows the results of style mixing and interpolation. The paper states, ‘As shown in the style mixing experiments, copying styles before 2D aggregation affects geometry aspects (shape of noses, glasses, etc.), while copying those after 2D aggregation brings changes in appearance (colors of skins, eyes, hairs, etc.), which indicates clear disentangled styles of geometry and appearance. In the style interpolation results, the smooth interpolation between two different styles without visual artifacts further demonstrates that the style space is semantically learned.’

Click to enlarge.

If you’d like to learn more about how StyleNeRF works and dig into the algorithms underpinning its impressive performance, be sure to check out the research paper. StyleNeRF is developed by Jiatao Gu, Lingjie Liu, Peng Wang and Christian Theobalt of the Max Planck Institute for Informatics and the University of Hong Kong.

All figures and tables credit: Jiatao Gu, Lingerie Liu, Peng Wang and Christian Theobalt / Max Planck Institute for Informatics and the University of Hong Kong

Author:
This article comes from DP Review and can be read on the original site.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
PHPSESSID	session	This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__gads	1 year 24 days	The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_LGX92D8MKV	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_213478817_1	1 minute	Set by Google to distinguish users.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Researchers develop an impressive style-based 3D-aware generator for high-res image synthesis

BROKENMOUNT

ABOUT

PARTNERS

Researchers develop an impressive style-based 3D-aware generator for high-res image synthesis

Related Posts

Sony World Photography Awards 2025 reveals its Photographer of the Year

Insta360 is teasing a new camera coming on April 22nd

Blackmagic Design halts US factory plans due to concerns over tariffs

BROKENMOUNT

ABOUT

PARTNERS