Scale Events
+00:00 GMT
Sign in or Join the community to continue

Reverse Engineering of Generative Models

Posted Jan 31, 2023 | Views 1.7K
Share
speaker
avatar
Vishal Asnani
Ph.D. In Computer Science and Engineering @ Michigan State University

Vishal Asnani is pursuing his Ph. D. degree in the Computer Science and Engineering department from Michigan State University since 2021. He received his Bachelor’s degree in Electrical and Instrumentation Engineering from Birla Institute of technology and Science, Pilani, India in 2019. His research interests include computer vision and machine learning with a focus on the studying of generative models and deepfake detection.

Asnani and research peers' work in Deepfake Detection has recently garnered media attention, including that of the Wall Street Journal, CNBC, CNET, Engadget, Fortune, The Mac Observer, MSU Today, New Scientist, SiliconAngle, VentureBeat, and The Verge.

Publications:

  1. Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images

  2. Proactive Image Manipulation Detection

https://github.com/vishal3477

+ Read More
SUMMARY

State-of-the-art (SOTA) Generative Models (GMs) can synthesize photo-realistic images that are hard for humans to distinguish from genuine photos. Identifying and understanding manipulated media are crucial to mitigate the social concerns on the potential misuse of GMs. We propose to perform reverse engineering of GMs to infer model hyperparameters from the images generated by these models. We define a novel problem, "model parsing", as estimating GM network architectures and training loss functions by examining their generated images - a task seemingly impossible for human beings. To tackle this problem, we propose a framework with two components: a Fingerprint Estimation Network (FEN), which estimates a GM fingerprint from a generated image by training with four constraints to encourage the fingerprint to have desired properties, and a Parsing Network (PN), which predicts network architecture and loss functions from the estimated fingerprints. To evaluate our approach, we collect a fake image dataset with 100K images generated by 116 different GMs. Extensive experiments show encouraging results in parsing the hyperparameters of the unseen models. Finally, our fingerprint estimation can be leveraged for deepfake detection and image attribution, as we show by reporting SOTA results on both the deepfake detection (Celeb-DF) and image attribution benchmarks.

+ Read More

Watch More

0:47
OpenAI's Greg Brockman: The Future of Large Language (LLMs) and Generative Models
Posted Oct 20, 2022 | Views 5.2K
# TransformX 2022
# Fireside Chat
# Natural Language Processing (NLP)
# Foundation Models
# Large Language Models (LLMs)
# Generative Models