[visionlist] SAVE THE DATE, EURASIP JIVP Webinar: Revealing and Leveraging the Visual Information in Diffusion Models (Dr. Deepti Ghadiyaram) (4 June 2025)

Giuseppe Valenzise giuseppe.valenzise at l2s.centralesupelec.fr
Wed May 28 18:15:51 -05 2025


(Apologies if you receive multiple copies of this message)

Our next 1-hour webinar will take place Wednesday, June 4, 2025 at 3:00 PM CEST (Europe/Paris) with Dr. Deepti Ghadiyaram  (Boston University, US).

RSVP here to join: https://cassyni.com/events/KrZokXADhoLa388YBkWx4V?cb=0.blj3  

Title: Revealing and Leveraging the Visual Information in Diffusion Models

Abstract: Generating high-quality photo-realistic and creative visual content using diffusion models is a thriving area of research. In this talk, I will focus not on the generation process, but on understanding and leveraging the rich visual semantic information represented within diffusion models. Specifically, I will present our work that uses mechanistic interpretability tools such as k-sparse autoencoders (k-SAE) to probe various layers and denoising timesteps of different diffusion architectures. Next, I will present how to uncover monosemantic interpretable concepts pertaining to safety and photographic styles and steer the generation process thereby offering more controllability to users.

Bio: Deepti is an Assistant Professor at Boston University in the Department of Computer Science and also a Member of Technical Staff at Runway. Her research interests are on topics pertaining to safe and interpretable computer vision, improving realism in generative video models, and human actions. Prior to joining Boston University, she was a Senior Research Scientist at Fundamental AI Research (FAIR) in Meta AI. She has served as a program chair for NeurIPS 2022 Dataset and Benchmarks track, hosted several tutorials and organized workshops and an area chair for several years at CVPR, ICCV, ECCV, ACCV, AAAI, and NeurIPS.

We look forward to your attendance.

—
__________________________________________
Dr. Giuseppe Valenzise
CNRS Researcher
Laboratoire des Signaux et Systèmes (L2S, UMR 8506)
CNRS - CentraleSupelec - Université Paris-Saclay
3, rue Joliot Curie
91192 Gif-sur-Yvette Cedex, France
https://l2s.centralesupelec.fr/u/valenzise-giuseppe/

General Chair - ICME 2025

Chair of the IEEE SPS Multimedia Signal Processing Technical Committee (MMSP TC)

Editor in Chief EURASIP Journal on Image and Video Processing





~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
“Immersive Video Technologies”
https://www.elsevier.com/books/immersive-video-technologies/valenzise/978-0-323-91755-1


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://visionscience.com/pipermail/visionlist_visionscience.com/attachments/20250529/88fb98a6/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature_banner.png
Type: image/png
Size: 160520 bytes
Desc: not available
URL: <http://visionscience.com/pipermail/visionlist_visionscience.com/attachments/20250529/88fb98a6/attachment-0001.png>


More information about the visionlist mailing list