Notice

Recent Posts

Recent Comments

Tags more

Archives

관리 메뉴

쉬엄쉬엄블로그

(딥러닝) Generative Models - 2 본문

부스트캠프 AI Tech 4기

쉬엄쉬엄블로그 2023. 6. 5. 11:39

728x90

이 색깔은 주석이라 무시하셔도 됩니다.

Given a training set of examples, we can cast the generative model
learning process as finding the best-approximating density model from the model family.
- 학습 데이터 샘플들이 주어졌을 때, 생성 모델 학습 과정을 모델 계열에서 가장 근사적인 밀도 모델을 찾는 것으로 볼 수 있음
Then, how can we evaluate the goodness of the approximation?
- 어떤 기준으로 근사가 잘 되었는지 정의하는 것이 중요

근사적으로 두 분포사이의 거리를 구하는 수식
We can simplify this with
As the first term does not depend on $P_\theta$, minimizing the KL-divergence is equivalent to maximizing the expected log-likelihood.
- 첫 번째 항은 $P_\theta$에 의존하지 않기 때문에 $logP_\theta(x)$를 최대화하는 것이 KL-divergence 최소화하는 것과 동일한 효과가 됨
Approximate the expected log-likelihood with the empirical log-likelihood
Maximum likelihood learning is then:
Variance of Monte Carlo estimate is high

For maximum likelihood learning, empirical risk minimization (ERM) if often used.
- 최대 우도 학습을 위해 경험적 위험 최소화를 종종 사용함
However, ERM often suffers from its overfitting.
- 그러나 ERM은 종종 과적합을 겪음
- Extreme case: The model remembers all training data
  - 극단적인 경우에 모델은 모든 학습 데이터를 기억함
To achieve better generalization, we typically restrict the hypothesis space of distributions that we search over.
- 더 나은 일반화를 위해 일반적으로 검색하는 분포의 가설 공간을 제한함
However, it could deteriorate the performance of the generative model.
- 하지만 생성 모델의 성능이 저하될 수 있음
Usually, MLL is prone to under-fitting as we often use simple parametric distributions such as spherical Gaussians.
- 일반적으로, MLL(Maximum Likelihood Learning)은 구 모양의 가우시안 분포와 같은 단순한 모수 분포를 자주 사용하기 때문에 적합하지 않은 경향이 있음
What about other ways of measuring the similarity?
- 유사성을 측정하는 다른 방법은?
- KL-divergence leads to maximum likelihood learning or Variational Autoencoder (VAE).
- Jensen-Shannon divergence leads to Generative Adversarial Network (GAN).
- Wasserstein distance leads to Wasserstein Autoencoder (WAE) or Adversarial Autoencoder (AAE).

D. Kingma, “Variational Inference and Deep Learning: A New Synthesis,” Ph.D. Thesis

Diffusion models progressively generate images from noise.
- Diffusion model은 노이즈로부터 이미지를 점진적으로 생성함

자세한 설명은 생략
Forward (diffusion) process progressively injects noise to an image.
- Forward (diffusion) process는 이미지에 노이즈를 점진적으로 주입시킨다.
The reverse process is learned in such a way to denoise the perturbed image back to a clean image.
- reverse process는 교란된 이미지를 깨끗한 이미지로 다시 노이즈를 제거하는 방식으로 학습
참고 링크
Diffusion Model은 이미지의 주변 scene에 dependent하고 유사하게 들어갈 수 있는 이미지 편집이 가능함
- GAN 같은 모델은 이미지의 중간만을 편집하는 것이 불가능한 것은 아니지만 Diffusion Model처럼 쉽게 되지는 않음

출처: 부스트캠프 AI Tech 4기(NAVER Connect Foundation)

(Data Viz) Python과 Matplotlib (0)	2023.06.07
(Data Viz) 시각화의 요소 상태 (2)	2023.06.06
(딥러닝) Generative Models - 1 (0)	2023.06.03
(딥러닝) Transformer (0)	2023.06.02
(딥러닝) Recurrent Neural Networks (0)	2023.06.01