Text to Image generation

March 10, 2018

* created inverse gans

March 19, 2018

* added is_cuda check for train.py
* using --split=2 for inference (testing set)
* additional loss used by the original code author:
	g_loss = criterion(outputs, real_labels) \
					 + self.l2_coef * l2_loss(activation_fake, activation_real.detach()) \
					 + self.l1_coef * l1_loss(fake_images, right_images)

GAN losses

* forward GAN: 
	* disc_loss = real_loss + fake_loss + wrong_loss(cls)(wrong image + right embedding)
	* gen_loss = g_loss
* inverse GAN: 
	* disc_loss = real_loss + fake_loss + wrong_loss(cls)(wrong embedding + right image)
	* gen_loss = criterion(outputs, real_labels) # that's it?

Questions:

* cls: what would cls loss be for disc_loss in inverse GAN?

TODO:

prepare dataset COCO (dataloader function)
generate word embeddings (using skip thought or gensim?)
GAN for image generation from word embeddings
GAN for caption generation
Cycle GAN structure

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
Text-to-Image-Synthesis		Text-to-Image-Synthesis
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text to Image generation

March 10, 2018

March 19, 2018

GAN losses

Questions:

TODO:

About

Releases

Packages

Languages

CSC2548/csc2548

Folders and files

Latest commit

History

Repository files navigation

Text to Image generation

March 10, 2018

March 19, 2018

GAN losses

Questions:

TODO:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages