Image Style Transfer Using CNNs

A Flask application that implements neural style transfer, allowing users to blend content and style images seamlessly. The application leverages the power of PyTorch for the style transfer algorithm and uses Tailwind CSS for modern, responsive styling. This combination ensures both high-performance image processing and an intuitive, visually appealing user interface.

Additionally, a camera module is added whereby users can submit content images through their webcam.

Medium Blog About the project

Demo Video

demo1.mp4

Watch the Demo Video on YouTube

Screenshots

What is Image Style Transfer?

Image style transfer, also known as Neural Style Transfer, refers to a category of software algorithms that modify digital images or videos to emulate the visual style of another image. Essentially, this technique involves combining two images—a content image and a style reference image (such as a famous artwork)—to produce a new image that maintains the content of the first image but adopts the visual style of the second.

Examples :

Research Paper

In this project, we will replicate the style transfer method described in the paper, Image Style Transfer Using Convolutional Neural Networks, by Gatys in PyTorch.

VGG19 Architecture

For this style transfer project, we utilize the features extracted from the 19-layer VGG Network, which includes a sequence of convolutional and pooling layers, along with several fully-connected layers. The convolutional layers are labeled by their stack and sequence within the stack, such as Conv_1_1 for the first layer in the first stack, and Conv_2_1 for the first layer in the second stack. The deepest convolutional layer is Conv_5_4.

Separating Style and Content

The process of style transfer involves distinguishing between the content and style of an image. Given a content image and a style image, the goal is to generate a new image that combines the elements of both:

The arrangement and objects are similar to the content image.
The style, including colors and textures, resembles the style image.

In this project, we will use a pre-trained VGG19 network to extract the content and style features from an image.

Load Features in VGG19

The VGG19 network is divided into two parts:

vgg19.features, which contains all convolutional and pooling layers.
vgg19.classifier, which includes the three linear classifier layers at the end.

We will only need the features part, and we will "freeze" the weights of these layers.

Load in Content and Style Images

You can use any images you like.It's beneficial to use smaller images and adjust the content and style images to be the same size.

VGG19 Layers

To obtain the content and style representations of an image, we pass the image through the VGG19 network to reach the desired layers and then extract the output from those layers.

Gram Matrix

The output of each convolutional layer is a Tensor with dimensions related to the batch size, depth (d), height, and width (h, w). The Gram matrix of a convolutional layer can be computed as follows:

Obtain the tensor's depth, height, and width using batch_size, d, h, w = tensor.size().
Reshape the tensor so that the spatial dimensions are flattened.
Compute the Gram matrix by multiplying the reshaped tensor by its transpose.

Putting it all Together

With functions for feature extraction and Gram matrix computation in place, we can now integrate these components. We'll extract features from our images and calculate the Gram matrices for each layer in our style representation..

Loss and Weights

Individual Layer Style Weights

Below, you have the option to weight the style representation at each relevant layer. Using a range between 0–1 for these weights is suggested. Emphasizing earlier layers (conv1_1 and conv2_1) will result in more prominent style artifacts in the final image. Emphasizing later layers will highlight smaller features because each layer varies in size, creating a multi-scale style representation

Content and Style Weight

Following the method in the paper, we define alpha (content_weight) and beta (style_weight). This ratio influences the level of stylization in the final image. It is recommended to keep content_weight = 1 and adjust the style_weight to achieve the desired effect.

# weights for each style layer
# weighting earlier layers more will result in *larger* style artifacts
# notice we are excluding `conv4_2` our content representation
style_weights = {
    'conv1_1': 1.,
    'conv2_1': 0.8,
    'conv3_1': 0.5,
    'conv4_1': 0.3,
    'conv5_1': 0.1
}

content_weight = 1 # alpha
style_weight = 1e6 # beta

Updating the Target & Calculating Losses

You will decide on the number of steps for updating your image, similar to a training loop. The number of steps is flexible, but at least 2000 steps are recommended for good results. Fewer steps might be sufficient for testing different weight values or experimenting with images.

During each iteration, calculate the content and style losses and update your target image accordingly.

Content Loss

The content loss is the mean squared difference between the target and content features at layer conv4_2, calculated as follows:

content_loss = torch.mean((target_features['conv4_2'] - content_features['conv4_2'])**2)

Style Loss

The style loss is calculated similarly but involves iterating through multiple layers specified by the style_weights dictionary. Compute the Gram matrix for the target image (target_gram) and the style image (style_gram) at each layer and compare them to calculate the layer_style_loss. This value is then normalized by the size of the layer.

Total Loss

Finally, create the total loss by combining the style and content losses, weighted by the specified alpha and beta values. Print this loss periodically; even if it starts large, it should decrease over iterations. Focus on the appearance of the target image rather than the loss value itself.

Kaggle Notebook

You can view the basic implementation of the project in this kaggle notebook. https://www.kaggle.com/code/nithin1729s/styletransfer

Running the Project

Set up a virtual environment:
```
virtualenv env
source env/bin/activate
```
Install the required dependencies:
```
pip install -r requirements.txt
```
Install tkinter
```
sudo apt-get install python3-tk
```
Run the project:
```
python wsgi.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
baseline_checkpoints		baseline_checkpoints
content_dir		content_dir
data		data
scripts		scripts
static		static
style_dir		style_dir
templates		templates
test_images		test_images
test_results		test_results
train_results		train_results
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
Gatys_Image_Style_Transfer_CVPR_2016_paper.pdf		Gatys_Image_Style_Transfer_CVPR_2016_paper.pdf
Implementation.ipynb		Implementation.ipynb
README.md		README.md
aespanet_models.py		aespanet_models.py
app.py		app.py
baseline.py		baseline.py
contextual_utils.py		contextual_utils.py
hist_loss.py		hist_loss.py
index.py		index.py
main.py		main.py
requirements.txt		requirements.txt
smoothing_utils.py		smoothing_utils.py
style_decorator.py		style_decorator.py
utils.py		utils.py
vercel.json		vercel.json
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Style Transfer Using CNNs

Demo Video

Screenshots

What is Image Style Transfer?

Research Paper

VGG19 Architecture

Separating Style and Content

Load Features in VGG19

Load in Content and Style Images

VGG19 Layers

Gram Matrix

Putting it all Together

Loss and Weights

Individual Layer Style Weights

Content and Style Weight

Updating the Target & Calculating Losses

Content Loss

Style Loss

Total Loss

Kaggle Notebook

Running the Project

About

Languages

Nithin1729S/Image-Style-Transfer-Using-CNNs

Folders and files

Latest commit

History

Repository files navigation

Image Style Transfer Using CNNs

Demo Video

Screenshots

What is Image Style Transfer?

Research Paper

VGG19 Architecture

Separating Style and Content

Load Features in VGG19

Load in Content and Style Images

VGG19 Layers

Gram Matrix

Putting it all Together

Loss and Weights

Individual Layer Style Weights

Content and Style Weight

Updating the Target & Calculating Losses

Content Loss

Style Loss

Total Loss

Kaggle Notebook

Running the Project

About

Topics

Resources

Stars

Watchers

Forks

Languages