Data Science

Neural style transfer involves combining two images together:
- One image refers to our content image $C$
- The second image refers to our style image $S$
- Combining these two images creates a genereated image $G$
Essentially, we apply a style from one image to our content image
Then, the generated image will be include the content of our content image with the style from our style image

We must define a new cost function in order to build a neural style transfer system
The cost function measures how well our generated image $G$ fits to our data
The cost function is minimized using gradient descent
The cost function is defined as the following:

J(G) = \alpha J_{c}(C,G) + \beta J_{s}(S,G)

The cost function is made up of two cost sub-functions:
- The cost function $J_{c}(C,G)$ is the cost of the content image
- The cost function $J_{s}(C,G)$ is the cost of the style image
The content cost function measures the similarity between the content image $C$ and generated image $G$
The style cost function measures the similarity between the style image $S$ and generated image $G$
The hyperaparameters $\alpha$ and $\beta$ specify the weighting between the content and style cost functions
For example, we could increase $\alpha$ to place more weight on the content cost function
We could also decrease $\beta$ to place more weight on the content cost function

The general algorithm looks like the following:
1. Initiate some random $G$ with the following dimensions:
$G: 100 \times 100 \times 3$
1. Use gradient descent to minimize $J(G)$ :
$G = G - \frac{\partial J(G)}{\partial G}$
Therefore, gradient descent updates the pixels of our image $G$
An example looks like the following:

costnst

Visualizing a CNN

Neural Style Transfer