# Images
# Binary Image
Translate the screen into pixels, set them to 1 if they are totally filled.
# Gray scale Image
Use 255 bits to represent intensity between white and black for each pixel of the screen.
# Binary Mask
Set pixel to 1 in foreground shape.
# Color Image
Three channels, RGB, each have map of screen pixels.
If a pixel should be black, (0,0,0) tensor.
Adjust each value in the range of 255 bits for each channel.
# Depth Map
Each pixel indicates how close or far from the camera, set up like gray scale.
# Image Functions
A function that maps a location to an intensity value.
# silly examples
f(x, y) = x + y
f(x, y) = {
1 if |x| <= 1
0 otherwise
}# Operations
You can perform operations on the image function.
- flip:
f(x, y) => f(-x, y) - negate:
f(x, y) => 1 - f(x, y) - add:
f(x, y) => f(x, y) + 10
You can combine operations, add two images, etc....
Also set operations like union is called masking.
⨂ indicates a "cross correlation".
# Comparing images
low distance = alike
low theta = alike
high cos(theta) = alike
high similarity = alikeDistance
- Count different pixels, sum the amount of differences
- L1: Calculate the distance between each pixel, sum the distances
- L2: Calculate the distance squared, sum the results, take square root of sum
Similarity
- Dot product all pixels, take sum divided by square root of each pixel in second image squared sum?
# Image Moments
Goal:
M(i,j) 3 x 3 represents some image M(x,y) 20 x 20
Definition:
Given an original image I(x,y), Mij = sum(x)sum(y)x^iy^iI(x,y)
Pixels -> Histograms -> Distributions
Find histograms by counting patterns in pixels, convert to distributions by multiplication in a linear way to unify it 0 to 1.
# Cross-Entropy
Cross-Entropy is a measure of the difference between two probability distributions for a given random variable or set of events.
p
-log2^p
-p*log2^p
sum(for each p)
p and q
-log2^q
-p*log2^q
sum(for each zip(p, q))# Filter
# Cross Correlation
Take a 1d list of pixels and form a matrix with it's left neighbor (shift right
- above and right neighbor (shift left 1) below (both neighbors have padding of one nil value. This matrix can be multiplied by new vectors to achieve filter operations.
Convolution: flip filter vector
Convolution is communicative and associative