- partition image to segments
- group pixels with similar visual characteristics
- the specificity or scale is dependent on the domain
- segmentation in humans
- likely to be different
- subjective
- ill-defined problem
- For gray images
Theory
Approaches
- Top-down
- pixels belong together because they come from the same object
- in line with Gestalt Psychology
- Bottom-up
- pixels belong together because they look similar
- Most techniques are bottom-up
Clustering based
- Cluster pixels based on its visual characteristics
- Each pixel can be seen as a feature vector of
- brightness
- colour (R,G,B channels)
- position
- depth
- motion
- texture
- material
- Map the pixels into a feature space
- Pixel Similarity
- Dissimilarity or distance between features → L2 Norm
- Using pixel similarity/dissimilarity, cluster the features such that similar pixels cluster together
- each cluster is an image segment
- disjoint regions could belong to the same cluster
- using position as a feature could discourage this
- Algorithms
Graph based
- images as graphs → G = (V,E)
- a vertex for each pixel, and its features, like in clustering
- an edge → weighted by the affinity or similarity between vertices
- Algorithms