[cs231n] 2. Image Classification - NN, KNN, Linear Classification

Computer 💻/Deep Learning

[cs231n] 2. Image Classification - NN, KNN, Linear Classification

yeon42 2022. 1. 10. 15:18

728x90

Image Classification

컴퓨터는 사람처럼 이미지 자체를 인식하는 것이 아닌, 숫자, 즉 픽셀 값으로 인식한다 !

다양한 이미지들의 분류를 위해선 수많은 변화에 대해서 robust한 알고리즘이 필요하다.

Data-driven Image Classification (데이터 기반 이미지 분류)

기존의 rule-based appraoch는 다양한 클래스들에 적용이 불가능했다.

ex) 고양이는 귀, 눈, 코가 '존재한다' -> 이를 통해 고양이임을 판단

but, 고양이도 종류에 따라 다르다.

사람이 직접 만드는 것이 아닌, 데이터를 기반으로 모델을 만들어 문제를 해결하자!

출처:&amp;amp;amp;amp;amp;nbsp;https://3months.tistory.com/512

1. Collect a (large numbers of) dataset of images and labels

2. Use Machine Learning to train a classifier

-> 다양한 객체 인식할 수 있는 모델 만들기

3. Evaluate the classifier on new images

NN (Nearest Neighborhood)

- train data를 저장한 후, 예측 단계에서 투입된 이미지와 가장 가까운 데이터의 레이블을 예측하는 방법

1. memorize all data and labels

2. predict the (new images') label to find the most similar training image

오른쪽을 보면 train image 중 test image와 유사한 순으로 정렬된 것을 볼 수 있다.

* 그렇다면, 이미지와 이미지의 가까운 정도(distance)는 어떻게 구할 수 있을까?

(+) 거리 계산은 동일한 위치 값에서 rgb를 나타내는 pixel값으로

# L1 예시
import numpy as np

class NearestNeighbor:
    def __init__(self):
        pass
    
    # 단지 Memorize training data (simple)
    def train(self, X, y):
        """ X is N x D where each row is an example.
            Y is 1-dim of size N """
        # the nearest neightbor classifier simply remembers all the training data
        self.Xtr = X
        self.ytr = y

    def predict(self, X):
        """ X is N x D where each row is an example we wish to predict label for """
        num_test = X.shape[0]
        # lets make sure that the output type matches the input type
        Ypred = np.zeros(num_test, dtype=self.ytr.dtype)
        
        # loop over all test rows
        for i in xrange(num_test):
            # find the nearest training image to the i'th test image
            # using the L1-distance (sum of absolute value differences)
            distances = np.sum(np.abs(self.Xtr - X[i, :]), axis=1)
            min_index = np.argmin(distances) # get the Index with the smallest distance
            Ypred[i] = self.ytr[min_index] # predict the label of teh nearest example
            
        return Ypred