Kaggle was having an Image Challenge and I decided to try my hand at it. The challenge is to identify ships on the ocean from satellite images.
My first attempt involved blurring the training images. Attempting to filter large parts of the ocean. Then logging parts of the image for kNN usage. And noting if the image piece was contained a piece of the ship. To attempt to reduce training set size, the initial image piece was larger that what was reasonable for reporting back as true or false. My strategy was to take a smaller piece of the identified image and log that part as well.
This first attempt turned out well, and I plan to make another attempt or 2. My code on GitHub.
Steps listed below
- Original Image with Training Data
- Blur and filter image
- Identify larger pieces (32×32 pixels) based on training data. (Thinner lines are False, thicker lines are True)
- Identify smaller pieces (4×4 pixels) based on training data. (Thinner lines are False, thicker lines are True)
- Apply data to knn
- Review test image
- Get result of larger images
- Get result of smaller images
- Generate data to submit
- Comparison of Prediction and Test data