Portfolio

Autoencoders & CLIP models

Implemented Autoencoder model for image compression and representational learning. Moreover, the CLIP model is explored for query tasks as shown below. Press blue link above for more details.

Diffusion Models

Diffusion-based neural networks have gained traction in recent years for tasks such as generative AI. In this project, I implement a diffusion-based network for image generation using CIFAR10 dataset. Press blue link above for more details.

Laplacian Pyramids: Image Blending

Laplacian pyramids are useful to capture information of images at various scales. In this project, I use this to blend images together as seen below. Press blue link above for more details.

Panorama Stitching with ORB/SIFT Features

Panorama stitching involves joining images taken at different angles. This is nontrivial since it’s challenging to line them up. By using ORB features (key landmarks), one can use this to find the affine mapping related to the two images as shown below. Press blue link above for more details.

VGG Network: Scene Recognition

In this project, I perform classification (scene recognition) with a VGG-type neural network using the miniplaces dataset to recognize the scene of images. Press blue link above for more details.