Search Engine

Based on work done freshman summer. Used GCP to train a model that implements product search with Tensorflow API. Used data from online stores as training and testing data. Used embeddings, feature extraction and sentence encoding as input to the Tensorflow model.

Below is an extremely high level view of the model.

Github

https://github.com/rah4927/search

How it Works

Embeddings

  • Used Word2Vec and Doc2Vec to get vector embeddings of products and product metadata
  • Used Universal Sentence Encoding to get embeddings of product raw data and user queries
  • Used VGG 16 to extract features of product images

Model

Here’s a very simplified picture of the mode. Created a neural network N(q, w) that takes a query q and a product w and outputs a score based on the likelihood of them being correlated.

Ranking

For a given query q, sort N(q, w) over all products w. This is our initial search engine. ::: :::::