Skip to content

Commit f3964fb

Browse files
authored
Update README.md
another update to rm
1 parent be6212a commit f3964fb

File tree

1 file changed

+5
-4
lines changed

1 file changed

+5
-4
lines changed

README.md

+5-4
Original file line numberDiff line numberDiff line change
@@ -13,13 +13,14 @@ To provide accurate recipe recommendations from BonAppetit.com based on user ima
1313
* `ingred_app_builder.ipynb` Builds the aforementioned NLP model.
1414
* `ba_scrape.ipynb` notebook shows the scraping workflow. Users can follow the same workflow to scrape for themselves. Note: this project scraped only until Dec. 2, 2020
1515
* `w_urls.csv` and `web_apps/super.csv` are two resulting `.csv` files with scraped BA data. Super is a merged file for use in web apps. **NB** I do not own the rights to these recipes, they are the IP of Bon Appetit and Conde Nast. I provide this as the result of my student project.
16+
* `ba_presentation.pdf` is a pdf of the presentation I gave on this project. It has further visual explanations.
1617

1718

1819
## Methods:
1920
I scraped ~5,000 recipes and images from around an 8 year period as found freely on BonAppetit.com
20-
<br>
21-
*Topic Modeling* Non-Negative Matrix Factorization was used to provide useful topic models in creating the applications. These topics served as filters to allow the user to add specificity to their search. Such unsupervised topic modeling was necessary given the poor quality of BA provided topics which are often subjective, particular to their magazine, and inconsistent.
22-
*Computer Vision Image Retrieval* After retrieving weights/features from my corpus of ~5,000 BA images on a ResNet50 model, I binarized these features to use in a k-Nearest Neighbors model. A user can then submit a photograph which is processed with ResNet50, compared to the weights for distance in k-NN and a result of the top neighbors is produced with images and links to BA.
23-
*NLP Recommender* Using a vectorized dataframe of all BA recipe ingredients, user text input is similarly vectorized and its distance to other recipes is calculated. Within a given cosine similarity score threshold, results are produced for the user with images and links to BA.
21+
<br><br>
22+
*Topic Modeling* Non-Negative Matrix Factorization was used to provide useful topic models in creating the applications. These topics served as filters to allow the user to add specificity to their search. Such unsupervised topic modeling was necessary given the poor quality of BA provided topics which are often subjective, particular to their magazine, and inconsistent. <br><br>
23+
*Computer Vision Image Retrieval* After retrieving weights/features from my corpus of ~5,000 BA images on a ResNet50 model, I binarized these features to use in a k-Nearest Neighbors model. A user can then submit a photograph which is processed with ResNet50, compared to the weights for distance in k-NN and a result of the top neighbors is produced with images and links to BA. <br><br>
24+
*NLP Recommender* Using a vectorized dataframe of all BA recipe ingredients, user text input is similarly vectorized and its distance to other recipes is calculated. Within a given cosine similarity score threshold, results are produced for the user with images and links to BA.<br><br>
2425

2526

0 commit comments

Comments
 (0)