TraderJoesFallAIPartyPlanner PlaywrightLlamaIndexVectorSearch
Overview
In this tutorial we are going to scrape the Trader Joe's website for all their Fall Faves using Playwright, and then create an AI party planner using the LlamaIndex x MongoDB Vector Search integration and a chat engine. This will help us figure out easily which Trader Joe's fall faves would be perfect for our fall festivities.
What's Covered
- Building a Trader Joe’s AI party planner using Playwright, LlamaIndex, and MongoDB Atlas Vector Search
- Scraping Trader Joe’s fall items with Playwright and formatting them for chatbot use
- Setting up and embedding product data in MongoDB Atlas Vector Store for semantic search
- Creating a Retrieval-Augmented Generation (RAG) chatbot to answer party planning questions
- Adding interactive Chat Engine functionality for back-and-forth Q&A about fall party items
Part 1: Scrape the Trader Joes website for their fall items
First, let’s go ahead and install Playwright:
Collecting playwright Downloading playwright-1.48.0-py3-none-manylinux1_x86_64.whl.metadata (3.5 kB) Requirement already satisfied: greenlet==3.1.1 in /usr/local/lib/python3.10/dist-packages (from playwright) (3.1.1) Collecting pyee==12.0.0 (from playwright) Downloading pyee-12.0.0-py3-none-any.whl.metadata (2.8 kB) Requirement already satisfied: typing-extensions in /usr/local/lib/python3.10/dist-packages (from pyee==12.0.0->playwright) (4.12.2) Downloading playwright-1.48.0-py3-none-manylinux1_x86_64.whl (38.2 MB) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 38.2/38.2 MB 45.5 MB/s eta 0:00:00 Downloading pyee-12.0.0-py3-none-any.whl (14 kB) Installing collected packages: pyee, playwright Successfully installed playwright-1.48.0 pyee-12.0.0 Downloading Chromium 130.0.6723.31 (playwright build v1140) from https://playwright.azureedge.net/builds/chromium/1140/chromium-linux.zip 164.5 MiB [] 0% 0.0s164.5 MiB [] 0% 33.9s164.5 MiB [] 0% 29.2s164.5 MiB [] 0% 16.1s164.5 MiB [] 0% 9.0s164.5 MiB [] 1% 6.6s164.5 MiB [] 1% 5.5s164.5 MiB [] 2% 4.6s164.5 MiB [] 2% 5.0s164.5 MiB [] 2% 5.1s164.5 MiB [] 3% 4.9s164.5 MiB [] 4% 4.6s164.5 MiB [] 4% 4.4s164.5 MiB [] 5% 4.1s164.5 MiB [] 6% 3.8s164.5 MiB [] 6% 3.6s164.5 MiB [] 7% 3.5s164.5 MiB [] 8% 3.3s164.5 MiB [] 8% 3.2s164.5 MiB [] 9% 3.1s164.5 MiB [] 10% 3.0s164.5 MiB [] 11% 2.9s164.5 MiB [] 11% 2.8s164.5 MiB [] 12% 2.8s164.5 MiB [] 12% 2.7s164.5 MiB [] 13% 2.7s164.5 MiB [] 14% 2.6s164.5 MiB [] 15% 2.5s164.5 MiB [] 17% 2.4s164.5 MiB [] 18% 2.3s164.5 MiB [] 19% 2.2s164.5 MiB [] 20% 2.1s164.5 MiB [] 21% 2.0s164.5 MiB [] 22% 2.0s164.5 MiB [] 23% 1.9s164.5 MiB [] 24% 1.9s164.5 MiB [] 25% 1.8s164.5 MiB [] 26% 1.9s164.5 MiB [] 26% 1.8s164.5 MiB [] 27% 1.8s164.5 MiB [] 28% 1.8s164.5 MiB [] 29% 1.7s164.5 MiB [] 30% 1.7s164.5 MiB [] 31% 1.7s164.5 MiB [] 32% 1.6s164.5 MiB [] 33% 1.6s164.5 MiB [] 34% 1.6s164.5 MiB [] 35% 1.6s164.5 MiB [] 36% 1.5s164.5 MiB [] 37% 1.5s164.5 MiB [] 38% 1.5s164.5 MiB [] 39% 1.4s164.5 MiB [] 40% 1.4s164.5 MiB [] 41% 1.6s164.5 MiB [] 42% 1.6s164.5 MiB [] 43% 1.5s164.5 MiB [] 44% 1.5s164.5 MiB [] 45% 1.5s164.5 MiB [] 46% 1.4s164.5 MiB [] 47% 1.4s164.5 MiB [] 48% 1.4s164.5 MiB [] 49% 1.3s164.5 MiB [] 50% 1.3s164.5 MiB [] 51% 1.3s164.5 MiB [] 52% 1.2s164.5 MiB [] 53% 1.2s164.5 MiB [] 54% 1.2s164.5 MiB [] 55% 1.1s164.5 MiB [] 56% 1.1s164.5 MiB [] 57% 1.1s164.5 MiB [] 58% 1.0s164.5 MiB [] 60% 1.0s164.5 MiB [] 61% 1.0s164.5 MiB [] 62% 0.9s164.5 MiB [] 63% 0.9s164.5 MiB [] 64% 0.9s164.5 MiB [] 65% 0.8s164.5 MiB [] 66% 0.8s164.5 MiB [] 67% 0.8s164.5 MiB [] 68% 0.7s164.5 MiB [] 69% 0.7s164.5 MiB [] 70% 0.7s164.5 MiB [] 71% 0.6s164.5 MiB [] 73% 0.6s164.5 MiB [] 74% 0.6s164.5 MiB [] 75% 0.6s164.5 MiB [] 76% 0.5s164.5 MiB [] 77% 0.5s164.5 MiB [] 78% 0.5s164.5 MiB [] 79% 0.5s164.5 MiB [] 80% 0.4s164.5 MiB [] 81% 0.4s164.5 MiB [] 82% 0.4s164.5 MiB [] 83% 0.4s164.5 MiB [] 84% 0.3s164.5 MiB [] 85% 0.3s164.5 MiB [] 86% 0.3s164.5 MiB [] 88% 0.3s164.5 MiB [] 89% 0.2s164.5 MiB [] 90% 0.2s164.5 MiB [] 91% 0.2s164.5 MiB [] 92% 0.2s164.5 MiB [] 94% 0.1s164.5 MiB [] 95% 0.1s164.5 MiB [] 96% 0.1s164.5 MiB [] 97% 0.0s164.5 MiB [] 99% 0.0s164.5 MiB [] 100% 0.0s Chromium 130.0.6723.31 (playwright build v1140) downloaded to /root/.cache/ms-playwright/chromium-1140 Downloading FFMPEG playwright build v1010 from https://playwright.azureedge.net/builds/ffmpeg/1010/ffmpeg-linux.zip 2.3 MiB [] 0% 0.0s2.3 MiB [] 4% 0.4s2.3 MiB [] 10% 0.3s2.3 MiB [] 24% 0.1s2.3 MiB [] 67% 0.0s2.3 MiB [] 100% 0.0s FFMPEG playwright build v1010 downloaded to /root/.cache/ms-playwright/ffmpeg-1010 Downloading Firefox 131.0 (playwright build v1465) from https://playwright.azureedge.net/builds/firefox/1465/firefox-ubuntu-22.04.zip 86.7 MiB [] 0% 0.0s86.7 MiB [] 0% 19.0s86.7 MiB [] 0% 16.2s86.7 MiB [] 0% 7.6s86.7 MiB [] 1% 4.2s86.7 MiB [] 2% 3.0s86.7 MiB [] 3% 2.5s86.7 MiB [] 5% 2.3s86.7 MiB [] 5% 2.1s86.7 MiB [] 6% 2.1s86.7 MiB [] 7% 2.1s86.7 MiB [] 8% 2.0s86.7 MiB [] 10% 1.8s86.7 MiB [] 12% 1.5s86.7 MiB [] 13% 1.5s86.7 MiB [] 15% 1.4s86.7 MiB [] 16% 1.3s86.7 MiB [] 18% 1.3s86.7 MiB [] 19% 1.3s86.7 MiB [] 20% 1.2s86.7 MiB [] 22% 1.2s86.7 MiB [] 23% 1.1s86.7 MiB [] 24% 1.1s86.7 MiB [] 26% 1.1s86.7 MiB [] 27% 1.0s86.7 MiB [] 29% 1.0s86.7 MiB [] 31% 1.0s86.7 MiB [] 32% 0.9s86.7 MiB [] 34% 0.9s86.7 MiB [] 36% 0.8s86.7 MiB [] 38% 0.8s86.7 MiB [] 39% 0.8s86.7 MiB [] 41% 0.8s86.7 MiB [] 42% 0.7s86.7 MiB [] 44% 0.7s86.7 MiB [] 46% 0.7s86.7 MiB [] 47% 0.7s86.7 MiB [] 49% 0.6s86.7 MiB [] 51% 0.6s86.7 MiB [] 54% 0.5s86.7 MiB [] 56% 0.5s86.7 MiB [] 58% 0.5s86.7 MiB [] 59% 0.5s86.7 MiB [] 60% 0.5s86.7 MiB [] 61% 0.5s86.7 MiB [] 61% 0.6s86.7 MiB [] 62% 0.6s86.7 MiB [] 64% 0.6s86.7 MiB [] 64% 0.7s86.7 MiB [] 64% 0.8s86.7 MiB [] 64% 0.9s86.7 MiB [] 64% 1.0s86.7 MiB [] 64% 1.1s86.7 MiB [] 64% 1.2s86.7 MiB [] 65% 1.2s86.7 MiB [] 65% 1.3s86.7 MiB [] 66% 1.3s86.7 MiB [] 67% 1.3s86.7 MiB [] 69% 1.2s86.7 MiB [] 70% 1.1s86.7 MiB [] 72% 1.0s86.7 MiB [] 73% 1.0s86.7 MiB [] 74% 0.9s86.7 MiB [] 75% 0.9s86.7 MiB [] 76% 0.9s86.7 MiB [] 77% 0.8s86.7 MiB [] 77% 0.9s86.7 MiB [] 78% 0.9s86.7 MiB [] 78% 0.8s86.7 MiB [] 79% 0.8s86.7 MiB [] 80% 0.8s86.7 MiB [] 81% 0.8s86.7 MiB [] 81% 0.7s86.7 MiB [] 82% 0.7s86.7 MiB [] 83% 0.7s86.7 MiB [] 83% 0.6s86.7 MiB [] 84% 0.6s86.7 MiB [] 85% 0.6s86.7 MiB [] 86% 0.5s86.7 MiB [] 87% 0.5s86.7 MiB [] 88% 0.5s86.7 MiB [] 88% 0.4s86.7 MiB [] 89% 0.4s86.7 MiB [] 90% 0.4s86.7 MiB [] 91% 0.4s86.7 MiB [] 92% 0.3s86.7 MiB [] 93% 0.3s86.7 MiB [] 94% 0.2s86.7 MiB [] 95% 0.2s86.7 MiB [] 96% 0.2s86.7 MiB [] 97% 0.1s86.7 MiB [] 98% 0.1s86.7 MiB [] 100% 0.0s Firefox 131.0 (playwright build v1465) downloaded to /root/.cache/ms-playwright/firefox-1465 Downloading Webkit 18.0 (playwright build v2083) from https://playwright.azureedge.net/builds/webkit/2083/webkit-ubuntu-22.04.zip 90.5 MiB [] 0% 0.0s90.5 MiB [] 0% 19.8s90.5 MiB [] 0% 21.1s90.5 MiB [] 0% 25.5s90.5 MiB [] 0% 25.3s90.5 MiB [] 0% 16.3s90.5 MiB [] 1% 8.9s90.5 MiB [] 1% 6.5s90.5 MiB [] 2% 5.4s90.5 MiB [] 3% 4.7s90.5 MiB [] 4% 4.1s90.5 MiB [] 4% 4.2s90.5 MiB [] 5% 4.0s90.5 MiB [] 5% 3.9s90.5 MiB [] 6% 3.6s90.5 MiB [] 7% 3.5s90.5 MiB [] 7% 3.7s90.5 MiB [] 8% 3.7s90.5 MiB [] 8% 3.5s90.5 MiB [] 9% 3.5s90.5 MiB [] 10% 3.3s90.5 MiB [] 10% 3.2s90.5 MiB [] 11% 3.2s90.5 MiB [] 12% 3.1s90.5 MiB [] 12% 3.0s90.5 MiB [] 13% 2.9s90.5 MiB [] 14% 2.7s90.5 MiB [] 15% 2.6s90.5 MiB [] 16% 2.6s90.5 MiB [] 17% 2.5s90.5 MiB [] 18% 2.4s90.5 MiB [] 19% 2.4s90.5 MiB [] 20% 2.3s90.5 MiB [] 21% 2.2s90.5 MiB [] 22% 2.2s90.5 MiB [] 23% 2.1s90.5 MiB [] 24% 2.1s90.5 MiB [] 25% 2.0s90.5 MiB [] 27% 1.9s90.5 MiB [] 28% 1.9s90.5 MiB [] 29% 1.8s90.5 MiB [] 30% 1.7s90.5 MiB [] 30% 1.8s90.5 MiB [] 31% 1.8s90.5 MiB [] 31% 1.9s90.5 MiB [] 32% 2.0s90.5 MiB [] 32% 2.1s90.5 MiB [] 33% 2.1s90.5 MiB [] 33% 2.2s90.5 MiB [] 34% 2.2s90.5 MiB [] 35% 2.1s90.5 MiB [] 37% 2.0s90.5 MiB [] 39% 1.8s90.5 MiB [] 40% 1.7s90.5 MiB [] 42% 1.6s90.5 MiB [] 44% 1.6s90.5 MiB [] 45% 1.5s90.5 MiB [] 47% 1.4s90.5 MiB [] 48% 1.3s90.5 MiB [] 50% 1.3s90.5 MiB [] 51% 1.2s90.5 MiB [] 52% 1.2s90.5 MiB [] 54% 1.2s90.5 MiB [] 55% 1.1s90.5 MiB [] 57% 1.0s90.5 MiB [] 58% 1.0s90.5 MiB [] 59% 1.0s90.5 MiB [] 60% 0.9s90.5 MiB [] 62% 0.9s90.5 MiB [] 63% 0.9s90.5 MiB [] 64% 0.8s90.5 MiB [] 65% 0.8s90.5 MiB [] 66% 0.8s90.5 MiB [] 68% 0.7s90.5 MiB [] 69% 0.7s90.5 MiB [] 70% 0.7s90.5 MiB [] 72% 0.7s90.5 MiB [] 73% 0.6s90.5 MiB [] 74% 0.6s90.5 MiB [] 76% 0.6s90.5 MiB [] 78% 0.5s90.5 MiB [] 79% 0.5s90.5 MiB [] 80% 0.5s90.5 MiB [] 81% 0.5s90.5 MiB [] 82% 0.5s90.5 MiB [] 84% 0.4s90.5 MiB [] 85% 0.4s90.5 MiB [] 87% 0.3s90.5 MiB [] 89% 0.3s90.5 MiB [] 92% 0.2s90.5 MiB [] 94% 0.1s90.5 MiB [] 95% 0.1s90.5 MiB [] 96% 0.1s90.5 MiB [] 98% 0.0s90.5 MiB [] 100% 0.0s Webkit 18.0 (playwright build v2083) downloaded to /root/.cache/ms-playwright/webkit-2083 Playwright Host validation warning: ╔══════════════════════════════════════════════════════╗ ║ Host system is missing dependencies to run browsers. ║ ║ Missing libraries: ║ ║ libwoff2dec.so.1.0.2 ║ ║ libgstgl-1.0.so.0 ║ ║ libgstcodecparsers-1.0.so.0 ║ ║ libharfbuzz-icu.so.0 ║ ║ libenchant-2.so.2 ║ ║ libsecret-1.so.0 ║ ║ libhyphen.so.0 ║ ║ libmanette-0.2.so.0 ║ ╚══════════════════════════════════════════════════════╝ at validateDependenciesLinux (/usr/local/lib/python3.10/dist-packages/playwright/driver/package/lib/server/registry/dependencies.js:216:9) at process.processTicksAndRejections (node:internal/process/task_queues:95:5) at async Registry._validateHostRequirements (/usr/local/lib/python3.10/dist-packages/playwright/driver/package/lib/server/registry/index.js:707:43) at async Registry._validateHostRequirementsForExecutableIfNeeded (/usr/local/lib/python3.10/dist-packages/playwright/driver/package/lib/server/registry/index.js:805:7) at async Registry.validateHostRequirementsForExecutablesIfNeeded (/usr/local/lib/python3.10/dist-packages/playwright/driver/package/lib/server/registry/index.js:794:43) at async t.<anonymous> (/usr/local/lib/python3.10/dist-packages/playwright/driver/package/lib/cli/program.js:119:7)
Once that’s done installing, we can go ahead and import our necessary packages:
Please keep in mind that we are using async because we are running everything inside of a Google Colab notebook.
Now, let’s start building our traderJoesScraper:
We started off with manually putting in all the links we want to scrape the information off of, please keep in mind that if you’re hoping to turn this into a scalable application it’s recommended to use pagination for this part, but for the sake of simplicity, we can input them manually.
Then we just looped through each of the URL’s listed, waited for our main selector to show up that had all the elements we were hoping to scrape, and then extracted our “name” and “price”.
Once we ran that, we got a list of all our products from the Fall Faves tag!
Name: Teeny Tiny Pecan Pies, Price: 4.99, Category: Food
Name: Hickory Road Smokehouse Uncured Carver Ham, Price: 5.99, Category: Food
Name: Frick's Uncured Carver Ham, Price: 5.99, Category: Food
Name: Organic Fuyu Persimmons, Price: 3.99, Category: Food
Name: Cut Sweet Potatoes, Price: 5.49, Category: Food
Name: Jumbo Pomegranate, Price: 2.99, Category: Food
Name: Cheesy Herby Biscuits, Price: 4.99, Category: Food
Name: Egg Nog Whole Milk Greek Yogurt, Price: 0.99, Category: Food
Name: Triple Creme Brie with Calvados Apple Brandy, Price: 12.99, Category: Food
Name: Chocolatey Caramel Pretzel Drumstick Decorating Kit, Price: 4.99, Category: Food
Name: Hot Chocolate Stirring Spoon, Price: 0.99, Category: Food
Name: Gluten Free Turkey Gravy, Price: 3.99, Category: Food
Name: Dinner Rolls, Price: 2.99, Category: Food
Name: Organic Peeled & Cooked Chestnuts, Price: 4.99, Category: Food
Name: Butter with Brown Sugar & Maple Syrup, Price: 2.99, Category: Food
Name: Double Fold Alcohol Free Bourbon Vanilla Flavoring, Price: 7.99, Category: Food
Name: Delicata Squash, Price: 1.49, Category: Food
Name: Spaghetti Squash, Price: 2.99, Category: Food
Name: Sugar Bee® Apple, Price: 1.29, Category: Food
Name: Savory Squash Pastry Bites, Price: 5.49, Category: Food
Name: All Butter Apple Shortbread Cookies, Price: 3.49, Category: Food
Name: Caramelized Onion Goat's Milk Cheese, Price: 2.99, Category: Food
Name: Pumpkin Loaf, Price: 4.99, Category: Food
Name: Pumpkin Butter, Price: 2.99, Category: Food
Name: Fresh Cranberries, Price: 2.29, Category: Food
Name: Cinnamon Sticks, Price: 2.99, Category: Food
Name: Fully Cooked Spiral Sliced Uncured Half Ham, Price: 5.99, Category: Food
Name: Non-Dairy Cinnamon Bun Oat Creamer, Price: 1.99, Category: Food
Name: Organic Pomegranate, Price: 2.49, Category: Food
Name: Organic Cranberries, Price: 2.99, Category: Food
Name: Thanksgiving Stuffing Seasoned Popcorn, Price: 2.99, Category: Food
Name: Organic Pumpkin, Price: 2.49, Category: Food
Name: Apple Overnight Oats, Price: 1.99, Category: Food
Name: Mashed Sweet Potatoes, Price: 2.99, Category: Food
Name: Halloween Gummies, Price: 4.49, Category: Food
Name: Teeny Tiny Apple Pies, Price: 4.99, Category: Food
Name: Lil' Tiger Stripe Pumpkin, Price: 1.49, Category: Food
Name: Petite Pumpkin Spice Cookies, Price: 3.99, Category: Food
Name: Pumpkin Joe-Joe's Cookies, Price: 2.99, Category: Food
Name: Double Fold Bourbon Vanilla Extract, Price: 7.99, Category: Food
Name: Butternut Squash Italian Lasagna, Price: 4.49, Category: Food
Name: Apple Cinnamon Buns, Price: 4.49, Category: Food
Name: Cut Butternut Squash, Price: 3.99, Category: Food
Name: Pumpkin Pie Spice, Price: 2.99, Category: Food
Name: Mini Maple Flavored Marshmallows, Price: 2.99, Category: Food
Name: Maple Flavored Fudge, Price: 2.99, Category: Food
Name: Double Crème Brie with Truffles, Price: 9.99, Category: Food
Name: Truffle Dip, Price: 5.49, Category: Food
Name: Pumpkin Pie, Price: 6.99, Category: Food
Name: Haricots Verts, Price: 5.99, Category: Food
Name: Herbes de Provence, Price: 4.99, Category: Food
Name: Creamy Toscano Cheese Dusted with Cinnamon, Price: 10.99, Category: Food
Name: Caramel Apple Mochi, Price: 4.99, Category: Food
Name: Cornbread Stuffing, Price: 5.99, Category: Food
Name: Pumpkin Spiced Joe-Joe's Sandwich Cookies, Price: 4.49, Category: Food
Name: Salted Maple Ice Cream, Price: 3.79, Category: Food
Name: Roasted Turkey & Sweet Potato Burrito, Price: 4.49, Category: Food
Name: Cinnamon Roll Blondie Bar Baking Mix, Price: 3.99, Category: Food
Name: Pumpkin Cheesecake Croissants, Price: 4.49, Category: Food
Name: Apple Cider Donuts, Price: 4.49, Category: Food
Name: Pumpkin Overnight Oats, Price: 1.99, Category: Food
Name: Fuyu Persimmons, Price: 0.79, Category: Food
Name: Cut Butternut Squash, Price: 3.99, Category: Food
Name: Brussels Sprouts, Price: 4.99, Category: Food
Name: Thanksgiving Stuffing Seasoned Kettle Chips, Price: 2.99, Category: Food
Name: Nuts About Rosemary Mix, Price: 7.99, Category: Food
Name: Brined Bone-In Half Turkey Breast Fully Cooked, Price: 9.99, Category: Food
Name: Harvest Apple Salad Kit, Price: 3.99, Category: Food
Name: Cornbread Stuffing Mix, Price: 4.99, Category: Food
Name: Condensed Cream of Portabella Mushroom Soup, Price: 1.99, Category: Food
Name: All Butter Puff Pastry Sheets, Price: 4.99, Category: Food
Name: Turkey Gravy, Price: 1.69, Category: Food
Name: Nantucket Style Cranberry Pie, Price: 6.99, Category: Food
Name: White Stilton with Cranberries, Price: 11.99, Category: Food
Name: Truffle Salami, Price: 4.99, Category: Food
Name: Autumn Maple Coffee, Price: 8.99, Category: Beverage
Name: Triple Ginger Brew Sparkling Beverage, Price: 3.99, Category: Beverage
Name: Harvest Blend Herbal Tea, Price: 2.49, Category: Beverage
Name: Non-Dairy Oat Beverage Maple Flavor, Price: 2.99, Category: Beverage
Name: Maple Espresso Black Tea Blend, Price: 2.99, Category: Beverage
Name: Non-Dairy Pumpkin Oat Beverage, Price: 2.99, Category: Beverage
Name: Mum Fleurettes, Price: 4.99, Category: Flowers&Plants
Name: Assorted Mum Plants, Price: 6.99, Category: Flowers&Plants
Name: Eight Candles, Price: 4.49, Category: EverythingElse
Name: Orange & Spice Scented Candle & Room Spritz, Price: 5.99, Category: EverythingElse
Name: Harvest Brunch Dog Treats, Price: 3.49, Category: EverythingElse
Name: Cinnamon Whisk, Price: 1.29, Category: EverythingElse
Name: Cinnamon Broom, Price: 4.99, Category: EverythingElse
Name: Pumpkin Maple Bacon Stuffies Dog Treats, Price: 4.49, Category: EverythingElse
[{'name': 'Teeny Tiny Pecan Pies', 'price': 4.99, 'category': 'Food'}, {'name': 'Hickory Road Smokehouse Uncured Carver Ham', 'price': 5.99, 'category': 'Food'}, {'name': "Frick's Uncured Carver Ham", 'price': 5.99, 'category': 'Food'}, {'name': 'Organic Fuyu Persimmons', 'price': 3.99, 'category': 'Food'}, {'name': 'Cut Sweet Potatoes', 'price': 5.49, 'category': 'Food'}, {'name': 'Jumbo Pomegranate', 'price': 2.99, 'category': 'Food'}, {'name': 'Cheesy Herby Biscuits', 'price': 4.99, 'category': 'Food'}, {'name': 'Egg Nog Whole Milk Greek Yogurt', 'price': 0.99, 'category': 'Food'}, {'name': 'Triple Creme Brie with Calvados Apple Brandy', 'price': 12.99, 'category': 'Food'}, {'name': 'Chocolatey Caramel Pretzel Drumstick Decorating Kit', 'price': 4.99, 'category': 'Food'}, {'name': 'Hot Chocolate Stirring Spoon', 'price': 0.99, 'category': 'Food'}, {'name': 'Gluten Free Turkey Gravy', 'price': 3.99, 'category': 'Food'}, {'name': 'Dinner Rolls', 'price': 2.99, 'category': 'Food'}, {'name': 'Organic Peeled & Cooked Chestnuts', 'price': 4.99, 'category': 'Food'}, {'name': 'Butter with Brown Sugar & Maple Syrup', 'price': 2.99, 'category': 'Food'}, {'name': 'Double Fold Alcohol Free Bourbon Vanilla Flavoring', 'price': 7.99, 'category': 'Food'}, {'name': 'Delicata Squash', 'price': 1.49, 'category': 'Food'}, {'name': 'Spaghetti Squash', 'price': 2.99, 'category': 'Food'}, {'name': 'Sugar Bee® Apple', 'price': 1.29, 'category': 'Food'}, {'name': 'Savory Squash Pastry Bites', 'price': 5.49, 'category': 'Food'}, {'name': 'All Butter Apple Shortbread Cookies', 'price': 3.49, 'category': 'Food'}, {'name': "Caramelized Onion Goat's Milk Cheese", 'price': 2.99, 'category': 'Food'}, {'name': 'Pumpkin Loaf', 'price': 4.99, 'category': 'Food'}, {'name': 'Pumpkin Butter', 'price': 2.99, 'category': 'Food'}, {'name': 'Fresh Cranberries', 'price': 2.29, 'category': 'Food'}, {'name': 'Cinnamon Sticks', 'price': 2.99, 'category': 'Food'}, {'name': 'Fully Cooked Spiral Sliced Uncured Half Ham', 'price': 5.99, 'category': 'Food'}, {'name': 'Non-Dairy Cinnamon Bun Oat Creamer', 'price': 1.99, 'category': 'Food'}, {'name': 'Organic Pomegranate', 'price': 2.49, 'category': 'Food'}, {'name': 'Organic Cranberries', 'price': 2.99, 'category': 'Food'}, {'name': 'Thanksgiving Stuffing Seasoned Popcorn', 'price': 2.99, 'category': 'Food'}, {'name': 'Organic Pumpkin', 'price': 2.49, 'category': 'Food'}, {'name': 'Apple Overnight Oats', 'price': 1.99, 'category': 'Food'}, {'name': 'Mashed Sweet Potatoes', 'price': 2.99, 'category': 'Food'}, {'name': 'Halloween Gummies', 'price': 4.49, 'category': 'Food'}, {'name': 'Teeny Tiny Apple Pies', 'price': 4.99, 'category': 'Food'}, {'name': "Lil' Tiger Stripe Pumpkin", 'price': 1.49, 'category': 'Food'}, {'name': 'Petite Pumpkin Spice Cookies', 'price': 3.99, 'category': 'Food'}, {'name': "Pumpkin Joe-Joe's Cookies", 'price': 2.99, 'category': 'Food'}, {'name': 'Double Fold Bourbon Vanilla Extract', 'price': 7.99, 'category': 'Food'}, {'name': 'Butternut Squash Italian Lasagna', 'price': 4.49, 'category': 'Food'}, {'name': 'Apple Cinnamon Buns', 'price': 4.49, 'category': 'Food'}, {'name': 'Cut Butternut Squash', 'price': 3.99, 'category': 'Food'}, {'name': 'Pumpkin Pie Spice', 'price': 2.99, 'category': 'Food'}, {'name': 'Mini Maple Flavored Marshmallows', 'price': 2.99, 'category': 'Food'}, {'name': 'Maple Flavored Fudge', 'price': 2.99, 'category': 'Food'}, {'name': 'Double Crème Brie with Truffles', 'price': 9.99, 'category': 'Food'}, {'name': 'Truffle Dip', 'price': 5.49, 'category': 'Food'}, {'name': 'Pumpkin Pie', 'price': 6.99, 'category': 'Food'}, {'name': 'Haricots Verts', 'price': 5.99, 'category': 'Food'}, {'name': 'Herbes de Provence', 'price': 4.99, 'category': 'Food'}, {'name': 'Creamy Toscano Cheese Dusted with Cinnamon', 'price': 10.99, 'category': 'Food'}, {'name': 'Caramel Apple Mochi', 'price': 4.99, 'category': 'Food'}, {'name': 'Cornbread Stuffing', 'price': 5.99, 'category': 'Food'}, {'name': "Pumpkin Spiced Joe-Joe's Sandwich Cookies", 'price': 4.49, 'category': 'Food'}, {'name': 'Salted Maple Ice Cream', 'price': 3.79, 'category': 'Food'}, {'name': 'Roasted Turkey & Sweet Potato Burrito', 'price': 4.49, 'category': 'Food'}, {'name': 'Cinnamon Roll Blondie Bar Baking Mix', 'price': 3.99, 'category': 'Food'}, {'name': 'Pumpkin Cheesecake Croissants', 'price': 4.49, 'category': 'Food'}, {'name': 'Apple Cider Donuts', 'price': 4.49, 'category': 'Food'}, {'name': 'Pumpkin Overnight Oats', 'price': 1.99, 'category': 'Food'}, {'name': 'Fuyu Persimmons', 'price': 0.79, 'category': 'Food'}, {'name': 'Cut Butternut Squash', 'price': 3.99, 'category': 'Food'}, {'name': 'Brussels Sprouts', 'price': 4.99, 'category': 'Food'}, {'name': 'Thanksgiving Stuffing Seasoned Kettle Chips', 'price': 2.99, 'category': 'Food'}, {'name': 'Nuts About Rosemary Mix', 'price': 7.99, 'category': 'Food'}, {'name': 'Brined Bone-In Half Turkey Breast Fully Cooked', 'price': 9.99, 'category': 'Food'}, {'name': 'Harvest Apple Salad Kit', 'price': 3.99, 'category': 'Food'}, {'name': 'Cornbread Stuffing Mix', 'price': 4.99, 'category': 'Food'}, {'name': 'Condensed Cream of Portabella Mushroom Soup', 'price': 1.99, 'category': 'Food'}, {'name': 'All Butter Puff Pastry Sheets', 'price': 4.99, 'category': 'Food'}, {'name': 'Turkey Gravy', 'price': 1.69, 'category': 'Food'}, {'name': 'Nantucket Style Cranberry Pie', 'price': 6.99, 'category': 'Food'}, {'name': 'White Stilton with Cranberries', 'price': 11.99, 'category': 'Food'}, {'name': 'Truffle Salami', 'price': 4.99, 'category': 'Food'}, {'name': 'Autumn Maple Coffee', 'price': 8.99, 'category': 'Beverage'}, {'name': 'Triple Ginger Brew Sparkling Beverage', 'price': 3.99, 'category': 'Beverage'}, {'name': 'Harvest Blend Herbal Tea', 'price': 2.49, 'category': 'Beverage'}, {'name': 'Non-Dairy Oat Beverage Maple Flavor', 'price': 2.99, 'category': 'Beverage'}, {'name': 'Maple Espresso Black Tea Blend', 'price': 2.99, 'category': 'Beverage'}, {'name': 'Non-Dairy Pumpkin Oat Beverage', 'price': 2.99, 'category': 'Beverage'}, {'name': 'Mum Fleurettes', 'price': 4.99, 'category': 'Flowers&Plants'}, {'name': 'Assorted Mum Plants', 'price': 6.99, 'category': 'Flowers&Plants'}, {'name': 'Eight Candles', 'price': 4.49, 'category': 'EverythingElse'}, {'name': 'Orange & Spice Scented Candle & Room Spritz', 'price': 5.99, 'category': 'EverythingElse'}, {'name': 'Harvest Brunch Dog Treats', 'price': 3.49, 'category': 'EverythingElse'}, {'name': 'Cinnamon Whisk', 'price': 1.29, 'category': 'EverythingElse'}, {'name': 'Cinnamon Broom', 'price': 4.99, 'category': 'EverythingElse'}, {'name': 'Pumpkin Maple Bacon Stuffies Dog Treats', 'price': 4.49, 'category': 'EverythingElse'}]
To keep track of the items, we can go ahead and quickly count them:
89
As of the date this was scrapped we had 89 products.
Now, let’s go ahead and save our products into a .txt file so we can use it later in our tutorial when we are using our LlamaIndex and Atlas Vector Search integration. Go ahead and name the file whatever you like, for sake of tracking I’m naming mine: tj_fall_faves_oct30.txt.
Since we are using a notebook, please make sure that you download the file locally, since once our runtime is disconnected the .txt file will be lost.
Now that we have all our Trader Joe’s fall products let’s go ahead and build out our AI Party Planner!
Part 2: LlamaIndex and Atlas Vector Search Integration
This is the quickstart we are using in order to be successful with this part of the tutorial: https://www.mongodb.com/docs/atlas/atlas-vector-search/ai-integrations/llamaindex/#:~:text=You%20can%20integrate%20Atlas%20Vector,RAG). We will be going over how to use Atlas Vector Search with LlamaIndex to build a RAG application with chat capabilities!
This section will cover in detail how to set up the environment, store our custom data that we previously scraped on Atlas, create an Atlas Vector Search index on top of our data, and to finish up we will implement RAG and will use Atlas Vector Search to answer questions from our unique data store.
Let’s first use pip to install all our necessary libraries. We will need to include llama-index, llama-index-vector-stores-mongodb, and llama-index-embeddings-openai.
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.4/1.4 MB 20.4 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 313.6/313.6 kB 20.4 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 53.7 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.2/1.2 MB 49.7 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.5/1.5 MB 55.8 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 386.9/386.9 kB 26.8 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 76.4/76.4 kB 6.0 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 78.0/78.0 kB 6.5 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 325.2/325.2 kB 21.2 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 176.8/176.8 kB 13.5 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 295.8/295.8 kB 18.3 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.2/1.2 MB 49.3 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 49.5/49.5 kB 3.9 MB/s eta 0:00:00 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 58.3/58.3 kB 4.6 MB/s eta 0:00:00
Now go ahead and import in your necessary import statements:
Input your OpenAI API Key and your MongoDB Atlas cluster connection string when prompted:
OpenAI API Key:·········· MongoDB Atlas SRV Connection String:··········
Once your keys are in, let’s go ahead and assign our specific models for llama_index so it knows how to properly embed our file. This is just to keep everything consistent!
Now we can go ahead and read in our .txt file with our scraped products. We are doing this using the SimpleDirectoryReader from llama_index. Text files aren’t the only files that can be nicely loaded into LlamaIndex. There are a ton of other supported methods and I recommend checking out some of their supported file types.
So here we are just reading the contents of our file and then returning it as a list of documents; the format LlamaIndex requires.
Document(id_='757e7f69-9104-48f4-8b0b-49a4d77e7350', embedding=None, metadata={'file_path': '/content/tj_fall_faves_oct30.txt', 'file_name': 'tj_fall_faves_oct30.txt', 'file_type': 'text/plain', 'file_size': 5672, 'creation_date': '2024-10-30', 'last_modified_date': '2024-10-30'}, excluded_embed_metadata_keys=['file_name', 'file_type', 'file_size', 'creation_date', 'last_modified_date', 'last_accessed_date'], excluded_llm_metadata_keys=['file_name', 'file_type', 'file_size', 'creation_date', 'last_modified_date', 'last_accessed_date'], relationships={}, text="Name: Teeny Tiny Pecan Pies, Price: $4.99, Category: Food\nName: Hickory Road Smokehouse Uncured Carver Ham, Price: $5.99, Category: Food\nName: Frick's Uncured Carver Ham, Price: $5.99, Category: Food\nName: Organic Fuyu Persimmons, Price: $3.99, Category: Food\nName: Cut Sweet Potatoes, Price: $5.49, Category: Food\nName: Jumbo Pomegranate, Price: $2.99, Category: Food\nName: Cheesy Herby Biscuits, Price: $4.99, Category: Food\nName: Egg Nog Whole Milk Greek Yogurt, Price: $0.99, Category: Food\nName: Triple Creme Brie with Calvados Apple Brandy, Price: $12.99, Category: Food\nName: Chocolatey Caramel Pretzel Drumstick Decorating Kit, Price: $4.99, Category: Food\nName: Hot Chocolate Stirring Spoon, Price: $0.99, Category: Food\nName: Gluten Free Turkey Gravy, Price: $3.99, Category: Food\nName: Dinner Rolls, Price: $2.99, Category: Food\nName: Organic Peeled & Cooked Chestnuts, Price: $4.99, Category: Food\nName: Butter with Brown Sugar & Maple Syrup, Price: $2.99, Category: Food\nName: Double Fold Alcohol Free Bourbon Vanilla Flavoring, Price: $7.99, Category: Food\nName: Delicata Squash, Price: $1.49, Category: Food\nName: Spaghetti Squash, Price: $2.99, Category: Food\nName: Sugar Bee® Apple, Price: $1.29, Category: Food\nName: Savory Squash Pastry Bites, Price: $5.49, Category: Food\nName: All Butter Apple Shortbread Cookies, Price: $3.49, Category: Food\nName: Caramelized Onion Goat's Milk Cheese, Price: $2.99, Category: Food\nName: Pumpkin Loaf, Price: $4.99, Category: Food\nName: Pumpkin Butter, Price: $2.99, Category: Food\nName: Fresh Cranberries, Price: $2.29, Category: Food\nName: Cinnamon Sticks, Price: $2.99, Category: Food\nName: Fully Cooked Spiral Sliced Uncured Half Ham, Price: $5.99, Category: Food\nName: Non-Dairy Cinnamon Bun Oat Creamer, Price: $1.99, Category: Food\nName: Organic Pomegranate, Price: $2.49, Category: Food\nName: Organic Cranberries, Price: $2.99, Category: Food\nName: Thanksgiving Stuffing Seasoned Popcorn, Price: $2.99, Category: Food\nName: Organic Pumpkin, Price: $2.49, Category: Food\nName: Apple Overnight Oats, Price: $1.99, Category: Food\nName: Mashed Sweet Potatoes, Price: $2.99, Category: Food\nName: Halloween Gummies, Price: $4.49, Category: Food\nName: Teeny Tiny Apple Pies, Price: $4.99, Category: Food\nName: Lil' Tiger Stripe Pumpkin, Price: $1.49, Category: Food\nName: Petite Pumpkin Spice Cookies, Price: $3.99, Category: Food\nName: Pumpkin Joe-Joe's Cookies, Price: $2.99, Category: Food\nName: Double Fold Bourbon Vanilla Extract, Price: $7.99, Category: Food\nName: Butternut Squash Italian Lasagna, Price: $4.49, Category: Food\nName: Apple Cinnamon Buns, Price: $4.49, Category: Food\nName: Cut Butternut Squash, Price: $3.99, Category: Food\nName: Pumpkin Pie Spice, Price: $2.99, Category: Food\nName: Mini Maple Flavored Marshmallows, Price: $2.99, Category: Food\nName: Maple Flavored Fudge, Price: $2.99, Category: Food\nName: Double Crème Brie with Truffles, Price: $9.99, Category: Food\nName: Truffle Dip, Price: $5.49, Category: Food\nName: Pumpkin Pie, Price: $6.99, Category: Food\nName: Haricots Verts, Price: $5.99, Category: Food\nName: Herbes de Provence, Price: $4.99, Category: Food\nName: Creamy Toscano Cheese Dusted with Cinnamon, Price: $10.99, Category: Food\nName: Caramel Apple Mochi, Price: $4.99, Category: Food\nName: Cornbread Stuffing, Price: $5.99, Category: Food\nName: Pumpkin Spiced Joe-Joe's Sandwich Cookies, Price: $4.49, Category: Food\nName: Salted Maple Ice Cream, Price: $3.79, Category: Food\nName: Roasted Turkey & Sweet Potato Burrito, Price: $4.49, Category: Food\nName: Cinnamon Roll Blondie Bar Baking Mix, Price: $3.99, Category: Food\nName: Pumpkin Cheesecake Croissants, Price: $4.49, Category: Food\nName: Apple Cider Donuts, Price: $4.49, Category: Food\nName: Pumpkin Overnight Oats, Price: $1.99, Category: Food\nName: Fuyu Persimmons, Price: $0.79, Category: Food\nName: Cut Butternut Squash, Price: $3.99, Category: Food\nName: Brussels Sprouts, Price: $4.99, Category: Food\nName: Thanksgiving Stuffing Seasoned Kettle Chips, Price: $2.99, Category: Food\nName: Nuts About Rosemary Mix, Price: $7.99, Category: Food\nName: Brined Bone-In Half Turkey Breast Fully Cooked, Price: $9.99, Category: Food\nName: Harvest Apple Salad Kit, Price: $3.99, Category: Food\nName: Cornbread Stuffing Mix, Price: $4.99, Category: Food\nName: Condensed Cream of Portabella Mushroom Soup, Price: $1.99, Category: Food\nName: All Butter Puff Pastry Sheets, Price: $4.99, Category: Food\nName: Turkey Gravy, Price: $1.69, Category: Food\nName: Nantucket Style Cranberry Pie, Price: $6.99, Category: Food\nName: White Stilton with Cranberries, Price: $11.99, Category: Food\nName: Truffle Salami, Price: $4.99, Category: Food\nName: Autumn Maple Coffee, Price: $8.99, Category: Beverage\nName: Triple Ginger Brew Sparkling Beverage, Price: $3.99, Category: Beverage\nName: Harvest Blend Herbal Tea, Price: $2.49, Category: Beverage\nName: Non-Dairy Oat Beverage Maple Flavor, Price: $2.99, Category: Beverage\nName: Maple Espresso Black Tea Blend, Price: $2.99, Category: Beverage\nName: Non-Dairy Pumpkin Oat Beverage, Price: $2.99, Category: Beverage\nName: Mum Fleurettes, Price: $4.99, Category: Flowers&Plants\nName: Assorted Mum Plants, Price: $6.99, Category: Flowers&Plants\nName: Eight Candles, Price: $4.49, Category: EverythingElse\nName: Orange & Spice Scented Candle & Room Spritz, Price: $5.99, Category: EverythingElse\nName: Harvest Brunch Dog Treats, Price: $3.49, Category: EverythingElse\nName: Cinnamon Whisk, Price: $1.29, Category: EverythingElse\nName: Cinnamon Broom, Price: $4.99, Category: EverythingElse\nName: Pumpkin Maple Bacon Stuffies Dog Treats, Price: $4.49, Category: EverythingElse\n", mimetype='text/plain', start_char_idx=None, end_char_idx=None, text_template='{metadata_str}\n\n{content}', metadata_template='{key}: {value}', metadata_seperator='\n') Now that our file has been read in, let’s connect to our MongoDB Atlas cluster and set up a vector store! Feel free to name the database and collection anything you like. We are initializing a vector store using MongoAtlasVectorSearch from llama_index which will allow us to work with our embedded documents directly in our cluster.
Since our vector store has been defined (by our vector_store_context) let’s go ahead and create a vector index in MongoDB for our documents in sample_data.
Parsing nodes: 0%| | 0/1 [00:00<?, ?it/s]
Generating embeddings: 0%| | 0/2 [00:00<?, ?it/s]
Once this cell has run you can go ahead and view your data with the embeddings inside of your Atlas cluster.
In order to allow for vector search queries on our created vector store, we need to create an Atlas Vector Search index on our tj_products.fall_faves collection. We can do this either through the Atlas UI or directly from our notebook:
'vector_index'
You’ll be able to see this index once it’s up and running under your “Atlas Search” tab in your Atlas UI. Once it’s done, we can start querying our data and we can do some basic RAG.
Part 3: Basic RAG
With our Atlas Vector Search index up and running we are ready to have some fun and bring our AI Party Planner to life! We are going to continue with this dream team where we will use Atlas Vector Search to get our documents and LlamaIndex’s query engine to actually answer our questions based on our documents.
To do this, we will need to have Atlas Vector Search become a vector index retriever and we will need to initialize a RetrieverQueryEngine to handle queries by passing each question through our vector retrieval system. This combination will allow us to ask any questions we want in natural language, and it will match us with the most accurate documents.
Mum Fleurettes are available for $4.99 and Assorted Mum Plants are available for $6.99.
But what if we want to keep asking questions and get responses with memory? Let’s quickly build a Chat Engine.
Part 4: Chat engine
Instead of having to ask one question at a time about our Trader Joe’s products for our party, we can go ahead and incorporate a back-and-forth conversation to get the most out of our AI Party Planner.
We first need to initialize the chat engine from our vector_store_index and enable a streaming response. Condense question mode is also used to ensure that the engine shortens their questions or rephrases them to make the most sense when used in a back and forth conversation. Streaming is enabled as well so we can see the response:
Ask away! Type 'exit' to quit >>> hi! i am planning a fall party Consider including a variety of fall-themed food and beverages such as pumpkin pie, apple cider donuts, maple-flavored fudge, pumpkin spiced cookies, and harvest blend herbal tea to create a festive atmosphere for your fall party. Additionally, you could incorporate seasonal decorations like cinnamon brooms, scented candles, and mum plants to enhance the autumn ambiance. Ask away! Type 'exit' to quit >>> i want to make a turkey, which three sides with prices and reasonings will be best The best three side dishes to serve with turkey at a fall party would be Cut Butternut Squash, Brussels Sprouts, and Cornbread Stuffing. Cut Butternut Squash and Brussels Sprouts are reasonably priced at $3.99 and $4.99 respectively, offering a balance of flavors and textures that complement the turkey well. Cornbread Stuffing, priced at $5.99, adds a traditional touch to the meal and enhances the overall fall-themed dining experience. Ask away! Type 'exit' to quit >>> which drinks should i serve? i want something caffinated Harvest Blend Herbal Tea and Autumn Maple Coffee would be ideal caffeinated drinks to serve at a fall party to complement the autumn-themed food and create a festive atmosphere. Ask away! Type 'exit' to quit >>> what are the prices of these drinks $2.49 for Harvest Blend Herbal Tea and $8.99 for Autumn Maple Coffee. Ask away! Type 'exit' to quit >>> which decor should i use? i want my home to smell nice Cinnamon Whisk, Cinnamon Broom, Orange & Spice Scented Candle & Room Spritz Ask away! Type 'exit' to quit >>> what are the prices? $5.99, $1.29, $4.99 Ask away! Type 'exit' to quit >>> exit Exiting chat. Have a happy fall!