IMDb Scrape
About
Extracted movie metadata to practice data extraction and cleaning for film databases.
Tech Stack
Discussion & Feedback
Have questions about this project? Built something similar? Share your thoughts!
(Requires GitHub account to comment)
Related Projects
Stardew Valley Wiki Chatbot GPT
Knows everything about Stardew Valley! Ask any Stardew Valley question and get instant answers - temporarily offline after OpenAI took it down. Reached 1K chats and 10 reviews (4.2 star rating) before going down. Built with Puppeteer and Cheerio for scraping the Stardew Valley Wiki, TypeScript for structuring data into markdown files, and sklearn for clustering content into 9 categories to speed up responses.
Hackathon - UoM StudentHack
I got to take part in a Hackathon with UniCSHackathons! It was an awesome experience and my team were amazing to work with - we created a site where you can choose between 3 versions of real news articles, aimed at primary, secondary and adult ages to learn about current space events! The articles were scraped using puppeteer/cheerio from spacenews.com then stored on a PostgreSQL database using a Django backend, before being sent to Google's Gemini LLM to create the new content. The new articles were displayed with a React and typescript frontend!
Scrape Shack Site
Website offering custom web scraping services, deployed to a k3 cluster. Upwork contracts from the same period: [Web Scraping eAIS UK](/projects/web-scraping-eais-uk/), [Web Scraping FAA.gov](/projects/web-scraping-faagov/), [PDF Mapping Project](/projects/pdf-mapping-project/).