For this project, I scrape Glassdoor.ca, a California-based company that provides a database of job postings along with company and salary reviews, and interview tips. Developed in Python and deployed as a web application using R, this program allows a user to upload and match their resume to available job postings on Glassdoor. I use cosine similarity as my distance metric.
The code for this project can be found on my github.
This project demonstrates web scraping and text analysis skills in Python, and deployment in R with R -Shiny.
Python: selenium, sklearn, numpy, pandas, csv, re, pickle, time, collections
R: PythonInR, shiny, dplyr, tidyr, pdftools.
I used Selenium webdriver to obtain the first page of job postings for “Data Analyst” and “Data Scientist” in both Toronto and Vancouver.
The web application, built with R – Shiny, ranks jobs that best match my resume, as shown below.
Developed in Python and deployed as a web application using R, this mini project allows a user to upload and match their resume to available job postings on Glassdoor.ca. You can read more about my future plans with this project in this blog post.