Resume Text Analysis: Find a Job on Glassdoor

Abstract

For this project, I scrape Glassdoor.ca, a California-based company that provides a database of job postings along with company and salary reviews, and interview tips. Developed in Python and deployed as a web application using R, this program allows a user to upload and match their resume to available job postings on Glassdoor. I use cosine similarity as my distance metric.

The code for this project can be found on my github.

Skills

This project demonstrates web scraping and text analysis skills in Python, and deployment in R with R -Shiny.

Packages/Libraries Used

Python: selenium, sklearn, numpy, pandas, csv, re, pickle, time, collections

R: PythonInR, shiny, dplyr, tidyr, pdftools.

Data

I used Selenium webdriver to obtain the first page of job postings for “Data Analyst” and “Data Scientist” in both Toronto and Vancouver.

Results

The web application, built with R – Shiny, ranks jobs that best match my resume, as shown below.

Concluding Remarks

Developed in Python and deployed as a web application using R, this mini project allows a user to upload and match their resume to available job postings on Glassdoor.ca. You can read more about my future plans with this project in this blog post.