WDSS: A Map of Programming Project

A Map of Programming Project

  • WDSS
  • Coventry, UK (Remote)
  • Temporary

Take part in this Summer Research Project ran by WDSS, building essential team-working experience and supercharging your CV with real-world experience. More details about the Summer Research Projects scheme can be found on our info website.

The final product will be published on WDSS's research blog.

Difficulty: ★ ★ ★ ☆ ☆

Project Overview

Create a map of programming languages where web-developers, software engineers and data scientists live on different continents.

Objectives

  • Scrape data from StackOverflow of other similar platform regarding what language tags different users contribute to
  • Use dimensionality reduction (PCA, t-SNE, UMAP) to reduce this to a 2D "map" of programming languages with related languages close together.
  • Try to turn this into an actual map using Voronoi tessellations (if you can make this look pretty, Reddit/HackerNews will eat it up)

Role Responsibilities

Data Engineer
  • Perform web scraping to collect data from StackOverflow
  • Perform data cleaning and feature engineering
  • Contribute to the relevant sections of the final write-up
Data Scientist
  • Experiment with different dimensionality reduction techniques to obtain the most satisfactory results
  • Contribute to the relevant sections of the final write-up

Candidate Attributes

  • A passion for the project and, preferably, interdisciplinary data science in general
  • Self-starting attitude with willingness to research and problem-solve independently
  • Ability to work well in a team and clearly communicate your ideas
  • Strong time-management and organisational skills.
  • Basic Python/R skills are expected
  • Familiarity with other programming languages is a plus

How to Apply

  • Use the application button on this page to fill in the form where you should attach your CV and any other sources you think are relevant for the role (e.g. GitHub profile for a coding role) as well as answering two further questions in which you should demonstrate your motivation and suitbility for the project.
  • Results to be announced by the end of term 3 (Saturday 3 July 2021)
  • You can apply for more than one project

Job Overview

  • Location : Coventry, UK (Remote)
  • Start date : 05 July 2021
  • Application start : 15 May 2021
  • Application end : 25 June 2021

You can find the full application details in the job listing.

About WDSS

WDSS is a student-led society focused on spreading the word of interdisiplinary data science through talks, networking, teaching, and research.