With the popularity and demand for data scientists, and the well-documented shortage of skilled labor, more people are interested in data science as a career. Over time, I’ve gotten an increasingly large number of questions regarding how to start out as a data scientist. Like many other roles, landing the first job is typically the hardest, as having some experience under your belt is mandatory for many employers. This can create a vicious catch 22: how do you land your first job if they all require prior experience?
In this post, we’ll try to give you some advice — based on my own experience moving into data science several years back, and my current experience managing a data science department, interviewing dozens of candidates and reviewing hundreds of applications every year.
What’s your background?
People trying to start a career in data science can be split into three relatively distinct groups. It’s important to identify which of these you are most similar to, in order to figure out your best next steps.
The STEM career change — These are people with an advanced academic degree in a technical/scientific field who may already have several years’ work experience in an adjacent field. As the hype around data science has grown, they’ve started considering the option of transitioning. They typically have a strong mathematics and research background and can follow the linear algebra and statistics behind machine learning models. They have experience reading academic papers and aren’t intimidated by the formulas. Their transferable skills can help them become good data scientists relatively quickly.
The data science new grad — While it’s taken a few years, universities have started to address the industry demand and various faculties are now offering MSc programs in data science. Depending on the university, these might include the statistics, electrical engineering or industrial engineering departments. While these degrees can’t cover everything, they’re quickly becoming a gold standard for comprehensive data science training that a 3- or 6-month bootcamp can’t meet. A good program will also include a thesis (and publication/s), which gives the employer an opportunity to discuss your work in greater detail. Whenever interviewing new grads I deep dive into their thesis, making sure they understand alternative approaches, discuss why they made certain decisions and ascertain how they handle feedback. Due to the scope of a thesis, it’s usually a great way to evaluate how someone performs research and how well they really know their material, in a way that a Kaggle project they did a while back can’t achieve.
The optimist — This is someone who hasn’t gone through formal data science training nor do they have an extensive statistics/math background. They may have several years’ experience in data analytics within a specific vertical (finance, healthcare, etc) and want to complement their current skills to gradually move into a data science role. In the past, several people turned to me for consultation about their possibility to be a data scientist in fintech or some other specific vertical. While business acumen and experience in the vertical is important, this is the wrong mental mindset. The commonality between data science roles in various verticals is significant — the tools and algorithms solve generic mathematical problems, not vertical-specific ones. It’s easier to teach a good data scientist about a new domain than it is to train a business analyst with domain knowledge how to program, teach them statistics and machine learning. If you want to be a data scientist — you want to be just that, not a fintech data scientist.
If you’ve read this far, you probably know that there are a lot of online courses teaching everything data science related. While those courses are fundamental and deliver a ton of content, the vast majority try to give the most practical information as fast as possible. This typically means you’re going to learn a lot of machine learning models but only get the 30K foot explanation of how the algorithm actually works. Many courses won’t complicate matters with complex math so they can remain accessible to as big an audience as possible. While it’s definitely possible to train models and ‘do data science’ without understanding the intricacies of the algorithm, your capabilities will be limited.
How to break into data science
There are different ways to gain the minimal experience and knowledge to get your first data science position. When hiring for a junior position, the interviewer is going to look for a few things:
As a candidate, you need to remember that the company’s loss function is asymmetric — hiring a bad candidate can have a much worse outcome than turning down a good hire. This means that companies are going to be cautious about taking risks on someone lacking a track record. You need to help the hiring manager as much as possible to demonstrate that you’re a low-risk and high-potential hire. This also means that your chances may be relatively low and you need to be emotionally prepared for a lot of rejections before getting an offer.
There are 3 main ways to gain the theoretical knowledge and expertise necessary for your first role, and they can be combined in various methods:
Compared to other high income, high demand professions, you don’t have to spend several years in medical school or log a thousand flight hours before you’re allowed to practice data science. While the demand for data scientists is high, most of that demand is for very skilled individuals who can demonstrate their value. You need to keep in mind that despite the lack of regulatory barriers, market forces still exist and companies won’t pay top dollar for someone with limited experience. More so, new data scientists require a lot of attention, training and support from more experienced data scientists. As the first few months are almost all investment by the company, it could take a year until a new data scientist’s contribution is back to zero. Paradoxically, this problem is exacerbated by the lack of experienced data scientists — they are really needed working on problems now and can only spend a certain amount of time training new people.
It’s not an easy path but it’s definitely rewarding. The world needs more great data scientists, so get to it.
Lorem ipsum viverra feugiat. Pellen tesque libero ut justo, ultrices in ligula. Semper at. Lorem ipsum dolor sit amet elit. Non quae, fugiat nihil ad. Lorem ipsum dolor sit amet. Lorem ipsum init dolor sit, amet elit. Dolor ipsum non velit, culpa! elit ut et.
Lorem ipsum dolor sit amet elit. Velit beatae rem ullam dolore nisi esse quasi, sit amet. Lorem ipsum dolor sit amet elit.