The art of hiring data scientists
.jpg)
Originally posted on October 8th, 2015. Updated on May 30th, 2018. New update on December 11th, 2025.
The Data Scientist is one of the most sought after positions in tech â and much of the business world. But the demand is very quickly outstripping the supply.
McKinsey estimates that by 2018, the U.S. economy will have a shortage of 140,000 to 190,000 people with analytical expertise. And if youâre a startup or business thatâs looking to expand your data science operations, that might sound a little scary.
Data science is more important today than ever before. A company that doesnât properly utilize it may end up falling behind the competition. Whole industries are sprouting out of data science, including machine learning and artificial intelligence. Businesses that fail to see the importance of hiring the best data scientists are only doing themselves a disservice. Hiring a data scientist with the right expertise, refined skills, and big ideas to join your team is going to make a big difference in the long run. The big question is how to do it because the competition for data science talent is fierce.
How can you attract top talent in such a competitive market?
Weâre not saying itâs going to be easy. After all, a highly qualified data scientist candidate can become a valuable team member, so itâs no wonder that a company would seek hiring one with the right skills. But with these tips from Data Scientist at Insightly, Sara Vera, itâs certainly going to get a lot easier!
In her keynote at Extract SF 2014, Sara gave us her top three tips for hiring great Data Scientists. And now, sheâs sharing them with you! Watch the quick 10 minute video of her talk and then read on for a more in depth look at her insights.
How to hire data scientists when demand is higher than supply

Because the field of Data Science is so new, a lot of Data Scientists â like myself  â fall into it in a very nonlinear way.
I was originally in a PhD program at the University of Washington, but after I finished my masters degree I decided I didnât want to stay in academia. I wanted something a little more fast paced. When I started browsing job websites, I realized that my quantitative skill set was pretty valuable in the private sector. I decided to move to San Francisco because I liked how cutting edge the tech industry was and it felt like a great place to deepen my skill set.

When I got to San Francisco, the demand for Data Scientists was growing fast â way faster than the supply. As a job seeker itâs great. But once I got a job at Insightly and needed to hire more Data Scientists, it became super challenging. Iâm going to share with you what worked at Insightly and give you some tips that should help you hire some great people.

Define your needs
Every company has very different needs. It might depend on what industry the company is in or what challenges you anticipate will be waiting for you in the future. Because data science is such a broad field it really helps to first define what you need in your company.
Are you looking for a candidate who is going to be producing data analytics for machines or for humans?
The reason you want to do this is because those are pretty different skill sets, even though they are both done by Data Scientist. For example, if youâre looking to build ad-targeting or recommendation engines or do algorithmic training then you are going to want to look for a candidate with a really strong mathematics and computer science background. You donât want to bring on a data scientist to your team with a skill set that doesnât match the needs of your business. That isnât good for your company, and it isnât good for potential data science candidates. Hiring a data scientist with top-notch machine learning skills, for example, when your company doesnât need machine learning skills will not lead to more success.
On the other hand, if you are looking for a data scientist for your company who is going to be reporting to project managers and diving into how the product is doing or how your user growth is going; then you need to find a candidate whoâs really good at storytelling. That type of job requires a data science expert who is able to connect what could be pretty fuzzy data points into an overarching narrative of whatâs going on in your company. Candidates with these skills are more likely to come from a social science background because in the medical field, sociology, economics, or geography theyâre accustomed to doing this with their data.

Hire for talent, train for tech skills
The second thing to do is to hire for data science talent and train for tech skills. This is really important becauseâŠ
Analytical thinking and communication skills are harder to teach than SQL, Python and R.
I came from an academic background, and while I used data for statistical analysis, I had very little experience doing any kind of coding. Luckily the people who hired me for my first tech job gave me a take home test and I was able to Google all of the coding that I needed to write a beautiful report. Of course when I went in for the on site interview, I completely bombed the technical side of the interview. But they thought I was pretty smart and decided to hire me anyway. And I was able to pick up the SQL and Python through their training and mentorship.
The added benefit of training for tech skills, is that you create an environment where mentorship is really strong, which creates a more cohesive team. The best candidates will want to always be learning from each other anyway.

Broaden your pool of candidates
We have some really strong Data Science candidates here already in the tech world and also graduating from Berkeley and Stanford. But if you look just a little bit deeper than that, or a little bit broader than that, there are a lot of would be Data Scientists at State schools and places like Zipfian and other data boot camps that have popped up. In other words, when hiring for data scientist candidates, itâs important not to get tunnel vision. There are lots of great candidates out there who you might not have ever considered before.
Think outside the box when looking for candidates
The other benefit of this approach, is that if youâre looking in a broader pool of candidates, youâre also going to increase diversity in your company. And by diversity I also mean geography and age, not just gender and race.
Diversity in the workplace is really important because research has shown that it increases your revenue by promoting innovation and creative thinking. Diversity is a huge initiative right now at Google where 100% of managers and 25% of their employee base is going through unconscious bias training.
Hopefully these insights and tips will help you build a great Data Science team!
With machine learning, big data analytics, and artificial intelligence becoming such big components of every business. Following tips ensures the data scientist you hire will become a major contributor to the team. Donât pass up this opportunity to bring on the best data science talent!
About the author
Sara Vera is a Data Scientist at Insightly, a customer relationship management tool for a small business. Itâs a simple, yet powerful, CRM system for small businesses with integrations to Google Apps, Office 365, MailChimp, and major social media sites.
â
2025 UPDATE: Hiring Data Scientists in the Age of AI, Automation & LLMs
A lot has changed since this article was last updated in 2018, both in the world of data science and in how companies hire, train, and retain technical talent.
Back then, companies were dealing with a shortage of analytically skilled professionals. In 2025, the situation is more complex: some data science skills are oversupplied, while others, especially AI engineering, applied machine learning, and data governance remain critically scarce.
Below is a modernized view of how companies hire data scientists today, what has changed, and what hasnât.
1. The Data Science Role Has Evolved Dramatically
In 2015â2018, the conversation centered around:
- SQL, Python, R
- statistics & modeling
- analytics vs engineering
- storytelling vs algorithmic thinking

In 2025, the role is shaped by:
AI-native workflows
Data scientists now work alongside:
- Large Language Models (LLMs)
- synthetic data generation
- autonomous agents
- AutoML & automated feature engineering
- AI orchestration tools (LangChain, DSPy, semantic kernels)
Most traditional âdata wranglingâ is now assisted or performed by AI.
Data scientists are more specialized
Modern sub-roles include:
- AI Engineer / LLM Engineer
- ML Platform Engineer
- AI Product Scientist
- Generative AI Researcher
- Data Governance & Responsible AI Lead
- Analytics Translator / Insights PM
Generalists still exist but specialists are in higher demand.
2. The Talent Gap Still Exists But in New Areas
Original McKinsey projections warned of a huge shortage by 2018.
In 2025, shortages persist, but in different skill categories:
Undersupplied
- AI engineers
- LLM optimization experts
- reinforcement learning specialists
- synthetic data engineers
- data governance, privacy & compliance roles
- real-time data infrastructure engineers

Oversupplied
- junior analysts
- bootcamp-trained generalists
- basic SQL/Python candidates
Companies must now differentiate between commodity skills and strategic skills.
3. The Hiring Advice Still Applies But Needs an Update
Sara Veraâs tips remain shockingly relevant, but hereâs how they translate for 2025.
Tip 1: Define Your Needs, Now More Than Ever
In 2025, the key question isnât just analytics vs machine learning.
Itâs also:
- âDo we need a data scientist or an AI engineer?â
- âIs this a human or LLM-assisted analytics workflow?â
- âIs this role more experimentation or more production?â
- âDo we need modeling skills, or orchestration & evaluation skills?â
Examples:
- If you're building personalization, you need ML Ops + AI engineers.
- If youâre doing insights for leadership, you need analytics translators + genAI tooling.
- If you're building a data governance program, you need privacy, quality, lineage experts.
Clear scoping prevents mismatched hires and expensive mistakes.
Tip 2: Hire for Thinking, Train for Tools But Include AI Fluency
Saraâs original advice was timeless:
Analytical thinking and communication skills are harder to teach than SQL, Python or R.

This is still true but now we add:
AI fluency is mandatory.
Data scientists in 2025 must:
- prompt, refine, and evaluate LLMs
- understand hallucination risks
- know how to chain models together
- grasp embeddings, vector databases, and retrieval
- evaluate bias & fairness in AI outputs
These skills can be taught, but you should screen for curiosity, adaptability, and systems thinking not just âcan they code?â
Tools like Lovable, Base44, and Storm platforms
mean you are not hiring people to write boilerplate code, you're hiring them to reason about systems.
Tip 3: Broaden Your Pool - Global, Remote, AI-Assisted Talent
The talent pool is no longer Bay Area + Stanford/Berkeley.
In 2025, the strongest candidates often come from:
- global remote markets
- AI engineering bootcamps
- hybrid roles (PMs with AI fluency, engineers who learned statistics)
- research labs that shifted into industry
- open-source AI communities on GitHub & HuggingFace
Diversity is still essential but now includes:
- geographic diversity
- model transparency perspectives
- cross-disciplinary thinking
- ethical, legal, governance viewpoints
The best data science teams combine:
- technical rigor
- storytelling
- ethical foresight
- operational execution
No single âprofileâ dominates.
4. Mentorship Still Matters But Itâs Now AI-Enhanced
Teams that blend:
- senior ML engineers
- junior analysts
- AI automation tools
- autonomous agents
perform better than teams trying to replace people with AI entirely.
Mentorship now includes:
- teaching how to evaluate LLM outputs
- teaching when not to trust automation
- teaching how to build human-in-the-loop workflows
The strongest teams are human-led, AI-accelerated.
.jpg)
5. The New Hiring Funnel: 2025 Style
A common modern hiring pipeline includes:
1) Problem-solving case (AI-assisted allowed)
Candidates show how they use AI tools intelligently, not blindly.
2) Technical assessment (code + reasoning)
Less boilerplate, more architecture thinking.
3) Practical portfolio review
GitHub, Kaggle, HuggingFace, open-source contributions.
4) AI fluency check
Prompting, evaluation, system design.
5) Communication + storytelling
Still one of the hardest and most important skills.
Final Thoughts for 2025
Data science is no longer defined by coding ability alone, it's defined by:
- the ability to reason
- the ability to evaluate models
- the ability to integrate AI into workflows
- the ability to extract reliable insights
- the ability to understand data ethics & governance
- the ability to collaborate across hybrid humanâAI teams
The companies that win in 2025 hire curious thinkers, not just technicians.
They invest in training, mentor talent, and adopt AI-native tools that supercharge productivity.
And yes, the demand is still higher than supply, but the shape of the talent gap has changed.
If your company updates its hiring strategy to match the modern data landscape, you will not only find great talent, you will build a data team ready for the next decade.
With AI models requiring clean, structured external data more than ever, tools like the Import.io web data extraction platform play a foundational role in enabling modern data science teams to operate efficiently.
Originally posted on October 8th, 2015. Updated on May 30th, 2018. New update on December 11th, 2025.
The Data Scientist is one of the most sought after positions in tech â and much of the business world. But the demand is very quickly outstripping the supply.
McKinsey estimates that by 2018, the U.S. economy will have a shortage of 140,000 to 190,000 people with analytical expertise. And if youâre a startup or business thatâs looking to expand your data science operations, that might sound a little scary.
Data science is more important today than ever before. A company that doesnât properly utilize it may end up falling behind the competition. Whole industries are sprouting out of data science, including machine learning and artificial intelligence. Businesses that fail to see the importance of hiring the best data scientists are only doing themselves a disservice. Hiring a data scientist with the right expertise, refined skills, and big ideas to join your team is going to make a big difference in the long run. The big question is how to do it because the competition for data science talent is fierce.
How can you attract top talent in such a competitive market?
Weâre not saying itâs going to be easy. After all, a highly qualified data scientist candidate can become a valuable team member, so itâs no wonder that a company would seek hiring one with the right skills. But with these tips from Data Scientist at Insightly, Sara Vera, itâs certainly going to get a lot easier!
In her keynote at Extract SF 2014, Sara gave us her top three tips for hiring great Data Scientists. And now, sheâs sharing them with you! Watch the quick 10 minute video of her talk and then read on for a more in depth look at her insights.
How to hire data scientists when demand is higher than supply

Because the field of Data Science is so new, a lot of Data Scientists â like myself  â fall into it in a very nonlinear way.
I was originally in a PhD program at the University of Washington, but after I finished my masters degree I decided I didnât want to stay in academia. I wanted something a little more fast paced. When I started browsing job websites, I realized that my quantitative skill set was pretty valuable in the private sector. I decided to move to San Francisco because I liked how cutting edge the tech industry was and it felt like a great place to deepen my skill set.

When I got to San Francisco, the demand for Data Scientists was growing fast â way faster than the supply. As a job seeker itâs great. But once I got a job at Insightly and needed to hire more Data Scientists, it became super challenging. Iâm going to share with you what worked at Insightly and give you some tips that should help you hire some great people.

Define your needs
Every company has very different needs. It might depend on what industry the company is in or what challenges you anticipate will be waiting for you in the future. Because data science is such a broad field it really helps to first define what you need in your company.
Are you looking for a candidate who is going to be producing data analytics for machines or for humans?
The reason you want to do this is because those are pretty different skill sets, even though they are both done by Data Scientist. For example, if youâre looking to build ad-targeting or recommendation engines or do algorithmic training then you are going to want to look for a candidate with a really strong mathematics and computer science background. You donât want to bring on a data scientist to your team with a skill set that doesnât match the needs of your business. That isnât good for your company, and it isnât good for potential data science candidates. Hiring a data scientist with top-notch machine learning skills, for example, when your company doesnât need machine learning skills will not lead to more success.
On the other hand, if you are looking for a data scientist for your company who is going to be reporting to project managers and diving into how the product is doing or how your user growth is going; then you need to find a candidate whoâs really good at storytelling. That type of job requires a data science expert who is able to connect what could be pretty fuzzy data points into an overarching narrative of whatâs going on in your company. Candidates with these skills are more likely to come from a social science background because in the medical field, sociology, economics, or geography theyâre accustomed to doing this with their data.

Hire for talent, train for tech skills
The second thing to do is to hire for data science talent and train for tech skills. This is really important becauseâŠ
Analytical thinking and communication skills are harder to teach than SQL, Python and R.
I came from an academic background, and while I used data for statistical analysis, I had very little experience doing any kind of coding. Luckily the people who hired me for my first tech job gave me a take home test and I was able to Google all of the coding that I needed to write a beautiful report. Of course when I went in for the on site interview, I completely bombed the technical side of the interview. But they thought I was pretty smart and decided to hire me anyway. And I was able to pick up the SQL and Python through their training and mentorship.
The added benefit of training for tech skills, is that you create an environment where mentorship is really strong, which creates a more cohesive team. The best candidates will want to always be learning from each other anyway.

Broaden your pool of candidates
We have some really strong Data Science candidates here already in the tech world and also graduating from Berkeley and Stanford. But if you look just a little bit deeper than that, or a little bit broader than that, there are a lot of would be Data Scientists at State schools and places like Zipfian and other data boot camps that have popped up. In other words, when hiring for data scientist candidates, itâs important not to get tunnel vision. There are lots of great candidates out there who you might not have ever considered before.
Think outside the box when looking for candidates
The other benefit of this approach, is that if youâre looking in a broader pool of candidates, youâre also going to increase diversity in your company. And by diversity I also mean geography and age, not just gender and race.
Diversity in the workplace is really important because research has shown that it increases your revenue by promoting innovation and creative thinking. Diversity is a huge initiative right now at Google where 100% of managers and 25% of their employee base is going through unconscious bias training.
Hopefully these insights and tips will help you build a great Data Science team!
With machine learning, big data analytics, and artificial intelligence becoming such big components of every business. Following tips ensures the data scientist you hire will become a major contributor to the team. Donât pass up this opportunity to bring on the best data science talent!
About the author
Sara Vera is a Data Scientist at Insightly, a customer relationship management tool for a small business. Itâs a simple, yet powerful, CRM system for small businesses with integrations to Google Apps, Office 365, MailChimp, and major social media sites.
â
2025 UPDATE: Hiring Data Scientists in the Age of AI, Automation & LLMs
A lot has changed since this article was last updated in 2018, both in the world of data science and in how companies hire, train, and retain technical talent.
Back then, companies were dealing with a shortage of analytically skilled professionals. In 2025, the situation is more complex: some data science skills are oversupplied, while others, especially AI engineering, applied machine learning, and data governance remain critically scarce.
Below is a modernized view of how companies hire data scientists today, what has changed, and what hasnât.
1. The Data Science Role Has Evolved Dramatically
In 2015â2018, the conversation centered around:
- SQL, Python, R
- statistics & modeling
- analytics vs engineering
- storytelling vs algorithmic thinking

In 2025, the role is shaped by:
AI-native workflows
Data scientists now work alongside:
- Large Language Models (LLMs)
- synthetic data generation
- autonomous agents
- AutoML & automated feature engineering
- AI orchestration tools (LangChain, DSPy, semantic kernels)
Most traditional âdata wranglingâ is now assisted or performed by AI.
Data scientists are more specialized
Modern sub-roles include:
- AI Engineer / LLM Engineer
- ML Platform Engineer
- AI Product Scientist
- Generative AI Researcher
- Data Governance & Responsible AI Lead
- Analytics Translator / Insights PM
Generalists still exist but specialists are in higher demand.
2. The Talent Gap Still Exists But in New Areas
Original McKinsey projections warned of a huge shortage by 2018.
In 2025, shortages persist, but in different skill categories:
Undersupplied
- AI engineers
- LLM optimization experts
- reinforcement learning specialists
- synthetic data engineers
- data governance, privacy & compliance roles
- real-time data infrastructure engineers

Oversupplied
- junior analysts
- bootcamp-trained generalists
- basic SQL/Python candidates
Companies must now differentiate between commodity skills and strategic skills.
3. The Hiring Advice Still Applies But Needs an Update
Sara Veraâs tips remain shockingly relevant, but hereâs how they translate for 2025.
Tip 1: Define Your Needs, Now More Than Ever
In 2025, the key question isnât just analytics vs machine learning.
Itâs also:
- âDo we need a data scientist or an AI engineer?â
- âIs this a human or LLM-assisted analytics workflow?â
- âIs this role more experimentation or more production?â
- âDo we need modeling skills, or orchestration & evaluation skills?â
Examples:
- If you're building personalization, you need ML Ops + AI engineers.
- If youâre doing insights for leadership, you need analytics translators + genAI tooling.
- If you're building a data governance program, you need privacy, quality, lineage experts.
Clear scoping prevents mismatched hires and expensive mistakes.
Tip 2: Hire for Thinking, Train for Tools But Include AI Fluency
Saraâs original advice was timeless:
Analytical thinking and communication skills are harder to teach than SQL, Python or R.

This is still true but now we add:
AI fluency is mandatory.
Data scientists in 2025 must:
- prompt, refine, and evaluate LLMs
- understand hallucination risks
- know how to chain models together
- grasp embeddings, vector databases, and retrieval
- evaluate bias & fairness in AI outputs
These skills can be taught, but you should screen for curiosity, adaptability, and systems thinking not just âcan they code?â
Tools like Lovable, Base44, and Storm platforms
mean you are not hiring people to write boilerplate code, you're hiring them to reason about systems.
Tip 3: Broaden Your Pool - Global, Remote, AI-Assisted Talent
The talent pool is no longer Bay Area + Stanford/Berkeley.
In 2025, the strongest candidates often come from:
- global remote markets
- AI engineering bootcamps
- hybrid roles (PMs with AI fluency, engineers who learned statistics)
- research labs that shifted into industry
- open-source AI communities on GitHub & HuggingFace
Diversity is still essential but now includes:
- geographic diversity
- model transparency perspectives
- cross-disciplinary thinking
- ethical, legal, governance viewpoints
The best data science teams combine:
- technical rigor
- storytelling
- ethical foresight
- operational execution
No single âprofileâ dominates.
4. Mentorship Still Matters But Itâs Now AI-Enhanced
Teams that blend:
- senior ML engineers
- junior analysts
- AI automation tools
- autonomous agents
perform better than teams trying to replace people with AI entirely.
Mentorship now includes:
- teaching how to evaluate LLM outputs
- teaching when not to trust automation
- teaching how to build human-in-the-loop workflows
The strongest teams are human-led, AI-accelerated.
.jpg)
5. The New Hiring Funnel: 2025 Style
A common modern hiring pipeline includes:
1) Problem-solving case (AI-assisted allowed)
Candidates show how they use AI tools intelligently, not blindly.
2) Technical assessment (code + reasoning)
Less boilerplate, more architecture thinking.
3) Practical portfolio review
GitHub, Kaggle, HuggingFace, open-source contributions.
4) AI fluency check
Prompting, evaluation, system design.
5) Communication + storytelling
Still one of the hardest and most important skills.
Final Thoughts for 2025
Data science is no longer defined by coding ability alone, it's defined by:
- the ability to reason
- the ability to evaluate models
- the ability to integrate AI into workflows
- the ability to extract reliable insights
- the ability to understand data ethics & governance
- the ability to collaborate across hybrid humanâAI teams
The companies that win in 2025 hire curious thinkers, not just technicians.
They invest in training, mentor talent, and adopt AI-native tools that supercharge productivity.
And yes, the demand is still higher than supply, but the shape of the talent gap has changed.
If your company updates its hiring strategy to match the modern data landscape, you will not only find great talent, you will build a data team ready for the next decade.
With AI models requiring clean, structured external data more than ever, tools like the Import.io web data extraction platform play a foundational role in enabling modern data science teams to operate efficiently.