Learn

Platform

For Business

Pricing

Resources

Login

Start for Free

Login

Start for Free

Sign Up

/

/

Business Intelligence Careers

How Much Time Should I Spend Cleaning Data?

How Much Time Should I Spend Cleaning Data?

How Much Time Should I Spend Cleaning Data?

3 min read

Dec 8, 2022

Stacy Giroux

Cohort Learning Lead

Currently Reading

How Much Time Should I Spend Cleaning Data?

What a great question!

This one came up recently during one of the weekly virtual sessions we have with our bootcamp students. It is particularly insightful because there’s no easy answer and it’s one that almost every analyst will face at some point during their data career. The discussion we had as a group was really thoughtful and left me thinking a lot about cars and tires.

I’ve recently spent my weekends watching Formula 1 racing with my son and find the pit stops fascinating. I find it almost mesmerizing to watch the choreography that happens between the car & driver and their pit crew during each pit stop.

If you haven’t seen this, the entire process of changing all four tires can happen in under 2-seconds! In fact, the total time a car is stationary is measured down to one-hundredths of a second. So, even to someone new to watching F1 racing like me, it quickly becomes apparent that these pit stops are critically important to the strategy and success of the team.

In F1 racing, podium positions are decided by fractions of seconds and a clean pit stop really can make a difference. Teams invest significant resources – manpower, tools, and (I’d presume) practice – to make sure each pit stop is as clean as possible, exactly because it is an essential component to their overall success…and that’s why discussing cleaning data during bootcamp made me think about cars and tires!

When I’m faced with the question of how much time to spend cleaning the data, I always try to check in and ask:

What elements of cleaning this data are essential to our success?

How you answer this question will change from project-to-project, and can even change over time within a project.

For example: early in my data career when I worked in a lab setting, ensuring any source data was error-free was of critical importance.

The overall tolerance for errors in a lab needs to be low and, not unlike an F1 pit crew, significant resources are invested and sophisticated processes are developed to obtain the most efficient and effective process. Data is not only checked but often double- or triple-checked before going into analysis and reporting. Definitely an F1-level approach to data cleaning!

And this can apply to other areas, too. Think of any scenario where the stakes are high and/or the tolerance for risk or error is low; in these scenarios, if you uncover significant issues in the source data the next right step may be to advocate for taking the time to clean the data before moving into analysis and reporting. The investment of time, resources, and/or effort in these scenarios may be critical to support the outcomes the business is striving for.

In the example we discussed in our bootcamp, the situation was from a project data set where an error was detected in 3-rows of data out of a total of almost 200,000.

The group talked through whether these 3-rows would change any of the conclusions from subsequent analysis (unlikely), and then went one step further to talk about how much effort it was to make the correction (not much). Ultimately we decided there were many valid approaches to dealing with these particular errors but it was the conversation that was invaluable.

So, when faced with the question of how much time to spend cleaning data, know there is no “one-size-fits-all” answer. But, if you focus first on having a solid understanding of how the analysis will be used and the importance of outcomes it will drive, you’ll be in a much better position to gauge the “right fit” for your particular scenario.

Be passionate. Seek mastery. Learn with humility.

Stacy

BLACK FRIDAY CAME EARLY!

Save up to 50% on Maven Pro plans today!

This week, we're offering up major discounts on Maven Pro plans. Don't wait -- this offer ends soon!

Sign Up & Save

Sign Up & Save

Sign Up & Save

Share this article with your friends

Stacy Giroux

Cohort Learning Lead

Stacy is a former Cohort Learning Lead for Maven Analytics, helping to design, manage, and faciliate immersive bootcamp experiences for aspiring data professionals.

View profile

PREVIOUS

Introducing the Maven Portfolio Showdown

NEXT

Introducing: Maven Showcase

BLACK FRIDAY CAME EARLY!

Save up to 50% on Maven Pro plans today!

This week, we're offering up major discounts on Maven Pro plans. Don't wait -- this offer ends soon!

Sign Up & Save

Sign Up & Save

YOU MAY ALSO LIKE

Maven Analytics Hall of Fame 2025: Call for Nominations

How to Choose the Right Chart for Your Data

A Beginner’s Guide to Data-Driven Decision Making

Analysts You Can Hire: Summer 2025

RECAP: “Ask Me Anything” with Chris & John!

You May Also Like

Maven News
Maven Analytics Hall of Fame 2025: Call for Nominations
John Pauler
2 min read
Business Intelligence Skills
How to Choose the Right Chart for Your Data
Chris Dutton
5 min read
Business Intelligence Skills
A Beginner’s Guide to Data-Driven Decision Making
Chris Dutton
4 min read
Maven News
Analysts You Can Hire: Summer 2025
John Pauler
10 min read
Business Intelligence Careers
RECAP: “Ask Me Anything” with Chris & John!
Dakota Brown
16 min read
Business Intelligence Careers
RECAP: “Ask Me Anything” with John Pauler!
Dakota Brown
11 min read
Data Science
Why Everyone Should Learn a Little Data Science (Even If You Don't Want to Be a Data Scientist)
Chris Bruehl
4 min read
Data Science
What I’ve Learned From 12 Years in NLP
Alice Zhao
5 min read
Data Science
What’s Behind ChatGPT and Modern AI: A Crash Course on Large Language Models
Alice Zhao
5 min read
Team Training
Hidden Costs of Not Upskilling Your Team (And How to Fix It!)
Lauren Rosenthal
3 min read
Data Science
Beyond Automation: Best Practices for Trustworthy AI at Scale
Kristen Kehrer
9 min read
Maven News
Analysts You Can Hire: Spring 2025
John Pauler
10 min read
Business Intelligence Skills
You Built the Report, Now What? A Framework for Gathering Post-Deployment Feedback
Colin Tomb
8 min read
Business Intelligence Skills
How to Get Started as a Data Analyst in 2025
Lauren Rosenthal
1 min read
Business Intelligence Skills
How to Go From Beginner to Advanced in SQL
Matt Mike
6 min read
Business Intelligence Careers
Data Career Advice From Alex The Analyst
John Pauler
7 min read
Business Intelligence Careers
6 Steps to Become an Advanced Excel User
Matt Mike
5 min read
Business Intelligence Careers
Analysts You Can Hire: Winter 2025
John Pauler
11 min read
Business Intelligence Skills
Streamlining Stakeholder Requirements: A Smarter, Automated Approach
Colin Tomb
11 min read
Business Intelligence Careers
How to Use LinkedIn to Get Your Next Job
Ian Klosowicz
4 min read
Business Intelligence Careers
Top Secrets to Building a Standout Data Resume in 2025
Albert Bellamy
5 min read
Business Intelligence Careers
How to Build a Standout Analytics Portfolio Project
Kristen Kehrer
8 min read
Maven News
Hall of Fame Class of 2024
John Pauler
7 min read
Maven News
Welcome Andrew Hubbard! Maven Analytics Hall Of Fame, Class of 2024
John Pauler
2 min read

View all articles

FOR INDIVIDUALS

Master data & AI skills

Build data & AI skills to launch or accelerate your career (start for free, no credit card required).

Start Learning for Free

FOR COMPANIES & TEAMS

Transform your workforce

Assess your team's data & AI skills and follow personalized learning plans to close the gaps.

Explore Business Plans

FOR INDIVIDUALS

Master data & AI skills

Build data & AI skills to launch or accelerate your career (start for free, no credit card required).

Start Learning for Free

FOR COMPANIES & TEAMS

Transform your workforce

Assess your team's data & AI skills and follow personalized learning plans to close the gaps.

Explore Business Plans

FOR INDIVIDUALS

Master data & AI skills

Build data & AI skills to launch or accelerate your career (start for free, no credit card required).

Start Learning for Free

FOR COMPANIES & TEAMS

Transform your workforce

Assess your team's data & AI skills and follow personalized learning plans to close the gaps.

Explore Business Plans

Empowering everyday people with life-changing data skills.

© Maven Analytics, LLC | All Rights Reserved

Terms & Conditions

Community Forum Policy

Empowering everyday people with life-changing data skills.

© Maven Analytics, LLC | All Rights Reserved

Terms & Conditions

Community Forum Policy

Empowering everyday people with life-changing data skills.

© Maven Analytics, LLC | All Rights Reserved

Terms & Conditions

Community Forum Policy