Ensuring ROI on Predictive Analytics Projects

With the fast-paced growth in data professionals: Data scientists, analysts, engineers, and the lines between data roles being blurred, measuring and communicating the ROI of data teams is no easy feat. But given the large investment in this area, understanding this value presents an existential question for the data industry. Holistically, to understand the ROI of the data organization, we have to examine it through the lens of each function in the company.

Our guest today is Keith McCormick, an independent consultant, trainer, speaker and author. He has been designing and conducting analytics projects for over 25 years, and today he will be talking all about the ROI of predictive analytics projects.

You will want to hear this episode if you are interested in:

[00:12] Introduction to the guest speaker, Keith McCormick
[01:25] A fun fact or hobby about Keith
[03:28] Challenges and benefits of overly estimating the ROI for analytics projects
[09:57] A sneak preview of the confusion matrix
[10:40] Keith’s confidence in his ROI for analytics projects
[13:03] Stages in a company when Keith is brought in
[17:47] Costs to be considered from the start
[20:37] When are teams calculating the benefits of a project?
[22:13] How easy is it to bring data scientists, analytics leaders, and executives on the same page?
[25:47] Keith’s upcoming courses

Notable Quotes

The risk around the surprise as we get more into the conversation about the challenges and benefits of overestimating the ROI for analytics projects is so that you can just prioritize the projects available to you.
Maybe we put too much pressure on data scientists that they've always got to come up with some insight that's worth money every time they're looking at data, and we know that it doesn't happen that way.
There has to be a prediction going on because otherwise, machine learning isn't the right tool, and we all know that folks will haul out machine learning algorithms when they don't necessarily need them.
If you go through the discipline of the confusion matrix and work your way down it, and you just sit down with the appropriate team members, you'd be surprised how much of this you can do in an hour or less than an hour.
If you have all these different codes, now you've got something you can tackle with a confusion matrix, which means you can think through the problem. Everybody on the team, including the non-data scientists, can follow what you're talking about.
I always recommend that people do a partial rollout, and that's the best way to do the kind of estimate for the costs that you do.
You can't whiteboard a confusion matrix and know that a human will choose to ignore the prediction, but the reality is that will happen.
What I suggest that folks do is they take that experienced data engineers, data stewards, what have you, and put them part-time on the project, try to guesstimate fairly early on what that role is going to be, and then get somebody possibly even temporarily to cover some of their other duties that are easier to delegate than this.
With management, you have to be very judicious with their time, but you have to get on their calendars. So part of what comes with experience is knowing where in the lifecycle we will likely need them.

About Keith McCormick

Keith McCormick is an independent consultant, trainer, speaker and author. His consulting specializes in helping analytics from all industry leaders to build and manage their data science teams. His training has reached thousands of individuals trying to learn statistics, machine learning and data science. He specializes in predictive models and segmentation analysis, including classification trees, neural nets, general linear models, cluster analysis, and association rules. He has been designing and conducting analytics projects for over 25 years

LinkedIn: https://www.linkedin.com/in/keithmccormick/

“Maybe we put too much pressure on data scientists that they've always got to come up with some insight that's worth money every time they're looking at data, and we know that it doesn't happen that way.”

- Keith McCormick

Resources

Website: https://www.keithmccormick.com
Twitter: KMcCormickBlog

Connect with LightsOnData

Podcast: www.anchor.fm/lightsondata
LinkedIn: www.linkedin.com/company/lightsondata
YouTube: www.youtube.com/c/lightsondata
Instagram: https://instagram.com/lightsondata
George on LinkedIn: www.linkedin.com/in/georgefirican
George on Twitter: www.twitter.com/georgefrican

Human in the Loop AI: Why It’s Often Just a Checkbox

Data Observability vs. Data Quality: A Comprehensive Discussion

Watch and Listen to Your Favorite Episodes!

Watch to the Video Version

Watch on

YouTube*

Watch on

LinkedIn Live**

(during live event)

*Voted as #1 Most Helpful Data Video Channel of 2020 by the audience of DataLiteracy.com

**Voted Top 3 Data Podcasts by Data Community Content Creators Awards of 2021

*** Named "Best 10 Data Science Podcasts You Must Follow" in 2021, 2022, 2023, 2024, 2025

**** Named "Top 3 Data Management Podcasts" in 2023, 2024, 2025

Listen to the Podcast Version

Listen on

Apple Podcast

Listen on

Anchor.fm

Listen on

Spotify

Listen on

Podchaser

Listen on

Castbox

Listen on

Pocket Casts

Listen on

Breaker

Listen on

Amazon Music

Subscribe & Listen to the Podcast Version

Listen on

Apple Podcast

Listen on

Anchor.fm

Listen on

Spotify

Listen on

Google Podcasts

Listen on

Castbox

Listen on

Breaker

about the show

Each episode puts the lights on various data topics with renowned industry experts. We cover the following topics in a fun and informative interview format: data science, data analytics, machine learning, artificial intelligence, data visualization, data storytelling, data governance, data management, data quality, data strategy, and much more.

Learn more about the Lights On Data Show

Do you want to be featured on the show?

Related episodes

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
sp_landing	1 day	The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
sp_t	1 year	The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content.
tve_leads_unique	1 month	This cookie is set by the provider Thrive Themes. This cookie is used to know which optin form the visitor has filled out when subscribing a newsletter.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_1Z635JPV9L	2 years	This cookie is installed by Google Analytics.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
AE_AB_COOKIE	1 year	No description
DEVICE_INFO	5 months 27 days	No description
loglevel	never	No description available.
tl_4829_4830_26	1 month	No description
tl_4829_4840_30	1 month	No description
tl_4829_4941_41	1 month	No description
tve_secret	1 year	No description available.

Ensuring ROI on Predictive Analytics Projects

You will want to hear this episode if you are interested in:

Notable Quotes

About Keith McCormick

Resources

Connect with LightsOnData

You may also like

Human in the Loop AI: Why It’s Often Just a Checkbox

Data Observability vs. Data Quality: A Comprehensive Discussion

Watch and Listen to Your Favorite Episodes!