Podcasts SDS 773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas

68 minutes
Business, Deep Learning, Machine Learning

SDS 773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas

Subscribe on Apple Podcasts, Spotify, Stitcher Radio or TuneIn

Dr. Barrett Thomas presents an engaging overview of how Markov decision processes relate to Deep Reinforcement Learning and their practical applications in optimizing business outcomes operations logistics. The conversation also covers the dynamic role of machine learning in enabling rapid delivery services and the anticipated changes autonomous vehicles and drones will bring to transportation and supply chains.

Thanks to our Sponsors:

Interested in sponsoring a Super Data Science Podcast episode? Email natalie@superdatascience.com for sponsorship information.

About Barrett Thomas

Barrett (Barry) Thomas is the Senior Associate Dean of the Tippie College of Business at the University of Iowa and the Gary C. Fethke Research Professor of Business Analytics. Barry joined Tippie in 2002 after earning a PhD and MS in Industrial and Operations Engineering from the University of Michigan and BA degrees in both Mathematics and Economics from Grinnell College. Barry’s research focuses on improving last-mile-delivery operations through mathematical optimization and machine learning. He has over 50 peer-reviewed research publications and is the co-Area Editor for the Routing and Logistics Area of the journal Transportation Science. He has previously served as President of the INFORMS Transportation and Logistics Society and the Grinnell College’s Board of Trustees.

Overview

Dr. Barrett Thomas, with a foundation in delivery logistics and machine learning, started his career at Schneider Logistics in Green Bay, Wisconsin. His work in logistics and vehicle routing began with an internship that introduced him to the complexities of transporting components to manufacturing sites. Barret combines his expertise in operations research with machine learning to tackle sequential decision problems, a critical aspect of optimizing logistics processes.

A significant portion of the discussion explores classical and complex problems in operations research, such as the Traveling Salesperson Problem, and the historical Königsberg Bridge Problem. These discussions naturally led to the mention of the Concorde Problem Solver, an advanced tool used in solving such intricate logistics issues by efficiently mapping out the shortest possible route in a given network, minimizing time and resources.

Barrett highlights the role of Markov decision processes (MDPs) in modeling decision-making scenarios where outcomes are partly random and partly under the control of a decision-maker. MDPs help in selecting the best action to maximize the immediate reward based on the current state, thereby optimizing the overall decision-making process in logistics and delivery.

Jon and Barrett further explore how operations research, with the aid of neural networks and simulations, is pivotal in reducing business costs and enhancing profits. This integration of machine learning with logistics has paved the way for innovations such as same-day delivery, fundamentally altering consumer expectations and delivery standards.

The evolution of Barrett’s career mirrors the broader shifts in the logistics and delivery landscape, characterized by the advent of new delivery models such as meal delivery and same-day delivery services. Looking ahead, Barrett envisions a future where logistics and delivery models are redefined by technologies like drones and autonomous vehicles. These innovations promise to streamline delivery processes, reduce urban congestion, and optimize space usage.

In this episode you will learn:

Barrett’s start in operations logistics [02:27]
Concorde Solver and the traveling salesperson problem [09:59]
Cross-function approximation explained [19:13]
How Markov decision processes relate to deep reinforcement learning [26:08]
Understanding policy in decision-making contexts [33:40]
Revolutionizing supply chains and transportation with aerial drones [46:47]
Barrett’s career evolution: past changes and future prospects [52:19]

Items mentioned in this podcast:

This episode is brought to you by Ready Tensor
Dr. Barrett Thomas’s List of Publications
“Same-day delivery with heterogeneous fleets of drones and vehicles” by Marlin W. Ulmer and Barrett W. Thomas
“Deep Q-learning for same-day delivery with vehicles and drones” by Xinwei Chen, Marlin W. Ulmer and Barrett W. Thomas
Markov decision process
Backward dynamic programming
Deep reinforcement learning
Bill Cook’s Concorde Solver
Traveling sales person problem
Königsberg bridge problem
Reinforcement Learning and Stochastic Optimization by Warren B. Powell
Machine Learning Level 2 (in Python)
Jon Krohn’s Mathematical Foundations of Machine Learning Course
Open Data Science Conference (ODSC) East
The Super Data Science Podcast Team

Follow Barrett:

Follow Jon:

Episode Transcript:

Download The Transcript

Podcast Transcript

Jon Krohn: 00:00:00

This is episode number 773 with Dr. Barrett Thomas, Research Professor at the University of Iowa. Today’s episode is brought to you by Ready Tensor, where innovation meets reproducibility.

00:00:14

Welcome to the Super Data Science Podcast, the most listened-to podcast in the data science industry. Each week we bring you inspiring people and ideas to help you build a successful career in data science. I’m your host, Jon Krohn. Thanks for joining me today. And now, let’s make the complex simple.

00:00:45

Welcome back to the Super Data Science Podcast. Today, you’re in for a treat with the eloquent and deeply knowledgeable Professor Barrett Thomas. Barrett is Research Professor in Business Analytics and Senior Associate Dean at the University of Iowa’s College of Business. As will soon be unsurprising to you, when you hear how well he communicates complex concepts, he’s won multiple teaching awards amongst other academic prizes. He holds a PhD in industrial and operations engineering from the University of Michigan. Today’s episode is a technical one that will appeal primarily to hands-on practitioners like data scientists, software developers, and machine learning engineers. In the episode, Barrett details what Markov Decision Processes are and how they relate to deep reinforcement learning, how operations research leverages neural networks to maximize business profits and minimize business costs, how same-day delivery has been made possible by machine learning, and how aerial drones and autonomous vehicles will revolutionize supply chains and transportation. All right, you ready for this fascinating episode? Let’s go.

00:01:47

Nice. Well, welcome to the University of Iowa where we have an amazing, if you’re watching the YouTube version of this, the University of Iowa is super generous with recording equipment. I’m sure the audio even sounds spectacular, but I can see the video in real time here. And wow, this is some great production value. The reason why I’m at the University of Iowa is because I’m interviewing Professor Barrett Thomas. So welcome to the show.

Barrett Thomas: 00:02:14

Well, thank you for having me. I have had a chance to follow your show and you do great work, so it’s exciting to be a guest.

Jon Krohn: 00:02:22

Nice, thank you. Well, let’s dig right into your research. So you sit at the intersection of delivery logistics, and machine learning. Do you want to tell us a bit about why that’s interesting, and I also know you have a lot of association with the business analytics program, you have an operations research background, so maybe you can tie those different fields together?

Barrett Thomas: 00:02:42

Yeah, sure. So my first job out of college, I was an intern at Schneider Logistics, which was a subsidiary of Schneider Trucking. So if you’ve seen-

Jon Krohn: 00:02:56

Orange trucks.

Barrett Thomas: 00:02:57

The orange trucks on the road, that’s Schneider, and they’re based in my hometown of Green Bay, Wisconsin. So I ended up there in an internship, so found my way into the trucking business. One of the things that we were doing at that time, it was the dawn of third-party logistics. So we were working with a lot of different clients, helping them in many cases move different components to manufacturing facilities. One of the things that happens in that context is that you don’t necessarily fill up a full truckload. And so, we were looking at ways of improving the cost by along the way, stopping somewhere else and putting more in the truck to get better, essentially utilization.