Podcasts SDS 683: Contextual A.I. for Adapting to Adversaries, with Dr. Matar Haller

81 minutes
Artificial Intelligence, Data Science

SDS 683: Contextual A.I. for Adapting to Adversaries, with Dr. Matar Haller

Subscribe on Apple Podcasts, Spotify, Stitcher Radio or TuneIn

Matar Haller speaks to Jon Krohn about the challenges of identifying, analyzing and flagging malicious information online. In this episode, Matar explains how contextual AI and a “database of evil” can help resolve the multiple challenges of blocking dangerous content across a range of media, even those that are live-streamed.

Thanks to our Sponsors:

Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

About Matar Haller

As the VP of Data & AI at ActiveFence, Matar Haller leads the Data Group, whose teams are responsible for the data and algorithms which fuel ActiveFence’s ability to ingest, detect and analyze harmful activity and malicious content at scale in an ever-changing, complex online landscape. Matar holds a Ph.D. in Neuroscience from the University of California at Berkeley, where she recorded and analyzed signals from electrodes surgically implanted in human brains. Matar is passionate about expanding leadership opportunities for women in STEM fields and has three children who surprise and inspire her every day.

Overview

According to the Alan Turing Institute, nearly 90% of people aged 18-34 “have witnessed or received harmful content online at least once” (turing.ac.uk, 2023). As VP of Data and AI at ActiveFence, a company that develops algorithms to detect and remove harmful user-generated posts, Mater Haller has a lot of work ahead of her. The ubiquity of dangerous content and its range across media formats (video, text, audio, image) make such data hard to monitor. As Matar explains, this is only the first step in a long process of weeding out illegal material online.

The first issue is that the divisions between acceptable and unacceptable content may not always be clear. This is why it is necessary to consider content such as “baby’s bath time” in its broader context: Is it an innocent video posted to share a happy moment? Or is it something more sinister? To assess this broader context, Matar explains, contextual AI will look at the user’s history of posts, whether it contains known logos or weapons, the language used to describe the post, and its user-generated tags.

Another obstacle that ActiveFence has to climb is that users who want to spread misinformation and extreme media won’t give up easily, seeking instead to circumvent AI through a variety of means, from unusual spellings to video and audio played at half-time. To address these evasion techniques, ActiveFence hires intelligence analysts with expertise in finding misinformation. These analysts research the hashtags, trends and even emojis that dangerous groups might use, which they can then add to a “database of evil” which helps the ActiveFence team to surface and block any offending data.

Thanks to the Digital Services Act passed by the EU last year, users in EU countries can benefit from more guardrails against witnessing illegal content. But while this is a step in the right direction, some forms of content sharing such as live-streaming can be a challenge to monitor. Matar says that the scale of data, posted simultaneously, can bog down the process of detection. One method to solve this problem is to again use contextual AI, considering the user, their streaming and comments history, and the groups to which they belong, before making a risk score as to how likely that piece of content might be inflammatory.

Listen to this episode to hear about the specific technologies ActiveFence uses to run its platform, Matar’s experience with the Insight Fellows Program, and MedTech’s potential capabilities for predicting brain seizures.

In this episode you will learn:

How ActiveFence helps its customers to moderate platform content [05:36]
How ActiveFence finds extreme social media users trying to evade detection [16:32]
How to monitor live-streaming content and analyze it for dangerous material [29:13]
The technologies ActiveFence uses to run its platform [35:54]
Matar’s experience of the Insight Fellows Program (Data Science Fellowship) [40:28]
Leadership opportunities for women in STEM [1:00:41]
Israel’s R&D edge for AI [1:13:19]

Items mentioned in this podcast:

Follow Matar:

Follow Jon:

Episode Transcript:

Download The Transcript

Podcast Transcript

Jon Krohn: 00:00:05

This is episode number 683 with Dr. Matar Haller, VP of Data and AI at ActiveFence. Today’s episode is brought to you by Posit, the open-source data science company, by Anaconda, the world’s most popular Python distribution, and by WithFeeling.ai, the company bringing humanity into A.I.

00:00:23

Welcome to the SuperDataScience podcast, the most listened-to podcast in the data science industry. Each week, we bring you inspiring people and ideas to help you build a successful career in data science. I’m your host, Jon Krohn. Thanks for joining me today. And now let’s make the complex simple.

00:00:55

Welcome back to the SuperDataScience podcast. Today I’m joined by the wildly intelligent data scientist and communicator, Matar Haller. Matar is the Vice President of Data and AI At ActiveFence, an Israeli firm that has raised over a $100 million in venture capital to protect online platforms and their users from malicious behavior and malicious content. She’s renowned for her top-rated presentations at leading global data science conferences. She previously worked as Director of Algorithmic AI at SparkBeyond an analytics platform. She holds a PhD in neuroscience from UC Berkeley, and prior to data science, she taught soldiers how to operate tanks. Today’s episode has some technical moments that will resonate particularly well with hands-on data science practitioners. But for the most part, the episode will be interesting to anyone who wants to hear from a brilliant person on cutting-edge AI applications. In this episode, Matar details the “database of evil” that ActiveFence has amassed for identifying malicious content. How contextual AI considers adjacent and potentially multimodal information when classifying data. How to continuously adapt AI systems to real-world adversarial actors. The machine learning model deployment stack. She uses the data she collected directly from human brains using recording electrodes and how this research relates to the brain-computer interfaces of the future and why being a preschool teacher is a more intense job than the military. All right, you ready for this captivating episode? Let’s go.

00:02:27

Matar, welcome to the SuperDataScience podcast. It’s awesome to have you on the show. Where are you calling in from?

Matar Haller: 00:02:33

I’m calling in from Israel, sunny, sunny Israel. So thanks for having me.

Jon Krohn: 00:02:37

Sunny, sunny Israel. Is that always true? Always Sunny Israel.

Matar Haller: 00:02:40

Mhm, most of the time it’s pretty sunny. We have like two seasons. One is really long and it’s really, really hot. And the other one is shorter and beautiful and not as hot. But still, we have a lot of sun and that’s not-

Jon Krohn: 00:02:54

[crosstalk 00:02:55] beaches.

Matar Haller: 00:02:56

We have very nice beaches. We, it’s, we have tropic-like areas that are like more green and nice, and forests, wildflowers, mountains. Not all camels and deserts, although we have that too.

Jon Krohn: 00:03:09

Cool. Well, I guess it isn’t cool, but I, sounds hot, but I will have to visit there sometime. I actually, I have a grandmother who recently visited and said that it was her favorite place she’s ever been.

Matar Haller: 00:03:22

Oh, wow. Nice. So come visit I’ll introduce you to my chickens.

Jon Krohn: 00:03:29

There you go. This episode brought to you by the Israel Tourism Board. And, but you do travel a lot as well. So you were recently in New York. You were at MLconf the Machine Learning Conference in New York, which I wasn’t able to make it to this year, but you were a speaker at MLconf. And Deborah Williams, who’s a friend of mine and the acquisitions editor Pearson that I’ve worked with for the books that I’ve created, all the video content I’ve created, she wrote me a long email summarizing how MLconf had gone. And she said that by far the best speaker hands down, and not just her opinion, but the opinion of “everyone that she spoke to” was that you Matar were by far the best speaker at MLconf. So I was like, well, get her on the show.

Matar Haller: 00:04:21

So that’s very, very flattering and now like, take your expectations and lower them. Thank you. Very, very flattering. Thank you. That was a fun, that was a fun conference. There’s lots of interesting ideas and good, good talks. So it, it was a, if she said that, there, it’s, there was a high bar. So thank you.

Jon Krohn: 00:04:40

And so let’s dig into what you do. So you are the VP of Data and Artificial Intelligence at ActiveFence, which is a platform for content moderation, harmful content detection, and threat intelligence. And so to be clear ActiveFence is not a company that is doing the content moderating. It’s not like there’s this army of people at ActiveFence that are monitoring for harmful content, but you provide tools, data, and AI-enhanced tools that allow your customers to be able to do that content moderation themselves more efficiently. And this seems to be quite a good niche. I could see on Crunchbase that ActiveFence has over a hundred million dollars in funding. So yeah, it seems like a very valuable niche to be filling for your customers. So tell us a bit about what this means. How do you use AI to be moderating content? How’s that useful for threat intelligence, that kind of thing?

Matar Haller: 00:05:42

Sure. So, ActiveFence you’re right. Like we are a platform that basically, our clients are any company that has user-generated content. So whether it’s you know, comments or chats or uploading videos or audio or any place that you have a user that’s able to upload content, there’s a potential for misuse of that. And for uploading malicious content. And our goal, our mission is basically to help platforms ensure that their users are safe, that they have safe online interactions. And so we do, we provide the tools to help them, to help them do that. And really one of the, this is one of the biggest challenges that face UGCs, platforms with user-generated content is basically how can they detect this malicious behavior especially since, as we know, items can be in any format, right? So we need to be able to detect whether it’s video, audio, text, images all of that. And we, and also it can be in any language, and it can also be any number of violations, right?

00:06:44

So you have sort of these, these big ones that, you know, you say absolutely not. Like I do not want child pornography. I do not want terror. I do not want one’s supremacy. But, but there’s, there’s like many, many, many, many more and different, different companies, different platforms have different levels of sensitivity to it, right? Even something that you can say as blatant is like, I do not want child pornography. No one wants child pornography on their platform. But let’s define it, right? What does that mean? Is, you know, baby’s first bath. Is, is that, is that something that we need to be aware of?

00:07:12

And so the tools that we provide need to be sort of contextually aware of, you know, the policy the way that things are being used or presented. And so for me and for all, you know, my teams it’s a super, super interesting space to be in because not only are the algorithms that we use really exciting and sort of interesting, but I think the application right, we’re not, we’re not selling air, like we’re actually making it like impact, like making a real impact on like human interactions in a positive way.

Jon Krohn: 00:07:45

Right, so to what extent can you tell us about those exciting algorithms?

Matar Haller: 00:07:51

So, that’s, that’s an excellent question. Thank you for asking. So I think that, so there’s many different levels of things that we can do. So the first thing is that we sort of, we have our, a platform, right? And this, this is a platform that basically enables, it enables users to or like moderators, to come in to view the content, to look at sort of where, where it is, and then to make a decision whether or not something should be removed or not, right? And this is the platform that we provide to our users in order to basically ensure that we’re being able to protect the wellbeing of the moderators and to make sure that they’re only seeing things they actually need to be seen in order to be more efficient. They, there’s absolutely no need to review everything. There’s like a, most of the things are benign and even within the things that are harmful, there isn’t really any need to view anything. Then in that, in that case, basically you want to make sure that you have sort of some sort of automated content moderation on top. And that’s where sort of we, we come in. Yes.

Jon Krohn: 00:08:49

I guess that that ends up being important for the mental health of the people who are doing the content moderation as well, because I’ve read how people in those roles, it can be quite a harrowing experience when you’re just watching the headings in child porn all day.

Matar Haller: 00:09:02

Absolutely. Absolutely. Content, like moderator wellbeing is a huge, huge, huge issue. It’s in the news, like periodically it comes up as like this, this, this huge thing. And, and ActiveFence is like very, very concerned about this, right? We deal with data that is not pleasant, right? And so in the same way that I actively work to protect my data scientists and my engineers from exposure to this, and only when it’s really needed and with a lot of safeguards, we want to make sure that, you know, we’re all human and want to make sure that moderators are also protected in the same way. And so if there’s things that are sort of blatant, you know, a beheading, why do they need to watch that? There really isn’t a need, right? There’s things that are clearly, obviously violations are clearly obviously malicious and should just and should just be removed and banned.

00:09:52

And so the algorithms that we use, basically we use what we, we what, what we call contextual AI. What this means is that we look at sort of the item in the context that it is being used but also within the item, right? We have a, our data model basically enables us to take sort of an item even if it’s just an image, and start breaking it apart into the components that it has, so that then we can build those together into a coherent risk score where this risk score can take into account, you know, what, like, do we see any like weapons? Do we see any known logos? Do we see any known people of interest that we know have, you know, from their history or whatever, we know that they’re, you know, spewing hate speech or misinformation and so forth?

00:10:36

And then all those components together can combine to basically say, yes, this item is very probably, very probable to be risky. And so that’s sort of how we, we build the full picture. And then, of course, there’s other layers of that, right? Even for example, for chats, right? You can say, well, I can just use keywords, right? Like, if I find the “N” word, then clearly this is very violative. But what if it’s someone saying, please don’t call me that. Or what if it’s a rap song? Or what if it’s you know, someone like, you know like community sort of re-owning a word. And so like, you know, I, you know, I’m proud to be a whatever some, some slur. And so in those cases, clearly I don’t want to ban that. And if I’m just doing keywords which are sort of contextually unaware then I lose that ability. And so in those cases, we do need to use sort of language models that are more contextually aware. And these language models need to be trained and tuned on these specific cases. Because these are the cases that are always interesting.

Jon Krohn: 00:11:37

That does sound really interesting. And it sounds like the kind of thing that in this brave new world that we have of these really powerful large language models that this is the kind of thing that they could do really well, that a few years ago it might have been a lot tougher. And so it’s great that you, that you’re presumably able to leverage these kinds of new technologies, especially these kind of multimodal technologies that are emerging. So I don’t think it’s available to the public yet, but GPT-4 has this image component where you can have an image, you can provide an image, a photo of your fridge, and ask GPT-4, “What can I cook?”, based on the ingredients that you see in this image. And so that kind of multimodality, it sounds like it’s something that you’ve been working with for a while.

Matar Haller: 00:12:30

Yeah, we’ve been looking a lot at multimodality because for you know, if I’m going back to the child pornography example, because that’s, for people something that’s like so obvious, right? Like, you should be able to know whether something is child porn or not. Like we all sort of viscerally know what is bad. And yet sometimes you’ll see, you know, a picture of a child and it looks fine, but, you know, it’s sort of like only the people that are in the know will know that, you know, whether like that’s a face of a victim that’s known, or in the comments, there’s links to off-platform sites or something about the angle or a logo that’s like, the picture itself is benign, but there’s a logo that’s associated with a studio that’s been associated with child porn or the title or the description.

00:13:12

And so sometimes it’s enough to look at the image, sometimes it’s enough to look at the surroundings. But oftentimes it’s the combination. And I mean in terms of like this, the generative AI, right now is this sort of like perfect storm for trust and safety, right? Because we’re gonna be having sort of US elections soon, and so political disinformation is something that’s like very, very pertinent. And is now sort of having these large language models sort of lowers the bar for the entry of bad actors, right? Suddenly, if it used to be that things could either be like really high quality, but low scale or low quality and easy to catch in high scale, now that’s not an issue, right? And so it’s like an enabling technology and, you know, can, it’s obviously, I don’t, like no fear mongering here, I think it’s like, has a lot of good that it can do. But we need to be aware of how it can be used and how we can kind of be prepared for it.

Jon Krohn: 00:14:15

This episode is brought to you by Posit: the open-source data science company. Posit makes the best tools for data scientists who love open source. Period. No matter which language they prefer. Posit’s popular RStudio IDE and enterprise products, like Posit Workbench, Connect, and Package Manager, help individuals, teams, and organizations scale R & Python development easily and securely. Produce higher-quality analysis faster with great data science tools. Visit Posit.co—that’s P-O-S-I-T dot co—to learn more.

00:14:50

Yeah, no question. So we, you know, famously in the United States, in the 2008 election cycle, there was a lot of, there were foreign actors involved in creating disinformation in Eastern European kind of farms. And you can imagine exactly like you’re saying, these kinds of tools like GPT-4 make it a lot easier to create a lot more content. Cause you don’t need to have a human typing out everything so much more cheaply. It probably like several orders of magnitude less expensive to be generating malicious content, misleading content, disinformation. So yeah, it is interesting that yeah, heading into yeah, and I mean, I guess we’re always heading into an election cycle somewhere. [crosstalk 00:15:31] And so it’s something-

Matar Haller: 00:15:33

It never ends.

Jon Krohn: 00:15:34

Like, yeah, it’s crazy in the US to me that people in the, in the lower house that they have a two-year election cycle, and so like, you spend a few months litigating then is back to fundraising.

Matar Haller: 00:15:48

Exactly.

Jon Krohn: 00:15:48

It’s wild.

Matar Haller: 00:15:50

Yeah. But, but, but I think it’s, what’s interesting is that like disinformation is only one aspect of it, right? We’re seeing like computer generated or, you know generative AI-generated child pornography. And then at this point, the question is it still violative? And I think yes, right? Like, we don’t want that stuff out there. I don’t care whether something is real or fake. It’s, it’s still child porn and it should be, it should be banned. And then, and then there’s a second level of like, well, unless I’m trying to find who the victim is, and then I do care, and then there’s like another level of detection that needs to be built on top of that.

Jon Krohn: 00:16:27

Is it tricky? I mean, it must be tricky. Something that must add an extra level of complexity to this is that presumably the nefarious actors out there are constantly shifting and trying to evade detection by you. So, in, with, when many of our listeners and myself, when we’re building machine learning models, we don’t have to worry about somebody trying to outwit the model. You know, like you can build a machine learning classifier to detect images of cats and dogs, and it’s not like the cats are like trying to look like dogs and are gonna come up with ways of like, dressing to look more like dogs. So, …

Matar Haller: 00:17:08

Yeah, no, I mean, I think I may think about it in terms of like, how, how is what I do different from car detection, right? Like that’s sort of, you know, or cat detection or anything. Or, yeah. Anyway, there’s a million examples. And so I think there’s, in addition to the fact that it’s evasive and adversarial, right? So there’s, you know, examples of like QAnon, which is a group that’s bands on some platforms will, you know, change their text to be cue, like C U E. And then you have to basically catch it by knowing to look for that. And to know to look for that, that’s already subject matter expertise. And that’s one thing [inaudible 00:17:44] we have, you know, you mentioned threat intelligence. And so we have intelligence analysts that this is what they do, right?

00:17:49

They, they’re experts in, you know, misinformation or in hate speech and or in terror and research these groups, they know about sort of the latest hashtags or trends like what emojis they’re using now and so forth. And then basically that is able to, you know, they’re able to sort of surface data that’s relevant. You know, for example, you know, the latest, there’s, you know a hate group that was founded in like this last June or this last October, and they’re already, you know, on, on different social networks with their logos spewing hate. And so to catch those, to know those to put, and then those can feed back into our algorithms, and I can know to look for those logos, to look for those phrases, to look for those actors. And so then I’m able to sort of stay on top of it on the fact that yes, it’s adversarial.

00:18:35

I have the subject of matter expertise. It’s extremely non-stationary, right? I have new actors coming up all the time, right? If I’m looking for cats and dogs, like how much have they really changed? They’re, they’re not gonna change, right? Right. They’re gonna have four legs, a tail, and ears versus here the landscape is very non, because it’s adversarial, it’s extremely non-stationary. And so that’s why I need to have my subject matter experts that are constantly feeding me more information of like, oh, this is a new slang term. Oh, this is a new slur. And so forth.

Jon Krohn: 00:19:04

Is there a flywheel that gives active defense, a defensible moat between this content moderation and the threat intelligence? So I’m just kind of, I was, I just kind of had this brainwave here, and you can correct me if I’m thinking about this incorrectly, but, so you have the content moderation aspect of the platform. So machine learning models are detecting “Hey, you know, we think that there’s, this is high-risk content over here” automatically. And then maybe that automation can assist the threat intelligence part of the company. And then the threat intelligence people in turn are keeping tabs on what’s going on through a combination of more manual intelligence work, as well as this automated automatically assisted intelligence work that your content moderation side is helping with. And then they can feed back into the content moderation. Like, “Hey, like, here’s something else you need to be able to look for. We need to train a machine learning model to be able to check this kind of thing”. Because yeah, like, you know, there’s this new logo that you need to be looking out for.

Matar Haller: 00:20:02

Totally. You nailed it. Yeah. We love flywheels at ActiveFence. We always say like, no strategy deck is complete without a flywheel. And so, and so, absolutely. It’s exactly as you described. So we have our intelligence analysts that are, you know, finding, finding things, right? Those feed into the algorithms, make our algorithms better, and then we have incoming data collection, whether the data is like we go out and practically collect it, or we get data, you know, clients are sending us data being like, “Hey, is this violated or not?” and then that’s basically can then be fed back into the intelligence analysts. So for things that come from clients, sometimes it’s things that they haven’t seen before that they, that they don’t know. Often they do know it, but because we’re also out there collecting data proactively, then that’s basically able to feed back in.

00:20:46

And one core component of this flywheel is something that is we’ve, we’ve, it’s our proprietary database. It’s sort of a violative content. And what this means is that basically we have data that we’ve already identified, whether it’s images or audio or videos or texts that we’ve already identified as being violative of a policy or malicious, we can hash that. And then new content that comes in can be compared to that, right? And then that also helps us, first of all, to be more efficient. But also we can proactively enlarge. We don’t need to wait for data to come in, right? We can proactively enlarge this proprietary database by going out there, going to sources that we know are problematic and that’s what our intelligence analyst can help us with. And then next time that it comes in, basically we’ve, we’ve already seen it before. And so there’s definitely this sort of interaction, the flywheel between the intelligence analysts, the humans and the AI, one sort of feeding off the other.

Jon Krohn: 00:21:43

Yes. So this is the database of evil that you’ve talked about publicly before, right? And so to give a bit of an analogy, this is kind of like how antivirus solutions have a database of known viruses, and then if that line of code that’s known to be malicious is come across on your own hardware, that can be compared against this database, and you can say, okay, this is the threat. We need to remove this part of your file system. So similarly, you have this database of bad content, of harmful content and yeah, it’s proprietary. So, okay, that, I think that, yeah, you have more to say about that.

Matar Haller: 00:22:24

So my, the more thing that I have to say about that is only that to bring us back to the, to the idea that we’re in an adversarial space. And if we sort of keep this in mind, it’s like we’re in the space that’s adversarial, that requires subject matter expertise, it’s, or it’s adversarial or invasive, requires subject matter expertise. It’s non-stationary, it’s multi-dimensional. And so once we keep those and, and it requires context, and once we keep those in mind, then it sort of helps us frame how we want to use this database of evil. But also, or this proprietary base, but also like what it needs to sort of be robust against, right? So if we’re in a place that requires subject matter expertise, ensure, then we can keep enlarging our data, like all of our databases, right, with our intelligence analysts, and it’s non-stationary. So we need to make sure that it’s always updated. Like having a snapshot of this database isn’t enough, right? There’s gonna be new things.

00:23:15

But also if we’re, if we’re in a place that’s inherently adversarial, then we need to make sure that this database is also robust to adversarial manipulations. What does that mean? For example, if I have like a very hateful song like, you know, glorifying the Holocaust, for example like a love song glorifying the Holocaust, right? These things exist. Then, and I know that this is banned on platforms, then I can speed it up, right? And then in the comments, or in the title or summary, I can say, listen to this at half speed. And then now, now I’ve basically like made it against all, you know, made it past all kinds of defenses.

00:23:54

And so we need to make sure, and the same thing with images, right? I can like rotate it, I can grayscale it, I can mirror it, I can do all sorts of things. And so I need to make sure that my hashing any hashing algorithm that I have is robust to these manipulations up to a point, right? Because I’m always gonna, it’s always this idea of like precision versus recall. Like, do I want to now unfairly capture things that shouldn’t be captured, right? And unfairly say that they are violative, probably not. And so it’s a tricky line, but it’s, it’s, that’s the line that is when any content moderation algorithm we’re always trying to figure out what, where things should, where the boundaries should go.

Jon Krohn: 00:24:31

And I guess that’s key to why having a human in the loop in these kinds of decisions so that people can, if they’re unfairly forced to remove content, there should be some kind of appeals process in the platform, or yeah, or human reviewer that can make some final decisions. So I guess kind of going back to a point earlier in the conversation, when something is really flagrant and your risk score, which I assume is kind of similar to in machine learning, having a binary classifier where you have a confidence on yeah, whether, how likely is this to be malicious content, harmful content? And so if that is very high, if you’re like, okay, this is 0.99999, we’re just like, there’s no point in sending this to a human to review. But if it’s 0.8 or 0.7, then like there might be something here somebody should review before a decision is made. And yeah, same thing on the flip side, when it’s, yeah, even things that are, yeah, so if something does get flagged automatically, because there’s still, everything is probabilistic in machine learning.

00:25:35

So there’s gonna be cases where the algorithm is very confident and there’s still in and due to some circumstance you’re describing where like a group has re-owned something that has been a racial slur historically there should be the opportunity for that person to say “no, like, I have,” like “I should be able to”, yeah, so these, so it kind of works both ways.

Matar Haller: 00:26:00

So our risk score exactly as you describe, and like, everything is probabilistic and also it’s, it’s a business use case where they want to, like how much manual view they want, right? Like maybe for a child’s like a platform for children, they say, you know what, just ban everything. Like, so what if the kids like can’t chat about, you know, it’s fine. But maybe for like other platforms, like a news platform-

Jon Krohn: 00:26:23

The kids can’t chat about racial slurs.

Matar Haller: 00:26:26

Yeah, but fine, I, I’m okay with that. Like, I don’t care for me. My kids can just like type three words. That’s okay. And my, my kids are anyway, they’re own, they’re getting cell phones when they’re 35. Like until then, deal with it. But yeah, but so, so it’s exactly that, right? But other platforms would be like, you know what, even if it’s at, at 99, like for the things that are 90 nines, like, we, we want to review them because we’d, we would rather err on the side of like free speech or whatever. And so I think that also in terms of appeals, that’s super important point. That’s something that is definitely critical in this because this is like, you know, people are posting things and they don’t want to be unfairly punished.

00:27:05

And actually right now I think that the world of trust and safety is having its GDPR moment, right? Like GDPR was, is for those who that are not familiar, it was like privacy regulation passed into EU that ended up having huge sweeping effect because it was basically any time that a EU that a citizen of the EU is on an online platform, then GDPR is effect, like, is effective on them in terms of like, you know what cookies and what can be stored and, and so forth. And you’ve probably all seen, like, you know, the notifications on your browsers about privacy regulations. And so now trust in safety is having its GDPR moment with the DSA which is a digital services act. It’s passed by the EU last year. And it basically also puts in protections for trust and safety. It codifies them by law in sort of, in a similar way with fines and so forth. And while it’s still new for smaller tech companies. Big, like the very big online companies, they have, like, they’re already, it’s like already being rolling in and they’re rolling out and there’s like very strict regulations on them that they need to follow. And it’ll, it’ll trickle down probably to everyone. And so like regulation, fines are on the table.

00:28:15

And so these businesses need these tools to be compliant. And part of that is also auditing and understanding. Like, part of the DSA is like auditing and understanding why things were banned and, and explaining it and so forth. And so that’s another thing that we invest in is explainability, right? Like if I’m giving a score, then I want to be able to explain why. Like, because a lot of times these things are, you need that subject matter expertise to understand that, oh, like this particular logo is actually associated with this particular terra group or hate group, or child pornography studio or whatever.

Jon Krohn: 00:28:48

Nice. Yeah. So I can see how the evolving regulatory landscape ends up being important, probably helpful to you as you develop these algorithms. We’ve talked already about harmful content kind of in static in you know, in posted content. But there’s also, there’s something that we hear a lot in the news recently, we, we see increasingly in the news is not just content that’s been posted, but content that is streaming real-time. So there have been incidences in the US recently of shootings being live-streamed to social media platforms. And so this happening in real-time, that must add an extra layer of complexity to some of the work that you’re doing.

Matar Haller: 00:29:40

Absolutely. There’s been like horrific instances of live streaming in the US and, and elsewhere. And so there’s a couple of ways to approach that. One of which is that you, we can put really, really small content moderation model is sort of on the edge device, right? So that, that does sort of some something basic to catch sort of the blatant stuff and then, you know, raise it up for, for human review. Cause I think in these, in these cases of, of live streams, it’s, it’s tricky. We’re still learning the space and we would want someone to just, like, if we flag it to take a look at it maybe err on the side of flagging too much, and then having someone take a look at it, again, it’s, it’s always, it’s a business question of like, what’s the platform? What, what are we looking for?

00:30:21

And so, you know, have some sort of like detector of, I don’t know, gunshots or of something that’s that, you know, a small [inaudible 00:30:29] model edge device able to flag it right away. We can also do something where we are, you know, once the content makes it to the servers sample frame, like there’s a question of like, what do you want to moderate? Like every single frame? Do you want to sample every minute, every second? Like at what there’s, there’s a huge question. I think what makes this so challenging is just the scale, right? You have like so much data streaming in and then do the same thing, sample, and then look for maybe more, more complex things, right? So those are sort of like the typical things, and that’s where you’re really focusing on the content itself, right, that’s coming in.

00:30:58

But a lot of these times when, when you’re, when you’re live streaming something like this, then you know the perpetrator may have like, you know, pre-shared this somewhere. There’s people that are, you know, joining the stream that are commenting, and now it’s now suddenly you have a much, much richer source of information, right? You can look at who are the other users, who is the user that’s streaming, what else have they streamed in the past? What, what other groups are they been in? What are people writing in the comments like and so forth. And suddenly now you might be able to catch it or at least flag it just from the surrounding information, right? Like there’s, there’s enough indicators of risk from the things that are around it where sure, you want to moderate the content, you want to look at it and so forth. However, you would want to basically look at the other markets of risk around the content itself to make your job easier and faster, and more efficient.

Jon Krohn: 00:31:44

Did you know that Anaconda is the world’s most popular platform for developing and deploying secure Python solutions faster? Anaconda’s solutions enable practitioners and institutions around the world to securely harness the power of open source. And their cloud platform is a place where you can learn and share within the Python community. Master your Python skills with on-demand courses, cloud-hosted notebooks, webinars and so much more! See why over 35 million users trust Anaconda by heading to www.superdatascience.com/anaconda — you’ll find the page pre-populated with our special code “SDS” so you’ll get your first 30 days free. Yep, that’s 30 days of free Python training at www.superdatascience.com/anaconda

00:32:29

Gotcha. So, yeah, so it obviously is more complex to be moderating harmful content when you are thinking about it in a real-time situation. But as you point out, having smaller threat detection models on the edge device, so on mobile phones, maybe on laptops, being able to detect these issues in real-time and potentially flag those to the social media platform. And then also once the real-time data is reaching the servers of these platforms, you can be sampling at some appropriate interval in order to be trying to detect whether harmful content, so yeah, the sound of gunshots, and then so it can be reviewed as to whether this is like video game gunshots or not. And then something so probably in a circumstance like that, whether it’s real-world gunshots or video game gunshots, we’re going to be able to tell more easily because of the contextual information that surrounds that. So the kind of text that people post in response is probably going to be quite different and classifiable different in a video game where people there might be more like well, I don’t want to even speculate on what-

Matar Haller: 00:33:41

Yeah, love, yeah. But, but, but I would say that, you know, if we’re thinking, if we’re thinking about it and like, I’m kind of thinking it, thinking out loud and like refining of it, what I said earlier is that, so you have something on the edge device that does like you know, more basic, like smaller model that does more basic content moderation. And then instead of it flagging human, like remember that everything is making it to the cloud. And so we’re sampling. And so things that have been flagged by the edge device can then either have like more like different, like more tightly spaced samples or can have more deeper analysis on it. Like, it, it’s, it’s basically a funnel, right? And again, it depends what you discover on the edge device. Like maybe you might want to right away flag it too and be like, listen, like there’s, this is not something that, that is very likely to be in the gray zone.

00:34:23

And then also you can look at the surrounding content. You don’t need to wait for the content to be uploaded to serve anything, right? Like, you have the surrounding content, it’s like text and who the user is, and like, where, you know, do, do we recognize this user? Where else have they posted before? The users that are commenting, like, where else are they? And this is where actually, like a graphical data model comes in handy, right? Because now you have all these relations between users and you can see like, what have they liked before, what groups are they in who have they interacted with, and so forth. And then if these are people that are known to us, then we can say, well actually, like this is a user that if we see them here, this is, it adds to the probability of risk.

Jon Krohn: 00:35:01

All right. So Matar, you’ve given us a interesting overview of how content moderation works in your automated platform for detecting harmful content. So things like contextual AI needing to be able to adapt to adversarial opponents, the flywheel between content moderation and threat intelligence that’s helpful to you. The “database of evil” and how there’s flexibility in the way that information’s hashed in there so that you can be detecting new variations that are adjacent to existing known harmful content. And then most recently we just talked about the specific circumstances of real-time streaming and how we can be addressing harmful content in those circumstances. So very interesting. And so I’m curious to what extent you can tell us about the kinds of technologies that you use to make the platform happen. So, you know, what kinds of programming languages, obviously we’re not, you can’t get into a level of detail that would allow adversarial actors to be more effective than adversarially-

Matar Haller: 00:36:09

Adversarial actors listen in now. Yeah, so I, and again I’m giving it with the caveat that like, the parts that of that I deal with are sort of like the, the data, the MLOps, the engineering, the API, the world of front end is beautiful, mysterious to me. So I can like list technologies that use there, but they don’t mean that much to me. I’m more, I’ve always been sort of like a backend geek. And so in terms of that, we do obviously data people do Python, duh. We also use Node and TypeScript. We serve our models on Kubernetes. We have, we’ve done a lot of work in-house of selecting of like writing stuff to basically select the correct instance type for a given model so that you get really good, the best utilization.

00:37:01

We’ve also, are working on like model-in versus model-out being like, do we bake the model into the image or does the, or do we like, or when we spin up the pod, like do we bring the model from outside basically in order to maximize our or minimize our uptime? Because we basically need to be able to deal with really, really high throughput and low latency. And so and also we don’t want to be just like burning money on machines that are up for no reason. And so we have HPA that we then tune, and then we can basically spin up and spin down our machines as we need and then be smart about which machines we’re spinning up. And also if we’re able to sometimes sort of put multiple models on the machine or batch the requests to the machine and we do all sorts of optimizations to make sure that we’re high through put SLA.

Jon Krohn: 00:37:52

Nice, very interesting, thank you for being able to go into even that level of detail. So clearly you have a really deep understanding of not just data science and modeling, but of backend engineering. So like scaling, being able to meet SLAs, Kubernetes. So, super interesting. I didn’t know from my research beforehand that you had that kind of expertise as well. So let’s dig into your background a little bit to see how this all came about. So you did a neuroscience PhD at UC Berkeley which is I think, a great decision. I also did a neuroscience PhD. So I, for me, that was something that I got into it because I was fascinated as to how chemicals, biology, physics create a conscious experience. And so like everything that you think, everything that you do in some way that we obviously are nowhere near fully elucidating can be reduced down to physical processes.

00:39:03

And so I wanted to dig into that as much as I could. But then as I got started in the PhD, I was like, wow dataset sizes are getting really big. It seems like there’s really interesting things that we could be doing there, detecting patterns in data, identifying causal direction in data. And so I went down this road of focusing on programming and machine learning because I knew that whether I stayed in academia or not, those would be transferrable skills. And I’m not surprised, I guess that that ended up being true. However so in your PhD, I know that you were recording activity from surgically implanted electrodes in human brains. And I made this joke before we started recording about how, you know, I felt, you know, I really feel like I made the right choice sticking to silicon experiments or analyzing data as opposed to doing things in you know, learning how to implant electrodes into a ferret.

00:40:03

And I was making this joke about how, you know, people in my cohort, in my PhD were doing that kind of thing. But then the exact person that I was thinking of ended up has a really nice job at Google DeepMind. So there’s, so it seems like you have insight into that. So I ended up getting here [crosstalk 00:40:21], but we kind of have this. So tell us about your PhD, how that relates to work you’re doing today. There’s also, there’s the Insight Data Science Fellowship program that you used to transition from your PhD from your academic background into industrial data science. So it’d be interesting to hear about that. And then to finally, have it all make sense, as to how I started off this entire long transition is that I mentioned how you have, it’s, you have a rich understanding of the backend of a software platform. And so just kind of how this all came about your rich depth of knowledge in the field.

Matar Haller: 00:41:04

Yes. The first question, let’s start with, with the first one. So my PhD, so yes, my PhD. I was recording electrodes, recording data from electrodes surgical implanted in human brains. Basically what I wanted was, you know, animal-quality data from humans, right? With animal research you can stick electrodes where you want, get really beautiful data. You could do in slices. You could do you know, from trained monkeys for years to do a task and, and then get just beautiful recordings of just like signals for hours and hours of neurons at work. And with humans, you’re often limited to things that are either slow, so FMRI, and it’s like, you know, you see it many, many, many, many, many degrees removed. You’re actually measuring blood flow. You’re not even measuring like direct brain activity. So you’re like measuring a side effect of thinking. Or you can do EEG, which is then, you know electrical signals filtered through the scalp. And even with, with technologies like MEG and so forth which is magnetoencephalography, it’s, it’s not the same. You’re not, you’re not at, you’re not on the brain. And then this really unique opportunity opens up in the laboratory of Dr. Robert Knight at Berkeley which [inaudible 00:42:12] is basically to work with patients that are undergoing brain surgery often for epilepsy. So epilepsy that can’t be treated with medicine right? They keep having recurrent seizures. And so the only solution is to go and to surgically remove the problematic area of the brain. However before that’s done, you need to map out the brain to ensure that, you know, you stop the seizures, but the person is left, you know aphasic, like they can’t speak, or you stop seizures and suddenly they can’t, you know, they’re blind.