“The most important thing from a development perspective is actually people start writing their evals. That is, I was on this tour for a very long time because the problem, why does agenda coding work so well, Sarah, is of course, you can verify the outcome, right? You can either say, hey, is the program compiling, or are you unit tests, right?”
Agent mining captures tribal knowledge from decision traces
“Now we call it agent mining because we record all these decision traces, these contexts, what the users are entering into the system. And then you can either use it to say like, hey, wait a minute, this is actually an anomaly. The folks in, I don't know, in UK from our company or the folks in Australia shouldn't do this because the standard operating procedure is this. Or you say like, oh, that's actually a very good improvement.”
LLMs are insufficient for predictive tabular data analysis
“Now, the problem is, of course, still today, if we look at these predictive questions, right? ... the challenge is large language models are not made for this, right? In a way, how they generate just one token after another essentially in a sequence to sequence modeling, I mean, they're language models, right? And they do this phenomenally well. But if you still want to do these predictors where you have to go back to these classical machine learning approaches...”
“What we are focusing on is the optimization domains, obviously, and then if you go into things like logistics, traveling salesman problems, knapsack problems, like all these kind of usual hard problems in computer science, these are interesting problems where we believe that could be interesting for the future, for maybe a different kind of computing paradigm to solve for.”