The Neon Show

Why Coding is the Fastest Path to AGI | Turing CEO Jonathan Siddharth

Siddhartha Ahluwalia Season 1 Episode 375

Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.

0:00 | 1:10:48

Who is teaching the world's most powerful AI models to think?

Turing is one of the largest data partners to OpenAI, Anthropic, Google, Meta, Microsoft, and Nvidia. At a $2.2 billion valuation it has become one of the most important infrastructure layers in the AGI race.

Jonathan Siddharth started Turing in 2018 with a thesis that talent matching is a trillion-dollar problem. Turing reached unicorn status in 2021. Then, in 2022, as the foundation model race accelerated, OpenAI approached Turing to provide coding data for ChatGPT.

Jonathan recognised that frontier AI labs faced an enormous bottleneck: high-quality training data and human intelligence at scale. Instead of remaining just a talent marketplace, he made a bet that most unicorn CEOs never make. He built a second business on top of the first and leaned back into his AI research roots.

Jonathan has a clear view of what needs to happen before we get to super intelligence. The four keys to unlocking AGI: coding, reasoning, tool use, and multimodality. He believes we solve for those four, and AI can do almost anything a human can do in front of a computer. 

If you are excited about where the AGI race is heading this episode is for you

00:00 - Trailer
01:06 - What Turing does
05:55 - Why OpenAI reached out to Turing
8:28 - How GPT-3 became ChatGPT
17:54 - How ImageNet breakthrough changed the world
21:12 - The largest provider of coding data to AI labs
24:34 - Four keys to super intelligence
28:45 - Every human will run multiple companies in 10 years
32:27 - Can agents have self-improvement loops?
34:36 - The future of software engineering
36:26 - Agents should create, humans should steer
39:46 - Is the line between products and services companies blurring?
40:42 - How an agent can handle hiring end-to-end
43:36 - Every human can now write software
45:22 - Will workflow SaaS disappear?
47:46 - No fine-tuning vs fine-tuning camps
51:49 - A case study in compute constraints
57:06 - Why the world needs so much compute
1:01:26 - Where Jonathan would invest today
1:03:16 - Where cybersecurity is heading
1:08:31 - How the world will look in 10 years
-------------
India’s talent has built the world’s tech—now it’s time to lead it.
This mission goes beyond startups. It’s about shifting the center of gravity in global tech to include the brilliance rising from India.
What is Neon Fund?
We invest in seed and early-stage founders from India and the diaspora building world-class Enterprise AI companies. We bring capital, conviction, and a community that’s done it before.
Subscribe for real founder stories, investor perspectives, economist breakdowns, and a behind-the-scenes look at how we’re doing it all at Neon.
-------------
Check us out on:
Website: https://neon.fund/
Instagram: https://www.instagram.com/theneonshoww/
LinkedIn: https://www.linkedin.com/company/beneon/
Twitter: https://x.com/TheNeonShoww

Connect with Siddhartha on:
LinkedIn: https://www.linkedin.com/in/siddharthaahluwalia/
Twitter: https://x.com/siddharthaa7
-------------
This video is for informational purposes only. The views expressed are those of the individuals quoted and do not constitute professional advice.

Send us Fan Mail

SPEAKER_01

OpenAI came to us when they were training 253 and they wanted to teach 253 to COVID. So we collaborated with them on teaching the models to code, and then after we launched, it was the fourth of the API.

SPEAKER_00

Turing AI enterprises have secured significant funding, including a $111 million investment and raising a valuation of $2.2 billion.

SPEAKER_01

So Turing is the one company that works with the Frontier AI labs and Fortune 500 enterprises. A lot of problems in life are math problems, and code is a way to express and solve math problems. If you can solve coding, tool use, reasoning, and multimodality, you have the keys to superintelligence. I used to think about why do we need so much compute? Now it's clear to me. I think humanity is going to be reasoning bound and compute bound to solve some of our biggest problems.

SPEAKER_02

Which are the kind of companies that you would invest?

SPEAKER_01

I would bet on companies that are an input to I would also bet on companies that are betting on a future where.

SPEAKER_02

Hi, this is Sadhhat Alwaliya. Welcome to the Neon Show. I'm your host and managing partner of Neon Fund, a fund that has invested at seed stages in some of the best enterprise AI companies coming from India, building for the US market and for the globe, like Atomic Work, Spot Draft, CloudSec. Today I have with me Jonathan Sidhat, co-founder and CEO of Turing. Turing is accelerating superintelligence. Jonathan, Turing is one of the most defining AI companies of our era. It's such a proud thing to see that, you know, the founders who are from, you know, you're from Chennai, you know, and achieving so much today. So first of all, congratulations on all the success.

SPEAKER_01

Thank you, Suddart, and thank you for having me. It's great to be here. And it's impressive how large the neon show is in terms of subscribers.

SPEAKER_02

No, thank you. So, Jonathan, uh, want to understand from you. Turing started in the hiring business when you initially started. You became a unicorn in that business. And then you you started the business of being the provider to the Frontier AI labs on the data side, right? How did this transition happen? If you can walk us through the journey.

SPEAKER_01

Great. Maybe for the audience, I'll give a little bit of an overview of what Turing does, and then I'll walk through the journey for how we got here. So our mission is to accelerate superintelligence to drive real economic progress. And we do that in two specific ways. First, we work with all the Frontier AI labs to advance the models in SUE, Enterprise, and Frontier STEM by providing high-quality data, evals, and oral environments. In doing that, we discover the jagged intelligence of these Frontier models and agents. We discover what different models and agents are good at. We leverage those insights as a differentiated deployment partner in the enterprise. And the enterprise, we've gone deep in financial services. We work with some of the largest asset management firms in the world and build end-to-end AI systems for them. And when we build these systems, we discover where these models and agents break. We leverage those, uh, our knowledge of those capability gaps to create even more realistic data, evals, oral environments for the frontier AI labs. So we so we can solve even bigger problems in enterprise, see where things break, generate even more realistic data, solve even bigger problems in enterprise, and we execute this loop. So most data companies don't see deployment, and most deployment companies don't see data. So Turing is the one company that works with uh the Frontier AI labs and Fortune 500 enterprises. And we are a trusted bridge between research and deployment. Um, on one hand, it's like humans making AI smarter, and on the other hand, it's AI making humans smarter. Um, and these both uh compound. That's what we do. And you're right that the journey here um is super interesting. So Turing started uh as this uh a platform that used AI to source and vet software engineers from all over the world. We have the world's largest uh software engineering talent platform, uh, Silicon Valley Caliber software engineers from all over the planet. So in um, and we were growing really fast. Uh in um it the world kind of changed when uh OpenAI came to us uh when they were training GPT-3 and they wanted to teach GPT-3 to code. Um, so we collaborated with them on teaching the models to code and to do tool use, function calling. And then when ChatGPT launched and it took off, it was the birth of uh the AGI race. Right. And it became clear to us that this is this is going to be a huge shift in the way software engineering is done and how all knowledge work uh in front of a computer will be done. So we leaned into it uh and we built a data platform on top of our talent platform and expanded beyond coding to enterprise knowledge work, frontier STEM. And uh it's been incredible working with all the Frontier AI labs to advance the models in coding, knowledge work, and STEM.

SPEAKER_02

And and how did OpenAI reach out to you? Like what was your mode or positioning in the data space?

SPEAKER_01

So we were not a data company when they reached out. Like we, I think OpenAI at that time was looking for. I mean, I don't know for sure, like, but they were looking for who has the best coders on the planet, and it is Turing. We have the world's best platform for software engineers, and now we've expanded that to experts in every industry you can think of financial services, retail, healthcare, life sciences, every um function, software engineering, sales, marketing, finance, every role in the org chart in that function. Um, we have, I mean, Turing's platform has the smartest humans in the world in every domain. I think OpenAI at that time was just looking for um somebody who can generate really high quality data for coding models really quickly. And the fact that we used AI to source talent, vet talent, match talent, manage talent meant we could scale up really fast while keeping quality high. That was the key.

SPEAKER_02

So they used your this engine uh uh to help them, or they they they used uh they said, Jonathan, uh this is what we need on the data side uh for coding. Come and help us with whatever you have.

SPEAKER_01

So I wouldn't be able to get into the specifics, but we but it it it was about generating coding data to improve the models. Um for um at that time the models were at the level of writing, um like relatively simple Python code, like relatively the kind of um coding tasks that a human would do when interviewing for a role. Now we've built up scaled up our engine for agentic coding, where I mean you're seeing what the coding models today are capable of, building end-to-end systems, like coding for days, yeah, uh sometimes a week. Um, and we've uh we've uh evolved it. Um but it started with just I think like they needed a partner that could help generate really high-quality coding data that could um improve the models, you're evaluating weaknesses in the models, figuring out failure modes, and generating data to uh address those, uh address those gaps.

SPEAKER_02

And if you have to explain, let's say, coding data to our audience who many of them might not come from the AI background, how would you explain it?

SPEAKER_01

Uh so there's I think it's um uh one way to think about data is just look at the LLM training pipeline today. So there's pre-training data, uh SFT data, and reinforcement learning data. All of this is slightly different. Uh in pre-training, you're feeding the model um the entire internet.

SPEAKER_03

Yeah.

SPEAKER_01

Um, and you're also giving the model access to books, other sources of knowledge. So pre-training is um largely unsupervised. Uh, so you're just feeding the model data. The model is building, I mean, you're you're basically building a base model that learns how to autocomplete. Yeah. In post-training, uh think of pre-training as creating like uh it's almost like brain mass. There's this raw brain mass, but you haven't quite shaped it yet. Um in that pre-training corpus, from a coding perspective, there's also data from GitHub, right? Like and other sources of code. If you've acquired code assets, you might also feed that into pre-training. Um, but the base model is only doing um uh autocompletion of tokens, right? It doesn't yet know how to follow instructions. In supervised fine-tuning, there's this paper from OpenAI called Instruct GPT, which talks about the recipe for how to do this, where uh you're teaching the model to be an assistant. So in supervised fine tuning, uh that's where um GPT-3 becomes Chat GPT, right? So you're teaching the model to be helpful, honest, harmless, and you're teaching the model how to have a conversation. So you the so humans are teaching the model to um you're having conversations with the model, the model knows um how to uh communicate with with a user. Now then there is uh reinforcement learning. And in reinforcement learning, you uh create uh oral environments uh where uh you're creating these environments where an agent is getting uh a reward when it does something correctly, and when it doesn't do something correctly, it doesn't get a reward. And usually you also have to give the agent intermediate rewards as it's um uh as it's uh trying to get to a to a solution. So in the context of coding, there is like a there is a benchmark called Suibench uh for your readers and encourage them to read it. Sweetbench is a really good benchmark, which is uh a it's open, it's largely open source GitHub repos where you are um the models are being tested on their ability to merge pull requests correctly with the unit tests passing. So one type of coding data that's helpful to people building models is, and Turing does this, where we are automatically crawling um open source GitHub repos at massive scale. We are also acquiring code assets from startups and other companies. And we are running those repos and figuring out um pull requests where the test cases uh don't pass for today's frontier models and agents. Right. So we have to uh and in some cases where the test cases don't exist, we would create test cases. Um, but you'd want uh data sets like this that are realistic, like reflecting real-world software engineering, long horizon where the agent has to work for hours or days to complete a task, and calibrated at the right level of complexity. Uh, if you have tasks that are too easy for the model, there is no learning signal. If it is too difficult, there is no learning signal. So there is a sweet spot of complexity that you'd want the RL environments to be at, usually like 20 to 40 percent, something in that zip code, like a Goldilocks zone. And you'd have to spin these up at massive scale. And uh and Turing does that. So that's an example of like one type of coding data set where you're evaluating the agent's ability to merge uh pull requests with test cases that pass without false positives, false negatives. There are other constructs like this. Um there is a benchmark called terminal bench, uh, where you're evaluating um terminal bench is the kind of benchmark you'd need for like a command line coding agent that has access to a terminal, like ThinkClot Code, Codecs, Gemini CLI, Grok Build agents like that. There it's an interesting problem where you give the agent access to a terminal and access to the internet, and it has you're evaluating its ability to do complex tasks in software engineering, machine learning, cybersecurity, scientific coding, etc. In theory, if you have access to the terminal on your computer and the internet, you could do anything on your computer. And there again, it's you're evaluating the uh agent's ability to this to do this type of knowledge work. Where with Terminal Bench, you don't just evaluate the output, you evaluate the state of the system. Maybe you might be uh you might be you might have to install something, you might have to write code, you might have to create some documents or PowerPoints or spreadsheets. Um yeah, so that's another example. Tunnel bench is like another benchmark. Um, there are many others in coding. There's MLE bench, which evaluates an agent's ability to write um machine learning code that satisfies some objective. The way to think about reinforcement learning is um if you've um I I think the the example that everybody uh remembers well is um Alpha Go and Alpha Zero by DeepMind.

SPEAKER_03

Okay.

SPEAKER_01

Um, so Alpha Go was DeepMind's go-playing program that was originally trained by learning from expert humans. Um Alpha Zero was their program where it was not trained from humans, it just played against itself. And obviously, in a game, you get a reward based on whether you won the game or you lost the game. And depending on whether you won the game or lost the game, you can um you can figure out like which moves led to you winning the game or losing the game. You can have intermediate rewards. Uh you can imagine a similar construct for coding or for knowledge work, where the reward can be whether you did the task well or not. And but you have to be smart about intermediate process rewards. How do you reward partial completion to get to the task? And the task could be writing a program that can automatically vet a finance analyst. Maybe that's a task. Or it could be create a board deck. Uh, or it could be evaluate this investment opportunity and write an investment memo. It becomes a little more complex when the task is not binary, like with code or with math, it's relatively more binary, easy to verify. But how do you verify the quality of a board deck?

SPEAKER_03

Yeah.

SPEAKER_01

It's uh you have to be, you have to have like a good rubric-based evaluator. Um, so yeah, lots of interesting problems. But reinforcement learning has been one of those fascinating areas that uh, like deep learning, has been like a very rich vein. Uh the thing I find fascinating, Siddharth, is um Ilya Sutskiever, the chief scientist at OpenAI, like I saw in a podcast that apparently his research program for OpenAI in 2014 was um train a generative model on the world's knowledge and do RL on it. Right. And this was in 2024, and more than a decade later, it's still the formula that like everybody's largely following. Um I remember this Sam Altman quote of um do more of what works. And I feel like deep learning works and reinforcement learning works. And I think all the labs are scaling this up, and we are constrained by algorithmic research, compute, and data, and companies like Turing advanced the data pillar, NVIDIA advances the compute pillar, and of course the labs advanced the algorithmic research pillar.

SPEAKER_02

And uh from 2014, let's say OpenAI started roughly during that time frame. So 2014 to 21, 22, uh the world never knew about how fast AI accelerating or uh but suddenly 22, 23, until now, every year, the acceleration is 10x more since then. Why why is uh so much development happening at this space?

SPEAKER_01

Um, so I think like I I feel like um up until 2012, we were in the pre-deep learning era. We were in the machine learning era, right? Where I mean I did machine learning at Stanford, like the where you would, there were all these different algorithms you would use. Neural networks was one such algorithm. We hadn't really discovered that deep neural networks trained with lots of data and lots of compute do magical things. Yeah, we hadn't yet discovered that. 2012 was a milestone when ImageNet came out, the ImageNet breakthrough came out, where these convolutional neural nets um exceeded human-level performance in certain tasks. And after 2012, the world just doubled down on deep neural networks, lots of compute, lots of data. And we started using GPUs uh for training. Um and I think there was a small group of people that knew what was happening, and the there was that group that bet on deep neural networks. I think the OpenAI team was one of them. And I think it's one of those things where uh exponential growth is like exponential improvements are like hard to fathom. And between GPT 2 to GPT-3 to GPT-4, it's like as you kept scaling up, these models just keep getting keep kept improving. Like it is it was reported that the sorry the current series of frontier models are largely in the trillion parameter realm.

SPEAKER_03

Yeah.

SPEAKER_01

Now we are starting to see 10 trillion parameter models. Um it's and I think when these models come out, I think we'll see it'll feel even more magical. Um some people say like intelligence is log of compute, like you're just um and the scaling laws are continuing to hold, meaning bigger model with more data, with more compute means that the models smoothly keep getting better. Um, it's like it's it's like the when the pre-training loss keeps coming down, it's like the model's performance on also all these a variety of different intelligence benchmarks seems to keep going up. Um so I I think it's just people discovered the power of I think OpenAI like led the way, and uh everybody like has discovered that the scaling laws are holding, and I think it's the world's energy being more focused um on a recipe that works. I think so we're seeing faster progress.

SPEAKER_02

God, but but since now, uh, you know, and anthropics started in 21, 22, roughly, but in three years it's very astonishing how much they have been able to accomplish. And today, across enterprises uh or even uh mid-market startups, they have become the default uh medium for coding. How has such acceleration happened in in only three years?

SPEAKER_01

I think with coding, um coding is one of those areas where because it's verifiable, I think that there is a good path to using reinforcement learning to improve coding models um quickly. And I think like The labs, many of the labs see that coding is really important. It's important for a few reasons. One is many believe that the fastest way to get to AGI is to automate AI research itself. And to automate AI research, you need a really strong coding model. Since AI researchers are implementing research papers, trying out new ideas, you have to write a lot of code. So if you can accelerate your ability to write high-quality code, ideally figure out what experiments to run, you'll of course accelerate. So there was a lot of energy that went into improving coding. Second, when the models improve in coding, they have out-of-domain performance. They also improve in other tasks that have nothing to do with coding for reasons we don't fully understand. My hunch is that when the models improve in coding, they also improve in tool use. And that helps these generalized computer use agents like Codecs, Cloud Code, Gemini's CLI, etc. So I think a lot of people prioritized coding. And as a result, the coding models improved. Like that's uh, I mean, uh we are probably the largest provider of coding data to all the labs. And uh like before we expanded into other types of knowledge work, like we did, we really focused on coding. Um, so part of it is the focus and the importance to the lab, so just improved more quickly.

SPEAKER_03

Yeah.

SPEAKER_01

And um, and I think like we some of these um benchmarks that we had, like I feel like there were a lot of benchmarks that were somewhat more academic, where you're testing intelligence rather than the ability to do real work. Um, and it was needed at the time, like in the in the last few years, we needed to know how smart is AI. So we were we were happy when AI wins like a gold medal in a math Olympiad or a physics Olympiad or wins a coding competition, stuff like that. But I think there were also some really good benchmarks for coding. I do think Sweetch was a good start. Uh now Sweetbench is saturated, now you need more complex coding benchmarks. Turing is creating many complex coding benchmarks. Uh, and those also helped the labs hill climb on something that was more economically valuable.

SPEAKER_02

So, one of the interesting things is that you mentioned everything that you give to a model is almost like a like a uh which you shared offline, is almost like a coding problem. Can you give that example that you gave me offline?

SPEAKER_01

Yes, I was saying like one reason coding is so important in AGI is a lot of problems are reducible to coding.

SPEAKER_03

Yeah.

SPEAKER_01

For example, uh, if you're asking an AI a question like um uh what's the difference between AI investing in Silicon Valley versus AI investing in uh uh India? Um a good model would come up with a plan. The plan might involve querying crunch base, querying pitchbook, downloading some data, figuring out, okay, let me break down AI investing by sector or by uh category. Let me write some Python code to analyze the data, to visualize the data, and present it to you. So it wrote code as part of its um execution. So I we've we've uh we all know uh that you know a decade ago, Mark Andreessen spoke about um software is eating the world. What he really meant was software is eating all types of knowledge work, and now AI is eating software. So AI is eating all types of knowledge work. It most tasks people do involve data analysis, involve some type of writing code. For example, like I'm coming off of a leadership offsite with uh with my team. And one of the things we do in the off-site is figure out exactly where to bet on. And it's actually a data analysis problem. Like you're looking at which segments are growing fastest, which segments are uh not growing as fast. You might want to analyze data from your own internal systems. And that's an example of um again where coding is kind of embedded in the loop. If I ask my head of FPNA, what are some areas in Turing we should invest in, what are some areas we should reduce our investment in, that's again a coding problem, right? Um a lot of problems in life are math problems framed the right way. And code is a way to express and solve math problems. And I feel like if you can solve um coding, tool use, reasoning, and multimodality, you have the keys to superintelligence. Uh, those are the building blocks. If you can solve those four, you can automate anything that a human does in front of a computer. And it'll give humans superpowers. For example, um, as an investor, uh, you might be interested in analyzing what are some trends in investing among top investors. That's a coding problem. You might be curious how is my portfolio at NEON different from Sequoia's? That's a coding problem. Coding and data analysis. Now every human has the benefit of a giant software engineering team in the cloud that'll envision whatever they want. It's actually even more than that. Like when you think of you having access to a giant team of software engineers in the cloud, the onus is on you to also ask the right questions.

SPEAKER_03

Yeah.

SPEAKER_01

But now you, I mean, given we've also baked in um humanity's best knowledge from strategy consultants, analysts, data scientists, software engineers, product managers, CEOs, CTOs. It can, I mean, you could literally ask your AI agent to come up with a strategy for how to build a portfolio that maximizes returns. Yeah. Of course, it may be wrong today, and it'll probably be wrong for a while, but that's where like you could exercise your human judgment to correct it. Um basically, coding and coding is the key to superintelligence. You solve coding, you've solved almost everything.

SPEAKER_02

So then with this thesis, uh, the edge in most fields would the human edge in most fields or knowledge worker working fields, for example, investing is one of the fields, right? Where judgment counts. Uh right. And in public markets, the data is more visible. In private markets, uh you know, you don't discover or data is not visible mostly, uh right, especially at the earlier stages. So according to that thesis, the the human judgment becomes less and less uh uh important because if AI superintelligence can make those judgments where to invest, then everybody has access to that information.

SPEAKER_01

So I actually disagree. I think human judgment can is gonna become even more important. I think what's happening is the floor is going up. The for example, if a um if a software engineer is working with a coding agent that has now written code for a week, um, you do, I mean, reviewing the code that get that is that that got written needs work, you have to know where to look, what to check. Of course, AI will also help with that. I um and in the in your point about investing too, I think I don't think these models are still at a point where um human input is not needed. I think of these as productivity accelerators. Like today there are things that you're using tools for to accelerate your ability to do to do work. Um I think um these frontier models are the ultimate knowledge work accelerator. So humans can solve problems at higher and higher levels of obstruction. Um uh I do uh I think like, I mean, we used to think about humans as um having um in the Andy Grove high output management sense, maybe your span should be seven to eight people that you're managing, right? Now I think in the limit over the next decade, every human might run seven to eight companies as these as the loop from idea to a I mean, as you could go from prompt to company in hopefully like a single step over the next decade.

SPEAKER_03

Yeah.

SPEAKER_01

Um I think it'll it's we still uh I think humans will ultimately still be key to knowing what questions to ask, knowing what problems to solve. Um, it's just that now um going from idea to execution has been democratized. Like in the past, if you were a plumber or a doctor or a lawyer, you might have an idea for a cool startup, but you're bottlenecked on I have to raise some venture capital, I have to hire some engineers, I have to hire a salesperson, a marketing person. There was high friction. Um I now it's like much lower friction. I'm very bullish on what this means for humanity. I think we'll see more entrepreneurs, lots and lots of companies getting started. I think larger companies today will be, companies in the future will be a lot smaller. Sure. Um I I think humans, I think humans are just gonna be, I mean, we are more productive today after personal computers, after the World Wide Web, after Google. And now I think this is the next step. AGI is gonna just make humans 100x more productive.

SPEAKER_02

So one limitation that I see right now is let's say if I have a high agency teammate in the in my investment team, yeah, on a daily basis, they'll think on what problems to solve. Uh, I I think models are not there yet.

SPEAKER_01

Let's say not yet, but they will be.

SPEAKER_02

And you believe that that that is near term future.

SPEAKER_01

Absolutely. Absolutely. It is even now, in theory, you could get that behavior by looping the model as an agent. Um, that there are ways to get that from the models today with the right scaffolding, with the right harness. Absolutely. For example, I I uh have a project at Turing that I run. I have a few agents running around to help me run Tiering. The way I improve that code base is to also have an agent suggest to me how should I improve the code base and to do it. So it's in a self-improvement loop by itself. You can imagine applying a self-improvement loop to yourself, to your company, and letting it run. Yeah, right. Obviously, it may not be at a point where um you trust it enough to keep running it constantly. But even that is huge. I mean, even your human analyst might come to you for approval. I have this new idea, I have this new business plan. I think we should do this. This will require this much investment, but it will give us this much in revenues. You will you will do that, right? So I would argue that it's already there with the right harness, and it's gonna get even better, like with especially with these bigger and bigger models. Um I think of I think of what we have right now is intelligence as an API. Um, and it's a very unique type of intelligence where it's already superhuman in many things, it's subhuman in certain things, and as long as you build around its jagged intelligence, like you're good.

SPEAKER_02

Understood. So today uh uh software engineering is one of the the works that is getting replaced. Globally, companies, if we observe, uh right, people are replacing uh uh or reducing their team size in software engineering.

SPEAKER_01

So I I think it's it's there's a few things happening. I think so um I think these models have democratized coding to the point where every software engineer is now insanely more productive. Now, if you're if it's a company that is that has product market fit that wants to scale, suddenly your team is now 10x more productive and you can grow faster than ever. If you're a company that doesn't have product market fit, or you're operating, you're treating RD almost like a cost center, then yes, you need fewer engineers, and you can you can probably do that. But it's also the case that engineers are now able to create a lot more than ever before. Um it'll create some challenges with distribution. Like if you can create a lot of apps, like how do you get users, how do you make money? Those are still important problems. Um but I feel like, like at Turing, for example, like we are scaling our software engineering team. Like we want really strong engineers because we see this massive opportunity. And in the past, like we would have been um like now for this, we are significantly more capital efficient in terms of how we can scale. Um so I think the impact of it is dependent on uh the company and where it is in their life cycle. You you are correct that if you're a junior and if you're an engineer that is not using AI, uh, not even an engineer, if you're in any type of knowledge work, if you're not using AI highly, like I'm I'm talking, you should actually be in the mindset of agent first, human second. Turing operates this way.

SPEAKER_02

Um what do you mean by that? Can you share examples?

SPEAKER_01

So um every knowledge worker at Turing, either in coding or any other domain, um, I want them to use AI to do the work first. So agents should create, humans should steer, humans should create prompts, skill files, give the agent access to the right tools, access to the right knowledge sources. Humans should create the scaffold, and AI should create. For example, if you're creating a board deck, yeah, don't the human should not, the investor, let's say that uh my head of investor relations, he shouldn't create the board deck. He should give the model access to the right sources of knowledge, access to the right tools, write a skills file to apply consistent formatting, um, give give the right instructions on how to create the board deck. Uh, use codecs or cowork or Gemini or Grog to build it. Um, and then when there are mistakes in it, prompt the model um to improve it.

SPEAKER_03

Sure.

SPEAKER_01

And then keep improving the skills file so that next time it's even faster. Basically, um the agents create v1 of the work. Humans are verifying and iterating, humans never create v1. Um there's an interesting post from OpenAI on um how they used how they used codecs to uh to the for one particular software project, which was agent first, human second. Uh humans were not allowed to write code directly. They can only like prompt agent. Yes. Um so I think like knowledge work is that's what I meant when I said um agent first, human second in how you do knowledge work. Um and uh I think it's a huge productivity unlock across every function and every workflow in an organization. Think of every function software engineering, sales, marketing, finance, people, recruiting. Think of every role in the org chart in that function.

SPEAKER_02

Yeah.

SPEAKER_01

And every role is a composite of workflows, right? And every workflow has an artifact that you create. There's a clear definition of done. At Turing, we are in this unique position where we generate data to help the models improve in every workflow, in every role, in every function, in every industry. We create data and oral environments to improve the models, and we also go build with it in the enterprise. So we know exactly where these things break. So that's why like I have the somewhat unique vantage point into seeing where this is going. Um and it's super exciting.

SPEAKER_02

And do you think the difference between today services and product company is blurring out?

SPEAKER_01

Yes.

SPEAKER_02

Can you share your perspective on that?

SPEAKER_01

So I think what's happening is there's two trends that are happening right now. One is the models are becoming agentic, where the models can accomplish long horizon workflows for every type of knowledge work, for every type of coding project. The models are able to do it end-to-end, right? And if you can control your computer, they can do everything you can in front of your computer, right? With the humans verifying. So, on one hand, the models are becoming agentic. And in theory, if the models have access to the right sources of data, you kind of don't need software in the middle. The models can do it end-to-end. For example, imagine you want to create here is a fictitious conversation that I may have with an agent. It's not quite fictitious because at Turing I'll soon be able to have this conversation. But my conversation to the agent might be hey, I want to create a hiring rec for a researcher in reinforcement learning for code. The agent will tell me, Oh, that's great. Uh, here I picked up, I created V1 of a JD. Does this sound good to you? And I'd say, uh, you know, uh, this sounds good. Um uh no prior experience needed.

SPEAKER_03

Yeah.

SPEAKER_01

So let's say, okay, done. I made that change. And then it says, the agent says, Hey, by the way, I looked up um comp benchmarking data and I came up with these compounds to the for the role. Uh, I also checked with the finance agent, and this is in budget. Uh, does it sound good to you? I said, Yeah, that sounds good. Let's go for it. And add a geoconstraint for the Bay Area and Bangalore. Um and then the agent asks me, hey, uh, can I update the website with this new role? Yeah, go for it. And by the way, in the future, it may not even ask me. It remembers this once. Next time it'll do it itself, right? And it could say, um, I've also created, would you like me to create a LinkedIn ad for this role? I say, yeah, uh, do it with a with, I don't know, $1,000 budget. Um and then it could, it could then say, uh, I see that your calendar is uh, you have time on your calendar, you have like 10 hours a week in your calendar for recruiting. Can I use those slots? I say yes. So now in, and then it uh when there when somebody emails me for this job, it could set up the meeting, it could even do the first assessment, it could assess talent on certain things. Now, in this workflow that I mentioned, I did not talk to anybody in finance, anybody in HR, anybody in recruiting, anybody in the website, anybody in our engineering team. I did not talk to our performance marketing team to run ads. It was just the agent. And no, there was no SaaS software involved. There was no greenhouse, there was no Asana, there was no uh there was no separate vetting software. The agent kind of whipped up whatever it needed on the fly. There was literally no human involved, right? It was so there's one there's one trend where the models are becoming agentic. So legacy SaaS has is is going to have some challenges like as the models become more agentic. I'll share uh think of this as a dual pincer. It's a pincer movement. That's the top-down pincer. Yeah, the bottoms-up pincer is every human can now write software. Yeah, my EA wrote software because I have a crazy schedule, like I work a lot, I travel a lot, like I'm always on, so it creates a lot of work. So my EA wrote uh software to manage her EA team, and that software also automatically keeps track of all demands on my time. Prioritizes it based on what's most important for me. So nothing gets dropped. Now, in the in in she could have used Trello or Jira or Asana, but she didn't. She built her own task management system. And she built something that was more custom for her than any of that software. It's the era of infinite software, custom software. And there is an alternate universe. I've heard pitches for these types of software to help EAs serve their execs better, but probably wasn't venture backable, or for whatever reason it didn't, it didn't take hold. But she just built it and you and used it. So we're not buying it. So you can see that superintelligence is eating SaaS and it's eating services, right? It's as the models become agentic in this dual pincer motion. Like one, every human now creates software that they wouldn't pay for, but it's just a fun weekend project now.

SPEAKER_03

Yeah.

SPEAKER_01

And the models are becoming agentic. I think these two pose some unique challenges where I think the key is going to be who has access to the data layer. I think the data is going to be key. And in the enterprise, I think that'll be particularly important.

SPEAKER_02

So according to this theory, then all workflow SaaS would, let's say, next five years would not be required. Then only the system of records will be required.

SPEAKER_01

I think it is a possibility.

SPEAKER_02

So help me explain. A company which is starting new and scales quickly, how does it build its own system of recording?

SPEAKER_01

I'm not saying they may they'll build so they could uh I mean it's all of it is just software, right? And if in the past, like writing enterprise grade secure software was hard and it would have taken a while, now it's become easier. So they would just use the models to write what they need to write. And the benefit for them is custom software, software that is uh fine-tuned for their specific workflows. Like we are used to, because the cost of software RD was quite high, especially for ML AI software, it made sense to go outside and use something. Now you can build it. I mean, uh, there was a whole industry around um uh you know uh implement uh you know, like the I don't want to name specific software, but you're like implementing uh your own customization to some legacy SaaS software, right?

SPEAKER_02

Now you don't have to the entire GSI industry is based on that.

SPEAKER_01

Yeah. There's an interesting uh thesis that's gonna be validated. Like it's kind of unclear how it's gonna uh how it's gonna play out, which is on one hand, we could be in a world of, I'm gonna call it the no fine-tuning world, where you have a frontier model, like a trillion parameter or 10 trillion parameter from OpenAI, Anthropic, DeepMind, Meta, XAI, Amazon, Microsoft, somebody, a frontier model. And all you need to do is manage context and memory, give it access to the right tools, and let the model cook. So that's one way, the no-fine-tuning camp. Uh, all you have to do is be smart about how you're managing context and memory. You don't have to touch the weights. That's one world. Let's call it the no-fine-tuning camp. There is another camp, which is hey, you don't need these giant models. You could, with a half a billion to 10 billion parameter model, fine-tune it on your proprietary data, distill your proprietary human knowledge into the LLMs, automate your proprietary tool calls. It'll be faster, cheaper, uh, more accurate. Um that's one school of thought. Let's call it the fine-tuning camp. I think it'll be interesting to see how it plays out. What I'm seeing more and more is there's a lot more people going into the no fine-tuning camp than a couple of years ago for very high value enterprise use cases. Um, can you give examples? The um, I mean, I have a system at Turing that um helps me run Turing. And this is a system that I built on top of one of the frontier models. And I didn't have to fine-tune it. Like all I had to do was intelligent management of uh context, memory, access to the right tools, access to the right sources of knowledge. Um and you can imagine, like in um in asset management, in financial services, there are workflows in um in uh in the in the day-to-day of a fund controller, uh, of um uh an asset manager, where the the the frontier models are actually doing very, very well, like without any fine-tuning. Um I I I would start more specifically with what's the workflow that you're trying to automate, and um what's a verifier for the quality of the artifact that that workflow is creating. Um and step one should be to like just do it with a frontier model and see how well it does. And most times you'll see that you don't have to touch the weights. But there are certain workflows, let's like customer support, um, like an invoice-to-pay system, some of these more basic stuff. Where I think you know, a smaller open source model that's fine-tuned on proprietary data might be faster, cheaper, more accurate. But for really high-value use cases like um discovering alpha in investment opportunities, you want the most powerful model. And you'll probably see that that powerful model with good context and memory management will do very, very well. And I I personally don't fall into any specific camp. I think it's much more specific to the workflow that you're trying to automate. Some require giant models that you don't have to bother fine-tuning, um, but have good context management. Some require smaller models that you can um some you can make do with smaller models. Yeah.

SPEAKER_02

Uh and for our audience, if if you can share, let's say, in very simple terms, uh training and inference has been the key. And and like say the last few years, there was huge investment in training part, but now inference is also coming up as well. So probably if you can share what has been happening and where is it going in both these areas.

SPEAKER_01

Yeah, yeah. So um there is like um there is scale up in compute for a few different things. Um there's pre-training compute. So I would break training into two parts, uh, pre-training and post-training. In the past, it was a lot of the compute went into pre-training. Yeah, now a lot of compute goes into reinforcement learning in post-training as well. Um, especially after O1 came out and Deep Seek came out. So there's a lot of compute that's needed for still for training, right? But what we're seeing is as these models have gotten so much more capable, uh, people need so much more. Like I'm compute constrained, like for how much I want to create agents, run agents. So it's um uh so so inference compute is skyrocketing because demand is skyrocketing.

SPEAKER_02

And what are the use cases for that?

SPEAKER_01

All types of knowledge work, all types of coding. The um, for example, like um what's something in your workflow that you do most frequently in your workflow as uh I I think search search for company data.

SPEAKER_02

Let's say, for example, if I'm evaluating a cybersecurity company.

SPEAKER_01

Yeah.

SPEAKER_02

Uh I I would try to find uh uh details about that company mostly. Uh details about the competitors and where industry is heading. Yeah. And I would then try to find customer usage uh data to to verify uh how our customers are using. Is are is the customer usage growing every month or not? Uh because because for us, I'll give you a simple example. We come at pre-seed at zero revenue. So it's very important that after investment, is it two, three years, a company can reach 10 million ARR. Yeah. So all our effort is to validate and validate or disvalidate in that direction.

SPEAKER_01

Yeah. Can I I'm just brainstorming on the fly with you. But I would can I give you a use case for you to be compute constrained, inference constraint? It would be to run a deep research agent in a loop that is looking for cybersecurity companies all the time. While we're talking, that agent is running, right? Maybe on a Mac Mini somewhere safe. It's running, it's pulling uh interesting companies that it thinks are good. But then you've given it a budget. So it goes and spends some money on GLG or like some expert network, it recruits some humans, the agent spins up Zoom and does a conversation with that human, does interviews with them, gives you a summary of the interview results. And in some cases, it may play with the software itself. Cybersecurity, I don't know if it is possible to do it well, but it might be. It might test the software by itself, or it might recruit some other humans, give them access to the software, collect their feedback, maybe it recruits some CISOs. Um, as long as you give it enough money to go hire people, research the web, go hire people, talk to people, run experiments, and come back. This agent may come back to you uh every week and give you a summary. Hey, Siddharth, like here are the top three companies in cybersecurity that I think you should pay attention to. These are the most hyped, these are the most high, these are the most underrated, but where maybe there is more investment alpha. How much would you pay for something like that? It's literally helping you discover your next investment opportunity. It's like your associate or your.

SPEAKER_02

Yeah, absolutely. I'll I'll pay the cost of an associate.

SPEAKER_01

Yes. And that's that's that's why you'd be inference constrained, right? Because to do that, it's making a ton of LLM calls, it's like running in a loop, right? Like that, I feel like if we kept talking, we can come up with even more. Where and that's why like inference is skyrocketing. It's because there's so much model capability overhang. The models are capable of so much more, but humans are still not extracting the fullest value from these models, especially in enterprise. Uh, in enterprise, I think first mile and last mile is still a problem. Like the average enterprise is still not getting the best value. And at Turing, that's one of the reasons why we focus on enterprise deployments. Even if the models did not get any smarter, there's so much more value to be unlocked. Uh, but of course the models are gonna get a lot smarter.

SPEAKER_02

So it's gonna be uh I think all the money in the world is getting concentrated on the models.

SPEAKER_01

Yes. I used to, I used to like um a few years back, I used to think about why why do we need um so much compute? Yeah. Now it's clear to me. Like, I think humanity is gonna be reasoning bound and compute bound to solve some of our biggest problems. And especially if coding, enterprise, and STEM are so important. If you accelerate coding, you'll accelerate AI research. If you accelerate knowledge work, like we're gonna create so much more value in the economy, like we'll move the GDP of the world. And if you accelerate scientific discovery to discover new materials, new um new sources of energy, um, more efficient uh robotics, there's just so much more um innovation that it'll unlock. Uh so I think like it's it's gonna be an exciting decade or two ahead, more.

SPEAKER_02

The the biggest fear right now among founders and and the community is that are models going to eat up all the work? Then we discuss about cybersecurity coming. If models eat up the domain of cybersecurity, then what is left for entrepreneurs to create?

SPEAKER_01

I think we'll keep building on top of the models. Like uh like uh so Sam Altman spoke at a Turing event uh a short while ago. And one thing he would say is uh there are two schools of thought, there are two camps. One camp that's betting on a world where the models are getting better. There's another camp that's betting on a world where the models are not getting better. You create all these other structures to fill in the gaps. I think like I personally would bet on a world where the models keep getting better. There are certain things you may do today thinking that the models are not going to get good at this. I would not do that. I would just think of intelligence as continuing to improve. I think like humans, the best founders will discover big markets. Like my advice to founders is uh pick a big market. Um, markets that don't exist don't care how smart you are or how hard you work, pick a big market. Pick a market with weak competition, like ideally not competition that's uh other big tech companies or other AI labs. Like, don't don't pick a market with weak competition or analog competition. For example, Uber competed with taxi companies, analog competition, not a fair fight. Um, companies like um Airbnb competed with like hotels again, uh analog competition. So pick analog competition or relatively weak competition. Don't don't compete with the frontier labs or big tech companies, that's just not a good strategy. Uh and third is um the um I think the basics of hire really smart people who are highly driven. Um I think that always holds. Um I would think about I I would think about coming up with ideas that are betting on a world where intelligence will be abundant and the models keep getting smarter. Um I think the hard problems to solve will still be how do you uh get distribution, uh, how do you generate revenue, not short-term revenue, but long-term revenue. I think those problems will still will still remain. Um and I think I would pick on problems that require inherently human um a human touch, like maybe problems more in the physical world rather than in the purely digital world. Um maybe problems where human trust, human connection um are important.

SPEAKER_02

Um can can you give examples of some of the areas where let's say if you have to invest personally, which was which are the kind of companies that you would invest?

SPEAKER_01

So uh I'll give you a set that I think is uh necessary but not sufficient. It's a subset. I haven't thought deeply about expanding it. Um I would bet on companies that are an input to superintelligence. Like uh Turing's an input to superintelligence, like date the models need data. They're always gonna need data for a long, a very, very long horizon. Uh I would bet on companies that are either an input, like I mean, what are the key inputs? Compute, energy, data. You need high-quality research talent, but the labs would hire that themselves. Those are inputs, they are fuel to superintelligence. I would bet on companies there. Um, I would also bet on companies that are betting on a future where intelligence is widely distributed. Turing's enterprise business is a bet on deployment that intelligence is already there. Let's just build the right systems around it so enterprises can unlock value. Um, for example, if Neon were a Turing customer, we'd build that agent for you so that you have like a um an associate GPT or an associate cloud that can do these. So I'd bet on companies that are inputs to superintelligence or an outcome of superintelligence. In a world where superintelligence exists, you'd need it. Um I might also like bet on robotics clearly will be huge. And it's I think it's a matter of time.

SPEAKER_02

Uh would you invest in domains like cybersecurity, or would you think models would take up that domain?

SPEAKER_01

I would invest in domains in cybersecurity. Turing is investing in cybersecurity to make sure that um the um yeah, to make sure that agentic coding is safe, both for offense and defense.

SPEAKER_02

But like with anthropic launching models like Mythos, there's a huge debate that would they try to take up whatever is going on on vulnerability management, and then slowly start getting into endpoint management and everything else.

SPEAKER_01

Yeah, yeah. I think it's it uh uh I would not build a I I I when I said I would invest in cybersecurity more in, I mean, we are investing in cybersecurity at Turin to um help these models keep our systems safe, help these frontier models detect um vulnerabilities in a safe way, um, and help the coding models write code that is more secure. I meant in that sense. Uh, but you're you're right in this the same reason I gave against uh legacy software, yeah. Uh legacy SaaS software also applies to legacy cybersecurity products too.

SPEAKER_02

Or I I would say uh entrepreneurs not stop thinking in terms of products, but start thinking in terms of problems. Yeah.

SPEAKER_01

Yeah.

SPEAKER_02

What that's what you are hinting at, right? Uh what whatever uh problem horizon that feeds into model or becomes an outcome of model. But to to contest your, let's say the work that you are doing for an enterprise, yeah, the model are coming or will come in that kind of a work. So, how how do you make that defensible and growing?

SPEAKER_01

Yeah, yeah. We are uh we are building on top of the models. Oftentimes with intelligent context management, memory management, we for the enterprises oftentimes they want a system that is relatively model agnostic. Uh, you'd want to make sure that you have the right auditability, governance, traceability, verifiability. You want a human in the loop. You need a workflow that's constructed on top of the models. We would we would not build our own model. We don't want to compete with our customers, but we are building around the models so that enterprises can unlock the fullest value. And in the real world enterprise, there is still a lot of last mile messiness where it is there is a gap between what the model is capable of. And what an enterprise is actually able to extract uh value out of. And it's it's non-trivial. Like the and even at Turing, it it requires work to make sure that you're operating in the right agent first, human, second way. Um yeah.

SPEAKER_02

So you you so your bet is the the model will not come into the last bit of stitching together everything.

SPEAKER_01

Yes, and part of that bet is you're seeing like uh I mean Turing is hiring an insane amount of forward-deployed engineers. You can see OpenAI, Anthropic, and all the Frontier Labs also doing that. Yeah. Because it's hard. It's hard, and that last mile is also not fully digital. In some cases, you have to you have to um be in the right rooms, talk to the right people, observe the right workflows to make sure that you're doing something that helps helps people. You have to manage change inside those organizations, you have to train people in new workflows. Um it is complex. And you wanna, you're you're also not vibe coding in the enterprise, right? Like you have to make sure that systems are secure, maintainable. Um, they are built with the same coding standards as the rest of the code base in that enterprise. Uh, so that is that is quite a bit of last mile engineering to do. So, what we are building on the enterprise side is a palantir for AGI. We've built our platform that makes it very easy for us to build and deploy agents, uh, which is highly model and agent agnostic. It's actually agnostic at every layer of the stack. It's agnostic at the layer of the cloud, the model provider, agent tick orchestration layer, memory. It's very, very modular, which is which is helpful for the enterprises so that they can use the right um models and agents for the right task.

SPEAKER_02

Well, no, thank you so much, Jonathan. You know, it's been an amazing conversation. I learned a lot in this process of uh discussing brainstorming with you. Uh my last uh question is uh if you see the world different in 10 years, uh right, what would your prediction be for 2035 or 2036?

SPEAKER_01

The name companies will look very different, jobs will look very different, education will look very different. I think it'll look almost unrecognizable from what exists today. Um and I think like we are it's gonna be a golden age of productivity. I feel like most of humanity's most challenging problems are intelligence constrained in terms of us being able to throw more intelligence at it to solve the problem. But imagine a world where intelligence is so abundantly available so that we find cures to diseases faster. We've mastered, maybe, maybe we've mastered some crazy interstellar travel. We've uh we've discovered new materials. Um I think it's I think it's gonna be, I think we'll it'll it'll be an exciting future. Like when we look back 100 years ago, 200 years ago, sometimes you may wonder, like, how did people live without computers uh and all of these tools and gadgets we take for granted? I think people will look at this this this will look archaic. Um so I think the so those three things will look very different. Enterprises will look different, jobs will look different, and the education system will look totally different. And I hope we've uh we've also maybe we've also extended human lifespan, right? Like with uh with better knowledge of human biology and better drugs and better elimination of um sort of common reasons people uh people die.

SPEAKER_02

Thank you so much, Jonathan. It's been an amazing conversation with you.

SPEAKER_03

Great. Thank you for having me.