Dear friends,

The competitive landscape of large language models (LLMs) is evolving quickly. The ultimate winners are yet to be determined, and already the current dynamics are exciting. Let me share a few observations, focusing on direct-to-consumer chat interfaces and the LLM infrastructure and application layers.

First, ChatGPT is a new category of product. It’s not just a better search engine, auto-complete, or something else we already knew. It overlaps with other categories, but people also use it for entirely different purposes such as writing and brainstorming. Companies like Google and Microsoft that are integrating LLMs into existing products may find that the complexity of switching not only technologies but also product categories raises unique challenges.

OpenAI is clearly in the lead in offering this new product category, and ChatGPT is a compelling direct-to-consumer product. While competitors are emerging, OpenAI’s recent move to have ChatGPT support third-party plugins, if widely adopted, could make its business much more defensible, much like the app stores for iOS and Android helped make those platforms very defensible businesses.

Second, the LLM infrastructure layer, which enables developers to interact with LLMs via an API, looks extremely competitive. OpenAI/Microsoft leads in this area as well, but Google and Amazon have announced their own offerings, and players such as Hugging Face, Meta, Stability AI, and many academic institutions are busy training and releasing open source models. It remains to be seen how many applications will need the power of the largest models, such as GPT-4, versus smaller (and cheaper) models offered by cloud providers or even hosted locally, like gpt4all, which runs on a desktop.

Finally, the application layer, in which teams build on top of LLMs, looks less competitive and full of creativity. While many teams are piling onto “obvious” ideas — say, building question-answering bots or summarizers on top of online content — the sheer diversity of potential LLM-powered applications leaves many ideas relatively unexplored in verticals including specialized coaching and robotic process automation. AI Fund, the venture studio I lead, is working with entrepreneurs to build applications like this. Competition feels less intense when you can identify a meaningful use case and go deep to solve it.

LLMs are a general-purpose technology that’s making many new applications possible. Taking a lesson from an earlier era of tech, after the iPhone came out, I paid $1.99 for an app that turned my phone into a flashlight. It was a good idea, but that business didn’t last: The app was easy for others to replicate and sell for less, and eventually Apple integrated a flashlight into iOS. In contrast, other entrepreneurs built highly valuable and hard-to-build businesses such as AirBnB, Snapchat, Tinder, and Uber, and those apps are still with us. We may already have seen this phenomenon in generative AI: Lensa grew rapidly through last December but its revenue run appears to have collapsed.

Today, in a weekend hackathon, you can build a shallow app that does amazing things by taking advantage of amazing APIs. But over the long term, what excites me are the valuable solutions to hard problems that LLMs make possible. Who will build generative AI’s lasting successes? Maybe you!

One challenge is that the know-how for building LLM products is still evolving. While academic studies are important, current research offers a limited view of how to use LLMs. As the InstructGPT paper says, “Public NLP datasets are not reflective of how our language models are used. . . . [They] are designed to capture tasks that are easy to evaluate with automatic metrics.”

In light of this, community is more important than ever. Talking to friends who are working on LLM products often teaches me non-intuitive tricks for improving how I use them. I will continue trying to help others wherever I can.

大型語言模型 (LLMs) 的競爭格局正在迅速打開。最終贏家尚未出爐,但目前的形勢已經令人興奮。我想分享一些觀察結果,重點關注直接面向消費者的聊天接口以及LLMs基礎設施和應用程序層。



其次,LLMs的基礎設施層使開發人員能夠通過API與LLMs進行交互,這看起來極具競爭力。OpenAI和微軟在這一領域也處于領先地位,谷歌和亞馬遜也爭相發布了自己的產品,而Hugging Face, Meta, Stability AI等公司和許多學術機構都在忙著訓練和發布開源模型。有多少應用程序需要用到像GPT-4這樣的最大型模型,而不是云提供商提供的更?。ǜ阋耍┑哪P?,甚至是本地托管的模型(比如運行在桌面上的gpt4all)還有待觀察。

最后是應用程序層。開發團隊建立在LLMs的基礎上,看起來競爭不那么激烈,且充滿創造力。雖然許多團隊都在嘗試“顯而易見”的想法——比如在在線內容的基礎上構建問答機器人或摘要器。但LLMs支持的潛在應用程序的多樣性,使得許多想法在專業指導和機器人過程自動化等垂直領域還未被充分探索。我領導的風投公司AI Fund正在與企業家合作開發這樣的應用程序。當你能夠確定一個有意義的用例并深入解決它時,競爭的感覺就不那么激烈了。









