The Singularity Monthly: The race for LLM dominance, promising quantum progress, Elon's new Mars plan, and 1,000,000X faster internet

Topic of the Month

Chatbot Explosion

Once a rarity, top-shelf AI chatbots are now almost commonplace.

Google launched its most powerful model, Gemini Ultra, in February, followed by Anthropic’s Claude 3 Opus in March and Meta’s Llama 3 last week. While OpenAI’s GPT-4 was unambiguously the top AI model in 2023, Google and Anthropic have claimed Claude and Gemini now rival GPT-4. In addition, Yann LeCun, chief AI scientist at Meta, recently said an algorithm even bigger than Llama 3 is in training—the company has so far only launched small and midsized algorithms—and it too could be a match for GPT-4.

This race to the top has been fueled by a rapid shift to generative AI overall. According to the Stanford Institute for Human-Centered AI’s (HAI) 2024 AI Index report, the number of foundation models—these are the complex, data-hungry algorithms like GPT-4, Claude, Gemini, and Llama—built per year has grown by a factor of almost 38 between 2019 and 2023. Last year, industry and academia combined to release 149 foundation models in total.

There’s little to indicate the pace will slow this year.

Clearly, models from the likes of OpenAI, Google, Meta, and Anthropic are converging. Mistral, the French AI startup behind the Mixtral AI models, is in the mix too. But it’s Meta's Llama 3 and the company's upcoming 400-billion-parameter algorithm alluded to by LeCun that may unleash a flood.

Llama is a so-called “open-weights" model. It isn’t fully open source—Meta, for example, withholds certain information and restricts usage over a max number of users—but it can be tweaked and modified. Developers made over 30,000 new variants based on Llama 2, CEO of Hugging Face, Clement Delange, told Wired recently. It’s likely we can expect the same for Llama 3 and its larger sibling too, only the new models will be more capable.

To date, according to the HAI report, closed algorithms like those from Google and OpenAI outperform open algorithms by a median 24.2 percent. Meta's new releases may help narrow the gap. What can we expect when hundreds or thousands of GPT-4-caliber algorithms arrive?

The open-source AI community is nothing if not experimental. Just weeks after Llama leaked last year, some developers had created versions that could run on laptops and phones. Others later fine-tuned Llama to improve performance, expand context windows, and add new languages. Despite worries the risk of openly releasing advanced models is too great, Meta says this experimentation is what motivates it to go open.

“AI is better when more people look at the code,” LeCun said at an MIT conference this month. “Infrastructure needs to be open source—it just progresses faster.”

For all this, it’s worth remembering the industry has been chasing GPT-4 for over a year.

Viewed from that angle, GPT-4-like performance is already a bit quaint, not least because so many models have attained it. Yes, there’s a lot we can still do with these tools—AI as a whole has reached or surpassed human performance in an impressive range of tasks. But it still struggles in areas like common sense, reasoning, and planning, and it’s still prone to bias and hit-or-miss factuality.

In recognition of both significant progress and the challenges ahead, the HAI report retired a number of benchmarks this year that have become “saturated” due to plateauing abilities or a shift in focus to more difficult challenges by researchers. A host of newly introduced benchmarks measuring the likes of agent-based behavior, causal, moral, and mathematical reasoning, coding, and factuality hint at where we’re headed.

These challenges are likely already driving work on future AI models.

OpenAI’s next big update could come as soon as this summer. It seems likely the algorithm will improve on GPT-4, but whether it’s able to make significant inroads on these benchmarks—or run laps around Gemini, Claude, and Llama—remains to be seen. Some problems may require fundamental breakthroughs. Still, OpenAI is no doubt working furiously to level up. And, for that matter, Google, Meta, and others are too.

“Look, I don’t want to downplay the accomplishment of GPT-4, but I don’t want to overstate it either,” OpenAI CEO Sam Altman told Lex Fridman in a March podcast interview. “And I think at this point that we are on an exponential curve, we’ll look back relatively soon at GPT-4 like we look back at GPT-3 now.”

Nothing comes automatically. But the industry has the talent, cash, and motivation to continue pushing the envelope. How far and how fast it will yield is still up for debate.

Know someone who might enjoy the Singularity Monthly?
Share this newsletter with them.

More News From the Future

Two developments hint practical quantum computers are getting closer.

Error prone. Quantum computing is stuck in a phase called the noisy intermediate-scale quantum (NISQ) era. After years of theorizing, real quantum computers finally exist, but they aren’t of much use yet. This is due to two related challenges—error correction and scale. The processing components of quantum computing, qubits, are fragile and error-prone. Correcting errors is possible, but we’ll need much larger machines with far more qubits to make it happen. Recently, the field has begun to make notable progress on both fronts.

Atomic QC. Scientists have been working on error-correction techniques for years. But this month, Microsoft and Quantinuum went beyond theory by implementing error correction on a physical machine and reducing error rates by a factor of 800. Separately, neutral atom quantum computers have been scaling quickly. Atom Computing announced a machine with over 1,000 neutral atom qubits last year. And this March, Caltech researchers made an array of 6,100 neutral atom qubits. The researchers wrote they believe “universal quantum computing with ten thousand atomic qubits could be a near-term prospect.”

More is better. There’s plenty more work to do. Microsoft and Quantinuum’s error-correction experiment was on a small 30-qubit, trapped-ion machine. The error rate still needs to improve a lot, and they’ll need to prove it works at scale. The Caltech research, meanwhile, showed their array could be trapped and kept coherent, but the team had yet to perform computations. Still, concrete progress on scale and error correction are bringing practical quantum computers closer.

This AI model designs brand new CRISPR gene editors.

A startup, Profluent, has created an AI that designs gene-editing systems “never before seen on Earth” and used one, called OpenCRISPR-1, to edit human cells in a dish. The algorithm trained on sequences of amino acids and nucleic acids to internalize how CRISPR works and can use that training to generate similar but totally novel systems. It’s early, but the hope is generative algorithms might improve on existing natural gene editors.

Elon Musk doubles down on Mars and sketches out Starship’s future.

In a recent speech, Elon Musk updated SpaceX employees on his Mars vision, including a plan to launch 1,000-ship fleets of the company’s Starship spacecraft every 26 months as Earth passes by Mars. To make that a reality, he said the company is building new launch towers in Texas and Florida and a “giant factory” to ramp up production. Starship also needs to be fully reusable—something he hopes SpaceX will have proven possible by sometime next year—refuelable, and haul double the payload to orbit. Now that Starship is actually in testing, going to Mars sounds a bit more plausible. But even without those aspirations, Starship as described would be a revolutionary vehicle.

Scientists notch internet speeds a million times faster than broadband.

A team of researchers says with new optical tools they’ve achieved internet speeds of 301 terabits per second. The blazingly fast mark isn’t a new record—which currently stands at 22.9 petabits per second—but it’s potentially more practical. Those faster speeds have been recorded on non-standard multicore fiber optic cables, while the new result added wavelength bands of light to fiber optic cables like those already in service. The scientists hope the work can help current infrastructure scale up to meet future demand.

Upcoming Events

EXECUTIVE PROGRAM

Join us for a LIVE Executive Program information session

In this interactive 30-minute webinar hosted by Singularity VP Alix Rübsaam, you'll get to know more about the 5-day program's content, speakers, and logistics. Please bring whatever questions you may have!

Ready to apply? Choose one of our 2024 EP dates:

October 6–10, 2024

November 10 –14, 2024

THE FUTURE OF AI

Full agenda announced—a few seats are still available

From May 8–10, we're convening the sharpest minds in AI at the Computer History Museum in Mountain View, California. Learn more and secure your seat for this exclusive event.

-----------------------------------------

Thanks for reading. We hope you enjoyed this month's updates and found something to inspire you on your exponential journey.

See you next month!

The Singularity Team

-----------------------------------------

Know someone who might enjoy the Singularity Monthly?
Share this newsletter with them.

Received this from a friend and want to subscribe? Subscribe here.