The AI Rundown by Lightscape Partners – 04/08/24

Apple released a new foundational model capable of understanding context, Princeton researchers developed a new SWE-agent, and Microsoft may be building the most expensive computer ever.

Ted Wagner
April 08, 2024

Image generated by Ted Wagner using DALL-E 3

Good morning and welcome to this week’s AI Rundown.

Last week in AI,

Apple released a paper announcing a new model called ReALM, claiming it understands spoken and on-screen context better than any other model and outperforming GPT-4 with ReALM’s largest models.
Researchers at Princeton developed a framework to convert any language model into a software engineer capable of debugging real issues in a codebase.
Microsoft is rumored to build a $100 billion data center to house OpenAI’s next AI supercomputer.

Check out the AI conferences happening this month here.

If you’d like a quick refresher on industry terms and the current landscape of AI, skip to the bottom.

The new framework allows for AI agents to run more efficiently directly on devices.
The framework is currently only available for Android devices.
It’s 35x faster and more accurate than RAG and has <1s latency when running Llama-7B.

Stable Audio 2.0 is Stability’s product in AI-generated audio, able to produce high-quality, full tracks with coherent musical structure.
The tracks are up to 3 minutes in length.
The model is capable of audio-to-audio generation, allowing for users to upload samples.
The training data was licensed through the AudioSparx music library, ensuring fair compensation.

Transformer-based language models are the primary model architecture for almost all modern generative AI models.
The paper claims they can train transformers to “dynamically allocate FLOPS (or compute) to specific positions in a sequence, optimizing the allocation.”
The technique makes models up to 50% faster to step during post-training sampling.

Many-shot jailbreaking evades the safety guardrails in LLMs and works on generally all available models.
The hack utilizes LLM advancements in growing context windows, the amount of information an LLM can process at once.
Anthropic published the research paper to shine light on the issue and ensure proper attention is given.

The startup is developing a software-focused edge AI solution via a purpose built chip and ML software.
The investment from Dell is the firm’s first in hardware, marking a shift in investment intent in the AI race.

Image credit: SONYA HUANG, PAT GRADY, Sequoia Capital

Thank you for reading the AI Rundown by Lightscape Partners. Please send any questions, comments, or suggestions to [email protected].