Science Archives

Test-Time Compute: The Next Frontier in AI Scaling

12. November 202412. November 2024Martin Treiber 563 views 13 minutes

Major AI labs, including OpenAI, are shifting their focus away from building ever-larger language models (LLMs). Instead, they are exploring "test-time compute", where models receive extra processing time during execution to produce better results. This change stems from the limitations of traditional pre-training methods, which have reached a plateau in performance and are becoming too...

Ferret-UI 2: Towards Universal UI Understanding for LLMs

28. October 202428. October 2024Martin Treiber 596 views 5 minutes

Ferret-UI 2 is a multimodal large language model (MLLM) designed to interpret, navigate, and interact with UIs on iPhone, Android, iPad, Web, and AppleTV. It enhances UI comprehension, supports high-resolution perception, and tackles complex, user-centered tasks across these diverse platforms. Core Architecture: Multimodal Integration The foundational architecture of Ferret-UI 2 integrates a CLIP ViT-L/14 visual...

LLMs in Your Pocket: An Overview of CAMPHOR by Apple

21. October 202421. October 2024Martin Treiber 576 views 4 minutes

While Large Language Models (LLMs) have demonstrated remarkable capabilities in understanding and responding to complex queries, their reliance on server-side processing poses significant challenges for mobile assistants. These challenges primarily revolve around two key issues: Privacy: Mobile assistants frequently require access to sensitive personal information to provide accurate and relevant responses. Storing and processing this...

Unmasking the Mathematical Minds of LLMs: Are They Really Reasoning?

13. October 202413. October 2024Martin Treiber 498 views 4 minutes

Large language models (LLMs) have stormed onto the scene, dazzling us with their linguistic prowess and seeming intelligence. From crafting creative text formats to tackling complex coding challenges, they've left many wondering: are these machines truly thinking? The spotlight, in particular, has fallen on their mathematical reasoning abilities, with many claiming these models are on...

Can AI talk us out of conspiracy theory rabbit holes?

18. September 202418. September 2024Dana McKay 717 views 5 minutes

New research published in Science shows that for some people who believe in conspiracy theories, a fact-based conversation with an artificial intelligence (AI) chatbot can “pull them out of the rabbit hole”. Better yet, it seems to keep them out for at least two months. This research, carried out by Thomas Costello at the Massachusetts...

Top Prompting Strategies Unveiled by Anthropic Experts

10. September 202410. September 2024Martin Treiber 505 views 11 minutes

Prompt engineering is the bridge between human intention and AI output, and its impact on industries from healthcare to research is profound. As AI systems become more powerful, the ability to craft precise, effective prompts has emerged as a key skill in making these systems work for us. Anthropic’s prompt engineering experts have spent years...

AI was born at a US summer camp 68 years ago. Here’s why that event still matters today.

3. September 20243. September 2024Sandra Peter 1057 views 6 minutes

Imagine a group of young men gathered at a picturesque college campus in New England, in the United States, during the northern summer of 1956. It’s a small casual gathering. But the men are not here for campfires and nature hikes in the surrounding mountains and woods. Instead, these pioneers are about to embark on...

AI Simulates Classic DOOM

31. August 202431. August 2024Martin Treiber 818 views 4 minutes

Imagine a world where you could play DOOM—yes, the iconic 1993 first-person shooter—powered not by a traditional game engine but by a neural network. Thanks to a groundbreaking new AI system called GameNGen, developed by researchers at Google Research, Google DeepMind, and Tel Aviv University, this is no longer a futuristic dream but a reality....

What is In-Context Learning of LLMs?

19. August 202419. August 2024Martin Treiber 721 views 11 minutes

In-context learning (ICL) refers to a remarkable capability of large language models (LLMs) that allows these models to perform new tasks without any additional parameter fine-tuning. This learning approach leverages the pre-existing knowledge embedded within the model, which is activated through the use of task-specific prompts consisting of input-output pairs. Unlike traditional supervised learning that...

Do Emergent Abilities in AI Models Boil Down to In-Context Learning?

18. August 202419. August 2024Martin Treiber 552 views 13 minutes

Emergent abilities in large language models (LLMs) represent a fascinating area of artificial intelligence, where models display unexpected and novel behaviors as they increase in size and complexity. These abilities, such as performing arithmetic or understanding complex instructions, often emerge without explicit programming or training for specific tasks, sparking significant interest and debate in the...