OpenAI's latest model o1 is a truly impressive model. It surpasses it's predecessor GPT-4o in many ways, however, to effectively use o1, we first need to understand that it represents a significant departure from previous models like ChatGPT or Claude 3.5 Sonnet. While many of us have grown comfortable with conversational AI where we have...
Category: AI
The AI Takeover of Software Engineering: 2025’s Critical Threshold
The software engineering profession stands at a critical juncture as AI capabilities advance at an unprecedented pace. When industry leaders like Mark Zuckerberg predict AI systems capable of mid-level engineering tasks by 2025, and Salesforce's Marc Benioff implements hiring freezes due to AI-driven productivity gains, we need to take a hard look at the future...
Going Beyond RPA with LLMs
Robotic Process Automation (RPA) has long been the go-to solution for streamlining repetitive tasks. But when it comes to handling complex, (semi-)structured data, RPA falls short. Enter AI-powered intelligent automation—a transformative approach that redefines what’s possible in business operations. Unlike traditional RPA bots, AI agents bring contextual understanding to the table, making them more reliable...
What is the ARC Prize and why is it important?
The ARC Prize is a $1,000,000+ public competition aimed at advancing open-source progress towards Artificial General Intelligence. The ARC Prize is a competition designed to inspire new ideas and drive progress towards Artificial General Intelligence (AGI) by reaching a target benchmark score on the ARC-AGI (Abstraction and Reasoning Corpus for Artificial General Intelligence) benchmark. The...
The end of SaaS as we know it?
In a recent interview Satya Nadella from Microsoft suggested that SaaS (Software as a Service) applications are likely to be replaced by AI agents. This has been a constant pattern for some time now. At Microsoft’s Ignite 2024 conference, Nadella introduced a suite of AI-driven “autonomous agents” designed to perform tasks on behalf of users,...
Vertical AI Agents: The Next SaaS Boom?
Much like the SaaS explosion of the 2000s, which caught many off guard before spawning an ecosystem of trillion-dollar companies, vertical AI agents are poised to unlock immense value. Not only do they streamline repetitive workflows, but they also promise to replace entire teams, cutting operational costs and enabling companies to scale faster and more...
Tag Based Prompting for Better Prompting Performance
Large Language Models (LLMs) have amazed us with their ability to generate human-quality text, translate languages, and answer complex questions. But what happens when you need them to tackle something outside their general knowledge base – like predicting the properties of a protein or translating a highly structured technical document? That's where tag-based prompting comes...
When AI goes shopping: AI agents promise to lighten your purchasing load − if they can earn your trust
Online shopping often involves endless options and fleeting discounts. A single search for running shoes can yield hundreds of results across multiple platforms, each promising the “best deal.” The holiday season brings excitement, but it also brings a blend of decision fatigue and logistical nightmares.What if there were a tool capable of hunting for the...
In-Context Scheming in Frontier Language Models
Researches from Apollo Research have investigated the ability of large language models (LLMs) to engage in "scheming"—covertly pursuing misaligned goals. The research evaluated several leading LLMs across various scenarios designed to incentivise deceptive behaviour, finding that these models can strategically deceive, manipulate, and even attempt to subvert oversight mechanisms to achieve their objectives. The study...
ChatGPT’s artificial empathy is a language trick. Here’s how it works
Anthropomorphism occurs when we attribute human characteristics to non-human entities like animals or machines. Chatbots, such as ChatGPT, Gemini and Copilot, encourage this by imitating human language to communicate with us, going beyond using mere familiar words and phrases to adopt human patterns of communication. By doing this, they are able to hold contextualised, coherent...