Friday, December 19, 2025

OpenAI’s Latest Model Is Scarily Good at These Important Work Functions

If you thought 2025 had a lot of AI-related job displacement, just wait until next year. OpenAI’s latest AI model, GPT-5.2, achieved a new record in GDPval, an evaluation created by the company in order to track how well AI models perform on economically valuable, real-world tasks. An AI model being evaluated through GDPval is directed to complete 1,320 tasks traditionally done by humans across 44 occupations in eight sectors: real estate, government, manufacturing, professional services, healthcare, finance, trade, and information. A panel of human judges then decide if the model’s work matches or exceeds the output of a skilled human worker. With thinking mode enabled, GPT-5.2 matched or exceeded “top industry professionals” on about 71 percent of the tasks, a huge leap from GPT-5’s roughly 40 percent score. The new model took the top spot from Claude Opus 4.5, the current most advanced AI model from Anthropic, which scored about 60 percent, and Google’s Gemini 3 Pro, which scored about 54 percent. OpenAI says GPT-5.2 is “our first model that performs at or above a human expert level.” GPT-5.2 Pro, a larger and more expensive version of the model, fared even better with a 74.1 percent GDPVal score. OpenAI wrote that GPT‑5.2 completed the GDPval tasks 11 times faster than expert humans at just 1 percent of the cost, “suggesting that when paired with human oversight, GPT‑5.2 can help with professional work.” But the model hasn’t crushed all business-focused evaluations. It placed third on Vending-Bench 2, a benchmark that measures AI models’ ability to run a vending machine for a simulated year and scores them based on how much they can grow their cash balance from an initial $500. GPT-5.2 ended five simulated years with an average balance of $3,952, far below Claude Opus 4.5’s $4,967 average, and leader Gemini 3 Pro’s $5,478. Still, the model was a marked improvement over GPT-5.1, which sits in fifth place with an average balance of $1,473. BY BEN SHERRY @BENLUCASSHERRY

Wednesday, December 17, 2025

4 Things Marc Andreessen Says All Founders Should Be Doing With AI to Beat the Competition

You’d be hard pressed to find a bigger evangelist of artificial intelligence than Marc Andreessen. The venture capitalist’s a16z has invested tens of billions of dollars in AI companies and continues to look for opportunities. But his enthusiasm for the technology goes beyond financial interests. He’s also a proponent of business owners taking better advantage of AI’s offerings. On a recent episode of the a16z podcast, Andreessen discussed how founders, business owners, and anyone with entrepreneurial instincts should be using AI to gain an advantage over the competition. While hundreds of millions of people have access to AI in the palm of their hands, the majority aren’t using it as a tool (other than punching up their emails). AI mastery, he said, is a skill. And just as people who plunk out “Chopsticks” on a piano can’t play Chopin on the first try, casual AI users won’t have the ability to utilize the technology to its peak potential. To do that, you’ll need to study some and use AI regularly, learning how to ask good prompts. “There are a slice of people who just use these new systems all the time, like literally all day for everything,” he said. “In a lot of cases, they’re reporting that they’re getting enormous benefits from that.” 1. Ask what you should be asking One of the biggest hurdles in learning AI is the intimidation factor. New technology can be overwhelming, even for people in the tech space—doubly so when it’s regularly referred to as revolutionary and world-changing. What Andreessen suggests, though, is using AI to learn about AI. “You can ask it: ‘What question should I be asking?'” he said “[AI] is actually a thought partner helping you figure out what questions to ask … You can say, teach me how to use you in the best way … Teach me how to use you for my business.” AI systems, he noted, love to talk—and the more they tell you about themselves and how to find the information you want, the more adept you’ll become at operating them. 2. Think of AI as a business coach The best coaches in sports are patient with their players, helping them run plays again and again until those become second nature. Business coaches possess a similar trait, helping business owners (and sometimes teams) improve their performance and hone leadership skills. AI, Andreessen said, is like that, but on steroids. It is, quite literally, impossible to frustrate, so no matter what pace you need to proceed at and no matter how many follow-up questions you have, it will never get cross with you. “It’s like having the world’s best coach, mentor, therapist, right?” he said. “It’s infinitely patient. It’s happy to have the conversation. It’s happy to have the conversation 50 times. It’s happy if you admit your insecurities—it’ll coach you through them. It’s happy if you run wild speculations that don’t make any sense. It’s happy to do all that at 4:00 in the morning.” 3. Use AI to find problems with your thinking AI can not only make suggestions about directions for your business, but also look at how you’re running the company now and point out possible mistakes. Whether it’s staffing, marketing, customer feedback, expansion plans, strategy, or performance assessments, if you share information that is accurate and unbiased with the AI and ask for feedback, you’ll get candid advice and criticism that can help you correct mistakes you might not realize you were making. (Just be sure your data is protected.) 4. Use it to draw on lessons from other founders Not every founder is an expert when it comes to scaling. But if you’re looking to grow your business, it’s an essential skill. If the thought of an expansion roadmap is paralyzing, your AI companion has a wealth of knowledge to draw from. “Because it’s been trained on some large percentage of the total amount of human knowledge, it has all the information on how Ray Kroc turned McDonald’s from a single restaurant and how all these other entrepreneurs before actually did this,” he said. “So it can explain [to] you and help you figure out how to do this for your own business.” BY CHRIS MORRIS @MORRISATLARGE

Friday, December 12, 2025

Warren Buffett and Michael Burry Don’t See Eye to Eye on AI

One is the Oracle of Omaha. The other pulled off the Big Short. Warren Buffett and Michael Burry are two of the biggest names in American investing, but they can’t seem to agree on one of the biggest investment questions of the 21st century: Is artificial intelligence an overinflated bubble set to pop, or is all the investor hype actually warranted? Buffett last month revealed a major new stake in Google parent company Alphabet, making the tech giant one of Berkshire Hathaway’s top 10 largest holdings. The investment is being interpreted as a bet on AI, which Alphabet is invested heavily in; markets are now treating the company like the front-runner of the AI race. Burry—famous for making a bet against the American housing market that would prove very lucrative during the 2008 financial crisis—recently took two more short positions, this time on the automation and data company Palantir and chip-maker Nvidia, both darlings of the AI boom. Burry has been particularly critical of accounting policies used by Nvidia’s Big Tech customer base, which he says “have been systematically increasing the useful lives of chips and servers, for depreciation purposes, as they invest hundreds of billions of dollars in graphics chips with accelerating planned obsolescence.” Their diverging investment strategies come as chatter of an AI bubble has entered the mainstream—even OpenAI CEO Sam Altman is voicing concerns—while, nevertheless, investors continue to pump money into the sector. Both Buffett and Burry have quite a bit of credibility, making their contradictory tactics all the more notable. The former is responsible for making Berkshire Hathaway one of the most recognizable names in American investing, with what was once a Nebraska textile company now a massive conglomerate with tendrils across the U.S. economy. The latter inspired the Michael Lewis book The Big Short and the movie of the same name, in which he was portrayed by Christian Bale. Each is also going through a period of major transition. Buffett announced in May his plans to step down as CEO at the end of this year (though he will hold onto his stock). Vice chairman Greg Abel is set to replace him. Meanwhile, Burry’s hedge fund Scion Asset Management will close by the end of this year, with Burry writing in a recent investor letter that his “estimation of value in securities is not now, and has not been for some time, in sync with the markets.” He’s since launched a financial newsletter called Cassandra Unchained on which he’s expressed further skepticism of the AI boom. BY BRIAN CONTRERAS @_B_CONTRERAS_

Wednesday, December 10, 2025

You’ll Be Managing Digital Employees in 2026, a New Forecast Says

In March this year Salesforce’s CEO Marc Benioff, an AI hawk, landed himself in the spotlight when he predicted that this is the last time company leaders will manage only humans during a call with investors, he predicted that this is the last time company leaders will manage only humans. Given that his company had just launched a system that sold AI agents to Salesforce customers it was easy to brush off the prediction as an enthusiastic sales pitch. Now global market research outfit Forrester has predicted more or less the same thing, and says that next year is when things will begin to change. And the changes aren’t going to be small. In a new report first published on the Forbes website, Forrester researchers explain that AI agents are poised to “move beyond” helping workers boost their efficiency—the main selling point for AI at the moment—and instead will join the workforce. This means leadership will have to think about “orchestrating workflows independent of human workers.” They’ll have to think about “technology as part of the workforce” and that means changing planning as well as day-to-day business. HR teams in particular will play a big role, Forrester thinks. This is because sophisticated AI agents will be able to independently execute “complex tasks or end-to-end processes, acting as a virtual member of a team.” Strip away the cloak of anodyne corporate speak and this means Forrester’s predicting that AiIagents will be able to act almost at the level of a human. That means one way to manage them is to treat them as if they are almost people, with HR teams working to align agents alongside human workers on projects and tasks, tracking and optimizing a new type of “hybrid workforce.” One way to do this is to deploy human capital management (HCM) techniques, Forrester suggests. HCM is a system of rules and software that approaches employees as valuable assets, and the report notes that while mainly large enterprises use HCM now, due to sheer numbers of staff, smaller businesses may find the trick useful for a hybrid AI/human workforce. “Facing immediate pressures of productivity and resource optimization,” driven by the fact that you can employ numerous AI agents at once, and they can work 24-7, smaller outfits may actually “benefit from this technology sooner,” the report suggests. This job may sound daunting, particularly if you’ve never experimented with this tech or you’re feeling far from thinking of AI tools as equivalent to your human workers. But Forrester thinks that around three in 10 companies that already sell enterprise software will get in on the game, offering their own HCM solutions to help you manage AI tools. Meanwhile the research company also thinks that business software companies like Oracle, Microsoft and their ilk will offer “autonomous governance” software, which will help companies deploy AI on business tasks while also ensuring there are audit trails and real-time monitoring so you stay within any compliance limits you need to follow. And if you’re concerned this sounds all too automated for you, don’t worry — Forrester says that even though these trends are shifting fast, we’re “till a few years away from a system that can independently manage an entire business unit without human involvement and adaptability.” Your leadership and management skills are still needed! Though you may be tempted to dismiss this research as not relevant to your smaller company, with its family-like feel and reliance on person-to-person collaboration, that might be a mistake. The AI revolution really is rolling on, and if even some of Forrester’s predictions prove true, then inside a year you may be in a position where you can “hire” an AI agent system that can work alongside your staff and help them achieve goals as if it were another employee. That shift will take a lot of leadership, discussing issues with your (probably quite wary) human workers, deciding how to integrate the tech into your workflows and planning and so on. It goes far beyond downloading some software and pressing a button. BY KIT EATON @KITEATON

Monday, December 8, 2025

This Small Startup’s AI Video Model Just Put Sora 2 to Shame

The battle to win the burgeoning AI-generated-video market is heating up, thanks to a new model from a small but mighty player. Runway, a startup that develops AI models for video generation, has released its new flagship model, named Gen-4.5. The company said in a blog post this new model is a major step up for AI-generated video, especially when it comes to realistic physics and exact instruction following. The model claimed the top spot on independent benchmarking organization Artificial Analysis’s text-to-video leaderboard. Founded in 2018 by students of New York University’s Tisch School for the Arts, Runway has been laser-focused on AI video and has been steadily growing since releasing its first model in 2023. According to The Information, this strategy has paid off; the company hit $80 million in annualized recurring revenue in December 2024, and hopes to hit $300 million in ARR by the end of 2025. But Runway is going up against some of the biggest tech companies in the world, most notably Google and OpenAI, which have developed and commercialized their own AI video models. Runway’s plan to beat these mega-funded foes seems pretty simple: make better models. Runway wrote that Gen-4.5 represents “a new frontier for video generation.” Objects in Gen-4.5 videos “move with realistic weight, momentum, and force,” the company says, with better water and surface rendering. The company also says that details like hair will remain more consistent, and that the model will be able to generate more varied art styles. Altogether, Runway says, these upgrades enable the platform’s users to be much more exacting and detailed about their video generations. The new model is already being used commercially by enterprises, Runway says. Video game distributor Ubisoft, ad agency Wieden + Kennedy, Allstate Insurance, and Target were given early access to the tool. The model is available to paid subscribers and through Runway’s API. Gen-4.5 was both built on Nvidia GPUs and uses that company’s hardware to run, according to Runway. The company wrote that it “collaborated extensively” with Nvidia on the model’s creation. Runway creative principal Nicolas Neubert celebrated the model’s release on X, posting that “Gen-4.5 was built by a team that fits onto two school buses and decided to take on the largest companies in the world. We are David and we’ve brought one hell of a slingshot.” BY BEN SHERRY @BENLUCASSHERRY

Friday, December 5, 2025

Black Friday Broke Records. The Real Story Is How AI Changed the Way We Shop

If you only looked at the numbers, you’d think Black Friday was business as usual—just bigger. And, to be clear, it was definitely bigger. Adobe, which tracks more than a trillion retail site visits across 18 categories, says consumers spent a record $11.8 billion online yesterday, up 9.1 percent from last year and even above the company’s own forecast. Between 10 a.m. and 2 p.m., Adobe says shoppers spent $12.5 million every minute. By any metric, that’s a massive number of people shopping for deals. It’s a record for Black Friday sales online, but if you look a little closer, you realize it’s also a massive number of people shopping in very different ways than they used to. Black Friday has already changed quite a bit in the past few years. What was once a single day defined by incredible deals and lines outside big-box stores has stretched into a weeks-long digital shopping season. And, let’s be honest, people aren’t camping outside a Target anymore; they’re sitting on their couch, scrolling their phones. The AI holiday The most interesting part of the story is how things have shifted even more this year. Adobe’s data shows that AI-generated traffic to retail sites jumped 805 percent year-over-year. Not only are people using AI tools to find deals and compare products, but also shoppers who landed on a site from an AI assistant were 38 percent more likely to convert than everyone else. That’s surprising, and yet it makes perfect sense. One of the things AI chatbots like ChatGPT, Claude, and Gemini are good at is instantly surfacing the best price across half a dozen retailers. This year, there were plenty of headline features: Electronics, toys, apparel, TVs, and appliances were discounted between 24 and 30 percent. AI tools just made it easier to find them. And those deals didn’t just convince people to buy more. Adobe says that people spent more on higher-end items. The share of units sold from the most expensive tier of products spiked: 64 percent in electronics, 55 percent in sporting goods, 48 percent in appliances. With the right combination of discounts and AI-assisted shopping comparison, people weren’t just looking for deals—they were looking for the best value. Mobile continued to dominate Depending on the hour, around 55 percent of online Black Friday sales happened on a phone—$6.5 billion worth. That’s up 10 percent from last year and represents billions of dollars processed through screens smaller than a wallet. Mobile phones reward frictionless experiences. And it turns out, AI is very good at removing friction. When the easiest way to shop is to ask ChatGPT for a recommendation and the best deal, it changes the way retailers have to think about Black Friday. Not only that, but the timeline seems to have shifted. Adobe says one of the biggest spikes happened from 10 a.m. to 2 p.m. Shopping habits shifted toward the times when people are already using their phones. You don’t need to wait for a sale to “start” when an AI assistant can surface the best price the moment it exists. AI shopping is here to stay Adobe expects U.S. consumers to spend more than $250 billion online this holiday season, with Cyber Monday alone projected to hit $14.2 billion. But the part worth paying attention to isn’t the total—it’s how we got there. Shoppers are trusting AI to do the busywork and find them the best value. For a shopping event that used to be all about physical stores, that’s a significant shift that retailers have to pay attention to. The challenge is that they no longer control the narrative—the AI assistant does. The lesson here may not seem obvious, but the reality is that retailers need to redefine what loyalty means when more shoppers start their journey with an AI prompt instead of walking into a store or pulling up your website. When an assistant compares every retailer at once, being “top of mind” matters far less than being the lowest-friction, highest-confidence option in that moment. That means loyalty isn’t something you win with flashy ads or homepage banners—it’s something you earn through the operational details an AI actually cares about. Black Friday broke spending records. But the more interesting record is the one you might overlook: how many of those purchases started with someone typing a question into an AI instead of typing a URL into a browser. EXPERT OPINION BY JASON ATEN, TECH COLUMNIST @JASONATEN

Wednesday, December 3, 2025

Gen-Zers Are Using AI to Boost Their Side Hustles and Grow Them Into Full-Time Businesses

As more Gen-Zers embrace side hustles, they’re increasingly leaning on artificial intelligence to help them get ahead. A new survey by Canva finds that 80 percent of the people who have side hustles have used AI to fuel the growth of those enterprises, with 74 percent calling it their secret weapon. The tools, including ChatGPT and Canva’s own online graphic design offerings, are being used for everything from video creation and logo/brand design to data analysis and copywriting. And some of those side hustles are becoming full-time businesses. Side hustles, on the whole, have never been hotter. Data from the U.S. Bureau of Labor Statistics shows that 8.9 million Americans are currently working multiple jobs. That’s 5.4 percent of the country’s workforce. And a SurveyMonkey study published earlier this month found “72 percent of workers either already have or are considering a side gig—37 percent already have a side hustle, and 35 percent are considering pursuing one.” Some 22 percent of the people surveyed by Canva said they were inspired to start their own company after launching a side gig and 17 percent said the work led to a consulting or freelancing job. Additionally, 33 percent said they had gained new clients or customers, while 29 percent said their side gig had helped build their professional brand. Gen-Z was the generation most likely to start passive income side hustles: Of those with side hustles, 48 percent of Gen-Zers are currently earning passive income. All totaled, two-thirds of the 300 “5-9 influencers,” as Canva calls them, said they would consider quitting their full-time jobs if they believed their side projects could sustain them. They wouldn’t be the first. Some very familiar tech companies got their start as side hustles or side projects, including Groupon, Twitter, Craigslist, and Instagram (which began as Burbn, a location-based app for whiskey lovers). And thousands of other, smaller businesses began as a part-time side gig for the founder, eventually growing to multimillion-dollar businesses. Today’s side hustle community is made up of a mix of generations. Canva’s survey found that just under half of Gen-Zers, Millennials, Gen-Xers, and Baby Boomers were making money from side gigs today, with the actual percentages ranging from 40 to 48. Increasingly, the side hustles they’re choosing are digitally focused. The most popular jobs were social media creator (35 percent), e-commerce (27 percent), gaming and streaming (24 percent), and graphic design (14 percent). Extra income is the biggest motivator for people who have side gigs, Canva found, but it wasn’t the only one. Some 36 percent of the respondents said they were running their side hustle because they enjoyed the creative expression it gave them. And just under one-third said they wanted to turn a passion into a business. Even people with side hustles who aren’t looking to launch a business of their own are seeing advantages from the work. The skills they’ve learned as part of that work, including the AI expertise they’re building, are helping people advance. Some 14 percent of the people surveyed said their side hustle had helped them get a promotion at their day job. BY CHRIS MORRIS @MORRISATLARGE

Monday, December 1, 2025

The hottest new AI company is…Google?

Google just threw another twist in the fast-changing AI race. And its biggest competitors are taking notice. “We’re delighted by Google’s success — they’ve made great advances in AI and we continue to supply to Google,” Nvidia wrote in a November 25 post on X, before adding that “NVIDIA offers greater performance, versatility, and fungibility than ASICs,” (the application-specific integrated circuits) like those made by Google. “Congrats to Google on Gemini 3! Looks like a great model,” OpenAI CEO Sam Altman also wrote on X. The posts came just days after mounting buzz about Google’s Gemini 3 model — and the Google-made chips that help to power it. Salesforce CEO Marc Benioff wrote on X that he’s not going back to ChatGPT after trying Google’s new model. “The leap is insane — reasoning, speed, images, video… everything is sharper and faster. It feels like the world just changed, again,” he wrote. Now Meta is said to be in talks with Google about buying its Tensor chips, according to The Information, coming after Anthropic said in October that it plans to significantly expand its own use of Google’s technology. Shares of Google were up nearly 8% last week, while Nvidia’s were down a little over 2%. At stake is more than just bragging rights or a few sales contracts. As the tech industry claims AI will reshape the world — including investment portfolios belonging to everyone from billionaires to 401k-holding retirees — what company and what vision comes out on top could affect nearly every American. At face value, Nvidia’s post says the company isn’t worried about Google encroaching on its territory. And for good reason — Google’s chips are fundamentally different from Nvidia’s offerings, meaning they aren’t a match-for-match alternative. But that OpenAI and Nvidia felt the need to acknowledge Google at all is telling. “They’re in the lead for now, let’s call it, until somebody else comes up with the next model,” Angelo Zino, senior vice president and technology lead at CFRA, told CNN. Google and Meta did not immediately respond to a request for comment. Nvidia declined to comment. The leader for now Google is hardly an AI underdog. Along with ChatGPT, Gemini is one of the world’s most popular AI chatbots, and Google is one of the few cloud providers large enough to be known as a “hyperscaler,” a term for the handful of tech giants that rent out cloud-based computing resources to other companies on a large scale. Google services like Search and Translate have used AI as far back as the early 2000s. Even so, Google was largely caught flat-footed by OpenAI’s ChatGPT when it arrived in 2022. Google management reportedly issued a “code red” in December 2022 following ChatGPT’s seemingly overnight success, according to The New York Times. ChatGPT now has at least 800 million weekly active users, according to its maker, OpenAI, while Google’s Gemini app has 650 million monthly active users. But Gemini 3, which debuted on November 18, now sits at the top of benchmark leaderboards for tasks like text generation, image editing, image processing and turning text into images, putting it ahead of rivals like ChatGPT, xAI’s Grok and Anthropic’s Claude in those categories. Google said over one million users tried Gemini 3 in its first 24 hours through both the company’s AI coding program and the tools that allow digital services to connect to other apps. But people tend to use different AI models for different purposes, says Ben Barringer, the global head of technology research at investment firm Quilter Cheviot. For example, models from xAI and Perplexity are ranked higher than Gemini 3 search performance in benchmark tests. “It doesn’t necessarily mean (Google parent) Alphabet is going to be … the end-all when it comes to AI,” said Zino. “They’re just kind of another piece to this AI ecosystem that continues to get bigger.” More chip competition Google began making its Tensor chips long before the recent AI boom. But Nvidia still dominates in AI chips with the company reporting 62% year-over-year sales growth in the October quarter and profits up 65% compared to a year ago. That’s largely because Nvidia’s chips are powerful and can be used more broadly. Nvidia and its chief rival, AMD, specialize in chips known as graphics processing units, or GPUs, which can perform vast amounts of complex calculations quickly. Google’s Tensor chips are ASICs, or chips that are custom-made for specific purposes. While GPUs and Google’s chips can both be used for training and running AI models, ASICs are usually designed for “narrower workloads” than GPUs are designed for, Jacob Feldgoise, senior data research analyst at Georgetown’s Center for Security and Emerging Technology, told CNN in an email. Beyond the differences in the types of chips themselves, Nvidia provides full technology packages to be used in data centers that include not just GPUs, but other critical components like networking chips. It also offers a software platform that allows developers to tailor their code so that their apps can make better use of Nvidia’s chips, a key selling point for hooking in long-term customers. Even Google is an Nvidia client. “If you look at the magnitude of Nvidia’s offerings, nobody really can touch them,” said Ted Mortonson, technology desk sector strategist at Baird. Chips like Google’s won’t replace Nvidia anytime soon. But increased adoption of ASICs, combined with more competition from AMD, could suggest companies are looking to reduce their reliance on Nvidia. And Google won’t be the only AI chip competitor, said Barringer of Quilter Cheviot, and it’s doubtful it will achieve Nvidia’s dominance. “I think it’s a part of a balance,” he said. Analysis by Lisa Eadicicco