Image: made using DALL·E (Custom edited by me)

RSN012

Ritik Sharma
Published 02.06.2025.09.00.TUE

On-Device AI Everywhere: From Mobile Games to Pocket Supercomputers

Hey everyone, Ritik here! Welcome back to RSN, where we unpack the tech shifts reshaping our daily lives. This week is all about AI moving closer to the edge—your phone, your spreadsheets, even your handheld game sessions. Apple is eyeing a bigger slice of mobile gaming, Google wants local models on every device, and a distilled Chinese model squeezes frontier reasoning onto a single GPU. Let’s dive in!

1. Apple’s iOS 19 Gaming Push and First Studio Acquisition

“Cupertino doubles down on play before WWDC.”

What’s Happening: Apple is going beyond Apple Arcade to carve out a serious mobile-gaming lane. Ahead of WWDC, it quietly bought two-person studio RAC7, creators of Arcade hit Sneaky Sasquatch, and is cooking up a dedicated Gaming app for iOS 19 that will replace Game Center. The new launcher will centralize leaderboards, recommendations, multiplayer lobbies, and in-game social features, integrating with iMessage and FaceTime for quick “Play Now” sessions. On Mac, the same app will index titles installed outside the App Store, signaling looser gatekeeping. The timing matters: Nintendo’s Switch 2 lands June 5, cloud gaming keeps rising, and two-thirds of App Store revenue already comes from games. Apple wants to keep high-end titles—Resident Evil, Assassin’s Creed, Death Stranding—flowing to A-series and M-series chips while unifying its gaming identity across iPhone, iPad, Mac, and Apple TV.

Why It Matters to You: If you game on iPhone or Mac, expect smoother matchmaking, shared achievements, and richer cross-device play without hunting through menus—or paying console prices.

Why It Matters to the World: Apple’s deeper gaming focus could redraw mobile-gaming economics, challenging console makers and cloud services while raising App Store antitrust questions yet again.

2. Perplexity Labs Turns Prompts into Dashboards and Apps

“Search startup graduates to full-service AI project builder.”

What’s Happening: Perplexity, the AI-powered search challenger, just launched Perplexity Labs for its $20/month Pro subscribers. Labs spends up to 10 minutes orchestrating web search, code execution, charting, and mini-app creation to spit out formatted reports, spreadsheets, and interactive dashboards. Files—CSV, PNG charts, HTML apps—are auto-organized for download. It’s available on web, iOS, Android and coming soon to Mac and Windows. Think ChatGPT’s Advanced Data Analysis meets Google Sheets plus a lightweight Streamlit. The launch dovetails with Perplexity’s broader expansion—previewing its own browser (Comet), acquiring professional network Read.vc, and reportedly courting investors at an $18 billion valuation. As enterprise appetites grow, Labs positions Perplexity less as a search toy and more as a workflow engine that can replace junior analysts—and maybe Canva or Airtable—in a single click.

Why It Matters to You: Need a market-size chart or budget spreadsheet? Type a prompt and get a polished doc without juggling Excel, Google Search, and PowerPoint.

Why It Matters to the World: Tools like Labs compress white-collar tasks, raising productivity—and ethical debates—about AI replacing entry-level knowledge jobs sooner than expected.

3. DeepSeek Shrinks Frontier Reasoning to a Single GPU

“R1 distilled: 685 B-parameter brains on a developer’s budget.”

What’s Happening: Chinese lab DeepSeek followed its headline-grabbing R1 model with DeepSeek-R1-0528-Qwen3-8B, a distilled 8-billion-parameter version fine-tuned on outputs from the 685 B R1. Despite its small size, the model beats Google Gemini 2.5 Flash on AIME 2025 math problems and nearly matches Microsoft Phi 4 on HMMT reasoning—yet runs on a single 40–80 GB GPU (e.g., an Nvidia H100) instead of a dozen. Licensed under MIT and already hosted via APIs at NodeShift and LM Studio, the miniature R1 aims at researchers and startups that can’t afford vast GPU clusters. DeepSeek continues to iterate despite past controversies over training data and U.S. export controls, demonstrating how rapid distillation keeps pushing advanced reasoning into ever-cheaper hardware footprints.

Why It Matters to You: If you’re building an AI feature on a tight budget, you can now run near-frontier reasoning locally or in a modest cloud instance—no seven-figure GPU bill required.

Why It Matters to the World: Democratized access to strong reasoning models accelerates innovation but also complicates governance, copyright, and safety oversight across borders.

4. Google AI Edge Gallery Lets Phones Run Hugging Face Models Offline

“Pocket-sized generative power—no data center needed.”

What’s Happening: Google quietly dropped an AI Edge Gallery alpha on GitHub, letting Android users (iOS coming) browse, download, and run compatible Hugging Face models entirely on-device. Tasks span AI chat, image generation, and code writing. The app features a “Prompt Lab” with one-shot templates and parameter sliders, all under Apache 2.0 so devs can fork it freely. Performance depends on handset silicon: flagship Tensor, Snapdragon, or Apple A-series chips can crank larger Gemma-based models; mid-tiers may favor 3 B-parameter LLMs. By avoiding cloud calls, Edge Gallery offers privacy, zero-latency demos, and resilience in low-connectivity zones—plus a subtle nudge for OEMs to ship beefier NPUs.

Why It Matters to You: Generate images or run private chats on the subway with no signal—and keep sensitive docs off remote servers.

Why It Matters to the World: Local-first AI shifts power from cloud giants to end users, sparking a new race for efficient on-device models and raising the bar for mobile chip design.

5. Android 16’s Material 3 Expressive Makeover and Gemini Everywhere

“Google’s OS gets splashy—and smarter—across phones, cars, and watches.”

What’s Happening: Google’s Android 16 QPR1 beta introduces Material 3 Expressive, the most researched UI revamp since 2020. Expect brighter color palettes, bouncier animations, and a redesigned quick-settings panel that looks, well, a bit iOS 17. Dynamic color themes extend to Wear OS 6 for Pixel Watch 3, while Live-Activity-style persistent notifications pin food deliveries or Lyft ETAs on the lock screen. Under the hood, Google Assistant bows out for Gemini, now the default voice agent in Wear OS, Android Auto, Google TV, and soon XR. Safety upgrades block sideloading during scam calls, hide OTPs in risky contexts, and rename Find My Devices to Find Hub with satellite-enabled tracking. Developers with Pixel 6 or newer can opt into the beta now; the stable Android 16 launch lands later this summer before QPR1 follows in the fall.

Why It Matters to You: Your Pixel will feel fresher, safer, and more personal—while Gemini promises richer, hands-free help whether you’re driving, cooking, or jogging.

Why It Matters to the World: Google’s design overhaul and Gemini expansion aim to stem the youth-driven iPhone exodus and cement Android’s role as the most customizable, AI-forward mobile platform.

My Creative Conclusion

From Apple gaming hubs and Perplexity’s auto-dashboards to DeepSeek’s single-GPU reasoning and Google’s on-device model gallery, AI is marching out of monolithic data centers and into everyday screens—even reshaping OS aesthetics along the way. The common thread? Empowerment. Whether you’re whipping up a spreadsheet, chatting offline, or questing in Sneaky Sasquatch 2, intelligence is becoming an integral—yet invisible—layer of our digital lives. Our task is to harness that power responsibly: demanding transparency, safeguarding privacy, and ensuring that innovation lifts everyone, not just those with the biggest clouds. Stick with RSN as we track each leap forward and hold the tech to its promise.

— Ritik Sharma Founder, RSN (Ritik Sharma Newsletter)