Welcome to AI&Business (Beta)

An open manual for the collaboration of AI and business.

Industries & Cases Navigation:

Generated with Midjourney by aiandbusiness.com

By mid‑2025, a wave of agentic AI browsers has arrived—startups like The Browser Company’s Dia, which lets you chat with your tabs via AI built into the browser’s foundation, and Fellou, which takes it further by actually executing multi‑step, cross‑site workflows on your behalf —and now the tech giants are racing to follow: Perplexity Comet (Chromium-based, for Max subscribers) auto-summarizes content, autofills forms, and orchestrates research or errands across pages; Microsoft Edge’s experimental Copilot Mode sees across all open tabs, accepts voice or chat commands, and acts as a proactive in-browser assistant; and Google Chrome is embedding Gemini to deliver contextual tab-level AI help—with a fuller “Agent Mode” due soon.


Industry News & Updates:

Google Gemini 2.5 Flash Image (Nano-Banana)

Google DeepMind introduces Gemini 2.5 Flash Image (Nano-Banana), a state-of-the-art image generation and editing model that blends multiple images, maintains character consistency, allows natural-language transformations, and leverages world knowledge for precision editing.

GPT‑5

OpenAI has officially released GPT‑5, its smartest, fastest, and most capable language model yet—featuring integrated reasoning via an automatic router, “vibe‑coding” for software‑generation from plain English, multimodal processing, higher accuracy with fewer hallucinations, and personalized experiences across multiple variants (Standard, Mini, Nano, Pro)—marking a major step toward agentic AI and AGI.

ElevenLabs Music

ElevenLabs has just launched Eleven Music, a new AI music-generation platform that lets users—from creators to businesses—instantly generate studio-grade tracks with vocals or instrumentals in multiple languages from natural‑language prompts, backed by licensing deals with Merlin Network and Kobalt to ensure broad commercial use.

ChatGPT Study Mode

OpenAI has launched ChatGPT Study Mode, a new learning-oriented mode that guides users through step-by-step reasoning using Socratic-style questions, scaffolded lessons, personalized feedback, and interactive quizzes—designed to foster critical thinking and deeper comprehension rather than simply providing answers, and available to all logged‑in users on Free, Plus, Pro, and Team plans (with Edu availability soon)

ChatGPT Agent

OpenAI today launched ChatGPT Agent, a powerful AI assistant that goes beyond chat by autonomously managing complex, multi‑step tasks—complete with its own “virtual computer” that can browse websites, run code, book reservations, shop online, edit spreadsheets and slide decks—while ensuring users stay in control through permission prompts and safety features like Watch Mode.

AMERICA’S AI ACTION PLAN

The Trump Administration has released America’s AI Action Plan (on July 23, 2025), a sweeping policy roadmap of over 90 federal actions across three pillars—Accelerating Innovation, Building American AI Infrastructure, and Leading International AI Diplomacy & Security—to establish U.S. global dominance in AI through deregulation, export incentives, energy expansion, and workforce development.

Grok 4

Midjourney Video

Midjourney has unveiled Video V1, its first image-to-video generation model now available via Web and Discord, letting users animate static or uploaded images into short 5‑ to 21‑second clips—with adjustable motion levels, both automatic and custom prompts, at approximately eight times the GPU cost of image creation—offered under existing subscription tiers (from $10/month), marking a big step toward real-time interactive worlds despite ongoing copyright lawsuits from Disney and Universal.

Google A2A

https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/

Google introduced the Agent2Agent (A2A) protocol, an open standard enabling seamless communication and collaboration between AI agents from different frameworks or vendors, aimed at solving interoperability challenges in enterprise AI systems.

https://github.com/google/A2A

Source: Google Cloud

Runway Gen-4

Runway Gen-4 is a cutting-edge AI model designed for generating consistent and controllable media, enabling creators to produce coherent characters, locations, and objects across scenes while maintaining stylistic and cinematic continuity.

https://runwayml.com/research/introducing-runway-gen-4

Top 100 GenAI Apps (4th edition)

Offical LInk: https://a16z.com/100-gen-ai-apps-4/

Last updated

Was this helpful?