Home

Shubham’s AI Playbook

A personal reference for navigating the modern AI ecosystem — tools, workflows, models, and principles.

Explore the Playbook

Tools — Conversational AI, coding assistants, automation platforms and more.
Productivity — Microsoft Copilot, Notion AI, Grammarly, Otter.ai and more.
Hardware — Chips, cloud ML, and today’s most capable frontier models.
Creative & Hubs — Image, video, voice generation tools and model marketplaces.
Workflows — Five AI workflows plus industry verticals — healthcare, finance, legal, and more.
Glossary — 30+ key terms explained in plain English.
Principles — Six guiding rules plus AI safety, ethics, and practical guardrails.
Confusions — 20+ misconceptions that trip up early adopters — debunked.
History — Key milestones from 1950s to the present.
Open Source — Open-weight models you can run, fine-tune, or build on.
Follow — People, newsletters, YouTube channels and podcasts worth your time.

How the Major AI Companies Are Structured

Each major lab has a full vertical stack — a research org, a model family, products built on those models, and distinct tiers you choose between. Understanding this helps you know exactly what you’re paying for and why the same company can have five different products.

Anthropic

Founded 2021 · Safety-focused · San Francisco

Model Family: Claude

Products:

claude.ai — Chat interface (web & mobile)
Anthropic API — Direct model access for developers
Claude Code — Agentic CLI for coding tasks in terminal
Claude for Teams / Enterprise — Business plans with data privacy
Powering other apps — Cursor, Windsurf use Claude under the hood

Model Tiers (cheapest → most capable):

Haiku — Fast & cheap
Sonnet — Balanced (default)
Opus — Most capable

All tiers share the same 200K token context window.

OpenAI

Founded 2015 · Microsoft-backed · San Francisco

Model Families:

GPT series — General purpose
o-series — Reasoning models

Products:

ChatGPT — Chat interface (web, mobile, desktop)
OpenAI API — Direct model access for developers
Codex — Code-specialised model powering GitHub Copilot
DALL-E — Image generation (inside ChatGPT Plus)
Sora — Video generation (ChatGPT Pro)
ChatGPT Enterprise — Business plan, data privacy

Model Tiers (cheapest → most capable):

GPT-4o mini — Fast & cheap
GPT-4o — Balanced (default)
o1 / o3 / o4 — Deep reasoning

Note: o-series “thinks” before answering — slower but significantly smarter on complex problems.

Google DeepMind

Founded 1998 (Google) · Alphabet subsidiary · Mountain View

Model Family: Gemini + Imagen

Products:

Gemini.google.com — Chat interface (web & mobile)
Google AI Studio — Free API access for developers
Vertex AI — Enterprise API with Google Cloud integration
Gemini in Workspace — AI inside Docs, Gmail, Sheets, Slides
Imagen / ImageFX — Photorealistic image generation
Gemini on Android — On-device assistant replacing Google Assistant

Model Tiers (cheapest → most capable):

Flash — Fast & free
Pro — Balanced (1M context)
Ultra — Most capable

Gemini 1.5 Pro has a 1M-token context window — the largest of any major model.

DeepSeek (High-Flyer)

Founded 2023 · Hangzhou, China · Open weights

Model Family: DeepSeek R1 & V3

Products:

chat.deepseek.com — Chat interface
DeepSeek API — Very affordable pricing
Open weights — Run via Ollama or Hugging Face

Model Tiers (cheapest → most capable):

V3 — General purpose
R1 — Reasoning (matches o1)

MIT license — free for commercial use. Trained at a fraction of Western model costs.

How Other Tools Fit Into This Picture

Not every AI tool builds its own model. Most sit on top of the big three, or rely on the open-source community. Here’s where everything else fits:

Open Weights — No product, just models

Meta (Llama), Mistral, Google (Gemma), Microsoft (Phi-4), Yi (01.AI)

These companies release model weights publicly but don’t operate a major consumer product. They build the engine and let the community use it.

How you access them:

Ollama (run locally)
Hugging Face
Replicate
Together AI
LM Studio

Sits on top of the big three

Perplexity, Microsoft Copilot, GitHub Copilot, Cursor, Windsurf

These products don’t build their own foundation models. They call the OpenAI, Anthropic or Google APIs and wrap them in a specialised experience.

Perplexity → uses Claude, GPT-4o + its own search stack
Microsoft Copilot → powered by OpenAI GPT-4o
GitHub Copilot → powered by OpenAI Codex / GPT-4o
Cursor → uses Claude Sonnet or GPT-4o (you choose)
Windsurf → uses Claude or its own Codeium models

Infrastructure & Routing

LangChain, LlamaIndex, OpenRouter, CrewAI

These tools don’t provide AI themselves — they help you build systems that connect to and orchestrate multiple models and data sources.

OpenRouter → single API to route between 200+ models
LangChain → framework to chain LLM calls and tools
LlamaIndex → connects your data to any LLM
CrewAI → orchestrates multiple AI agents working together

Quick Comparison: Top Conversational AI

Feature	ChatGPT (GPT-4o)	Claude (Sonnet 4)	Gemini (1.5 Pro)	Perplexity	Ollama (Local)
Context window	128K tokens	200K tokens	1M tokens	Varies by model	Varies by model
Image input	✅	✅	✅	✅	✅ (llava, llama3.2-vision)
Web browsing	✅ (plugin)	✅ (claude.ai)	✅	✅ Core feature — always on	❌
Code execution	✅ (Code Interpreter)	✅ (Artifacts)	✅	❌	❌ (model only)
Free daily limit	~10–15 GPT-4o msgs, then falls back to GPT-4o mini (unlimited, less capable)	~20–40 msgs/day on Sonnet; throttled or blocked after	Generous — Flash mostly free; Pro is rate-limited	~5 AI Pro searches/day; basic search unlimited	Unlimited — runs on your hardware
Paid plan	Plus $20/mo — higher GPT-4o limits	Pro $20/mo — ~5× more usage	Google One AI $20/mo — full 1.5 Pro access	Pro $20/mo — unlimited Pro searches	Free forever
Privacy	Data sent to OpenAI	Data sent to Anthropic	Data sent to Google	Data sent to Perplexity	✅ 100% local
API access	✅ OpenAI API	✅ Anthropic API	✅ Google AI Studio	✅ Perplexity API	✅ Local REST API
Best for	General tasks, plugins, broad use	Writing, long docs, safety-critical tasks	Research, very long context	Real-time research with cited sources	Privacy, offline, cost-free automation

Tip: If you hit free limits often, the API is often cheaper for heavy use — you pay per token with no daily cap. For regular personal use, a $20/mo Pro plan on your most-used tool pays for itself quickly.

Complete Beginner? Here’s Your Path.

If you’re new to AI, don’t try to absorb everything at once. Follow this sequence over 4–6 weeks. Each step builds on the last.

1. Days 1–7 — Pick one AI assistant and use it every day

Don’t compare them yet. Use it for real tasks — drafting emails, summarising articles, explaining things you don’t understand. Volume over perfection.

ChatGPT — most popular, widest ecosystem
Gemini — best if you use Google Docs / Gmail
Claude — excellent for writing & long documents

All three are free. Pick one and stick with it for at least a week.

2. Week 2 — Learn what you’re actually working with

Before adding more tools, spend 30 minutes understanding the basics. It changes how you use everything.

Read the Glossary — token, prompt, context window, hallucination
Read Common Confusions — debunks myths everyone believes early on
Watch 3Blue1Brown’s “But what is a GPT?” on YouTube — 20 min, best visual explanation available

3. Weeks 3–4 — Add one specialised tool for your biggest pain point

Don’t sign up for everything. Pick the one that removes real friction from your day:

Too many meetings → Otter.ai or Fathom
Research-heavy work → NotebookLM
Lots of writing → Grammarly
Need web answers with sources → Perplexity

4. Month 2 — Stay current without getting overwhelmed

AI moves fast. You don’t need to read everything — just stay loosely informed.

One newsletter: The Rundown AI (daily, 5 min) or The Batch by Andrew Ng (weekly, deeper)
Follow Ethan Mollick on X — best practical takes on using AI at work

5. Month 2+ — Pick your lane

Go deeper based on what you actually care about. Once you have the basics, use this playbook as your reference. Pick the path that fits your interest:

Creative work — Midjourney → Runway → Suno → Creative page
Building products — Cursor → Lovable → n8n → Tools page
Business & automation — Microsoft Copilot → Zapier → Workflows page
How it works — fast.ai → Karpathy’s YouTube → Hardware & Models
Privacy / self-hosting — Ollama → LM Studio → run Llama locally → Open Source
Ethics & Safety — Read the Principles page — guardrails, alignment, responsible use

The biggest mistake beginners make: signing up for 10 tools in week one. Master one thing first.