Skip to content

Home

Shubham’s AI Playbook

A personal reference for navigating the modern AI ecosystem — tools, workflows, models, and principles.


Explore the Playbook

  • Tools — Conversational AI, coding assistants, automation platforms and more.
  • Productivity — Microsoft Copilot, Notion AI, Grammarly, Otter.ai and more.
  • Hardware — Chips, cloud ML, and today’s most capable frontier models.
  • Creative & Hubs — Image, video, voice generation tools and model marketplaces.
  • Workflows — Five AI workflows plus industry verticals — healthcare, finance, legal, and more.
  • Glossary — 30+ key terms explained in plain English.
  • Principles — Six guiding rules plus AI safety, ethics, and practical guardrails.
  • Confusions — 20+ misconceptions that trip up early adopters — debunked.
  • History — Key milestones from 1950s to the present.
  • Open Source — Open-weight models you can run, fine-tune, or build on.
  • Follow — People, newsletters, YouTube channels and podcasts worth your time.

How the Major AI Companies Are Structured

Each major lab has a full vertical stack — a research org, a model family, products built on those models, and distinct tiers you choose between. Understanding this helps you know exactly what you’re paying for and why the same company can have five different products.

Anthropic

Founded 2021 · Safety-focused · San Francisco

Model Family: Claude

Products:

  • claude.ai — Chat interface (web & mobile)
  • Anthropic API — Direct model access for developers
  • Claude Code — Agentic CLI for coding tasks in terminal
  • Claude for Teams / Enterprise — Business plans with data privacy
  • Powering other apps — Cursor, Windsurf use Claude under the hood

Model Tiers (cheapest → most capable):

  • Haiku — Fast & cheap
  • Sonnet — Balanced (default)
  • Opus — Most capable

All tiers share the same 200K token context window.


OpenAI

Founded 2015 · Microsoft-backed · San Francisco

Model Families:

  • GPT series — General purpose
  • o-series — Reasoning models

Products:

  • ChatGPT — Chat interface (web, mobile, desktop)
  • OpenAI API — Direct model access for developers
  • Codex — Code-specialised model powering GitHub Copilot
  • DALL-E — Image generation (inside ChatGPT Plus)
  • Sora — Video generation (ChatGPT Pro)
  • ChatGPT Enterprise — Business plan, data privacy

Model Tiers (cheapest → most capable):

  • GPT-4o mini — Fast & cheap
  • GPT-4o — Balanced (default)
  • o1 / o3 / o4 — Deep reasoning

Note: o-series “thinks” before answering — slower but significantly smarter on complex problems.


Google DeepMind

Founded 1998 (Google) · Alphabet subsidiary · Mountain View

Model Family: Gemini + Imagen

Products:

  • Gemini.google.com — Chat interface (web & mobile)
  • Google AI Studio — Free API access for developers
  • Vertex AI — Enterprise API with Google Cloud integration
  • Gemini in Workspace — AI inside Docs, Gmail, Sheets, Slides
  • Imagen / ImageFX — Photorealistic image generation
  • Gemini on Android — On-device assistant replacing Google Assistant

Model Tiers (cheapest → most capable):

  • Flash — Fast & free
  • Pro — Balanced (1M context)
  • Ultra — Most capable

Gemini 1.5 Pro has a 1M-token context window — the largest of any major model.


DeepSeek (High-Flyer)

Founded 2023 · Hangzhou, China · Open weights

Model Family: DeepSeek R1 & V3

Products:

  • chat.deepseek.com — Chat interface
  • DeepSeek API — Very affordable pricing
  • Open weights — Run via Ollama or Hugging Face

Model Tiers (cheapest → most capable):

  • V3 — General purpose
  • R1 — Reasoning (matches o1)

MIT license — free for commercial use. Trained at a fraction of Western model costs.


How Other Tools Fit Into This Picture

Not every AI tool builds its own model. Most sit on top of the big three, or rely on the open-source community. Here’s where everything else fits:

Open Weights — No product, just models

Meta (Llama), Mistral, Google (Gemma), Microsoft (Phi-4), Yi (01.AI)

These companies release model weights publicly but don’t operate a major consumer product. They build the engine and let the community use it.

How you access them:

  • Ollama (run locally)
  • Hugging Face
  • Replicate
  • Together AI
  • LM Studio

Sits on top of the big three

Perplexity, Microsoft Copilot, GitHub Copilot, Cursor, Windsurf

These products don’t build their own foundation models. They call the OpenAI, Anthropic or Google APIs and wrap them in a specialised experience.

  • Perplexity → uses Claude, GPT-4o + its own search stack
  • Microsoft Copilot → powered by OpenAI GPT-4o
  • GitHub Copilot → powered by OpenAI Codex / GPT-4o
  • Cursor → uses Claude Sonnet or GPT-4o (you choose)
  • Windsurf → uses Claude or its own Codeium models

Infrastructure & Routing

LangChain, LlamaIndex, OpenRouter, CrewAI

These tools don’t provide AI themselves — they help you build systems that connect to and orchestrate multiple models and data sources.

  • OpenRouter → single API to route between 200+ models
  • LangChain → framework to chain LLM calls and tools
  • LlamaIndex → connects your data to any LLM
  • CrewAI → orchestrates multiple AI agents working together

Quick Comparison: Top Conversational AI

FeatureChatGPT (GPT-4o)Claude (Sonnet 4)Gemini (1.5 Pro)PerplexityOllama (Local)
Context window128K tokens200K tokens1M tokensVaries by modelVaries by model
Image input✅ (llava, llama3.2-vision)
Web browsing✅ (plugin)✅ (claude.ai)✅ Core feature — always on
Code execution✅ (Code Interpreter)✅ (Artifacts)❌ (model only)
Free daily limit~10–15 GPT-4o msgs, then falls back to GPT-4o mini (unlimited, less capable)~20–40 msgs/day on Sonnet; throttled or blocked afterGenerous — Flash mostly free; Pro is rate-limited~5 AI Pro searches/day; basic search unlimitedUnlimited — runs on your hardware
Paid planPlus $20/mo — higher GPT-4o limitsPro $20/mo — ~5× more usageGoogle One AI $20/mo — full 1.5 Pro accessPro $20/mo — unlimited Pro searchesFree forever
PrivacyData sent to OpenAIData sent to AnthropicData sent to GoogleData sent to Perplexity✅ 100% local
API access✅ OpenAI API✅ Anthropic API✅ Google AI Studio✅ Perplexity API✅ Local REST API
Best forGeneral tasks, plugins, broad useWriting, long docs, safety-critical tasksResearch, very long contextReal-time research with cited sourcesPrivacy, offline, cost-free automation

Tip: If you hit free limits often, the API is often cheaper for heavy use — you pay per token with no daily cap. For regular personal use, a $20/mo Pro plan on your most-used tool pays for itself quickly.


Complete Beginner? Here’s Your Path.

If you’re new to AI, don’t try to absorb everything at once. Follow this sequence over 4–6 weeks. Each step builds on the last.

1. Days 1–7 — Pick one AI assistant and use it every day

Don’t compare them yet. Use it for real tasks — drafting emails, summarising articles, explaining things you don’t understand. Volume over perfection.

  • ChatGPT — most popular, widest ecosystem
  • Gemini — best if you use Google Docs / Gmail
  • Claude — excellent for writing & long documents

All three are free. Pick one and stick with it for at least a week.

2. Week 2 — Learn what you’re actually working with

Before adding more tools, spend 30 minutes understanding the basics. It changes how you use everything.

  • Read the Glossary — token, prompt, context window, hallucination
  • Read Common Confusions — debunks myths everyone believes early on
  • Watch 3Blue1Brown’s “But what is a GPT?” on YouTube — 20 min, best visual explanation available

3. Weeks 3–4 — Add one specialised tool for your biggest pain point

Don’t sign up for everything. Pick the one that removes real friction from your day:

  • Too many meetings → Otter.ai or Fathom
  • Research-heavy work → NotebookLM
  • Lots of writing → Grammarly
  • Need web answers with sources → Perplexity

4. Month 2 — Stay current without getting overwhelmed

AI moves fast. You don’t need to read everything — just stay loosely informed.

  • One newsletter: The Rundown AI (daily, 5 min) or The Batch by Andrew Ng (weekly, deeper)
  • Follow Ethan Mollick on X — best practical takes on using AI at work

5. Month 2+ — Pick your lane

Go deeper based on what you actually care about. Once you have the basics, use this playbook as your reference. Pick the path that fits your interest:

  • Creative work — Midjourney → Runway → Suno → Creative page
  • Building products — Cursor → Lovable → n8n → Tools page
  • Business & automation — Microsoft Copilot → Zapier → Workflows page
  • How it works — fast.ai → Karpathy’s YouTube → Hardware & Models
  • Privacy / self-hosting — Ollama → LM Studio → run Llama locally → Open Source
  • Ethics & Safety — Read the Principles page — guardrails, alignment, responsible use

The biggest mistake beginners make: signing up for 10 tools in week one. Master one thing first.