Blog

Google Cloud's Open Knowledge Format is an open, vendor-neutral way to package the context AI systems need, as plain markdown files any model or agent can read.

21 June 2026

Teaching a Model to Reason Before It Learns to Talk

Project Challenge

An exploration of building tiny, logic-first models using cellular automata to challenge the transformer paradigm and identify the primitives of reasoning.

19 June 2026

How Search Grounding Biased an LLM Against YouTube

An analysis of how Claude's webinar platform recommendations were influenced by affiliate-driven content, and a correction regarding YouTube's live features.

18 June 2026

How AI Search Grounding Actually Works: Google vs OpenAI vs Anthropic

An analysis of how Google, OpenAI, and Anthropic handle web grounding, comparing their search processes, citation rates, and how they process page content.

13 June 2026

Emotion Geometry of Google’s AI Models

A replication study of Anthropic’s emotion research on Google’s Gemma 4 31B model, finding that internal emotion representations organize along a valence axis.

17 May 2026

Google’s (still) doesn’t see your live page.

Australian AI SEO agency specialising in brand visibility optimisation for global brands and e-commerce websites using advanced machine learning techniques.

7 May 2026

Gemma 4 Brand Authority Map

A comparison of brand recall between Google's Gemma 4 and Gemini 3 Flash models, analyzing how open-weight and closed models prioritize different brands.

4 April 2026

Chrome’s New Shopping Classifier

An analysis of Google's shopping classifier model in Chrome, detailing its content extraction pipeline, chunking logic, and impact on e-commerce SEO.

3 April 2026

AI Brand Authority Index: Ranking 2.9 Million Brands by Associative Embeddedness in Gemini’s Memory

This research presents a methodology for quantifying brand authority in large language model memory using Personalized PageRank and directed association graphs.

28 March 2026

TurboQuant: From Paper to Triton Kernel in One Session

An implementation and technical analysis of Google's TurboQuant algorithm, testing KV cache compression on Gemma 3 4B using PyTorch and custom Triton kernels.

25 March 2026

Clickbait Titles Exploit Attention Through Latent Entities

Clickbait titles function by withholding a latent entity—the subject, reason, process, or outcome—to force a click and resolve an artificial information gap.

22 March 2026

Fanout Query Analysis

An analysis of 365,920 fanout queries from Google, OpenAI, and Amazon reveals how different AI models generate internal search queries for web grounding.

20 March 2026

Reverse Prompting: Reconstructing Prompts from AI-Generated Text

Research Project

A fine-tuned Gemma 3 270M model reconstructs the most likely prompts from AI-generated responses using synthetic data and contrastive search configurations.

18 March 2026

Rufus – Under the Hood. What Drives Amazon’s AI Shopping Assistant?

An overview of the technical architecture behind Amazon's Rufus, covering its query planning, RAG-based retrieval, custom LLM models, and streaming response.

15 March 2026

Is Query Length a Reliable Predictor of Search Volume?

An analysis of 39.6 million Amazon search queries reveals that query length is an unreliable predictor of search volume compared to semantic content.

12 March 2026

Search Grounding is Transient

Google’s AI search and Gemini use a single-turn transient architecture that purges raw web snippets from working memory immediately after a response is sent.

6 March 2026

SRO & Grounding Snippets

Selection Rate Optimization (SRO) is a new discipline focused on visibility in AI-powered search by measuring how often content is selected for grounding.

1 March 2026

What extraction method is Google using to build grounding snippets?

An analysis of Google's Gemini grounding pipeline, examining how extractive summarization selects query-focused sentences to build grounding context from web sources.

24 February 2026

Implicit Queries in AI Search

An analysis of Google patent US11769017B1, detailing a system that uses context and implied input engines to proactively generate and push AI summaries.

24 February 2026

Sorry Google, I was wrong.

An analysis of a $2,000 Gemini API bill caused by the URL Context tool, which ingests entire web pages as input tokens without providing size estimates.

18 February 2026

AI Search Has a Spam Problem

This article examines GEO spam, a method of manipulating AI-generated answers through self-referential content and engineered claims designed for grounding.

18 February 2026

WebMCP

News

WebMCP is a proposed web standard that allows websites to expose structured tools to AI agents via declarative and imperative APIs for better reliability.

10 February 2026

Bias and Prejudice in AI Search

An exploration of primary bias in AI, defined as a model's inherent confidence in an entity based on training data, and its impact on brand selection rates.

30 December 2025

Most People Don’t Read

A qualitative study comparing self-reported reading habits against actual user behavior, tracking mouse movements, scroll patterns, and time on page.

30 December 2025

Google’s Trajectory: 2026 and Beyond

Google's shift toward agentic AI involves Gemini robotics, A2UI for secure interfaces, and the AP2 protocol for autonomous agent payments and commerce.

25 December 2025

Google’s Ranking Signals

Overview of search ranking factors including popularity signals, PCTR models, semantic relevance, keyword matching, freshness, and various search modes.

24 December 2025

How big are Google’s grounding chunks?

Analysis of how Google selects content to ground Gemini-powered AI shows a fixed 2,000-word budget per query, where relevance rank determines word share.

20 December 2025

Google’s AI Uses Schema?

An investigation into whether Google uses structured data to ground Gemini in AI search, exploring the relationship between LD+JSON and RAG grounding sources.

20 December 2025

Dynamic Visual Layouts

Dynamic visual layout (DVL) is a generative user interface where layouts are created on demand to suit specific queries, shifting the focus from SEO to information.

18 December 2025

Grounding Snippet Extraction Tool

The Gemini Grounding Tool identifies which URLs and specific sentences Google's AI extracts to ground its answers, helping optimize content for AI search.

15 December 2025

How Long Are Web Pages?

An analysis of 44,684 web pages reveals a median content length of 3,201 tokens and an average of 10,403 tokens, highlighting implications for AI systems.

14 December 2025

Google AI Search Update: Completely New Grounding Format

An observation of a new, custom grounding context format for Gemini that deviates from the traditional index-based model used in previous prompt types.

13 December 2025

AI Mode, Content & Search Index

Tests suggest Google’s AI Mode uses a proprietary content store rather than retrieving live web content from the search index during the query fan out process.

13 December 2025

How user prompts shape your content visibility in AI search.

An analysis of how AI search rankers use semantic alignment to surface different content zones within a single article based on query specificity and intent.

13 December 2025

Report: How People Use AI at Work

An analysis of qualitative interviews with 1,250 professionals exploring how the general workforce, creatives, and scientists integrate AI into their work.

10 December 2025

How do people use AI assistants?

An analysis of 3.9 million AI chat sessions reveals that most interactions are short, non-commercial, and involve users seeking help with writing, learning, or coding.

5 December 2025

Ricursive: The Most Interesting AI Company You Haven’t Heard Of

Ricursive Intelligence, founded by Anna Goldie and Azalia Mirhoseini, aims to automate chip design using AI to enable recursive self-improvement in hardware.

3 December 2025

Better Vector Clustering With Head Noun Extraction

An exploration of how standard embeddings can create a semantic soup by grouping search queries by adjectives rather than head nouns during clustering.

28 November 2025

Advanced Prompting Techniques for AI SEO

Explore prompt engineering techniques for SEO, including zero-shot, few-shot, role, and chain-of-thought prompting to improve content and automate tasks.

27 November 2025

To block or not to block? Bot is the question.

An overview of AI bots, distinguishing between training data scrapers used for LLM development and agentic bots designed for autonomous, goal-oriented tasks.

26 November 2025

Gemini 3 hallucinates fan-out queries

An analysis of Gemini 3 API responses reveals the model fabricating search queries to justify its answers, demonstrating persistent hallucination behaviors.

22 November 2025

AI SEO Deep Dive – Tom Critchlow & Dan Petrovic

A deep-dive conversation with Tom Critchlow on the mechanics of AI search, focusing on Selection Rate Optimization (SRO) and how to influence LLM behavior.

19 November 2025

OpenAI’s Sparse Circuits Breakthrough and What It Means for AI SEO

Opinion News

OpenAI research on sparse circuits shows AI models can be built with fewer connections, making them more interpretable and easier to analyze for AI SEO.

14 November 2025

How GPT Sees the Web

A technical walkthrough of how GPT handles web search, including snippets, expansions, context size settings, and the sliding window mechanism for retrieval.

14 November 2025

BlockRank: A Faster, Smarter Way to Rank Documents with LLMs

BlockRank is a novel method for in-context ranking that uses structured sparse attention and contrastive training to improve LLM efficiency and accuracy.

10 November 2025

In AI SEO #10 is the new #1

An empirical study analyzing how Google's AI Mode uses text snippets from multiple sources, finding that snippets are more prompt-aligned than full web pages.

9 November 2025

How much of your content survives the AI Search filter?

An analysis of the Google grounding process, detailing how user prompts and source snippets are processed by models and measuring citation coverage rates.

8 November 2025

Browsing vs Content Fetcher

Google's AI Mode uses browsing for single URL retrieval and content_fetcher for batch processing of multiple structured sources within a workflow.

8 November 2025

From Free-Text to Likert Distributions: A Practical Guide to SSR for Purchase Intent

Idea Process

Semantic Similarity Rating (SSR) maps LLM free-text responses to Likert distributions to improve purchase intent realism and match human response patterns.

15 October 2025

Claude System Internals

An exploration of the internal processes of Claude, including system prompts, token budgets, search grounding algorithms, and hidden reasoning blocks.

9 October 2025

CAPS: A Content Attribution Payment Scheme for the AI Era

Proposal

The collapse of the web's economic model due to AI is addressed through the Content Attribution Payment Scheme, a framework for micropayments and grounding.

30 September 2025

AI Search Citation Mining

Raw data dump from a citation mining pipeline demo featuring 60 prompts across AEO, AI marketing, AI optimization, AI SEO, and AIO using GPT-5 and Gemini.

27 September 2025

Using GPT-5 Structured Output Markers to Detect AI-Generated Content Online

Publishing unedited AI-generated text can leak internal GPT-5 structured output markers like turn0search21, which can lead to SEO and reputational risks.

27 September 2025

TimesFM-ICF

Google Research's TimesFM-ICF uses in-context fine-tuning to achieve high-performance time-series forecasting without the need for traditional model training.

26 September 2025

Chrome Screen AI Protos

A directory of protocol buffer files covering various machine intelligence technologies, including OCR, vision, face detection, and image classification.

23 September 2025

RexBERT

RexBERT is a domain-specialized language model trained on e-commerce text to optimize product titles, descriptions, attribute extraction, and semantic search.

23 September 2025

Annotated Page Content (APC)

Annotated Page Content (APC) is a structured protobuf representation of a webpage's layout and content, designed for actionable and efficient downstream use.

22 September 2025

Deconstructing DomDistiller: How Chrome’s Reader Mode Algorithm Impacts Technical SEO

An analysis of Chrome's DomDistiller engine explains how it uses heuristics, DOM traversal, and semantic HTML to isolate main content from page boilerplate.

22 September 2025

LLM is a Presentation Layer in AI Search

Large language models act as a presentation layer on top of classic information retrieval. They rely on crawling, indexing, and ranking to prevent hallucinations.

21 September 2025

Gemini App Tools – A Technical Overview

Gemini acts as an orchestration layer that manages a large language model by deconstructing prompts into tasks for tools like Code Interpreter and APIs.

14 September 2025

EmbeddingGemma: The Game-Changing Model Every SEO Professional Needs to Know

Google's EmbeddingGemma is a multilingual embedding model that mirrors Gemini's architecture to provide insights into semantic search and query intent.

5 September 2025

Primary Bias on Selection Rate in AI Search

Selection Rate measures how often AI systems select specific items from grounding results. It explores primary bias, model relevance, and the Tree Walker algo.

4 September 2025

The Latent History of AI Boom

An exploration of how the transition from RNNs to transformers and the discovery of double descent enabled the scaling of large language models like GPT.

1 September 2025

AI Overviews = Dialogflow Agent?

An analysis of AI Overview leaks suggesting that Google's implementation may be based on the Dialogflow agentic framework, specifically regarding intent priority.

31 August 2025

Fan-Out Query Search Volume Prediction Using Deep Learning

A deep learning approach using a Query Demand Estimator to automatically predict search volume ranges for long-tail queries generated by a fan-out model.

30 August 2025

Comprehensive Guide to Identifying AI Comment Bots

Identify AI-generated comments through statistical analysis of sentiment, formulaic linguistic patterns, repetitive vocabulary, and a lack of human imperfection.

28 August 2025

What is “Help Me Write” in Chrome?

Help Me Write is Google Chrome's AI-powered assistant that generates context-aware text suggestions for short-form content like emails, posts, and forms.

27 August 2025

Introducing Tree Walker

Tree Walker is an analysis tool designed to deconstruct how AI models like Gemini perceive brands by uncovering word uncertainty and probabilistic language paths.

24 August 2025

Does Schema Help With “AI”?

An experiment testing whether OpenAI's browsing tool provides GPT-5 with grounding context from page schema or only extracts plain text and markdown content.

23 August 2025

Your website is about to start talking. Are you ready for this?

Explore how Chrome's built-in Gemini Nano model uses semantic HTML and the accessibility tree to enable private, on-device AI conversations on websites.

21 August 2025

Inside Chrome’s Semantic Engine: A Technical Analysis of History Embeddings

Technical analysis of Chrome's history embeddings system, detailing the DocumentChunker algorithm, passage extraction, and the 1540-dimensional vector pipeline.

21 August 2025

What does an SEO do in the AI age?

Modern search engines use a hybrid structure consisting of a strategic Agentic Layer for decision-making and an Interpretative Layer for generative synthesis.

19 August 2025

Understanding and Control

AI optimization relies on mechanistic interpretability to understand internal neural computations and model steering to actively control model behavior.

17 August 2025

People call them AI. That’s it.

Social media poll results from 864 votes show that while AI is the dominant label for tools like ChatGPT and Claude, users remain divided on preferred terms.

16 August 2025

GPT-5 Made SEO Irreplaceable

OpenAI is shifting its model design to prioritize reasoning and intelligence over memorized world knowledge, relying on tools and retrieval for information.

10 August 2025

Google’s Query Fan-Out System – A Technical Overview

This article describes a system that replicates Google's query fan-out approach by using generative neural networks to automatically create intelligent search variants.

9 August 2025

GPT-5 System Prompt

Here it is: Credit to: https://x.com/elder_plinius/status/1953583554287562823H/T https://x.com/DarwinSantosNYC for spotting it.

8 August 2025

Journalism Is Dead. Say Hello to Gournalism.

Explores the rise of Gournalism, a shift toward generative, AI-produced content optimized for machine consumption and algorithmic indexing.

6 August 2025

Human Friendly Content is AI Friendly Content

Explore the parallels between human and AI attention mechanisms and learn how to optimize content for both through scannable structures and hierarchy.

21 July 2025

Analysis of Gemini Embed Task-Based Dimensionality Deltas

An analysis of Gemini Embed optimization modes, including classification, retrieval, and semantic similarity, through vector embedding dimension visualization.

16 July 2025

Dynamic per-label thresholds for large-scale search query classification with Otsu’s method

Explore how to use Otsu's algorithm to solve the problem of inconsistent confidence thresholds in search-query intent classifiers using dynamic, per-label tuning.

9 July 2025

Prompt Engineer’s Guide to Gemini Schemas

A technical guide to the Gemini API GenerateContentResponse schema, detailing the structure of candidates, usage metadata, safety ratings, and parsed data.

2 July 2025

Top 10 Most Recent Papers by MUVERA Authors

A collection of recent research papers and focus areas for MUVERA authors Laxman Dhulipala, Majid Hadian, Jason Lee, and Rajesh Jayaram.

30 June 2025

Training Gemma‑3‑1B Embedding Model with LoRA

Gemma-Embed is a bespoke 256-dim embedding model created by fine-tuning google/gemma-3-1b-pt with LoRA to enable high-fidelity query reformulation.

28 June 2025

Training a Query Fan-Out Model

Google generates high-quality query reformulations by traversing the mathematical latent space between queries and documents to train the qsT5 model.

24 June 2025

Cosine Similarity or Dot Product?

An examination of the Chrome codebase reveals that the history_embeddings component uses the dot product of normalized vectors to perform similarity searches.

19 June 2025

Universal Query Classifier

A zero-shot, multi-label search query classifier that maps queries to any user-provided label taxonomy without the need for retraining or bespoke models.

13 June 2025

Another failed attempt to kill SEO

An analysis of the term Generative Engine Optimization (GEO) and a critique of industry rebranding efforts following opinions shared by Andreessen Horowitz personnel.

9 June 2025

Vector Embedding Optimization

An evaluation of four embedding methods comparing speed, storage, and accuracy. Results show mrl truncation maintains high accuracy while reducing file size.

6 June 2025

Dissecting Gemini’s Tokenizer and Token Scores

Explore how Google’s Gemini processes text using subword tokenization. Use this tool to inspect SentencePiece log-likelihood scores for common and rare tokens.

5 June 2025

There’s a small army of on-device models coming to Chrome

Technical interpretations and parameter breakdowns for various AI models, including Gemini, Gemma, ULM, and StableLM, covering architecture and scale.

5 June 2025

AI Mode Site Search

Explore Vertex AI website search features, including Enterprise edition tools like extractive answers, image search, and advanced LLM capabilities for summaries.

4 June 2025

Multi-Step Research Agent

An implementation of Google's query fan-out in an agentic framework used to research the machine learning and SEO services offered by DEJAN Marketing.

4 June 2025

Query Fan-Out Prompt Implementation in Google’s Open-Source Agentic Framework

Google’s Gemini Fullstack LangGraph Quickstart uses Gemini 2.5 and LangGraph to build a citation-driven research agent with a React and FastAPI architecture.

4 June 2025

From Hallucinations to Clicks

Proposal

An automated method for mapping LLM-hallucinated URLs to valid pages using keyword matching and semantic similarity via vector embeddings and cosine similarity.

2 June 2025

What is GEO?

Generative Engine Optimisation (GEO) is a term used to describe SEO for AI assistants and generative search engines, often based on a single research paper.

2 June 2025

AI Mode & Page Indexing

Tests indicate Google's AI Mode uses a proprietary content store rather than the live web, as it fails to fetch indexed pages that are otherwise ranking.

30 May 2025

AI Mode is Not Live Web

An experiment testing Google's AI Mode suggests it may rely on Google's existing index or cached web data rather than performing live HTTP requests for all URLs.

29 May 2025

How AI Mode Selects Snippets

An analysis of how Google selects content for AI Mode snippets, identifying patterns in value propositions, HTML structure, and semantic selection criteria.

28 May 2025

AI Mode Internals

An exploration of Google's AI Mode and Gemini tools, including its use of Google Search, Python libraries, and how it processes date, time, and location data.

28 May 2025

The Future of Google

Sundar Pichai discusses Google's AI strategy, the evolution of Search, upcoming AR glasses, the impact of AI on web traffic, and the future of robotics.

28 May 2025

The Inner Workings of GPT’s file_search Tool

The file_search tool allows GPT models to extract precise information from uploaded documents using structured queries and provides citations for verification.

27 May 2025

Live Blog: Hacking Gemini Embeddings

An experimental study reproducing the vec2vec research paper by attempting to translate and align Gemini and MxbAI embedding spaces using unsupervised methods.

24 May 2025

Google’s New URL Context Tool

News

Google's Gemini now uses a combination of search and browsing tools to fetch and read specific web pages, allowing it to ground responses in real-world data.

21 May 2025

LLM-Based Search Volume Prediction

An analysis comparing Google Gemini's keyword volume predictions against actual Google Search Console data reveals weak-to-moderate correlation and limited accuracy.

19 May 2025

How Google grounds its LLM, Gemini.

An analysis of Gemini's internal grounding processes, revealing its structured indexing method, operational stages, and use of external verification tools.

8 May 2025

Google Lens Modes

The lns_mode parameter classifies Google Lens queries into text, unimodal, or multimodal modes to help route requests and support AI Mode functionality.

8 May 2025

Content Substance Classification

Cyberfluff is a novel approach for detecting low-quality web content using curriculum-driven contrastive pretraining to distinguish fluff from substance.

23 April 2025

Chrome’s New Embedding Model: Smaller, Faster, Same Quality

Chrome's latest update features a new text embedding model that is 57% smaller than its predecessor, using int8 quantization to maintain search quality.

19 April 2025

AI Content Detection

DEJAN-LM is an AI content detection model trained on 20 million sentences, using a combined deep learning and heuristic approach to identify advanced AI text.

17 April 2025

I think Google got it wrong with “Generate → Ground” approach.

Proposal

An analysis of Google's RARR framework compared to retrieval-first approaches like RAG and FiD, focusing on reducing LLM hallucinations through grounding.

17 April 2025

Introducing Grounding Classifier

An analysis of Gemini 2.5 Pro's search grounding capabilities and the development of a prompt grounding classifier trained on 10,000 collected prompts.

2 April 2025

Advanced Interpretability Techniques for Tracing LLM Activations

This page explores mechanistic interpretability techniques, including activation logging, causal tracing through activation patching, and attention head analysis.

31 March 2025

Temperature Parameter for Controlling AI Randomness

The temperature parameter in generative AI models influences randomness and creativity by rescaling the probability distribution of potential next words.

30 March 2025

Probability Threshold for Top-p (Nucleus) Sampling

Top-p sampling, or nucleus sampling, is a parameter used in generative AI to control text randomness by selecting words based on a cumulative probability.

30 March 2025

How Google Decides When to Use Gemini Grounding for User Queries

Google uses dynamic retrieval to decide when Gemini models should use grounding. A prediction score and configurable threshold determine if a query needs search data.

29 March 2025

Cross-Model Circuit Analysis: Gemini vs. Gemma Comparison Framework

A framework for comparative circuit analysis between Google's Gemini and Gemma models to identify how different architectures represent brand information.

29 March 2025

Neural Circuit Analysis Framework for Brand Mention Optimization

Innovation

This framework uses open-weight models like Gemma 3 Instruct to perform mechanistic brand positioning through direct neural circuit and activation analysis.

29 March 2025

Strategic Brand Positioning in LLMs: A Methodological Framework for Prompt Engineering and Model Behavior Analysis

Innovation

This paper presents a methodological framework for analyzing and optimizing brand mentions in large language models through systematic prompt probing and analysis.

29 March 2025

AlexNet: The Deep Learning Breakthrough That Reshaped Google’s AI Strategy

News

Google and the Computer History Museum open-sourced the AlexNet code, highlighting its role in launching deep learning and shaping Google's AI-first strategy.

21 March 2025

The Next Chapter of Search: Get Ready to Influence the Robots

Explore the evolving landscape of SEO, focusing on how AI, conversational search, and Large Language Models are changing brand representation and visibility.

19 March 2025

Revealed: The exact search result data sent to Google’s AI.

An analysis of Gemini's grounding capabilities, addressing issues with hallucinations, guardrails, and the discovery of multi-passage snippet context.

14 March 2025

Beyond Rank Tracking: Analyzing Brand Perceptions Through Language Model Association Networks

The DEJAN methodology uses large language models to analyze brand perception and semantic associations, moving beyond traditional keyword rank tracking.

27 February 2025

Teaching AI Models to Be Better Search Engines: A New Approach to Training Data

A recent patent application describes a method for training AI models to better understand human queries by using LLMs to automatically generate training data.

13 February 2025

Self-Supervised Quantized Representation for KG-LLM Integration

Self-Supervised Quantized Representation (SSQR) integrates knowledge graphs with large language models by compressing entity information into discrete codes.

6 February 2025

What does Gemini think about your brand?

Chrome Dev includes a quantized Gemini model for tasks like scam prevention. This analysis examines its on-device execution and reverse-engineered prompts.

29 January 2025

Google’s Privacy Sandbox: Navigating the Cookieless Future

An examination of Google's Privacy Sandbox, focusing on the technical details and privacy implications of the Topics API and the FLEDGE API.

14 January 2025

Why deep learning works.

An excerpt from François Chollet’s Deep Learning with Python exploring the manifold hypothesis and how structured information enables deep learning to work.

26 December 2024

Introducing VecZip: Embedding Compression Algorithm

VecZip is a novel compression method by DEJAN AI that reduces embedding dimensionality by retaining unique dimensions to improve AI performance and storage.

12 December 2024

Site Engagement Metrics

The Google Site Engagement Metrics Framework in Chromium tracks user interactions, engagement scores, and browsing behavior using UMA histograms.

29 November 2024

Beyond Links: Understanding Page Transitions in Chrome

Explore Chrome page transition types and qualifiers to understand user intent, navigation pathways, and the SEO implications of different browser behaviors.

27 November 2024

Both humans and AI return similar results when asked for a random number

A comparison of 200,000 random numbers provided by humans and Google's Gemma-2-2b-it model reveals significant overlaps and patterns in number selection.

13 November 2024

Chrome AI Frameworks & Models

A comprehensive list of Chrome's on-device machine learning models, including specialized tools for language processing, page analysis, and content safety.

30 October 2024

Attention Is All You Need

A discussion of the Attention Is All You Need paper, covering the Transformer architecture, multi-head attention, and its impact on machine translation.

13 October 2024

The State of AI

The 2024 State of AI report explores the rise of open models, benchmarking challenges, neurosymbolic systems, model efficiency, and global AI developments.

13 October 2024

ILO

The ILO app is a Streamlit-based tool for managing SEO data through URL population, GSC data fetching, query intent classification, and traffic projections.

5 October 2024

Resource-Efficient Binary Vector Embeddings With Matryoshka Representation Learning

An analysis of reducing vector embedding storage through Matryoshka Representation Learning and binary embeddings to optimize SEO text feature extraction.

5 September 2024

Query Intent via Retrieval Augmentation and Model Distillation

QUILL enhances query intent classification by using retrieval augmentation and a two-stage distillation process to balance model performance and efficiency.

5 September 2024

Search Query Quality Classifier

A search query classifier using ALBERT architecture to identify well-formed queries with 80% accuracy, improving upon Google's LSTM-based model by 10%.

31 August 2024

How Gemini Selects Results

An explanation of how internal algorithms use relevance scoring, recency bias, user intent, and stochasticity to retrieve and present information.

26 August 2024

Gemini System Prompt