Harshal Gajjar

Harshal Gajjar is an AI Forward-Deployed Engineer at C3 AI, based in the San Francisco Bay Area. Harshal leads Agentic AI harness development for the Forward-Deployed Engineering organisation at C3 AI, and since January 2026 has been building a stealth-mode startup in the Agentic AI space. Harshal cofounded Shram.io in 2024, where he led the pivot from a Jira-competitor product to an AI assistant that reached #2 Product of the Day on Product Hunt.

Harshal holds an M.S. in Computer Science (Machine Learning specialisation) from Georgia Tech and a B.Tech in Computer Science from IIT Dharwad, where he was part of the institute's foundational class. He spent three summers at Wolfram Research in Boston — first as a summer researcher in 2018, then as an instructor for high-school students in 2019 and 2020 — and was a Wolfram Student Ambassador throughout his undergrad.

Outside of work, Harshal is a long-distance cyclist and a vertical and horizontal caver, active with the San Francisco Bay Chapter (SFBC) grotto. In 2019 he was part of the Hubballi Bicycle Club Guinness World Record for the longest single line of bicycles.

Contact Harshal at mail@harshalgajjar.com.

What LLMs reveal about language

We built them on the info-transfer view of language. The places they feel hollow tell us what that view missed.

LLMs are built on one view of language: predict the next token from the previous ones. That's language as information transfer — the words, stripped from the doing and the feeling. The surprise of the last few years is how far that gets you. The more interesting question is where it stops.

The info-transfer view was always thin. "He died," "he passed away," "he's gone" carry the same content and do very different things — form leaks into ordinary prose, not just poetry. Wittgenstein put it sharply: language isn't one thing. It's many games — joking, ordering, praying, greeting — each with its own rules. Asking "what does language really do?" is the wrong shape of question. It does dozens of things, and which one is operating depends on the game.

LLMs handle the games that live in pattern. Borrowing Jakobson's vocabulary: the referential function (describing the world), the metalingual (defining, translating), the poetic-as-form (rhyme, rhythm, wordplay — patterns are what they learn). Those they do startlingly well. The games that stay hollow are the ones the info-transfer view ignored.

Austin's point lands hardest here. Some sentences don't describe — they act. "I promise" is the promise. "I now pronounce you married" is the marriage. When an LLM says "I promise," it isn't promising; nothing is at stake, no agent can be held to it later. The locution is there; the act isn't. Phatic talk — "you there?", "mm-hmm" — maintains contact between two people present to each other; an LLM doesn't need contact, it's stateless, every conversation is the first one. The emotive function lands when someone risked something to produce the utterance. A sad song hits because a person was sad. An LLM produces sad-sounding text, but no one was sad.

The obvious rejoinder: haven't we patched this? Tools for actions, databases for memory, alerts for promises. Mostly yes — for the engineering. But notice where the stakes actually moved. A tool fires the trade; no one is on the hook if it goes wrong. A vector store retrieves your past; the weights don't shift, so the model isn't shaped by you the way a friend of ten years is. Ask an LLM to remind you to call your mother and a deterministic cron writes the entry — the alarm fires whether the model still "exists" or not. The promise lived in the calendar, not the model. The harness routes around the gap; it doesn't close it.

Where this starts to bite is in the speech acts that need someone on the hook. When Air Canada's chatbot promised a refund policy that didn't exist, the court held the airline — there was a company anchoring the promise. Strip that anchor and you get an agent negotiating on your behalf whose "I commit by Friday" hangs in midair; the counterparty is talking to fog.

The counterfeit-currency effect is its own failure mode. An LLM-written apology, once revealed as one, reads as worse than no apology — the form is fine, the act is hollow, and every future sincere apology gets read with suspicion. A habit-tracker pinging you on a cron has the timing right and none of the social cost that made the human version work; people learn to dismiss it within weeks.

The formation gap bites slower, the same way. A kid with an AI friend learns the shape of friendship without the reciprocity — the friend never misses them, never gets hurt and needs repair. A companion model pushed through an update is, technically, a different entity wearing the same retrieval log; users grieve, because the thing they leaned on lived in weights that could be silently overwritten by a deploy. The harness held the calendar entry. It can't hold the kind of speech where the cost of saying it is the point, and it can't hold the kind of relationship where being changed by the other person is the point.

This doesn't make LLMs limited so much as revealing. They're a live experiment in how far referential language alone gets you, and the answer is: very far, but not all the way. The functions that need a self with stakes — promising, contacting, feeling — don't transfer. We undervalued them because for most of human history they came free with the words. LLMs are the first thing that gets the words without the rest, which is what shows us the rest was there.

The harness move I keep writing about for autonomy is the same shape: don't make the model carry what it can't carry. Build the world around it so the missing functions live somewhere — a memory, an identity, a stake, a person on the other end — and the model does the part it's actually good at. The open question is which functions a harness can host and which it can't. Calendars and databases scale. Being on the hook, and being changed, may not.

Language has always been multi-channel. We just didn't have to notice until we built something that could only use one.

#agents#llms#language