All articles

The Grade That Changed Everything

How a failed exam, a wise professor, and an obscure mathematical inequality quietly built the foundation of modern AI


Bucharest, 1992. I had just earned my place at the Faculty of Electronics — one of the hardest universities to get into in Romania at the time. My grades were strong. I had a merit scholarship. I was, by any objective measure, a student who knew how to study.

Which makes what happened at my Mathematical Analysis exam all the more embarrassing to admit.

I walked in with confidence. I reached into the pile of exam topics and pulled one out. I read the words and felt a rush of genuine joy: Cauchy-Bunyakovsky-Schwarz inequality. I knew this one. I had studied it carefully. I pressed pen to paper and started writing the demonstration, fast and clean, the kind of writing that happens when you actually understand something.

The professor walked by. He looked at my paper. Then he said, quietly but firmly: "Move on to subject two, please."

Subject two was a limit problem. A wall of symbols, zeros, and mathematical expressions that, to me, might as well have been written in ancient Sumerian. I had nothing. Not a clue. Not even a direction to start guessing.


Here is where I did something I am simultaneously proud of and embarrassed by.

I stalled. I looked at the problem. I looked at my previous work. And then, somewhere between panic and inspiration, an idea arrived. I raised my hand and told the professor: "Sir, I believe this limit can be solved using the Cauchy-Bunyakovsky-Schwarz inequality."

I had absolutely no idea if that was true. I said it because it was the only thing I knew, and I needed to buy time.

The professor looked at me for a long moment. Then he said: "Return to your seat and think about it."

Time passed. The clock on the wall did not cooperate. When he finally called me back to the front, I had nothing new to offer except conviction. I told him again that this inequality was fundamental to advanced mathematics, that it mattered deeply, that it was load-bearing in ways that weren't always obvious. I held on to that statement with everything I had, because it was all I had.

He reached for my grade book. He turned through the pages slowly, reading. Then he stopped. "Tens, tens, tens... but you have a five here, in physics?" I corrected him quickly: "No, sir — that was here, in your course, last semester." He looked at me. He looked at the page. He looked at me again.

Then he wrote in my grade book and handed it back.

A five. A passing grade, barely. Enough to continue.


Years passed. I started a doctoral program in Neural Networks. I worked. I built things. I moved through the world of computing and systems. Years became decades.

Then, last year, I took a machine learning course at Stanford. The kind of course where you expect to see things you already know — gradient descent, weights, bias, activation functions. Familiar territory. And it was, mostly.

Until one moment that I can only describe as a lightning strike to the back of the skull.

I was looking at the mathematics of how neural networks measure similarity between things — how they decide whether two pieces of information are related, how they figure out which words belong together, how attention mechanisms in modern AI actually work. And something clicked open inside me.

Vectors in multidimensional space. Dot products. Similarity. Alignment. Projection.

Wait.

Oh.

The inequality I had half-bluffed my way through in 1992. The one I had clung to in desperation in front of a professor who could see right through me. The thing I had described as "fundamental to advanced mathematics" without really understanding why — it turns out I was accidentally, unknowingly, completely right. It is fundamental. It is load-bearing. It is sitting at the very center of how modern artificial intelligence understands the world.


Let me try to explain this in a way that requires no math degree, because the idea is actually beautiful and simple.

Imagine you are trying to describe whether two people have similar taste in music. You could list every song they've ever liked and compare the lists. But that's unwieldy. Instead, imagine you could represent each person as an arrow in space — where the arrow points encodes everything about their preferences. If two people point in exactly the same direction, they are perfectly aligned. They love the same things. If they point in opposite directions, they disagree on everything. If their arrows are perpendicular — pointing at right angles to each other — they share nothing in common.

This is not a metaphor. This is literally how modern AI works. Every word, every sentence, every idea in a language model exists as an arrow — a vector — in a space with hundreds or thousands of dimensions. When ChatGPT decides that "king" is related to "queen," it's measuring the angle between two arrows. When a recommendation algorithm decides you might like a film you've never seen, it's measuring alignment between your preference arrow and that film's arrow.

The Cauchy-Bunyakovsky-Schwarz inequality is the mathematical rule that makes all of this coherent. It says, in essence: the similarity between two arrows can never be greater than perfect alignment. It gives you a guaranteed boundary. It tells you that similarity is always a number between -1 and 1, never outside that range, never unbounded, always interpretable. Without it, the entire geometry collapses. Similarity scores become meaningless. The comparison between vectors — which is the beating heart of modern AI — stops working.

It is why Google can find what you're looking for even when you don't use the exact right words. It is why AI translation works. It is why every time a language model produces a coherent sentence, it is implicitly, invisibly doing something that relies on this principle.

The thing I bluffed about in an exam in Bucharest in 1992 turned out to be, quite literally, one of the structural pillars of the technology that defines our era.


I don't know that professor's name. I wish I did. I've thought about him many times over the years, but never with the weight I feel now.

He knew. He had to have known. He was looking at a grade book full of tens — a student who clearly understood the material, who had clearly put in the work — and he was also looking at a young man who was having a very bad exam day and trying to hide it with audacity. He made a choice. He gave me a five.

Not a ten. Not a failure. A five.

I have wondered for years why. Was it mercy? Was it a signal — you can do better than this? Was it a quiet lesson in the difference between knowing something and performing knowing? I don't know. What I know is that a failure at that moment would have meant repeating the exam, potentially losing my scholarship, possibly a detour that changes everything that came after.

He didn't give me a ten I didn't earn. He didn't fail me when he could have. He gave me exactly enough to keep going.


I am an AI engineer now. I understand how neural networks work. I understand how information becomes vectors, how vectors become similarity scores, how similarity scores become the answers your phone gives you when you ask it something. I understand, at a level I couldn't have imagined in 1992, why that inequality is fundamental.

And I find myself thinking: that five is the grade I treasure most. More than any ten I ever received, in any subject, at any stage of my life. A ten tells you what you already know. A five, given with wisdom by someone who understood more than they let on, can tell you who you might become.

If I could find that professor today, I would tell him: I became what I became, in part, because you let me pass.

Mathematics is not cold. It is not dry. It is not a collection of symbols that only experts can understand. It is the language underneath everything — underneath the AI assistants, the search engines, the recommendation systems, the translation tools that are changing how humanity communicates. It is beautiful in the way that a load-bearing wall is beautiful: you don't always see it, but remove it, and everything falls.

I almost failed an exam on one small piece of that language. Instead, I spent thirty years walking toward it, not knowing it was waiting for me.

That is what mathematics does. It waits. And when you finally arrive, it welcomes you like an old friend you never knew you had.


Published on OmniTechnicus.ai

Continue Reading

A Correction, in Public: Revisiting AI Oversight in the United States

A follow-up to "The Global AI Risk Assessment Convergence." The original argued that three jurisdictions—the EU, South Korea, and the United States—were converging on human oversight for high-risk AI. Two of those pillars hold. The third does not: the US section rested on Executive Order 14110, which had already been revoked at the time of publication. This piece corrects the record in the open, reframes the US not as converging but as contested, and draws the durable engineering lesson underneath a changing regulatory landscape.

Agentic AI Is Distributed Systems. Part 2: Building on Uncertain Ground — Practical Semantic Correctness for Agentic Systems

You cannot mathematically prove an LLM correct. But you can architect around it. Part 2 surveys the three research communities approaching semantic correctness for LLM outputs — formal verification, runtime guardrails, and post-condition verification — maps their results honestly, and delivers a concrete engineering playbook: four patterns engineers can implement today to build agentic systems that fail visibly, recover gracefully, and escalate correctly.

Agentic AI Is Distributed Systems. Part 1: The Map Every AI Engineer Should Have Memorized

Every major agentic AI pattern has a named distributed systems ancestor going back decades. MoA is ensemble computing. ReAct is a control loop. AutoGen is the actor model. LangGraph is a workflow engine. Multi-agent coordination is a blackboard system. This article maps each AI concept to its CS lineage, explains what that means for engineers building today, and names the three problems that genuinely have no distributed systems precedent — the real frontier where new thinking is required.