This essay is the second of six in a series discussing Transformers and Post-Transformers. It establishes what "observability" means in the context of software, alongside human and LLM thinking, and why the first meets the definition while the second an...
This essay is the first of six in a series discussing Transformers and Post-Transformers. It establishes the meaning of "Intelligence" and "Functional General Intelligence," and provides a reference point for those terms in future essays.
I recently completed a content moderation system for an enterprise social media platform. It would be understatement to say some of its users have incompatible worldviews.
It's an understatement to say "AI" (really, Transformer-based LLMs) are actively remaking the world. Yes, other forms of AI exist. Diffusion models and RL still matter a lot. And if you're not paying attention to BDH, you should be: as we're witnessing ...
Over the last two years, I built either an "industry-killer" or "cool toy": an AI-powered platform for high-quality article generation. The "Eureka!" moment wasn't discovering that Generative AI can be used to do this. It was realizing that the REAL pro...
In November 2024, the CEO of a global digital advertising company called me about a problem potentially costing his firm millions: lost Requests For Proposal (RFP).
Complex prompts often produce good results up to a point, after which additional refinements can reduce output quality. This happens because additional instructions create competing attention patterns in the transformer's processing.
For the past two years, I've served as the Lead Architect (and frequent bench engineer) on several applied projects integrating various Large Language Models (LLMs).
Most of what folks call "AI" now runs on Transformer-based architectures. GPTs (Generative Pre-trained Transformers) are the most familiar variant; the family also includes encoder-only models like BERT and encoder-decoder models like T5. A key concept ...
Clarity in verbal and written communication is among the most critical skills we learn as human beings. But using complex vocabulary, while efficient in expert audiences with shared knowledge, is often the wrong approach.