ChatGPT

Stephen Wolfram, What Is ChatGPT Doing … and Why Does It Work? (Illinois: Wolfram Media Inc., March 2023)

Reviewed by Shaza Arif

Artificial Intelligence (AI) models such as ChatGPT have become increasingly popular owing to their impressive human-like responses generated in an unexpectedly short time. Individuals from all age groups continue to explore such models with great excitement, given their captivating potential in different fields from content writing to coding. While it is interesting to use and explore ChatGPT in particular, it is equally important to examine its internal workings to have a better understanding of the fascinating Chatbot. Authored by Stephen Wolfram, the book What Is ChatGPT Doing … and Why Does It Work? is a recent take on the subject. Wolfram is well-known for his contributions in Mathematics, Physics and Computer Sciences and is the founder of Wolfram Alpha (a search engine used to perform advanced computational tasks) and Mathematica (an algebraic software system). In approximately 100-odd pages, he deconstructs how and why ChatGPT can provide us with all the answers we ask in such ‘human-like’ writing style.

The book is structured into two parts. The initial part provides an overview of relevant Machine Learning (ML) concepts behind ChatGPT. The focus is on the explanation of neural nets (p.17), neural net training (p.35), loss function (p.31), embedding (p.46) and the process of training models. The subsequent section of the book sheds light on the content-generating capability of ChatGPT. In short, the book explains three simple stages to explain the working of ChatGPT. In the first stage, the human prompt is broken down into smaller parts (referred to as token) and converted into a series of digits through a process called ‘embedding’ (the process by which the computer starts to understand the text). In the next step, the series of numbers is then passed on to a series of layers of Artificial Neural Networks (ANN), where they are modified and form a new kind of embedding. The last part of the embedding is then assessed by the neural network to predict the next part by using thousands of available values. In the final stage, the complex structure of the neural network – the exact working of which is still not understood generates the final output that captures the nuance of human-like thinking in written text. The training data used by ChatGPT in the process comprises abundant text from online sources such as digital books and websites.

The author skillfully employs straightforward language, making intricate concepts more approachable for the reader. Significantly, the book highlights the remarkable progress in computer capabilities, particularly in areas such as human-like writing, which were once considered challenging for machines but are now achievable with relative ease. In this context, ChatGPT operates on a model with an underlying well-defined structure and approximately 175 billion perimeters (p.13). These parameters are sufficient to generate human-like text. He concludes that the computational speed and process used in ChatGPT are more straightforward than they were generally assumed – what he calls a ‘computationally shallower process’.

Wolfram also offers a compelling exploration of the intersection between Technology, Science, and Philosophy, delving into the intricacies and philosophical underpinnings of these fields. He thoughtfully presents the paradox of human achievement in technology and exploration -highlighting our significant strides in innovation and discovery, yet acknowledging the perpetual mysteries that elude our understanding, particularly in the realm of Artificial Neural Networks (ANN). The narrative underscores that, despite our advancements, some aspects of technology, especially as complex as ANN, may always remain beyond our understanding like the exact processing of the text by ANN. Similarly, syntax is not fed into the ChatGPT algorithms –  it is something that the system has implicitly learnt, and it remains for us to find out the exact process which enables it to do so. At its core, the book poignantly captures the essence of our human experience, reflecting on how our deep-seated awe and curiosity about our own intellect finds expression in our fascination with innovations like ChatGPT (dare we call them ‘creations’?). It looks into the profound realisation that our amazement is not merely with the technology itself but with what it reveals about us – our capacity for thought, innovation, and the creation of entities like AI and chatbots that mirror our intellectual prowess. This reflection on ChatGPT becomes a mirror, allowing us to marvel at the complexities and capabilities of the human mind, a journey that continually surprises and inspires us as we see our intellectual reflections in these advanced technologies.

The book culminates with a comparison between ChatGPT and Wolfram Alpha, shedding light on their distinct functionalities and applications. This section is particularly enlightening, offering a clear perspective on how these technologies differ and complement each other. Additionally, the latter part of the book is enhanced by illustrative examples that not only elucidate the inner workings of ChatGPT but also serve as thoughtful interludes, giving readers respite from the more technical discussion.

Despite its compact size, the book presents a rich and intellectually demanding exploration of advanced topics. It is for readers who already possess a foundational understanding of very technical concepts and training data. This is both its strength and may be considered a relative weakness since the text assumes familiarity with these principles, as they are not extensively elaborated upon but rather discussed at a more advanced level. This necessitates multiple readings for a complete grasp of the content, making it a challenging yet rewarding experience for those seeking deeper insights into these complex subjects.

While the book may not necessarily be an interesting read for many, it is highly recommended for AI and ML professionals, as well as students aspiring to explore these fields. It provides an insightful deep dive into the intricate world of AI and ML, enhancing understanding, especially of ChatGPT and its broader implications. Additionally, it offers valuable insights for policymakers, clarifying the capabilities and limitations of ChatGPT, which is crucial for informed decision-making on future technology regulations. For anyone with a basic grasp of AI and ML looking to expand their knowledge or explore the potential impacts of these technologies, this book is a valuable resource. Its depth and clarity make it a worthwhile read for those intrigued by the evolving AI landscape.

Shaza Arif is a Research Assistant at the Centre for Aerospace & Security Studies (CASS), Islamabad, Pakistan. She can be reached at [email protected].


Share this article

Facebook
Twitter
LinkedIn

Recent Publications

Browse through the list of recent publications.

The Cover-up: IAF Narrative of the May 2025 Air Battle

Even after one year since the India-Pakistan May war of 2025, the Indian discourse regarding Operation Sindoor remains uncertain under its pretence of restraint. The Pahalgam attack on 22 April, which killed 26 people, triggered an escalatory spiral. New Delhi quickly accused Pakistan-linked elements, while Islamabad refuted the allegation and demanded an independent investigation. On 7 May, India launched attacks deep inside Pakistan under what it later termed as Operation Sindoor. The political motive was intended to turn the crisis into coercive signalling by shifting the blame onto the enemy and projecting a sense of military superiority.
This episode, however, began to fray immediately as war seldom follows the intended script. Within minutes PAF shot down 7 IAF aircraft including 4 Rafales. On 8 May, Reuters reported that at least two Indian aircraft were shot down by a Pakistani J-10C, while the local government sources reported other aircraft crashes in Indian-occupied Jammu and Kashmir

Read More »

Why the IAF’s Post-Sindoor Spending Surge is a Sign of Panic

After Operation Sindoor, India is spending billions of dollars on new weapons. This is being taken by many people as an indication of military prowess. It is not. This rush to procure weapons is in fact an acknowledgement that the Air Force in India had failed to do what it was meant to do. The costly jets and missiles that India had purchased over the years failed to yield the promised results.

Sindoor was soon followed by India in sealing the gaps which the operation had exposed. It was reported that Indian Air Force (IAF) is looking to speed up its purchases of more than 7 billion USD. This will involve other Rafale fighter jets with India already ordering 26 more Rafales to the Navy in 2024 at an estimated cost of about 3.9 billion USD. India is also seeking long-range standoff missiles, Israeli loitering munitions and increased drone capabilities. Special financial powers of the Indian military were activated to issue emergency procurement orders. The magnitude and rate of these purchases speak volumes.

Indian media and defence analysts have over the years considered the Rafale as a game changer. When India purchased 36 Rafales aircrafts at an approximate cost of 8.7 billion USD, analysts vowed that the aircraft would provide India with air superiority over Pakistan. Operation Sindoor disproved all those allegations. Indian aircraft did not even fly in Pakistani airspace when the fighting started. India solely depended on standoff weapons that were launched at a safe distance. The air defence system of Pakistan, comprising of the HQ-9 surface-to-air missile system and its own fighters, stood its ground.

Read More »

May 2025: Mosaic Warfare and the Myth of Centralised Air Power

Visualise a modern-day Air Force commander sitting in the operations room, miles away from the combat zone, overseeing every friendly and enemy aircraft and all assets involved in the campaign. In a split second, he can task a fighter, reposition a drone, and authorise a strike. In today’s promising technological era, he does not even need an operations room; a laptop on his desktop will suffice. The situation looks promising as it offers efficiency, precision, and control. The term used for such operational control is ‘centralisation’, which has been made possible with advanced networking, integrating space, cyber, surveillance, artificial intelligence, and seamless communication, enabling a single commander to manage an entire campaign from a single node. Centralised command and control, championed by the Western air forces and then adopted by many others, has thus been seen as a pinnacle of modern military power.
The concept of centralisation, enabled by state-of-the-art networking, may seem promising, but it is nothing more than a myth.

Read More »