
Artificial Intelligence
Michael Negnevitsky
What's inside?
Explore the fascinating world of artificial intelligence and understand how intelligent systems work, shaping our present and future.
You'll learn
Key points
01How Do Machines Actually Learn to Think?
Stepping into the world of artificial intelligence often feels like trying to learn a completely foreign language, but the basic concepts are surprisingly rooted in human common sense. Let us break down the fundamental question of what it really means for a piece of metal and silicon to exhibit something resembling intelligence. For decades, the public perception of artificial intelligence has been heavily skewed by movies featuring sentient robots and dramatic futuristic landscapes. However, the reality presented in Michael Negnevitsky’s work is far more grounded, practical, and arguably more fascinating. The journey begins with a simple premise: intelligence is not just about crunching numbers at lightning speed, but rather about processing knowledge, recognizing patterns, and making informed decisions in less-than-perfect conditions. To truly appreciate modern intelligent systems, we must first look at the historical shift from traditional computing to knowledge-based processing. A standard calculator is incredibly fast, but it is entirely devoid of intelligence. If you ask a calculator to divide a number by zero, it simply returns an error. It does not understand the context of the numbers, nor does it care. Traditional computer programs operate on a strict diet of data and rigid algorithms, following step-by-step instructions to achieve a predictable outcome. But human intelligence does not work this way. When a doctor diagnoses a patient, they are not merely running a mathematical formula; they are drawing upon years of experience, recognizing subtle symptoms, weighing probabilities, and applying generalized knowledge to a specific, unique situation. The monumental challenge for early computer scientists was figuring out how to transfer this messy, complex human knowledge into a format that a machine could utilize. Negnevitsky introduces us to the concept of the knowledge representation bottleneck. This is the crux of the artificial intelligence dilemma. How do you take the intuitive, often subconscious expertise of a human professional and translate it into code? In the early days, researchers believed that if they could just write enough logical rules, they could simulate human thought. This led to the era of Good Old-Fashioned AI, which relied heavily on formal logic and symbol manipulation. While these early systems achieved remarkable success in closed environments—like solving algebraic equations or playing checkers—they failed spectacularly when exposed to the chaotic, unpredictable nature of the real world. A machine that could play flawless chess might be completely incapable of recognizing a picture of a cat, simply because the rules of "cat-ness" are too ambiguous to be hard-coded into traditional logic. This realization prompted a massive paradigm shift in the field. Instead of trying to painstakingly program every single rule of the universe into a computer, scientists began to ask themselves a different question: What if we could build systems that learn from their environment, adapt to new information, and handle uncertainty just like we do? This question serves as the launching pad for the entire book. We are introduced to the idea that artificial intelligence is not a single, monolithic technology, but rather a diverse toolbox of different methodologies. Each tool in this box has been specifically designed to tackle a different aspect of human cognition. Consider the sheer variety of tasks your brain performs on a daily basis. You use strict logical reasoning when balancing your checkbook. You use fuzzy, approximate judgments when deciding if the bathwater is too hot. You use pattern recognition when picking out a friend's face in a crowded room. You even use a form of evolutionary trial-and-error when trying to optimize your morning commute through heavy traffic. Negnevitsky expertly structures his exploration of artificial intelligence around these distinct cognitive functions. By breaking down intelligence into these manageable components, the seemingly impossible task of building an artificial mind becomes an understandable engineering process. One of the most engaging aspects of this introductory exploration is the demystification of the term "expert." We often view experts as individuals possessing some unattainable genius, but in the realm of intelligent systems, expertise is simply a large collection of specialized knowledge combined with the ability to apply it effectively. If we can extract this knowledge and formalize it, we can create machines that replicate the decision-making processes of our brightest minds. This brings us to the Turing Test, the famous benchmark proposed by Alan Turing to determine if a machine can exhibit intelligent behavior indistinguishable from that of a human. While passing the Turing Test remains a fascinating philosophical goal, the practical application of artificial intelligence has moved toward building systems that assist, augment, and enhance human capabilities rather than simply mimicking conversational behavior. As we prepare to delve deeper into the specific technologies that make up this fascinating field, it is crucial to keep this foundational perspective in mind. The goal of building intelligent systems is not to create a mechanical human, but to capture specific fragments of human cognitive power and amplify them. Whether it is preserving the disappearing knowledge of a retiring specialist, automating complex industrial processes, or finding hidden patterns in massive datasets, the practical value of artificial intelligence lies in its ability to solve real-world problems. We are standing at the threshold of a technological revolution, and understanding the basic building blocks of machine thought is the key to navigating the future. Let us move forward and examine the very first major triumph in this field: the ability to capture human expertise in the form of simple, logical rules.
02Capturing Human Expertise in Simple Rules
We often rely on highly trained specialists to solve our most pressing problems, from diagnosing a strange engine noise in our car to identifying a rare medical condition. What if we could bottle that hard-earned expertise, preserve it forever, and hand it to anyone who needs it? This is exactly the promise of rule-based expert systems, one of the earliest and most commercially successful branches of artificial intelligence. In this chapter, we explore how computer scientists learned to interview human specialists, extract their knowledge, and translate it into a language that machines can execute flawlessly. At the incredibly beating heart of every expert system lies a remarkably simple concept: the IF-THEN rule. While human decision-making feels incredibly complex and nuanced, Negnevitsky reveals that much of professional expertise can actually be distilled into these straightforward conditional statements. Consider a seasoned mechanic examining a car that refuses to start. The mechanic does not consciously run complex calculus equations in their head; instead, they rely on a chain of logical deductions built over years of experience. They might think, "If the engine does not turn over, and the headlights are dim, then the battery is likely dead." This is a classic IF-THEN rule. The "IF" part represents the condition or the evidence we observe, while the "THEN" part represents the conclusion or the action to be taken. Building an expert system requires two main components: the knowledge base and the inference engine. You can think of the knowledge base as a massive library containing hundreds or even thousands of these IF-THEN rules. It is the repository of human expertise. However, a library is entirely useless without a librarian to search through the books and find the right answers. This is where the inference engine comes into play. The inference engine is the "brain" of the expert system. It is a computer program that actively searches the knowledge base, matches the rules against the current data, and logically infers new information to reach a final conclusion. To understand how the inference engine navigates this vast sea of rules, we must look at the two primary methods of reasoning: forward chaining and backward chaining. These two approaches mirror the different ways humans tackle problem-solving in everyday life. Forward chaining is entirely data-driven. It starts with the known facts and works forward to see what conclusions can be drawn. Picture an investigator walking into a chaotic crime scene. They gather fingerprints, secure the weapon, and interview witnesses. They take all this scattered data and push it forward through their internal rule base to eventually identify a suspect. In a computer system, forward chaining looks at the available data, fires any rules whose "IF" conditions are met, generates new facts, and repeats the process until no more rules can be fired. This method is incredibly useful for tasks like system monitoring, where data is constantly flowing in, and the system needs to trigger an alarm if a dangerous combination of events occurs. On the flip side, backward chaining is goal-driven. It starts with a specific hypothesis or a goal and works backward to see if the available evidence supports it. Returning to our crime scene analogy, backward chaining is like a detective who already has a prime suspect in mind. The detective's goal is to prove the suspect's guilt, so they work backward, looking specifically for the evidence—financial motives, lack of an alibi, or physical clues—that would validate their hypothesis. In the medical field, a doctor often uses backward chaining. They might suspect a patient has a specific infection based on a quick glance. The doctor will then ask targeted questions and order specific lab tests to confirm or deny that initial hypothesis. Expert systems designed for diagnosis almost always rely on backward chaining because it is much more efficient to ask the user a few targeted questions rather than demanding they input every single fact about their life before reaching a conclusion. One of the most famous historical examples highlighted in the study of expert systems is MYCIN, a program developed in the 1970s to diagnose blood infections and recommend antibiotic therapies. MYCIN was a revolutionary achievement because it proved that a machine could perform at a level equal to or better than human specialists in a highly complex domain. The system worked by asking the attending physician a series of questions about the patient's symptoms and lab results. Using backward chaining, MYCIN navigated through hundreds of rules regarding bacterial infections. However, MYCIN also introduced a critical innovation: the ability to handle uncertainty. Human experts rarely deal in absolute certainties. A doctor might say, "Given these symptoms, I am 80% sure this is a streptococcus infection." To capture this reality, developers introduced certainty factors into expert systems. A certainty factor is a number that represents the level of belief in a particular fact or rule. It allows the system to accumulate evidence from multiple, sometimes conflicting, sources and calculate a final confidence level for its recommendation. This addition made expert systems significantly more robust and human-like in their reasoning. Despite their incredible success in specialized fields, rule-based expert systems also possess notable limitations. They are notoriously "brittle." If a situation arises that falls even slightly outside their programmed rules, they cannot adapt or use common sense; they simply fail. Furthermore, the process of extracting rules from human experts—a process known as knowledge engineering—is incredibly time-consuming and difficult. Experts often struggle to articulate exactly why they make certain decisions, relying heavily on intuition that defies simple IF-THEN logic. This bottleneck in knowledge acquisition signaled to researchers that while rigid rules are excellent for capturing structured expertise, they are insufficient for replicating the fluid, adaptable nature of true human intelligence. To build smarter systems, machines needed to learn how to operate not just in black and white, but in the infinite shades of gray that define the real world.

Continue reading with LeapAhead app
Full summary is waiting for you in the app
03Embracing the Gray Areas with Fuzzy Logic
04Mimicking the Human Brain with Neural Networks
05Evolution in Action Through Genetic Algorithms
06Combining Forces for Smarter Hybrid Systems
07Conclusion
About Michael Negnevitsky
Michael Negnevitsky is a Professor in Electrical Engineering and Computer Science at the University of Tasmania, Australia. He is an expert in artificial intelligence, specializing in knowledge and data engineering, intelligent systems, and power system engineering.