AI Security Topic - Marin Ivezic

Three Views of Artificial Intelligence (AI) Future

February 3, 2018

If you’ve read the many predictions about the future of AI, you’ve likely found them to be wildly different. They range from AI spelling...

Risks of AI – Meeting the Ghost in the Machine

May 4, 2019

Because it demands so much manpower, cybersecurity has already benefited from AI and automation to improve threat prevention, detection and response. Preventing spam and identifying malware are already common examples. However, AI is also being used – and will be used more and more – by cybercriminals to circumvent cyberdefenses and bypass security algorithms. AI-driven cyberattacks have the potential to be faster, wider spread and less costly to implement. They can be scaled up in ways that have not been possible in even the most well-coordinated hacking campaigns. These attacks evolve in real time, achieving high impact rates.

How Model Inversion Attacks Compromise AI Systems

February 11, 2021

A model inversion attack aims to reverse-engineer a target machine learning model to infer sensitive information about its training data. Specifically, these attacks are designed to exploit the model's internal representations and decision boundaries to reverse-engineer and subsequently reveal sensitive attributes of the training data. Take, for example, a machine learning model that leverages a Recurrent Neural Network (RNN) architecture to conduct sentiment analysis on encrypted messages. An attacker utilizing model inversion techniques can strategically query the model and, by dissecting the SoftMax output probabilities or even hidden layer activations, approximate the semantic and syntactic structures used in the training set.

How to Defend Neural Networks from Trojan Attacks

September 18, 2023

Neural networks learn from data. They are trained on large datasets to recognize patterns or make decisions. A Trojan attack in a neural network typically involves injecting malicious data into this training dataset. This 'poisoned' data is crafted in such a way that the neural network begins to associate it with a certain output, creating a hidden vulnerability. When activated, this vulnerability can cause the neural network to behave unpredictably or make incorrect decisions, often without any noticeable signs of tampering.

Backdoor Attacks in Machine Learning Models

October 11, 2022

Backdoor attacks in the context of Machine Learning (ML) refer to the deliberate manipulation of a model's training data or its algorithmic logic to implant a hidden vulnerability, often referred to as a "trigger." Unlike typical vulnerabilities that are discovered post-deployment, backdoor attacks are often premeditated and planted during the model's development phase. Once deployed, the compromised ML model appears to function normally for standard inputs. However, when the model encounters a specific input pattern corresponding to the embedded trigger, it produces an output that is intentionally skewed or altered, thereby fulfilling the attacker's agenda.

Perturbation Attacks in Text Classification Models

August 24, 2022

Text Classification Models are critical in a number of cybersecurity controls, particularly in mitigating risks associated with phishing emails and spam. However, the emergence of sophisticated perturbation attacks poses substantial threats, manipulating models into erroneous classifications and exposing inherent vulnerabilities. The explored mitigation strategies, including advanced detection techniques and defensive measures like adversarial training and input sanitization, are instrumental in defending against these attacks, preserving model integrity and accuracy.

When AI Trusts False Data: Exploring Data Spoofing’s Impact on Security

February 11, 2021

Data spoofing is the intentional manipulation, fabrication, or misrepresentation of data with the aim of deceiving systems into making incorrect decisions or assessments. While it is often associated with IP address spoofing in network security, the concept extends into various domains and types of data, including, but not limited to, geolocation data, sensor readings, and even labels in machine learning datasets. In the realm of cybersecurity, the most commonly spoofed types of data include network packets, file hashes, digital signatures, and user credentials. The techniques used for data spoofing are varied and often sophisticated,

Explainable AI Frameworks

December 26, 2021

Trust comes through understanding. As AI models grow in complexity, they often resemble a "black box," where their decision-making processes become increasingly opaque. This lack of transparency can be a roadblock, especially when we need to trust and understand these decisions. Explainable AI (XAI) is the approach that aims to make AI's decisions more transparent, interpretable, and understandable. As the demand for transparency in AI systems intensifies, a number of frameworks have emerged to bridge the gap between machine complexity and human interpretability. Some of the leading Explainable AI Frameworks include:

Model Fragmentation and What it Means for Security

August 21, 2023

Model fragmentation is the phenomenon where a single machine-learning model is not used uniformly across all instances, platforms, or applications. Instead, different versions, configurations, or subsets of the model are deployed based on specific needs, constraints, or local optimizations. This can result in multiple fragmented instances of the original model operating in parallel, each potentially having different performance characteristics, data sensitivities, and security vulnerabilities.

Why We Seriously Need a Chief AI Security Officer (CAISO)

September 20, 2023

With AI’s breakneck expansion, the distinctions between ‘cybersecurity’ and ‘AI security’ are becoming increasingly pronounced. While both disciplines aim to safeguard digital assets, their focus and the challenges they address diverge in significant ways. Traditional cybersecurity is primarily about defending digital infrastructures from external threats, breaches, and unauthorized access. On the other hand, AI security has to address unique challenges posed by artificial intelligence systems, ensuring not just their robustness but also their ethical and transparent operation as well as unique internal vulnerabilities intrinsic to AI models and algorithms.

How Saliency Attacks Quietly Trick Your AI Models

November 13, 2021

"Saliency" refers to the extent to which specific features or dimensions in the input data contribute to the final decision made by the model. Mathematically, this is often quantified by analyzing the gradients of the model's loss function with respect to the input features; these gradients represent how much a small change in each feature would affect the model's output. Some sophisticated techniques like Layer-wise Relevance Propagation (LRP) and Class Activation Mapping (CAM) can also be used to understand feature importance in complex models like convolutional neural networks.

How Multimodal Attacks Exploit Models Trained on Multiple Data Types

August 8, 2022

In simplest terms, a multimodal model is a type of machine learning algorithm designed to process more than one type of data, be it text, images, audio, or even video. Traditional models often specialize in one form of data; for example, text models focus solely on textual information, while image recognition models zero in on visual data. In contrast, a multimodal model combines these specializations, allowing it to analyze and make predictions based on a diverse range of data inputs.

“Magical” Emergent Behaviours in AI: A Security Perspective

November 24, 2022

Emergent behaviours in AI have left both researchers and practitioners scratching their heads. These are the unexpected quirks and functionalities that pop up in complex AI systems, not because they were explicitly trained to exhibit them, but due to the intricate interplay of the system's complexity, the sheer volume of data it sifts through, and its interactions with other systems or variables. It's like giving a child a toy and watching them use it to build a skyscrapper. While scientists hoped that scaling up AI models would enhance their performance on familiar tasks, they were taken aback when these models started acing a number of unfamiliar tasks.

Semantic Adversarial Attacks: When Meaning Gets Twisted

July 19, 2023

Semantic adversarial attacks represent a specialized form of adversarial manipulation where the attacker focuses not on random or arbitrary alterations to the data but specifically on twisting the semantic meaning or context behind it. Unlike traditional adversarial attacks that often aim to add noise or make pixel-level changes to deceive machine learning models, semantic attacks target the inherent understanding of the data. For example, instead of just altering the color of an image to mislead a visual recognition system, a semantic attack might mislabel the image to make the model believe it's seeing something entirely different.

Securing Machine Learning Workflows through Homomorphic Encryption

July 21, 2023

Homomorphic Encryption has transitioned from being a mathematical curiosity to a linchpin in fortifying machine learning workflows against data vulnerabilities. Its complex nature notwithstanding, the unparalleled privacy and security benefits it offers are compelling enough to warrant its growing ubiquity. As machine learning integrates increasingly with sensitive sectors like healthcare, finance, and national security, the imperative for employing encryption techniques that are both potent and efficient becomes inescapable.

AI Security

AI Is Neither the End of Civilization nor the Beginning of Nirvana

Introduction to AI-Enabled Disinformation

How Model Inversion Attacks Compromise AI Systems

AI’s Coming Impact on the Economy

Three Views of Artificial Intelligence (AI) Future

Risks of AI – Meeting the Ghost in the Machine

How Model Inversion Attacks Compromise AI Systems

How to Defend Neural Networks from Trojan Attacks

Backdoor Attacks in Machine Learning Models

Perturbation Attacks in Text Classification Models

When AI Trusts False Data: Exploring Data Spoofing’s Impact on Security

Explainable AI Frameworks

Model Fragmentation and What it Means for Security

Why We Seriously Need a Chief AI Security Officer (CAISO)

How Saliency Attacks Quietly Trick Your AI Models

How Multimodal Attacks Exploit Models Trained on Multiple Data Types

“Magical” Emergent Behaviours in AI: A Security Perspective

Semantic Adversarial Attacks: When Meaning Gets Twisted

Securing Machine Learning Workflows through Homomorphic Encryption