Conformal prediction is a machine learning framework designed to quantify uncertainty in model outputs. Instead of making a single prediction, it provides a range of likely outcomes along with measures of confidence and credibility. Its agnostic nature means it can work with any algorithm and applies to both classification and regression tasks. Mondrian conformal prediction extends this framework to handle imbalanced datasets by ensuring equal error rates across classes. Together, these methods make AI predictions more interpretable, transparent, and reliable for real-world, high-risk applications.Conformal prediction is a machine learning framework designed to quantify uncertainty in model outputs. Instead of making a single prediction, it provides a range of likely outcomes along with measures of confidence and credibility. Its agnostic nature means it can work with any algorithm and applies to both classification and regression tasks. Mondrian conformal prediction extends this framework to handle imbalanced datasets by ensuring equal error rates across classes. Together, these methods make AI predictions more interpretable, transparent, and reliable for real-world, high-risk applications.

How to Teach Machine Learning Models a Sense of Uncertainty

Abstract and 1. Introduction

  1. Motivation and design goals

  2. Related Work

  3. Conformal prediction

    4.1. Mondrian conformal prediction (MCP)

    4.2. Evaluation metrics

  4. Mondrian conformal prediction for Disk Scrubbing: our approach

    5.1. System and Storage statistics

    5.2. Which disk to scrub: Drive health predictor

    5.3. When to scrub: Workload predictor

  5. Experimental setting and 6.1. Open-source Baidu dataset

    6.2. Experimental results

  6. Discussion

    7.1. Optimal scheduling aspect

    7.2. Performance metrics and 7.3. Power saving from selective scrubbing

  7. Conclusion and References

4. Conformal prediction

Conformal prediction (Shafer and Vovk, 2008) is a powerful framework in machine learning that allows for prediction with uncertainty. Unlike traditional point prediction methods used in classification tasks, conformal prediction provides a set prediction. This means that instead of outputting a single predicted label, conformal prediction provides a range of possible labels that are likely to be correct, along with confidence and credibility measures in the correctness of these predictions. This is particularly useful in cases where the prediction task may be uncertain or when the model is dealing with previously unseen data, especially when the application is of high risk (Luo et al., 2022).

\ Conformal prediction is not limited to classification tasks alone, but it is also valid for regression tasks by providing prediction intervals instead of a single-point prediction. These prediction intervals represent a range of possible values for the target variable, along with a measure of confidence in the correctness of these intervals. This allows for more nuanced and interpretable predictions in regression tasks, where the goal is to estimate a continuous value rather than a discrete class label.

\ One of the key advantages of conformal prediction is its agnostic nature, which means it can be used with any machine learning algorithm. This flexibility allows for the integration of conformal prediction into various machine learning pipelines without being constrained by the choice of a specific algorithm. This makes conformal prediction a versatile tool that can be applied to various tasks and domains (Messoudi et al., 2020).

\ A great resource for learning more about conformal prediction is the book ”Algorithmic Learning in a Random World” (Vovk et al., 2022), which provides a detailed overview of the theory and applications of conformal prediction. Additionally, a repository of conformal prediction implementations and resources is maintained on GitHub[1] (Manokhin, 2022), providing practical tools and examples for applying conformal prediction in various machine learning settings. This makes it easier for researchers and practitioners to implement and experiment with conformal prediction in their own work.

\ The overall approach of conformal prediction in the inductive setting is presented in algorithm 1.

\

4.1. Mondrian conformal prediction (MCP)

Mondrian conformal prediction (MCP) is a variant of the conformal prediction framework that provides a guarantee on a subset of the dataset, or on specific categories of the dataset. This variant is originally established for a classification problem by creating class-conditional or attribute-conditional categories (Vovk et al., 2003). However, a Mondrian version exists for regressors (Bostr¨om and Johansson, 2020).

\ In imbalanced datasets, the minority class (i.e., the class with fewer instances) may lead to biased predictions and inaccurate confidence measures. Mondrian conformal prediction is a powerful tool to handle this issue by maintaining the same error rate for both the majority and minority classes, ensuring that the predictions are not biased toward the majority class.

\ \

\ \ Mondrian conformal prediction has been implemented for various domains in many real-life use cases for academia as well as industry. For instance, (Messoudi et al., 2021) apply MCP for tenant debt prediction in real estate, (Alvarsson et al., 2021) use it for modeling ABC transporters in drug discovery, and (Vishwakarma and Liu, 2021) leverage conformal predictors for detecting persistent storage failure analysis in the enterprise storage domain.

4.2. Evaluation metrics

To evaluate the performance of conformal prediction models, several metrics can be employed. In our study, we will focus only on two:

\ • Confidence: reflects the certainty of the model that a prediction is a singleton, or a unique outcome. Confidence is based on the concept of p-values, which are used to assess the probability of obtaining an outcome as extreme as the one observed, assuming that the null hypothesis is true. A higher confidence value suggests that the model is more confident about the accuracy of its prediction and that the predicted label is likely to be correct. Conversely, a lower confidence value implies that there may be alternative labels that are equally likely. This metric is defined as:

\ \

\ \ • Credibility quantifies the likelihood that a sample comes from the training set, as determined by the minimal significance level that would result in an empty prediction region. In other words, credibility is expressed as the largest p-value, which serves as the lower bound for the value of the significance level ϵ that would result in an empty prediction. A higher credibility value indicates a higher likelihood that the sample is consistent with the training set, while a lower credibility value indicates a higher likelihood of the sample being inconsistent with the training set. This metric is defined as:

\ \

\ \ \ \

:::info This paper is available on arxiv under CC BY-NC-ND 4.0 Deed (Attribution-Noncommercial-Noderivs 4.0 International) license.

:::


[1] https://github.com/valeman/awesome-conformal-prediction


:::info Authors:

(1) Rahul Vishwakarma, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]);

(2) Jinha Hwang, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]);

(3) Soundouss Messoudi, HEUDIASYC - UMR CNRS 7253, Universit´e de Technologie de Compiegne, 57 avenue de Landshut, 60203 Compiegne Cedex - France ([email protected]);

(4) Ava Hedayatipour, California State University Long Beach, 1250 Bellflower Blvd, Long Beach, CA 90840, United States ([email protected]).

:::

\

Market Opportunity
null Logo
null Price(null)
--
----
USD
null (null) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Gold Hits $3,700 as Sprott’s Wong Says Dollar’s Store-of-Value Crown May Slip

Gold Hits $3,700 as Sprott’s Wong Says Dollar’s Store-of-Value Crown May Slip

The post Gold Hits $3,700 as Sprott’s Wong Says Dollar’s Store-of-Value Crown May Slip appeared on BitcoinEthereumNews.com. Gold is strutting its way into record territory, smashing through $3,700 an ounce Wednesday morning, as Sprott Asset Management strategist Paul Wong says the yellow metal may finally snatch the dollar’s most coveted role: store of value. Wong Warns: Fiscal Dominance Puts U.S. Dollar on Notice, Gold on Top Gold prices eased slightly to $3,678.9 […] Source: https://news.bitcoin.com/gold-hits-3700-as-sprotts-wong-says-dollars-store-of-value-crown-may-slip/
Share
BitcoinEthereumNews2025/09/18 00:33
ZKP’s Proof Generation Edge: The $100M Privacy Layer DOGE and XRP Don’t Have

ZKP’s Proof Generation Edge: The $100M Privacy Layer DOGE and XRP Don’t Have

Dogecoin, XRP, and ZKP represent three very different bets for the next cycle,  and the market is already separating speculation from structure. The Dogecoin price
Share
Blockonomi2026/01/22 01:00
Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

The post Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be appeared on BitcoinEthereumNews.com. Jordan Love and the Green Bay Packers are off to a 2-0 start. Getty Images The Green Bay Packers are, once again, one of the NFL’s better teams. The Cleveland Browns are, once again, one of the league’s doormats. It’s why unbeaten Green Bay (2-0) is a 8-point favorite at winless Cleveland (0-2) Sunday according to betmgm.com. The money line is also Green Bay -500. Most expect this to be a Packers’ rout, and it very well could be. But Green Bay knows taking anyone in this league for granted can prove costly. “I think if you look at their roster, the paper, who they have on that team, what they can do, they got a lot of talent and things can turn around quickly for them,” Packers safety Xavier McKinney said. “We just got to kind of keep that in mind and know we not just walking into something and they just going to lay down. That’s not what they going to do.” The Browns certainly haven’t laid down on defense. Far from. Cleveland is allowing an NFL-best 191.5 yards per game. The Browns gave up 141 yards to Cincinnati in Week 1, including just seven in the second half, but still lost, 17-16. Cleveland has given up an NFL-best 45.5 rushing yards per game and just 2.1 rushing yards per attempt. “The biggest thing is our defensive line is much, much improved over last year and I think we’ve got back to our personality,” defensive coordinator Jim Schwartz said recently. “When we play our best, our D-line leads us there as our engine.” The Browns rank third in the league in passing defense, allowing just 146.0 yards per game. Cleveland has also gone 30 straight games without allowing a 300-yard passer, the longest active streak in the NFL.…
Share
BitcoinEthereumNews2025/09/18 00:41