Highest Voted 'bias' Questions - Data Science Stack Exchange

32

votes

5 answers

Why underfitting is called high bias and overfitting is called high variance?

I have been using terms like underfitting/overfitting and bias-variance tradeoff for quite some while in data science discussions and I understand that underfitting is associated with high bias and over fitting is associated with high variance. But…

variance bias

asked Feb 14 '19 at 14:33

Vaibhav Thakur

2,403
3
13
9

8

votes

3 answers

What are bias and variance in machine learning?

I am studying machine learning, and I have encountered the concept of bias and variance. I am a university student and in the slides of my professor, the bias is defined as: $bias = E[error_s(h)]-error_d(h)$ where $h$ is the hypotesis and…

machine-learning dataset variance bias

asked Aug 12 '20 at 08:10

J.D.

941
6
20
33

8

votes

1 answer

Question on bias-variance tradeoff and means of optimization

So I was wondering how does one, for example, can best optimize the model they are trying to build when confronted with issues presented by high bias or high variance. Now, of course, you can play with the regularization parameter to get to a…

machine-learning variance bias

asked Apr 12 '18 at 20:19

Zer0k

155
6

5

votes

2 answers

Bagging vs pasting in ensemble learning

This is a citation from "Hands-on machine learning with Scikit-Learn, Keras and TensorFlow" by Aurelien Geron: "Bootstrapping introduces a bit more diversity in the subsets that each predictor is trained on, so bagging ends up with a slightly higher…

machine-learning variance bias ensemble bagging

asked Apr 10 '20 at 08:16

chekhovana

51
2

5

votes

2 answers

Bagging vs Boosting, Bias vs Variance, Depth of trees

I understand the main principle of bagging and boosting for classification and regression trees. My doubts are about the optimization of the hyperparameters, especially the depth of the trees First question: why we are supposed to use weak learners…

random-forest xgboost boosting variance bias

asked Oct 15 '19 at 13:19

K.Hua

153
1
6

5

votes

1 answer

Whether add bias or not in a perceptron

In some places, perceptron is described as having added bias, while in some places, bias is not added. Which one is right for you?

machine-learning neural-network deep-learning perceptron bias

asked Aug 02 '18 at 09:34

Developer

1,109
2
9
11

4

votes

2 answers

Trade off between Bias and Variance

What are the best ideas or approaches to trade off between bias and variance in Machine Learning models.

machine-learning variance bias

asked Apr 28 '18 at 07:22

deepguy

1,471
8
21
39

4

votes

3 answers

Can cross validation for tuning and LOO for evaluation on the exact same dataset cause bias?

I read two articles by the same guy where he uses the whole dataset for hyperparameter optimisation using with CV and then evaluates the model with the best hyperparameters using leave one out on the same dataset. This seems fishy, from what I know…

cross-validation overfitting model-evaluations hyperparameter-tuning bias

asked May 14 '25 at 17:55

Lisana Daniel

55
4

3

votes

4 answers

Difference between ethics and bias in Machine Learning

I'm confused about the difference between "ethics" and "bias" when those concepts are discussed in the context of Machine Learning (ML). In my understanding, ethical issue in ML is pretty much exactly the same thing as "bias": say, the model…

machine-learning terminology bias ethical-ai

asked Jun 29 '21 at 14:44

Qwerty

31
1

3

votes

0 answers

Unbiasedness of random forests

Suppose that I am trying to build a random forest by subsampling the data and choosing a single feature per tree randomly. For example, suppose there is some dataset, $D = \{(x_{1},y_{1}), ......(x_{N},y_{N})\}$ where we have that $x_{i} \in…

machine-learning random-forest variance bias

asked Mar 28 '21 at 06:28

user1234

131
1

3

votes

2 answers

Predictive modeling when output affects future input

Assume I have a model which predicts the outcome of the number of icecreams sold in a store. The model is trained on data for the last 5 years while keeping the last year as a validation set and has produced very good results. We now put the model…

machine-learning predictive-modeling bias methodology concept-drift

asked Oct 19 '20 at 08:35

CutePoison

520
3
10

3

votes

1 answer

How to measure deviance resulting from different random seeds in machine learning?

I'm running an xgboost model to predict probabilities to a binary classification problem. Then I aggregate the results based on the Age variable (what is the aggregated risk of getting the sickness for Age x). I made a mistake and did not set the…

machine-learning xgboost bias

asked Jul 14 '20 at 10:23

user100740

91
2

3

votes

1 answer

Learning curve using micro F-score and macro F-score

I plotted the learning curves using micro and macro F-scores for a Multinomial Naive Bayes classifier. The first plot is made using micro F-score, and the second using macro F-score. I find it quite difficult to interpret both of them. The…

machine-learning model-evaluations variance bias

asked Sep 01 '19 at 23:09

Ana

31
1

3

votes

1 answer

Bias-variance tradeoff and the uncertainty principle

Bias variance tradeoff seems to behave like the uncertainty principle, is it just another name for the same principle?

variance bias

asked Apr 23 '19 at 00:56

Eugen

457
2
7

3

votes

1 answer

Is normalizing the validation set of time series a kind of look ahead bias?

Here's the data normalization process of a time series in a paper about stock prediction using LSTM: Split train and test set based on time (e.g. training set: 2001-2010, test set:2011-2012). This looks fine to me. Normalize the training set by…

time-series lstm bias data-leakage

asked Mar 09 '19 at 21:31

TQA

536
2
14

Questions tagged [bias]