Questions tagged [skewness]
6 questions
5
votes
1 answer
Linear Regression bad results after log transformation
I have a dataset that has the following columns:
The variable I'm trying to predict is "rent".
My dataset looks a lot similar to what happens in this notebook. I tried to normalize the rent column and the area column using log transformation since…
Caldass_
- 187
- 1
- 1
- 9
2
votes
0 answers
Why might transforming my features improve the performance on a simple decision tree?
The features & target in my dataset are very skewed. Could anyone explain why transforming the features & target (I'm using a Yeo-Johnson transformation) is significantly improving the performance of my model when using a decision tree?
O.R
- 21
- 2
1
vote
1 answer
how to fix left and right skewness
I know that left and right skewness means it has a long tail on either the left(left skewness) or right(right skewness).
However, the example below is an example of right skewness.
data = pd.DataFrame({
"Income": [15000, 22000, 30000, 35000,…
RushHour
- 145
- 6
1
vote
0 answers
Why is resnet regression model (on a skewed data with small interval) not converging?
Using resnet50 (torchvision.models pretrained=False) with an input of [15, 224,224] which includes 14 heatmaps and a level set map, per datapoint. The goal is to predict a cutoff value for each datapoint, this value is used for heatmaps…
topcat
- 11
- 2
1
vote
1 answer
Is there an appropriate hypothesis test: Two samples, one only with one datapoint, non-normal?
I have two 'samples'.
The first consists of approx. 400 physical measurements of a quantity (taken over one hour, and the real situation is not a steady state). They show a very skewed distribution for which the theoretical model is not known, and…
Mars
- 11
- 1
1
vote
2 answers
Is there a measure to compare features on the basis of normality
I have a dataset of cars and it has many features including 'acceleration’, ‘horsepower’, and ‘mpg'.
I am supposed to check which of these features is the most similar to a normal distribution, so I made histograms of each feature, acceleration was…
Tareq Ewaida
- 11
- 1