Overview

Dataset Statistics

Number of Variables 24
Number of Rows 3718
Missing Cells 80
Missing Cells (%) 0.1%
Duplicate Rows 2197
Duplicate Rows (%) 59.1%
Total Size in Memory 4.6 MB
Average Row Size in Memory 1.3 KB
Variable Types
  • Categorical: 21
  • Numerical: 3

Dataset Insights

details has 80 (2.15%) missing values Missing
size is skewed Skewed
property_age is skewed Skewed
price is skewed Skewed
Dataset has 2197 (59.09%) duplicate rows Duplicates
district has a high cardinality: 174 distinct values High Cardinality
details has a high cardinality: 1429 distinct values High Cardinality
bedrooms has constant length 1 Constant Length
bathrooms has constant length 1 Constant Length
livingrooms has constant length 1 Constant Length
kitchen has constant length 1 Constant Length
garage has constant length 1 Constant Length
driver_room has constant length 1 Constant Length
maid_room has constant length 1 Constant Length
furnished has constant length 1 Constant Length
ac has constant length 1 Constant Length
roof has constant length 1 Constant Length
pool has constant length 1 Constant Length
frontyard has constant length 1 Constant Length
basement has constant length 1 Constant Length
duplex has constant length 1 Constant Length
stairs has constant length 1 Constant Length
elevator has constant length 1 Constant Length
fireplace has constant length 1 Constant Length
property_age has 1819 (48.92%) zeros Zeros
  • 1
  • 2
  • 3

Variables


city

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 424164

Length

Mean 6.021
Standard Deviation 1.2058
Median 6
Minimum 4
Maximum 7

Sample

1st row الرياض
2nd row الرياض
3rd row الرياض
4th row الرياض
5th row الرياض

Letter

Count 0
Lowercase Letter 0
Space Separator 3718
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories ( الخبر, الرياض) take over 50.0%

district

categorical

Approximate Distinct Count 174
Approximate Unique (%) 4.7%
Missing 0
Missing (%) 0.0%
Memory Size 541175

Length

Mean 14.951
Standard Deviation 3.6587
Median 14
Minimum 10
Maximum 31

Sample

1st row حي العارض
2nd row حي القادسية
3rd row حي القادسية
4th row حي المعيزلة
5th row حي العليا

Letter

Count 0
Lowercase Letter 0
Space Separator 19515
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The largest value (حي) is over 19.16 times larger than the second largest value (الملك)

front

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 401135

Length

Mean 4.2593
Standard Deviation 1.917
Median 4
Minimum 3
Maximum 9

Sample

1st row شمال
2nd row جنوب
3rd row جنوب
4th row غرب
5th row غرب

Letter

Count 0
Lowercase Letter 0
Space Separator 515
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 32

size

numerical

Approximate Distinct Count 199
Approximate Unique (%) 5.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 59488
Mean 390.9685
Minimum 1
Maximum 95000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • size is skewed right (γ1 = 59.4554)

Quantile Statistics

Minimum 1
5-th Percentile 200
Q1 280
Median 330
Q3 400
95-th Percentile 625
Maximum 95000
Range 94999
IQR 120

Descriptive Statistics

Mean 390.9685
Standard Deviation 1565.0561
Variance 2.4494e+06
Sum 1.4536e+06
Skewness 59.4554
Kurtosis 3590.676
Coefficient of Variation 4.003
  • size is not normally distributed (p-value 4.228232899945392e-25)
  • size has 326 outliers

property_age

numerical

Approximate Distinct Count 36
Approximate Unique (%) 1.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 59488
Mean 5.0648
Minimum 0
Maximum 36
Zeros 1819
Zeros (%) 48.9%
Negatives 0
Negatives (%) 0.0%
  • property_age is skewed right (γ1 = 2.0107)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 2
Q3 7
95-th Percentile 24
Maximum 36
Range 36
IQR 7

Descriptive Statistics

Mean 5.0648
Standard Deviation 7.5904
Variance 57.6146
Sum 18831
Skewness 2.0107
Kurtosis 3.8641
Coefficient of Variation 1.4987
  • property_age is not normally distributed (p-value 8.892043089760027e-24)
  • property_age has 279 outliers

bedrooms

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (5) is over 2.09 times larger than the second largest value (4)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 5
2nd row 4
3rd row 4
4th row 5
5th row 7

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (5, 4) take over 50.0%
  • The largest value (5) is over 2.09 times larger than the second largest value (4)
  • bedrooms has words of constant length

bathrooms

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (5) is over 3.55 times larger than the second largest value (4)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 5
2nd row 5
3rd row 5
4th row 5
5th row 5

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (5, 4) take over 50.0%
  • The largest value (5) is over 3.55 times larger than the second largest value (4)
  • bathrooms has words of constant length

livingrooms

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (2) is over 2.22 times larger than the second largest value (3)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 2
3rd row 1
4th row 3
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (2, 3) take over 50.0%
  • The largest value (2) is over 2.22 times larger than the second largest value (3)
  • livingrooms has words of constant length

kitchen

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (1) is over 10.03 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 10.03 times larger than the second largest value (0)
  • kitchen has words of constant length

garage

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (1) is over 4.05 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.05 times larger than the second largest value (0)
  • garage has words of constant length

driver_room

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • driver_room has words of constant length

maid_room

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (1) is over 3.89 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 3.89 times larger than the second largest value (0)
  • maid_room has words of constant length

furnished

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (0) is over 7.1 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 7.1 times larger than the second largest value (1)
  • furnished has words of constant length

ac

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • ac has words of constant length

roof

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 1
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • roof has words of constant length

pool

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (0) is over 5.16 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.16 times larger than the second largest value (1)
  • pool has words of constant length

frontyard

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (1) is over 4.07 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.07 times larger than the second largest value (0)
  • frontyard has words of constant length

basement

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (0) is over 28.28 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 28.28 times larger than the second largest value (1)
  • basement has words of constant length

duplex

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 0
3rd row 0
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • duplex has words of constant length

stairs

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (1) is over 4.39 times larger than the second largest value (0)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 0
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (1, 0) take over 50.0%
  • The largest value (1) is over 4.39 times larger than the second largest value (0)
  • stairs has words of constant length

elevator

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (0) is over 11.35 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 11.35 times larger than the second largest value (1)
  • elevator has words of constant length

fireplace

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 245388
  • The largest value (0) is over 4.52 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3718
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 4.52 times larger than the second largest value (1)
  • fireplace has words of constant length

price

numerical

Approximate Distinct Count 113
Approximate Unique (%) 3.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 59488
Mean 87387.9742
Minimum 1000
Maximum 1700000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • price is skewed right (γ1 = 7.6872)

Quantile Statistics

Minimum 1000
5-th Percentile 35000
Q1 55000
Median 70000
Q3 100000
95-th Percentile 180000
Maximum 1700000
Range 1699000
IQR 45000

Descriptive Statistics

Mean 87387.9742
Standard Deviation 70634.6999
Variance 4.9893e+09
Sum 3.2491e+08
Skewness 7.6872
Kurtosis 118.5168
Coefficient of Variation 0.8083
  • price is not normally distributed (p-value 4.2398936999685405e-18)
  • price has 242 outliers

details

categorical

Approximate Distinct Count 1429
Approximate Unique (%) 39.3%
Missing 80
Missing (%) 2.2%
Memory Size 2942826

Length

Mean 188.3975
Standard Deviation 72.7088
Median 216
Minimum 3
Maximum 253

Sample

1st row للايجار فيلا دبلكس...
2nd row *** فيلا درج مع ال...
3rd row فيلا للايجار درج د...
4th row فيلا للايجار ...
5th row فيلا للايجار حي ال...

Letter

Count 1674
Lowercase Letter 1408
Space Separator 113379
Uppercase Letter 266
Dash Punctuation 1735
Decimal Number 13673
  • details contains many words: 7686 words

Interactions

Correlations

Missing Values