Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 45211 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 29.2 MiB |
Average record size in memory | 677.2 B |
Variable types
Numeric | 7 |
---|---|
Categorical | 6 |
Boolean | 4 |
pdays is highly overall correlated with previous and 1 other fields | High correlation |
previous is highly overall correlated with pdays | High correlation |
housing is highly overall correlated with month | High correlation |
contact is highly overall correlated with month | High correlation |
month is highly overall correlated with housing and 1 other fields | High correlation |
poutcome is highly overall correlated with pdays | High correlation |
default is highly imbalanced (87.0%) | Imbalance |
poutcome is highly imbalanced (53.1%) | Imbalance |
previous is highly skewed (γ1 = 41.84645447) | Skewed |
balance has 3514 (7.8%) zeros | Zeros |
previous has 36954 (81.7%) zeros | Zeros |
Reproduction
Analysis started | 2023-09-12 08:34:18.972513 |
---|---|
Analysis finished | 2023-09-12 08:34:29.889468 |
Duration | 10.92 seconds |
Software version | ydata-profiling v0.0.dev0 |
Download configuration | config.json |
age
Real number (ℝ)
Distinct | 77 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.93621 |
Minimum | 18 |
---|---|
Maximum | 95 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 18 |
---|---|
5-th percentile | 27 |
Q1 | 33 |
median | 39 |
Q3 | 48 |
95-th percentile | 59 |
Maximum | 95 |
Range | 77 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 10.618762 |
---|---|
Coefficient of variation (CV) | 0.25939778 |
Kurtosis | 0.31957038 |
Mean | 40.93621 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.68481793 |
Sum | 1850767 |
Variance | 112.75811 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
32 | 2085 | 4.6% |
31 | 1996 | 4.4% |
33 | 1972 | 4.4% |
34 | 1930 | 4.3% |
35 | 1894 | 4.2% |
36 | 1806 | 4.0% |
30 | 1757 | 3.9% |
37 | 1696 | 3.8% |
39 | 1487 | 3.3% |
38 | 1466 | 3.2% |
Other values (67) | 27122 |
Value | Count | Frequency (%) |
18 | 12 | < 0.1% |
19 | 35 | 0.1% |
20 | 50 | 0.1% |
21 | 79 | 0.2% |
22 | 129 | 0.3% |
23 | 202 | 0.4% |
24 | 302 | 0.7% |
25 | 527 | |
26 | 805 | |
27 | 909 |
Value | Count | Frequency (%) |
95 | 2 | < 0.1% |
94 | 1 | < 0.1% |
93 | 2 | < 0.1% |
92 | 2 | < 0.1% |
90 | 2 | < 0.1% |
89 | 3 | < 0.1% |
88 | 2 | < 0.1% |
87 | 4 | |
86 | 9 | |
85 | 5 |
job
Categorical
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 MiB |
blue-collar | |
---|---|
management | |
technician | |
admin. | |
services | |
Other values (7) |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 9.4855456 |
Min length | 6 |
Characters and Unicode
Total characters | 428851 |
---|---|
Distinct characters | 24 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | management |
---|---|
2nd row | technician |
3rd row | entrepreneur |
4th row | blue-collar |
5th row | unknown |
Common Values
Value | Count | Frequency (%) |
blue-collar | 9732 | |
management | 9458 | |
technician | 7597 | |
admin. | 5171 | |
services | 4154 | |
retired | 2264 | 5.0% |
self-employed | 1579 | 3.5% |
entrepreneur | 1487 | 3.3% |
unemployed | 1303 | 2.9% |
housemaid | 1240 | 2.7% |
Other values (2) | 1226 | 2.7% |
Length
Value | Count | Frequency (%) |
blue-collar | 9732 | |
management | 9458 | |
technician | 7597 | |
admin | 5171 | |
services | 4154 | |
retired | 2264 | 5.0% |
self-employed | 1579 | 3.5% |
entrepreneur | 1487 | 3.3% |
unemployed | 1303 | 2.9% |
housemaid | 1240 | 2.7% |
Other values (2) | 1226 | 2.7% |
Most occurring characters
Value | Count | Frequency (%) |
e | 64550 | |
n | 45360 | |
a | 42656 | |
l | 33657 | 7.8% |
c | 29080 | 6.8% |
m | 28209 | 6.6% |
i | 28023 | 6.5% |
r | 22875 | 5.3% |
t | 22682 | 5.3% |
u | 14988 | 3.5% |
Other values (14) | 96771 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 412369 | |
Dash Punctuation | 11311 | 2.6% |
Other Punctuation | 5171 | 1.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 64550 | |
n | 45360 | |
a | 42656 | |
l | 33657 | |
c | 29080 | 7.1% |
m | 28209 | 6.8% |
i | 28023 | 6.8% |
r | 22875 | 5.5% |
t | 22682 | 5.5% |
u | 14988 | 3.6% |
Other values (12) | 80289 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 11311 |
Other Punctuation
Value | Count | Frequency (%) |
. | 5171 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 412369 | |
Common | 16482 | 3.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 64550 | |
n | 45360 | |
a | 42656 | |
l | 33657 | |
c | 29080 | 7.1% |
m | 28209 | 6.8% |
i | 28023 | 6.8% |
r | 22875 | 5.5% |
t | 22682 | 5.5% |
u | 14988 | 3.6% |
Other values (12) | 80289 |
Common
Value | Count | Frequency (%) |
- | 11311 | |
. | 5171 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 428851 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 64550 | |
n | 45360 | |
a | 42656 | |
l | 33657 | 7.8% |
c | 29080 | 6.8% |
m | 28209 | 6.6% |
i | 28023 | 6.5% |
r | 22875 | 5.3% |
t | 22682 | 5.3% |
u | 14988 | 3.5% |
Other values (14) | 96771 |
marital
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 MiB |
married | |
---|---|
single | |
divorced |
Common Values
Value | Count | Frequency (%) |
married | 27214 | |
single | 12790 | |
divorced | 5207 | 11.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
married | 27214 | |
single | 12790 | |
divorced | 5207 | 11.5% |
Most occurring characters
Value | Count | Frequency (%) |
r | 59635 | |
i | 45211 | |
e | 45211 | |
d | 37628 | |
m | 27214 | |
a | 27214 | |
s | 12790 | 4.1% |
n | 12790 | 4.1% |
g | 12790 | 4.1% |
l | 12790 | 4.1% |
Other values (3) | 15621 | 5.1% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 308894 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
r | 59635 | |
i | 45211 | |
e | 45211 | |
d | 37628 | |
m | 27214 | |
a | 27214 | |
s | 12790 | 4.1% |
n | 12790 | 4.1% |
g | 12790 | 4.1% |
l | 12790 | 4.1% |
Other values (3) | 15621 | 5.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 308894 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
r | 59635 | |
i | 45211 | |
e | 45211 | |
d | 37628 | |
m | 27214 | |
a | 27214 | |
s | 12790 | 4.1% |
n | 12790 | 4.1% |
g | 12790 | 4.1% |
l | 12790 | 4.1% |
Other values (3) | 15621 | 5.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 308894 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
r | 59635 | |
i | 45211 | |
e | 45211 | |
d | 37628 | |
m | 27214 | |
a | 27214 | |
s | 12790 | 4.1% |
n | 12790 | 4.1% |
g | 12790 | 4.1% |
l | 12790 | 4.1% |
Other values (3) | 15621 | 5.1% |
education
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 MiB |
secondary | |
---|---|
tertiary | |
primary | |
unknown | 1857 |
Common Values
Value | Count | Frequency (%) |
secondary | 23202 | |
tertiary | 13301 | |
primary | 6851 | 15.2% |
unknown | 1857 | 4.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
secondary | 23202 | |
tertiary | 13301 | |
primary | 6851 | 15.2% |
unknown | 1857 | 4.1% |
Most occurring characters
Value | Count | Frequency (%) |
r | 63506 | |
a | 43354 | |
y | 43354 | |
e | 36503 | |
n | 28773 | |
t | 26602 | |
o | 25059 | 6.7% |
s | 23202 | 6.2% |
c | 23202 | 6.2% |
d | 23202 | 6.2% |
Other values (6) | 39425 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 376182 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
r | 63506 | |
a | 43354 | |
y | 43354 | |
e | 36503 | |
n | 28773 | |
t | 26602 | |
o | 25059 | 6.7% |
s | 23202 | 6.2% |
c | 23202 | 6.2% |
d | 23202 | 6.2% |
Other values (6) | 39425 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 376182 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
r | 63506 | |
a | 43354 | |
y | 43354 | |
e | 36503 | |
n | 28773 | |
t | 26602 | |
o | 25059 | 6.7% |
s | 23202 | 6.2% |
c | 23202 | 6.2% |
d | 23202 | 6.2% |
Other values (6) | 39425 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 376182 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
r | 63506 | |
a | 43354 | |
y | 43354 | |
e | 36503 | |
n | 28773 | |
t | 26602 | |
o | 25059 | 6.7% |
s | 23202 | 6.2% |
c | 23202 | 6.2% |
d | 23202 | 6.2% |
Other values (6) | 39425 |
default
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 44.3 KiB |
False | |
---|---|
True | 815 |
Value | Count | Frequency (%) |
False | 44396 | |
True | 815 | 1.8% |
balance
Real number (ℝ)
ZEROS
 
Distinct | 7168 |
---|---|
Distinct (%) | 15.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1362.2721 |
Minimum | -8019 |
---|---|
Maximum | 102127 |
Zeros | 3514 |
Zeros (%) | 7.8% |
Negative | 3766 |
Negative (%) | 8.3% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | -8019 |
---|---|
5-th percentile | -172 |
Q1 | 72 |
median | 448 |
Q3 | 1428 |
95-th percentile | 5768 |
Maximum | 102127 |
Range | 110146 |
Interquartile range (IQR) | 1356 |
Descriptive statistics
Standard deviation | 3044.7658 |
---|---|
Coefficient of variation (CV) | 2.2350644 |
Kurtosis | 140.75155 |
Mean | 1362.2721 |
Median Absolute Deviation (MAD) | 448 |
Skewness | 8.3603083 |
Sum | 61589682 |
Variance | 9270599 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 3514 | 7.8% |
1 | 195 | 0.4% |
2 | 156 | 0.3% |
4 | 139 | 0.3% |
3 | 134 | 0.3% |
5 | 113 | 0.2% |
6 | 88 | 0.2% |
8 | 81 | 0.2% |
23 | 75 | 0.2% |
7 | 69 | 0.2% |
Other values (7158) | 40647 |
Value | Count | Frequency (%) |
-8019 | 1 | |
-6847 | 1 | |
-4057 | 1 | |
-3372 | 1 | |
-3313 | 1 | |
-3058 | 1 | |
-2827 | 1 | |
-2712 | 1 | |
-2604 | 1 | |
-2282 | 1 |
Value | Count | Frequency (%) |
102127 | 1 | |
98417 | 1 | |
81204 | 2 | |
71188 | 1 | |
66721 | 1 | |
66653 | 1 | |
64343 | 1 | |
59649 | 1 | |
58932 | 1 | |
58544 | 1 |
housing
Boolean
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 44.3 KiB |
True | |
---|---|
False |
Value | Count | Frequency (%) |
True | 25130 | |
False | 20081 |
loan
Boolean
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 44.3 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 37967 | |
True | 7244 | 16.0% |
contact
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 MiB |
cellular | |
---|---|
unknown | |
telephone | 2906 |
Common Values
Value | Count | Frequency (%) |
cellular | 29285 | |
unknown | 13020 | |
telephone | 2906 | 6.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
cellular | 29285 | |
unknown | 13020 | |
telephone | 2906 | 6.4% |
Most occurring characters
Value | Count | Frequency (%) |
l | 90761 | |
u | 42305 | |
n | 41966 | |
e | 38003 | |
c | 29285 | 8.3% |
a | 29285 | 8.3% |
r | 29285 | 8.3% |
o | 15926 | 4.5% |
k | 13020 | 3.7% |
w | 13020 | 3.7% |
Other values (3) | 8718 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 351574 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
l | 90761 | |
u | 42305 | |
n | 41966 | |
e | 38003 | |
c | 29285 | 8.3% |
a | 29285 | 8.3% |
r | 29285 | 8.3% |
o | 15926 | 4.5% |
k | 13020 | 3.7% |
w | 13020 | 3.7% |
Other values (3) | 8718 | 2.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 351574 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
l | 90761 | |
u | 42305 | |
n | 41966 | |
e | 38003 | |
c | 29285 | 8.3% |
a | 29285 | 8.3% |
r | 29285 | 8.3% |
o | 15926 | 4.5% |
k | 13020 | 3.7% |
w | 13020 | 3.7% |
Other values (3) | 8718 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 351574 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
l | 90761 | |
u | 42305 | |
n | 41966 | |
e | 38003 | |
c | 29285 | 8.3% |
a | 29285 | 8.3% |
r | 29285 | 8.3% |
o | 15926 | 4.5% |
k | 13020 | 3.7% |
w | 13020 | 3.7% |
Other values (3) | 8718 | 2.5% |
day
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.806419 |
Minimum | 1 |
---|---|
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 8 |
median | 16 |
Q3 | 21 |
95-th percentile | 29 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 8.3224762 |
---|---|
Coefficient of variation (CV) | 0.52652509 |
Kurtosis | -1.0598974 |
Mean | 15.806419 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 0.093079014 |
Sum | 714624 |
Variance | 69.263609 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 2752 | 6.1% |
18 | 2308 | 5.1% |
21 | 2026 | 4.5% |
17 | 1939 | 4.3% |
6 | 1932 | 4.3% |
5 | 1910 | 4.2% |
14 | 1848 | 4.1% |
8 | 1842 | 4.1% |
28 | 1830 | 4.0% |
7 | 1817 | 4.0% |
Other values (21) | 25007 |
Value | Count | Frequency (%) |
1 | 322 | 0.7% |
2 | 1293 | |
3 | 1079 | |
4 | 1445 | |
5 | 1910 | |
6 | 1932 | |
7 | 1817 | |
8 | 1842 | |
9 | 1561 | |
10 | 524 | 1.2% |
Value | Count | Frequency (%) |
31 | 643 | 1.4% |
30 | 1566 | |
29 | 1745 | |
28 | 1830 | |
27 | 1121 | |
26 | 1035 | |
25 | 840 | |
24 | 447 | 1.0% |
23 | 939 | |
22 | 905 |
month
Categorical
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.6 MiB |
may | |
---|---|
jul | |
aug | |
jun | |
nov | |
Other values (7) |
Common Values
Value | Count | Frequency (%) |
may | 13766 | |
jul | 6895 | |
aug | 6247 | |
jun | 5341 | 11.8% |
nov | 3970 | 8.8% |
apr | 2932 | 6.5% |
feb | 2649 | 5.9% |
jan | 1403 | 3.1% |
oct | 738 | 1.6% |
sep | 579 | 1.3% |
Other values (2) | 691 | 1.5% |
Length
Value | Count | Frequency (%) |
may | 13766 | |
jul | 6895 | |
aug | 6247 | |
jun | 5341 | 11.8% |
nov | 3970 | 8.8% |
apr | 2932 | 6.5% |
feb | 2649 | 5.9% |
jan | 1403 | 3.1% |
oct | 738 | 1.6% |
sep | 579 | 1.3% |
Other values (2) | 691 | 1.5% |
Most occurring characters
Value | Count | Frequency (%) |
a | 24825 | |
u | 18483 | |
m | 14243 | |
y | 13766 | |
j | 13639 | |
n | 10714 | |
l | 6895 | 5.1% |
g | 6247 | 4.6% |
o | 4708 | 3.5% |
v | 3970 | 2.9% |
Other values (9) | 18143 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 135633 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 24825 | |
u | 18483 | |
m | 14243 | |
y | 13766 | |
j | 13639 | |
n | 10714 | |
l | 6895 | 5.1% |
g | 6247 | 4.6% |
o | 4708 | 3.5% |
v | 3970 | 2.9% |
Other values (9) | 18143 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 135633 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 24825 | |
u | 18483 | |
m | 14243 | |
y | 13766 | |
j | 13639 | |
n | 10714 | |
l | 6895 | 5.1% |
g | 6247 | 4.6% |
o | 4708 | 3.5% |
v | 3970 | 2.9% |
Other values (9) | 18143 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 135633 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 24825 | |
u | 18483 | |
m | 14243 | |
y | 13766 | |
j | 13639 | |
n | 10714 | |
l | 6895 | 5.1% |
g | 6247 | 4.6% |
o | 4708 | 3.5% |
v | 3970 | 2.9% |
Other values (9) | 18143 |
duration
Real number (ℝ)
Distinct | 1573 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 258.16308 |
Minimum | 0 |
---|---|
Maximum | 4918 |
Zeros | 3 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 35 |
Q1 | 103 |
median | 180 |
Q3 | 319 |
95-th percentile | 751 |
Maximum | 4918 |
Range | 4918 |
Interquartile range (IQR) | 216 |
Descriptive statistics
Standard deviation | 257.52781 |
---|---|
Coefficient of variation (CV) | 0.99753928 |
Kurtosis | 18.153915 |
Mean | 258.16308 |
Median Absolute Deviation (MAD) | 93 |
Skewness | 3.1443181 |
Sum | 11671811 |
Variance | 66320.574 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
124 | 188 | 0.4% |
90 | 184 | 0.4% |
89 | 177 | 0.4% |
104 | 175 | 0.4% |
122 | 175 | 0.4% |
114 | 175 | 0.4% |
136 | 174 | 0.4% |
139 | 174 | 0.4% |
112 | 174 | 0.4% |
121 | 173 | 0.4% |
Other values (1563) | 43442 |
Value | Count | Frequency (%) |
0 | 3 | < 0.1% |
1 | 2 | < 0.1% |
2 | 3 | < 0.1% |
3 | 4 | < 0.1% |
4 | 15 | < 0.1% |
5 | 35 | |
6 | 45 | |
7 | 73 | |
8 | 85 | |
9 | 77 |
Value | Count | Frequency (%) |
4918 | 1 | |
3881 | 1 | |
3785 | 1 | |
3422 | 1 | |
3366 | 1 | |
3322 | 1 | |
3284 | 1 | |
3253 | 1 | |
3183 | 1 | |
3102 | 1 |
campaign
Real number (ℝ)
Distinct | 48 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.7638407 |
Minimum | 1 |
---|---|
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 8 |
Maximum | 63 |
Range | 62 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 3.0980209 |
---|---|
Coefficient of variation (CV) | 1.1209115 |
Kurtosis | 39.249651 |
Mean | 2.7638407 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 4.8986502 |
Sum | 124956 |
Variance | 9.5977334 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 17544 | |
2 | 12505 | |
3 | 5521 | 12.2% |
4 | 3522 | 7.8% |
5 | 1764 | 3.9% |
6 | 1291 | 2.9% |
7 | 735 | 1.6% |
8 | 540 | 1.2% |
9 | 327 | 0.7% |
10 | 266 | 0.6% |
Other values (38) | 1196 | 2.6% |
Value | Count | Frequency (%) |
1 | 17544 | |
2 | 12505 | |
3 | 5521 | 12.2% |
4 | 3522 | 7.8% |
5 | 1764 | 3.9% |
6 | 1291 | 2.9% |
7 | 735 | 1.6% |
8 | 540 | 1.2% |
9 | 327 | 0.7% |
10 | 266 | 0.6% |
Value | Count | Frequency (%) |
63 | 1 | < 0.1% |
58 | 1 | < 0.1% |
55 | 1 | < 0.1% |
51 | 1 | < 0.1% |
50 | 2 | |
46 | 1 | < 0.1% |
44 | 1 | < 0.1% |
43 | 3 | |
41 | 2 | |
39 | 1 | < 0.1% |
pdays
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 559 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.197828 |
Minimum | -1 |
---|---|
Maximum | 871 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 36954 |
Negative (%) | 81.7% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | -1 |
---|---|
5-th percentile | -1 |
Q1 | -1 |
median | -1 |
Q3 | -1 |
95-th percentile | 317 |
Maximum | 871 |
Range | 872 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 100.12875 |
---|---|
Coefficient of variation (CV) | 2.4908994 |
Kurtosis | 6.9351952 |
Mean | 40.197828 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.6157155 |
Sum | 1817384 |
Variance | 10025.766 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
-1 | 36954 | |
182 | 167 | 0.4% |
92 | 147 | 0.3% |
91 | 126 | 0.3% |
183 | 126 | 0.3% |
181 | 117 | 0.3% |
370 | 99 | 0.2% |
184 | 85 | 0.2% |
364 | 77 | 0.2% |
95 | 74 | 0.2% |
Other values (549) | 7239 | 16.0% |
Value | Count | Frequency (%) |
-1 | 36954 | |
1 | 15 | < 0.1% |
2 | 37 | 0.1% |
3 | 1 | < 0.1% |
4 | 2 | < 0.1% |
5 | 11 | < 0.1% |
6 | 10 | < 0.1% |
7 | 7 | < 0.1% |
8 | 25 | 0.1% |
9 | 12 | < 0.1% |
Value | Count | Frequency (%) |
871 | 1 | |
854 | 1 | |
850 | 1 | |
842 | 1 | |
838 | 1 | |
831 | 1 | |
828 | 1 | |
826 | 1 | |
808 | 1 | |
805 | 1 |
previous
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 41 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.58032337 |
Minimum | 0 |
---|---|
Maximum | 275 |
Zeros | 36954 |
Zeros (%) | 81.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 353.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 3 |
Maximum | 275 |
Range | 275 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.303441 |
---|---|
Coefficient of variation (CV) | 3.9692371 |
Kurtosis | 4506.8607 |
Mean | 0.58032337 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 41.846454 |
Sum | 26237 |
Variance | 5.3058406 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 36954 | |
1 | 2772 | 6.1% |
2 | 2106 | 4.7% |
3 | 1142 | 2.5% |
4 | 714 | 1.6% |
5 | 459 | 1.0% |
6 | 277 | 0.6% |
7 | 205 | 0.5% |
8 | 129 | 0.3% |
9 | 92 | 0.2% |
Other values (31) | 361 | 0.8% |
Value | Count | Frequency (%) |
0 | 36954 | |
1 | 2772 | 6.1% |
2 | 2106 | 4.7% |
3 | 1142 | 2.5% |
4 | 714 | 1.6% |
5 | 459 | 1.0% |
6 | 277 | 0.6% |
7 | 205 | 0.5% |
8 | 129 | 0.3% |
9 | 92 | 0.2% |
Value | Count | Frequency (%) |
275 | 1 | |
58 | 1 | |
55 | 1 | |
51 | 1 | |
41 | 1 | |
40 | 1 | |
38 | 2 | |
37 | 2 | |
35 | 1 | |
32 | 1 |
poutcome
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.8 MiB |
unknown | |
---|---|
failure | |
other | 1840 |
success | 1511 |
Common Values
Value | Count | Frequency (%) |
unknown | 36959 | |
failure | 4901 | 10.8% |
other | 1840 | 4.1% |
success | 1511 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
unknown | 36959 | |
failure | 4901 | 10.8% |
other | 1840 | 4.1% |
success | 1511 | 3.3% |
Most occurring characters
Value | Count | Frequency (%) |
n | 110877 | |
u | 43371 | 13.9% |
o | 38799 | 12.4% |
k | 36959 | 11.8% |
w | 36959 | 11.8% |
e | 8252 | 2.6% |
r | 6741 | 2.2% |
f | 4901 | 1.6% |
a | 4901 | 1.6% |
i | 4901 | 1.6% |
Other values (5) | 16136 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 312797 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
n | 110877 | |
u | 43371 | 13.9% |
o | 38799 | 12.4% |
k | 36959 | 11.8% |
w | 36959 | 11.8% |
e | 8252 | 2.6% |
r | 6741 | 2.2% |
f | 4901 | 1.6% |
a | 4901 | 1.6% |
i | 4901 | 1.6% |
Other values (5) | 16136 | 5.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 312797 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
n | 110877 | |
u | 43371 | 13.9% |
o | 38799 | 12.4% |
k | 36959 | 11.8% |
w | 36959 | 11.8% |
e | 8252 | 2.6% |
r | 6741 | 2.2% |
f | 4901 | 1.6% |
a | 4901 | 1.6% |
i | 4901 | 1.6% |
Other values (5) | 16136 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 312797 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
n | 110877 | |
u | 43371 | 13.9% |
o | 38799 | 12.4% |
k | 36959 | 11.8% |
w | 36959 | 11.8% |
e | 8252 | 2.6% |
r | 6741 | 2.2% |
f | 4901 | 1.6% |
a | 4901 | 1.6% |
i | 4901 | 1.6% |
Other values (5) | 16136 | 5.2% |
y
Boolean
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 44.3 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 39922 | |
True | 5289 | 11.7% |
age | balance | day | duration | campaign | pdays | previous | job | marital | education | default | housing | loan | contact | month | poutcome | y | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
age | 1.000 | 0.096 | -0.009 | -0.033 | 0.037 | -0.017 | -0.012 | 0.242 | 0.329 | 0.135 | 0.021 | 0.225 | 0.064 | 0.164 | 0.095 | 0.068 | 0.155 |
balance | 0.096 | 1.000 | 0.001 | 0.043 | -0.031 | 0.070 | 0.080 | 0.033 | 0.017 | 0.046 | 0.048 | 0.060 | 0.075 | 0.034 | 0.057 | 0.022 | 0.058 |
day | -0.009 | 0.001 | 1.000 | -0.058 | 0.140 | -0.092 | -0.088 | 0.041 | 0.032 | 0.039 | 0.013 | 0.107 | 0.048 | 0.091 | 0.284 | 0.079 | 0.073 |
duration | -0.033 | 0.043 | -0.058 | 1.000 | -0.108 | 0.029 | 0.031 | 0.012 | 0.011 | 0.000 | 0.000 | 0.000 | 0.016 | 0.025 | 0.019 | 0.014 | 0.364 |
campaign | 0.037 | -0.031 | 0.140 | -0.108 | 1.000 | -0.112 | -0.108 | 0.012 | 0.006 | 0.011 | 0.015 | 0.027 | 0.010 | 0.032 | 0.058 | 0.047 | 0.048 |
pdays | -0.017 | 0.070 | -0.092 | 0.029 | -0.112 | 1.000 | 0.986 | 0.043 | 0.026 | 0.046 | 0.035 | 0.168 | 0.030 | 0.200 | 0.177 | 0.571 | 0.192 |
previous | -0.012 | 0.080 | -0.088 | 0.031 | -0.108 | 0.986 | 1.000 | 0.000 | 0.000 | 0.006 | 0.000 | 0.000 | 0.008 | 0.008 | 0.014 | 0.032 | 0.011 |
job | 0.242 | 0.033 | 0.041 | 0.012 | 0.012 | 0.043 | 0.000 | 1.000 | 0.205 | 0.458 | 0.033 | 0.281 | 0.105 | 0.150 | 0.109 | 0.062 | 0.135 |
marital | 0.329 | 0.017 | 0.032 | 0.011 | 0.006 | 0.026 | 0.000 | 0.205 | 1.000 | 0.121 | 0.018 | 0.020 | 0.052 | 0.045 | 0.071 | 0.028 | 0.066 |
education | 0.135 | 0.046 | 0.039 | 0.000 | 0.011 | 0.046 | 0.006 | 0.458 | 0.121 | 1.000 | 0.014 | 0.119 | 0.080 | 0.123 | 0.109 | 0.035 | 0.072 |
default | 0.021 | 0.048 | 0.013 | 0.000 | 0.015 | 0.035 | 0.000 | 0.033 | 0.018 | 0.014 | 1.000 | 0.003 | 0.077 | 0.023 | 0.057 | 0.040 | 0.022 |
housing | 0.225 | 0.060 | 0.107 | 0.000 | 0.027 | 0.168 | 0.000 | 0.281 | 0.020 | 0.119 | 0.003 | 1.000 | 0.041 | 0.213 | 0.504 | 0.143 | 0.139 |
loan | 0.064 | 0.075 | 0.048 | 0.016 | 0.010 | 0.030 | 0.008 | 0.105 | 0.052 | 0.080 | 0.077 | 0.041 | 1.000 | 0.015 | 0.182 | 0.055 | 0.068 |
contact | 0.164 | 0.034 | 0.091 | 0.025 | 0.032 | 0.200 | 0.008 | 0.150 | 0.045 | 0.123 | 0.023 | 0.213 | 0.015 | 1.000 | 0.512 | 0.207 | 0.151 |
month | 0.095 | 0.057 | 0.284 | 0.019 | 0.058 | 0.177 | 0.014 | 0.109 | 0.071 | 0.109 | 0.057 | 0.504 | 0.182 | 0.512 | 1.000 | 0.214 | 0.260 |
poutcome | 0.068 | 0.022 | 0.079 | 0.014 | 0.047 | 0.571 | 0.032 | 0.062 | 0.028 | 0.035 | 0.040 | 0.143 | 0.055 | 0.207 | 0.214 | 1.000 | 0.312 |
y | 0.155 | 0.058 | 0.073 | 0.364 | 0.048 | 0.192 | 0.011 | 0.135 | 0.066 | 0.072 | 0.022 | 0.139 | 0.068 | 0.151 | 0.260 | 0.312 | 1.000 |
age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 58 | management | married | tertiary | no | 2143 | yes | no | unknown | 5 | may | 261 | 1 | -1 | 0 | unknown | no |
1 | 44 | technician | single | secondary | no | 29 | yes | no | unknown | 5 | may | 151 | 1 | -1 | 0 | unknown | no |
2 | 33 | entrepreneur | married | secondary | no | 2 | yes | yes | unknown | 5 | may | 76 | 1 | -1 | 0 | unknown | no |
3 | 47 | blue-collar | married | unknown | no | 1506 | yes | no | unknown | 5 | may | 92 | 1 | -1 | 0 | unknown | no |
4 | 33 | unknown | single | unknown | no | 1 | no | no | unknown | 5 | may | 198 | 1 | -1 | 0 | unknown | no |
5 | 35 | management | married | tertiary | no | 231 | yes | no | unknown | 5 | may | 139 | 1 | -1 | 0 | unknown | no |
6 | 28 | management | single | tertiary | no | 447 | yes | yes | unknown | 5 | may | 217 | 1 | -1 | 0 | unknown | no |
7 | 42 | entrepreneur | divorced | tertiary | yes | 2 | yes | no | unknown | 5 | may | 380 | 1 | -1 | 0 | unknown | no |
8 | 58 | retired | married | primary | no | 121 | yes | no | unknown | 5 | may | 50 | 1 | -1 | 0 | unknown | no |
9 | 43 | technician | single | secondary | no | 593 | yes | no | unknown | 5 | may | 55 | 1 | -1 | 0 | unknown | no |
age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
45201 | 53 | management | married | tertiary | no | 583 | no | no | cellular | 17 | nov | 226 | 1 | 184 | 4 | success | yes |
45202 | 34 | admin. | single | secondary | no | 557 | no | no | cellular | 17 | nov | 224 | 1 | -1 | 0 | unknown | yes |
45203 | 23 | student | single | tertiary | no | 113 | no | no | cellular | 17 | nov | 266 | 1 | -1 | 0 | unknown | yes |
45204 | 73 | retired | married | secondary | no | 2850 | no | no | cellular | 17 | nov | 300 | 1 | 40 | 8 | failure | yes |
45205 | 25 | technician | single | secondary | no | 505 | no | yes | cellular | 17 | nov | 386 | 2 | -1 | 0 | unknown | yes |
45206 | 51 | technician | married | tertiary | no | 825 | no | no | cellular | 17 | nov | 977 | 3 | -1 | 0 | unknown | yes |
45207 | 71 | retired | divorced | primary | no | 1729 | no | no | cellular | 17 | nov | 456 | 2 | -1 | 0 | unknown | yes |
45208 | 72 | retired | married | secondary | no | 5715 | no | no | cellular | 17 | nov | 1127 | 5 | 184 | 3 | success | yes |
45209 | 57 | blue-collar | married | secondary | no | 668 | no | no | telephone | 17 | nov | 508 | 4 | -1 | 0 | unknown | no |
45210 | 37 | entrepreneur | married | secondary | no | 2971 | no | no | cellular | 17 | nov | 361 | 2 | 188 | 11 | other | no |