Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 865 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 190.2 KiB |
Average record size in memory | 225.2 B |
Variable types
Text | 3 |
---|---|
Numeric | 3 |
Reproduction
Analysis started | 2023-09-12 08:37:09.740731 |
---|---|
Analysis finished | 2023-09-12 08:37:12.289722 |
Duration | 2.55 seconds |
Software version | ydata-profiling v0.0.dev0 |
Download configuration | config.json |
Code
Text
UNIQUE
 
Distinct | 865 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 57.9 KiB |
Length
Max length | 39 |
---|---|
Median length | 26 |
Mean length | 11.375723 |
Min length | 3 |
Characters and Unicode
Total characters | 9840 |
---|---|
Distinct characters | 31 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 865 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | air_force_blue_raf |
---|---|
2nd row | air_force_blue_usaf |
3rd row | air_superiority_blue |
4th row | alabama_crimson |
5th row | alice_blue |
Value | Count | Frequency (%) |
air_force_blue_raf | 1 | 0.1% |
amethyst | 1 | 0.1% |
arsenic | 1 | 0.1% |
air_superiority_blue | 1 | 0.1% |
alabama_crimson | 1 | 0.1% |
alice_blue | 1 | 0.1% |
alizarin_crimson | 1 | 0.1% |
alloy_orange | 1 | 0.1% |
almond | 1 | 0.1% |
amaranth | 1 | 0.1% |
Other values (855) | 855 |
Most occurring characters
Value | Count | Frequency (%) |
e | 1201 | 12.2% |
_ | 799 | 8.1% |
r | 796 | 8.1% |
a | 788 | 8.0% |
l | 695 | 7.1% |
n | 626 | 6.4% |
i | 558 | 5.7% |
o | 519 | 5.3% |
t | 396 | 4.0% |
u | 373 | 3.8% |
Other values (21) | 3089 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 9025 | |
Connector Punctuation | 799 | 8.1% |
Decimal Number | 16 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 1201 | |
r | 796 | 8.8% |
a | 788 | 8.7% |
l | 695 | 7.7% |
n | 626 | 6.9% |
i | 558 | 6.2% |
o | 519 | 5.8% |
t | 396 | 4.4% |
u | 373 | 4.1% |
s | 343 | 3.8% |
Other values (16) | 2730 |
Decimal Number
Value | Count | Frequency (%) |
1 | 13 | |
7 | 1 | 6.2% |
3 | 1 | 6.2% |
9 | 1 | 6.2% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 799 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 9025 | |
Common | 815 | 8.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 1201 | |
r | 796 | 8.8% |
a | 788 | 8.7% |
l | 695 | 7.7% |
n | 626 | 6.9% |
i | 558 | 6.2% |
o | 519 | 5.8% |
t | 396 | 4.4% |
u | 373 | 4.1% |
s | 343 | 3.8% |
Other values (16) | 2730 |
Common
Value | Count | Frequency (%) |
_ | 799 | |
1 | 13 | 1.6% |
7 | 1 | 0.1% |
3 | 1 | 0.1% |
9 | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9840 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 1201 | 12.2% |
_ | 799 | 8.1% |
r | 796 | 8.1% |
a | 788 | 8.0% |
l | 695 | 7.1% |
n | 626 | 6.4% |
i | 558 | 5.7% |
o | 519 | 5.3% |
t | 396 | 4.0% |
u | 373 | 3.8% |
Other values (21) | 3089 |
Name
Text
UNIQUE
 
Distinct | 865 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 58.3 KiB |
Length
Max length | 41 |
---|---|
Median length | 28 |
Mean length | 11.591908 |
Min length | 3 |
Characters and Unicode
Total characters | 10027 |
---|---|
Distinct characters | 69 |
Distinct categories | 9 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 865 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | Air Force Blue (Raf) |
---|---|
2nd row | Air Force Blue (Usaf) |
3rd row | Air Superiority Blue |
4th row | Alabama Crimson |
5th row | Alice Blue |
Value | Count | Frequency (%) |
blue | 98 | 6.0% |
green | 78 | 4.8% |
pink | 47 | 2.9% |
dark | 45 | 2.8% |
red | 42 | 2.6% |
yellow | 31 | 1.9% |
rose | 28 | 1.7% |
light | 25 | 1.5% |
lavender | 23 | 1.4% |
orange | 23 | 1.4% |
Other values (606) | 1190 |
Most occurring characters
Value | Count | Frequency (%) |
e | 1168 | 11.6% |
765 | 7.6% | |
a | 737 | 7.4% |
r | 661 | 6.6% |
l | 611 | 6.1% |
n | 609 | 6.1% |
i | 536 | 5.3% |
o | 463 | 4.6% |
u | 345 | 3.4% |
t | 328 | 3.3% |
Other values (59) | 3804 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 7369 | |
Uppercase Letter | 1661 | 16.6% |
Space Separator | 765 | 7.6% |
Open Punctuation | 89 | 0.9% |
Close Punctuation | 89 | 0.9% |
Dash Punctuation | 20 | 0.2% |
Other Punctuation | 17 | 0.2% |
Decimal Number | 16 | 0.2% |
Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 1168 | |
a | 737 | |
r | 661 | 9.0% |
l | 611 | 8.3% |
n | 609 | 8.3% |
i | 536 | 7.3% |
o | 463 | 6.3% |
u | 345 | 4.7% |
t | 328 | 4.5% |
d | 251 | 3.4% |
Other values (19) | 1660 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 206 | |
P | 174 | |
C | 158 | 9.5% |
G | 140 | 8.4% |
R | 135 | 8.1% |
M | 95 | 5.7% |
S | 93 | 5.6% |
D | 90 | 5.4% |
L | 84 | 5.1% |
T | 68 | 4.1% |
Other values (16) | 418 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 7 | |
' | 6 | |
# | 2 | 11.8% |
. | 1 | 5.9% |
& | 1 | 5.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 13 | |
3 | 1 | 6.2% |
7 | 1 | 6.2% |
9 | 1 | 6.2% |
Space Separator
Value | Count | Frequency (%) |
765 |
Open Punctuation
Value | Count | Frequency (%) |
( | 89 |
Close Punctuation
Value | Count | Frequency (%) |
) | 89 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 9030 | |
Common | 997 | 9.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 1168 | 12.9% |
a | 737 | 8.2% |
r | 661 | 7.3% |
l | 611 | 6.8% |
n | 609 | 6.7% |
i | 536 | 5.9% |
o | 463 | 5.1% |
u | 345 | 3.8% |
t | 328 | 3.6% |
d | 251 | 2.8% |
Other values (45) | 3321 |
Common
Value | Count | Frequency (%) |
765 | ||
( | 89 | 8.9% |
) | 89 | 8.9% |
- | 20 | 2.0% |
1 | 13 | 1.3% |
/ | 7 | 0.7% |
' | 6 | 0.6% |
# | 2 | 0.2% |
3 | 1 | 0.1% |
7 | 1 | 0.1% |
Other values (4) | 4 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10021 | |
None | 5 | < 0.1% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 1168 | 11.7% |
765 | 7.6% | |
a | 737 | 7.4% |
r | 661 | 6.6% |
l | 611 | 6.1% |
n | 609 | 6.1% |
i | 536 | 5.3% |
o | 463 | 4.6% |
u | 345 | 3.4% |
t | 328 | 3.3% |
Other values (55) | 3798 |
None
Value | Count | Frequency (%) |
é | 3 | |
à | 1 | 20.0% |
ú | 1 | 20.0% |
Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Hex
Text
Distinct | 765 |
---|---|
Distinct (%) | 88.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 54.0 KiB |
Value | Count | Frequency (%) |
c19a6b | 5 | 0.6% |
967117 | 4 | 0.5% |
fada5e | 4 | 0.5% |
808080 | 3 | 0.3% |
0f0 | 3 | 0.3% |
a52a2a | 3 | 0.3% |
483c32 | 3 | 0.3% |
d2691e | 3 | 0.3% |
fad6a5 | 3 | 0.3% |
dda0dd | 3 | 0.3% |
Other values (755) | 831 |
Most occurring characters
Value | Count | Frequency (%) |
# | 865 | |
0 | 665 | 11.3% |
f | 625 | 10.6% |
8 | 317 | 5.4% |
c | 300 | 5.1% |
a | 292 | 5.0% |
e | 269 | 4.6% |
4 | 268 | 4.6% |
b | 268 | 4.6% |
3 | 267 | 4.5% |
Other values (7) | 1745 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2997 | |
Lowercase Letter | 2019 | |
Other Punctuation | 865 | 14.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 665 | |
8 | 317 | |
4 | 268 | |
3 | 267 | |
6 | 265 | 8.8% |
7 | 252 | 8.4% |
9 | 250 | 8.3% |
5 | 248 | 8.3% |
2 | 243 | 8.1% |
1 | 222 | 7.4% |
Lowercase Letter
Value | Count | Frequency (%) |
f | 625 | |
c | 300 | |
a | 292 | |
e | 269 | |
b | 268 | |
d | 265 |
Other Punctuation
Value | Count | Frequency (%) |
# | 865 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 3862 | |
Latin | 2019 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
# | 865 | |
0 | 665 | |
8 | 317 | 8.2% |
4 | 268 | 6.9% |
3 | 267 | 6.9% |
6 | 265 | 6.9% |
7 | 252 | 6.5% |
9 | 250 | 6.5% |
5 | 248 | 6.4% |
2 | 243 | 6.3% |
Latin
Value | Count | Frequency (%) |
f | 625 | |
c | 300 | |
a | 292 | |
e | 269 | |
b | 268 | |
d | 265 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5881 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
# | 865 | |
0 | 665 | 11.3% |
f | 625 | 10.6% |
8 | 317 | 5.4% |
c | 300 | 5.1% |
a | 292 | 5.0% |
e | 269 | 4.6% |
4 | 268 | 4.6% |
b | 268 | 4.6% |
3 | 267 | 4.5% |
Other values (7) | 1745 |
R
Real number (ℝ)
ZEROS
 
Distinct | 221 |
---|---|
Distinct (%) | 25.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 158.59884 |
Minimum | 0 |
---|---|
Maximum | 255 |
Zeros | 81 |
Zeros (%) | 9.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 101 |
median | 178 |
Q3 | 236 |
95-th percentile | 255 |
Maximum | 255 |
Range | 255 |
Interquartile range (IQR) | 135 |
Descriptive statistics
Standard deviation | 85.338432 |
---|---|
Coefficient of variation (CV) | 0.53807726 |
Kurtosis | -0.92645087 |
Mean | 158.59884 |
Median Absolute Deviation (MAD) | 66 |
Skewness | -0.59367921 |
Sum | 137188 |
Variance | 7282.6479 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
255 | 110 | 12.7% |
0 | 81 | 9.4% |
250 | 15 | 1.7% |
204 | 13 | 1.5% |
150 | 11 | 1.3% |
128 | 11 | 1.3% |
227 | 10 | 1.2% |
153 | 10 | 1.2% |
244 | 10 | 1.2% |
240 | 9 | 1.0% |
Other values (211) | 585 |
Value | Count | Frequency (%) |
0 | 81 | |
1 | 4 | 0.5% |
2 | 1 | 0.1% |
3 | 2 | 0.2% |
5 | 1 | 0.1% |
6 | 1 | 0.1% |
8 | 4 | 0.5% |
10 | 1 | 0.1% |
11 | 1 | 0.1% |
13 | 1 | 0.1% |
Value | Count | Frequency (%) |
255 | 110 | |
254 | 7 | 0.8% |
253 | 8 | 0.9% |
252 | 6 | 0.7% |
251 | 9 | 1.0% |
250 | 15 | 1.7% |
249 | 4 | 0.5% |
248 | 8 | 0.9% |
247 | 3 | 0.3% |
246 | 2 | 0.2% |
G
Real number (ℝ)
ZEROS
 
Distinct | 234 |
---|---|
Distinct (%) | 27.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 124.68324 |
Minimum | 0 |
---|---|
Maximum | 255 |
Zeros | 58 |
Zeros (%) | 6.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 64 |
median | 123 |
Q3 | 190 |
95-th percentile | 250 |
Maximum | 255 |
Range | 255 |
Interquartile range (IQR) | 126 |
Descriptive statistics
Standard deviation | 76.270225 |
---|---|
Coefficient of variation (CV) | 0.61171194 |
Kurtosis | -1.0978467 |
Mean | 124.68324 |
Median Absolute Deviation (MAD) | 63 |
Skewness | 0.052233472 |
Sum | 107851 |
Variance | 5817.1472 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 58 | 6.7% |
255 | 35 | 4.0% |
128 | 13 | 1.5% |
105 | 12 | 1.4% |
51 | 11 | 1.3% |
204 | 11 | 1.3% |
66 | 9 | 1.0% |
160 | 9 | 1.0% |
218 | 9 | 1.0% |
102 | 9 | 1.0% |
Other values (224) | 689 |
Value | Count | Frequency (%) |
0 | 58 | |
1 | 2 | 0.2% |
2 | 2 | 0.2% |
3 | 2 | 0.2% |
6 | 2 | 0.2% |
8 | 2 | 0.2% |
10 | 3 | 0.3% |
11 | 2 | 0.2% |
12 | 3 | 0.3% |
14 | 2 | 0.2% |
Value | Count | Frequency (%) |
255 | 35 | |
254 | 3 | 0.3% |
253 | 2 | 0.2% |
252 | 2 | 0.2% |
251 | 1 | 0.1% |
250 | 5 | 0.6% |
249 | 1 | 0.1% |
248 | 4 | 0.5% |
247 | 2 | 0.2% |
246 | 1 | 0.1% |
B
Real number (ℝ)
ZEROS
 
Distinct | 230 |
---|---|
Distinct (%) | 26.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 119.08786 |
Minimum | 0 |
---|---|
Maximum | 255 |
Zeros | 80 |
Zeros (%) | 9.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 53 |
median | 119 |
Q3 | 186 |
95-th percentile | 253.6 |
Maximum | 255 |
Range | 255 |
Interquartile range (IQR) | 133 |
Descriptive statistics
Standard deviation | 78.343862 |
---|---|
Coefficient of variation (CV) | 0.65786606 |
Kurtosis | -1.13796 |
Mean | 119.08786 |
Median Absolute Deviation (MAD) | 66 |
Skewness | 0.10728769 |
Sum | 103011 |
Variance | 6137.7608 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 80 | 9.2% |
255 | 41 | 4.7% |
107 | 15 | 1.7% |
128 | 14 | 1.6% |
204 | 10 | 1.2% |
94 | 9 | 1.0% |
120 | 9 | 1.0% |
50 | 8 | 0.9% |
51 | 8 | 0.9% |
153 | 8 | 0.9% |
Other values (220) | 663 |
Value | Count | Frequency (%) |
0 | 80 | |
2 | 3 | 0.3% |
3 | 1 | 0.1% |
5 | 2 | 0.2% |
7 | 2 | 0.2% |
8 | 3 | 0.3% |
9 | 1 | 0.1% |
10 | 2 | 0.2% |
11 | 3 | 0.3% |
12 | 3 | 0.3% |
Value | Count | Frequency (%) |
255 | 41 | |
254 | 3 | 0.3% |
252 | 1 | 0.1% |
251 | 1 | 0.1% |
250 | 7 | 0.8% |
249 | 1 | 0.1% |
245 | 3 | 0.3% |
244 | 2 | 0.2% |
241 | 1 | 0.1% |
240 | 6 | 0.7% |
R | G | B | |
---|---|---|---|
R | 1.000 | 0.256 | 0.010 |
G | 0.256 | 1.000 | 0.289 |
B | 0.010 | 0.289 | 1.000 |
Code | Name | Hex | R | G | B | |
---|---|---|---|---|---|---|
0 | air_force_blue_raf | Air Force Blue (Raf) | #5d8aa8 | 93 | 138 | 168 |
1 | air_force_blue_usaf | Air Force Blue (Usaf) | #00308f | 0 | 48 | 143 |
2 | air_superiority_blue | Air Superiority Blue | #72a0c1 | 114 | 160 | 193 |
3 | alabama_crimson | Alabama Crimson | #a32638 | 163 | 38 | 56 |
4 | alice_blue | Alice Blue | #f0f8ff | 240 | 248 | 255 |
5 | alizarin_crimson | Alizarin Crimson | #e32636 | 227 | 38 | 54 |
6 | alloy_orange | Alloy Orange | #c46210 | 196 | 98 | 16 |
7 | almond | Almond | #efdecd | 239 | 222 | 205 |
8 | amaranth | Amaranth | #e52b50 | 229 | 43 | 80 |
9 | amber | Amber | #ffbf00 | 255 | 191 | 0 |
Code | Name | Hex | R | G | B | |
---|---|---|---|---|---|---|
855 | yale_blue | Yale Blue | #0f4d92 | 15 | 77 | 146 |
856 | yellow | Yellow | #ff0 | 255 | 255 | 0 |
857 | yellow_green | Yellow-Green | #9acd32 | 154 | 205 | 50 |
858 | yellow_munsell | Yellow (Munsell) | #efcc00 | 239 | 204 | 0 |
859 | yellow_ncs | Yellow (Ncs) | #ffd300 | 255 | 211 | 0 |
860 | yellow_orange | Yellow Orange | #ffae42 | 255 | 174 | 66 |
861 | yellow_process | Yellow (Process) | #ffef00 | 255 | 239 | 0 |
862 | yellow_ryb | Yellow (Ryb) | #fefe33 | 254 | 254 | 51 |
863 | zaffre | Zaffre | #0014a8 | 0 | 20 | 168 |
864 | zinnwaldite_brown | Zinnwaldite Brown | #2c1608 | 44 | 22 | 8 |