Overview
Brought to you by YData
Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 189 |
Missing cells | 188 |
Missing cells (%) | 19.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.5 KiB |
Average record size in memory | 40.7 B |
Variable types
URL | 1 |
---|---|
Categorical | 2 |
DateTime | 1 |
Text | 1 |
Reproduction
Analysis started | 2024-10-29 15:28:30.647105 |
---|---|
Analysis finished | 2024-10-29 15:28:30.935559 |
Duration | 0.29 seconds |
Software version | ydata-profiling v0.0.dev0 |
Download configuration | config.json |
Variables
url
URL
Unique 
Distinct | 189 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
http://abrahadesta.wordpress.com/ | 1 |
---|---|
http://aljazeera.net/ | 1 |
http://am.wikipedia.org/ | 1 |
http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 |
http://amharic.voanews.com/ | 1 |
Other values (184) |
Value | Count | Frequency (%) |
http://abrahadesta.wordpress.com/ | 1 | 0.5% |
http://aljazeera.net/ | 1 | 0.5% |
http://am.wikipedia.org/ | 1 | 0.5% |
http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 | 0.5% |
http://amharic.voanews.com/ | 1 | 0.5% |
http://ancientgebts.org/ | 1 | 0.5% |
http://carpediemethiopia.blogspot.com/ | 1 | 0.5% |
http://citizenlab.org/ | 1 | 0.5% |
http://cpj.org/ | 1 | 0.5% |
http://egoportal.blogspot.com/ | 1 | 0.5% |
Other values (179) | 179 |
Value | Count | Frequency (%) |
http | 173 | |
https | 16 | 8.5% |
Value | Count | Frequency (%) |
nazret.com | 8 | 4.2% |
www.cafpde.org | 3 | 1.6% |
www.hrw.org | 3 | 1.6% |
am.wikipedia.org | 2 | 1.1% |
www.awate.com | 2 | 1.1% |
citizenlab.org | 2 | 1.1% |
facebook.com | 2 | 1.1% |
www.ethiopiafirst.com | 2 | 1.1% |
portal.unesco.org | 2 | 1.1% |
www.aigaforum.com | 2 | 1.1% |
Other values (134) | 161 |
Value | Count | Frequency (%) |
/ | 127 | |
/blog/index.php | 7 | 3.7% |
/index.html | 2 | 1.1% |
/index.htm | 2 | 1.1% |
/story/201306250132-0022854 | 1 | 0.5% |
/geography/en/ev.php-URL_ID=3559&URL_DO=DO_TOPIC&URL_SECTION=201.html | 1 | 0.5% |
/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 | 0.5% |
/library/eng-eth/index | 1 | 0.5% |
/~ena/ | 1 | 0.5% |
/new/index.asp | 1 | 0.5% |
Other values (45) | 45 | 23.8% |
Value | Count | Frequency (%) |
174 | ||
blog=12 | 1 | 0.5% |
blog=13 | 1 | 0.5% |
blog=14 | 1 | 0.5% |
blog=15 | 1 | 0.5% |
blog=16 | 1 | 0.5% |
blog=7 | 1 | 0.5% |
blog=9 | 1 | 0.5% |
c=ethiop&t=africa | 1 | 0.5% |
feed=5&how=paged&what=all | 1 | 0.5% |
Other values (6) | 6 | 3.2% |
Value | Count | Frequency (%) |
188 | ||
ethiopia | 1 | 0.5% |
category_code
Categorical
Distinct | 15 |
---|---|
Distinct (%) | 7.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
NEWS | |
---|---|
HUMR | |
POLR | |
ECON | |
ANON | |
Other values (10) |
Common Values
Value | Count | Frequency (%) |
NEWS | 65 | |
HUMR | 45 | |
POLR | 32 | |
ECON | 13 | 6.9% |
ANON | 8 | 4.2% |
CULTR | 7 | 3.7% |
XED | 5 | 2.6% |
MISC | 3 | 1.6% |
HOST | 3 | 1.6% |
PUBH | 2 | 1.1% |
Other values (5) | 6 | 3.2% |
Length
Value | Count | Frequency (%) |
news | 65 | |
humr | 45 | |
polr | 32 | |
econ | 13 | 6.9% |
anon | 8 | 4.2% |
cultr | 7 | 3.7% |
xed | 5 | 2.6% |
misc | 3 | 1.6% |
host | 3 | 1.6% |
pubh | 2 | 1.1% |
Other values (5) | 6 | 3.2% |
Most occurring characters
Value | Count | Frequency (%) |
N | 95 | |
R | 86 | |
E | 85 | |
S | 72 | |
W | 65 | |
O | 56 | |
U | 54 | |
H | 51 | |
M | 50 | |
L | 42 | |
Other values (11) | 100 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 756 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
N | 95 | |
R | 86 | |
E | 85 | |
S | 72 | |
W | 65 | |
O | 56 | |
U | 54 | |
H | 51 | |
M | 50 | |
L | 42 | |
Other values (11) | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 756 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
N | 95 | |
R | 86 | |
E | 85 | |
S | 72 | |
W | 65 | |
O | 56 | |
U | 54 | |
H | 51 | |
M | 50 | |
L | 42 | |
Other values (11) | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 756 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
N | 95 | |
R | 86 | |
E | 85 | |
S | 72 | |
W | 65 | |
O | 56 | |
U | 54 | |
H | 51 | |
M | 50 | |
L | 42 | |
Other values (11) | 100 |
date_added
Date
Distinct | 6 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
Minimum | 2014-04-15 00:00:00 |
---|---|
Maximum | 2018-04-10 00:00:00 |
source
Categorical
Imbalance 
Distinct | 5 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.6 KiB |
citizenlab | |
---|---|
OONI | 4 |
CIPIT | 4 |
BBC | 2 |
defenddefenders | 1 |
Length
Max length | 15 |
---|---|
Median length | 10 |
Mean length | 9.7195767 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | citizenlab |
---|---|
2nd row | citizenlab |
3rd row | citizenlab |
4th row | citizenlab |
5th row | citizenlab |
Common Values
Value | Count | Frequency (%) |
citizenlab | 178 | |
OONI | 4 | 2.1% |
CIPIT | 4 | 2.1% |
BBC | 2 | 1.1% |
defenddefenders | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
citizenlab | 178 | |
ooni | 4 | 2.1% |
cipit | 4 | 2.1% |
bbc | 2 | 1.1% |
defenddefenders | 1 | 0.5% |
Most occurring characters
Value | Count | Frequency (%) |
i | 356 | |
e | 183 | |
n | 180 | |
c | 178 | |
z | 178 | |
t | 178 | |
l | 178 | |
a | 178 | |
b | 178 | |
I | 12 | 0.7% |
Other values (10) | 38 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1837 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 356 | |
e | 183 | |
n | 180 | |
c | 178 | |
z | 178 | |
t | 178 | |
l | 178 | |
a | 178 | |
b | 178 | |
I | 12 | 0.7% |
Other values (10) | 38 | 2.1% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1837 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 356 | |
e | 183 | |
n | 180 | |
c | 178 | |
z | 178 | |
t | 178 | |
l | 178 | |
a | 178 | |
b | 178 | |
I | 12 | 0.7% |
Other values (10) | 38 | 2.1% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1837 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 356 | |
e | 183 | |
n | 180 | |
c | 178 | |
z | 178 | |
t | 178 | |
l | 178 | |
a | 178 | |
b | 178 | |
I | 12 | 0.7% |
Other values (10) | 38 | 2.1% |
notes
Text
Constant  Missing 
Distinct | 1 |
---|---|
Distinct (%) | 100.0% |
Missing | 188 |
Missing (%) | 99.5% |
Memory size | 1.6 KiB |
Value | Count | Frequency (%) |
reportedly | 1 | |
blocked | 1 |
Most occurring characters
Value | Count | Frequency (%) |
e | 3 | |
o | 2 | |
d | 2 | |
l | 2 | |
R | 1 | 5.6% |
r | 1 | 5.6% |
p | 1 | 5.6% |
t | 1 | 5.6% |
y | 1 | 5.6% |
1 | 5.6% | |
Other values (3) | 3 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 18 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 3 | |
o | 2 | |
d | 2 | |
l | 2 | |
R | 1 | 5.6% |
r | 1 | 5.6% |
p | 1 | 5.6% |
t | 1 | 5.6% |
y | 1 | 5.6% |
1 | 5.6% | |
Other values (3) | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 18 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 3 | |
o | 2 | |
d | 2 | |
l | 2 | |
R | 1 | 5.6% |
r | 1 | 5.6% |
p | 1 | 5.6% |
t | 1 | 5.6% |
y | 1 | 5.6% |
1 | 5.6% | |
Other values (3) | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 18 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 3 | |
o | 2 | |
d | 2 | |
l | 2 | |
R | 1 | 5.6% |
r | 1 | 5.6% |
p | 1 | 5.6% |
t | 1 | 5.6% |
y | 1 | 5.6% |
1 | 5.6% | |
Other values (3) | 3 |
Correlations
category_code | source | |
---|---|---|
category_code | 1.000 | 0.100 |
source | 0.100 | 1.000 |
Missing values
Sample
url | category_code | date_added | source | notes | |
---|---|---|---|---|---|
0 | http://abrahadesta.wordpress.com/ | CULTR | 2014-04-15 | citizenlab | NaN |
1 | http://aljazeera.net/ | NEWS | 2014-04-15 | citizenlab | NaN |
2 | http://am.wikipedia.org/ | MISC | 2014-04-15 | citizenlab | NaN |
3 | http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | MISC | 2014-04-15 | citizenlab | NaN |
4 | http://amharic.voanews.com/ | NEWS | 2014-04-15 | citizenlab | NaN |
5 | http://ancientgebts.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
6 | http://carpediemethiopia.blogspot.com/ | POLR | 2014-04-15 | citizenlab | NaN |
7 | http://citizenlab.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
8 | http://cpj.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
9 | http://egoportal.blogspot.com/ | POLR | 2014-04-15 | citizenlab | NaN |
url | category_code | date_added | source | notes | |
---|---|---|---|---|---|
179 | https://www.citizenlab.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
180 | https://www.dropbox.com/s/n65b3d67f82asn2/Leaked%20National%20Entrance%20Exam_English.pdf?dl=0 | FILE | 2016-05-30 | OONI | NaN |
181 | https://www.facebook.com/Jawarmd | NEWS | 2016-05-30 | OONI | NaN |
182 | https://www.facebook.com/pages/Addis-Neger/49967100821 | NEWS | 2014-04-15 | citizenlab | NaN |
183 | https://www.hrw.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
184 | https://www.mereja.com/ | NEWS | 2016-09-09 | CIPIT | NaN |
185 | https://www.oromiamedia.org/ | NEWS | 2016-05-30 | OONI | NaN |
186 | https://www.privacyinternational.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
187 | https://www.torproject.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
188 | https://www.twitter.com/ | HOST | 2014-04-15 | citizenlab | NaN |