Overview
Brought to you by YData
Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 189 |
| Missing cells | 188 |
| Missing cells (%) | 19.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.5 KiB |
| Average record size in memory | 40.7 B |
Variable types
| URL | 1 |
|---|---|
| Categorical | 2 |
| DateTime | 1 |
| Text | 1 |
Reproduction
| Analysis started | 2025-03-11 15:19:35.139405 |
|---|---|
| Analysis finished | 2025-03-11 15:19:35.360465 |
| Duration | 0.22 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
url
URL
Unique 
| Distinct | 189 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| http://abrahadesta.wordpress.com/ | 1 |
|---|---|
| http://aljazeera.net/ | 1 |
| http://am.wikipedia.org/ | 1 |
| http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 |
| http://amharic.voanews.com/ | 1 |
| Other values (184) |
| Value | Count | Frequency (%) |
| http://abrahadesta.wordpress.com/ | 1 | 0.5% |
| http://aljazeera.net/ | 1 | 0.5% |
| http://am.wikipedia.org/ | 1 | 0.5% |
| http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 | 0.5% |
| http://amharic.voanews.com/ | 1 | 0.5% |
| http://ancientgebts.org/ | 1 | 0.5% |
| http://carpediemethiopia.blogspot.com/ | 1 | 0.5% |
| http://citizenlab.org/ | 1 | 0.5% |
| http://cpj.org/ | 1 | 0.5% |
| http://egoportal.blogspot.com/ | 1 | 0.5% |
| Other values (179) | 179 |
| Value | Count | Frequency (%) |
| http | 173 | |
| https | 16 | 8.5% |
| Value | Count | Frequency (%) |
| nazret.com | 8 | 4.2% |
| www.cafpde.org | 3 | 1.6% |
| www.hrw.org | 3 | 1.6% |
| am.wikipedia.org | 2 | 1.1% |
| www.awate.com | 2 | 1.1% |
| citizenlab.org | 2 | 1.1% |
| facebook.com | 2 | 1.1% |
| www.ethiopiafirst.com | 2 | 1.1% |
| portal.unesco.org | 2 | 1.1% |
| www.aigaforum.com | 2 | 1.1% |
| Other values (134) | 161 |
| Value | Count | Frequency (%) |
| / | 127 | |
| /blog/index.php | 7 | 3.7% |
| /index.html | 2 | 1.1% |
| /index.htm | 2 | 1.1% |
| /story/201306250132-0022854 | 1 | 0.5% |
| /geography/en/ev.php-URL_ID=3559&URL_DO=DO_TOPIC&URL_SECTION=201.html | 1 | 0.5% |
| /wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | 1 | 0.5% |
| /library/eng-eth/index | 1 | 0.5% |
| /~ena/ | 1 | 0.5% |
| /new/index.asp | 1 | 0.5% |
| Other values (45) | 45 | 23.8% |
| Value | Count | Frequency (%) |
| 174 | ||
| blog=12 | 1 | 0.5% |
| blog=13 | 1 | 0.5% |
| blog=14 | 1 | 0.5% |
| blog=15 | 1 | 0.5% |
| blog=16 | 1 | 0.5% |
| blog=7 | 1 | 0.5% |
| blog=9 | 1 | 0.5% |
| c=ethiop&t=africa | 1 | 0.5% |
| feed=5&how=paged&what=all | 1 | 0.5% |
| Other values (6) | 6 | 3.2% |
| Value | Count | Frequency (%) |
| 188 | ||
| ethiopia | 1 | 0.5% |
category_code
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| NEWS | |
|---|---|
| HUMR | |
| POLR | |
| ECON | |
| ANON | |
| Other values (10) |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 3 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | CULTR |
|---|---|
| 2nd row | NEWS |
| 3rd row | MISC |
| 4th row | MISC |
| 5th row | NEWS |
Common Values
| Value | Count | Frequency (%) |
| NEWS | 65 | |
| HUMR | 45 | |
| POLR | 32 | |
| ECON | 13 | 6.9% |
| ANON | 8 | 4.2% |
| CULTR | 7 | 3.7% |
| XED | 5 | 2.6% |
| MISC | 3 | 1.6% |
| HOST | 3 | 1.6% |
| PUBH | 2 | 1.1% |
| Other values (5) | 6 | 3.2% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| news | 65 | |
| humr | 45 | |
| polr | 32 | |
| econ | 13 | 6.9% |
| anon | 8 | 4.2% |
| cultr | 7 | 3.7% |
| xed | 5 | 2.6% |
| misc | 3 | 1.6% |
| host | 3 | 1.6% |
| pubh | 2 | 1.1% |
| Other values (5) | 6 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 95 | |
| R | 86 | |
| E | 85 | |
| S | 72 | |
| W | 65 | |
| O | 56 | |
| U | 54 | |
| H | 51 | |
| M | 50 | |
| L | 42 | |
| Other values (11) | 100 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 756 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 95 | |
| R | 86 | |
| E | 85 | |
| S | 72 | |
| W | 65 | |
| O | 56 | |
| U | 54 | |
| H | 51 | |
| M | 50 | |
| L | 42 | |
| Other values (11) | 100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 756 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 95 | |
| R | 86 | |
| E | 85 | |
| S | 72 | |
| W | 65 | |
| O | 56 | |
| U | 54 | |
| H | 51 | |
| M | 50 | |
| L | 42 | |
| Other values (11) | 100 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 756 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 95 | |
| R | 86 | |
| E | 85 | |
| S | 72 | |
| W | 65 | |
| O | 56 | |
| U | 54 | |
| H | 51 | |
| M | 50 | |
| L | 42 | |
| Other values (11) | 100 |
date_added
Date
| Distinct | 6 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| Minimum | 2014-04-15 00:00:00 |
|---|---|
| Maximum | 2018-04-10 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=6)
source
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 KiB |
| citizenlab | |
|---|---|
| OONI | 4 |
| CIPIT | 4 |
| BBC | 2 |
| defenddefenders | 1 |
Length
| Max length | 15 |
|---|---|
| Median length | 10 |
| Mean length | 9.7195767 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | citizenlab |
|---|---|
| 2nd row | citizenlab |
| 3rd row | citizenlab |
| 4th row | citizenlab |
| 5th row | citizenlab |
Common Values
| Value | Count | Frequency (%) |
| citizenlab | 178 | |
| OONI | 4 | 2.1% |
| CIPIT | 4 | 2.1% |
| BBC | 2 | 1.1% |
| defenddefenders | 1 | 0.5% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| citizenlab | 178 | |
| ooni | 4 | 2.1% |
| cipit | 4 | 2.1% |
| bbc | 2 | 1.1% |
| defenddefenders | 1 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 356 | |
| e | 183 | |
| n | 180 | |
| c | 178 | |
| z | 178 | |
| t | 178 | |
| l | 178 | |
| a | 178 | |
| b | 178 | |
| I | 12 | 0.7% |
| Other values (10) | 38 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1837 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 356 | |
| e | 183 | |
| n | 180 | |
| c | 178 | |
| z | 178 | |
| t | 178 | |
| l | 178 | |
| a | 178 | |
| b | 178 | |
| I | 12 | 0.7% |
| Other values (10) | 38 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1837 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 356 | |
| e | 183 | |
| n | 180 | |
| c | 178 | |
| z | 178 | |
| t | 178 | |
| l | 178 | |
| a | 178 | |
| b | 178 | |
| I | 12 | 0.7% |
| Other values (10) | 38 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1837 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 356 | |
| e | 183 | |
| n | 180 | |
| c | 178 | |
| z | 178 | |
| t | 178 | |
| l | 178 | |
| a | 178 | |
| b | 178 | |
| I | 12 | 0.7% |
| Other values (10) | 38 | 2.1% |
notes
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 188 |
| Missing (%) | 99.5% |
| Memory size | 1.6 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Reportedly blocked |
|---|
| Value | Count | Frequency (%) |
| reportedly | 1 | |
| blocked | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| o | 2 | |
| d | 2 | |
| l | 2 | |
| R | 1 | 5.6% |
| r | 1 | 5.6% |
| p | 1 | 5.6% |
| t | 1 | 5.6% |
| y | 1 | 5.6% |
| 1 | 5.6% | |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 3 | |
| o | 2 | |
| d | 2 | |
| l | 2 | |
| R | 1 | 5.6% |
| r | 1 | 5.6% |
| p | 1 | 5.6% |
| t | 1 | 5.6% |
| y | 1 | 5.6% |
| 1 | 5.6% | |
| Other values (3) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 3 | |
| o | 2 | |
| d | 2 | |
| l | 2 | |
| R | 1 | 5.6% |
| r | 1 | 5.6% |
| p | 1 | 5.6% |
| t | 1 | 5.6% |
| y | 1 | 5.6% |
| 1 | 5.6% | |
| Other values (3) | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 3 | |
| o | 2 | |
| d | 2 | |
| l | 2 | |
| R | 1 | 5.6% |
| r | 1 | 5.6% |
| p | 1 | 5.6% |
| t | 1 | 5.6% |
| y | 1 | 5.6% |
| 1 | 5.6% | |
| Other values (3) | 3 |
Correlations
| category_code | source | |
|---|---|---|
| category_code | 1.000 | 0.100 |
| source | 0.100 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| url | category_code | date_added | source | notes | |
|---|---|---|---|---|---|
| 0 | http://abrahadesta.wordpress.com/ | CULTR | 2014-04-15 | citizenlab | NaN |
| 1 | http://aljazeera.net/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 2 | http://am.wikipedia.org/ | MISC | 2014-04-15 | citizenlab | NaN |
| 3 | http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BD | MISC | 2014-04-15 | citizenlab | NaN |
| 4 | http://amharic.voanews.com/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 5 | http://ancientgebts.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
| 6 | http://carpediemethiopia.blogspot.com/ | POLR | 2014-04-15 | citizenlab | NaN |
| 7 | http://citizenlab.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 8 | http://cpj.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 9 | http://egoportal.blogspot.com/ | POLR | 2014-04-15 | citizenlab | NaN |
| url | category_code | date_added | source | notes | |
|---|---|---|---|---|---|
| 179 | https://www.citizenlab.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 180 | https://www.dropbox.com/s/n65b3d67f82asn2/Leaked%20National%20Entrance%20Exam_English.pdf?dl=0 | FILE | 2016-05-30 | OONI | NaN |
| 181 | https://www.facebook.com/Jawarmd | NEWS | 2016-05-30 | OONI | NaN |
| 182 | https://www.facebook.com/pages/Addis-Neger/49967100821 | NEWS | 2014-04-15 | citizenlab | NaN |
| 183 | https://www.hrw.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
| 184 | https://www.mereja.com/ | NEWS | 2016-09-09 | CIPIT | NaN |
| 185 | https://www.oromiamedia.org/ | NEWS | 2016-05-30 | OONI | NaN |
| 186 | https://www.privacyinternational.org/ | HUMR | 2014-04-15 | citizenlab | NaN |
| 187 | https://www.torproject.org/ | NEWS | 2014-04-15 | citizenlab | NaN |
| 188 | https://www.twitter.com/ | HOST | 2014-04-15 | citizenlab | NaN |