Table filled in from the top 10m most popular web-sites.
- List Quality
- This list is prioritized on page rank vs traffic
- May include both inactive or redirected sites
- Does not reflect actual traffic or views
- Other lists that might be interesting:
- This list is prioritized on page rank vs traffic
https://s3-us-west-1.amazonaws.com/umbrella-static/index.html https://www.commoncrawl.org/ https://tranco-list.eu/
- Not that concerned if this list is exact
- Should provide good sampling of top sites
- Order really isn't important in this case
- This domain is decades old and not in the list!
- Don't get much traffic, but 10M+??
- Good performance test of a database table with 10M rows
- Cold load times take several seconds when paged out
- Well indexed but not performing as well as exepected
- Much larger tables have perfomed much better
- Appears like it is doing a full table-scan
- This database is running in a SQL Server under docker.
- Will load the same dataset into postgres for comparison.
- Started crawling the home page for the first 1M domains.
- Interested in stats on use of html5, proper html, etc..
- Started with the first 100K sites and expanding to first 1M.
- Observations
- Surprising number of domains without a proper html lang attr
- Surprising number of domains not using a proper HTML 5 doctype
- The domains are naked without fully qualified with hostname.
- Many domains don't have a dns entry for the naked domain.
- domain.com vs www.domain.com which should redirect.
- Suprising number of SSL errors on the naked domain
- Clients can't connect without dropping ssl verification
- With standard security checks clients will never get a redirect
- domains that redirect behind an invalid ssl cert
- this is super easy to fix/park to handle redirects.
- lost revenue with that much linking to get on this list
Selected Top Domains
skip report
# | Domain | Sort | Rank | Status | Flags |
---|---|---|---|---|---|
3201 | shell.com | 3856 | 5.48 | 200 | HTML 5, English |
3202 | iban.com | 3857 | 5.48 | 200 | HTML 5, English |
3203 | pubs.usgs.gov | 3858 | 5.48 | 200 | HTML 5, English |
3204 | courtlistener.com | 3860 | 5.48 | 200 | HTML 5, English |
3205 | archive.nytimes.com | 3861 | 5.48 | 200 | HTML 5, English |
3206 | openreview.net | 3862 | 5.48 | 200 | HTML 5, English |
3207 | pubchem.ncbi.nlm.nih.gov | 3863 | 5.48 | 200 | HTML 5, English |
3208 | jcpenney.com | 3864 | 5.48 | 200 | HTML 5, English |
3209 | chromium.googlesource.com | 3865 | 5.48 | 200 | HTML 5, English |
3210 | heyzine.com | 3866 | 5.48 | 200 | HTML 5, English |
3211 | earthobservatory.nasa.gov | 3867 | 5.48 | 200 | HTML 5, English |
3212 | freemusicarchive.org | 3868 | 5.48 | 200 | HTML 5, English |
3213 | realtyna.com | 3869 | 5.48 | 200 | HTML 5, English |
3214 | health.ny.gov | 3871 | 5.48 | 200 | English, Strict |
3215 | media.defense.gov | 3872 | 5.48 | 200 | HTML 5, English |
3216 | rescuetime.com | 3873 | 5.48 | 200 | HTML 5, English |
3217 | education.minecraft.net | 3874 | 5.48 | 200 | HTML 5, English |
3218 | google.org | 3875 | 5.48 | 200 | HTML 5, English |
3219 | regex101.com | 3876 | 5.48 | 200 | HTML 5, English |
3220 | googledevelopers.blogspot.com | 3877 | 5.48 | 200 | HTML 5, English |
3221 | webfoundation.org | 3878 | 5.48 | 200 | HTML 5, English |
3222 | deloitte.com | 3879 | 5.48 | 200 | HTML 5, English |
3223 | vulture.com | 3880 | 5.48 | 200 | HTML 5, English |
3224 | siliconangle.com | 3881 | 5.48 | 200 | HTML 5, English |
3225 | authorstream.com | 3882 | 5.48 | 200 | HTML 5, No Lang |
3226 | support.lenovo.com | 3883 | 5.47 | 200 | HTML 5, No Lang |
3227 | triblive.com | 3884 | 5.47 | 200 | HTML 5, English |
3228 | teacherspayteachers.com | 3885 | 5.47 | 200 | HTML 5, English |
3229 | liveinternet.ru | 3886 | 5.47 | 200 | No Lang, Transitional |
3230 | bbcamerica.com | 3887 | 5.47 | 200 | HTML 5, English |
3231 | european-union.europa.eu | 3888 | 5.47 | 200 | HTML 5, English |
3232 | deutschlandfunk.de | 3890 | 5.47 | 200 | HTML 5 |
3233 | keys.openpgp.org | 3891 | 5.47 | 200 | HTML 5, English |
3234 | contentmarketinginstitute.com | 3892 | 5.47 | 200 | HTML 5, English |
3235 | coub.com | 3893 | 5.47 | 200 | HTML 5, No Lang |
3236 | a11yproject.com | 3897 | 5.47 | 200 | HTML 5, English |
3237 | nhc.noaa.gov | 3899 | 5.47 | 200 | No Lang, Transitional |
3238 | subway.com | 3901 | 5.47 | 200 | HTML 5, English |
3239 | indiana.edu | 3902 | 5.47 | 200 | HTML 5, English |
3240 | affinity.serif.com | 3903 | 5.47 | 200 | HTML 5, English |
3241 | translate.yandex.com | 3904 | 5.47 | 200 | HTML 5, English |
3242 | veed.io | 3905 | 5.47 | 200 | HTML 5, English |
3243 | tidal.com | 3906 | 5.47 | 200 | HTML 5, English |
3244 | figshare.com | 3907 | 5.47 | 200 | HTML 5, English |
3245 | paulgraham.com | 3908 | 5.47 | 200 | No Lang |
3246 | ncei.noaa.gov | 3909 | 5.47 | 200 | HTML 5, English |
3247 | marca.com | 3910 | 5.47 | 200 | HTML 5 |
3248 | subito.it | 3912 | 5.47 | 200 | HTML 5 |
3249 | sheknows.com | 3913 | 5.47 | 200 | HTML 5, English |
3250 | bgr.com | 3914 | 5.47 | 200 | HTML 5, English |
3251 | bugs.webkit.org | 3916 | 5.47 | 200 | HTML 5, English |
3252 | 3cx.com | 3917 | 5.47 | 200 | English |
3253 | creativereview.co.uk | 3918 | 5.47 | 200 | HTML 5, English |
3254 | fws.gov | 3919 | 5.47 | 200 | HTML 5, English |
3255 | leanpub.com | 3920 | 5.47 | 200 | HTML 5, No Lang |
3256 | superpages.com | 3921 | 5.47 | 200 | HTML 5, English |
3257 | secondlife.com | 3922 | 5.47 | 200 | HTML 5, English |
3258 | computerhistory.org | 3923 | 5.47 | 200 | HTML 5, English |
3259 | wallpaper.com | 3924 | 5.47 | 200 | HTML 5, English |
3260 | thestar.com.my | 3925 | 5.47 | 200 | English |
3261 | later.com | 3927 | 5.47 | 200 | HTML 5, English |
3262 | stylecaster.com | 3928 | 5.47 | 200 | HTML 5, English |
3263 | akc.org | 3929 | 5.47 | 200 | HTML 5, English |
3264 | teachable.com | 3930 | 5.47 | 200 | HTML 5, English |
3265 | brevo.com | 3931 | 5.47 | 200 | HTML 5, English |
3266 | skyscanner.net | 3932 | 5.47 | 200 | HTML 5, English |
3267 | wto.org | 3933 | 5.47 | 200 | HTML 5, English |
3268 | rstudio.com | 3934 | 5.47 | 200 | HTML 5, English |
3269 | worldline.com | 3935 | 5.47 | 200 | HTML 5, English |
3270 | themify.me | 3936 | 5.47 | 200 | HTML 5, English |
3271 | snapwidget.com | 3937 | 5.47 | 200 | HTML 5, English |
3272 | overstock.com | 3938 | 5.47 | 200 | HTML 5, English |
3273 | us.macmillan.com | 3939 | 5.47 | 200 | HTML 5, English |
3274 | accessify.com | 3940 | 5.47 | 200 | HTML 5, English |
3275 | keep.google.com | 3941 | 5.47 | 200 | HTML 5, English |
3276 | newsroom.ibm.com | 3942 | 5.47 | 200 | HTML 5, English |
3277 | datadoghq.com | 3943 | 5.47 | 200 | HTML 5, English |
3278 | spreadsheets.google.com | 3944 | 5.47 | 200 | HTML 5, English |
3279 | cgtrader.com | 3945 | 5.47 | 200 | HTML 5, English |
3280 | uqtr.ca | 3947 | 5.47 | 200 | |
3281 | explore.zoom.us | 3948 | 5.47 | 200 | HTML 5, English |
3282 | carnival.com | 3949 | 5.47 | 200 | HTML 5, English |
3283 | thunderbird.net | 3950 | 5.47 | 200 | HTML 5, English |
3284 | morningconsult.com | 3951 | 5.47 | 200 | HTML 5, English |
3285 | cisecurity.org | 3952 | 5.47 | 200 | HTML 5, English |
3286 | eset.com | 3953 | 5.47 | 200 | HTML 5, English |
3287 | overcast.fm | 3954 | 5.47 | 200 | HTML 5, English |
3288 | avira.com | 3955 | 5.47 | 200 | HTML 5, English |
3289 | benzinga.com | 3956 | 5.47 | 200 | HTML 5, English |
3290 | bangordailynews.com | 3957 | 5.47 | 200 | HTML 5, English |
3291 | analyticsindiamag.com | 3958 | 5.47 | 200 | HTML 5, English |
3292 | claris.com | 3960 | 5.47 | 200 | HTML 5, English |
3293 | marykay.com | 3961 | 5.47 | 200 | HTML 5, English |
3294 | themarkup.org | 3962 | 5.47 | 200 | HTML 5, English |
3295 | spdx.org | 3963 | 5.47 | 200 | HTML 5, English |
3296 | oasis-open.org | 3964 | 5.47 | 200 | HTML 5, English |
3297 | umass.edu | 3965 | 5.47 | 200 | HTML 5, English |
3298 | trendhunter.com | 3966 | 5.47 | 200 | HTML 5, No Lang |
3299 | revolve.com | 3967 | 5.47 | 200 | HTML 5, English |
3300 | validator.schema.org | 3969 | 5.47 | 200 | HTML 5, No Lang |
Data from: Open PageRank