Wikidata Import 2023-05-05
Jump to navigation
Jump to search
Download ~6h:30
Download Options
https://dumps.wikimedia.org/wikidatawiki/entities
dcatap.rdf 04-May-2023 18:19 84753 latest-all.json.bz2 03-May-2023 21:06 81640390615 latest-all.json.gz 03-May-2023 12:47 123885468527 latest-all.nt.bz2 04-May-2023 16:07 158382342866 latest-all.nt.gz 03-May-2023 22:23 205171447838 latest-all.ttl.bz2 04-May-2023 03:24 101606862077 latest-all.ttl.gz 03-May-2023 17:08 124093922794 latest-lexemes.json.bz2 03-May-2023 03:53 305234182 latest-lexemes.json.gz 03-May-2023 03:51 416121890 latest-lexemes.nt.bz2 28-Apr-2023 23:34 778797047 latest-lexemes.nt.gz 28-Apr-2023 23:29 1019519966 latest-lexemes.ttl.bz2 28-Apr-2023 23:30 440519100 latest-lexemes.ttl.gz 28-Apr-2023 23:26 548481488 latest-truthy.nt.bz2 28-Apr-2023 22:23 36023954950 latest-truthy.nt.gz 28-Apr-2023 19:07 59758277315
download script
cat download.sh
#/bin/bash
# WF 2023-04-26
# download wikidata dumps
baseurl=https://dumps.wikimedia.org/wikidatawiki/entities/
for file in latest-all latest-lexemes
do
for ext in ttl.bz2
do
url=$baseurl/$file.$ext
log=$file-$ext.log
nohup wget $url >> $log&
done
done
Download logs
--2023-05-05 08:09:14-- https://dumps.wikimedia.org/wikidatawiki/entities//latest-all.ttl.bz2
Resolving dumps.wikimedia.org (dumps.wikimedia.org)... 208.80.154.142, 2620:0:861:2:208:80:154:142
Connecting to dumps.wikimedia.org (dumps.wikimedia.org)|208.80.154.142|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 101606862077 (95G) [application/octet-stream]
Saving to: ‘latest-all.ttl.bz2’
0K .......... .......... .......... .......... .......... 0% 400K 2d20h
50K .......... .......... .......... .......... .......... 0% 222K 4d0h
100K .......... .......... .......... .......... .......... 0% 399K 3d15h
99225450K . 100% 2.32T=6h50m
2023-05-05 15:00:04 (3.93 MB/s) - ‘latest-all.ttl.bz2’ saved [101606862077/101606862077]
Munging
Preparation
see Wikidata_Import_2023-04-26#Preparation_.7E20-30_min