Difference between revisions of "WikiData Import 2020-07-30"
Jump to navigation
Jump to search
Line 17: | Line 17: | ||
ls -l latest-all.nt | ls -l latest-all.nt | ||
-rw------- 1 wf admin 1980899328 Jul 30 17:56 latest-all.nt | -rw------- 1 wf admin 1980899328 Jul 30 17:56 latest-all.nt | ||
+ | # a few hours later | ||
+ | ls -l latest-all.nt.bz2 | ||
+ | -rw-r--r-- 1 wf admin 118776910150 Jul 23 19:36 latest-all.nt.bz2 | ||
</source> | </source> | ||
[[Category:WikiData]] | [[Category:WikiData]] |
Revision as of 08:22, 31 July 2020
Download and unpack
This download was done with the "latest-all.nt" dataset.
wget https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.nt.bz2
--2020-07-30 06:40:18-- https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.nt.bz2
Resolving dumps.wikimedia.org (dumps.wikimedia.org)... 2620::861:1:208:80:154:7, 208.80.154.7
Connecting to dumps.wikimedia.org (dumps.wikimedia.org)|2620::861:1:208:80:154:7|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 118776910150 (111G) [application/octet-stream]
Saving to: ‘latest-all.nt.bz2’
latest-all.nt.bz2 0%[ ] 635.10M 5.02MB/s eta 6h 19m
...
latest-all.nt.bz2 100%[===================>] 110.62G 4.88MB/s in 6h 31m
2020-07-30 13:11:49 (4.82 MB/s) - ‘latest-all.nt.bz2’ saved [118776910150/118776910150]
bzip2 -dk latest-all.nt.bz2
ls -l latest-all.nt
-rw------- 1 wf admin 1980899328 Jul 30 17:56 latest-all.nt
# a few hours later
ls -l latest-all.nt.bz2
-rw-r--r-- 1 wf admin 118776910150 Jul 23 19:36 latest-all.nt.bz2