WikiData Import 2022-05-22

From BITPlan Wiki
Revision as of 08:21, 23 May 2022 by Wf (talk | contribs)
Jump to navigation Jump to search

see QLever/script as discussed in QLever Issue #562 for the script which makes reproducing this attempt easier.

since the https://github.com/ad-freiburg/qlever-control now has an official "qlever" script we have rename the script that has the purpose to make the import attempts reproducible to qleverauto.

Preparations

WikiData_Import_2022-05-21#Build_code steps still apply for this attempt using the native/compiled version of qlever.


qleverauto environment checks

./qleverauto -v
qleverauto version : 1.28 $ : 2022/05/23 05:59:46 $
# some changes were done during this attempt
./qleverauto -v
qleverauto version : 1.29 $ : 2022/05/23 06:15:28 $
./qleverauto -e
needed software
docker → /usr/bin/docker ✅
top → /usr/bin/top ✅
df → /usr/bin/df ✅
jq → /usr/bin/jq ✅
lsb_release → /usr/bin/lsb_release ✅
free → /usr/bin/free ✅
operating system
No LSB modules are available.
Distributor ID:	Ubuntu
Description:	Ubuntu 20.04.4 LTS
Release:	20.04
Codename:	focal
docker version
Docker version 20.10.16, build aa7e414
memory
              total        used        free      shared  buff/cache   available
Mem:          125Gi       5,3Gi        22Gi        43Mi        98Gi       119Gi
Swap:         2,0Gi       8,0Mi       2,0Gi
diskspace
/dev/sdb5       116G   25G   86G  23% /
tmpfs            63G   16K   63G   1% /dev/shm
/dev/sda1       3,6T  2,3T  1,2T  66% /hd/seel
/dev/sdb1       511M  4,0K  511M   1% /boot/efi
soft ulimit for files
1048576

Wikidata dump download

Dump is still quite recent from latest attempt

./qleverauto -wd
wikidata lexemes:latest-lexemes.ttl.bz2 already downloaded
wikidata dump:latest-all.ttl.bz2 already downloaded

Wikidata indexing

nohup ./qleverauto -wi&