Difference between revisions of "Wikidata Import 2024-02-18"

From BITPlan Wiki
Jump to navigation Jump to search
Line 1: Line 1:
 
{{PageSequence|prev=Wikidata Import 2024-01-20|next=|category=Wikidata|categoryIcon=cloud-download}}
 
{{PageSequence|prev=Wikidata Import 2024-01-20|next=|category=Wikidata|categoryIcon=cloud-download}}
 +
=Import=
 +
{{Import
 +
|state=✅
 +
|url=https://wiki.bitplan.com/index.php/Wikidata_Import_2024-02-18
 +
|target=QLever
 +
|start=2024-02-18
 +
|end=2024-02-18
 +
|days=0.5
 +
|os=Ubuntu 22.04.3 LTS
 +
|cpu=AMD Ryzen 9 5900X 12-Core Processor @ 4.95GHz
 +
|ram=128
 +
|triples=14.4
 +
|storemode=property
 +
}}
 
= Docker =
 
= Docker =
 
<source lang='bash' highlight='1,14'>
 
<source lang='bash' highlight='1,14'>
Line 91: Line 105:
 
2024-02-18 16:00:12.052 - INFO: Input triples processed: 200,000,000
 
2024-02-18 16:00:12.052 - INFO: Input triples processed: 200,000,000
 
2024-02-18 16:01:17.385 - INFO: Input triples processed: 300,000,000
 
2024-02-18 16:01:17.385 - INFO: Input triples processed: 300,000,000
2024-02-18 16:02:17.663 - INFO: Input triples processed: 400,000,000
+
...
2024-02-18 16:03:21.529 - INFO: Input triples processed: 500,000,000
+
2024-02-18 23:47:16.498 - INFO: Triples processed: 27,800,000,000
2024-02-18 16:04:28.097 - INFO: Input triples processed: 600,000,000
+
2024-02-18 23:47:26.341 - INFO: Triples processed: 27,900,000,000
2024-02-18 16:05:36.281 - INFO: Input triples processed: 700,000,000
+
2024-02-18 23:47:38.089 - INFO: Triples processed: 28,000,000,000
2024-02-18 16:06:43.362 - INFO: Input triples processed: 800,000,000
+
2024-02-18 23:47:48.185 - INFO: Triples processed: 28,100,000,000
2024-02-18 16:07:49.418 - INFO: Input triples processed: 900,000,000
+
2024-02-18 23:47:59.993 - INFO: Triples processed: 28,200,000,000
2024-02-18 16:08:59.587 - INFO: Input triples processed: 1,000,000,000
+
2024-02-18 23:48:11.605 - INFO: Triples processed: 28,300,000,000
2024-02-18 16:10:09.476 - INFO: Input triples processed: 1,100,000,000
+
2024-02-18 23:48:15.254 - INFO: Statistics for PSO: #relations = 70,167, #blocks = 912,255, #triples = 28,339,760,365
2024-02-18 16:11:18.556 - INFO: Input triples processed: 1,200,000,000
+
2024-02-18 23:48:15.254 - INFO: Statistics for POS: #relations = 70,167, #blocks = 912,255, #triples = 28,339,760,365
2024-02-18 16:12:28.070 - INFO: Input triples processed: 1,300,000,000
+
2024-02-18 23:48:15.254 - INFO: Writing meta data for PSO and POS ...
2024-02-18 16:13:39.084 - INFO: Input triples processed: 1,400,000,000
+
2024-02-18 23:48:19.327 - INFO: Index build completed
2024-02-18 16:14:46.751 - INFO: Input triples processed: 1,500,000,000
 
2024-02-18 16:15:59.349 - INFO: Input triples processed: 1,600,000,000
 
2024-02-18 16:17:08.768 - INFO: Input triples processed: 1,700,000,000
 
2024-02-18 16:18:20.043 - INFO: Input triples processed: 1,800,000,000
 
2024-02-18 16:19:30.357 - INFO: Input triples processed: 1,900,000,000
 
2024-02-18 16:20:37.163 - INFO: Input triples processed: 2,000,000,000
 
2024-02-18 16:21:48.621 - INFO: Input triples processed: 2,100,000,000
 
2024-02-18 16:22:56.149 - INFO: Input triples processed: 2,200,000,000
 
2024-02-18 16:23:49.829 - INFO: Input triples processed: 2,300,000,000
 
2024-02-18 16:24:56.267 - INFO: Input triples processed: 2,400,000,000
 
2024-02-18 16:26:04.897 - INFO: Input triples processed: 2,500,000,000
 
2024-02-18 16:27:13.611 - INFO: Input triples processed: 2,600,000,000
 
2024-02-18 16:28:21.187 - INFO: Input triples processed: 2,700,000,000
 
2024-02-18 16:29:21.408 - INFO: Input triples processed: 2,800,000,000
 
2024-02-18 16:30:26.128 - INFO: Input triples processed: 2,900,000,000
 
2024-02-18 16:31:31.781 - INFO: Input triples processed: 3,000,000,000
 
2024-02-18 16:32:38.742 - INFO: Input triples processed: 3,100,000,000
 
2024-02-18 16:33:48.502 - INFO: Input triples processed: 3,200,000,000
 
2024-02-18 16:34:51.945 - INFO: Input triples processed: 3,300,000,000
 
2024-02-18 16:36:01.858 - INFO: Input triples processed: 3,400,000,000
 
2024-02-18 16:37:12.226 - INFO: Input triples processed: 3,500,000,000
 
2024-02-18 16:38:21.741 - INFO: Input triples processed: 3,600,000,000
 
2024-02-18 16:39:31.795 - INFO: Input triples processed: 3,700,000,000
 
2024-02-18 16:40:42.177 - INFO: Input triples processed: 3,800,000,000
 
2024-02-18 16:41:50.624 - INFO: Input triples processed: 3,900,000,000
 
2024-02-18 16:43:01.851 - INFO: Input triples processed: 4,000,000,000
 
2024-02-18 16:44:13.660 - INFO: Input triples processed: 4,100,000,000
 
2024-02-18 16:45:23.053 - INFO: Input triples processed: 4,200,000,000
 
2024-02-18 16:46:34.143 - INFO: Input triples processed: 4,300,000,000
 
2024-02-18 16:47:40.993 - INFO: Input triples processed: 4,400,000,000
 
2024-02-18 16:48:52.191 - INFO: Input triples processed: 4,500,000,000
 
2024-02-18 16:50:01.070 - INFO: Input triples processed: 4,600,000,000
 
2024-02-18 16:51:06.287 - INFO: Input triples processed: 4,700,000,000
 
2024-02-18 16:52:02.124 - INFO: Input triples processed: 4,800,000,000
 
2024-02-18 16:53:09.321 - INFO: Input triples processed: 4,900,000,000
 
2024-02-18 16:54:18.144 - INFO: Input triples processed: 5,000,000,000
 
2024-02-18 16:55:27.092 - INFO: Input triples processed: 5,100,000,000
 
2024-02-18 16:56:32.032 - INFO: Input triples processed: 5,200,000,000
 
2024-02-18 16:57:32.926 - INFO: Input triples processed: 5,300,000,000
 
2024-02-18 16:58:36.694 - INFO: Input triples processed: 5,400,000,000
 
2024-02-18 16:59:43.481 - INFO: Input triples processed: 5,500,000,000
 
2024-02-18 17:00:52.058 - INFO: Input triples processed: 5,600,000,000
 
2024-02-18 17:01:58.487 - INFO: Input triples processed: 5,700,000,000
 
2024-02-18 17:03:04.997 - INFO: Input triples processed: 5,800,000,000
 
2024-02-18 17:04:14.760 - INFO: Input triples processed: 5,900,000,000
 
2024-02-18 17:05:24.624 - INFO: Input triples processed: 6,000,000,000
 
2024-02-18 17:06:34.002 - INFO: Input triples processed: 6,100,000,000
 
2024-02-18 17:07:43.831 - INFO: Input triples processed: 6,200,000,000
 
2024-02-18 17:08:54.722 - INFO: Input triples processed: 6,300,000,000
 
2024-02-18 17:10:02.911 - INFO: Input triples processed: 6,400,000,000
 
2024-02-18 17:11:16.436 - INFO: Input triples processed: 6,500,000,000
 
2024-02-18 17:12:25.464 - INFO: Input triples processed: 6,600,000,000
 
2024-02-18 17:13:37.228 - INFO: Input triples processed: 6,700,000,000
 
2024-02-18 17:14:47.685 - INFO: Input triples processed: 6,800,000,000
 
2024-02-18 17:15:54.522 - INFO: Input triples processed: 6,900,000,000
 
2024-02-18 17:17:05.453 - INFO: Input triples processed: 7,000,000,000
 
2024-02-18 17:18:13.934 - INFO: Input triples processed: 7,100,000,000
 
2024-02-18 17:19:07.212 - INFO: Input triples processed: 7,200,000,000
 
2024-02-18 17:20:14.138 - INFO: Input triples processed: 7,300,000,000
 
2024-02-18 17:21:23.144 - INFO: Input triples processed: 7,400,000,000
 
2024-02-18 17:22:32.146 - INFO: Input triples processed: 7,500,000,000
 
 
</source>
 
</source>

Revision as of 07:48, 19 February 2024

Import

Import
edit
state  ✅
url  https://wiki.bitplan.com/index.php/Wikidata_Import_2024-02-18
target  QLever
start  2024-02-18
end  2024-02-18
days  0.5
os  Ubuntu 22.04.3 LTS
cpu  AMD Ryzen 9 5900X 12-Core Processor @ 4.95GHz
ram  128
triples  14.4
comment  

Docker

docker pull adfreiburg/qlever
Using default tag: latest
latest: Pulling from adfreiburg/qlever
01007420e9b0: Pull complete 
460c63749ea2: Pull complete 
91b2277608b5: Pull complete 
c1a82dc7696f: Pull complete 
4593d1466d3e: Pull complete 
84b5c44e1220: Pull complete 
46cc3c2a5eaf: Pull complete 
Digest: sha256:80bc5f65dc9fe7cf5cd4c7ce326cdf97773a218d53534f3262c858a03b0e6d40
Status: Downloaded newer image for adfreiburg/qlever:latest
docker.io/adfreiburg/qlever:latest
docker pull adfreiburg/qlever-ui
Using default tag: latest
latest: Pulling from adfreiburg/qlever-ui
59bf1c3509f3: Pull complete 
07a400e93df3: Pull complete 
64052ee245ef: Pull complete 
a44d093ad4a5: Pull complete 
0381087ee065: Pull complete 
91c88323734b: Pull complete 
fdcee6d0309d: Pull complete 
e6b2715c1d5d: Pull complete 
b9c9f00cb678: Pull complete 
3f12ea50b177: Pull complete 
Digest: sha256:7f4b358d6a127e512979074de0c6e84f250a37bca46c494d8e04a62844716e48
Status: Downloaded newer image for adfreiburg/qlever-ui:latest
docker.io/adfreiburg/qlever-ui:latest

QLever control

https://github.com/ad-freiburg/qlever-control

git clone https://github.com/ad-freiburg/qlever-control.git
Cloning into 'qlever-control'...
remote: Enumerating objects: 1118, done.
remote: Counting objects: 100% (876/876), done.
remote: Compressing objects: 100% (432/432), done.
remote: Total 1118 (delta 399), reused 788 (delta 375), pack-reused 242
Receiving objects: 100% (1118/1118), 247.70 KiB | 949.00 KiB/s, done.
Resolving deltas: 100% (513/513), done.
cd qlever-control/
git checkout python-qlever
Branch 'python-qlever' set up to track remote branch 'python-qlever' from 'origin'.
Switched to a new branch 'python-qlever'
pip install .
Defaulting to user installation because normal site-packages is not writeable
Processing /home/wf/source/python/qlever-control
  Installing build dependencies ... done
  Getting requirements to build wheel ... done
  Preparing metadata (pyproject.toml) ... done
Building wheels for collected packages: UNKNOWN
  Building wheel for UNKNOWN (pyproject.toml) ... done
  Created wheel for UNKNOWN: filename=UNKNOWN-0.0.0-py3-none-any.whl size=5111 sha256=83e57ed4efe8c8115d3d266f05a0cb97388cfc42f0f17bba900da02ba9c31bef
  Stored in directory: /home/wf/.cache/pip/wheels/07/95/58/79d49197785a6e837569fd3f894d646428d2e272f53582c762
Successfully built UNKNOWN
Installing collected packages: UNKNOWN
Successfully installed UNKNOWN-0.0.0

dblp warmup test

wf@fur:/hd/tepig/dblp$ qlever setup-config dblp
wf@fur:/hd/tepig/dblp$ qlever get-data index restart test-query ui 

Action "get-data"

curl -LO -C - https://dblp.org/rdf/dblp.ttl.gz

  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
 24 1663M   24  401M    0     0  7092k      0  0:04:00  0:00:57  0:03:03 7126k
...

wikidata

qlever setup-config wikidata
qlever get-data index
...
2024-02-18 15:57:53.958 - INFO: QLever IndexBuilder, compiled on Fri Feb 16 16:15:27 UTC 2024 using git hash a70652
2024-02-18 15:57:53.958 - INFO: You specified the input format: TTL
2024-02-18 15:57:53.958 - INFO: Processing input triples from /dev/stdin ...
2024-02-18 15:57:53.958 - INFO: You specified "locale = en_US" and "ignore-punctuation = 1"
2024-02-18 15:57:53.958 - INFO: You specified "parallel-parsing = true", which enables faster parsing for TTL files that don't include multiline literals with unescaped newline characters and that have newline characters after the end of triples.
2024-02-18 15:57:53.958 - INFO: You specified "num-triples-per-batch = 5,000,000", choose a lower value if the index builder runs out of memory
2024-02-18 15:57:53.958 - INFO: Integers that cannot be represented by QLever will throw an exception (this is the default behavior)
2024-02-18 15:59:03.263 - INFO: Input triples processed: 100,000,000
2024-02-18 16:00:12.052 - INFO: Input triples processed: 200,000,000
2024-02-18 16:01:17.385 - INFO: Input triples processed: 300,000,000
...
2024-02-18 23:47:16.498 - INFO: Triples processed: 27,800,000,000
2024-02-18 23:47:26.341 - INFO: Triples processed: 27,900,000,000
2024-02-18 23:47:38.089 - INFO: Triples processed: 28,000,000,000
2024-02-18 23:47:48.185 - INFO: Triples processed: 28,100,000,000
2024-02-18 23:47:59.993 - INFO: Triples processed: 28,200,000,000
2024-02-18 23:48:11.605 - INFO: Triples processed: 28,300,000,000
2024-02-18 23:48:15.254 - INFO: Statistics for PSO: #relations = 70,167, #blocks = 912,255, #triples = 28,339,760,365
2024-02-18 23:48:15.254 - INFO: Statistics for POS: #relations = 70,167, #blocks = 912,255, #triples = 28,339,760,365
2024-02-18 23:48:15.254 - INFO: Writing meta data for PSO and POS ...
2024-02-18 23:48:19.327 - INFO: Index build completed