Difference between revisions of "Wikidata Import 2025-06-06"
Jump to navigation
Jump to search
Line 31: | Line 31: | ||
</source> | </source> | ||
== setup wdqs environment == | == setup wdqs environment == | ||
+ | # see also [https://github.com/scatter-llc/private-wikidata-query/blob/main/README.md README.md] at https://github.com/scatter-llc/private-wikidata-query/blob/main/README.md | ||
<source lang='bash' highlight='1'> | <source lang='bash' highlight='1'> | ||
− | https://github.com/scatter-llc/private-wikidata-query | + | https://github.com/scatter-llc/private-wikidata-query wdqs |
+ | mkdir wdqs/data | ||
+ | mv data.jnl wdqs/data | ||
</source> | </source> |
Revision as of 06:56, 7 June 2025
Import
Import | |
---|---|
state | ✅ |
url | https://wiki.bitplan.com/index.php/Wikidata_Import_2025-06-06 |
target | blazegraph |
start | 2025-06-06 |
end | |
days | |
os | Ubuntu 22.04.3 LTS |
cpu | |
ram | 128 |
triples | |
comment | seeded with 1.3 TB data.jnl file originally provided by James Hare |
This "import" is not using a dump and indexing approach but directly copying a blazegraph journal file.
Steps
Copy journal file
md5sum data.jnl
6ebe0cced1a22c6cf3fecb56afcf1c10 data.jnl
blockdownload --name wikidata --blocksize 512 --boost 8 --progress https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl .
Blocks ∅: 100%|████████████████████████████████████████████| 1.39T/1.39T [7:31:03<00:00, 51.5MB/s]
blockdownload --name wikidata --output data.jnl https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl 2025-06-05 --progress
setup wdqs environment
https://github.com/scatter-llc/private-wikidata-query wdqs
mkdir wdqs/data
mv data.jnl wdqs/data