Difference between revisions of "Wikidata Import 2025-06-06"
Jump to navigation
Jump to search
| Line 29: | Line 29: | ||
Blocks ∅: 100%|████████████████████████████████████████████| 1.39T/1.39T [7:31:03<00:00, 51.5MB/s] | Blocks ∅: 100%|████████████████████████████████████████████| 1.39T/1.39T [7:31:03<00:00, 51.5MB/s] | ||
blockdownload --name wikidata --output data.jnl https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl 2025-06-05 --progress | blockdownload --name wikidata --output data.jnl https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl 2025-06-05 --progress | ||
| + | </source> | ||
| + | == setup wdqs environment == | ||
| + | <source lang='bash' highlight='1'> | ||
| + | https://github.com/scatter-llc/private-wikidata-query | ||
</source> | </source> | ||
Revision as of 06:47, 7 June 2025
Import
| Import | |
|---|---|
| state | ✅ |
| url | https://wiki.bitplan.com/index.php/Wikidata_Import_2025-06-06 |
| target | blazegraph |
| start | 2025-06-06 |
| end | |
| days | |
| os | Ubuntu 22.04.3 LTS |
| cpu | |
| ram | 128 |
| triples | |
| comment | seeded with 1.3 TB data.jnl file originally provided by James Hare |
This "import" is not using a dump and indexing approach but directly copying a blazegraph journal file.
Steps
Copy journal file
md5sum data.jnl
6ebe0cced1a22c6cf3fecb56afcf1c10 data.jnl
blockdownload --name wikidata --blocksize 512 --boost 8 --progress https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl .
Blocks ∅: 100%|████████████████████████████████████████████| 1.39T/1.39T [7:31:03<00:00, 51.5MB/s]
blockdownload --name wikidata --output data.jnl https://wikidata-dump.wikidata.dbis.rwth-aachen.de/data.jnl 2025-06-05 --progress
setup wdqs environment
https://github.com/scatter-llc/private-wikidata-query