Get your own copy of WikiData
Why would you want your own Wikidata copy?
The resources behind https://query.wikidata.org/ are scarce and used by a lot of people. You might hit the https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/query_limits quite quickly.
See SPARQL for some examples that work online (mostly) without hitting these limits.
What are alternative endpoints ?
Prerequisites
Getting a copy of Wikidata is not for the faint of heart.
You need quite a bit of patience and some hardware resources to get your own WikiData copy working. The resources you need are a moving target since Wikidata is growing all the time.
On the other hand solutions such as QLever by Hannah Bast are making progress so that you can run your own copy of Wikidata on commodity hardware with an AMD Ryzen 9 16 core processor, 32 GB of RAM a 2 TTB SSD and the indexing will take less than 5 h. Together with the download time of some 6 hours that's less than half a day for getting a current copy so a daily update is feasible these days.
Papers
Successes
Please contact Wolfgang Fahl if you'd like to see your own success report added to the "Reports" table below. The imports table is generated from the documentation of our own Wikidata import trials in this semantic mediawiki.
Reports
# Date | Source | Target | Triples | days | RAM GB | CPU Cores | Speed | Link |
---|---|---|---|---|---|---|---|---|
2024-01-29 | latest-all.nt (2024-01-29) | QLever | 19.1 billion | 4.5 hours | 32 | 16 | AMD Ryzen 9 | Hannah Bast - QLever |
2023-01 | James Hare - Blazegraph & QLever | Blazegraph & QLever | ? | ? | ? | ? | 384 GB | James Hare Scatter LLC |
2022-07 | latest-all.ttl (2022-07-12) | stardog | 17.2 billion | 1 d 19 h | 253 | Tim Holzheim - BITPlan Wiki | ||
2022-06 | latest-all.nt (2022-06-25) | QLever | 17.2 billion | 1 d 2 h | 128 | 8 | 1.8 GHz | Wolfgang Fahl - BITPlan Wiki |
2022-05 | latest-all.ttl.bz2 (2022-05-29) | QLever | ~17 billion | 14h | 128 | 12/24 | 4.8 GHz boost | Hannah Bast - QLever |
2022-02 | latest-all.nt (2022-02) | stardog | 16.7 billion | 9h | Evren Sirin - stardog | |||
2022-02 | latest-all.nt (2022-01-29) | QLever | 16.9 billion | 4 d 2 h | 127 | 8 | 1.8 GHz | Wolfgang Fahl - BITPlan Wiki |
2020-08 | latest-all.nt (2020-08-15) | Apache Jena | 13.8 billion | 9 d 21 h | 64 | Wolfgang Fahl BITPlan Wiki | ||
2020-07 | latest-truthy.nt (2020-07-15) | Apache Jena | 5.2 billion | 4 d 14 h | 64 | Wolfgang Fahl BITPlan Wiki | ||
2020-06 | latest-all.ttl (2020-04-28) | Apache Jena | 12.9 billion | 6 d 16 h | ? | Jonas Sourlier - Jena Issue 1909 | ||
2020-03 | latest-all.nt.bz2 (2020-03-01 | Virtuoso | ~11.8 billion | 10 hours + 1day prep | 248 | Hugh Williams - Virtuoso | ||
2019-10 | blazegraph | ~10 billion | 5.5 d | 104 | 16 | Adam Shoreland Wikimedia Foundation | ||
2019-09 | latest-all.ttl (2019-09) | Virtuoso | 9.5 billion | 9.1 hours | ? | Adam Sanchez - WikiData mailing list | ||
2019-05 | wikidata-20190513-all-BETA.ttl | Virtuoso | ? | 43 hours | ? | - | ||
2019-05 | wikidata-20190513-all-BETA.ttl | Blazegraph | ? | 10.2 days | Adam Sanchez WikiData mailing list | |||
2019-02 | latest-all.ttl.gz | Apache Jena | ? | > 2 days | ? | corsin - muncca blog | ||
2018-01 | wikidata-20180101-all-BETA.ttl | Blazegraph | 3 billion | 4 days | 32 | 4 | 2.2 GHz | Wolfgang Fahl - BITPlan wiki |
2017-12 | latest-truthy.nt.gz | Apache Jena | ? | 8 hours | ? | Andy Seaborne Apache Jena Mailinglist |
Imports on our own hardware
Import | state | url | target | start | end | days | os | cpu | ram | triples |
---|---|---|---|---|---|---|---|---|---|---|
Wikidata Import 2025-08-19 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-08-19 | QLever | 19 August 2025 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | |||
Wikidata Import 2025-07-30 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-07-30 | QLever | 30 July 2025 | 1 August 2025 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | ||
Wikidata Import 2025-07-13 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-07-13 | QLever | 13 July 2025 | 13 July 2025 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | ||
Wikidata Import 2025-06-07 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-06-07 | QLever | 7 June 2025 | 8 June 2025 | 1 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | |
Wikidata Import 2025-06-06 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-06-06 | blazegraph | 6 June 2025 | 7 June 2025 | 1 | Ubuntu 22.04.5 LTS | AMD Ryzen 9 5900X 12-Core Processor | 128 | |
Wikidata Import 2025-06-02 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-06-02 | QLever | 2 June 2025 | 3 June 2025 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 20,484,725,007 | |
Wikidata Import 2025-05-03 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-05-03 | QLever | 3 May 2025 | 4 May 2025 | 0.6 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 20,096,524,609 |
Wikidata Import 2025-05-02 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-05-02 | blazegraph | 2 May 2025 | 5 June 2025 | 34 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16,836,266,114 |
Wikidata Import 2025-03-31 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-03-31 | QLever | 31 March 2025 | 1 April 2025 | 0.6 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | |
Wikidata Import 2025-02-13 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2025-02-13 | QLever | 13 February 2025 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | |||
Wikidata Import 2024-11-23 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-11-23 | QLever | 23 November 2024 | 24 November 2024 | 0.7 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 |
Wikidata Import 2024-11-16 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-11-16 | QLever | 16 November 2024 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 | ||
Wikidata Import 2024-11-15 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-11-15 | QLever | 15 November 2024 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 | ||
Wikidata import 2024-10-28 Virtuoso | ✅ | https://wiki.bitplan.com/index.php?title=Wikidata import 2024-10-28 Virtuoso | Virtuoso | 29 October 2024 | 3 November 2024 | 4 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 |
Wikidata Import 2024-10-26 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-10-26 | QLever | 26 October 2024 | 27 October 2024 | 0.9 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 |
Wikidata Import 2024-10-24 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-10-24 | QLever | 24 October 2024 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | |||
Wikidata Import 2024-10-17 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-10-17 | QLever | 17 October 2024 | 18 October 2024 | 0.9 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | 16.3 |
Wikidata Import 2024-04-13 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-04-13 | QLever | 13 April 2024 | 13 April 2024 | Ubuntu 22.04.3 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz (16 cores) | 512 | ||
Wikidata Import 2024-02-18 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-02-18 | QLever | 18 February 2024 | 18 February 2024 | 0.5 | Ubuntu 22.04.3 LTS | AMD Ryzen 9 5900X 12-Core Processor @ 4.95GHz | 128 | 15.5 |
Wikidata Import 2024-01-20 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2024-01-20 | QLever | 20 January 2024 | 21 January 2024 | 0.5 | Ubuntu 22.04.2 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz | 256 | 19.1 |
Wikidata Import 2023-05-15 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-05-15 | QLever | 15 May 2023 | Ubuntu 22.04.2 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz | 256 | |||
Wikidata Import 2023-05-14 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-05-14 | blazegraph | 14 May 2023 | Ubuntu 22.04.2 LTS | Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz | 32 | |||
Wikidata Import 2023-05-10 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-05-10 | blazegraph | 10 May 2023 | Ubuntu 22.04.2 LTS | Intel(R) Xeon(R) CPU X5690@3.47GHz | 64 | 14.7 | ||
Wikidata Import 2023-05-05 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-05-05 | blazegraph | 5 May 2023 | Ubuntu 22.04.2 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz | 256 | 14.7 | ||
Wikidata Import 2023-05-03 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-05-03 | blazegraph | 3 May 2023 | 26 June 2023 | 23 | Ubuntu 20.04.6 LTS | Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz | 128 | 14.7 |
Wikidata Import 2023-04-26 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-04-26 | blazegraph | 26 April 2023 | Ubuntu 22.04.2 LTS | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz | 256 | 14.7 | ||
Wikidata Import 2023-04-18 | ❌ | https://wiki.bitplan.com/index.php/Wikidata Import 2023-04-18 | blazegraph | 18 April 2023 | 18 April 2023 | Intel(R) Xeon(R) Gold 6326 CPU @ 2.90GHz | 256 | 14.6 | ||
Wikidata Import 2023-01-24 | https://wiki.bitplan.com/index.php/Wikidata Import 2023-01-24 | QLever | 24 January 2023 | |||||||
WikiData Import 2022-07-20 | https://wiki.bitplan.com/index.php/WikiData Import 2022-07-20 | virtuoso | 20 July 2022 | |||||||
WikiData Import 2022-07-12 | ✅ | https://wiki.bitplan.com/index.php/Wikidata On Stardog | Stardog | 11 July 2022 | 14 July 2022 | 3 | 256 | |||
Wikidata On Stardog | ✅ | https://wiki.bitplan.com/index.php/Wikidata On Stardog | Stardog | 11 July 2022 | 14 July 2022 | 3 | 256 | |||
WikiData Import 2022-06-25 | ✅ | https://wiki.bitplan.com/index.php/WikiData Import 2022-06-25 | QLever | 25 June 2022 | 27 June 2022 | 1.1 | Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz | 128 | ||
Wikidata on Allegrograph | ✅ | https://wiki.bitplan.com/index.php/Wikidata on Allegrograph | Allegrograph | 15 April 2022 | 0.3 | CentOS Linux release 7.9.2009 | AMD EPYC 7302, 3.0GHz | 256 | 16.8 | |
WikiData Import 2022-03-11 | ❌ | https://wiki.bitplan.com/index.php/WikiData Import 2022-03-11 | QLever | 12 March 2022 | 12 March 2022 | |||||
WikiData Import 2022-01-29 | ✅ | https://wiki.bitplan.com/index.php/WikiData Import 2022-01-29 | QLever | 2 February 2022 | 6 February 2022 | 4 | Ubuntu 20.04.3 LTS | Quad-Core AMD Opteron(tm) Processor 2374 HE | 64 | 16.9 |
Wikidata Import 2018-01-05 | ✅ | https://wiki.bitplan.com/index.php/Wikidata Import 2018-01-05 | blazegraph | 5 January 2018 | Quad-Core AMD Opteron(tm) Processor 2374 HE |
Links
- https://github.com/mmayers12/wikidata
- https://www.wikidata.org/wiki/Wikidata:Database_download#RDF_dumps
- https://www.mediawiki.org/wiki/Wikidata_Query_Service/User_Manual
- https://muncca.com/2019/02/14/wikidata-import-in-apache-jena/
- https://users.jena.apache.narkive.com/J1gsFHRk/tdb2-tdbloader-performance
- https://stackoverflow.com/questions/61813248/jena-tdbloader2-performance-and-limits
- https://github.com/maxlath/import-wikidata-dump-to-couchdb
- https://bugs.java.com/bugdatabase/view_bug.do?bug_id=8169477
- https://lists.wikimedia.org/pipermail/wikidata/2019-December/013716.html
- https://akbaritabar.netlify.app/how_to_use_a_wikidata_dump
- https://addshore.com/2019/10/your-own-wikidata-query-service-with-no-limits-part-1/
- https://topicseed.com/blog/importing-wikidata-dumps
Questions
- https://opendata.stackexchange.com/questions/107/how-can-i-download-the-complete-wikidata-database
- How to keep your copy of WikiData up to data without going thru the whole import process again? See https://phabricator.wikimedia.org/T244590
- https://stackoverflow.com/questions/56769098/bulk-loading-of-wikidata-dump-in-virtuoso
- https://stackoverflow.com/questions/47885637/failed-to-install-wikidata-query-rdf-blazegraph
- https://stackoverflow.com/questions/48020506/wikidata-on-local-blazegraph-expected-an-rdf-value-here-found-line-1/48110100
- https://stackoverflow.com/questions/56768463/wikidata-import-into-virtuoso
- https://stackoverflow.com/questions/14494449/virtuoso-system-requirements