WikiData Import 2020-09-11

From BITPlan Wiki
Revision as of 16:19, 25 September 2020 by Wf (talk | contribs) (→‎service unzip)
Jump to navigation Jump to search

Goal

replicate original blazegraph environment see https://github.com/wikimedia/wikidata-query-rdf/blob/master/docs/getting-started.md

Git clone

git clone  --recurse-submodules https://gerrit.wikimedia.org/r/wikidata/query/rdf wikidata-query-rdf
Cloning into 'wikidata-query-rdf'...
remote: Counting objects: 195, done
remote: Finding sources: 100% (161/161)
remote: Getting sizes: 100% (146/146)
remote: Compressing objects: 100% (131311/131311)
remote: Total 17957 (delta 17), reused 17930 (delta 9)
Receiving objects: 100% (17957/17957), 2.98 MiB | 3.37 MiB/s, done.
Resolving deltas: 100% (8953/8953), done.

mvn package

# Mac OS X 
export JAVA_HOME=$(/usr/libexec/java_home -v 1.8.0_191)
java -version
java version "1.8.0_191"
Java(TM) SE Runtime Environment (build 1.8.0_191-b12)
Java HotSpot(TM) 64-Bit Server VM (build 25.191-b12, mixed mode)
cd wikidata-query-rdf
mvn -version
Apache Maven 3.6.0 (97c98ec64a1fdfee7767ce5ffb20918da4f719f3; 2018-10-24T20:41:47+02:00)
Maven home: /opt/local/share/java/maven3
Java version: 1.8.0_191, vendor: Oracle Corporation, runtime: /Library/Java/JavaVirtualMachines/jdk1.8.0_191.jdk/Contents/Home/jre
Default locale: de_DE, platform encoding: UTF-8
OS name: "mac os x", version: "10.13.6", arch: "x86_64", family: "mac"
mvn package

issue

see https://phabricator.wikimedia.org/T263855

2020-09-11 13:45:39 INFO  RocksDBStateBackend:899 - Attempting to load RocksDB native library and store it under '/var/folders/2t/2g54bjr10830rv00508_y13w0000gn/T/flink-io-6ec39247-613e-410a-a83f-712f841ce3a8'
dyld: lazy symbol binding failed: Symbol not found: ____chkstk_darwin
  Referenced from: /private/var/folders/2t/2g54bjr10830rv00508_y13w0000gn/T/flink-io-6ec39247-613e-410a-a83f-712f841ce3a8/rocksdb-lib-396871de50f5fa7595c1071b59c34498/librocksdbjni-osx.jnilib (which was built for Mac OS X 10.15)
  Expected in: /usr/lib/libSystem.B.dylib

dyld: Symbol not found: ____chkstk_darwin
  Referenced from: /private/var/folders/2t/2g54bjr10830rv00508_y13w0000gn/T/flink-io-6ec39247-613e-410a-a83f-712f841ce3a8/rocksdb-lib-396871de50f5fa7595c1071b59c34498/librocksdbjni-osx.jnilib (which was built for Mac OS X 10.15)
  Expected in: /usr/lib/libSystem.B.dylib

Try workaround

mvn package -D skipTests

service unzip

Note the usage of. tar x instead of unzip

cd dist/target/
tar xvfz service-0.3.50-SNAPSHOT-dist.tar.gz
cd service-0.3.50-SNAPSHOT
ls
RWStore.properties                     loadRestAPI.sh
blazegraph-service-0.3.50-SNAPSHOT.war munge.sh
createNamespace.sh                     mw-oauth-proxy-0.3.50-SNAPSHOT.war
default.properties                     mwservices.json
docs                                   prefixes-sdc.conf
forAllCategoryWikis.sh                 prefixes.conf
jetty-runner-9.4.12.v20180830.jar      runBlazegraph.sh
ldf-config.json                        runStreamingUpdater.sh
lib                                    runUpdate.sh
loadCategoryDaily.sh                   summarizeEvents.sh
loadCategoryDump.sh                    wcqs-data-reload.sh
loadData.sh