Weaviate: Difference between revisions
Jump to navigation
Jump to search
| Line 39: | Line 39: | ||
=== Installweaviate === | === Installweaviate === | ||
<source lang='bash'> | <source lang='bash' highlight='1,9,10'> | ||
git clone https://github.com/weaviate/weaviate.git | |||
Cloning into 'weaviate'... | |||
remote: Enumerating objects: 124394, done. | |||
remote: Counting objects: 100% (14150/14150), done. | |||
remote: Compressing objects: 100% (1626/1626), done. | |||
remote: Total 124394 (delta 12845), reused 13458 (delta 12510), pack-reused 110244 | |||
Receiving objects: 100% (124394/124394), 945.83 MiB | 33.55 MiB/s, done. | |||
Resolving deltas: 100% (86422/86422), done. | |||
cd weaviate | |||
docker build --target weaviate -t weaviate . | |||
</source> | |||
== docker-compose.yml == | |||
<source lang='yaml'> | |||
version: '3.4' | |||
services: | |||
weaviate: | |||
image: weaviate | |||
ports: | |||
- 8080:8080 | |||
environment: | |||
CONTEXTIONARY_URL: contextionary:9999 | |||
QUERY_DEFAULTS_LIMIT: 25 | |||
AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: 'true' | |||
PERSISTENCE_DATA_PATH: './data' | |||
ENABLE_MODULES: 'text2vec-contextionary' | |||
DEFAULT_VECTORIZER_MODULE: 'text2vec-contextionary' | |||
AUTOSCHEMA_ENABLED: 'false' | |||
contextionary: | |||
environment: | |||
OCCURRENCE_WEIGHT_LINEAR_FACTOR: 0.75 | |||
EXTENSIONS_STORAGE_MODE: weaviate | |||
EXTENSIONS_STORAGE_ORIGIN: http://weaviate:8080 | |||
NEIGHBOR_OCCURRENCE_IGNORE_PERCENTILE: 5 | |||
ENABLE_COMPOUND_SPLITTING: 'false' | |||
image: semitechnologies/contextionary:en0.16.0-v1.2.1 | |||
</source> | </source> | ||
Revision as of 08:56, 10 June 2023
see
- https://www.semi.technology/news/fosdem-2020.html
- https://www.semi.technology/documentation/weaviate/current/about/philosophy.html#about-the-contextionary
- Word2vec
- BERT
- GloVe
Fill with dictionary "e.g. english".
- https://stackoverflow.com/questions/27652335/semantic-search-engines/60361878#60361878
- https://www.semi.technology/news/fosdem-2020.html
- https://en.wikipedia.org/wiki/Bob_van_Luijt
- Weaviate playground
- https://www.semi.technology/documentation/weaviate/current/about/philosophy.html#about-the-contextionary
- https://nlp.stanford.edu/pubs/glove.pdf
- http://wiki.bitplan.com/index.php/Weaviate
- https://hackernoon.com/how-weaviates-graphql-api-was-designed-t93932tl
- https://medium.com/semi-technologies/finding-answers-in-complex-standardizations-documents-using-weaviates-semantic-search-modules-4dcdb83527fb
Versions
Presentations
Installation
see https://weaviate.io/developers/weaviate/installation
Setup, schema, importing data and GraphQL
Docker
Prerequisites
sudo apt-get install curl docker docker-compose
Installweaviate
git clone https://github.com/weaviate/weaviate.git
Cloning into 'weaviate'...
remote: Enumerating objects: 124394, done.
remote: Counting objects: 100% (14150/14150), done.
remote: Compressing objects: 100% (1626/1626), done.
remote: Total 124394 (delta 12845), reused 13458 (delta 12510), pack-reused 110244
Receiving objects: 100% (124394/124394), 945.83 MiB | 33.55 MiB/s, done.
Resolving deltas: 100% (86422/86422), done.
cd weaviate
docker build --target weaviate -t weaviate .
docker-compose.yml
version: '3.4'
services:
weaviate:
image: weaviate
ports:
- 8080:8080
environment:
CONTEXTIONARY_URL: contextionary:9999
QUERY_DEFAULTS_LIMIT: 25
AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: 'true'
PERSISTENCE_DATA_PATH: './data'
ENABLE_MODULES: 'text2vec-contextionary'
DEFAULT_VECTORIZER_MODULE: 'text2vec-contextionary'
AUTOSCHEMA_ENABLED: 'false'
contextionary:
environment:
OCCURRENCE_WEIGHT_LINEAR_FACTOR: 0.75
EXTENSIONS_STORAGE_MODE: weaviate
EXTENSIONS_STORAGE_ORIGIN: http://weaviate:8080
NEIGHBOR_OCCURRENCE_IGNORE_PERCENTILE: 5
ENABLE_COMPOUND_SPLITTING: 'false'
image: semitechnologies/contextionary:en0.16.0-v1.2.1
Installation Issue
Python Library
sample code
Integration tests
Videos