Difference between revisions of "Weaviate"
Jump to navigation
Jump to search
(10 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
+ | * https://github.com/semi-technologies/weaviate | ||
+ | |||
<youtube>https://youtu.be/3NfcAF4qm2k</youtube> | <youtube>https://youtu.be/3NfcAF4qm2k</youtube> | ||
see | see | ||
Line 8: | Line 10: | ||
Fill with dictionary "e.g. english". | Fill with dictionary "e.g. english". | ||
− | + | <youtube>SDOl9fRObVg</youtube> | |
* https://stackoverflow.com/questions/27652335/semantic-search-engines/60361878#60361878 | * https://stackoverflow.com/questions/27652335/semantic-search-engines/60361878#60361878 | ||
* https://www.semi.technology/news/fosdem-2020.html | * https://www.semi.technology/news/fosdem-2020.html | ||
Line 18: | Line 20: | ||
* http://wiki.bitplan.com/index.php/Weaviate | * http://wiki.bitplan.com/index.php/Weaviate | ||
* https://hackernoon.com/how-weaviates-graphql-api-was-designed-t93932tl | * https://hackernoon.com/how-weaviates-graphql-api-was-designed-t93932tl | ||
+ | * https://medium.com/semi-technologies/finding-answers-in-complex-standardizations-documents-using-weaviates-semantic-search-modules-4dcdb83527fb | ||
+ | = Versions = | ||
+ | * https://medium.com/semi-technologies/weaviate-version-1-2-x-now-supports-transformer-models-4a12d858cce3 | ||
+ | |||
+ | = Presentations = | ||
+ | <youtube>DCQWqMecdlA</youtube> | ||
+ | <youtube>Ufs0cU-gaeM</youtube> | ||
+ | |||
= Installation = | = Installation = | ||
− | see https:// | + | see https://weaviate.io/developers/weaviate/installation |
+ | == Setup, schema, importing data and GraphQL == | ||
+ | <youtube>L0Y_zZR8FRI</youtube> | ||
== Docker == | == Docker == | ||
=== Prerequisites === | === Prerequisites === | ||
Line 27: | Line 39: | ||
=== Installweaviate === | === Installweaviate === | ||
− | <source lang='bash'> | + | <source lang='bash' highlight='1,9,10'> |
− | + | git clone https://github.com/weaviate/weaviate.git | |
− | + | Cloning into 'weaviate'... | |
− | + | remote: Enumerating objects: 124394, done. | |
− | + | remote: Counting objects: 100% (14150/14150), done. | |
− | docker-compose | + | remote: Compressing objects: 100% (1626/1626), done. |
+ | remote: Total 124394 (delta 12845), reused 13458 (delta 12510), pack-reused 110244 | ||
+ | Receiving objects: 100% (124394/124394), 945.83 MiB | 33.55 MiB/s, done. | ||
+ | Resolving deltas: 100% (86422/86422), done. | ||
+ | cd weaviate | ||
+ | docker build --target weaviate -t weaviate . | ||
+ | docker compose up | ||
+ | </source> | ||
+ | |||
+ | == docker-compose.yml == | ||
+ | <source lang='yaml'> | ||
+ | version: '3.4' | ||
+ | services: | ||
+ | weaviate: | ||
+ | image: weaviate | ||
+ | ports: | ||
+ | - 8080:8080 | ||
+ | environment: | ||
+ | CONTEXTIONARY_URL: contextionary:9999 | ||
+ | QUERY_DEFAULTS_LIMIT: 25 | ||
+ | AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: 'true' | ||
+ | PERSISTENCE_DATA_PATH: './data' | ||
+ | ENABLE_MODULES: 'text2vec-contextionary' | ||
+ | DEFAULT_VECTORIZER_MODULE: 'text2vec-contextionary' | ||
+ | AUTOSCHEMA_ENABLED: 'false' | ||
+ | contextionary: | ||
+ | environment: | ||
+ | OCCURRENCE_WEIGHT_LINEAR_FACTOR: 0.75 | ||
+ | EXTENSIONS_STORAGE_MODE: weaviate | ||
+ | EXTENSIONS_STORAGE_ORIGIN: http://localhost:8080 | ||
+ | NEIGHBOR_OCCURRENCE_IGNORE_PERCENTILE: 5 | ||
+ | ENABLE_COMPOUND_SPLITTING: 'false' | ||
+ | image: semitechnologies/contextionary:en0.16.0-v1.2.1 | ||
</source> | </source> | ||
Line 43: | Line 87: | ||
=== Integration tests === | === Integration tests === | ||
* https://github.com/semi-technologies/weaviate-python-client/blob/master/integration/client_functions.py | * https://github.com/semi-technologies/weaviate-python-client/blob/master/integration/client_functions.py | ||
+ | |||
= Videos = | = Videos = | ||
<youtube>Gd5Fx0fOdfA</youtube> | <youtube>Gd5Fx0fOdfA</youtube> | ||
<youtube>tKGRyzelDjU</youtube> | <youtube>tKGRyzelDjU</youtube> |
Latest revision as of 10:19, 10 June 2023
see
- https://www.semi.technology/news/fosdem-2020.html
- https://www.semi.technology/documentation/weaviate/current/about/philosophy.html#about-the-contextionary
- Word2vec
- BERT
- GloVe
Fill with dictionary "e.g. english".
- https://stackoverflow.com/questions/27652335/semantic-search-engines/60361878#60361878
- https://www.semi.technology/news/fosdem-2020.html
- https://en.wikipedia.org/wiki/Bob_van_Luijt
- Weaviate playground
- https://www.semi.technology/documentation/weaviate/current/about/philosophy.html#about-the-contextionary
- https://nlp.stanford.edu/pubs/glove.pdf
- http://wiki.bitplan.com/index.php/Weaviate
- https://hackernoon.com/how-weaviates-graphql-api-was-designed-t93932tl
- https://medium.com/semi-technologies/finding-answers-in-complex-standardizations-documents-using-weaviates-semantic-search-modules-4dcdb83527fb
Versions
Presentations
Installation
see https://weaviate.io/developers/weaviate/installation
Setup, schema, importing data and GraphQL
Docker
Prerequisites
sudo apt-get install curl docker docker-compose
Installweaviate
git clone https://github.com/weaviate/weaviate.git
Cloning into 'weaviate'...
remote: Enumerating objects: 124394, done.
remote: Counting objects: 100% (14150/14150), done.
remote: Compressing objects: 100% (1626/1626), done.
remote: Total 124394 (delta 12845), reused 13458 (delta 12510), pack-reused 110244
Receiving objects: 100% (124394/124394), 945.83 MiB | 33.55 MiB/s, done.
Resolving deltas: 100% (86422/86422), done.
cd weaviate
docker build --target weaviate -t weaviate .
docker compose up
docker-compose.yml
version: '3.4'
services:
weaviate:
image: weaviate
ports:
- 8080:8080
environment:
CONTEXTIONARY_URL: contextionary:9999
QUERY_DEFAULTS_LIMIT: 25
AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: 'true'
PERSISTENCE_DATA_PATH: './data'
ENABLE_MODULES: 'text2vec-contextionary'
DEFAULT_VECTORIZER_MODULE: 'text2vec-contextionary'
AUTOSCHEMA_ENABLED: 'false'
contextionary:
environment:
OCCURRENCE_WEIGHT_LINEAR_FACTOR: 0.75
EXTENSIONS_STORAGE_MODE: weaviate
EXTENSIONS_STORAGE_ORIGIN: http://localhost:8080
NEIGHBOR_OCCURRENCE_IGNORE_PERCENTILE: 5
ENABLE_COMPOUND_SPLITTING: 'false'
image: semitechnologies/contextionary:en0.16.0-v1.2.1
Installation Issue
Python Library
sample code
Integration tests
Videos