GROBID
Jump to navigation
Jump to search
GlossaryEntry | |
---|---|
responsible | |
state | |
since | 2009 |
description | GROBID (or Grobid, but not GroBid nor GroBiD) means GeneRation Of BIbliographic Data. |
references | |
lang | en |
master | GROBID |
Links
- https://github.com/kermitt2/grobid
- https://github.com/kermitt2/grobid-client-python
- https://www.confident-conference.org/r/grobid/
Installation
git clone https://github.com/kermitt2/grobid.git
./gradlew clean install test
Running service
./gradlew run
Apache Configuration
a2ensite grobid
systemctl reload apache2
With domain Names
see https://grobid.bitplan.com
#
# grobid.bitplan.com Server configuration
# 2020-07-04 see https://grobid.readthedocs.io/en/latest/Grobid-service/
#
# see http://stackoverflow.com/a/13089668/1497139
<VirtualHost *:80>
ServerAlias www.grobid.bitplan.com
ProxyPreserveHost On
ProxyRequests Off
ServerName grobid.bitplan.com
ProxyPass / http://localhost:8070/
ProxyPassReverse / http://localhost:8070/
Header add "Access-Control-Allow-Origin" "*"
</VirtualHost>
<VirtualHost *:443>
ServerAlias secure.grobid.bitplan.com
ProxyPreserveHost On
ProxyRequests Off
ServerName grobid.bitplan.com
ProxyPass / http://localhost:8070/
ProxyPassReverse / http://localhost:8070/
Header add "Access-Control-Allow-Origin" "*"
Include ssl.conf
</VirtualHost>
Install as service with Ansible playbook
https://github.com/TIBHannover/confiDent-Ansible-VMs/tree/master/playbooks/grobid
Service file
/etc/systemd/system/grobid.service
[Unit]
Description=Grobid
After=network.target
Wants=network.target
[Service]
WorkingDirectory=/usr/local/bin/
ExecStart=/usr/local/bin/grobid-installation/grobid-service/bin/grobid-service server /usr/local/bin/grobid-installation/grobid-service/config/config.yaml
Restart=on-abort
[Install]
WantedBy=multi-user.target
apache2 proxy pass
Redirect /r/grobid /r/grobid/
ProxyPass /r/grobid/ http://localhost:8070/
ProxyPassReverse /r/grobid/ http://localhost:8070/
References
- ^ Patrice Lopez. (2009) "GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications" ECDL 2009: Research and Advanced Technology for Digital Libraries pp 473-474 . doi: 10.1007/978-3-642-04346-8_62url: https://core.ac.uk/download/pdf/38300913.pdfat: ECDL 2009