Difference between revisions of "ConferenceCorpus"
Jump to navigation
Jump to search
(→Issues) |
|||
(6 intermediate revisions by the same user not shown) | |||
Line 7: | Line 7: | ||
|title=Scientific Event Corpus | |title=Scientific Event Corpus | ||
|url=https://github.com/WolfgangFahl/ConferenceCorpus | |url=https://github.com/WolfgangFahl/ConferenceCorpus | ||
− | |version=0.0 | + | |version=0.1.0 |
− | |date= | + | |date=2022-11-20 |
|since=2021-07-26 | |since=2021-07-26 | ||
|storemode=property | |storemode=property | ||
|name=ConferenceCorpus | |name=ConferenceCorpus | ||
}} | }} | ||
− | = | + | = What Links Here = |
+ | {{WhatLinksHere}} | ||
= Installation = | = Installation = | ||
== via pip == | == via pip == | ||
Line 34: | Line 35: | ||
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=json | * https://conferencecorpus.bitplan.com/eventseries/ISWC?format=json | ||
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=html | * https://conferencecorpus.bitplan.com/eventseries/ISWC?format=html | ||
+ | == Database View with Sqlite == | ||
+ | The EventCorpus.db is in Sqlite format. | ||
+ | === using sqlite-web === | ||
+ | <source lang='bash'> | ||
+ | pip install sqlite-web | ||
+ | sqlite_web $HOME/.conferencecorpus/EventCorpus.db | ||
+ | </source> | ||
+ | There is convenience script [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts/ccsqliteweb ccsqliteweb] available in the [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts scripts] directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup. | ||
== Command Line == | == Command Line == | ||
Line 73: | Line 82: | ||
title | title | ||
ConfIDent Event | ConfIDent Event | ||
− | + | 2022-05-11 | |
− | [[https://projects.tib.eu/en/confident/ © 2019- | + | [[https://projects.tib.eu/en/confident/ © 2019-2022 ConfIDent project and Wolfgang Fahl]] |
− | see also [[http:// | + | see also [[http://cc.bitplan.com Conference Corpus]] |
end title | end title | ||
Line 84: | Line 93: | ||
country : TEXT | country : TEXT | ||
eventId : TEXT | eventId : TEXT | ||
+ | lookupAcronym : TEXT | ||
+ | ordinal : INTEGER | ||
+ | region : TEXT | ||
source : TEXT | source : TEXT | ||
title : TEXT | title : TEXT | ||
− | |||
year : INTEGER | year : INTEGER | ||
} | } | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
class event_confref << Entity >> { | class event_confref << Entity >> { | ||
area : TEXT | area : TEXT | ||
+ | cityWikidataid : TEXT | ||
+ | countryIso : TEXT | ||
+ | countryWikidataid : TEXT | ||
dblpSeriesId : TEXT | dblpSeriesId : TEXT | ||
endDate : TEXT | endDate : TEXT | ||
keywords : TEXT | keywords : TEXT | ||
+ | location : TEXT | ||
ranks : TEXT | ranks : TEXT | ||
+ | regionIso : TEXT | ||
+ | regionWikidataid : TEXT | ||
seriesId : TEXT | seriesId : TEXT | ||
seriesTitle : TEXT | seriesTitle : TEXT | ||
startDate : TEXT | startDate : TEXT | ||
− | submissionExtended : | + | submissionExtended : INTEGER |
+ | url : TEXT | ||
} | } | ||
− | + | class event_gnd << Entity >> { | |
− | + | acronymCount : INTEGER | |
− | + | acronyms : TEXT | |
− | |||
− | class | ||
cityWikidataid : TEXT | cityWikidataid : TEXT | ||
+ | countryIso : TEXT | ||
countryWikidataid : TEXT | countryWikidataid : TEXT | ||
− | + | date : TEXT | |
+ | dateCount : INTEGER | ||
endDate : DATE | endDate : DATE | ||
+ | event : TEXT | ||
+ | fulltitle : TEXT | ||
+ | homepage : TEXT | ||
location : TEXT | location : TEXT | ||
− | + | organization : TEXT | |
− | + | place : TEXT | |
− | + | placeCount : INTEGER | |
− | + | places : TEXT | |
+ | regionIso : TEXT | ||
regionWikidataid : TEXT | regionWikidataid : TEXT | ||
− | |||
startDate : DATE | startDate : DATE | ||
− | + | variant : TEXT | |
− | + | variantCount : INTEGER | |
− | + | variants : TEXT | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
} | } | ||
− | |||
− | |||
− | |||
− | |||
class event_wikicfp << Entity >> { | class event_wikicfp << Entity >> { | ||
Final_Version_Due : TEXT | Final_Version_Due : TEXT | ||
Line 191: | Line 147: | ||
Submission_Deadline : TIMESTAMP | Submission_Deadline : TIMESTAMP | ||
cityWikidataid : TEXT | cityWikidataid : TEXT | ||
+ | countryIso : TEXT | ||
countryWikidataid : TEXT | countryWikidataid : TEXT | ||
deleted : INTEGER | deleted : INTEGER | ||
Line 196: | Line 153: | ||
eventType : TEXT | eventType : TEXT | ||
locality : TEXT | locality : TEXT | ||
− | + | regionIso : TEXT | |
regionWikidataid : TEXT | regionWikidataid : TEXT | ||
series : TEXT | series : TEXT | ||
seriesId : TEXT | seriesId : TEXT | ||
startDate : TIMESTAMP | startDate : TIMESTAMP | ||
+ | url : TEXT | ||
wikiCfpId : INTEGER | wikiCfpId : INTEGER | ||
} | } | ||
− | + | class event_orclone << Entity >> { | |
− | + | DblpConferenceId : TEXT | |
− | + | ISBN : TEXT | |
− | + | TibKatId : TEXT | |
+ | acceptedPapers : INTEGER | ||
+ | creationDate : TIMESTAMP | ||
+ | endDate : TIMESTAMP | ||
+ | eventType : TEXT | ||
+ | homepage : TEXT | ||
+ | inEventSeries : TEXT | ||
+ | lastEditor : TEXT | ||
+ | modificationDate : TIMESTAMP | ||
+ | pageTitle : TEXT <<PK>> | ||
+ | startDate : TIMESTAMP | ||
+ | submittedPapers : INTEGER | ||
+ | url : TEXT | ||
+ | wikidataId : TEXT | ||
+ | yearStr : TEXT | ||
+ | } | ||
+ | class event_tibkat << Entity >> { | ||
+ | alternativeTitles : TEXT | ||
+ | authorGndId : TEXT | ||
+ | bk : TEXT | ||
+ | changeDate : TEXT | ||
+ | cityWikidataid : TEXT | ||
+ | corporateCreatorNames : TEXT | ||
+ | corporateCreatorTypes : TEXT | ||
+ | countryIso : TEXT | ||
+ | countryWikidataid : TEXT | ||
+ | databaseDate : TEXT | ||
+ | dates : TEXT | ||
+ | ddc : TEXT | ||
+ | description : TEXT | ||
+ | documentGenreCode : TEXT | ||
+ | documentId : TEXT | ||
+ | documentTypeCode : TEXT | ||
+ | doi : TEXT | ||
+ | ean : TEXT | ||
+ | endDate : DATE | ||
+ | event : TEXT | ||
+ | firstid : TEXT | ||
+ | ftxCreationDate : TEXT | ||
+ | gndIds : TEXT | ||
+ | isbn : TEXT | ||
+ | isbn13 : TEXT | ||
+ | journalTitle : TEXT | ||
+ | journalVolumeNumber : TEXT | ||
+ | location : TEXT | ||
+ | ppn : TEXT | ||
+ | publisher : TEXT | ||
+ | pubplace : TEXT | ||
+ | pubyear : TEXT | ||
+ | regionIso : TEXT | ||
+ | regionWikidataid : TEXT | ||
+ | sponsorGndId : TEXT | ||
+ | startDate : DATE | ||
+ | } | ||
class event_dblp << Entity >> { | class event_dblp << Entity >> { | ||
booktitle : TEXT | booktitle : TEXT | ||
cityWikidataid : TEXT | cityWikidataid : TEXT | ||
+ | countryIso : TEXT | ||
countryWikidataid : TEXT | countryWikidataid : TEXT | ||
doi : TEXT | doi : TEXT | ||
ee : TEXT | ee : TEXT | ||
+ | endDate : TIMESTAMP | ||
isbn : TEXT | isbn : TEXT | ||
location : TEXT | location : TEXT | ||
mdate : TEXT | mdate : TEXT | ||
publicationSeries : TEXT | publicationSeries : TEXT | ||
− | + | regionIso : TEXT | |
regionWikidataid : TEXT | regionWikidataid : TEXT | ||
series : TEXT | series : TEXT | ||
+ | startDate : TIMESTAMP | ||
+ | url : TEXT | ||
} | } | ||
− | + | class event_crossref << Entity >> { | |
− | + | cityWikidataid : TEXT | |
− | + | countryIso : TEXT | |
− | + | countryWikidataid : TEXT | |
− | class | + | doi : TEXT |
− | + | endDate : DATE | |
− | + | location : TEXT | |
+ | month : INTEGER | ||
+ | name : TEXT | ||
+ | number : TEXT | ||
+ | regionIso : TEXT | ||
+ | regionWikidataid : TEXT | ||
+ | sponsor : TEXT | ||
+ | startDate : DATE | ||
+ | theme : TEXT | ||
+ | url : TEXT | ||
+ | } | ||
+ | class event_wikidata << Entity >> { | ||
+ | cityWikidataid : TEXT | ||
+ | countryId : TEXT | ||
+ | countryIso : TEXT | ||
+ | countryWikidataid : TEXT | ||
+ | dblpId : TEXT | ||
+ | describedAtUrl : TEXT | ||
+ | doi : TEXT | ||
endDate : TIMESTAMP | endDate : TIMESTAMP | ||
− | + | eventInSeries : TEXT | |
+ | eventInSeriesId : TEXT | ||
+ | eventTitle : TEXT | ||
+ | followedById : TEXT | ||
+ | gndId : TEXT | ||
homepage : TEXT | homepage : TEXT | ||
− | + | language : TEXT | |
− | + | location : TEXT | |
− | + | locationId : TEXT | |
− | + | mainSubject : TEXT | |
− | + | ppn : TEXT | |
− | + | proceedings : TEXT | |
+ | proceedingsLabel : TEXT | ||
+ | regionIso : TEXT | ||
+ | regionWikidataid : TEXT | ||
startDate : TIMESTAMP | startDate : TIMESTAMP | ||
− | + | url : TEXT | |
− | + | wikiCfpId : TEXT | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
} | } | ||
+ | Event <|-- event_confref | ||
+ | Event <|-- event_gnd | ||
+ | Event <|-- event_wikicfp | ||
Event <|-- event_orclone | Event <|-- event_orclone | ||
− | Event <|-- | + | Event <|-- event_tibkat |
− | Event <|-- | + | Event <|-- event_dblp |
Event <|-- event_crossref | Event <|-- event_crossref | ||
Event <|-- event_wikidata | Event <|-- event_wikidata | ||
− | |||
− | |||
− | |||
− | |||
} | } | ||
− | |||
' BITPlan Corporate identity skin params | ' BITPlan Corporate identity skin params | ||
' Copyright (c) 2015-2020 BITPlan GmbH | ' Copyright (c) 2015-2020 BITPlan GmbH | ||
Line 581: | Line 602: | ||
' end of skinparams ' | ' end of skinparams ' | ||
</uml> | </uml> | ||
+ | = Updating the database = | ||
+ | == Openresearch == | ||
+ | <source lang='bash'> | ||
+ | scripts/getbackup | ||
+ | </source> | ||
+ | gets a copy of the nightly OpenResearch backups | ||
+ | |||
= Issues = | = Issues = | ||
# {{Ticket | # {{Ticket |
Latest revision as of 08:17, 18 November 2023
OsProject
OsProject | |
---|---|
edit | |
id | ConferenceCorpus |
state | active |
owner | WolfgangFahl |
title | Scientific Event Corpus |
url | https://github.com/WolfgangFahl/ConferenceCorpus |
version | 0.1.0 |
description | |
date | 2022-11-20 |
since | 2021-07-26 |
until |
What Links Here
Installation
via pip
pip install ConferenceCorpus
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus
upgrade
pip install ConferenceCorpus -U
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus -U
Usage
RESTFul API
Examples
- https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=json
- https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=html
- https://conferencecorpus.bitplan.com/eventseries/ISWC?format=json
- https://conferencecorpus.bitplan.com/eventseries/ISWC?format=html
Database View with Sqlite
The EventCorpus.db is in Sqlite format.
using sqlite-web
pip install sqlite-web
sqlite_web $HOME/.conferencecorpus/EventCorpus.db
There is convenience script ccsqliteweb available in the scripts directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup.
Command Line
aelookup -h
usage: aelookup [-h] [-d] [-e ENDPOINT] [-v] [-u] [-f]
[--datasources DATASOURCES]
Scientific Event Corpus and Lookup
Created by Wolfgang Fahl on 2020-06-22.
Copyright 2020-2021 Wolfgang Fahl. All rights reserved.
Licensed under the Apache License 2.0
http://www.apache.org/licenses/LICENSE-2.0
Distributed on an "AS IS" basis without warranties
or conditions of any kind, either express or implied.
USAGE
optional arguments:
-h, --help show this help message and exit
-d, --debug show debug info
-e ENDPOINT, --endpoint ENDPOINT
SPARQL endpoint to use for wikidata queries
-v, --version show program's version number and exit
-u, --uml output plantuml diagram markup
-f, --force force Update - may take quite a time
--datasources DATASOURCES
, delimited list of datasource lookup ids
Overview
Datasources
You might want to open the diagrams in a new tab to be able to click the links depicted.
Event
EventSeries
Updating the database
Openresearch
scripts/getbackup
gets a copy of the nightly OpenResearch backups
Issues
- Issue 33 - Event series completion
- Issue 32 - regression TemplateNotFound: fb4common/base.html
- Issue 31 - Provide RDF export of the data
- Issue 30 - add ordinal distribution query✓
- Issue 29 - add scholar RESTFul API
- Issue 28 - add generic search for scholarly items
- Issue 27 - openresearch results missing in multiquery
- Issue 26 - add bib file import
- Issue 25 - make multiquery result available via webapi with content negotiation✓
- Issue 24 - allow updating the database via webserver
- Issue 23 - dictOfLod Lookup result via commandline
- Issue 22 - add multi query option✓
- Issue 21 - add Webserver✓
- Issue 20 - Work around upstream Nominatim OSM Pythontools issue✓
- Issue 19 - Update Openresearch Samples✓
- Issue 18 - Update requirements.txt✓
- Issue 17 - include ACM digital library as a source
- Issue 16 - Steps towards csv upload✓
- Issue 15 - Filter obviously invalid Series and Event entries✓
- Issue 14 - wikiCFP 500 Internal Server and TimeOut Error Handling✓
- Issue 12 - Relevant FTX fields
- Issue 11 - Locality fixes
- Issue 10 - OpenResearch export option
- Issue 9 - offline access to EventCorpus.db
- Issue 8 - migrate confref data from Proceedings Title Parser here✓
- Issue 7 - migrate crossref data from proceedings title parser here✓
- Issue 6 - migrate dblp data source here from ptp and dblpconf✓
- Issue 5 - dblp xml parser skips some proceedings titles
- Issue 4 - add commandline interface to CorpusLookup✓
- Issue 3 - add python api doc✓
- Issue 2 - Cache all SQL tables in the same SQLite database in a ".conferencecorpus" directory✓
- Issue 1 - There should be a common set of attributes for Event and EventSeries from different datasources