Difference between revisions of "ConferenceCorpus"

From BITPlan Wiki
Jump to navigation Jump to search
(Created page with "{{OsProject |name=ConferenceCorpus |storemode=property }}")
 
 
(35 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
=OsProject=
 +
 
{{OsProject
 
{{OsProject
 +
|id=ConferenceCorpus
 +
|state=active
 +
|owner=WolfgangFahl
 +
|title=Scientific Event Corpus
 +
|url=https://github.com/WolfgangFahl/ConferenceCorpus
 +
|version=0.1.0
 +
|date=2022-11-20
 +
|since=2021-07-26
 +
|storemode=property
 
|name=ConferenceCorpus
 
|name=ConferenceCorpus
|storemode=property
+
}}
 +
= What Links Here =
 +
{{WhatLinksHere}}
 +
= Installation =
 +
== via pip ==
 +
<source lang='bash'>
 +
pip install ConferenceCorpus
 +
# alternatively if your pip is not a python3 pip
 +
pip3 install ConferenceCorpus
 +
</source>
 +
=== upgrade ===
 +
<source lang='bash'>
 +
pip install ConferenceCorpus -U
 +
# alternatively if your pip is not a python3 pip
 +
pip3 install ConferenceCorpus -U
 +
</source>
 +
= Usage =
 +
== RESTFul API ==
 +
=== Examples ===
 +
* https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=json
 +
* https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=html
 +
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=json
 +
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=html
 +
== Database View with Sqlite ==
 +
The EventCorpus.db is in Sqlite format.
 +
=== using sqlite-web ===
 +
<source lang='bash'>
 +
pip install sqlite-web
 +
sqlite_web $HOME/.conferencecorpus/EventCorpus.db
 +
</source>
 +
There is convenience script [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts/ccsqliteweb ccsqliteweb] available in the [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts scripts] directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup.
 +
 
 +
== Command Line ==
 +
<source lang='bash'>
 +
aelookup -h
 +
usage: aelookup [-h] [-d] [-e ENDPOINT] [-v] [-u] [-f]
 +
                [--datasources DATASOURCES]
 +
 
 +
Scientific Event Corpus and Lookup
 +
 
 +
  Created by Wolfgang Fahl on 2020-06-22.
 +
  Copyright 2020-2021 Wolfgang Fahl. All rights reserved.
 +
 
 +
  Licensed under the Apache License 2.0
 +
  http://www.apache.org/licenses/LICENSE-2.0
 +
 
 +
  Distributed on an "AS IS" basis without warranties
 +
  or conditions of any kind, either express or implied.
 +
 
 +
USAGE
 +
 
 +
optional arguments:
 +
  -h, --help            show this help message and exit
 +
  -d, --debug          show debug info
 +
  -e ENDPOINT, --endpoint ENDPOINT
 +
                        SPARQL endpoint to use for wikidata queries
 +
  -v, --version        show program's version number and exit
 +
  -u, --uml            output plantuml diagram markup
 +
  -f, --force          force Update - may take quite a time
 +
  --datasources DATASOURCES
 +
                        , delimited list of datasource lookup ids
 +
</source>
 +
 
 +
= Overview =
 +
== Datasources ==
 +
You might want to open the diagrams in a new tab to be able to click the links depicted.
 +
=== Event ===
 +
<uml format='svg'>
 +
title
 +
ConfIDent  Event
 +
2022-05-11
 +
[[https://projects.tib.eu/en/confident/ © 2019-2022 ConfIDent project and Wolfgang Fahl]]
 +
see also [[http://cc.bitplan.com Conference Corpus]]
 +
 
 +
end title
 +
package DataSources {
 +
  class Event << Entity >> {
 +
  acronym : TEXT
 +
  city : TEXT
 +
  country : TEXT
 +
  eventId : TEXT
 +
  lookupAcronym : TEXT
 +
  ordinal : INTEGER
 +
  region : TEXT
 +
  source : TEXT
 +
  title : TEXT
 +
  year : INTEGER
 +
  }
 +
  class event_confref << Entity >> {
 +
  area : TEXT
 +
  cityWikidataid : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  dblpSeriesId : TEXT
 +
  endDate : TEXT
 +
  keywords : TEXT
 +
  location : TEXT
 +
  ranks : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  seriesId : TEXT
 +
  seriesTitle : TEXT
 +
  startDate : TEXT
 +
  submissionExtended : INTEGER
 +
  url : TEXT
 +
  }
 +
  class event_gnd << Entity >> {
 +
  acronymCount : INTEGER
 +
  acronyms : TEXT
 +
  cityWikidataid : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  date : TEXT
 +
  dateCount : INTEGER
 +
  endDate : DATE
 +
  event : TEXT
 +
  fulltitle : TEXT
 +
  homepage : TEXT
 +
  location : TEXT
 +
  organization : TEXT
 +
  place : TEXT
 +
  placeCount : INTEGER
 +
  places : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  startDate : DATE
 +
  variant : TEXT
 +
  variantCount : INTEGER
 +
  variants : TEXT
 +
  }
 +
  class event_wikicfp << Entity >> {
 +
  Final_Version_Due : TEXT
 +
  Notification_Due : TIMESTAMP
 +
  Submission_Deadline : TIMESTAMP
 +
  cityWikidataid : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  deleted : INTEGER
 +
  endDate : TIMESTAMP
 +
  eventType : TEXT
 +
  locality : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  series : TEXT
 +
  seriesId : TEXT
 +
  startDate : TIMESTAMP
 +
  url : TEXT
 +
  wikiCfpId : INTEGER
 +
  }
 +
  class event_orclone << Entity >> {
 +
  DblpConferenceId : TEXT
 +
  ISBN : TEXT
 +
  TibKatId : TEXT
 +
  acceptedPapers : INTEGER
 +
  creationDate : TIMESTAMP
 +
  endDate : TIMESTAMP
 +
  eventType : TEXT
 +
  homepage : TEXT
 +
  inEventSeries : TEXT
 +
  lastEditor : TEXT
 +
  modificationDate : TIMESTAMP
 +
  pageTitle : TEXT <<PK>>
 +
  startDate : TIMESTAMP
 +
  submittedPapers : INTEGER
 +
  url : TEXT
 +
  wikidataId : TEXT
 +
  yearStr : TEXT
 +
  }
 +
  class event_tibkat << Entity >> {
 +
  alternativeTitles : TEXT
 +
  authorGndId : TEXT
 +
  bk : TEXT
 +
  changeDate : TEXT
 +
  cityWikidataid : TEXT
 +
  corporateCreatorNames : TEXT
 +
  corporateCreatorTypes : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  databaseDate : TEXT
 +
  dates : TEXT
 +
  ddc : TEXT
 +
  description : TEXT
 +
  documentGenreCode : TEXT
 +
  documentId : TEXT
 +
  documentTypeCode : TEXT
 +
  doi : TEXT
 +
  ean : TEXT
 +
  endDate : DATE
 +
  event : TEXT
 +
  firstid : TEXT
 +
  ftxCreationDate : TEXT
 +
  gndIds : TEXT
 +
  isbn : TEXT
 +
  isbn13 : TEXT
 +
  journalTitle : TEXT
 +
  journalVolumeNumber : TEXT
 +
  location : TEXT
 +
  ppn : TEXT
 +
  publisher : TEXT
 +
  pubplace : TEXT
 +
  pubyear : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  sponsorGndId : TEXT
 +
  startDate : DATE
 +
  }
 +
  class event_dblp << Entity >> {
 +
  booktitle : TEXT
 +
  cityWikidataid : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  doi : TEXT
 +
  ee : TEXT
 +
  endDate : TIMESTAMP
 +
  isbn : TEXT
 +
  location : TEXT
 +
  mdate : TEXT
 +
  publicationSeries : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  series : TEXT
 +
  startDate : TIMESTAMP
 +
  url : TEXT
 +
  }
 +
  class event_crossref << Entity >> {
 +
  cityWikidataid : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  doi : TEXT
 +
  endDate : DATE
 +
  location : TEXT
 +
  month : INTEGER
 +
  name : TEXT
 +
  number : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  sponsor : TEXT
 +
  startDate : DATE
 +
  theme : TEXT
 +
  url : TEXT
 +
  }
 +
  class event_wikidata << Entity >> {
 +
  cityWikidataid : TEXT
 +
  countryId : TEXT
 +
  countryIso : TEXT
 +
  countryWikidataid : TEXT
 +
  dblpId : TEXT
 +
  describedAtUrl : TEXT
 +
  doi : TEXT
 +
  endDate : TIMESTAMP
 +
  eventInSeries : TEXT
 +
  eventInSeriesId : TEXT
 +
  eventTitle : TEXT
 +
  followedById : TEXT
 +
  gndId : TEXT
 +
  homepage : TEXT
 +
  language : TEXT
 +
  location : TEXT
 +
  locationId : TEXT
 +
  mainSubject : TEXT
 +
  ppn : TEXT
 +
  proceedings : TEXT
 +
  proceedingsLabel : TEXT
 +
  regionIso : TEXT
 +
  regionWikidataid : TEXT
 +
  startDate : TIMESTAMP
 +
  url : TEXT
 +
  wikiCfpId : TEXT
 +
  }
 +
  Event <|-- event_confref
 +
  Event <|-- event_gnd
 +
  Event <|-- event_wikicfp
 +
  Event <|-- event_orclone
 +
  Event <|-- event_tibkat
 +
  Event <|-- event_dblp
 +
  Event <|-- event_crossref
 +
  Event <|-- event_wikidata
 +
}
 +
' BITPlan Corporate identity skin params
 +
' Copyright (c) 2015-2020 BITPlan GmbH
 +
' see http://wiki.bitplan.com/PlantUmlSkinParams#BITPlanCI
 +
' skinparams generated by com.bitplan.restmodelmanager
 +
skinparam note {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam component {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam package {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam usecase {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam activity {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam classAttribute {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam interface {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam class {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam object {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
hide Circle
 +
' end of skinparams '
 +
</uml>
 +
 
 +
=== EventSeries ===
 +
<uml format='svg'>
 +
title
 +
ConfIDent  EventSeries
 +
2021-08-21
 +
[[https://projects.tib.eu/en/confident/ © 2019-2021 ConfIDent project]]
 +
see also [[http://ptp.bitplan.com/settings Proceedings Title Parser]]
 +
 
 +
end title
 +
package DataSources {
 +
  class EventSeries << Entity >> {
 +
  source : TEXT
 +
  }
 +
Note top of eventseries_dblp
 +
[[https://dblp.org/ dblp computer science bibliography]]
 +
5256 instances
 +
End note
 +
  class eventseries_dblp << Entity >> {
 +
  acronym : TEXT
 +
  count : INTEGER
 +
  eventSeriesId : TEXT
 +
  maxYear : TEXT
 +
  minYear : TEXT
 +
  }
 +
Note top of eventseries_orclone
 +
[[https://confident.dbis.rwth-aachen.de/or OPENRESEARCH (orclone-api)]]
 +
1083 instances
 +
End note
 +
  class eventseries_orclone << Entity >> {
 +
  acronym : TEXT
 +
  core2018Rank : TEXT
 +
  creationDate : TIMESTAMP
 +
  dblpSeries : TEXT
 +
  homepage : TEXT
 +
  lastEditor : TEXT
 +
  modificationDate : TIMESTAMP
 +
  pageTitle : TEXT <<PK>>
 +
  period : INTEGER
 +
  title : TEXT
 +
  unit : TEXT
 +
  wikiCfpSeries : TEXT
 +
  wikidataId : TEXT
 +
  }
 +
Note top of eventseries_orclonebackup
 +
[[https://confident.dbis.rwth-aachen.de/or OPENRESEARCH (orclone-backup)]]
 +
1056 instances
 +
End note
 +
  class eventseries_orclonebackup << Entity >> {
 +
  acronym : TEXT
 +
  core2018Rank : TEXT
 +
  dblpSeries : TEXT
 +
  homepage : TEXT
 +
  logo : TEXT
 +
  pageTitle : TEXT <<PK>>
 +
  period : TEXT
 +
  title : TEXT
 +
  unit : TEXT
 +
  wikiCfpSeries : TEXT
 +
  wikidataId : TEXT
 +
  }
 +
Note top of eventseries_wikicfp
 +
[[http://www.wikicfp.com WikiCFP]]
 +
6019 instances
 +
End note
 +
  class eventseries_wikicfp << Entity >> {
 +
  acronym : TEXT
 +
  dblpSeriesId : TEXT
 +
  deleted : BOOLEAN
 +
  seriesId : TEXT
 +
  title : TEXT
 +
  url : TEXT
 +
  wikiCfpId : INTEGER
 +
  }
 +
Note top of eventseries_confref
 +
[[http://portal.confref.org ConfRef]]
 +
4857 instances
 +
End note
 +
  class eventseries_confref << Entity >> {
 +
  acronym : TEXT
 +
  count : INTEGER
 +
  eventSeriesId : TEXT
 +
  maxYear : INTEGER
 +
  minYear : INTEGER
 +
  title : TEXT
 +
  }
 +
Note top of eventseries_wikidata
 +
[[https://www.wikidata.org/wiki/Wikidata:Main_Page Wikidata]]
 +
4254 instances
 +
End note
 +
  class eventseries_wikidata << Entity >> {
 +
  ACM_pid : TEXT
 +
  DBLP_pid : TEXT
 +
  FreeBase_pid : TEXT
 +
  GND_pid : TEXT
 +
  Microsoft_Academic_pid : TEXT
 +
  Publons_pid : TEXT
 +
  WikiCFP_pid : TEXT
 +
  acronym : TEXT
 +
  eventSeriesId : TEXT
 +
  homepage : TEXT
 +
  title : TEXT
 +
  url : TEXT
 +
  }
 +
Note top of eventseries_crossref
 +
[[https://www.crossref.org/ CrossRef]]
 +
1 instances
 +
End note
 +
  class eventseries_crossref << Entity >> {
 +
  eventSeriesId : TEXT
 +
  }
 +
Note top of eventseries_or
 +
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-api)]]
 +
1058 instances
 +
End note
 +
  class eventseries_or << Entity >> {
 +
  acronym : TEXT
 +
  core2018Rank : TEXT
 +
  creationDate : TIMESTAMP
 +
  dblpSeries : TEXT
 +
  homepage : TEXT
 +
  lastEditor : TEXT
 +
  modificationDate : TIMESTAMP
 +
  pageTitle : TEXT <<PK>>
 +
  title : TEXT
 +
  wikidataId : TEXT
 +
  }
 +
Note top of eventseries_orbackup
 +
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-backup)]]
 +
1028 instances
 +
End note
 +
  class eventseries_orbackup << Entity >> {
 +
  acronym : TEXT
 +
  core2018Rank : TEXT
 +
  dblpSeries : TEXT
 +
  homepage : TEXT
 +
  logo : TEXT
 +
  pageTitle : TEXT <<PK>>
 +
  period : TEXT
 +
  title : TEXT
 +
  unit : TEXT
 +
  wikidataId : TEXT
 +
  }
 +
  EventSeries <|-- eventseries_dblp
 +
  EventSeries <|-- eventseries_orclone
 +
  EventSeries <|-- eventseries_orclonebackup
 +
  EventSeries <|-- eventseries_wikicfp
 +
  EventSeries <|-- eventseries_confref
 +
  EventSeries <|-- eventseries_wikidata
 +
  EventSeries <|-- eventseries_crossref
 +
  EventSeries <|-- eventseries_or
 +
  EventSeries <|-- eventseries_orbackup
 +
}
 +
 
 +
' BITPlan Corporate identity skin params
 +
' Copyright (c) 2015-2020 BITPlan GmbH
 +
' see http://wiki.bitplan.com/PlantUmlSkinParams#BITPlanCI
 +
' skinparams generated by com.bitplan.restmodelmanager
 +
skinparam note {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam component {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam package {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam usecase {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam activity {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam classAttribute {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam interface {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam class {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
skinparam object {
 +
  BackGroundColor #FFFFFF
 +
  FontSize 12
 +
  ArrowColor #FF8000
 +
  BorderColor #FF8000
 +
  FontColor black
 +
  FontName Technical
 +
}
 +
hide Circle
 +
' end of skinparams '
 +
</uml>
 +
= Updating the database =
 +
== Openresearch ==
 +
<source lang='bash'>
 +
scripts/getbackup
 +
</source>
 +
gets a copy of the nightly OpenResearch backups
 +
 
 +
= Issues =
 +
# {{Ticket
 +
|number=33
 +
|title=Event series completion
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-28 15:10:53+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=32
 +
|title=regression TemplateNotFound: fb4common/base.html
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-22 08:04:01+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=31
 +
|title=Provide RDF export of the data
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-21 15:04:52+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=30
 +
|title=add ordinal distribution query
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-14 14:27:00+00:00
 +
|closedAt=2022-01-14 14:27:38+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=29
 +
|title=add scholar RESTFul API
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-14 06:55:23+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=28
 +
|title=add generic search for scholarly items
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-14 06:54:39+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=27
 +
|title=openresearch results missing in multiquery
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-13 10:22:50+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=26
 +
|title=add bib file import
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-12 13:22:01+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=25
 +
|title=make multiquery result available via webapi with content negotiation
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-11 08:16:01+00:00
 +
|closedAt=2022-01-11 10:41:51+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=24
 +
|title=allow updating the database via webserver
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-10 08:38:57+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=23
 +
|title=dictOfLod Lookup result via commandline
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-06 15:34:46+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=22
 +
|title=add multi query option
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-03 17:03:25+00:00
 +
|closedAt=2022-01-11 08:14:31+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=21
 +
|title=add Webserver
 +
|project=ConferenceCorpus
 +
|createdAt=2022-01-01 11:10:30+00:00
 +
|closedAt=2022-01-03 16:58:11+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=20
 +
|title=Work around upstream Nominatim OSM Pythontools issue
 +
|project=ConferenceCorpus
 +
|createdAt=2021-12-13 06:11:42+00:00
 +
|closedAt=2021-12-15 13:19:46+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=19
 +
|title=Update Openresearch Samples
 +
|project=ConferenceCorpus
 +
|createdAt=2021-12-02 13:31:46+00:00
 +
|closedAt=2021-12-05 23:05:16+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=18
 +
|title=Update requirements.txt
 +
|project=ConferenceCorpus
 +
|createdAt=2021-11-09 22:41:47+00:00
 +
|closedAt=2021-12-12 16:43:29+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=17
 +
|title=include ACM digital library as a source
 +
|project=ConferenceCorpus
 +
|createdAt=2021-11-04 08:15:21+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=16
 +
|title=Steps towards csv upload
 +
|project=ConferenceCorpus
 +
|createdAt=2021-09-29 22:56:51+00:00
 +
|closedAt=2021-10-08 11:51:32+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=15
 +
|title=Filter obviously invalid Series and Event entries
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-10 12:12:27+00:00
 +
|closedAt=2021-08-10 12:15:42+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=14
 +
|title=wikiCFP 500 Internal Server and TimeOut Error Handling
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-07 05:52:42+00:00
 +
|closedAt=2021-08-07 06:53:26+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=12
 +
|title=Relevant FTX fields
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-04 12:18:12+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=11
 +
|title=Locality fixes
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-04 08:08:10+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=10
 +
|title=OpenResearch export option
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-04 07:51:24+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=9
 +
|title=offline access to EventCorpus.db
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-04 07:48:55+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=8
 +
|title=migrate confref data from Proceedings Title Parser here
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-04 07:17:09+00:00
 +
|closedAt=2021-08-04 07:17:13+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=7
 +
|title=migrate crossref data from proceedings title parser here
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-02 13:23:24+00:00
 +
|closedAt=2021-08-02 13:24:54+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=6
 +
|title=migrate dblp data source here from ptp and dblpconf
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-02 13:17:12+00:00
 +
|closedAt=2021-08-02 13:17:15+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=5
 +
|title=dblp xml parser skips some proceedings titles
 +
|project=ConferenceCorpus
 +
|createdAt=2021-08-01 04:16:18+00:00
 +
|closedAt=
 +
|state=open
 +
}}
 +
# {{Ticket
 +
|number=4
 +
|title=add commandline interface to CorpusLookup
 +
|project=ConferenceCorpus
 +
|createdAt=2021-07-31 18:51:20+00:00
 +
|closedAt=2021-08-01 04:06:46+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=3
 +
|title=add python api doc
 +
|project=ConferenceCorpus
 +
|createdAt=2021-07-31 06:04:06+00:00
 +
|closedAt=2021-07-31 06:50:18+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=2
 +
|title=Cache all SQL tables in the same SQLite database in a ".conferencecorpus" directory 
 +
|project=ConferenceCorpus
 +
|createdAt=2021-07-30 08:57:37+00:00
 +
|closedAt=2021-07-30 12:44:44+00:00
 +
|state=closed
 +
}}
 +
# {{Ticket
 +
|number=1
 +
|title=There should  be a common set of attributes for Event and EventSeries from different datasources
 +
|project=ConferenceCorpus
 +
|createdAt=2021-07-30 08:51:29+00:00
 +
|closedAt=
 +
|state=open
 
}}
 
}}

Latest revision as of 08:17, 18 November 2023

OsProject

OsProject
edit
id  ConferenceCorpus
state  active
owner  WolfgangFahl
title  Scientific Event Corpus
url  https://github.com/WolfgangFahl/ConferenceCorpus
version  0.1.0
description  
date  2022-11-20
since  2021-07-26
until  

What Links Here

Installation

via pip

pip install ConferenceCorpus
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus

upgrade

pip install ConferenceCorpus -U
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus -U

Usage

RESTFul API

Examples

Database View with Sqlite

The EventCorpus.db is in Sqlite format.

using sqlite-web

pip install sqlite-web
sqlite_web $HOME/.conferencecorpus/EventCorpus.db

There is convenience script ccsqliteweb available in the scripts directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup.

Command Line

aelookup -h
usage: aelookup [-h] [-d] [-e ENDPOINT] [-v] [-u] [-f]
                [--datasources DATASOURCES]

Scientific Event Corpus and Lookup

  Created by Wolfgang Fahl on 2020-06-22.
  Copyright 2020-2021 Wolfgang Fahl. All rights reserved.

  Licensed under the Apache License 2.0
  http://www.apache.org/licenses/LICENSE-2.0

  Distributed on an "AS IS" basis without warranties
  or conditions of any kind, either express or implied.

USAGE

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           show debug info
  -e ENDPOINT, --endpoint ENDPOINT
                        SPARQL endpoint to use for wikidata queries
  -v, --version         show program's version number and exit
  -u, --uml             output plantuml diagram markup
  -f, --force           force Update - may take quite a time
  --datasources DATASOURCES
                        , delimited list of datasource lookup ids

Overview

Datasources

You might want to open the diagrams in a new tab to be able to click the links depicted.

Event

EventSeries

Updating the database

Openresearch

scripts/getbackup

gets a copy of the nightly OpenResearch backups

Issues

  1. Issue 33 - Event series completion
  2. Issue 32 - regression TemplateNotFound: fb4common/base.html
  3. Issue 31 - Provide RDF export of the data
  4. Issue 30 - add ordinal distribution query
  5. Issue 29 - add scholar RESTFul API
  6. Issue 28 - add generic search for scholarly items
  7. Issue 27 - openresearch results missing in multiquery
  8. Issue 26 - add bib file import
  9. Issue 25 - make multiquery result available via webapi with content negotiation
  10. Issue 24 - allow updating the database via webserver
  11. Issue 23 - dictOfLod Lookup result via commandline
  12. Issue 22 - add multi query option
  13. Issue 21 - add Webserver
  14. Issue 20 - Work around upstream Nominatim OSM Pythontools issue
  15. Issue 19 - Update Openresearch Samples
  16. Issue 18 - Update requirements.txt
  17. Issue 17 - include ACM digital library as a source
  18. Issue 16 - Steps towards csv upload
  19. Issue 15 - Filter obviously invalid Series and Event entries
  20. Issue 14 - wikiCFP 500 Internal Server and TimeOut Error Handling
  21. Issue 12 - Relevant FTX fields
  22. Issue 11 - Locality fixes
  23. Issue 10 - OpenResearch export option
  24. Issue 9 - offline access to EventCorpus.db
  25. Issue 8 - migrate confref data from Proceedings Title Parser here
  26. Issue 7 - migrate crossref data from proceedings title parser here
  27. Issue 6 - migrate dblp data source here from ptp and dblpconf
  28. Issue 5 - dblp xml parser skips some proceedings titles
  29. Issue 4 - add commandline interface to CorpusLookup
  30. Issue 3 - add python api doc
  31. Issue 2 - Cache all SQL tables in the same SQLite database in a ".conferencecorpus" directory
  32. Issue 1 - There should be a common set of attributes for Event and EventSeries from different datasources