ConferenceCorpus: Difference between revisions

From BITPlan Wiki
Jump to navigation Jump to search
No edit summary
 
(15 intermediate revisions by the same user not shown)
Line 3: Line 3:
{{OsProject
{{OsProject
|id=ConferenceCorpus
|id=ConferenceCorpus
|state=active
|owner=WolfgangFahl
|owner=WolfgangFahl
|title=Scientific Event Corpus
|title=Scientific Event Corpus
|url=https://github.com/WolfgangFahl/ConferenceCorpus
|url=https://github.com/WolfgangFahl/ConferenceCorpus
|version=0.0.10
|version=0.1.0
|date=2021-08-03
|date=2022-11-20
|since=2021-07-26
|storemode=property
|storemode=property
|name=ConferenceCorpus
|name=ConferenceCorpus
}}
}}
 
= What Links Here =
{{WhatLinksHere}}
= Installation =
= Installation =
== via pip ==
== via pip ==
Line 26: Line 29:
</source>
</source>
= Usage =
= Usage =
== RESTFul API ==
=== Examples ===
* https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=json
* https://conferencecorpus.bitplan.com/eventseries/WEBIST?format=html
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=json
* https://conferencecorpus.bitplan.com/eventseries/ISWC?format=html
== Database View with Sqlite ==
The EventCorpus.db is in Sqlite format.
=== using sqlite-web ===
<source lang='bash'>
pip install sqlite-web
sqlite_web $HOME/.conferencecorpus/EventCorpus.db
</source>
There is convenience script [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts/ccsqliteweb ccsqliteweb] available in the [https://github.com/WolfgangFahl/ConferenceCorpus/tree/main/scripts scripts] directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup.
== Command Line ==
== Command Line ==
<source lang='bash'>
<source lang='bash'>
Line 64: Line 82:
title
title
ConfIDent  Event
ConfIDent  Event
2021-08-03
2022-05-11
[[https://projects.tib.eu/en/confident/ © 2019-2021 ConfIDent project]]
[[https://projects.tib.eu/en/confident/ © 2019-2022 ConfIDent project and Wolfgang Fahl]]
see also [[http://ptp.bitplan.com/settings Proceedings Title Parser]]
see also [[http://cc.bitplan.com Conference Corpus]]


end title
end title
Line 72: Line 90:
   class Event << Entity >> {
   class Event << Entity >> {
   acronym : TEXT  
   acronym : TEXT  
  city : TEXT
  country : TEXT
   eventId : TEXT  
   eventId : TEXT  
  lookupAcronym : TEXT
  ordinal : INTEGER
  region : TEXT
   source : TEXT  
   source : TEXT  
   title : TEXT  
   title : TEXT  
  url : TEXT
   year : INTEGER  
   year : INTEGER  
   }
   }
Note top of confref_Event
   class event_confref << Entity >> {
[[http://portal.confref.org ConfRef]]
   area : TEXT
37945 instances
  cityWikidataid : TEXT
End note
  countryIso : TEXT
   class confref_Event << Entity >> {
  countryWikidataid : TEXT  
   city : TEXT  
   dblpSeriesId : TEXT  
   country : TEXT  
   endDate : TEXT  
   endDate : TEXT  
   keywords : TEXT  
   keywords : TEXT  
  location : TEXT
   ranks : TEXT  
   ranks : TEXT  
  regionIso : TEXT
  regionWikidataid : TEXT
  seriesId : TEXT
  seriesTitle : TEXT
   startDate : TEXT  
   startDate : TEXT  
   submissionExtended : BOOLEAN
   submissionExtended : INTEGER
  url : TEXT
   }
   }
Note top of crossref_Event
   class event_gnd << Entity >> {
[[https://www.crossref.org/ CrossRef]]
   acronymCount : INTEGER
49280 instances
  acronyms : TEXT  
End note
  cityWikidataid : TEXT
   class crossref_Event << Entity >> {
  countryIso : TEXT
   doi : TEXT  
  countryWikidataid : TEXT
  date : TEXT
  dateCount : INTEGER
   endDate : DATE  
   endDate : DATE  
  event : TEXT
  fulltitle : TEXT
  homepage : TEXT
   location : TEXT  
   location : TEXT  
   month : INTEGER  
   organization : TEXT
   name : TEXT  
  place : TEXT
   number : TEXT  
  placeCount : INTEGER  
   sponsor : TEXT  
   places : TEXT  
   regionIso : TEXT  
   regionWikidataid : TEXT  
   startDate : DATE  
   startDate : DATE  
   theme : TEXT  
   variant : TEXT
  variantCount : INTEGER
  variants : TEXT  
   }
   }
Note top of dblp_Event
   class event_wikicfp << Entity >> {
[[https://dblp.org/ dblp computer science bibliography]]
   Final_Version_Due : TEXT  
47891 instances
   Notification_Due : TIMESTAMP
End note
   Submission_Deadline : TIMESTAMP
   class dblp_Event << Entity >> {
   cityWikidataid : TEXT  
   booktitle : TEXT  
   countryIso : TEXT  
   doi : TEXT
   countryWikidataid : TEXT  
   ee : TEXT
   deleted : INTEGER
   isbn : TEXT  
   mdate : TEXT  
   publicationSeries : TEXT  
   series : TEXT
  }
Note top of wikidata_Event
[[https://www.wikidata.org/wiki/Wikidata:Main_Page Wikidata]]
7443 instances
End note
  class wikidata_Event << Entity >> {
  country : TEXT
  countryId : TEXT
  dblpConferenceId : TEXT
   endDate : TIMESTAMP  
   endDate : TIMESTAMP  
  eventInSeries : TEXT
  eventInSeriesId : TEXT
  gndId : TEXT
  homepage : TEXT
  language : TEXT
  mainSubject : TEXT
  ordinal : TEXT
  startDate : TIMESTAMP
  wikiCfpId : TEXT
  }
Note top of wikicfp_Event
[[http://www.wikicfp.com WikiCFP]]
86307 instances
End note
  class wikicfp_Event << Entity >> {
  Final_Version_Due : DATE
  Notification_Due : DATE
  Submission_Deadline : DATE
  deleted : INTEGER
  endDate : DATE
   eventType : TEXT  
   eventType : TEXT  
   locality : TEXT  
   locality : TEXT  
   lookupAcronym : TEXT  
   regionIso : TEXT
  regionWikidataid : TEXT  
   series : TEXT  
   series : TEXT  
   seriesId : TEXT  
   seriesId : TEXT  
   startDate : DATE
   startDate : TIMESTAMP
   wikiCFPId : INTEGER  
   url : TEXT
  wikiCfpId : INTEGER  
   }
   }
Note top of orwiki_Event
   class event_orclone << Entity >> {
[[https://www.openresearch.org/wiki/Main_Page OPENRESEARCH-wiki]]
   DblpConferenceId : TEXT  
9231 instances
   ISBN : TEXT  
End note
   TibKatId : TEXT  
   class orwiki_Event << Entity >> {
   acceptedPapers : TEXT  
   city : TEXT  
   country : TEXT  
  endDate : TEXT
  eventType : TEXT
  homepage : TEXT
  inEventSeries : TEXT
  ordinal : TEXT
  pageTitle : TEXT <<PK>>
  region : TEXT
  startDate : TEXT
  submittedPapers : TEXT
  yearStr : TEXT
  }
Note top of orcapi_Event
[[https://confident.dbis.rwth-aachen.de/or/index.php?title=Main_Page OPENRESEARCH-clone-api]]
9452 instances
End note
  class orcapi_Event << Entity >> {
   acceptedPapers : INTEGER  
   acceptedPapers : INTEGER  
  city : TEXT
  country : TEXT
   creationDate : TIMESTAMP  
   creationDate : TIMESTAMP  
   endDate : TIMESTAMP  
   endDate : TIMESTAMP  
Line 190: Line 173:
   lastEditor : TEXT  
   lastEditor : TEXT  
   modificationDate : TIMESTAMP  
   modificationDate : TIMESTAMP  
  ordinal : INTEGER
   pageTitle : TEXT <<PK>>
   pageTitle : TEXT <<PK>>
  region : TEXT
   startDate : TIMESTAMP  
   startDate : TIMESTAMP  
   submittedPapers : INTEGER  
   submittedPapers : INTEGER  
  url : TEXT
  wikidataId : TEXT
   yearStr : TEXT  
   yearStr : TEXT  
   }
   }
Note top of orcwiki_Event
  class event_tibkat << Entity >> {
[[https://confident.dbis.rwth-aachen.de/or/index.php?title=Main_Page OPENRESEARCH-clone-wiki]]
  alternativeTitles : TEXT
9325 instances
  authorGndId : TEXT
End note
  bk : TEXT
   class orcwiki_Event << Entity >> {
  changeDate : TEXT
   acceptedPapers : TEXT  
  cityWikidataid : TEXT
   city : TEXT  
  corporateCreatorNames : TEXT
   country : TEXT  
  corporateCreatorTypes : TEXT
   endDate : TEXT  
  countryIso : TEXT
   eventType : TEXT  
  countryWikidataid : TEXT
   homepage : TEXT  
  databaseDate : TEXT
   inEventSeries : TEXT  
  dates : TEXT
   ordinal : TEXT  
  ddc : TEXT
   pageTitle : TEXT <<PK>>
  description : TEXT
   presence : TEXT  
  documentGenreCode : TEXT
   region : TEXT  
  documentId : TEXT
   startDate : TEXT
  documentTypeCode : TEXT
   submittedPapers : TEXT  
  doi : TEXT
   yearStr : TEXT  
  ean : TEXT
  endDate : DATE
  event : TEXT
  firstid : TEXT
  ftxCreationDate : TEXT
  gndIds : TEXT
  isbn : TEXT
  isbn13 : TEXT
  journalTitle : TEXT
  journalVolumeNumber : TEXT
  location : TEXT
  ppn : TEXT
  publisher : TEXT
  pubplace : TEXT
  pubyear : TEXT
  regionIso : TEXT
  regionWikidataid : TEXT
  sponsorGndId : TEXT
  startDate : DATE
  }
   class event_dblp << Entity >> {
   booktitle : TEXT
  cityWikidataid : TEXT
  countryIso : TEXT
  countryWikidataid : TEXT  
   doi : TEXT  
   ee : TEXT  
   endDate : TIMESTAMP
  isbn : TEXT
  location : TEXT  
   mdate : TEXT  
   publicationSeries : TEXT  
   regionIso : TEXT  
   regionWikidataid : TEXT  
   series : TEXT
  startDate : TIMESTAMP
  url : TEXT  
  }
  class event_crossref << Entity >> {
  cityWikidataid : TEXT
  countryIso : TEXT
  countryWikidataid : TEXT
  doi : TEXT
  endDate : DATE
  location : TEXT
  month : INTEGER
  name : TEXT
  number : TEXT
   regionIso : TEXT  
   regionWikidataid : TEXT
  sponsor : TEXT  
   startDate : DATE
   theme : TEXT  
   url : TEXT  
   }
   }
Note top of orapi_Event
   class event_wikidata << Entity >> {
[[https://www.openresearch.org/wiki/Main_Page OPENRESEARCH-api]]
   cityWikidataid : TEXT
9454 instances
   countryId : TEXT  
End note
   countryIso : TEXT  
   class orapi_Event << Entity >> {
   countryWikidataid : TEXT
   acceptedPapers : INTEGER
  dblpId : TEXT
   city : TEXT  
  describedAtUrl : TEXT
   country : TEXT  
  doi : TEXT
   creationDate : TIMESTAMP
   endDate : TIMESTAMP  
   endDate : TIMESTAMP  
   eventType : TEXT  
   eventInSeries : TEXT
  eventInSeriesId : TEXT
  eventTitle : TEXT
  followedById : TEXT
  gndId : TEXT  
   homepage : TEXT  
   homepage : TEXT  
   inEventSeries : TEXT  
   language : TEXT  
   lastEditor : TEXT  
   location : TEXT  
   modificationDate : TIMESTAMP
   locationId : TEXT
   ordinal : INTEGER
  mainSubject : TEXT
   pageTitle : TEXT <<PK>>
  ppn : TEXT
   region : TEXT  
   proceedings : TEXT
  proceedingsLabel : TEXT
   regionIso : TEXT  
   regionWikidataid : TEXT  
   startDate : TIMESTAMP  
   startDate : TIMESTAMP  
   submittedPapers : INTEGER
   url : TEXT
   yearStr : TEXT  
   wikiCfpId : TEXT  
   }
   }
   Event <|-- confref_Event
   Event <|-- event_confref
   Event <|-- crossref_Event
   Event <|-- event_gnd
   Event <|-- dblp_Event
   Event <|-- event_wikicfp
   Event <|-- wikidata_Event
   Event <|-- event_orclone
   Event <|-- wikicfp_Event
   Event <|-- event_tibkat
   Event <|-- orwiki_Event
   Event <|-- event_dblp
   Event <|-- orcapi_Event
   Event <|-- event_crossref
   Event <|-- orcwiki_Event
   Event <|-- event_wikidata
  Event <|-- orapi_Event
}
}
' BITPlan Corporate identity skin params
' BITPlan Corporate identity skin params
' Copyright (c) 2015-2020 BITPlan GmbH
' Copyright (c) 2015-2020 BITPlan GmbH
Line 334: Line 374:
title
title
ConfIDent  EventSeries
ConfIDent  EventSeries
2021-08-10
2021-08-21
[[https://projects.tib.eu/en/confident/ © 2019-2021 ConfIDent project]]
[[https://projects.tib.eu/en/confident/ © 2019-2021 ConfIDent project]]
see also [[http://ptp.bitplan.com/settings Proceedings Title Parser]]
see also [[http://ptp.bitplan.com/settings Proceedings Title Parser]]
Line 343: Line 383:
   source : TEXT  
   source : TEXT  
   }
   }
Note top of eventseries_orbackup
Note top of eventseries_dblp
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-backup)]]
[[https://dblp.org/ dblp computer science bibliography]]
1028 instances  
5256 instances  
End note
End note
   class eventseries_orbackup << Entity >> {
   class eventseries_dblp << Entity >> {
   acronym : TEXT  
   acronym : TEXT  
   core2018Rank : TEXT
   count : INTEGER
  dblpSeries : TEXT
   eventSeriesId : TEXT  
  homepage : TEXT
   maxYear : TEXT  
  logo : TEXT
   minYear : TEXT  
  pageTitle : TEXT <<PK>>
  period : TEXT
   title : TEXT  
   unit : TEXT  
   wikidataId : TEXT  
   }
   }
Note top of eventseries_or
Note top of eventseries_orclone
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-api)]]
[[https://confident.dbis.rwth-aachen.de/or OPENRESEARCH (orclone-api)]]
1058 instances  
1083 instances  
End note
End note
   class eventseries_or << Entity >> {
   class eventseries_orclone << Entity >> {
   acronym : TEXT  
   acronym : TEXT  
   core2018Rank : TEXT  
   core2018Rank : TEXT  
Line 372: Line 407:
   modificationDate : TIMESTAMP  
   modificationDate : TIMESTAMP  
   pageTitle : TEXT <<PK>>
   pageTitle : TEXT <<PK>>
  period : INTEGER
   title : TEXT  
   title : TEXT  
  unit : TEXT
  wikiCfpSeries : TEXT
   wikidataId : TEXT  
   wikidataId : TEXT  
   }
   }
Line 387: Line 425:
   pageTitle : TEXT <<PK>>
   pageTitle : TEXT <<PK>>
   period : TEXT  
   period : TEXT  
  title : TEXT
  unit : TEXT
  wikiCfpSeries : TEXT
  wikidataId : TEXT
  }
Note top of eventseries_orclone
[[https://confident.dbis.rwth-aachen.de/or OPENRESEARCH (orclone-api)]]
1083 instances
End note
  class eventseries_orclone << Entity >> {
  acronym : TEXT
  core2018Rank : TEXT
  creationDate : TIMESTAMP
  dblpSeries : TEXT
  homepage : TEXT
  lastEditor : TEXT
  modificationDate : TIMESTAMP
  pageTitle : TEXT <<PK>>
  period : INTEGER
   title : TEXT  
   title : TEXT  
   unit : TEXT  
   unit : TEXT  
Line 413: Line 432:
Note top of eventseries_wikicfp
Note top of eventseries_wikicfp
[[http://www.wikicfp.com WikiCFP]]
[[http://www.wikicfp.com WikiCFP]]
5000 instances  
6019 instances  
End note
End note
   class eventseries_wikicfp << Entity >> {
   class eventseries_wikicfp << Entity >> {
Line 424: Line 443:
   wikiCfpId : INTEGER  
   wikiCfpId : INTEGER  
   }
   }
Note top of eventseries_dblp
Note top of eventseries_confref
[[https://dblp.org/ dblp computer science bibliography]]
[[http://portal.confref.org ConfRef]]
5256 instances  
4857 instances  
End note
End note
   class eventseries_dblp << Entity >> {
   class eventseries_confref << Entity >> {
   acronym : TEXT  
   acronym : TEXT  
   count : INTEGER  
   count : INTEGER  
   eventSeriesId : TEXT  
   eventSeriesId : TEXT  
   maxYear : TEXT
   maxYear : INTEGER
   minYear : TEXT  
   minYear : INTEGER
  title : TEXT  
   }
   }
Note top of eventseries_wikidata
Note top of eventseries_wikidata
Line 449: Line 469:
   acronym : TEXT  
   acronym : TEXT  
   eventSeriesId : TEXT  
   eventSeriesId : TEXT  
  homepage : TEXT
   title : TEXT  
   title : TEXT  
  }
   url : TEXT  
Note top of eventseries_confref
[[http://portal.confref.org ConfRef]]
4857 instances
End note
  class eventseries_confref << Entity >> {
   acronym : TEXT
  count : INTEGER
  eventSeriesId : TEXT
  maxYear : INTEGER
  minYear : INTEGER
  title : TEXT  
   }
   }
Note top of eventseries_crossref
Note top of eventseries_crossref
Line 470: Line 480:
   eventSeriesId : TEXT  
   eventSeriesId : TEXT  
   }
   }
   EventSeries <|-- eventseries_orbackup
Note top of eventseries_or
   EventSeries <|-- eventseries_or
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-api)]]
1058 instances
End note
  class eventseries_or << Entity >> {
  acronym : TEXT
  core2018Rank : TEXT
  creationDate : TIMESTAMP
  dblpSeries : TEXT
  homepage : TEXT
  lastEditor : TEXT
  modificationDate : TIMESTAMP
  pageTitle : TEXT <<PK>>
  title : TEXT
  wikidataId : TEXT
  }
Note top of eventseries_orbackup
[[https://www.openresearch.org/mediawiki/ OPENRESEARCH (or-backup)]]
1028 instances
End note
  class eventseries_orbackup << Entity >> {
  acronym : TEXT
  core2018Rank : TEXT
  dblpSeries : TEXT
  homepage : TEXT
  logo : TEXT
  pageTitle : TEXT <<PK>>
  period : TEXT
  title : TEXT
  unit : TEXT
  wikidataId : TEXT
  }
   EventSeries <|-- eventseries_dblp
   EventSeries <|-- eventseries_orclone
   EventSeries <|-- eventseries_orclonebackup
   EventSeries <|-- eventseries_orclonebackup
  EventSeries <|-- eventseries_orclone
   EventSeries <|-- eventseries_wikicfp
   EventSeries <|-- eventseries_wikicfp
   EventSeries <|-- eventseries_dblp
   EventSeries <|-- eventseries_confref
   EventSeries <|-- eventseries_wikidata
   EventSeries <|-- eventseries_wikidata
  EventSeries <|-- eventseries_confref
   EventSeries <|-- eventseries_crossref
   EventSeries <|-- eventseries_crossref
  EventSeries <|-- eventseries_or
  EventSeries <|-- eventseries_orbackup
}
}


Line 558: Line 600:
}
}
hide Circle
hide Circle
' end of skinparams '
' end of skinparams '
' end of skinparams '
</uml>
</uml>
= Updating the database =
== Openresearch ==
<source lang='bash'>
scripts/getbackup
</source>
gets a copy of the nightly OpenResearch backups
= Issues =
# {{Ticket
|number=33
|title=Event series completion
|project=ConferenceCorpus
|createdAt=2022-01-28 15:10:53+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=32
|title=regression TemplateNotFound: fb4common/base.html
|project=ConferenceCorpus
|createdAt=2022-01-22 08:04:01+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=31
|title=Provide RDF export of the data
|project=ConferenceCorpus
|createdAt=2022-01-21 15:04:52+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=30
|title=add ordinal distribution query
|project=ConferenceCorpus
|createdAt=2022-01-14 14:27:00+00:00
|closedAt=2022-01-14 14:27:38+00:00
|state=closed
}}
# {{Ticket
|number=29
|title=add scholar RESTFul API
|project=ConferenceCorpus
|createdAt=2022-01-14 06:55:23+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=28
|title=add generic search for scholarly items
|project=ConferenceCorpus
|createdAt=2022-01-14 06:54:39+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=27
|title=openresearch results missing in multiquery
|project=ConferenceCorpus
|createdAt=2022-01-13 10:22:50+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=26
|title=add bib file import
|project=ConferenceCorpus
|createdAt=2022-01-12 13:22:01+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=25
|title=make multiquery result available via webapi with content negotiation
|project=ConferenceCorpus
|createdAt=2022-01-11 08:16:01+00:00
|closedAt=2022-01-11 10:41:51+00:00
|state=closed
}}
# {{Ticket
|number=24
|title=allow updating the database via webserver
|project=ConferenceCorpus
|createdAt=2022-01-10 08:38:57+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=23
|title=dictOfLod Lookup result via commandline
|project=ConferenceCorpus
|createdAt=2022-01-06 15:34:46+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=22
|title=add multi query option
|project=ConferenceCorpus
|createdAt=2022-01-03 17:03:25+00:00
|closedAt=2022-01-11 08:14:31+00:00
|state=closed
}}
# {{Ticket
|number=21
|title=add Webserver
|project=ConferenceCorpus
|createdAt=2022-01-01 11:10:30+00:00
|closedAt=2022-01-03 16:58:11+00:00
|state=closed
}}
# {{Ticket
|number=20
|title=Work around upstream Nominatim OSM Pythontools issue
|project=ConferenceCorpus
|createdAt=2021-12-13 06:11:42+00:00
|closedAt=2021-12-15 13:19:46+00:00
|state=closed
}}
# {{Ticket
|number=19
|title=Update Openresearch Samples
|project=ConferenceCorpus
|createdAt=2021-12-02 13:31:46+00:00
|closedAt=2021-12-05 23:05:16+00:00
|state=closed
}}
# {{Ticket
|number=18
|title=Update requirements.txt
|project=ConferenceCorpus
|createdAt=2021-11-09 22:41:47+00:00
|closedAt=2021-12-12 16:43:29+00:00
|state=closed
}}
# {{Ticket
|number=17
|title=include ACM digital library as a source
|project=ConferenceCorpus
|createdAt=2021-11-04 08:15:21+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=16
|title=Steps towards csv upload
|project=ConferenceCorpus
|createdAt=2021-09-29 22:56:51+00:00
|closedAt=2021-10-08 11:51:32+00:00
|state=closed
}}
# {{Ticket
|number=15
|title=Filter obviously invalid Series and Event entries
|project=ConferenceCorpus
|createdAt=2021-08-10 12:12:27+00:00
|closedAt=2021-08-10 12:15:42+00:00
|state=closed
}}
# {{Ticket
|number=14
|title=wikiCFP 500 Internal Server and TimeOut Error Handling
|project=ConferenceCorpus
|createdAt=2021-08-07 05:52:42+00:00
|closedAt=2021-08-07 06:53:26+00:00
|state=closed
}}
# {{Ticket
|number=12
|title=Relevant FTX fields
|project=ConferenceCorpus
|createdAt=2021-08-04 12:18:12+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=11
|title=Locality fixes
|project=ConferenceCorpus
|createdAt=2021-08-04 08:08:10+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=10
|title=OpenResearch export option
|project=ConferenceCorpus
|createdAt=2021-08-04 07:51:24+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=9
|title=offline access to EventCorpus.db
|project=ConferenceCorpus
|createdAt=2021-08-04 07:48:55+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=8
|title=migrate confref data from Proceedings Title Parser here
|project=ConferenceCorpus
|createdAt=2021-08-04 07:17:09+00:00
|closedAt=2021-08-04 07:17:13+00:00
|state=closed
}}
# {{Ticket
|number=7
|title=migrate crossref data from proceedings title parser here
|project=ConferenceCorpus
|createdAt=2021-08-02 13:23:24+00:00
|closedAt=2021-08-02 13:24:54+00:00
|state=closed
}}
# {{Ticket
|number=6
|title=migrate dblp data source here from ptp and dblpconf
|project=ConferenceCorpus
|createdAt=2021-08-02 13:17:12+00:00
|closedAt=2021-08-02 13:17:15+00:00
|state=closed
}}
# {{Ticket
|number=5
|title=dblp xml parser skips some proceedings titles
|project=ConferenceCorpus
|createdAt=2021-08-01 04:16:18+00:00
|closedAt=
|state=open
}}
# {{Ticket
|number=4
|title=add commandline interface to CorpusLookup
|project=ConferenceCorpus
|createdAt=2021-07-31 18:51:20+00:00
|closedAt=2021-08-01 04:06:46+00:00
|state=closed
}}
# {{Ticket
|number=3
|title=add python api doc
|project=ConferenceCorpus
|createdAt=2021-07-31 06:04:06+00:00
|closedAt=2021-07-31 06:50:18+00:00
|state=closed
}}
# {{Ticket
|number=2
|title=Cache all SQL tables in the same SQLite database in a ".conferencecorpus" directory 
|project=ConferenceCorpus
|createdAt=2021-07-30 08:57:37+00:00
|closedAt=2021-07-30 12:44:44+00:00
|state=closed
}}
# {{Ticket
|number=1
|title=There should  be a common set of attributes for Event and EventSeries from different datasources
|project=ConferenceCorpus
|createdAt=2021-07-30 08:51:29+00:00
|closedAt=
|state=open
}}

Latest revision as of 07:17, 18 November 2023

OsProject

OsProject
edit
id  ConferenceCorpus
state  active
owner  WolfgangFahl
title  Scientific Event Corpus
url  https://github.com/WolfgangFahl/ConferenceCorpus
version  0.1.0
description  
date  2022-11-20
since  2021-07-26
until  

What Links Here

Installation

via pip

pip install ConferenceCorpus
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus

upgrade

pip install ConferenceCorpus -U
# alternatively if your pip is not a python3 pip
pip3 install ConferenceCorpus -U

Usage

RESTFul API

Examples

Database View with Sqlite

The EventCorpus.db is in Sqlite format.

using sqlite-web

pip install sqlite-web
sqlite_web $HOME/.conferencecorpus/EventCorpus.db

There is convenience script ccsqliteweb available in the scripts directory which will also kill an existing sqlite_web EventCorpus.db process and run the server in background using nohup.

Command Line

aelookup -h
usage: aelookup [-h] [-d] [-e ENDPOINT] [-v] [-u] [-f]
                [--datasources DATASOURCES]

Scientific Event Corpus and Lookup

  Created by Wolfgang Fahl on 2020-06-22.
  Copyright 2020-2021 Wolfgang Fahl. All rights reserved.

  Licensed under the Apache License 2.0
  http://www.apache.org/licenses/LICENSE-2.0

  Distributed on an "AS IS" basis without warranties
  or conditions of any kind, either express or implied.

USAGE

optional arguments:
  -h, --help            show this help message and exit
  -d, --debug           show debug info
  -e ENDPOINT, --endpoint ENDPOINT
                        SPARQL endpoint to use for wikidata queries
  -v, --version         show program's version number and exit
  -u, --uml             output plantuml diagram markup
  -f, --force           force Update - may take quite a time
  --datasources DATASOURCES
                        , delimited list of datasource lookup ids

Overview

Datasources

You might want to open the diagrams in a new tab to be able to click the links depicted.

Event

EventSeries

Updating the database

Openresearch

scripts/getbackup

gets a copy of the nightly OpenResearch backups

Issues

  1. Issue 33 - Event series completion
  2. Issue 32 - regression TemplateNotFound: fb4common/base.html
  3. Issue 31 - Provide RDF export of the data
  4. Issue 30 - add ordinal distribution query
  5. Issue 29 - add scholar RESTFul API
  6. Issue 28 - add generic search for scholarly items
  7. Issue 27 - openresearch results missing in multiquery
  8. Issue 26 - add bib file import
  9. Issue 25 - make multiquery result available via webapi with content negotiation
  10. Issue 24 - allow updating the database via webserver
  11. Issue 23 - dictOfLod Lookup result via commandline
  12. Issue 22 - add multi query option
  13. Issue 21 - add Webserver
  14. Issue 20 - Work around upstream Nominatim OSM Pythontools issue
  15. Issue 19 - Update Openresearch Samples
  16. Issue 18 - Update requirements.txt
  17. Issue 17 - include ACM digital library as a source
  18. Issue 16 - Steps towards csv upload
  19. Issue 15 - Filter obviously invalid Series and Event entries
  20. Issue 14 - wikiCFP 500 Internal Server and TimeOut Error Handling
  21. Issue 12 - Relevant FTX fields
  22. Issue 11 - Locality fixes
  23. Issue 10 - OpenResearch export option
  24. Issue 9 - offline access to EventCorpus.db
  25. Issue 8 - migrate confref data from Proceedings Title Parser here
  26. Issue 7 - migrate crossref data from proceedings title parser here
  27. Issue 6 - migrate dblp data source here from ptp and dblpconf
  28. Issue 5 - dblp xml parser skips some proceedings titles
  29. Issue 4 - add commandline interface to CorpusLookup
  30. Issue 3 - add python api doc
  31. Issue 2 - Cache all SQL tables in the same SQLite database in a ".conferencecorpus" directory
  32. Issue 1 - There should be a common set of attributes for Event and EventSeries from different datasources