ConferenceCorpus/statistics

From BITPlan Wiki
Jump to navigation Jump to search

Ordinal histogramms

confref

To few available ordinals for analysis.

Crossref

Ordinalhistogramm event crossref.png Zipf event crossref.png

dblp

Ordinalhistogramm event dblp.png Zipf event dblp.png

GND

Ordinalhistogramm event gnd.png Zipf event gnd.png

OpenResearch

Ordinalhistogramm event orclone.png Zipf event orclone.png

TIBKAT

Ordinalhistogramm event tibkat.png Zipf event tibkat.png

WikiCFP

Ordinalhistogramm event wikicfp.png Zipf event wikicfp.png

Wikidata

Ordinalhistogramm event wikidata.png Zipf event wikidata.png

Eventseries completeness

dblp

sql query

SELECT 
   series,
   min(ordinal) as minOrdinal, 
   max(ordinal) as maxOrdinal,
   avg(ordinal) as avgOrdinal,
   max(Ordinal)-min(Ordinal) as available,
   (max(Ordinal)-min(Ordinal)) /(max(Ordinal)-1.0) as completeness
FROM event_dblp
Where ordinal is not null 
group by series
order by 6 desc

histogramm

Dblp series completeness.png

openresearch

sql query

SELECT 
   inEventSeries,
   min(ordinal) as minOrdinal, 
   max(ordinal) as maxOrdinal,
   avg(ordinal) as avgOrdinal,
   max(Ordinal)-min(Ordinal) as available,
   (max(Ordinal)-min(Ordinal)) /(max(Ordinal)-1.0) as completeness
FROM event_orclone
Where ordinal is not null 
group by inEventSeries
order by 6 desc

histogramm

Orclone series completeness.png

tibkat

sql query

For tibkat a direct SQL query was not possible since the series info is not available. We therefore filtered by series acronym using python code to get an indication. 3136 series where identified this way and then the histogramm was created from the data

histogramm

Tibkat series completeness.png

wikicfp

sql query

SELECT 
   seriesId,
   min(ordinal) as minOrdinal, 
   max(ordinal) as maxOrdinal,
   avg(ordinal) as avgOrdinal,
   max(Ordinal)-min(Ordinal) as available,
   (max(Ordinal)-min(Ordinal)) /(max(Ordinal)-1.0) as completeness
FROM event_wikicfp
Where ordinal is not null 
group by seriesId
order by 6 desc

histogramm

Wikicfp series completeness.png

wikidata

sql query

SELECT 
   eventInSeriesId,
   min(ordinal) as minOrdinal, 
   max(ordinal) as maxOrdinal,
   avg(ordinal) as avgOrdinal,
   max(Ordinal)-min(Ordinal) as available,
   (max(Ordinal)-min(Ordinal)) /(max(Ordinal)-1.0) as completeness
FROM event_wikidata
Where ordinal is not null 
group by eventInSeriesId
order by 6 desc

histogramm

Wikidata series completeness.png