Open PROIEL-style treebank for the entire Greek language
NKUA · Athens Digital Glossa Chronos Research Network
June 30, 2026
Open PROIEL-style dependency treebank - Homer to Modern Greek - with NT verse-level cross-alignment to Latin / Gothic / Old Church Slavonic.
20-minute LREC-COLING system demo · Athens-PROIEL
Open https://athdgc.github.io in the browser.
Select Classical period tab. Click into a Plato tlg0059.tlg022 sample.
Observe:
localStorage)Open the public Neo4j endpoint (https://athdgc.github.io/graph).
Run:
Returns Greek aorist active verbs whose Vulgate counterpart is passive.
In the showcase: click Mark for review on a problematic sample.
The flag is stored in localStorage and persists across reloads. Export current review state via the console:
This JSON is then handed to the corpus-side fix_corpus_data.py which applies bulk corrections on ARIS.
| Layer | Implementation |
|---|---|
| Discovery | 43_discover_sources.py - daily cron on ARIS |
| Filtering | 44_filter_candidates.py - Greek-script + apparatus filters |
| OCR / Conversion | 45_ocr_candidates.sbatch + 46_to_proiel_xml.py |
| Annotation | 47_annotate_inbox.sbatch - Stanza on A100 GPU |
| Merge | 48_merge_xml_to_corpus.py - JSONL partitioning |
| Corpus fix | fix_corpus_data.py - TLG author override + Stanza error fixes |
| Showcase build | 51_build_showcase_site.py - regenerates HTML from JSONL |
| Deploy | deploy_athdgc_v2.ps1 - scp + git push |
Lavidas, N., Nikiforidou, K., Haug, D., Kulikov, L., Geka, V., Symeonidis, V., Michalareas, T., Chionidi, S., Tsiropina, A., Plakoutsi, E., Argyropoulos, E., and the Athens Digital Glossa Chronos Research Network (2026). AthDGC: Athens Diachronic Glossa Chronos. Zenodo.
Funded by HFRI (Project No. 20577) · Greece 2.0 NRRP Compute: GRNET ARIS (pa260305)
Funded by the Hellenic Foundation for Research and Innovation (HFRI) under the 3rd Call for HFRI Research Projects to support Post-Doctoral Researchers, Project No. 20577; with complementary support from the Greece 2.0 National Recovery and Resilience Plan. Compute supplied by GRNET ARIS (Greek national HPC), allocation pa260305.