CEURART document verification

2024-10-27 · 1 min read
blog computer-science

CEURART document verification.

Table of Contents

1 Key issues

  • Frequent errors are:
    • not using Libertinus fonts;
    • using an old CEURART template;
    • not using the correct copyright phrase;
    • not having selectable text in the PDF file, preventing indexing by GoogleScholar and the like.

2 Utilities

2.1 check-pdf-errors

  • Source: https://ceur-ws.org/check-pdf-errors
  • Checks pdf files.
  • Checks for the presence of the phrase ‘Creative Commons’ in pdf files.
    • This is to check if the text can be highlighted (if the document is not an image).
  • Checking for the use of Libertinus fonts.
  • Checking for duplication of pdf files.

2.2 check-index-errors

  • Source: https://ceur-ws.org/check-index-errors
  • Checks the index.html file.
  • Rules:
    • paper PDFs that are in the directory but not listed in index.html;
    • papers that are linked in index.html but not included in the directory.

2.3 ceur-add-pagenum

  • Repo: https://github.com/amato-gianluca/ceur-add-pagenum
  • A small Python script for counting the number of pages in the PDF files linked to an index.html document (prepared for submission to CEUR-WS), and updating the CEURPAGES fields.
  • The script does not alter the PDF files, just the index.html.
  • The script depends on the lxml and PyPDF2 packages.
Дмитрий Сергеевич Кулябов
Authors
Профессор кафедры теории вероятностей и кибербезопасности
Работаю профессором на кафедре теории вероятностей и кибербезопасности Российского университета дружбы народов им. Патриса Лумумбы. Научные интересы относятся к области теоретической физики и математического моделирования.