| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| v1.2.0-alpha | 2026-01-25 | ||
| v1.1-alpha | 2026-01-05 | ||
| v1.0alpha | 2025-12-21 | ||
| README.md | 2026-01-25 | 5.3 kB | |
| Totals: 4 Items | 5.3 kB | 13 |
CERCA β Citation Extraction & Reference Checking Assistant
CERCA is an open-source research tool that supports verification of bibliographic references in scientific manuscripts. It extracts references from PDF files and checks their existence and consistency against authoritative metadata sources, producing explainable diagnostics, audit logs, and reproducible reports.

Key Features
- π Flexible Reference Input:
- Drag-and-Drop: Parse references automatically from PDF files.
- Manual Entry Paste reference lists directly for quick checks.
- π Reference verification using Crossref, OpenAlex and Zenodo metadata
- π Match scores based on title, authors, and DOI similarity
- Interactive Dashboard:
- View real-time Pass/Fail statistics and verification rates.
- Color-coded status badges for quick visual assessment.
- π Export Data: Save verification reports for further analysis.
- π§ΎCSV export for analysis
- π§Ύ Diagnosis report (TXT)
- πͺ΅ Audit log for transparency and reproducibility
- π Right-click search (Google / Google Scholar) for manual inspection
- π Local privacy by design β PDFs never leave your machine
π¦ How to Run
Windows
- Download
Cerca_windows.zip. - Unzip the file.
- Double-click Cerca-1.0-alpha.jar.
If Windows shows a security warning, choose More info β Run anyway.
macOS
- Download
Cerca_mac.zip. - Unzip it.
- Right-click
Cerca-1.0-alpha.jarand select Open. - Note: Since this is an unverified alpha app, you may need to go to System Settings > Privacy & Security to allow it to run.
Linux
- Download
Cerca_linux.zip. - Unzip it.
- Open a terminal in that folder and run: ```bash java -jar Cerca-1.0-alpha.jar
π Requirements
CERCA is a Java desktop application built with JavaFX.
To run CERCA, you need:
- Java 17 or newer
- A Java Runtime Environment that includes JavaFX
- I recommend installing Azul Zulu JRE with JavaFX
πPrivacy & Local Processing
CERCA is designed with researcher privacy in mind.
- All PDF parsing and reference extraction are performed locally
- Manuscripts are never uploaded, stored, or shared
- CERCA performs metadata-only lookups (e.g., DOI, title, authors)
How It Works
- A PDF file is parsed locally to extract bibliographic references
- Each reference is queried against:
- Crossref
- Zenodo
- OpenAlex
- Metadata fields (title, authors, DOI) are compared
- CERCA assigns:
- A match score
- A status (PASS / CHECK / FAIL)
- A short diagnostic explanation
- Results can be saved as:
- TXT report (diagnosis)
- CSV table
- Audit logs
- Logs are saved for transparency and reproducibility
Status Definitions
- PASS β Strong metadata agreement with authoritative sources
- CHECK β Partial or ambiguous match; manual inspection recommended
- FAIL β No reliable metadata match found at time of verification
CERCA is an experimental tool. It does not replace manual verification.
Outputs
CERCA generates the following artifacts:
- TXT report β Summary and per-reference diagnostics
- CSV file β Structured results for analysis or editorial review
- Audit log β Timestamped record of verification steps
These outputs support reproducibility, transparency, and review documentation.
Intended Use
CERCA is intended for:
- Researchers performing final manuscript checks
- Reviewers assessing reference consistency
- Editors supporting editorial quality control
- Meta-research and reproducibility workflows
Limitations
- Verification depends on availability and correctness of external metadata
- Some valid references (e.g., books, technical reports, older works) may not be indexed
- Match scores are heuristic and intended to support human analysis
License
This project is licensed under the
GNU Affero General Public License, Version 3.0 (AGPL-3.0).
See the LICENSE file for details.
Third-Party Credits
This software uses the CERMINE library, licensed under GNU AGPL v3.
CERMINE
Copyright Β© Centre for Open Science
Dominika Tkaczyk, PaweΕ Szostek, Mateusz Fedoryszak,
Piotr Jan Dendek, Εukasz Bolikowski
CERMINE: automatic extraction of structured metadata from scientific literature.
International Journal on Document Analysis and Recognition, 2015,
Vol. 18, No. 4, pp. 317β335, DOI: 10.1007/s10032-015-0249-8
Citation
If you use CERCA in your research, please cite it as research software.
Author
Lidiany Cerqueira, PhD
Computer Science Researcher
Acknowledgments
CERCA was developed to support rigorous, transparent, and responsible research practices.