This first version of the scraper is a rudimentary implementation and offers basic functionality.
- One can scrape information about available locations. I.e. regions, places, cities or parishes as well as virtual entities Matricula Online has digitized content of. Usually a parish with digitized parish registers or similar content. This data consists only of metadata about these locations (geographical information, url, name, date range, notes); a URL is included to the parish's main page with the actual digitized sources (see below). This operation can be filtered by various parameters – or all can be scraped. https://data.matricula-online.eu/en/suchen/ is the scraped page.
- Information about all the digitized sources of parishes can be scraped too. An example of a parish's page is https://data.matricula-online.eu/de/deutschland/muenster/muenster-st-servatii/. This operation too scrapes metadata only (name of the source, type, date range, url to the actual content, notes).
Note that this very first version is not feature-complete. Not all resources Matricula offers can be scraped with this version (e.g. the actual content = images of parish registers like https://data.matricula-online.eu/de/deutschland/muenster/muenster-st-servatii/KB001_2/?pg=1).
:warning: This is a semver version < 1.0.0. Bugs and breaking changes are to be expected. Please report any issues.