packages.debian.org extraction improvements
Mentor: GerfriedFuchs (Webmaster Team)
Summary: Rewrite extraction parts to be able to receive push triggers
Required skills:
- perl
Description: The packages.debian.org script set currently contains a daily script that goes over the local available regular debian pool and extracts the changelogs and copyright informations of the packages. The proposed rewrite would change this to not depend on a local pool but outlines the following task which would need their own error handling, caching, fallbacks, locking and notification mails:
- Receive the update trigger from a specific pool (like the regular debian pool, but in future also from security.debian.org, volatile or backports)
- Calculate the difference of formerly extracted data to know what needs to get pulled
- Pull the difference
- Extract the changelogs and copyright informations from the pulled files
