This version adds new functionality for assessing and filtering results based on genome assembly metrics.
N50 and L50 values are now stored and can be used in isolate queries. The stats are automatically updated by a database trigger whenever, and however, contigs are added to the sequence bin.
%GC, number of Ns, and gaps in an assembly can be calculated and stored by an external script after assembly uploads. These can be used along with N50, total length and number of contigs in isolate queries.
The results of arbitrary analyses can now be stored in the isolate database as JSON values. These can be displayed within an isolate record using a templating system. rMLST species checks have been implemented as an exemplar.
Checks are now performed on submissions prior to upload to the database. Basic assembly stats are available to submitters and curators with automated warnings shown based on customisable thresholds. Hard limits can also be set so that submissions can be rejected automatically.
We can also run the rMLST species check on submitted genomes so that the results are available to curators shortly after submission and prior to handling.
Batch uploading data is now performed in a separate process, with progress notifications loaded by AJAX polling. This prevents browser timeout when hundreds of genomes are uploaded together.