Changelog
[Unreleased]
Added
Added
plasmidfinder(v2.1.6,https://depot.galaxyproject.org/singularity/plasmidfinder:2.1.6--py314hdfd78af_2) as a screening process for plasmid detection on assemblies; runs by default for all bacterial workflows except Mycobacterium tuberculosis (#281)Added
download_plasmidfinder_dbMakefile target andplasmidfinder_dbparameterAdded
--plasmidfinder,--plasmidfinder-genome-hits,--plasmidfinder-plasmid-seqs, and--software-info(plasmidfinder meta) inputs tocreate_yaml; theHit_in_genome_seq.fsaandPlasmid_seqs.fsapaths are now emitted in the analysis YAMLAdded optional
trimmomatic(v0.40,https://depot.galaxyproject.org/singularity/trimmomatic:0.40--hdfd78af_0) preprocessing module for Illumina adapter/quality trimming; off by default (use_trimmomatic = false) (#280)Added
use_trimmomaticandtrimmomatic_argsparametersAdded
shigatyper(v2.0.5,https://depot.galaxyproject.org/singularity/shigatyper:2.0.5--pyhdfd78af_0) as the Shigella discrimination tool on the E. coli workflow, replacing ShigaPass; uses raw reads, no separate DB (refs are bundled in the container) (#505)Added
--shigatyperinput tocreate_yaml
Fixed
Pipeline now fails loudly when any input sample does not produce a result JSON (previously the run could finish successfully with missing results) (#466)
Changed
Bumped
clinicalgenomicslund/jasentoolcontainer from1.0.0to1.1.0to add--plasmidfinderand--shigatypersupport increate-yaml
Removed
Removed
shigapassmodule,--shigapasscreate_yamlinput,shigapass_dbparameter, and thedownload_shigapass/update_shigapass_dbMakefile targets — superseded byshigatyper
[1.3.0]
Added
Added
free-disk-spaceto CI GA workflowAdded
jasentoolcontainer (clinicalgenomicslund/jasentool:1.0.0) for read counting, QC, YAML creation, and NCBI downloadsAdded
count_readsprocess to general workflows to report read countsAdded
concatenate_filesprocess config for version trackingAdded per-species MLST database
maketargets (setup_<species>_mlstdb_token,update_<species>_mlstdb) for S. aureus, E. coli, Klebsiella, and S. pyogenesAdded
download_klebsiella_pasteur_cgmlstdbMakefile target for downloading Klebsiella Pasteur cgMLST schemeAdded
log_file_dirparameter andworkflow.onCompletehandler writing pipeline completion logs to a configurable directoryAdded
log_file_dirto cluster profiles incmd.configAdded
kraken_batchprocess to run Kraken2 over all samples in a single job by loading the database into shared memory (/dev/shm) once, avoiding repeated DB loading overheadAdded
use_kraken_batchparameter (default:false) to enable batch Kraken2 classificationAdded
update_emmtyper_dbtarget toMakefileAdded flowchart of the pipeline to the first page
Added details to results section of the docs
Added
clair3(v2.0.0,docker://hkubal/clair3:v2.0.0) as the nanopore variant caller insubworkflows/variant_calling.nf, replacing freebayes for the nanopore platform; default modelr1041_e82_400bps_sup_v430_bacteria_finetunedAdded
clair3_assemblyprocess tosubworkflows/typing.nffor nanopore assembly-based variant calling whenuse_maskingis enabledAdded
params.reference_genome_faito all species profiles innextflow.configfor explicit FAI path configurationAdded
samtools_faidxprocess support for per-sample assembly FAI generation
Fixed
Fixed legend in the flowchart (swapped ONT and Ion Torrent)
Fixed
reads*.toRealPath()spread operator inkraken_batchcollectFileclosure — previously calledtoRealPath()directly on a list, causing aMissingMethodExceptionat runtimeFixed
parmas.citypo inmlstext.whencondition inmodules.configFixed
count_readsoutput channel binding (jsonemit) inquality_control.nfto resolvejoinerror on profiles without a reference genome (e.g.streptococcus)Fixed
error_corr_assembly.plto handle gzipped VCF input (.vcf.gz) viagzip -dcpipe, enabling compatibility with clair3 output
Changed
Renamed
clair3import toclair3_refandfreebayesimport tofreebayes_refinsubworkflows/variant_calling.nfto distinguish reference-based variant callingRenamed
freebayesimport tofreebayes_assemblyinsubworkflows/typing.nfto distinguish assembly-based variant callingUpdated
bacterial_general.nfto loadreference_genome_faidxfromparams.reference_genome_faiinstead of deriving it from the genome pathChanged
skesa,spades_illumina,spades_iontorrent,flye, andmedakato publish to a unifiedfasta/output directoryUpdated chewBBACA to v3.5.3 to enable use of unrestricted length of sample names
Updated resources in processes that read bam files
Removed unnecessary scripts from
bin/Moved
create_yamlprocess frommodules/local/yaml/main.nftomodules/local/jasentool/main.nfMoved
post_align_qcprocess frommodules/local/prp/main.nftomodules/local/jasentool/main.nfUpdated
post_align_qcsignature to removereference_genomeargumentUpdated
create_yaml,post_align_qc, and NCBI genome download commands to usejasentoolcontainer instead ofbonsai-prpReplaced
assets/mlstdb/update_mlstdb.shwith per-speciesMakefiletargets for MLST database updatesRemoved dead
add_igv_trackprocess frommodules/local/prp/main.nfand its config blockChanged cgmlst.org schema to
schema_idasname_id(Saur48) changes oftenUpdated AMRFinderPlus to v4.2.7
Updated kraken2 container to mulled image (kraken2=2.17.1 + coreutils=9.5) to provide GNU dd with iflag=nocache support
Updated bonsai-prp to v1.6.1
Reverted TBProfiler to v6.3.0
Updated emmtyper
ext.argsflag from--dbto--blast_dbUpdated
count_readspublishDir fromread_countstopostalignqcReplaced
curlwithwgetfor emmtyper database download to avoid SSL issuesUpdated Nextflow version in CI
Changed kraken2 singularity image to fetch
kraken2+coreutilscontainer frommulti-package-containersMoved variant calling for masking polymorphisms before cgmlst typing to
subworkflows/typing.nfUpdated flowchart with variant calling step
[1.2.0]
Added
Added
kleboratepipeline for K. pneumoniae and E. coli analysisAdded
--nanoporeoption to VirulenceFinder for analysing ONT dataAdded pulling of
.tar.gzdb files toMakefileAdded
NanoStats.txtfile to output from NanoPlot for easier parsing in prpAdded
bactopia-pyfor updating mlstdbAdded genome size used by Flye as a parameter
reference_sizeto profiles of all the speciesAdded option to turn off masking of the assembly before cgMLST analysis (default: true, if
nanoporeprofile is used, default is false)Added information about ONT workflow, input and output files to documentation
Added ONT test data with 10k reads for S. aureus
Added
filtlongto long-read workflows
Fixed
Updated wgmlst schema name for E. coli due to change in species ID for chewie-NS
Updated names of cgmlst schemas for all organisms to match changes at www.cgmlst.org
Fixed general profile
reference_sizein configFixed channel and spelling errors and mlstdb paths in
cmd.configFixed reporting of qc stats for ONT data in final json file
Fixed documentation describing sample sheet format
Fixed pipeline if
platform = iontorrentis setFixed medaka input channel
Fixed config calling specific versions of bonsai-prp when running offline
Changed
Updated Chewbbaca to v3.4.0 because of problems with E. coli schema preparation
Updated bonsai-prp to v1.5.0
Removed finder, ShigaPass & tbdb submodules
Neatened up mlst db updating scripts
Updated documentation regarding updating mlstdb
Changed
mlst_dbdir tomlstdbto matchbactopia-pyoutputRemoved blast db creation from
Makefileasbactopia-pydoes itRemoved deprecated scripts for downloading mlst db
Removed
_bonsaifrom yaml filenameUpdated
mlstto run if not CI runRemoved tb profiler instructions that are not longer relevant from docs
Removed the execution of
update_mlstdb&update_blast_dbfromupdate_databasesChanged mapping settings in minimap2 from
-x map-ontto-x lr:hqas quality of the data has improvedUpdated Flye 2.9.3 to version 2.9.6
Updated minimap2 2.28 to version 2.30
Updated Medaka 2.0.1 to version 2.2.0
Updated NanoPlot 1.43.0 to 1.46.2
Changed container for Medaka, it includes the models, so they don’t need to be downloaded while running analysis
Changed preset for masking of the assembly before cgMLST analysis for ONT data (default: false, as it is not tested and optimised for ONT data)
Changed
ch_emptytoch_sample_idChanged Klebsiella pneumoniae workflow to general Klebsiella workflow
Change Kpneumoniae cgmlst.org schema to Pasteur scgMLST629_S Klebsiella schema
Changed mask process to run with perl container
Changed flowchart in documentation to include recent updates
Changed hostile to explicitly call aligner to avoid ambiguity errors
[1.1.2]
Added
Fixed
Changed
Updated bonsai-prp to v1.3.3
Changed config variables for
staphylococcusUpdated methods for
staphylococcusUpdated VirulenceFinder to v3.2.0
Changed pulling of submodules in CI to
recursive
[1.1.1]
Added
Fixed
Fixed chewbbaca arg bug
Changed
Changed computational resources for hostile
[1.1.0]
Added
Added missing tools to S. aureus subworkflow tool list
Added spyogenes bed file representing cgmlst targets
Added
prodigalimage download to containerMakefileAdded module for mapping long reads with Minimap2
Added Minimap2 refence index
referenceGenomeMmifor all speciesAdded
samtools_coverageto get stats from mapping long reads to the reference and assemblyAdded
minimap2_align_assemblyAdded
minimap2_indexfor assembly indexingAdded
whenoperator tosamtoolsfornanoporeAdded
assayto yaml outputAdded
staphylococcus_nrlprofileAdded
release_life_cyclevia profilesAdded
create_prp_yaml.pyto binAdded
spatyperandsccmecto yamlAdded
gambitcoreas a moduleAdded
apptainerandsingularityprofiles
Fixed
Fixed stub-run for S.aureus ONT workflow
Fixed typing error in Makefile
Fixed checking out pipeline code with submodules
Fixed genome downloading via
bin/download_ncbi.pywith timed retriesFixed dirty submodules in
.gitmodulesFixed dubious ownership bug where finder dbs retrieve commit ID
Changed
Changed spyogenes genome from
GCF_000006785.2toGCF_005164585.1Updated
check_taxonmethod to include spyogenesChanged ptf downloading to generation of ptf via prodigal for
ecoli&spyogenesRenamed index created by BWA from
referenceGenomeIdxtoreferenceGenomeFaiRenamed
bwa_mem_deduptobwa_mem_assemblySplit
samtools_sortinto two modulesRemoved
abritamras it’s not usedChanged dirname
containertocontainersChanged dirname
configstoconfChanged dirname
nextflow-modulestomodulesChanged importing of
spadesUpdated
Makefilere dir name changesUpdated subworkflows
Grouped most workflows into one (
bacterial_general.nf) to remove duplicate codeChanged to have only one main config
nexflow.configChanged
cmdmodule name tocdmUpdated bonsai-prp to v1.3.1
Changed indentation structure
Moved
platformto config viaparams.platformChanged
hostileioUpdated docs regarding restructuring
Changed
prpsub commandsUpdated CI GA workflow re
container_dirRemoved
check-and-reinit-git-submodulesfrom CI GA workflowRemoved
kmasubmoduleUpdated finder submodules (
virulencefinder_db,resfinder_db,pointfinder_db,serotypefinder_db)Updated
resfinderversionChanged memory allowance for hostile
Updated docs regarding updating finder dbs
[1.0.0]
Added
Light installation doc updates
Updated NGP config file
Added ska2 module process (
ska_build)Added ska2 to
bacterial_base.nfAdded emmtyper module process (
emmtyper)Added emmtyper to
MakefileAdded downloading of Streptococcus & Streptococcus pyogenes files to
makefileAdded
get_taxonto methodsAdded
streptococcusandstreptococcus_pyogenesworkflows and profiles to configsAdded ska filepath to yaml
Added optional read downsampling using seqtk
Added
tbGradingRulesBedto IGV trackAdded module
nanoplotto check quality of raw reads from ONTAdded module
fastqto check qc of strep samplesAdded module
hostileto remove human readsAdded module
sccmecfor typing of SCCmec cassettes in assemblies of Staphylococcus speciesAdded
whenoperators to all modules that aren’t inbacterial_base.nfAdded additional arguments for Resfinder when analysing Nanopore data
Added downloading of hostile index to
MakefileAdded sw docs for
kpneumonia,spyogenes, &streptococcusAdded module
spatyperfor typing of Spa gene in Staphylococcus aureus
Fixed
Interrupted installation now hard resets E.Coli. to avoid fragmented installation
Installation target file for streptococcus changed, to resolve interrupted installation
Included htslib image to natively support tabix and bgzip
Fixed a bug that prevented pipeline from correctly guessing misspelled profiles
Fixed tbprofiler related installation rules in
MakefileAll modules can be called in one workflow -> profile determines whether they are actually executed using
whenstatementEmpty channels fixed
Chewbbaca collection of input fixed
Fixed resfinder
--speciesargFixed
nextflow.hopper.configsymlinkDirRemoved serotypefinder from saureus workflow
Fixed jasen running only on the first row/sample in csv
Fixed channel problem by changing
Channel.of([])toChannel.value([])Fixed medaka by changing
--threadsto-tin all the configsFixed
nextflow.hopper.configre singularity image pathFixed sccmec version file
Fixed handling of ONT samples re fastqc & postalignqc
Fixed io of spatyper and turned on postalignqc for ONT
Fixed io of resfinder for all workflows but saureus
Fixed input array for postalignqc
Changed
Removed sudo requirement from apptainer container creation (only required for older versions of apptainer) to streamline installation
Updated tbdb submodule
Moved taxon related methods to methods directory
Changed spyogenes genome from GCF_900475035.1 to GCF_000006785.2
Changed
containerDirfilepath for Lund configsUpdate
configs/nextflow.dev.configroot fpathChanged the freebayes output directory
Remove
copy_to_cronmoduleUpdated from Singularity v3.8.7 to Apptainer v1.3.6 in the CI pipeline
Updated PRP to version 0.11.4
Changed SerotypeFinder input from reads to assembly
Changed variable formatting of modules
Changed ska & sourmash filepath to symlink dir in
nextflow.hopper.config&nextflow.dev.configUpdated Kraken db filepath in
nextflow.hopper.config&nextflow.dev.configChanged
staphylococcus_aureus_alltosaureus_allinMakefileMoved all
whencommands to configsUpdated
fastqc&sccmecmem settingsChanged model that is used in
medaka_consensusto bacterial model (using--bacteriaargument)
[0.9.0]
Added
Added long-read test data (S. aureus)
Added
samplelist_nanopore.csvfor running long-read test dataAdded location of documentation to
READMEAdded
cdmDirto configAdded NanoPlot module
Added process for adding IGV annotation tracks with PRP.
Updated how
mycobacterium_tuberculosisworkflow adds IGV annotation tracks.
Fixed
Fixed
--qcargument filepath to be full filepath to outputFixed tbprofiler url in container Makefile
Fixed TB installation steps in main Makefile
Changed
Updated TbProfiler to version 6.3
Updated PRP to version 0.10.0
Removed delly annotation
Updated vcf args in prp module
[0.8.0]
Added
Added ShigaPass
Added mlstBlastDb to mlst
Added full path for bam and vcf filepaths
Added bam and bai to bonsai input for
staphylococcus_aureus,escherichia_coli&klebsiella_pneumoniaeAdded
bamDirandvcfDirto config paramsAdded run
bwa_memfrom only when profile is notmycobacterium_tuberculosisAdded
sample_nametoget_seqrun_metaAutomatically publish the pipeline documentation to read the docs.
Fixed
ShigaPass URL fixed
Fixed qc channel regarding
mycobacterium_tuberculosisFixed bwa output file bug and stub
Fixed README re jasentool cmds
Fixed
get_seqrun_metaif statementsFixed getting some software versions
Fixed tb workflow bug
Changed
Fixed output format for tbprofiler
Removed
samtools_sort_reffrom configsChanged
--symlink_dirarg for prpChanged sampleName to sampleID
Updated
flye,freebayes,mask,medaka,post_align_qc,skesa&spadesoutput filenamesUpdated bonsai-prp to v0.9.3
Updated the pipeline documentation.
Removed sudo from make (deprecated)
Updated NGP config for new hardware
Updated tbprofiler memory allocation
[0.7.0]
Added
Added two modules for assembly of long-read Nanopore data: Flye and Medaka
Added
referenceGenometoprp create-bonsai-inputcreated
.faifiles from all genomes inMakefileAdded
samtools_sort_refto tb workflow with 4GB memory (may need more)Added more cpus to tbprofiler
Fixed
Added nanopore to all workflows
Sort and index tbprofiler bam output
Changed
Updated saureus genome from
NC_002951.2toGCF_000012045.1Updated ecoli genome from
NC_000913.3toGCF_000005845.2Updated mtuberculosis genome from
NC_000962.3toGCF_000195955.2Updated kpneumoniae genome from
NC_016845.1toGCF_000240185.1Updated tbprofiler to v6.2.0
Updated
download_ncbi.pyto include.gfffilesUpdated bonsai-prp to v0.8.3
Removed FoHM variant duplicates
[0.6.0]
Added
Added converged_who_fohm_tbdb.csv
Added guide to create tbdb
Added
sequencing_runandlims_idto outputAdded
devModeflagAdded
lims_id&sequencing_runtometamoduleAdded
annotate_dellymoduleAdded prp versions
Added how-to guide on how to create bed file, bgzipped format and index for
annotate_dellyinputPrepare general
Makefilefor incorporation of tbprofiler v6.1.0 to auto create above files and create new tbdbAdd
create_yamlmodule for upload to bonsai
Fixed
Args for cdm
Changed
Run with converged/merged db
Publish bam & bai from tbprofiler
cronCopy set true in hopper config
bonsai-prp version upgraded from v0.5.0 to v0.6.0
Renamed
nextflow.hopper.configtonextflow.dev.configfor hopper developmentbonsai-prp version upgraded from v0.6.0 to v0.7.0
bonsai-prp version upgraded from v0.7.0 to v0.7.1
Changed blastDb to pubMlstDb re mlst
[0.5.0]
Added
Added a GitHub workflow to run a basic CI pipeline.
Build prp as singularity image from dockerhub in Makefile
Chewbacca and virulencefinder sif fetched from galaxyproject
Added species to amrfinderplus
Added getSpeciesTaxonName to amrfinderplus
Add serotypefinder to Makefiles, workflows, modules & configs
Fixed
Fixed config not pointing to the new lowercase repo name: jasen (instead of JASEN)
Changed
Changed to use full file paths in include statements for better navigation in text editors.
Upgraded the Skesa (container) to v2.5.1 to fix ownership issue with /tmp folder
Changed pythonScripts.sif filename to bonsai-prp.sif
Postalignqc added to prp and move to prp module
Makefile converts bonsai-prp from docker image to singularity
Makefile pulls all containers from online respositories
Configs can also pull images from galaxy project or dockerhub
Move post_align_qc to prp module
[0.4.0]
Added
Added tbdb as a submodule.
Downloaded new catalogue from WHO and convert to csv required for tbdb creation using jasentool.
DB creation included using
Makefile.Updated configs.
Updated
workflows/mycobacterium_tuberculosis.nf.Update TBProfiler version to v5.0.1.
Publish delly output when running tbprofiler.
Remove Tbprofiler run using WHO database until TBProfiler issue #313 resolved.
Fixed
Changed
Updated PRP to verion 0.3.0
0.3.0
Added
E.coli STX typing using Virulencefinder
Changed
Moved PRP to seperate repo
Updated PRP to v0.2.0
0.2.0
Added
The pipeline now supports IonTorrent input data
Both Single-End and Paired-End input is now supported
Test suite is entirely containerised within Singularity
Versions of all used software is conveniently written into a single file
Assemblies are automatically cleaned up
AMRFinder has replaced ARIBA
SKESA de novo assembly added
Readme contains simple usage instructions and more detailed breakdown of components
External nextflow-modules are now fully integrated
Changed
Extensive Nextflow code refactoring
Memory and CPU usage defaults optimised for all modules
Optimisation of module invocations to reduce overhead
PRP identifiers updated
Updated all utilised databases to latest version
Various publishDir updates
Fixed
Resolved multiple BLAST database referencing errors
Singularity images are now umasked properly
Minimum recommended RAM expanded to avoid runtime crashes
Fixed ‘failed to publish file’ error
MLST PRP input issues fixed
Metadata is now properly saved
Removed
Removed legacy modules (notably ARIBA)
Reduction to only two software requirements; Singularity and Nextflow