A component to download a set of FASTQ files under a certain project/study/experment by providing an accession ID from the European Nucleotide Archive (ENA). The component works by first getting a summary table of samples belonging to the provided accession number. In this summary table are paths (ftp) to zipped FASTQ files of individual samples.
Using these paths, zipped FASTQ files are downloaded and stored under a directory named data/<accessionID> inside <KNIME-workspace>
Supported accession types are Projects, Studies, BioSamples, Samples, Experiments, Runs and Analyses. Refer to https://ena-docs.readthedocs.io/en/latest/submit/general-guide/accessions.html to see details. The component is able to handle both single and paired library layouts in a seamless fashion.
Note:
Files will not be downloaded if their up-to-date version already exists under data/<accessionID>.
- Type: TableDownloaded FilesTable containing the list of files that have been transferred and if they have been transferred or were already up to date. If files are allowed to fail there is an additional column, that will indicate if the transfer was successful or did fail.