A component to download a set of FASTQ files under a certain project/study/experment by providing an accession ID from the European Nucleotide Archive (ENA). The component works by first getting a summary table of samples belonging to the provided accession number. In this summary table are paths (ftp) to zipped FASTQ files of individual samples.
Using these paths, zipped FASTQ files are downloaded and stored under a directory named data/<accessionID> inside <KNIME-workspace>
Supported accession types are Projects, Studies, BioSamples, Samples, Experiments, Runs and Analyses. Refer to https://ena-docs.readthedocs.io/en/latest/submit/general-guide/accessions.html to see details. The component is able to handle both single and paired library layouts in a seamless fashion.
Note:
Files will not be downloaded if their up-to-date version already exists under data/<accessionID>.
Using these paths, zipped FASTQ files are downloaded and stored under a directory named data/<accessionID> inside <KNIME-workspace>
Supported accession types are Projects, Studies, BioSamples, Samples, Experiments, Runs and Analyses. Refer to https://ena-docs.readthedocs.io/en/latest/submit/general-guide/accessions.html to see details. The component is able to handle both single and paired library layouts in a seamless fashion.
Note:
Files will not be downloaded if their up-to-date version already exists under data/<accessionID>.