This component reads DNA/RNA/Protein sequence files from a FASTA format file. The result is a three column table (ID, Sequence & SequenceLength), where individual sequence records are represented as rows.
You can read more about the FASTA file format at:
https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs&DOC_TYPE=BlastHelp
The component can handle multi-line FASTA entries. It can also directly read gzip compressed fasta files (.fa.gz). KNIME URLs such as "knime://knime.workflow/../file.fasta" are supported.
The component also has a histogram as its view, that shows the length distribution of sequences.
- Type: TableSequence recordsSequence records per row