This is an example workflow that demonstrates the FASTA Reader component. Here we read protein and gene sequences of all metabolite metabolizing enzymes on HMDB[1]. The files are included within the workflow but are also available at https://hmdb.ca/downloads. From the views of the components,one can see that the sequence length distribution of proteins is identical to that of the genes.
To visualize the length distribution right click on the component and click on "Interactive View: FASTA Reader". We can then filter for sequences of a certain length and revisit the length distribution using the Histogram node.
[1]: Wishart DS, Feunang YD, Marcu A, Guo AC, Liang K, et al., HMDB 4.0 — The Human Metabolome Database for 2018. Nucleic Acids Res. 2018. Jan 4;46(D1):D608-17. 29140435
Used extensions & nodes
Created with KNIME Analytics Platform version 4.2.2
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Legal
By using or downloading the workflow, you agree to our terms and conditions.