In this workflow two different types of ATA-box motifs are loaded as FASTA files. The first contains wildtype motifs of the HBB (Hemoglobin Subunit Beta) gene, the second contains mutated motifs of the same gene.
The different sequences are aligned with each other, using the SeqanTcoffee node. One goal of a multiple alignment is to detect conserved regions in sequences. In order to visualize them, a sequence or consensus logo is often used to convey information about the conservation of each position of a sequence.
In this example the Generic Javascript view is used to create a sequence logo. The view shows in which positions of the motif mutations can occur, which can lead to the disease ß-Thalassemia.
Workflow
Seqan Tcoffee (Multiple Alignment) and Sequence Logo
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.1.2 Note: Not all extensions may be displayed.
- Go to item
Generic Workflow Nodes for KNIME
Freie Universitaet Berlin, Universitaet Tuebingen, and the GenericWorkflowNodes Team
Version 1.0.0
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Legal
By using or downloading the workflow, you agree to our terms and conditions.