Joining two paired-end files into one file with joined (single-end) reads. Automatically reverse-complements reads of the second input file.

This simple program takes as input two mate pair or paired-end files and outputs a file where both mate sequences have been joined together. The FASTA file with joined mates is an required input file for the paired-end mode of Gustaf. The tool assumes the mates in the second file to be reverse complemented compared to the first file. This behaviour can be turned off using the command line argument "-rc".

Given only one input file and two output files, the program will split the reads from the input files at half length, and write the first half of each sequence as mates1 into the first output file and the reversed complemented second half of each sequence as mates2 into the second output file. Reverse complementing the sequences can again be turned off using "-rc".

To prepare the joined mate file for the Gustaf paired-end example, call

./gustaf_mate_joining adeno_modified_reads_mates1.fa adeno_modified_reads_mates2.fa -rc -o adeno_modified_reads_joinedMates.fa

Web Documentation for GustafMateJoining

Input Ports

  1. Type: URI Object

Output Ports

  1. Type: URI Object
    Set name of output FASTA/FASTQ file(s). [fasta,fa,fq,fastq]


This node is part of the extension



Short Link

Drag node into KNIME Analytics Platform