DESCRIPTION

fastaq_to_unique_by_id <infile> <outfile>

Removes duplicate sequences from a fasta/q file, based on their names. If the same name is found more than once, then the longest sequence is kept. Order of sequences is preserved in output

positional arguments:

infile

Name of input fasta/q file

outfile

Name of output fasta/q file

optional arguments:

-h, --help

show this help message and exit

AUTHOR

fastaq_to_unique_by_id was originally written by Martin Hunt ([email protected])

COPYING

Wellcome Trust Sanger Institute Copyright © 2013 Wellcome Trust Sanger Institute This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.