MSA File

This module defines functions and classes for parsing, manipulating, and analyzing multiple sequence alignments.

class MSAFile(msa, mode='r', format=None, aligned=True, **kwargs)[source]

Handle MSA files in FASTA, SELEX and Stockholm formats.

msa may be a filename or a stream. Multiple sequence alignments can be read from or written in FASTA (.fasta), Stockholm (.sth), or SELEX (.slx) format. For spesified extensions, format argument is not needed. If aligned is True, unaligned sequences in the file or stream will cause an IOError exception. filter, a function that returns a boolean, can be used for filtering sequences, see setFilter() for details. slice can be used to slice sequences, and is applied after filtering, see setSlice() for details.

close()[source]

Close the file. This method will not affect a stream.

getFilename()[source]

Return filename, or None if instance is handling a stream.

getFilter()[source]

Return function used for filtering sequences.

getFormat()[source]

Return file format.

getSlice()[source]

Return object used to slice sequences.

getTitle()[source]

Return title of the instance.

isAligned()[source]

Return True if MSA is aligned.

reset()[source]

Return to the beginning of the file.

setFilter(filter, filter_full=False)[source]

Set function used for filtering sequences. filter will be applied to split sequence label, by default. If filter_full is True, filter will be applied to the full label.

setSlice(slice)[source]

Set object used to slice sequences, which may be a slice() or a list() of numbers.

setTitle(title)[source]

Set title of the instance.

write(seq)[source]

Write seq, an Sequence instance, into the MSA file.

closed

True for closed file.

format

Format of the MSA file.

splitSeqLabel(label)

Return label, starting residue number, and ending residue number parsed from sequence label.

parseMSA(filename, **kwargs)[source]

Return an MSA instance that stores multiple sequence alignment and sequence labels parsed from Stockholm, SELEX, or FASTA format filename file, which may be a compressed file. Uncompressed MSA files are parsed using C code at a fraction of the time it would take to parse compressed files in Python.

writeMSA(filename, msa, **kwargs)[source]

Return filename containing msa, a MSA or MSAFile instance, in the specified format, which can be SELEX, Stockholm, or FASTA. If compressed is True or filename ends with .gz, a compressed file will be written. MSA instances will be written using C function into uncompressed files.

Read the Docs v: v1.5
Versions
latest
v1.5
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.