Bio::AlignIO stockholm
SummaryIncluded librariesPackage variablesSynopsisDescriptionGeneral documentationMethods
Toolbar
WebCvsRaw content
Summary
Bio::AlignIO::stockholm - stockholm sequence input/output stream
Package variables
No package variables defined.
Included modules
Bio::AlignIO
Inherit
Bio::AlignIO
Synopsis
Do not use this module directly. Use it via the Bio::AlignIO class.
Description
This object can transform Bio::Align::AlignI objects to and from
stockholm flat file databases.
Methods
next_alnDescriptionCode
write_alnDescriptionCode
Methods description
next_alncode    nextTop
 Title   : next_aln
Usage : $aln = $stream->next_aln()
Function: returns the next alignment in the stream.
Returns : Bio::Align::AlignI object
Args : NONE
write_alncodeprevnextTop
 Title   : write_aln
Usage : $stream->write_aln(@aln)
Function: writes the $aln object into the stream in stockholm format ###Not yet implemented!###
Returns : 1 for success and 0 for error
Args : Bio::Align::AlignI object
Methods code
next_alndescriptionprevnextTop
sub next_aln {
    my $self = shift;
    my $entry;

    my ($start,$end,%align,$name,$seqname,$seq,$count,
	%hash,%c2name, %accession, $no);

    # in stockholm format, every non-blank line that does not start
# with '#=' is an alignment segment; the '#=' lines are mark up lines.
# Of particular interest are the '#=GF <name/st-ed> AC <accession>'
# lines, which give accession numbers for each segment
my $aln = Bio::SimpleAlign->new(-source => 'stockholm'); while( defined($entry = $self->_readline) ) { $entry !~ /\w+/ && next; if ($entry =~ /^#\s*STOCKHOLM\s+/) { last; } else { $self->throw("Not Stockholm format: Expecting\" # STOCKHOLM 1.0\"; Found\" $_\""); } } #
# Next section is same as for selex format
#
while( defined($entry = $self->_readline) ) { # Double slash (//) signals end of file. The flat Pfam-A data from
# ftp://ftp.sanger.ac.uk/pub/databases/Pfam/Pfam-A.full.gz consists
# of several concatenated Stockholm-formatted files. The following
# line makes it possible to parse it without this module trying to
# read the whole file into memory. Andreas Kähäri 10/3/2003.
last if $entry =~ /^\/\//; # Extra bonus: Get the name of the alignment.
# Andreas Kähäri 10/3/2003.
if ($entry =~ /^#=GF\s+AC\s+(\S+)/) { $aln->id($1); next; } $entry =~ /^#=GS\s+(\S+)\s+AC\s+(\S+)/ && do { $accession{ $1 } = $2; next; }; $entry =~ /^([A-Za-z.-]+)$/ && ( $align{$name} .= $1 ) && next; $entry !~ /^([^#]\S+)\s+([A-Za-z.-]+)\s*/ && next; $name = $1; $seq = $2; if( ! defined $align{$name} ) { $count++; $c2name{$count} = $name; } $align{$name} .= $seq; } # ok... now we can make the sequences
foreach $no ( sort { $a <=> $b } keys %c2name ) { $name = $c2name{$no}; if( $name =~ /(\S+)\/(\d+)-(\d+)/ ) { $seqname = $1; $start = $2; $end = $3; } else { $seqname=$name; $start = 1; $end = length($align{$name}); } $seq = new Bio::LocatableSeq('-seq'=>$align{$name}, '-id'=>$seqname, '-start'=>$start, '-end'=>$end, '-type'=>'aligned', '-accession_number' => $accession{$name}, ); $aln->add_seq($seq); } # If $end <= 0, we have either reached the end of
# file in <fh> or we have encountered some other error
#
if ($end <= 0) { undef $aln;} return $aln;
}
write_alndescriptionprevnextTop
sub write_aln {
    my ($self,@aln) = @_;

    $self->throw("Sorry: stockholm-format output, not yet implemented! /n");
}

1;
}
General documentation
FEEDBACKTop
Reporting BugsTop
Report bugs to the Bioperl bug tracking system to help us keep track
the bugs and their resolution. Bug reports can be submitted via email
or the web:
  bioperl-bugs@bio.perl.org
http://bugzilla.bioperl.org/
AUTHORS - Peter SchattnerTop
Email: schattner@alum.mit.edu
CONTRIBUTORSTop
Andreas Kahari, ak@ebi.ac.uk
APPENDIXTop
The rest of the documentation details each of the object
methods. Internal methods are usually preceded with a _