.TH "esl\-seqstat" 1 "@EASEL_DATE@" "Easel @EASEL_VERSION@" "Easel Manual" .SH NAME esl\-seqstat \- summarize contents of a sequence file .SH SYNOPSIS .B esl\-seqstat [\fIoptions\fR] .I seqfile .SH DESCRIPTION .PP .B esl\-seqstat summarizes the contents of the .IR seqfile . It prints the format, alphabet type, number of sequences, total number of residues, and the mean, smallest, and largest sequence length. .PP If .I seqfile is \- (a single dash), sequence input is read from stdin. .SH OPTIONS .TP .B \-h Print brief help; includes version number and summary of all options, including expert options. .TP .B \-a Additionally show a summary statistic line showing the name, length, and description of each individual sequence. Each of these lines is prefixed by an = character, in order to allow these lines to be easily grepped out of the output. .TP .B \-c Additionally print the residue composition of the sequence file. .SH EXPERT OPTIONS .TP .BI \-\-informat " <s>" Assert that input .I seqfile is in format .IR <s> , bypassing format autodetection. Common choices for .I <s> include: .BR fasta , .BR embl , .BR genbank. Alignment formats also work; common choices include: .BR stockholm , .BR a2m , .BR afa , .BR psiblast , .BR clustal , .BR phylip . For more information, and for codes for some less common formats, see main documentation. The string .I <s> is case-insensitive (\fBfasta\fR or \fBFASTA\fR both work). .TP .B \-\-amino Assert that the .I seqfile contains protein sequences. .TP .B \-\-dna Assert that the .I seqfile contains DNA sequences. .TP .B \-\-rna Assert that the .I seqfile contains RNA sequences. .SH SEE ALSO .nf @EASEL_URL@ .fi .SH COPYRIGHT .nf @EASEL_COPYRIGHT@ @EASEL_LICENSE@ .fi .SH AUTHOR .nf http://eddylab.org .fi