Skip to content
Snippets Groups Projects
Commit 8ef93d46 authored by Graham McVicker's avatar Graham McVicker
Browse files

updated documentation

parent 3b2f7f6f
No related branches found
No related tags found
No related merge requests found
......@@ -12,12 +12,15 @@ parse_embl.pl
=head1 DESCRIPTION
This script will peform simple loading of an annotated genome in EMBL
flat file format into an Otter database. The script is aimed at
loading bacterial and small eukaryotic genomes published by the
Pathogen Sequencing Unit at the Sanger Institue, but should be able to
load most suitable pre-processed EMBL files.
This is a script originally written by Keith James to load a pathogen
genome from an EMBL flat file into an Otter (modified ensembl) database.
It has been has been stripped down and modified to make it more general
purpose.
This script will peform simple loading of an annotated genome in EMBL
flat file format into an Ensembl database. The script loads a fairly
minimal set of gene information from an EMBL file, but could be
extended to do more.
Certain assumptions are made about the format of the EMBL file such
that it may require pre-processing before loading - these are listed
......@@ -61,10 +64,7 @@ its location it is assumed the ends have not been found.
=item *
The standard Ensembl translation mechanism is not used because it is
hardwired for eukaryotes. The codon table is specified in the command
line and affects all protein coding genes. The 'translation_table' tag
of a feature is ignored, if present.
Non-standard translation tables are currently not handled.
=item *
......@@ -88,6 +88,8 @@ Email kdj@sanger.ac.uk
Stephen Searle, sjms@sanger.ac.uk
Graham McVicker
=head1 METHODS
=cut
......
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment