Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
E
ensembl
Manage
Activity
Members
Labels
Plan
Issues
0
Issue boards
Milestones
Iterations
Wiki
Requirements
Jira
Code
Merge requests
1
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Test cases
Artifacts
Deploy
Releases
Package Registry
Container Registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Insights
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
ensembl-gh-mirror
ensembl
Commits
bee4cf7b
Commit
bee4cf7b
authored
21 years ago
by
Arne Stabenau
Browse files
Options
Downloads
Patches
Plain Diff
dnafrag is officially named seq_region now
parent
fd1888d9
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
docs/ensembl_changes_spec.txt
+33
-33
33 additions, 33 deletions
docs/ensembl_changes_spec.txt
with
33 additions
and
33 deletions
docs/ensembl_changes_spec.txt
+
33
−
33
View file @
bee4cf7b
...
...
@@ -31,9 +31,9 @@ SCHEMA MODIFICATIONS
Proposed New/Modified Tables:
-----------------------------
dnafrag
seq_region
-------
dnafrag
_id int
seq_region
_id int
name varchar
type varchar (or maybe enum)
length int
...
...
@@ -41,14 +41,14 @@ Proposed New/Modified Tables:
dna
---
dnafrag
_id int
seq_region
_id int
sequence varchar
assembly
--------
dnafrag
_id_assembled int
dnafrag
_id_component int
seq_region
_id_assembled int
seq_region
_id_component int
component_start int
component_end int
assembled_start int
...
...
@@ -58,15 +58,15 @@ Proposed New/Modified Tables:
gene
----
For faster retrieval and retrieval independently of transcripts and
exons genes will also have a
dnafrag_id, dnafrag
_start and
dnafrag
_end.
exons genes will also have a
seq_region_id, seq_region
_start and
seq_region
_end.
gene_id int
type varchar
analysis_id int
dnafrag
_id int
dnafrag
_start int
dnafrag
_end int
dnafrag
_strand int (or enum?)
seq_region
_id int
seq_region
_start int
seq_region
_end int
seq_region
_strand int (or enum?)
transcript_count - (is this necessary? - probably can go)
display_xref_id
...
...
@@ -74,7 +74,7 @@ Proposed New/Modified Tables:
transcript
----------
For faster retrieval and retrieval independently of genes and exons
transcripts will also have a
dnafrag_id, dnafrag
_start and
dnafrag
_end.
transcripts will also have a
seq_region_id, seq_region
_start and
seq_region
_end.
The translation_id will be removed, translations will point to transcripts
instead (and pseudogenes will have no translation). Prediction transcripts
will now be stored as normal transcripts without genes. In order to
...
...
@@ -86,10 +86,10 @@ Proposed New/Modified Tables:
transcript_id int
gene_id int (NULLABLE)
exon_count int - (is this necessary?)
dnafrag
_id int
dnafrag
_start int
dnafrag
_end int
dnafrag
_strand int (or enum?)
seq_region
_id int
seq_region
_start int
seq_region
_end int
seq_region
_strand int (or enum?)
display_xref_id int
analysis_id int
...
...
@@ -112,8 +112,8 @@ Proposed New/Modified Tables:
all feature tables
------------------
All feature tables would now have
dnafrag_id, dnafrag_start, dnafrag
_end,
dnafrag
_strand instead of contig_id, contig_start, contig_end
All feature tables would now have
seq_region_id, seq_region_start, seq_region
_end,
seq_region
_strand instead of contig_id, contig_start, contig_end
This includes the repeat_feature, simple_feature, dna_align_feature,
protein_align_feature, exon, marker_feature,
and qtl_feature tables.
...
...
@@ -131,14 +131,14 @@ Removed Tables
contig
------
Contigs are no longer needed. They are stored as entries in the
dnafrag
Contigs are no longer needed. They are stored as entries in the
seq_region
table with type 'contig'. The embl_offset and clone_id will not be
necessary as their relationship to clones can be described by the
assembly table.
clone
-----
Clones are no longer needed. Clones are stored as entries in the
dnafrag
Clones are no longer needed. Clones are stored as entries in the
seq_region
table with type 'clone'. The htg_phase, created and modified timestamps will
be discarded as they are no longer maintained anyway. The embl_acc, version,
and embl_version columns are redundant and will also be discarded. Versions
...
...
@@ -147,7 +147,7 @@ Removed Tables
chromosome
----------
This table is no longer needed. Chromosomes can be stored in the
dnafrag
table with type 'chromosome'.
seq_region
table with type 'chromosome'.
META INFORMATION
...
...
@@ -160,16 +160,16 @@ or it may be better to create a meta_assembly table that is more specific.
This includes the following:
The
dnafrag
type (coordinate system) that every type of feature is stored
The
seq_region
type (coordinate system) that every type of feature is stored
in. This may be based on either logic_names, or upon table names.
The top-level
dnafrag
type (coordinate system). For human
The top-level
seq_region
type (coordinate system). For human
this would be 'chromosome'. For briggsae this may be something like
'scaffold' or 'super contig'. This information would be used to construct
the web display and would possible be the default coordinate system when
a coordinate system is unspecified by a user.
The sequence
dnafrag
type. This describes the dna frag type (coordinate
The sequence
seq_region
type. This describes the dna frag type (coordinate
system) at which the sequence is stored at. For example in human this would
be at the contig or clone level.
...
...
@@ -200,7 +200,7 @@ Slice
A new slice method 'coord_system' will be added and will denote the type
of dna_frag the slice is built on.
Slices will represent a region on a
dnafrag
as opposed to a region on a
Slices will represent a region on a
seq_region
as opposed to a region on a
chromosome. Slices will be immutable (i.e. their attributes will not be
changeable). A new slice will have to be created if the attributes are to
be changed.
...
...
@@ -293,7 +293,7 @@ Chromosome
----------
The Chromosome object is no longer necessary in the new system. The
Chromosome is replaced by Slices with coord_system = 'chromosome' (or
whatever the top level
dnafrag
type is for that species). For backwards
whatever the top level
seq_region
type is for that species). For backwards
compatibility a minimal implementation can remain which inherits from the
Slice object. Statistical information (e.g. known genes, genes, snps) that
was on chromosomes should be possible to calculate directly from the
...
...
@@ -606,13 +606,13 @@ Haplotypes (and the MHC region)
assembly_exception
------------------
dnafrag
_id int
dnafrag
_start int
dnafrag
_end int
seq_region
_id int
seq_region
_start int
seq_region
_end int
exc_type enum('HP', 'PAR')
exc_
dnafrag
_id int
exc_
dnafrag
_start int
exc_
dnafrag
_end int
exc_
seq_region
_id int
exc_
seq_region
_start int
exc_
seq_region
_end int
ori int (may not be needed, may implicitly be 1)
It is possible to retrieve a slice on a haplotype just as any other slice
...
...
@@ -703,10 +703,10 @@ TBD
Circular Chromosomes
--------------------
We can handle circular chromosomes (or any arbitrary circular sequence) in
a similar way the the haplotypes. The
dnafrag
for the circular sequence can
a similar way the the haplotypes. The
seq_region
for the circular sequence can
have a flag set which indicates that it is circular. The slice would have
an additional method is_type('circular') which would return true if the
slice was on a circular
dnafrag
. The following is the algorithm for
slice was on a circular
seq_region
. The following is the algorithm for
retrieval of features on a circular slice:
(a) Split the slice into 3 regions:
(1) slice_start -> 0,
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment