Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
E
ensembl
Manage
Activity
Members
Labels
Plan
Issues
0
Issue boards
Milestones
Iterations
Wiki
Requirements
Jira
Code
Merge requests
1
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Test cases
Artifacts
Deploy
Releases
Package Registry
Container Registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Insights
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
ensembl-gh-mirror
ensembl
Commits
c6fa236a
Commit
c6fa236a
authored
20 years ago
by
Ian Longden
Browse files
Options
Downloads
Patches
Plain Diff
updated
parent
02190e1d
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
misc-scripts/xref_mapping/README
+28
-3
28 additions, 3 deletions
misc-scripts/xref_mapping/README
with
28 additions
and
3 deletions
misc-scripts/xref_mapping/README
+
28
−
3
View file @
c6fa236a
THe following is the initial first draft/initial mind dump which will be tidiedup.
THe following is the initial first draft/initial mind dump which will be tidied
up.
The Xref tables are created and populated by the scripts in this directory.
The process can be viewed as a two part process.
...
...
@@ -69,6 +70,30 @@ entries loaded. These can then be mapped to the ENSEMBL entitys with the
xref_mapper.pl script.
To add new data to the xrefs you will have to edit sql/populate_metadata.sql
and/or type in the sql.
Add a new source you will insert a new source code. i.e.
INSERT INTO source VALUES (2000, 'NEW', 1, 'Y', 4);
Becouse some sources are dependent on others being loaded the last argument is
the order. Lower numbers are processed first.
You will also have to specify the files to down load and the parser to use.
i.e.
INSERT INTO source_url (source_id, species_id, url, checksum, file_modified_date,
upload_date, parser) VALUES (2000, 9606,'ftp://ftp.new.org/new.gz', '',
now(), now(), "NEWParser");
You will have to create XrefParser/NEWparser.pm.
The parsers.
...
...
@@ -132,8 +157,8 @@ for that particular species.
NOTE: RefSeqParser.pm also exists and can be used to parse the fasta type
files for the Refseq's. At the moment
T
he genbank style files are passed for
both protein and rna files. But the xrefs are on a whole
are
just duplicated
files for the Refseq's. At the moment
t
he genbank style files are passed for
both protein and rna files. But the xrefs are on a whole just duplicated
as they contain bascially the same xref data. A decision will have to be made
as to the benefits/disadvantages of this. The alternative is to pass the rna
as a fasta. (which i think is what the old system used to do, judging by the
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment