This project is mirrored from https://:*****@github.com/Ensembl/ensembl.git. Pull mirroring updated .
  1. 02 Jan, 2019 1 commit
  2. 23 Dec, 2018 1 commit
  3. 19 Dec, 2018 6 commits
  4. 18 Dec, 2018 2 commits
  5. 17 Dec, 2018 1 commit
  6. 07 Dec, 2018 5 commits
  7. 06 Dec, 2018 1 commit
  8. 12 Nov, 2018 1 commit
  9. 07 Nov, 2018 1 commit
  10. 06 Nov, 2018 1 commit
  11. 26 Oct, 2018 1 commit
  12. 25 Oct, 2018 4 commits
  13. 24 Oct, 2018 6 commits
  14. 18 Oct, 2018 2 commits
  15. 17 Oct, 2018 1 commit
  16. 16 Oct, 2018 1 commit
  17. 15 Oct, 2018 3 commits
    • Wojtek Bazant's avatar
      Fix bug: use return instead of next · 4ab71f7e
      Wojtek Bazant authored
      return goes back one frame up the stack
      next goes back to the closest frame on the stack that supports the
      operation (that is close enough in RefSeqGPFFParser alone)
      It works unless I subclass create_xrefs, and then my Hive workers die:
      
      Lost control. Check your Runnable for loose 'next' statements that are
      not part of a loop       WORKER_ERROR
      4ab71f7e
    • Wojtek Bazant's avatar
      C. elegans specific parsing of RefSeq_dna file · 7d6346f7
      Wojtek Bazant authored
      - New xref: to a WormBase CDS feature
      - Modify WormbaseCElegansRefSeqGPFFParser to serve both kinds of files
      - extract a utility method from RefSeqGPFFParser
      - xref_config.ini stanza for wormbase_cds
      - tests for new functionality
      7d6346f7
    • Wojtek Bazant's avatar
      C. elegans references use WormBase mapping to INSDC protein ids · d66449b6
      Wojtek Bazant authored
      - maintain naming convention: WormBase specific stuff says Wormbase at the front
      - rewrite WormBaseDirectParser
      - WormBaseDirectParser populates protein_ids
      - superclass method to make dependent protein_ids as parent
      - tap into UniProtParser
        + also skip EMBL scaffold ids (we can't reliably assign them)
      - tap into RefSeqGPFFParser
        + extract a method
      - tests for new stuff
        + add %args to parametrise test_parser
      
      Benefits for RefSeqGPFFParser:
      RefSeq proteins have coordinates as part of their identity, so we
      can't reliably sequence match them, we will also pick up all paralogs.
      This change fixes this spurious mapping.
      Benefits for UniProtParser:
      Not the above: UniProt entries are not tied to coordinates so all
      paralogs map to the same entry. We can handle versioning and updates
      a bit better: if WormBase updates an entry and a protein id changes but
      UniProt doesn't reflect this yet, with the change we will still pick up
      the UniProt entry although we can't sequence match any more.
      d66449b6
  18. 01 Oct, 2018 2 commits