Skip to content

backport ENSCORESW-2767: multi-species support for protein feature

Marek Szuba requested to merge bugfix/multispecies_pf into master

Created by: magaliruffier


  • Filling out the template is required. Any pull request that does not include enough information to be reviewed in a timely manner may be closed at the maintainers' discretion;
  • Review the contributing guidelines for this repository; remember in particular:
    • do not modify code without testing for regression
    • provide simple unit tests to test the changes
    • if you change the schema you must patch the test databases as well, see Updating the schema
    • the PR must not fail unit testing


Using one or more sentences, describe in detail the proposed changes.

In e!92 was updated to be a BaseAlignFeatureAdaptor rather than a BaseAdaptor to support GIFTS import, with a side effect of failing for collection databases. There is already a hack in place for feature types that needs to be restricted to species_id (in coord_system table), which helps the UnmappedObjectAdaptor to get around it. Similar workaround is implemented for BaseAlignFeatureAdaptor.

Use case

Describe the problem. Please provide an example representing the motivation behind the need for having these changes in place.

The production pipeline (VEP cache creation pipeline on EG collection databases) and the transcript pages for species that are in collection db failed with similar error messages ( 'Unknown column 'pf.seq_region_id' in 'where clause' at


If applicable, describe the advantages the changes will have.

Both the above use cases works with the changes in place.

Possible Drawbacks

If applicable, describe any possible undesirable consequence of the changes.

We might be missing some scenarios where this could still fail.


Have you added/modified unit tests to test the changes? Yes

If so, do the tests pass/fail? Pass

Have you run the entire test suite and no regression was detected? Yes

Merge request reports