Commit ca112c52 authored by Will Spooner's avatar Will Spooner
Browse files

Added code to remove anything in parens from the RefSeq entry sequence line....

Added code to remove anything in parens from the RefSeq entry sequence line. E.g. s/Oryza sativa (ssp. japonica)/Oryza sativa/
parent be16c052
......@@ -116,6 +116,7 @@ sub create_xrefs {
my ($species) = $entry =~ /\s+ORGANISM\s+(.*)\n/;
$species = lc $species;
$species =~ s/^\s*//g;
$species =~ s/\s*\(.+\)//; # Ditch anything in parens
$species =~ s/\s+/_/g;
$species =~ s/\n//g;
my $species_id_check = $name2species_id{$species};
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment