Skip to content
Snippets Groups Projects
Commit 2b6be9b3 authored by Abel Ureta-Vidal's avatar Abel Ureta-Vidal
Browse files

Added new species regexps filters for gene descriptions

parent 087f0a80
No related branches found
No related tags found
No related merge requests found
# regexp used for filter out useless description for Danio rerio
# add more as appropriate, line begining with # are supposed to be comments
^HYPOTHETICAL\s+PROTEIN\.?
^\s*\(?FRAGMENT\)?\.?\s*
^SIMILAR TO HUMAN CDNA KIAA\d+\.?
\ No newline at end of file
# regexp used for filter out useless description for Mus musculus
# add more as appropriate, line begining with # are supposed to be comments
^\(CLONE REM\d+\) ORF \(FRAGMENT\)\.*
^ORF\s*\d+\s+PROTEIN\.*
\(?[0-9A-Z]{10}RIK PROTEIN\)?[ \.]
RIKEN CDNA [0-9A-Z]{10}[ \.;]
.*RIKEN FULL-LENGTH ENRICHED LIBRARY.*PRODUCT:
.*RIKEN FULL-LENGTH ENRICHED LIBRARY.*
\(*HYPOTHETICAL\s+.*
^UNKNOWN\s+.*
CDNA SEQUENCE\s?,? [A-Z]+\d+[ \.;]
CLONE MGC:\d+[ \.;]
MGC:\s*\d+[ \.;]
HYPOTHETICAL PROTEIN,
HYPOTHETICAL PROTEIN \S+[\.;]
DNA SEGMENT, CHR.*
PROTEIN \S+ HOMOLOG\.?
^SIMILAR TO GENE.*
SIMILAR TO PUTATIVE[ \.]
^SIMILAR TO HYPOTHETICAL.*
SIMILAR TO (KIAA|LOC|RIKEN).*
SIMILAR TO GENBANK ACCESSION NUMBER\s+\S+
SIMILAR TO\s+$
EXPRESSED SEQUENCE [A-Z]+\d+[ \.;]
EST [A-Z]+\d+[ \.;]
^\s*\(FRAGMENT\)\.?\s*$
^\s*\(?GENE\)?\.?;?\s*$
\s*\(?GENE\)?\.?;?
\s*\(?PRECURSOR\)?\.?;?
^\s*\(\s*\)\s*$
^\s*\(\d*\)\s*[ \.]$
^\s+\(?\s*$
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment