Friday 28 June 2019

bioinformatics - Good poly-A filtering rules or tools


I am aligning a large number of ESTs. It seems poly-A tails show in many different ways. In addition to occurring at the very end, they can be flanked by the cloning sequence one one end, or have mismatches/errors. What is a good rule or available tools that will handle the usual cases?


A few examples of the non-trivial cases I found, with their Genbank Accs:


>EE409337
... AAAAAAAAAAAAAAAAAAAAAAAAAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAACCTTGTC
>EE409340
... TTTCTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACTTGTC

>EE409361
... TTGTTAAACTGAAAAAAAAAAAAAAAAAAAAAAAAAAAACCATGTCGGC
TTACTGAATTGAA
>EE420306
.... AAAAAAAGTTATGTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGAAAAAAA
AAAAAAAAAAAAAAAAA

Cross-posted on SeqAnswers,Biostars.




No comments:

Post a Comment

evolution - Are there any multicellular forms of life which exist without consuming other forms of life in some manner?

The title is the question. If additional specificity is needed I will add clarification here. Are there any multicellular forms of life whic...