1. Plant RNAseq

You are analyzing RNAseq data from a plant sequencing project and find that you have lots of hits to a protein with this sequence:

MIGRADIEGSKSNVAMNAWLPQASYPCGNFSDTSSFKFRRSKGSIGHAFTVRIRTGNQNQTSFYP
FVPHEISVLVELILGHLRYLLTDVPPQPNSPPDNVFRPDRPTKVGLGSKKRGSAPPPIHGIIGLE
SSSTGSSFPADSAKPVPLAVVSLDSRQGQGPKGPVPSPSPDRHASTRSRRGSSSSSPPTADGFET
GTPVPSPQSQSFSRGYGSILPTSLAYIVPSTRGCSPWRPDAVMSTTGHGRHSVLRIFKGRQGRTG
HHATCGALPAAGPYLRLSRFQGGQAICTDDRSAQAHAPGFAATVAPSYSSGPGPCRNGRVSAQLG
TVTQLPVHPASPVLLTKNGPLGALDSMAWLNRAATPSYLFKSFAPIPKSDERFARQYRCGPPPEF
PLASPRSGIVHHLSGPDRYALTRTLHKRSGSVGGATHKRIPPISFLAPNGFTHPLTRTHVRLLGP
CFKTGRMGSPQADARSAHVPKHAESARAVAHNRDDDVSASISTTQAWATITIRVGQFLFIFPSRY
LFAIGLSPIFSLGRNLPPDWGCIPKQPDSPTAPRGATGSEHNGALTLSGAPFQGTWARSAAEDAS
PDYNSNAEGDRFSWWAYPGSLAGRRALNLMASGATCVQRLDGSRDSAIHTKYRISLRSSSMQEPR
YPLPRVILYNVSKHNTHENRLRCHAGIDNDPSAGSPTETLLRLLLPLNDKVQWTSHNVAGSGPPT
SPQSEHFTGPFNRQIAPPTKNGHAPPPIESRKSSQSVNPYYVWTCGVLKATSADPWQMLSQLFVF
HKSKNFTSDYEIRMPPTVPVNHYSDPEGQHNRIRILCAGGTTRPIKARSASPAEGTSQPVHTIGG
PIDPTQAVSQAPSPESNPNSPSPVTTMVGHYPTIES

What does this protein do and why is it so abundant? And…. what’s wrong with it?

<– Previous exercise — | — Next exercise –>