BLASTX nr result
ID: Cocculus23_contig00006861
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00006861 (3889 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 357 3e-95 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 354 2e-94 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 353 3e-94 dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like ... 353 3e-94 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 347 2e-92 gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] 345 9e-92 gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc... 342 6e-91 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 342 1e-90 gb|AAC95175.1| putative non-LTR retroelement reverse transcripta... 341 1e-90 gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm,... 340 4e-90 gb|AAC33226.1| putative non-LTR retroelement reverse transcripta... 330 3e-87 emb|CAN77580.1| hypothetical protein VITISV_015346 [Vitis vinifera] 322 7e-85 ref|XP_002278276.1| PREDICTED: pentatricopeptide repeat-containi... 318 1e-83 ref|XP_006841544.1| hypothetical protein AMTR_s00003p00166290 [A... 309 8e-81 gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] 307 2e-80 ref|NP_001154694.1| pentatricopeptide repeat-containing protein ... 306 5e-80 emb|CAB83319.1| putative protein [Arabidopsis thaliana] 306 5e-80 emb|CAB72467.1| putative protein [Arabidopsis thaliana] 305 1e-79 gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA... 303 3e-79 ref|XP_006289785.1| hypothetical protein CARUB_v10003387mg [Caps... 302 7e-79 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 357 bits (915), Expect = 3e-95 Identities = 210/630 (33%), Positives = 320/630 (50%), Gaps = 17/630 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 +L G N +P MLK+D+ KAFDS+ W F+ A++ + P +F W+ +CI T +F+ Sbjct: 574 DLVHGYNWSNISPRGMLKVDLKKAFDSVRWEFVIAALRALAIPEKFINWISQCISTPTFT 633 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG GFF +GLRQGDPLSPYLF + ++ S+ L + S K S+L++S Sbjct: 634 VSINGGNGGFFKSTKGLRQGDPLSPYLFVLAMEAFSNLLHSRYESGLIHYHPKASNLSIS 693 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNS-GNDLSRIL 542 HL FADDV+IF + + + +C L +F WSGL++N+DKS +++AG N ++ + Sbjct: 694 HLMFADDVMIFFDGGSFSLHGICETLDDFASWSGLKVNKDKSHLYLAGLNQLESNANAAY 753 Query: 543 QVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQS 722 +G LP+++LGLP++ KL IA+ PL+ + W K LS+AGR++L+ +V+ Sbjct: 754 GFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKITARFRSWVNKCLSFAGRIQLISSVIFG 813 Query: 723 SYIFWTGAFPIPYSVCSKLESLMGSFLRG----KSKLRLISWATICRPLEEGGLGIRRIK 890 S FW F +P ++ESL FL ++K +SWA +C P EGGLG+RR+ Sbjct: 814 SINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPKSEGGLGLRRLL 873 Query: 891 DMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFA 1070 + NK +L+W ++ +K SLW W H L S W SW ++R+L +R Sbjct: 874 EWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTWKRLLSLRPLAH 933 Query: 1071 NHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS 1250 VGNG +W W G L G+ R+P A + W P S Sbjct: 934 QFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASAFSEDGWRLPVS 993 Query: 1251 SSPMVRTAWRQF--QQIPKLGCDEEDQFVWSP----CPSGLFSVASAWEQIRHHYDVWEW 1412 S + +P ++ D++ WS C FS A WE IR V W Sbjct: 994 RSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQG--FSAAKTWEAIRPKATVKSW 1051 Query: 1413 TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSHCELCWAGVESEDHLFFECP 1586 +WF +PK +F W L++L T+ +L +G C LC ES DHL C Sbjct: 1052 ASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASESRDHLLLICE 1111 Query: 1587 FSSEVWMRIKVKCWRNVQVVRGRFQESQTILRSF---GLNRADGVIRKLCYTVTVHFIWW 1757 FS++VW + +R + R R S + L S+ A ++RK+ V V+ +W Sbjct: 1112 FSAQVWRLV----FRRI-CPRQRLFSSWSELLSWVRQSSPEAPPLLRKIVSQVVVYNLWR 1166 Query: 1758 ERNMRLFNKGWRSATRLAEEII-QLVHQKV 1844 +RN L N + RLA +I +LV +++ Sbjct: 1167 QRNNLLHN-----SLRLAPAVIFKLVDREI 1191 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 354 bits (908), Expect = 2e-94 Identities = 206/603 (34%), Positives = 303/603 (50%), Gaps = 13/603 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 E+ G R N +P MLK+D+ KAFDS+ W F+ A++ + P R+ W+ +CI T SF+ Sbjct: 434 EMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFT 493 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG+T GFF +GLRQGDPLSPYLF + +++ S L + S K L++S Sbjct: 494 ISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSIS 553 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSN-SGNDLSRIL 542 HL FADDV+IF +S+ + +C L +F WSGL++N+DKS +F AG + S S Sbjct: 554 HLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSAAY 613 Query: 543 QVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQS 722 G P+++LGLP++ KL IAD PL+ + +L W +K LS+AGR +L+ +V+ Sbjct: 614 GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673 Query: 723 SYIFWTGAFPIPYSVCSKLESLMGSFLRGKS----KLRLISWATICRPLEEGGLGIRRIK 890 FW F +P K+ESL FL S K +SW C P EGGLG R Sbjct: 674 LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733 Query: 891 DMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFA 1070 + NK L +L+W ++ SLW QW L + S W W ++ +L +R Sbjct: 734 EWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAE 793 Query: 1071 NHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS 1250 VGNG FW W G L+ G+ RIP A + + + W P S Sbjct: 794 KFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWRLPLS 853 Query: 1251 SSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----FSVASAWEQIRHHYDVWEWT 1415 S + +P D + W C + FS A WE +R V W Sbjct: 854 RSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGFSAAKTWEVLRPRRPVKRWA 911 Query: 1416 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSHCELCWAGVESEDHLFFECPF 1589 + VWF +PK +F W L++LPT+ +L +G + + C LC E+ DHL C F Sbjct: 912 KSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLCDF 971 Query: 1590 SSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERN 1766 SS+VW + ++ C R Q + + E + R A ++RK+ + V+ +W +RN Sbjct: 972 SSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPSLLRKVVAQLVVYNLWRQRN 1028 Query: 1767 MRL 1775 + L Sbjct: 1029 LVL 1031 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 353 bits (907), Expect = 3e-94 Identities = 206/603 (34%), Positives = 302/603 (50%), Gaps = 13/603 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 E+ G R N +P MLK+D+ KAFDS+ W F+ A++ + P R+ W+ +CI T SF+ Sbjct: 434 EMVHGYNRLNISPRGMLKVDLKKAFDSVKWEFVTAALRALAIPERYINWIHQCITTPSFT 493 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG+T GFF +GLRQGDPLSPYLF + +++ S L + S K L++S Sbjct: 494 ISVNGATGGFFRSTKGLRQGDPLSPYLFVLAMEVFSKLLYSRYDSGYIHYHPKAGDLSIS 553 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSN-SGNDLSRIL 542 HL FADDV+IF +S+ + +C L +F WSGL++N+DKS +F AG + S S Sbjct: 554 HLMFADDVMIFFDGGSSSMHGICETLDDFADWSGLKVNKDKSQLFQAGLDLSERITSAAY 613 Query: 543 QVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQS 722 G P+++LGLP++ KL IAD PL+ + +L W +K LS+AGR +L+ +V+ Sbjct: 614 GFPAGTFPIRYLGLPLMCRKLRIADYGPLLEKLSARLRSWVSKALSFAGRTQLISSVIFG 673 Query: 723 SYIFWTGAFPIPYSVCSKLESLMGSFLRGKS----KLRLISWATICRPLEEGGLGIRRIK 890 FW F +P K+ESL FL S K +SW C P EGGLG R Sbjct: 674 LINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVDCCLPKSEGGLGFRSFG 733 Query: 891 DMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFA 1070 + NK L +L+W ++ SLW QW L + S W W ++ +L +R Sbjct: 734 EWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQTDPWTWKMLLNLRPLAE 793 Query: 1071 NHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS 1250 VGNG FW W G L+ G+ RIP A + + + W P S Sbjct: 794 KFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSAKVADAIDGSGWRLPLS 853 Query: 1251 SSPMVRTAWRQFQQIPKLG-CDEEDQFVWSPCPSGL----FSVASAWEQIRHHYDVWEWT 1415 S + +P D + W C + FS A WE +R V W Sbjct: 854 RSLTADSILSHLASLPPPSPLMVSDSYSW--CVDDVDCQGFSAAKTWEVLRPRRPVKRWA 911 Query: 1416 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFG--AQSHCELCWAGVESEDHLFFECPF 1589 VWF +PK +F W L++LPT+ +L +G + + C LC E+ DHL C F Sbjct: 912 RSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSFDTETRDHLLLLCDF 971 Query: 1590 SSEVWMRIKVK-CWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERN 1766 SS+VW + ++ C R Q + + E + R A ++RK+ + V+ +W +RN Sbjct: 972 SSQVWRMVFLRLCPR--QRLLCTWAELLSWTRQ-STAAAPSLLRKVVAQLVVYNLWRQRN 1028 Query: 1767 MRL 1775 + L Sbjct: 1029 LVL 1031 >dbj|BAB09379.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 1223 Score = 353 bits (906), Expect = 3e-94 Identities = 203/624 (32%), Positives = 318/624 (50%), Gaps = 25/624 (4%) Frame = +3 Query: 54 LKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFSPLMNGSTCGFFFGKRG 233 +K+DI KAFDS+ W F+ + FP F W+ CI T SFS +NG G+F RG Sbjct: 596 IKIDISKAFDSVQWPFLINVFTILGFPREFIHWINICITTASFSVQVNGELAGYFQSSRG 655 Query: 234 LRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLSHLAFADDVIIFLKPSA 413 LRQG LSPYLF I + +LS L ++ F KC ++ L+HL+FADD+++ Sbjct: 656 LRQGCALSPYLFVICMDVLSKMLDKAAAARHFGYHPKCKTMGLTHLSFADDLMVLSDGKI 715 Query: 414 STANQLCNILMEFEGWSGLRLNRDKSTIFVAG--SNSGNDLSRILQVKLGQLPVKHLGLP 587 + ++ + EF WSGLR++ +KST+++AG + + N+++ GQLPV++LGLP Sbjct: 716 RSIERIIKVFDEFAKWSGLRISLEKSTVYLAGLSATARNEVADRFPFSSGQLPVRYLGLP 775 Query: 588 VIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQSSYIFWTGAFPIPYSV 767 +I +LS DC PL+ +++ W ++ LSYAGRL L+ +VL S FW AF +P Sbjct: 776 LITKRLSTTDCLPLLEQVRKRIGSWTSRFLSYAGRLNLISSVLWSICNFWLAAFRLPRKC 835 Query: 768 CSKLESLMGSFLRGKSKLR----LISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIY 935 +LE + +FL +++ ISW +C+P +EGGLG+R +K+ N KL+W I Sbjct: 836 IRELEKMCSAFLWSGTEMNSNKAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIV 895 Query: 936 SSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFANHCFNLVGNGDATK 1112 S SLWV+W+ LRN S W + SW+++++LK R VGNG T Sbjct: 896 SHSNSLWVKWVDQHLLRNASFWEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTS 955 Query: 1113 FWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV----REAGYWNSPPS-SSPMVRTAW 1277 FW W G LL+ G+ + I RR T++E R+ + N + ++ +W Sbjct: 956 FWYDNWSDLGQLLERTGDRGLIDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSW 1015 Query: 1278 RQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPK 1448 + ED+ +W S FS W R W +++WF PK Sbjct: 1016 -------DTRTETEDKVLWRGKSDVFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPK 1068 Query: 1449 CSFTCWRMLLSKLPTKDKLTRF--GAQSHCELCWAGVESEDHLFFECPFSSEVWMRIKVK 1622 SF W +LPT D++ + G + C C +E+ DHLFF C F+S +W Sbjct: 1069 YSFCSWLAAHGRLPTGDRMINWANGIATDCIFCQGTLETRDHLFFTCSFTSVIW------ 1122 Query: 1623 CWRNVQVVRGRFQESQT--------ILRSFGLNRADGVIRKLCYTVTVHFIWWERNMRLF 1778 V + RG F+ T + + +R + +R+ + T++ +W ERN R Sbjct: 1123 ----VDLARGIFKTQYTSHWQSIIEAITNSQHHRVEWFLRRYVFQATIYIVWRERNGRRH 1178 Query: 1779 NKGWRSATRLAEEIIQLVHQKVST 1850 + +A++L I + + ++S+ Sbjct: 1179 GEPPNTASQLVGWIDKQIRNQLSS 1202 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 347 bits (890), Expect = 2e-92 Identities = 202/637 (31%), Positives = 326/637 (51%), Gaps = 19/637 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL + + + + +K+DI KAFDS+ W F+ + M FP F W+ C+ T SFS Sbjct: 301 ELVKDYHKDSISTRCAMKIDISKAFDSLQWSFLTHVLAAMNFPGEFIHWISLCMSTASFS 360 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG G+F RGLRQG LSPYLF I + +LS L ++EF +C +L L+ Sbjct: 361 IQVNGELAGYFRSARGLRQGCSLSPYLFVISMDVLSRMLDKAAGAREFGYHPRCKTLGLT 420 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAG--SNSGNDLSRI 539 HL FADD++I + + + +L +F GL++ +K+T+++AG +S +S Sbjct: 421 HLCFADDLMILTDGKIRSVDGIVKVLNQFAAKLGLKICMEKTTLYLAGVSDHSRQLMSSR 480 Query: 540 LQVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQ 719 +G+LPV++LGLP++ +L+ +D +PL+ R++ W ++ LS+AGRL L+ +VL Sbjct: 481 YSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVLW 540 Query: 720 SSYIFWTGAFPIPYSVCSKLESLMGSFLRGKSKLR----LISWATICRPLEEGGLGIRRI 887 S FW AF +P +++ + + L +L +SW IC+P +EGGLG++ + Sbjct: 541 SITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQSL 600 Query: 888 KDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQ 1064 ++ NK KL+W + S + SLWV+W L+ S W+ + + SW++RR+LK R Sbjct: 601 REANKVSSLKLIWRLLSCQDSLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHREV 660 Query: 1065 FANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSP 1244 + C V NG T FW W +G L++ G + I R T+ E W+ Sbjct: 661 AKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAEA-----WSRR 715 Query: 1245 PSSSPMVRTAWRQFQQI-----PKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYD 1400 V +F++I + ED +W FS W IR + Sbjct: 716 RRKRHRVEIL-NEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSSN 774 Query: 1401 VWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSHCELCWAGVESEDHLF 1574 W + VWF PK SF W + ++L T D++ + G + C C + +E+ DHLF Sbjct: 775 QRAWHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETRDHLF 834 Query: 1575 FECPFSSEVWMRIKVKCWRNVQVVRGRFQESQTI--LRSFGLNRADGVIRKLCYTVTVHF 1748 F+C +SSE+W I +NV R + S + + +R + + + V++H Sbjct: 835 FQCCYSSEIWTSIA----KNVYKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHS 890 Query: 1749 IWWERNMRLFNKGWRSATRLAEEIIQLVHQKVSTSKK 1859 IW ERN R + RSA+ L +I + + ++ST KK Sbjct: 891 IWRERNSRRHGEKSRSASNLIRQIDKTIRNQLSTIKK 927 >gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana] Length = 872 Score = 345 bits (885), Expect = 9e-92 Identities = 194/603 (32%), Positives = 310/603 (51%), Gaps = 14/603 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL + + + + +K+DI KAFDS+ W F+ + M F P F W+ CI T SFS Sbjct: 227 ELVKDYHKDSISARCAIKIDISKAFDSVQWSFLTNTLVAMNFSPTFIHWINLCITTASFS 286 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG G+F KRGLRQG LSPYLF I + +LS L ++F KC L L+ Sbjct: 287 VQVNGDLVGYFQSKRGLRQGCSLSPYLFVICMDVLSKMLDKAAGVRKFGFHPKCQRLGLT 346 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNS--GNDLSRI 539 HL+FADD+++ + + + EF SGLR++ +KST+++AG + +++ Sbjct: 347 HLSFADDLMVLSDGKTRSIEGILEVFDEFCKRSGLRISLEKSTLYMAGVSPIIKQEIAAK 406 Query: 540 LQVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQ 719 +GQLPV++LGLP++ +L+ AD +PL+ +++ W + S+AGR L+K+VL Sbjct: 407 FLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATWTFRFFSFAGRFNLIKSVLW 466 Query: 720 SSYIFWTGAFPIPYSVCSKLESLMGSFLRGKSKL----RLISWATICRPLEEGGLGIRRI 887 S FW AF +P +++ L SFL S++ ISW +C+P EGGLG+R + Sbjct: 467 SICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKISWDIVCKPKAEGGLGLRNL 526 Query: 888 KDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQ 1064 K+ N KL+W I S+ SLW +W+ +R SIW+ + SW++R+ILKIR+ Sbjct: 527 KEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTSMGSWIWRKILKIRDV 586 Query: 1065 FANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSP 1244 + VGNG++ FW W G L+DT G+ + IPR A++ + Sbjct: 587 AKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPREASVADAWTRRSRRRH 646 Query: 1245 PSSSPMVRTAWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWT 1415 +S +Q+I D ED +W + FS W I+ W Sbjct: 647 RTSLLNEIEEMMAYQRIHH--SDAEDTVLWRGKNDVFKPHFSTRDTWHLIKATSSTVSWH 704 Query: 1416 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGA----QSHCELCWAGVESEDHLFFEC 1583 + VWF PK + W + ++LPT D++ ++ + +C LC ++ +HLFF C Sbjct: 705 KGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCTNNSKTLEHLFFSC 764 Query: 1584 PFSSEVWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWER 1763 ++S VW + W+ R+ T + + +R +G + + + T++ +W ER Sbjct: 765 SYASTVWAALAKGIWKTRYST--RWSHLLTHISTHFQDRVEGFLTRYIFQATIYHVWRER 822 Query: 1764 NMR 1772 N R Sbjct: 823 NGR 825 >gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus] Length = 1214 Score = 342 bits (878), Expect = 6e-91 Identities = 199/627 (31%), Positives = 309/627 (49%), Gaps = 14/627 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL +G + N + +LK+D+ KAFDS+ W FI +K PPRF W+ +CI +TSFS Sbjct: 573 ELVQGFGQANISSRGVLKVDLRKAFDSVGWGFIIETLKAANAPPRFVNWIKQCITSTSFS 632 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 ++GS CG+F G +GLRQGDPLSP LF I +++LS L+ K K S + +S Sbjct: 633 INVSGSLCGYFKGSKGLRQGDPLSPSLFVIAMEILSRLLENKFSDGSIGYHPKASEVRIS 692 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAG-SNSGNDLSRIL 542 LAFADD++IF AS+ + ++L F+ SGL +N +KS ++ AG ++ + + Sbjct: 693 SLAFADDLMIFYDGKASSLRGIKSVLESFKNLSGLEMNTEKSAVYTAGLEDTDKEDTLAF 752 Query: 543 QVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQS 722 G P ++LGLP++ KL +D + L+ A + W K LS+AGRL+L+ +V+ S Sbjct: 753 GFVNGTFPFRYLGLPLLHRKLRRSDYSQLIDKIAARFNHWATKTLSFAGRLQLISSVIYS 812 Query: 723 SYIFWTGAFPIPYSVCSKLESLMGSFLRGKSKLR----LISWATICRPLEEGGLGIRRIK 890 + FW +F +P +E + FL G R +SW C P EGGLG+R Sbjct: 813 TVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPKAEGGLGLRNFW 872 Query: 891 DMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFA 1070 NK +L+W +++ + SLWV W H+ LR+ + W A + SW+++ IL +R Sbjct: 873 TWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIWKAILGLRPLAK 932 Query: 1071 NHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPS 1250 VGNG +W W G L++ G + T I A + E + W P + Sbjct: 933 RFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEASSSTGWILPSA 992 Query: 1251 ---SSPMVRTAWRQFQQIPKLGCDEEDQFVW--SPCPSGLFSVASAWEQIRHHYDVWEWT 1415 ++ + G ED + W S FS WE +R W Sbjct: 993 RTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWECLRQRDTTKLWA 1052 Query: 1416 ELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SHCELCWAGVESEDHLFFECPF 1589 VW+ IPK +F W L++LP + + T + S C +C E+ DHLF C Sbjct: 1053 AAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETETRDHLFIHCTL 1112 Query: 1590 SSEVWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRA--DGVIRKLCYTVTVHFIWWER 1763 S +W ++ + R+ F+E + I+ N+ G ++KL + IW ER Sbjct: 1113 GSLIWQQVLARFGRSQM-----FREWKDIIEWMLSNQGSFSGTLKKLAVQTAIFHIWKER 1167 Query: 1764 NMRLFNKGWRSATRLAEEIIQLVHQKV 1844 N RL + S T + ++I + + + Sbjct: 1168 NSRLHSAMSASHTAIFKQIDRSIRDSI 1194 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 342 bits (876), Expect = 1e-90 Identities = 203/621 (32%), Positives = 313/621 (50%), Gaps = 23/621 (3%) Frame = +3 Query: 54 LKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFSPLMNGSTCGFFFGKRG 233 +K+DI KAFDS+ W F+ A+ M FP F W+ CI TTSFS +NG G+F RG Sbjct: 1 MKIDISKAFDSLQWSFLINALSAMNFPGEFIHWISRCITTTSFSVQVNGELAGYFRSARG 60 Query: 234 LRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLSHLAFADDVIIFLKPSA 413 +RQG LSPYLF I +++LS L K F KC +L L+HL FADD++I Sbjct: 61 IRQGCALSPYLFVISMEVLSKMLDQAAGGKRFGFHPKCKNLGLTHLCFADDLMILTDGKV 120 Query: 414 STANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNSGNDLSRILQVK--LGQLPVKHLGLP 587 + + + ++ F SGL++N +K+T++ AG + N I + LGQLPV++LGLP Sbjct: 121 RSVDGIVEVMNLFAKRSGLQINMEKTTLYTAGVSDHNRYMMISRYPFGLGQLPVRYLGLP 180 Query: 588 VIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQSSYIFWTGAFPIPYSV 767 ++ +L+ D +PL ++ W ++ LS+AGRL L+ +VL S+ FW AF +P + Sbjct: 181 LVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSAC 240 Query: 768 CSKLESLMGSFLRGKSKLR----LISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIY 935 ++ S+ +FL +L +SW IC+P +EGGLG+R + + N + KL+W + Sbjct: 241 LKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVT 300 Query: 936 SSKKSLWVQWIHSRFLRNNSIWTATIPNDV--SWVYRRILKIRNQFANHCFNLVGNGDAT 1109 S+ SLWV+W L+ S W+ T PN SW+++++LK R V NG T Sbjct: 301 SNDDSLWVKWSKMNLLKQESFWSLT-PNSSLGSWMWKKMLKYRETAKPFSRVEVNNGART 359 Query: 1110 KFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPM------VRT 1271 FW W G L+D G+ ++ I R T+ E W++ + Sbjct: 360 SFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAEA-----WSNRRRRKHRTEQLNDIEA 414 Query: 1272 AWRQFQQIPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKI 1442 A Q Q L ED +W FS W Q+R + W + VWF Sbjct: 415 ALNQKYQTRNL--LREDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHST 472 Query: 1443 PKCSFTCWRMLLSKLPT--KDKLTRFGAQSHCELCWAGVESEDHLFFECPFSSEVWMRIK 1616 PK F W L ++L T + +L G+ C C +E+ DHLFF C ++S +W I Sbjct: 473 PKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYASAIWTAIA 532 Query: 1617 VKCWRNVQVVRGRFQ-ESQTILRSFGLNRADGV---IRKLCYTVTVHFIWWERNMRLFNK 1784 V++ RF + QTI+ + D + + + + +TVH +W ERN R + Sbjct: 533 ------KNVLQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLTVHTVWKERNDRRHGE 586 Query: 1785 GWRSATRLAEEIIQLVHQKVS 1847 R++ L + + + ++S Sbjct: 587 EPRTSANLISWMDKQIRNQLS 607 >gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1352 Score = 341 bits (875), Expect = 1e-90 Identities = 211/627 (33%), Positives = 321/627 (51%), Gaps = 24/627 (3%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL + + + + + LK+DI KAFD + W F+ +K + P F W+ CI T SFS Sbjct: 727 ELVKDYHKESISSRSALKIDISKAFDFVQWPFLINVLKAIHLPEMFIHWIELCIGTASFS 786 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG GFF +RGLRQG LSPYL+ I + +LS L K+ +C ++NL+ Sbjct: 787 VQVNGELSGFFRSERGLRQGCSLSPYLYVICMNVLSCMLDKAAVEKKISYHPRCRNMNLT 846 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNSGNDLSRILQ 545 HL FADD+++F ++ + I +F S L+++ +KSTIF+AG S N + ILQ Sbjct: 847 HLCFADDIMVFSDGTSKSIQGTLAIFEKFAAMSWLKISLEKSTIFMAGI-SPNAKTSILQ 905 Query: 546 ---VKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVL 716 +LG LPVK+LGLP++ +++ +D PLV ++ W + LS+AGRL+L+K+VL Sbjct: 906 QFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGRLQLIKSVL 965 Query: 717 QSSYIFWTGAFPIPYSVCSKLESLMGSFLRG----KSKLRLISWATICRPLEEGGLGIRR 884 S FW F +P + ++E + +FL +K I+W+ +C+ EEGGLG++ Sbjct: 966 SSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKEEGGLGLKP 1025 Query: 885 IKDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRN 1061 +K+ N+ L KL+W I S++ SLWV+W++ +R + W+ + SW++R+ILK R+ Sbjct: 1026 LKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSVKENTGLGSWLWRKILKQRD 1085 Query: 1062 QFANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-------- 1217 + V +G T FW W P G L G + IP AT+ EV Sbjct: 1086 KARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAEVMNTHRRKR 1145 Query: 1218 REAGYWNSPPSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGLFSVASAWEQIRHHY 1397 A + N S + R R L +ED F S FS + W+QIR Sbjct: 1146 HRADFLNQIKSQIELARQD-RSTDGDRSLWKQKEDTFKSS------FSSSKTWQQIRSIS 1198 Query: 1398 DVWEWTELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSHCELCWAGVESEDHL 1571 +W VWF PK SF W ++L T DK+ ++ GA+ C C +E+ DHL Sbjct: 1199 LRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGEELETRDHL 1258 Query: 1572 FFECPFSSEVWMRIK--VKCWRNV----QVVRGRFQESQTILRSFGLNRADGVIRKLCYT 1733 FF CP+SS VW + + RN+ + S+ L F L A + Sbjct: 1259 FFSCPYSSHVWFSLTKGLLNGRNILNWNLITPHLLDSSRPYLHVFTLRYA--------FQ 1310 Query: 1734 VTVHFIWWERNMRLFNKGWRSATRLAE 1814 ++H +W ERN R + A +LA+ Sbjct: 1311 ASIHSLWRERNCRRHGETAIPAAKLAK 1337 >gb|AAC28221.1| similar to reverse transcriptases (PFam: rvt.hmm, score: 60.13) [Arabidopsis thaliana] Length = 1164 Score = 340 bits (871), Expect = 4e-90 Identities = 188/543 (34%), Positives = 283/543 (52%), Gaps = 13/543 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL G ++N AP +MLK+D+ KAFDS+ W FI A++ + P +F W++EC+ T SFS Sbjct: 471 ELVHGYNKKNIAPSSMLKVDLRKAFDSVRWDFIVSALRALNVPEKFTCWILECLSTASFS 530 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 ++NG + G F+ +GLRQGDP+SPYLF + +++ S LQ + S K S L +S Sbjct: 531 VILNGHSAGHFWSSKGLRQGDPMSPYLFVLAMEVFSGLLQSRYTSGYIAYHPKTSQLEIS 590 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAG-SNSGNDLSRIL 542 HL FADDV+IF +S+ + + L +F GWSGL +N +K+ ++ AG S S +D Sbjct: 591 HLMFADDVMIFFDGKSSSLHGIVESLEDFAGWSGLLMNTNKTQLYHAGLSQSESDSMASY 650 Query: 543 QVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQS 722 KLG LPV++LGLP++ KL+IA+ APL+ + W ++LS+AGR++L+ +V+ Sbjct: 651 GFKLGSLPVRYLGLPLMSRKLTIAEYAPLIEKITARFNSWVVRLLSFAGRVQLLASVISG 710 Query: 723 SYIFWTGAFPIPYSVCSKLESLMGSFLRG----KSKLRLISWATICRPLEEGGLGIRRIK 890 FW +F +P K+ESL FL K + ++W+ +C P EGG+G+RR Sbjct: 711 IVNFWISSFILPLGCIKKIESLCSRFLWSSRIDKKGIAKVAWSQVCLPKAEGGIGLRRFA 770 Query: 891 DMNKAGLCKLLWWIYSSKKSLWVQWIHSRFL-RNNSIWTATIPNDVSWVYRRILKIRNQF 1067 N+ +++W ++S+ SLWV W L ++ S W SW ++ +L++R Sbjct: 771 VSNRTLYLRMIWLLFSNSGSLWVAWHKQHSLGKSTSFWNQPEKPHDSWNWKCLLRLRVVA 830 Query: 1068 ANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPP 1247 VGNG FW W P G L+ G R+ A I +V + W+ Sbjct: 831 ERFIRCNVGNGRDASFWFDNWTPFGPLIKFLGNEGPRDLRVHLNAKISDVCTSEGWSIAD 890 Query: 1248 SSSPMVRTAWRQFQQIP-KLGCDEEDQFVW----SPCPSGLFSVASAWEQIRHHYDVWEW 1412 S + I + D + W C FS A+ W +R W Sbjct: 891 PRSDQALSLHTHLTNISMPSDAQDLDSYDWVVDNKVCQG--FSAAATWSALRPSSAPVPW 948 Query: 1413 TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGAQ--SHCELCWAGVESEDHLFFECP 1586 VWF PK +F W L +LPTK +L +G Q + C LC E+ DHLF C Sbjct: 949 ARAVWFKGATPKHAFHLWTAHLDRLPTKVRLASWGMQIDTTCGLCSLHPETRDHLFLSCD 1008 Query: 1587 FSS 1595 F++ Sbjct: 1009 FAN 1011 >gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1529 Score = 330 bits (846), Expect = 3e-87 Identities = 197/628 (31%), Positives = 319/628 (50%), Gaps = 13/628 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL + + + P +K+DI KAFDS+ W+F+ ++ + FP F W+ CI T +FS Sbjct: 877 ELVKDYHKESVTPRCAMKIDISKAFDSVQWQFLLNTLEALNFPETFRHWIKLCISTATFS 936 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG GFF RGLRQG LSPYLF I + +LS + + KC + L+ Sbjct: 937 VQVNGELAGFFGSSRGLRQGCALSPYLFVICMNVLSHMIDEAAVHRNIGYHPKCEKIGLT 996 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNSGNDLSRILQ 545 HL FADD+++F+ + + N+ EF G SGL+++ +KSTI++AG ++ + + + Sbjct: 997 HLCFADDLMVFVDGHQWSIEGVINVFKEFAGRSGLQISLEKSTIYLAGVSASDRVQTLSS 1056 Query: 546 VKL--GQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQ 719 GQLPV++LGLP++ +++ AD +PL+ K+ W A+ LSYAGRL L+ +V+ Sbjct: 1057 FPFANGQLPVRYLGLPLLTKQMTTADYSPLIEAVKTKISSWTARSLSYAGRLALLNSVIV 1116 Query: 720 SSYIFWTGAFPIPYSVCSKLESLMGSFLRG----KSKLRLISWATICRPLEEGGLGIRRI 887 S FW A+ +P ++E L +FL K I+W++IC+P +EGGLGI+ + Sbjct: 1117 SIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPKKEGGLGIKSL 1176 Query: 888 KDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQ 1064 + NK KL+W + S++ SLWV WI + +R + W+A + + SW+++++LK R Sbjct: 1177 AEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWMWKKLLKYREL 1236 Query: 1065 FANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEV-REAGYWNS 1241 + V NG +T FW W G LLD G + IP ++ V R + Sbjct: 1237 AKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLETVLRTHQHRQH 1296 Query: 1242 PPSSSPMVRTAWRQFQQIPKLGCDEEDQFVWSPCPSGL---FSVASAWEQIRHHYDVWEW 1412 + + ++ QQ + D +W + F W +R H W Sbjct: 1297 RAAIYNRINAEIQRLQQQEREA--GPDISLWRSLKNDFNKRFITKVTWNNVRTHQPQQNW 1354 Query: 1413 TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRF--GAQSHCELCWAGVESEDHLFFECP 1586 + VWF PK SF W + ++L T D++ + G C LC E+ DHLFF C Sbjct: 1355 YKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCNNAEETRDHLFFSCQ 1414 Query: 1587 FSSEVWMRIKVKCWRNVQVVRGRFQESQTILRSFGLNRADGVIRKLCYTVTVHFIWWERN 1766 ++S VW + + + R + T+L + L R + + + +++ IW ERN Sbjct: 1415 YTSYVWEALTQRL-LSTNYSRD-WNRLFTLLCTSNLPRDHLFLFRYVFQASIYHIWRERN 1472 Query: 1767 MRLFNKGWRSATRLAEEIIQLVHQKVST 1850 R + RL + I + V ++S+ Sbjct: 1473 ARRHGEISSPTNRLIKLIDKTVRNRISS 1500 >emb|CAN77580.1| hypothetical protein VITISV_015346 [Vitis vinifera] Length = 347 Score = 322 bits (826), Expect = 7e-85 Identities = 158/201 (78%), Positives = 182/201 (90%), Gaps = 1/201 (0%) Frame = -1 Query: 3322 LPPPYDPFSKKPVIEEPKDRKNLQEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFA 3143 LPPPYDPFSKKP IEEPKD K+LQ+IFHKMRTEGL+ NA+KMFDALSKDGLTHEAMELFA Sbjct: 147 LPPPYDPFSKKPAIEEPKDPKDLQDIFHKMRTEGLVPNAVKMFDALSKDGLTHEAMELFA 206 Query: 3142 QIKDKGNMPDVVAHTAVIEAYANAGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDG 2963 QIKD G+MPDVVAHTAVIEAYANAGQSKEA+KVYMRML SGV PNAYTY+VLI GLA D Sbjct: 207 QIKDHGHMPDVVAHTAVIEAYANAGQSKEAVKVYMRMLTSGVMPNAYTYSVLIKGLAGDA 266 Query: 2962 KLGDAQKYLLEMMGKGIRPNAATYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVT 2783 KLG+A+KY+LEMMGKG+RPNA TYT +FE FA+E K+E+GR+ LE++KAKGF PDEKAV Sbjct: 267 KLGEAKKYVLEMMGKGMRPNAGTYTALFEGFAKEQKVEEGREFLEQMKAKGFTPDEKAVR 326 Query: 2782 DHL-SKRGQVFRSVMNLLFHK 2723 + L ++RGQVFRS+M++LF K Sbjct: 327 EILKNRRGQVFRSIMDILFGK 347 >ref|XP_002278276.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like [Vitis vinifera] Length = 307 Score = 318 bits (816), Expect = 1e-83 Identities = 157/201 (78%), Positives = 181/201 (90%), Gaps = 1/201 (0%) Frame = -1 Query: 3322 LPPPYDPFSKKPVIEEPKDRKNLQEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFA 3143 LPPPYDPFSKK IEEPKD K+LQ+IFHKMRTEGL+ NA+KMFDALSKDGLTHEAMELFA Sbjct: 107 LPPPYDPFSKKLAIEEPKDPKDLQDIFHKMRTEGLVPNAVKMFDALSKDGLTHEAMELFA 166 Query: 3142 QIKDKGNMPDVVAHTAVIEAYANAGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDG 2963 QIKD G+MPDVVAHTAVIEAYANAGQSKEA+KVYMRML SGV PNAYTY+VLI GLA D Sbjct: 167 QIKDHGHMPDVVAHTAVIEAYANAGQSKEAVKVYMRMLTSGVMPNAYTYSVLIKGLAGDA 226 Query: 2962 KLGDAQKYLLEMMGKGIRPNAATYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVT 2783 KLG+A+KY+LEMMGKG+RPNA TYT +FE FA+E K+E+GR+ LE++KAKGF PDEKAV Sbjct: 227 KLGEAKKYVLEMMGKGMRPNAGTYTALFEGFAKEQKVEEGREFLEQMKAKGFTPDEKAVR 286 Query: 2782 DHL-SKRGQVFRSVMNLLFHK 2723 + L ++RGQVFRS+M++LF K Sbjct: 287 EILKNRRGQVFRSIMDILFGK 307 >ref|XP_006841544.1| hypothetical protein AMTR_s00003p00166290 [Amborella trichopoda] gi|548843565|gb|ERN03219.1| hypothetical protein AMTR_s00003p00166290 [Amborella trichopoda] Length = 323 Score = 309 bits (791), Expect = 8e-81 Identities = 154/200 (77%), Positives = 169/200 (84%) Frame = -1 Query: 3322 LPPPYDPFSKKPVIEEPKDRKNLQEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFA 3143 LPPPYDPFSKKPV+EEP+D KNLQEIF+KM++EGLI NAIKMFDALSKDGLTHEAMELF Sbjct: 124 LPPPYDPFSKKPVVEEPEDPKNLQEIFYKMKSEGLIPNAIKMFDALSKDGLTHEAMELFG 183 Query: 3142 QIKDKGNMPDVVAHTAVIEAYANAGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDG 2963 IKDKG MPD+VAHTAVIEAYANAGQSKEA+KVY RMLASGVKPNAYTY VLI GL+RDG Sbjct: 184 VIKDKGQMPDIVAHTAVIEAYANAGQSKEAMKVYTRMLASGVKPNAYTYTVLIKGLSRDG 243 Query: 2962 KLGDAQKYLLEMMGKGIRPNAATYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVT 2783 K +A K LLEMM KGI+PNA TYT FE +E K E+ LEK+KA+GF PDEKA Sbjct: 244 KFSEANKILLEMMDKGIKPNAGTYTTFFECLCKEGKTEEAGVYLEKMKARGFAPDEKATR 303 Query: 2782 DHLSKRGQVFRSVMNLLFHK 2723 + LSKRG VFRSVMNLLF K Sbjct: 304 EILSKRGHVFRSVMNLLFGK 323 >gb|AAF99785.1|AC012463_2 T2E6.4 [Arabidopsis thaliana] Length = 740 Score = 307 bits (787), Expect = 2e-80 Identities = 168/516 (32%), Positives = 272/516 (52%), Gaps = 11/516 (2%) Frame = +3 Query: 6 ELARGITRRNHAPFAMLKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFS 185 EL + + + +P +K+DI KAFDS+ W+F+ ++ + FP FC W+ CI T +FS Sbjct: 153 ELVKDYHKDSISPRCAMKIDISKAFDSVQWQFLLNTLEALNFPENFCHWIKLCISTATFS 212 Query: 186 PLMNGSTCGFFFGKRGLRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLS 365 +NG GFF KRGLRQG LSPYLF I + +LS + + KC L+L+ Sbjct: 213 VQVNGELAGFFGSKRGLRQGCALSPYLFVICMNVLSHMIDVAAVHRNIGYHPKCKKLSLT 272 Query: 366 HLAFADDVIIFLKPSASTANQLCNILMEFEGWSGLRLNRDKSTIFVAGSN--SGNDLSRI 539 HL FADD+++F+ + + NI EF G SGL ++ +KST+++AG + + N++ Sbjct: 273 HLCFADDLMVFIDGQQRSVEGVINIFKEFAGKSGLHISLEKSTLYLAGVSELNRNNILSA 332 Query: 540 LQVKLGQLPVKHLGLPVIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQ 719 GQLPV++LGLP++ +++ AD +PL+ K+ W A+ LSYAGRL L+ +V+ Sbjct: 333 FPFASGQLPVRYLGLPLLTKQMTTADYSPLLDKVRSKISSWTARSLSYAGRLALINSVIV 392 Query: 720 SSYIFWTGAFPIPYSVCSKLESLMGSFLRGKSKLR----LISWATICRPLEEGGLGIRRI 887 S FW A+ +P ++E L +FL +L I+W ++C+ +EGGLGI+ + Sbjct: 393 SLSNFWMSAYRLPAGCIKEIEKLCSAFLWSGPELNPKKAKITWTSLCKLKQEGGLGIKSL 452 Query: 888 KDMNKAGLCKLLWWIYSSKKSLWVQWIHSRFLRNNSIWTATIPNDV-SWVYRRILKIRNQ 1064 + NK KL+W + S + SLWV W+ + +R S W+A + + SW+++++LK R+ Sbjct: 453 LEANKVSCLKLIWRLVSRQSSLWVNWVWTYIIRKGSFWSANDRSSLGSWMWKKLLKYRDV 512 Query: 1065 FANHCFNLVGNGDATKFWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSP 1244 + C + +G +T FW W G L+D + IP AT+ V + + Sbjct: 513 AKSMCKVEIKSGSSTSFWYDNWSQLGQLVDVTNARRTIDMGIPLAATVATVLAS--HRTK 570 Query: 1245 PSSSPMVRTAWRQFQQI-PKLGCDEEDQFVWSPCPSGL---FSVASAWEQIRHHYDVWEW 1412 + + + Q I + D F+W F W IR + +W Sbjct: 571 HHRTAIYNKIEAEIQSILQRERSGAPDIFLWRSSGDNFRQSFITKVTWHNIRVIHTHRQW 630 Query: 1413 TELVWFYDKIPKCSFTCWRMLLSKLPTKDKLTRFGA 1520 + VWF PK SF W + +L T D++ ++ + Sbjct: 631 YKGVWFSYNTPKYSFLLWLAIHDRLSTGDRIKKWNS 666 >ref|NP_001154694.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332003244|gb|AED90627.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 363 Score = 306 bits (784), Expect = 5e-80 Identities = 154/201 (76%), Positives = 172/201 (85%), Gaps = 2/201 (0%) Frame = -1 Query: 3319 PPPYDPFSKKPVIEEPKDRKNLQEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFAQ 3140 PPPYDPFSKKP IEEP+D KNLQEIFHKMRTEG N A+KMFDALSKDG THEA+ELF+Q Sbjct: 163 PPPYDPFSKKPAIEEPEDPKNLQEIFHKMRTEGFTNEAVKMFDALSKDGRTHEALELFSQ 222 Query: 3139 IKDKGNMPDVVAHTAVIEAYANAGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDGK 2960 IKDK MPDVVAHTA++EAYANAGQ+KE LKV+MRMLASGV PNAYTY+VLI GLA DGK Sbjct: 223 IKDKNRMPDVVAHTAIVEAYANAGQAKETLKVFMRMLASGVSPNAYTYSVLIKGLAADGK 282 Query: 2959 L-GDAQKYLLEMMGKGIRPNAATYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVT 2783 DA+KYLLEMMG G+ PNAATYT VFEAF RE K E R+LL+++K KGFVPDEKAV Sbjct: 283 THKDAKKYLLEMMGNGMSPNAATYTAVFEAFVREGKEESARELLQEMKGKGFVPDEKAVR 342 Query: 2782 DHLS-KRGQVFRSVMNLLFHK 2723 + L KRGQVFR+V+NLLF K Sbjct: 343 EALEYKRGQVFRTVINLLFDK 363 >emb|CAB83319.1| putative protein [Arabidopsis thaliana] Length = 482 Score = 306 bits (784), Expect = 5e-80 Identities = 154/201 (76%), Positives = 172/201 (85%), Gaps = 2/201 (0%) Frame = -1 Query: 3319 PPPYDPFSKKPVIEEPKDRKNLQEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFAQ 3140 PPPYDPFSKKP IEEP+D KNLQEIFHKMRTEG N A+KMFDALSKDG THEA+ELF+Q Sbjct: 144 PPPYDPFSKKPAIEEPEDPKNLQEIFHKMRTEGFTNEAVKMFDALSKDGRTHEALELFSQ 203 Query: 3139 IKDKGNMPDVVAHTAVIEAYANAGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDGK 2960 IKDK MPDVVAHTA++EAYANAGQ+KE LKV+MRMLASGV PNAYTY+VLI GLA DGK Sbjct: 204 IKDKNRMPDVVAHTAIVEAYANAGQAKETLKVFMRMLASGVSPNAYTYSVLIKGLAADGK 263 Query: 2959 L-GDAQKYLLEMMGKGIRPNAATYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVT 2783 DA+KYLLEMMG G+ PNAATYT VFEAF RE K E R+LL+++K KGFVPDEKAV Sbjct: 264 THKDAKKYLLEMMGNGMSPNAATYTAVFEAFVREGKEESARELLQEMKGKGFVPDEKAVR 323 Query: 2782 DHLS-KRGQVFRSVMNLLFHK 2723 + L KRGQVFR+V+NLLF K Sbjct: 324 EALEYKRGQVFRTVINLLFDK 344 >emb|CAB72467.1| putative protein [Arabidopsis thaliana] Length = 762 Score = 305 bits (781), Expect = 1e-79 Identities = 174/520 (33%), Positives = 257/520 (49%), Gaps = 12/520 (2%) Frame = +3 Query: 54 LKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFSPLMNGSTCGFFFGKRG 233 +K+DI KA DS+ W F+ + M FP F W+ CI T SFS +NG GFF RG Sbjct: 149 IKIDISKASDSVQWSFLINTLTAMHFPEMFIHWIRLCITTPSFSVQVNGELAGFFQSSRG 208 Query: 234 LRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLSHLAFADDVIIFLKPSA 413 LRQG LSPYLF I + +LS L V C + L+HL+FADD++I Sbjct: 209 LRQGCALSPYLFVICMDVLSKLLDKVVGIGRIGYHPHCKRMGLTHLSFADDLMILTDGQC 268 Query: 414 STANQLCNILMEFEGWSGLRLNRDKSTIFVAG--SNSGNDLSRILQVKLGQLPVKHLGLP 587 + + + F WSGL+++ +KSTIF AG S S L ++G+LP+++LGLP Sbjct: 269 RSIEGIIEVFDLFSKWSGLKISMEKSTIFSAGLSSTSRAQLHTHFPFEVGELPIRYLGLP 328 Query: 588 VIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQSSYIFWTGAFPIPYSV 767 ++ +LS D APL+ +++ W ++ LS+AGR L+ +++ SS FW AF +P + Sbjct: 329 LVTKRLSSVDYAPLIEQIRKRIGSWSSRFLSFAGRFNLISSIIWSSCNFWLSAFQLPRAC 388 Query: 768 CSKLESLMGSFLRG----KSKLRLISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIY 935 ++E L SFL SK ISW +C+P EGGLG+R +K+ N KL+W I Sbjct: 389 IQEIEKLCSSFLWSGTNLNSKKAKISWNQVCKPKSEGGLGLRSLKEANDVCCLKLVWRII 448 Query: 936 SSKKSLWVQWIHSRFLRNNSIW-TATIPNDVSWVYRRILKIRNQFANHCFNLVGNGDATK 1112 S SLWV+W+ L+ W N SW++++ILK R C VGNG++T Sbjct: 449 SHGDSLWVKWVEHNLLKREIFWIVKENANLGSWIWKKILKYRGVAKRFCKAEVGNGESTS 508 Query: 1113 FWLHRWHPQGMLLDTFGENCRLFTRIPRRATIKEVREAGYWNSPPSSSPMVRTAWRQFQQ 1292 FW W G L+D G + I R ++ + + Q Sbjct: 509 FWFDDWSLLGRLIDVAGIRGTIDMGISRTMSVADAWTSRRRRHHRQEILNTIEEVLSTQH 568 Query: 1293 IPKLGCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKCSFTC 1463 + ++ + +W + FS + W +R + W + VWF PK SF Sbjct: 569 QKRTQQQQQGRVLWKGKNDIYKDKFSTKNTWNYLRTTSNEVAWHKGVWFPHATPKYSFCL 628 Query: 1464 WRMLLSKLPTKDKLTRF--GAQSHCELCWAGVESEDHLFF 1577 W +L T ++ ++ G C C G+E+ DHLFF Sbjct: 629 WLAAHDRLATGARMIKWNRGETGDCTFCRQGIETRDHLFF 668 >gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490 [Arabidopsis thaliana] Length = 657 Score = 303 bits (777), Expect = 3e-79 Identities = 169/531 (31%), Positives = 273/531 (51%), Gaps = 14/531 (2%) Frame = +3 Query: 54 LKLDIHKAFDSIDWRFIEWAMKRMEFPPRFCRWVMECIKTTSFSPLMNGSTCGFFFGKRG 233 L++D+ KA+D+++W F+ +K + PP F W+ CI T S+S NG GFF GK+G Sbjct: 137 LQVDLTKAYDNVNWEFLINILKALNLPPIFINWIWVCISTPSYSIAYNGELIGFFVGKKG 196 Query: 234 LRQGDPLSPYLFSIGLQMLSSYLQWKVFSKEFEIPSKCSSLNLSHLAFADDVIIFLKPSA 413 +RQGDP+S +LF + + +L+ L F + KC + ++HL+FADD+++F S Sbjct: 197 IRQGDPMSSHLFVLVMDILARSLDLGAVEGRFVLHPKCLAPMITHLSFADDILVFCDGSL 256 Query: 414 STANQLCNILMEFEGWSGLRLNRDKSTIFVAGSNSGND--LSRILQVKLGQLPVKHLGLP 587 S+ + +IL F+ SGL +N K+ + + G N + ++ L V G LPV++LG+P Sbjct: 257 SSLVAILDILDVFKKGSGLGINLQKTALLLDGGNFERNRIMAASLGVSQGSLPVRYLGVP 316 Query: 588 VIPGKLSIADCAPLVGMFARKLEGWQAKILSYAGRLELVKAVLQSSYIFWTGAFPIPYSV 767 ++ K+ D PLV + W A+ LS+AGRL+L+K+V+ S+ FW F +P Sbjct: 317 LMSQKMKKHDYQPLVDRINSRFTSWTARHLSFAGRLQLLKSVIYSTINFWASIFILPNQC 376 Query: 768 CSKLESLMGSFL----RGKSKLRLISWATICRPLEEGGLGIRRIKDMNKAGLCKLLWWIY 935 KLE + +FL ++ ISW +C E GGLG++R+ NK KL+W ++ Sbjct: 377 LHKLEQMCNAFLWSGAPNSAREAKISWDIVCSSKESGGLGLKRLSSWNKVLALKLIWLLF 436 Query: 936 SSKKSLWVQWIHSRFLRNNSIWTATIPNDVSWVYRRILKIRNQFANHCFNLVGNGDATKF 1115 ++ SLWV W V WV+R++ K+R VG+G +F Sbjct: 437 TASGSLWVSW-------------------VRWVWRKLCKLREVARPFVICEVGSGITARF 477 Query: 1116 WLHRWHPQGMLLDTFGENCRLFTRIPRRATIKE-VREAGYW-NSPPSSSPMVRTAWRQFQ 1289 W W G L+ G + + +++ +R +W S S +P++ Sbjct: 478 WQDNWTGHGPLIHLTGLTGPQLVGLSITSVVRDAIRNDDWWIASSRSRNPVILLLKSLLP 537 Query: 1290 QIPKL-GCDEEDQFVW---SPCPSGLFSVASAWEQIRHHYDVWEWTELVWFYDKIPKCSF 1457 + L C+ +D ++W PS FS A W ++ W + VWF +++PK +F Sbjct: 538 PVGNLVDCEHDDSYLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAVWFTNQVPKHAF 597 Query: 1458 TCWRMLLSKLPTKDKLTRFG--AQSHCELCWAGVESEDHLFFECPFSSEVW 1604 W ++L T+D+L +G + C LC E+ DHLFF C FSS +W Sbjct: 598 ISWVTAWNRLHTRDRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSSRIW 648 >ref|XP_006289785.1| hypothetical protein CARUB_v10003387mg [Capsella rubella] gi|482558491|gb|EOA22683.1| hypothetical protein CARUB_v10003387mg [Capsella rubella] Length = 362 Score = 302 bits (774), Expect = 7e-79 Identities = 160/239 (66%), Positives = 184/239 (76%), Gaps = 3/239 (1%) Frame = -1 Query: 3430 RFFSSEVDTKNGKT-FAFXXXXXXXXXXXXXXXXXXKLPPPYDPFSKKPVIEEPKDRKNL 3254 R FSSE N K F+ +LPPPYDPFSKKP IEEP+D KNL Sbjct: 124 RQFSSETKRVNTKVNFSLSDDDSDEETPVIEDSGKPELPPPYDPFSKKPAIEEPEDAKNL 183 Query: 3253 QEIFHKMRTEGLINNAIKMFDALSKDGLTHEAMELFAQIKDKGNMPDVVAHTAVIEAYAN 3074 Q IFHKMRTEG N A+KMFDALSKDG THEA+ELF+QIKDK MP+VVAHTA++EAYAN Sbjct: 184 QAIFHKMRTEGFTNEAVKMFDALSKDGRTHEALELFSQIKDKNQMPEVVAHTAIVEAYAN 243 Query: 3073 AGQSKEALKVYMRMLASGVKPNAYTYAVLIIGLARDGK-LGDAQKYLLEMMGKGIRPNAA 2897 AGQ+KEALKV+MRML+ GV PNAYTY VLI GLA DG+ L DA+KYLLEMMG GI PNAA Sbjct: 244 AGQAKEALKVFMRMLSCGVLPNAYTYTVLIKGLAADGRTLKDAKKYLLEMMGIGISPNAA 303 Query: 2896 TYTIVFEAFAREDKMEQGRQLLEKLKAKGFVPDEKAVTDHL-SKRGQVFRSVMNLLFHK 2723 TYT VFEAF +E+K + RQLL+++K KGFVPDEKAV + L SKRG VFR+V+NLLF+K Sbjct: 304 TYTPVFEAFVKEEKEDSARQLLQEMKGKGFVPDEKAVREALQSKRGPVFRTVINLLFNK 362