BLASTX nr result
ID: Catharanthus22_contig00021823
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00021823 (1144 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 102 3e-28 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 101 3e-28 dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] 101 4e-28 emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 69 7e-25 gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas... 54 2e-22 gb|AAD22368.1| putative non-LTR retroelement reverse transcripta... 72 4e-21 gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana] 68 1e-20 sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr... 68 1e-20 gb|AAC26674.1| putative non-LTR retroelement reverse transcripta... 62 7e-20 emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga... 72 7e-18 gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 67 3e-17 gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma ... 71 5e-17 emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210... 49 7e-17 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 90 5e-16 gb|EMJ08167.1| hypothetical protein PRUPE_ppb019197mg [Prunus pe... 75 7e-16 ref|XP_002309989.1| predicted protein [Populus trichocarpa] 86 3e-15 gb|EOY02505.1| RNA-binding (RRM/RBD/RNP motifs) family protein i... 88 4e-15 ref|XP_002314708.1| predicted protein [Populus trichocarpa] 84 1e-14 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 84 7e-14 gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] 84 8e-14 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 102 bits (253), Expect(3) = 3e-28 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1223 ASAGGLLRNENGLWVAGYICNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1282 Query: 265 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 116 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1283 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1332 Score = 43.9 bits (102), Expect(3) = 3e-28 Identities = 30/116 (25%), Positives = 46/116 (39%), Gaps = 2/116 (1%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1052 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1111 Query: 764 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 603 +S L +H W++ + + S +F Y W +W RN+ F+ + P Sbjct: 1112 QTSNHLHMHSWMKAACSSQQKDGYGT-NWSLIFPYILWNLWKARNRLVFDNNITAP 1166 Score = 27.3 bits (59), Expect(3) = 3e-28 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 PA GF KLN+DGA K + Sbjct: 1203 PAAGFTKLNSDGACKSH 1219 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 101 bits (252), Expect(3) = 3e-28 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1223 ASAGGLLRNENGLWVAGYTCNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1282 Query: 265 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 116 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1283 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1332 Score = 44.3 bits (103), Expect(3) = 3e-28 Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 2/116 (1%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1052 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1111 Query: 764 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 603 +S L +H W++ + ++ S +F Y W +W RN+ F+ + P Sbjct: 1112 QTSNHLHMHSWMKAACSSQQKDGYST-NWSLIFPYILWNLWKARNRLVFDNNITAP 1166 Score = 27.3 bits (59), Expect(3) = 3e-28 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 PA GF KLN+DGA K + Sbjct: 1203 PAAGFTKLNSDGACKSH 1219 >dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] Length = 1898 Score = 101 bits (252), Expect(3) = 4e-28 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1755 ASAGGLLRNENGLWVAGYTCNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1814 Query: 265 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 116 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1815 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1864 Score = 43.9 bits (102), Expect(3) = 4e-28 Identities = 30/116 (25%), Positives = 46/116 (39%), Gaps = 2/116 (1%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1584 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1643 Query: 764 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 603 +S L +H W++ + + S +F Y W +W RN+ F+ + P Sbjct: 1644 QTSNHLHMHSWMKAACSSQQKDGYGT-NWSLIFPYILWNLWKARNRLVFDNNITAP 1698 Score = 27.3 bits (59), Expect(3) = 4e-28 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 PA GF KLN+DGA K + Sbjct: 1735 PAAGFTKLNSDGACKSH 1751 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 68.9 bits (167), Expect(3) = 7e-25 Identities = 44/111 (39%), Positives = 62/111 (55%), Gaps = 1/111 (0%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 AG GGLIR RG F + G+ T T AEL AV GL++A + N +++ VD+E+V Sbjct: 1232 AGGGGLIRGPRGEIHEVFAINCGSCTCTKAELLAVLRGLMIAWEGNHKQVIVSVDSELVA 1291 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQ-SVSIRHVHRVGNKCADFLANQG 116 LL P P +I+ C SL++ + + I H +R N+ AD LAN G Sbjct: 1292 KLLISNAPPSSPYIHIINRCLSLIARKEWKIVIEHCYRETNRAADRLANMG 1342 Score = 62.0 bits (149), Expect(3) = 7e-25 Identities = 48/160 (30%), Positives = 70/160 (43%), Gaps = 8/160 (5%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLG-LQNT 768 F+WL D R+ D C C E D +LRRCP A LW KLG L Sbjct: 1059 FIWLAIQDRLMTNSNRFLRRLTDDPRCLVCGEVEENTDHILRRCPVARILWRKLGMLGEH 1118 Query: 767 LTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP-DHKS 591 +++ W+ N D++ L +FA +CW +W +RN R F + +P D S Sbjct: 1119 NREEINLGSWITKNLSADTMMGSEWL---RVFAVSCWWLWRWRNDRCFNRNPSIPIDQVS 1175 Query: 590 FCLSKAGEF---Y*RGPDNKGTRSTRSR--IISW-VPRQG 489 F ++ E R NK S R + ++ W P++G Sbjct: 1176 FIFARVKEIKEAMDRNDTNKSQHSGRRKEILVRWQCPKEG 1215 Score = 31.2 bits (69), Expect(3) = 7e-25 Identities = 13/17 (76%), Positives = 15/17 (88%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 P G++KLNTDGASKGN Sbjct: 1212 PKEGWVKLNTDGASKGN 1228 >gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase); Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Length = 729 Score = 53.9 bits (128), Expect(4) = 2e-22 Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 2/127 (1%) Frame = -3 Query: 950 QDFLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQN 771 Q F+WL H V S C C E+ T+ +LR C + +WL+L N Sbjct: 477 QTFIWLAAHGRILTNYRRSKWGVGISPTCPCCAREDETVIHVLRDCVHSTQVWLRLIPHN 536 Query: 770 TLTS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDH 597 +T+ S +W+ N I N T F CW +W +RNK FE P + Sbjct: 537 YITNFFSFDCREWVFNNLNKKGIGD-NPATWQTTFMTTCWYLWNWRNKSIFEIGFQRPSN 595 Query: 596 KSFCLSK 576 + + K Sbjct: 596 PTLVIQK 602 Score = 50.8 bits (120), Expect(4) = 2e-22 Identities = 23/69 (33%), Positives = 41/69 (59%) Frame = -1 Query: 466 WSFQG*FAGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLE 287 W G AG GGL+RD G + G+++ IG AE+W + GL +A +N +++++E Sbjct: 644 WKGSGTLAGCGGLLRDSDGRWIKGYFKKIGMCDAFHAEMWGMYLGLDMAWRENTTHLIVE 703 Query: 286 VDAEVVVYL 260 D++++ L Sbjct: 704 SDSKILSLL 712 Score = 44.7 bits (104), Expect(4) = 2e-22 Identities = 22/66 (33%), Positives = 36/66 (54%) Frame = -1 Query: 1138 ELQKRIHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDWVWDVKAA 959 ++ +I A+P + PD W GTN +F+ +SA+NL + G+W +W+ K Sbjct: 414 DIVNQILALPTPSDFDGPDTIGWGGTNTLKFTVQSAYNLQQENPFAVGGDWKTLWNWKGP 473 Query: 958 PRIKTF 941 RI+TF Sbjct: 474 HRIQTF 479 Score = 23.9 bits (50), Expect(4) = 2e-22 Identities = 10/17 (58%), Positives = 13/17 (76%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 P G++KLN DGA KG+ Sbjct: 631 PPFGWVKLNCDGAWKGS 647 >gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 321 Score = 72.4 bits (176), Expect(3) = 4e-21 Identities = 44/109 (40%), Positives = 59/109 (54%) Frame = -1 Query: 448 FAGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVV 269 FA AGG++RD G + GF +IG + +AELW V GL +A + + LEVD+++V Sbjct: 175 FATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVDSKMV 234 Query: 268 VYLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 122 V L HP +L C LS V I HV+R N+ AD LAN Sbjct: 235 VGFLTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLAN 283 Score = 45.1 bits (105), Expect(3) = 4e-21 Identities = 40/158 (25%), Positives = 64/158 (40%), Gaps = 6/158 (3%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 FLWL + D+ +C+ C+G E T+ +LR CP +W +L ++ + Sbjct: 9 FLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSRLVPRDQI 68 Query: 764 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 591 + S+ +W+ N + + + P T+F A W W +R F + D Sbjct: 69 RQFFTASLLEWIYKN-----LRERGSWP--TVFVMAVWWGWKWRCGNIFGGNGKCRDRVK 121 Query: 590 FCLSKAGEFY*RGPDNKGTR---STRSRIISWV-PRQG 489 F A E KG S R++SWV P G Sbjct: 122 FIKDLAEEVAIANAFVKGNEVRVSRVERLVSWVSPEDG 159 Score = 32.0 bits (71), Expect(3) = 4e-21 Identities = 13/18 (72%), Positives = 16/18 (88%) Frame = -2 Query: 501 SPAGGFLKLNTDGASKGN 448 SP G++KLNTDGAS+GN Sbjct: 155 SPEDGWVKLNTDGASRGN 172 >gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana] Length = 1055 Score = 67.8 bits (164), Expect(3) = 1e-20 Identities = 43/108 (39%), Positives = 57/108 (52%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGG++RD G GF +IG + AELW V GL A +K + + LEVD+EV+V Sbjct: 584 ASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIV 643 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 122 L+ HP ++ C L V I HV+R N+ AD LAN Sbjct: 644 GFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLAN 691 Score = 49.7 bits (117), Expect(3) = 1e-20 Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 5/153 (3%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 FLWL + + S +C+ C+G ++ +LR CP + +W+++ Q Sbjct: 412 FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQ 471 Query: 764 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 591 S S+ +WL N S ++P ST+FA W W +R F ++ D Sbjct: 472 QGFFSKSLFEWLYDN--LGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVK 529 Query: 590 FCLSKAGEFY*RGPDNKGTRSTRSRI---ISWV 501 F A E Y N T+ R+ I WV Sbjct: 530 FVKEWAVEVYRAHSGNVLVGITQPRVERMIGWV 562 Score = 30.4 bits (67), Expect(3) = 1e-20 Identities = 12/18 (66%), Positives = 16/18 (88%) Frame = -2 Query: 501 SPAGGFLKLNTDGASKGN 448 SP G++K+NTDGAS+GN Sbjct: 563 SPCVGWVKVNTDGASRGN 580 >sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750 Length = 620 Score = 67.8 bits (164), Expect(3) = 1e-20 Identities = 43/108 (39%), Positives = 57/108 (52%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGG++RD G GF +IG + AELW V GL A +K + + LEVD+EV+V Sbjct: 475 ASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIV 534 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 122 L+ HP ++ C L V I HV+R N+ AD LAN Sbjct: 535 GFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLAN 582 Score = 49.7 bits (117), Expect(3) = 1e-20 Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 5/153 (3%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 FLWL + + S +C+ C+G ++ +LR CP + +W+++ Q Sbjct: 303 FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQ 362 Query: 764 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 591 S S+ +WL N S ++P ST+FA W W +R F ++ D Sbjct: 363 QGFFSKSLFEWLYDN--LGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVK 420 Query: 590 FCLSKAGEFY*RGPDNKGTRSTRSRI---ISWV 501 F A E Y N T+ R+ I WV Sbjct: 421 FVKEWAVEVYRAHSGNVLVGITQPRVERMIGWV 453 Score = 30.4 bits (67), Expect(3) = 1e-20 Identities = 12/18 (66%), Positives = 16/18 (88%) Frame = -2 Query: 501 SPAGGFLKLNTDGASKGN 448 SP G++K+NTDGAS+GN Sbjct: 454 SPCVGWVKVNTDGASRGN 471 >gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 970 Score = 62.0 bits (149), Expect(4) = 7e-20 Identities = 37/108 (34%), Positives = 56/108 (51%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A A G I + +G + GF +IG+ +AELW GL++A DK + L +D+E+VV Sbjct: 825 AAASGAILNLQGEWLGGFALNIGSCDAPLAELWGAYYGLLIAWDKGFRRVELNLDSELVV 884 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 122 L HP ++ C+ + V + HV+R N+ AD LAN Sbjct: 885 GFLSTGISKAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLAN 932 Score = 45.4 bits (106), Expect(4) = 7e-20 Identities = 33/131 (25%), Positives = 47/131 (35%), Gaps = 5/131 (3%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 765 F+WL +H + D A C C G + ++ +LR CP +W +L Q Sbjct: 659 FIWLVSHMVIMTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLLPQRRQ 718 Query: 764 TSSLSVHKWLEINS*TDSISQFNNLPSS-----TLFAYACWTIWYYRNKRKFEPHSILPD 600 S +WL F NL + TLF+ W W +R F + D Sbjct: 719 NEFFSQFEWL-----------FTNLDPAKGDWPTLFSMGIWWAWKWRCGDVFGERKLCRD 767 Query: 599 HKSFCLSKAGE 567 F A E Sbjct: 768 RLKFIKDIAEE 778 Score = 32.7 bits (73), Expect(4) = 7e-20 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 6/71 (8%) Frame = -1 Query: 1135 LQKRIHAIPFSNL---ATRPDNTRWIGTNNGEFSSKSAWNLLLD--QEHTEDGNW-DWVW 974 LQ+++ A+ ++ A D W GT NG+F+ +SA+ LL +E G++ +W Sbjct: 589 LQEQLSAVAKESISADALLSDELSWKGTQNGDFTVRSAYELLKPEAEERPLIGSFLKQIW 648 Query: 973 DVKAAPRIKTF 941 + A R++ F Sbjct: 649 KLVAPERVRVF 659 Score = 24.3 bits (51), Expect(4) = 7e-20 Identities = 9/18 (50%), Positives = 15/18 (83%) Frame = -2 Query: 501 SPAGGFLKLNTDGASKGN 448 +P+ ++KL TDGAS+G+ Sbjct: 804 APSDRWVKLTTDGASRGH 821 >emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1363 Score = 72.0 bits (175), Expect(3) = 7e-18 Identities = 62/190 (32%), Positives = 90/190 (47%), Gaps = 9/190 (4%) Frame = -1 Query: 589 FAYQRLVS--FTNVDQIIRAQDLHG----LELFHGFPGRGFSK--TEH*WSFQG*FAGAG 434 F Y R+V+ FTN+ + + + G + L P +GF K T+ W AG G Sbjct: 1166 FTYNRVVADFFTNI-RAFQVNNTQGNGSKVVLRWKPPHQGFLKLNTDGAWKADWENAGIG 1224 Query: 433 GLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVVYLLQ 254 G+ RD GN +GF + + + AEL A+R+GL +A D N + +E DA+ VV LL Sbjct: 1225 GVFRDAVGNWELGFAKRVDAGSPEAAELMAIREGLQVAWDCNYHKLEVECDAKGVVQLLA 1284 Query: 253 QE-PVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYVCLFE 77 + HP I+ D LL+ SV H+ R GNK A LA + N + + Sbjct: 1285 KPLEAENHPLGVIVMDICILLTRHWSVEFLHIKREGNKVAHCLAAEAVNQVEERVIFINP 1344 Query: 76 PTPALGVLLR 47 P A V + Sbjct: 1345 PEHAKEVYFK 1354 Score = 37.7 bits (86), Expect(3) = 7e-18 Identities = 18/55 (32%), Positives = 29/55 (52%) Frame = -1 Query: 1144 PSELQKRIHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDW 980 P ++ K+I IP ++++ D+ W NG FS KSA+ L+ +E G W Sbjct: 984 PPDILKQIKEIPLASMSEVEDDFTWNFEKNGTFSVKSAYYLINRREEETGGKGSW 1038 Score = 28.5 bits (62), Expect(3) = 7e-18 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 5/87 (5%) Frame = -3 Query: 866 CEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQN-----TLTSSLSVHKWLEINS*TDSISQ 702 C C + L R C A +W+++ + L +L +W++ N ++Q Sbjct: 1079 CVACDHPIEDMIHLFRDCCVASSVWIEILKHHKPNNQNLFFNLEWEEWIDFN-----LNQ 1133 Query: 701 FNNLPSSTLFAYACWTIWYYRNKRKFE 621 + T F A W IW RNK FE Sbjct: 1134 HDYWV--TKFTTAFWHIWCSRNKTVFE 1158 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 66.6 bits (161), Expect(3) = 3e-17 Identities = 38/108 (35%), Positives = 58/108 (53%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A AGG IR+ +G + GF +IG+ +AELW GL++A DK + L++D ++VV Sbjct: 1086 AAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLVV 1145 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 122 L HP ++ C+ + V + HV+R N+ AD LAN Sbjct: 1146 GFLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLAN 1193 Score = 41.2 bits (95), Expect(3) = 3e-17 Identities = 34/128 (26%), Positives = 50/128 (39%), Gaps = 2/128 (1%) Frame = -3 Query: 944 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL--GLQN 771 F+WL + + + ++AIC C G E T+ +LR CP +W +L ++ Sbjct: 918 FIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLRDCPAMEPIWRRLLPLRRH 977 Query: 770 TLTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 591 S S+ +WL N D + TLF W W +R F I D Sbjct: 978 HEFFSQSLLEWLFTN--MDPVKGI----WPTLFGMGIWWAWKWRCCDVFGERKICRDRLK 1031 Query: 590 FCLSKAGE 567 F A E Sbjct: 1032 FIKDMAEE 1039 Score = 28.1 bits (61), Expect(3) = 3e-17 Identities = 10/17 (58%), Positives = 15/17 (88%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 P+ G++K+ TDGAS+GN Sbjct: 1066 PSDGWVKITTDGASRGN 1082 >gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma cacao] Length = 419 Score = 71.2 bits (173), Expect(3) = 5e-17 Identities = 35/106 (33%), Positives = 59/106 (55%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A +GG+IRDE GN + GF + IG T + AE W + GL L ++ + +E+D+ + + Sbjct: 276 AASGGVIRDEYGNWIAGFCQKIGITFSLTAEPWGIYQGLTLCWNRGLRKFCVEIDSMLAL 335 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFL 128 + + L P+ +L + LL V+I HVHR ++C D++ Sbjct: 336 QKIYSQSSMLDPNAQLLRRIKELLQQSWDVTISHVHREADQCTDWM 381 Score = 37.7 bits (86), Expect(3) = 5e-17 Identities = 27/115 (23%), Positives = 46/115 (40%), Gaps = 2/115 (1%) Frame = -3 Query: 941 LWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL--GLQNT 768 +W H+ + +A C TL LR C K+ LWL+L + ++ Sbjct: 108 IWRILHEALPTSEWLLKRHLRSTAFYFRCEAPVETLVHALRDCGKSKLLWLQLRPNIHSS 167 Query: 767 LTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 603 S + W+ N +P + +F +A W +W++RN F+ I P Sbjct: 168 DFFSEELKPWVLKN--LACKDPVEGIPWAIIFIHAIWLLWFWRNMNLFDKSFIWP 220 Score = 26.2 bits (56), Expect(3) = 5e-17 Identities = 9/16 (56%), Positives = 13/16 (81%) Frame = -2 Query: 498 PAGGFLKLNTDGASKG 451 P G++KLN DG++KG Sbjct: 256 PKNGYVKLNVDGSAKG 271 >emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1| putative protein [Arabidopsis thaliana] Length = 947 Score = 48.9 bits (115), Expect(4) = 7e-17 Identities = 26/63 (41%), Positives = 38/63 (60%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A GG++RD G+ GF IG + +AELW V GL +A ++ + + LEVD+E+VV Sbjct: 864 ATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGVYYGLYMAWERRFTRVELEVDSELVV 923 Query: 265 YLL 257 L Sbjct: 924 GFL 926 Score = 43.5 bits (101), Expect(4) = 7e-17 Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 5/135 (3%) Frame = -3 Query: 878 DSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL-GLQNTLT-SSLSVHKWLEINS*TDSIS 705 D+++C+ C+G + T+ +L+ CP +W +L +Q + + S+ WL +N + Sbjct: 714 DTSVCQVCKGGDETILHVLKDCPSIAGIWRRLVQVQRSYDFFNGSLFGWLYVNLGMKNAE 773 Query: 704 QFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKSFCLSKAGEF---Y*RGPDNKGT 534 +TLFA W W +R F D F A E + N G Sbjct: 774 --TGYAWATLFAIVVWWSWKWRCGYVFGEVGKCRDRVKFFRDLAAEVSHAHAIHSQNGGL 831 Query: 533 RSTRSRIISWVPRQG 489 R+ R+++W P G Sbjct: 832 RTRVERLVAWKPPDG 846 Score = 31.2 bits (69), Expect(4) = 7e-17 Identities = 13/18 (72%), Positives = 16/18 (88%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGNL 445 P G ++KLNTDGAS+GNL Sbjct: 844 PDGEWVKLNTDGASRGNL 861 Score = 30.4 bits (67), Expect(4) = 7e-17 Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%) Frame = -1 Query: 1117 AIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGN----WDWVWDVKAAPRI 950 A+ ++ D W + +G F+ KSA+ LL + +H N +D +W V A R+ Sbjct: 647 AVVVDSVTGARDRLSWGYSADGVFTVKSAYRLLTE-DHDPRPNMAAFFDRLWRVVALERV 705 Query: 949 KTF 941 KTF Sbjct: 706 KTF 708 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 90.1 bits (222), Expect(2) = 5e-16 Identities = 49/119 (41%), Positives = 70/119 (58%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A GGL+RD G V GF +IG + + AEL A+ GL+L +D+NI + +E+DA VV+ Sbjct: 873 AATGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVI 932 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYV 89 ++QQ H +L R LS F S I H+ R GN+ ADFL+N+G + NL + Sbjct: 933 QMIQQSKKGSHDIRYLLASIRKCLSFF-SFRISHIFREGNQAADFLSNKGHTHQNLQVI 990 Score = 21.9 bits (45), Expect(2) = 5e-16 Identities = 9/17 (52%), Positives = 11/17 (64%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGN 448 P G KLN DG+S+ N Sbjct: 854 PVTGEYKLNVDGSSRHN 870 >gb|EMJ08167.1| hypothetical protein PRUPE_ppb019197mg [Prunus persica] Length = 363 Score = 74.7 bits (182), Expect(3) = 7e-16 Identities = 42/112 (37%), Positives = 63/112 (56%), Gaps = 1/112 (0%) Frame = -1 Query: 442 GAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVVY 263 GAGG+IRD G+ + GF ++G ELW + GL L K ++ + +E+D+ V Sbjct: 239 GAGGIIRDSFGDWMGGFAVNLGIGQTLDDELWGLFFGLKLVAAKGVARLSIEMDSMTDVQ 298 Query: 262 LLQQE-PVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCN 110 L++Q P LHP ++ C +L+S F+ V + HV+R N AD LAN N Sbjct: 299 LIKQHVPSCLHPCTGVIASCVALISKFEFVELTHVYRERNAAADCLANWSLN 350 Score = 32.3 bits (72), Expect(3) = 7e-16 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 3/75 (4%) Frame = -3 Query: 698 NNLPSSTLFAYACWT-IWYYRNKRKFEPHSILPDH-KSFCLSKAGEFY*RGPDNKGTRST 525 +NL S +++ W IW +RN R F + LP H K S E+ P++ R+ Sbjct: 149 SNLCSKSVYDLQPWLFIWKWRNSRVFNVEAELPFHPKRIIASAVSEWLQTCPNSISKRTQ 208 Query: 524 RSRIISW-VPRQGVF 483 +++W P GVF Sbjct: 209 VQIMLAWEPPMNGVF 223 Score = 24.3 bits (51), Expect(3) = 7e-16 Identities = 9/18 (50%), Positives = 11/18 (61%) Frame = -2 Query: 498 PAGGFLKLNTDGASKGNL 445 P G KLN DG+ KG + Sbjct: 218 PMNGVFKLNVDGSRKGGI 235 >ref|XP_002309989.1| predicted protein [Populus trichocarpa] Length = 245 Score = 85.5 bits (210), Expect(2) = 3e-15 Identities = 57/141 (40%), Positives = 78/141 (55%), Gaps = 2/141 (1%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 AGAGG+IRD G + GF R+IG ++ AELWAV GL LA D+ + LE D++VVV Sbjct: 99 AGAGGVIRDHLGAWIGGFARNIGICSSVNAELWAVYVGLQLAWDRGFRKVDLESDSKVVV 158 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLD--Y 92 L+ + V + ++NI+ + +L V+ HV+R N AD+LAN G LD Sbjct: 159 GLINGDSVRVDRNYNIIMQIKGMLGRDWEVTTYHVYREANCVADWLANYGLTRDLLDRGS 218 Query: 91 VCLFEPTPALGVLLRFGC*GS 29 L EP L LL + GS Sbjct: 219 DVLEEPPSGLYPLLYYDLIGS 239 Score = 23.9 bits (50), Expect(2) = 3e-15 Identities = 9/12 (75%), Positives = 10/12 (83%) Frame = -2 Query: 483 LKLNTDGASKGN 448 +KLN DG SKGN Sbjct: 84 IKLNVDGCSKGN 95 >gb|EOY02505.1| RNA-binding (RRM/RBD/RNP motifs) family protein isoform 1 [Theobroma cacao] gi|508710609|gb|EOY02506.1| RNA-binding (RRM/RBD/RNP motifs) family protein isoform 1 [Theobroma cacao] Length = 344 Score = 88.2 bits (217), Expect(2) = 4e-15 Identities = 49/132 (37%), Positives = 77/132 (58%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 AGAGG+IR+++G +G+ R +G T+T AE W +RDGL LA + + +++++VD +VV+ Sbjct: 86 AGAGGIIRNDQGEWNVGYSRKLGQATSTCAEHWGLRDGLQLAVKRGLFDVIIKVDLQVVL 145 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYVC 86 L+ +E V H I+ +CRSLL + ++R N CAD LA G D+V Sbjct: 146 DLICKEAVDSHTLGPIIKECRSLLEQIPNHRFCQINRDSNCCADHLARMGATMTK-DFVI 204 Query: 85 LFEPTPALGVLL 50 P + +LL Sbjct: 205 FEFPPDCIKLLL 216 Score = 20.8 bits (42), Expect(2) = 4e-15 Identities = 8/14 (57%), Positives = 10/14 (71%) Frame = -2 Query: 498 PAGGFLKLNTDGAS 457 P GF KLN+ G+S Sbjct: 66 PPRGFFKLNSGGSS 79 >ref|XP_002314708.1| predicted protein [Populus trichocarpa] Length = 245 Score = 84.0 bits (206), Expect(2) = 1e-14 Identities = 49/117 (41%), Positives = 69/117 (58%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 AGAGG+IRD G + GF R+IG ++ AELWAV GL LA D+ + LE D++VVV Sbjct: 99 AGAGGVIRDHLGAWIGGFARNIGICSSVNAELWAVYVGLQLAWDRGFRKVDLESDSKVVV 158 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLD 95 L+ + V + ++NI+ + +L V+ HV+R N AD+LAN G LD Sbjct: 159 GLINGDSVRVDRNYNIIMQIKGMLGRNWEVTTYHVYREANCVADWLANYGLTRDLLD 215 Score = 23.9 bits (50), Expect(2) = 1e-14 Identities = 9/12 (75%), Positives = 10/12 (83%) Frame = -2 Query: 483 LKLNTDGASKGN 448 +KLN DG SKGN Sbjct: 84 IKLNVDGCSKGN 95 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 84.3 bits (207), Expect(2) = 7e-14 Identities = 50/116 (43%), Positives = 66/116 (56%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A GG++RD G V GF ++GT + AEL A+ GLIL RD NI + +E+DA V+ Sbjct: 2110 AAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVI 2169 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANL 98 LLQ H ++ R LLS F S H+ R GN+ ADFLAN+G + NL Sbjct: 2170 RLLQGNHRGPHAIRYLMVSLRQLLSHF-SFRFSHIFREGNQAADFLANRGHEHQNL 2224 Score = 20.4 bits (41), Expect(2) = 7e-14 Identities = 8/15 (53%), Positives = 11/15 (73%) Frame = -2 Query: 498 PAGGFLKLNTDGASK 454 P+ G KLN DG++K Sbjct: 2091 PSLGEFKLNVDGSAK 2105 >gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao] Length = 1702 Score = 84.3 bits (207), Expect = 8e-14 Identities = 47/116 (40%), Positives = 67/116 (57%) Frame = -1 Query: 445 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 266 A GGL+RD G V GF +IG + + AEL A+ GL+L +++NI + +E+DA V + Sbjct: 1393 AAIGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWIEMDALVAI 1452 Query: 265 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANL 98 ++QQ H +L R LS F S I H+ R GN+ ADFL+N+G NL Sbjct: 1453 QMIQQSQKGSHDIQYLLASIRKCLSFF-SFRISHIFREGNQVADFLSNKGHTQQNL 1507 Score = 73.2 bits (178), Expect = 2e-10 Identities = 56/189 (29%), Positives = 88/189 (46%), Gaps = 14/189 (7%) Frame = -1 Query: 622 NPTLSFLIIKAFAYQRLVSFTNVDQIIRA-------QDLHGLE--LFHGFPGRGFSKTEH 470 N FL K Q L+ F+ + + A QD HG ++ P G K Sbjct: 1491 NQVADFLSNKGHTQQNLLVFSEAEGELHAHWGLRYEQDSHGHPKIIYWSRPLMGEFKLNV 1550 Query: 469 *WSFQG*F--AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNI 296 + F A +GG+ RD + GF + G +T AEL A+ GL+L + NIS + Sbjct: 1551 DGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELMALHRGLLLCNEYNISRV 1610 Query: 295 VLEVDAEVVVYLLQQEPVPLHPS---FNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLA 125 +E+DA+ +V +L + + + +C C S + S I H+HR N+ AD+L+ Sbjct: 1611 WIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGI----SYRISHIHRESNQAADYLS 1666 Query: 124 NQGCNYANL 98 NQG + +L Sbjct: 1667 NQGHTHQSL 1675