BLASTX nr result
ID: Catharanthus23_contig00023893
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00023893 (1183 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 102 1e-31 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 101 1e-31 dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] 101 2e-31 emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 69 5e-25 gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptas... 54 2e-22 gb|AAD22368.1| putative non-LTR retroelement reverse transcripta... 72 4e-21 gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana] 68 1e-20 sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H pr... 68 1e-20 gb|AAC26674.1| putative non-LTR retroelement reverse transcripta... 64 3e-20 gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 67 3e-17 emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga... 72 4e-17 gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma ... 71 6e-17 emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210... 49 7e-17 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 90 5e-16 gb|EMJ08167.1| hypothetical protein PRUPE_ppb019197mg [Prunus pe... 75 8e-16 gb|EOY02505.1| RNA-binding (RRM/RBD/RNP motifs) family protein i... 89 2e-15 ref|XP_002309989.1| predicted protein [Populus trichocarpa] 85 6e-15 ref|XP_002314708.1| predicted protein [Populus trichocarpa] 84 1e-14 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 84 8e-14 gb|AAF97302.1|AC007843_5 Hypothetical protein [Arabidopsis thali... 75 8e-14 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 102 bits (253), Expect(4) = 1e-31 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1223 ASAGGLLRNENGLWVAGYICNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1282 Query: 308 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 159 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1283 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1332 Score = 43.9 bits (102), Expect(4) = 1e-31 Identities = 30/116 (25%), Positives = 46/116 (39%), Gaps = 2/116 (1%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1052 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1111 Query: 807 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 646 +S L +H W++ + + S +F Y W +W RN+ F+ + P Sbjct: 1112 QTSNHLHMHSWMKAACSSQQKDGYGT-NWSLIFPYILWNLWKARNRLVFDNNITAP 1166 Score = 32.0 bits (71), Expect(4) = 1e-31 Identities = 16/69 (23%), Positives = 30/69 (43%) Frame = -3 Query: 1166 IHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDWVWDVKAAPRIKT 987 + A P + + + D W + G + SA++L+ + +D + DW+W +IK Sbjct: 993 VRATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDG-DDRSHDWIWRATCTEKIKL 1051 Query: 986 FFGSATMTG 960 F G Sbjct: 1052 FMWKIVKNG 1060 Score = 27.3 bits (59), Expect(4) = 1e-31 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 PA GF KLN+DGA K + Sbjct: 1203 PAAGFTKLNSDGACKSH 1219 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 101 bits (252), Expect(4) = 1e-31 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1223 ASAGGLLRNENGLWVAGYTCNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1282 Query: 308 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 159 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1283 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1332 Score = 44.3 bits (103), Expect(4) = 1e-31 Identities = 30/116 (25%), Positives = 47/116 (40%), Gaps = 2/116 (1%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1052 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1111 Query: 807 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 646 +S L +H W++ + ++ S +F Y W +W RN+ F+ + P Sbjct: 1112 QTSNHLHMHSWMKAACSSQQKDGYST-NWSLIFPYILWNLWKARNRLVFDNNITAP 1166 Score = 32.0 bits (71), Expect(4) = 1e-31 Identities = 16/69 (23%), Positives = 30/69 (43%) Frame = -3 Query: 1166 IHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDWVWDVKAAPRIKT 987 + A P + + + D W + G + SA++L+ + +D + DW+W +IK Sbjct: 993 VRATPIAINSEQEDFLSWPHSTTGMVTVSSAYSLIAGHDG-DDRSHDWIWRATCTEKIKL 1051 Query: 986 FFGSATMTG 960 F G Sbjct: 1052 FMWKIVKNG 1060 Score = 27.3 bits (59), Expect(4) = 1e-31 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 PA GF KLN+DGA K + Sbjct: 1203 PAAGFTKLNSDGACKSH 1219 >dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] Length = 1898 Score = 101 bits (252), Expect(4) = 2e-31 Identities = 50/111 (45%), Positives = 77/111 (69%), Gaps = 1/111 (0%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGGL+R+E G V G+ +IGT + +AELW +R+GL+LA+++ + ++ E D+E VV Sbjct: 1755 ASAGGLLRNENGLWVAGYTCNIGTANSFLAELWGLREGLLLAKNRGFTKLIAETDSEAVV 1814 Query: 308 YLLQQEPVPLHPSFNILC-DCRSLLSIFQSVSIRHVHRVGNKCADFLANQG 159 +L+++ P+ P +IL DC+ LL FQ + + H+ R GN+CADFLAN G Sbjct: 1815 QVLRKDG-PVTPDASILVKDCKLLLDHFQEIKVTHILREGNQCADFLANLG 1864 Score = 43.9 bits (102), Expect(4) = 2e-31 Identities = 30/116 (25%), Positives = 46/116 (39%), Gaps = 2/116 (1%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 F+W + + D+A C C E+ TLD L RRC A W T Sbjct: 1584 FMWKIVKNGLMVNVERKRRGLADAASCPVCGEEDETLDHLFRRCLLAEACWDSAVPPLTF 1643 Query: 807 TSS--LSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 646 +S L +H W++ + + S +F Y W +W RN+ F+ + P Sbjct: 1644 QTSNHLHMHSWMKAACSSQQKDGYGT-NWSLIFPYILWNLWKARNRLVFDNNITAP 1698 Score = 31.2 bits (69), Expect(4) = 2e-31 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 1/70 (1%) Frame = -3 Query: 1166 IHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDG-NWDWVWDVKAAPRIK 990 + A P + + + D W + G + SA++L+ H DG + DW+W +IK Sbjct: 1525 VRATPIAINSEQEDFPSWPHSTTGMVTVSSAYSLIAG--HDGDGRSHDWIWRATCTEKIK 1582 Query: 989 TFFGSATMTG 960 F G Sbjct: 1583 LFMWKIVKNG 1592 Score = 27.3 bits (59), Expect(4) = 2e-31 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 PA GF KLN+DGA K + Sbjct: 1735 PAAGFTKLNSDGACKSH 1751 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 69.3 bits (168), Expect(3) = 5e-25 Identities = 47/128 (36%), Positives = 69/128 (53%), Gaps = 1/128 (0%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 AG GGLIR RG F + G+ T T AEL AV GL++A + N +++ VD+E+V Sbjct: 1232 AGGGGLIRGPRGEIHEVFAINCGSCTCTKAELLAVLRGLMIAWEGNHKQVIVSVDSELVA 1291 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQ-SVSIRHVHRVGNKCADFLANQGCNYANLDYV 132 LL P P +I+ C SL++ + + I H +R N+ AD LAN G ++ V Sbjct: 1292 KLLISNAPPSSPYIHIINRCLSLIARKEWKIVIEHCYRETNRAADRLANMG--VCAVERV 1349 Query: 131 CLFEPTPE 108 + E P+ Sbjct: 1350 VMIEAIPK 1357 Score = 62.0 bits (149), Expect(3) = 5e-25 Identities = 48/160 (30%), Positives = 70/160 (43%), Gaps = 8/160 (5%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLG-LQNT 811 F+WL D R+ D C C E D +LRRCP A LW KLG L Sbjct: 1059 FIWLAIQDRLMTNSNRFLRRLTDDPRCLVCGEVEENTDHILRRCPVARILWRKLGMLGEH 1118 Query: 810 LTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP-DHKS 634 +++ W+ N D++ L +FA +CW +W +RN R F + +P D S Sbjct: 1119 NREEINLGSWITKNLSADTMMGSEWL---RVFAVSCWWLWRWRNDRCFNRNPSIPIDQVS 1175 Query: 633 FCLSKAGEF---Y*RGPDNKGTRSTRSR--IISW-VPRQG 532 F ++ E R NK S R + ++ W P++G Sbjct: 1176 FIFARVKEIKEAMDRNDTNKSQHSGRRKEILVRWQCPKEG 1215 Score = 31.2 bits (69), Expect(3) = 5e-25 Identities = 13/17 (76%), Positives = 15/17 (88%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 P G++KLNTDGASKGN Sbjct: 1212 PKEGWVKLNTDGASKGN 1228 >gb|ABD28505.2| RNA-directed DNA polymerase (Reverse transcriptase); Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Length = 729 Score = 53.9 bits (128), Expect(4) = 2e-22 Identities = 36/127 (28%), Positives = 52/127 (40%), Gaps = 2/127 (1%) Frame = -2 Query: 993 QDFLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQN 814 Q F+WL H V S C C E+ T+ +LR C + +WL+L N Sbjct: 477 QTFIWLAAHGRILTNYRRSKWGVGISPTCPCCAREDETVIHVLRDCVHSTQVWLRLIPHN 536 Query: 813 TLTS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDH 640 +T+ S +W+ N I N T F CW +W +RNK FE P + Sbjct: 537 YITNFFSFDCREWVFNNLNKKGIGD-NPATWQTTFMTTCWYLWNWRNKSIFEIGFQRPSN 595 Query: 639 KSFCLSK 619 + + K Sbjct: 596 PTLVIQK 602 Score = 50.8 bits (120), Expect(4) = 2e-22 Identities = 23/69 (33%), Positives = 41/69 (59%) Frame = -3 Query: 509 WSFQG*FAGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLE 330 W G AG GGL+RD G + G+++ IG AE+W + GL +A +N +++++E Sbjct: 644 WKGSGTLAGCGGLLRDSDGRWIKGYFKKIGMCDAFHAEMWGMYLGLDMAWRENTTHLIVE 703 Query: 329 VDAEVVVYL 303 D++++ L Sbjct: 704 SDSKILSLL 712 Score = 44.7 bits (104), Expect(4) = 2e-22 Identities = 22/66 (33%), Positives = 36/66 (54%) Frame = -3 Query: 1181 ELQKRIHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDWVWDVKAA 1002 ++ +I A+P + PD W GTN +F+ +SA+NL + G+W +W+ K Sbjct: 414 DIVNQILALPTPSDFDGPDTIGWGGTNTLKFTVQSAYNLQQENPFAVGGDWKTLWNWKGP 473 Query: 1001 PRIKTF 984 RI+TF Sbjct: 474 HRIQTF 479 Score = 23.9 bits (50), Expect(4) = 2e-22 Identities = 10/17 (58%), Positives = 13/17 (76%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 P G++KLN DGA KG+ Sbjct: 631 PPFGWVKLNCDGAWKGS 647 >gb|AAD22368.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 321 Score = 72.4 bits (176), Expect(3) = 4e-21 Identities = 44/109 (40%), Positives = 59/109 (54%) Frame = -3 Query: 491 FAGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVV 312 FA AGG++RD G + GF +IG + +AELW V GL +A + + LEVD+++V Sbjct: 175 FATAGGVLRDHNGAWIGGFAVNIGVCSAPLAELWGVYYGLFIAWGRGARRVELEVDSKMV 234 Query: 311 VYLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 165 V L HP +L C LS V I HV+R N+ AD LAN Sbjct: 235 VGFLTTGIADSHPLSFLLRLCYDFLSKGWIVRISHVYREANRLADGLAN 283 Score = 45.1 bits (105), Expect(3) = 4e-21 Identities = 40/158 (25%), Positives = 64/158 (40%), Gaps = 6/158 (3%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 FLWL + D+ +C+ C+G E T+ +LR CP +W +L ++ + Sbjct: 9 FLWLVVQQVIITNVERYRRHLSDTRVCQICQGGEETILHVLRDCPAMAGIWSRLVPRDQI 68 Query: 807 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 634 + S+ +W+ N + + + P T+F A W W +R F + D Sbjct: 69 RQFFTASLLEWIYKN-----LRERGSWP--TVFVMAVWWGWKWRCGNIFGGNGKCRDRVK 121 Query: 633 FCLSKAGEFY*RGPDNKGTR---STRSRIISWV-PRQG 532 F A E KG S R++SWV P G Sbjct: 122 FIKDLAEEVAIANAFVKGNEVRVSRVERLVSWVSPEDG 159 Score = 32.0 bits (71), Expect(3) = 4e-21 Identities = 13/18 (72%), Positives = 16/18 (88%) Frame = -1 Query: 544 SPAGGFLKLNTDGASKGN 491 SP G++KLNTDGAS+GN Sbjct: 155 SPEDGWVKLNTDGASRGN 172 >gb|AAF23831.1|AC007234_3 F1E22.12 [Arabidopsis thaliana] Length = 1055 Score = 67.8 bits (164), Expect(3) = 1e-20 Identities = 43/108 (39%), Positives = 57/108 (52%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGG++RD G GF +IG + AELW V GL A +K + + LEVD+EV+V Sbjct: 584 ASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIV 643 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 165 L+ HP ++ C L V I HV+R N+ AD LAN Sbjct: 644 GFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLAN 691 Score = 49.7 bits (117), Expect(3) = 1e-20 Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 5/153 (3%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 FLWL + + S +C+ C+G ++ +LR CP + +W+++ Q Sbjct: 412 FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQ 471 Query: 807 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 634 S S+ +WL N S ++P ST+FA W W +R F ++ D Sbjct: 472 QGFFSKSLFEWLYDN--LGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVK 529 Query: 633 FCLSKAGEFY*RGPDNKGTRSTRSRI---ISWV 544 F A E Y N T+ R+ I WV Sbjct: 530 FVKEWAVEVYRAHSGNVLVGITQPRVERMIGWV 562 Score = 30.4 bits (67), Expect(3) = 1e-20 Identities = 12/18 (66%), Positives = 16/18 (88%) Frame = -1 Query: 544 SPAGGFLKLNTDGASKGN 491 SP G++K+NTDGAS+GN Sbjct: 563 SPCVGWVKVNTDGASRGN 580 >sp|P0C2F6.1|RNHX1_ARATH RecName: Full=Putative ribonuclease H protein At1g65750 Length = 620 Score = 67.8 bits (164), Expect(3) = 1e-20 Identities = 43/108 (39%), Positives = 57/108 (52%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGG++RD G GF +IG + AELW V GL A +K + + LEVD+EV+V Sbjct: 475 ASAGGVLRDCTGAWCGGFSLNIGRCSAPQAELWGVYYGLYFAWEKKVPRVELEVDSEVIV 534 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 165 L+ HP ++ C L V I HV+R N+ AD LAN Sbjct: 535 GFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLAN 582 Score = 49.7 bits (117), Expect(3) = 1e-20 Identities = 40/153 (26%), Positives = 63/153 (41%), Gaps = 5/153 (3%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 FLWL + + S +C+ C+G ++ +LR CP + +W+++ Q Sbjct: 303 FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQ 362 Query: 807 TS--SLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 634 S S+ +WL N S ++P ST+FA W W +R F ++ D Sbjct: 363 QGFFSKSLFEWLYDN--LGDRSGCEDIPWSTIFAVIIWWGWKWRCGNIFGENTKCRDRVK 420 Query: 633 FCLSKAGEFY*RGPDNKGTRSTRSRI---ISWV 544 F A E Y N T+ R+ I WV Sbjct: 421 FVKEWAVEVYRAHSGNVLVGITQPRVERMIGWV 453 Score = 30.4 bits (67), Expect(3) = 1e-20 Identities = 12/18 (66%), Positives = 16/18 (88%) Frame = -1 Query: 544 SPAGGFLKLNTDGASKGN 491 SP G++K+NTDGAS+GN Sbjct: 454 SPCVGWVKVNTDGASRGN 471 >gb|AAC26674.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 970 Score = 63.5 bits (153), Expect(4) = 3e-20 Identities = 43/127 (33%), Positives = 64/127 (50%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A A G I + +G + GF +IG+ +AELW GL++A DK + L +D+E+VV Sbjct: 825 AAASGAILNLQGEWLGGFALNIGSCDAPLAELWGAYYGLLIAWDKGFRRVELNLDSELVV 884 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYVC 129 L HP ++ C+ + V + HV+R N+ AD LAN + L + C Sbjct: 885 GFLSTGISKAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLANYAF-FLPLGFHC 943 Query: 128 LFEPTPE 108 FE PE Sbjct: 944 -FEICPE 949 Score = 45.4 bits (106), Expect(4) = 3e-20 Identities = 33/131 (25%), Positives = 47/131 (35%), Gaps = 5/131 (3%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQNTL 808 F+WL +H + D A C C G + ++ +LR CP +W +L Q Sbjct: 659 FIWLVSHMVIMTNVERVRRHLSDIATCSVCNGADESILHVLRDCPAMTPIWQRLLPQRRQ 718 Query: 807 TSSLSVHKWLEINS*TDSISQFNNLPSS-----TLFAYACWTIWYYRNKRKFEPHSILPD 643 S +WL F NL + TLF+ W W +R F + D Sbjct: 719 NEFFSQFEWL-----------FTNLDPAKGDWPTLFSMGIWWAWKWRCGDVFGERKLCRD 767 Query: 642 HKSFCLSKAGE 610 F A E Sbjct: 768 RLKFIKDIAEE 778 Score = 32.7 bits (73), Expect(4) = 3e-20 Identities = 21/71 (29%), Positives = 38/71 (53%), Gaps = 6/71 (8%) Frame = -3 Query: 1178 LQKRIHAIPFSNL---ATRPDNTRWIGTNNGEFSSKSAWNLLLD--QEHTEDGNW-DWVW 1017 LQ+++ A+ ++ A D W GT NG+F+ +SA+ LL +E G++ +W Sbjct: 589 LQEQLSAVAKESISADALLSDELSWKGTQNGDFTVRSAYELLKPEAEERPLIGSFLKQIW 648 Query: 1016 DVKAAPRIKTF 984 + A R++ F Sbjct: 649 KLVAPERVRVF 659 Score = 24.3 bits (51), Expect(4) = 3e-20 Identities = 9/18 (50%), Positives = 15/18 (83%) Frame = -1 Query: 544 SPAGGFLKLNTDGASKGN 491 +P+ ++KL TDGAS+G+ Sbjct: 804 APSDRWVKLTTDGASRGH 821 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 66.6 bits (161), Expect(3) = 3e-17 Identities = 38/108 (35%), Positives = 58/108 (53%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGG IR+ +G + GF +IG+ +AELW GL++A DK + L++D ++VV Sbjct: 1086 AAAGGAIRNGQGEWLGGFALNIGSCAAPLAELWGAYYGLLIAWDKGFRRVELDLDCKLVV 1145 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLAN 165 L HP ++ C+ + V + HV+R N+ AD LAN Sbjct: 1146 GFLSTGVSNAHPLSFLVRLCQGFFTRDWLVRVSHVYREANRLADGLAN 1193 Score = 41.2 bits (95), Expect(3) = 3e-17 Identities = 34/128 (26%), Positives = 50/128 (39%), Gaps = 2/128 (1%) Frame = -2 Query: 987 FLWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL--GLQN 814 F+WL + + + ++AIC C G E T+ +LR CP +W +L ++ Sbjct: 918 FIWLVSQNVIMTNVERVRRHLSENAICSVCNGAEETILHVLRDCPAMEPIWRRLLPLRRH 977 Query: 813 TLTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKS 634 S S+ +WL N D + TLF W W +R F I D Sbjct: 978 HEFFSQSLLEWLFTN--MDPVKGI----WPTLFGMGIWWAWKWRCCDVFGERKICRDRLK 1031 Query: 633 FCLSKAGE 610 F A E Sbjct: 1032 FIKDMAEE 1039 Score = 28.1 bits (61), Expect(3) = 3e-17 Identities = 10/17 (58%), Positives = 15/17 (88%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 P+ G++K+ TDGAS+GN Sbjct: 1066 PSDGWVKITTDGASRGN 1082 >emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1363 Score = 71.6 bits (174), Expect(3) = 4e-17 Identities = 59/169 (34%), Positives = 84/169 (49%), Gaps = 9/169 (5%) Frame = -3 Query: 632 FAYQRLVS--FTNVDQIIRAQDLHG----LELFHGFPGRGFSK--TEH*WSFQG*FAGAG 477 F Y R+V+ FTN+ + + + G + L P +GF K T+ W AG G Sbjct: 1166 FTYNRVVADFFTNI-RAFQVNNTQGNGSKVVLRWKPPHQGFLKLNTDGAWKADWENAGIG 1224 Query: 476 GLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVVYLLQ 297 G+ RD GN +GF + + + AEL A+R+GL +A D N + +E DA+ VV LL Sbjct: 1225 GVFRDAVGNWELGFAKRVDAGSPEAAELMAIREGLQVAWDCNYHKLEVECDAKGVVQLLA 1284 Query: 296 QE-PVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCN 153 + HP I+ D LL+ SV H+ R GNK A LA + N Sbjct: 1285 KPLEAENHPLGVIVMDICILLTRHWSVEFLHIKREGNKVAHCLAAEAVN 1333 Score = 35.4 bits (80), Expect(3) = 4e-17 Identities = 17/53 (32%), Positives = 28/53 (52%) Frame = -3 Query: 1181 ELQKRIHAIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGNWDW 1023 ++ K+I IP ++++ D+ W NG FS KSA+ L+ +E G W Sbjct: 986 DILKQIKEIPLASMSEVEDDFTWNFEKNGTFSVKSAYYLINRREEETGGKGSW 1038 Score = 28.5 bits (62), Expect(3) = 4e-17 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 5/87 (5%) Frame = -2 Query: 909 CEGCRGEEVTLDDLLRRCPKAIDLWLKLGLQN-----TLTSSLSVHKWLEINS*TDSISQ 745 C C + L R C A +W+++ + L +L +W++ N ++Q Sbjct: 1079 CVACDHPIEDMIHLFRDCCVASSVWIEILKHHKPNNQNLFFNLEWEEWIDFN-----LNQ 1133 Query: 744 FNNLPSSTLFAYACWTIWYYRNKRKFE 664 + T F A W IW RNK FE Sbjct: 1134 HDYWV--TKFTTAFWHIWCSRNKTVFE 1158 >gb|EOY19161.1| Polynucleotidyl transferase, putative [Theobroma cacao] Length = 419 Score = 71.2 bits (173), Expect(3) = 6e-17 Identities = 35/106 (33%), Positives = 59/106 (55%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A +GG+IRDE GN + GF + IG T + AE W + GL L ++ + +E+D+ + + Sbjct: 276 AASGGVIRDEYGNWIAGFCQKIGITFSLTAEPWGIYQGLTLCWNRGLRKFCVEIDSMLAL 335 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFL 171 + + L P+ +L + LL V+I HVHR ++C D++ Sbjct: 336 QKIYSQSSMLDPNAQLLRRIKELLQQSWDVTISHVHREADQCTDWM 381 Score = 37.7 bits (86), Expect(3) = 6e-17 Identities = 27/115 (23%), Positives = 46/115 (40%), Gaps = 2/115 (1%) Frame = -2 Query: 984 LWLCNHDXXXXXXXXXXXRVVDSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL--GLQNT 811 +W H+ + +A C TL LR C K+ LWL+L + ++ Sbjct: 108 IWRILHEALPTSEWLLKRHLRSTAFYFRCEAPVETLVHALRDCGKSKLLWLQLRPNIHSS 167 Query: 810 LTSSLSVHKWLEINS*TDSISQFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILP 646 S + W+ N +P + +F +A W +W++RN F+ I P Sbjct: 168 DFFSEELKPWVLKN--LACKDPVEGIPWAIIFIHAIWLLWFWRNMNLFDKSFIWP 220 Score = 26.2 bits (56), Expect(3) = 6e-17 Identities = 9/16 (56%), Positives = 13/16 (81%) Frame = -1 Query: 541 PAGGFLKLNTDGASKG 494 P G++KLN DG++KG Sbjct: 256 PKNGYVKLNVDGSAKG 271 >emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1| putative protein [Arabidopsis thaliana] Length = 947 Score = 48.9 bits (115), Expect(4) = 7e-17 Identities = 26/63 (41%), Positives = 38/63 (60%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A GG++RD G+ GF IG + +AELW V GL +A ++ + + LEVD+E+VV Sbjct: 864 ATTGGVLRDGIGHWCGGFALDIGVCSAPLAELWGVYYGLYMAWERRFTRVELEVDSELVV 923 Query: 308 YLL 300 L Sbjct: 924 GFL 926 Score = 43.5 bits (101), Expect(4) = 7e-17 Identities = 34/135 (25%), Positives = 58/135 (42%), Gaps = 5/135 (3%) Frame = -2 Query: 921 DSAICEGCRGEEVTLDDLLRRCPKAIDLWLKL-GLQNTLT-SSLSVHKWLEINS*TDSIS 748 D+++C+ C+G + T+ +L+ CP +W +L +Q + + S+ WL +N + Sbjct: 714 DTSVCQVCKGGDETILHVLKDCPSIAGIWRRLVQVQRSYDFFNGSLFGWLYVNLGMKNAE 773 Query: 747 QFNNLPSSTLFAYACWTIWYYRNKRKFEPHSILPDHKSFCLSKAGEF---Y*RGPDNKGT 577 +TLFA W W +R F D F A E + N G Sbjct: 774 --TGYAWATLFAIVVWWSWKWRCGYVFGEVGKCRDRVKFFRDLAAEVSHAHAIHSQNGGL 831 Query: 576 RSTRSRIISWVPRQG 532 R+ R+++W P G Sbjct: 832 RTRVERLVAWKPPDG 846 Score = 31.2 bits (69), Expect(4) = 7e-17 Identities = 13/18 (72%), Positives = 16/18 (88%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGNL 488 P G ++KLNTDGAS+GNL Sbjct: 844 PDGEWVKLNTDGASRGNL 861 Score = 30.4 bits (67), Expect(4) = 7e-17 Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 4/63 (6%) Frame = -3 Query: 1160 AIPFSNLATRPDNTRWIGTNNGEFSSKSAWNLLLDQEHTEDGN----WDWVWDVKAAPRI 993 A+ ++ D W + +G F+ KSA+ LL + +H N +D +W V A R+ Sbjct: 647 AVVVDSVTGARDRLSWGYSADGVFTVKSAYRLLTE-DHDPRPNMAAFFDRLWRVVALERV 705 Query: 992 KTF 984 KTF Sbjct: 706 KTF 708 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 90.1 bits (222), Expect(2) = 5e-16 Identities = 49/119 (41%), Positives = 70/119 (58%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A GGL+RD G V GF +IG + + AEL A+ GL+L +D+NI + +E+DA VV+ Sbjct: 873 AATGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKDRNIEKLWIEMDALVVI 932 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYV 132 ++QQ H +L R LS F S I H+ R GN+ ADFL+N+G + NL + Sbjct: 933 QMIQQSKKGSHDIRYLLASIRKCLSFF-SFRISHIFREGNQAADFLSNKGHTHQNLQVI 990 Score = 21.9 bits (45), Expect(2) = 5e-16 Identities = 9/17 (52%), Positives = 11/17 (64%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 P G KLN DG+S+ N Sbjct: 854 PVTGEYKLNVDGSSRHN 870 >gb|EMJ08167.1| hypothetical protein PRUPE_ppb019197mg [Prunus persica] Length = 363 Score = 74.7 bits (182), Expect(3) = 8e-16 Identities = 42/112 (37%), Positives = 63/112 (56%), Gaps = 1/112 (0%) Frame = -3 Query: 485 GAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVVY 306 GAGG+IRD G+ + GF ++G ELW + GL L K ++ + +E+D+ V Sbjct: 239 GAGGIIRDSFGDWMGGFAVNLGIGQTLDDELWGLFFGLKLVAAKGVARLSIEMDSMTDVQ 298 Query: 305 LLQQE-PVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCN 153 L++Q P LHP ++ C +L+S F+ V + HV+R N AD LAN N Sbjct: 299 LIKQHVPSCLHPCTGVIASCVALISKFEFVELTHVYRERNAAADCLANWSLN 350 Score = 32.3 bits (72), Expect(3) = 8e-16 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 3/75 (4%) Frame = -2 Query: 741 NNLPSSTLFAYACWT-IWYYRNKRKFEPHSILPDH-KSFCLSKAGEFY*RGPDNKGTRST 568 +NL S +++ W IW +RN R F + LP H K S E+ P++ R+ Sbjct: 149 SNLCSKSVYDLQPWLFIWKWRNSRVFNVEAELPFHPKRIIASAVSEWLQTCPNSISKRTQ 208 Query: 567 RSRIISW-VPRQGVF 526 +++W P GVF Sbjct: 209 VQIMLAWEPPMNGVF 223 Score = 24.3 bits (51), Expect(3) = 8e-16 Identities = 9/18 (50%), Positives = 11/18 (61%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGNL 488 P G KLN DG+ KG + Sbjct: 218 PMNGVFKLNVDGSRKGGI 235 >gb|EOY02505.1| RNA-binding (RRM/RBD/RNP motifs) family protein isoform 1 [Theobroma cacao] gi|508710609|gb|EOY02506.1| RNA-binding (RRM/RBD/RNP motifs) family protein isoform 1 [Theobroma cacao] Length = 344 Score = 89.4 bits (220), Expect(2) = 2e-15 Identities = 52/134 (38%), Positives = 80/134 (59%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 AGAGG+IR+++G +G+ R +G T+T AE W +RDGL LA + + +++++VD +VV+ Sbjct: 86 AGAGGIIRNDQGEWNVGYSRKLGQATSTCAEHWGLRDGLQLAVKRGLFDVIIKVDLQVVL 145 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYVC 129 L+ +E V H I+ +CRSLL + ++R N CAD LA G D+V Sbjct: 146 DLICKEAVDSHTLGPIIKECRSLLEQIPNHRFCQINRDSNCCADHLARMGATMTK-DFV- 203 Query: 128 LFEPTPELGVLLRF 87 +FE P+ LL F Sbjct: 204 IFEFPPDCIKLLLF 217 Score = 20.8 bits (42), Expect(2) = 2e-15 Identities = 8/14 (57%), Positives = 10/14 (71%) Frame = -1 Query: 541 PAGGFLKLNTDGAS 500 P GF KLN+ G+S Sbjct: 66 PPRGFFKLNSGGSS 79 >ref|XP_002309989.1| predicted protein [Populus trichocarpa] Length = 245 Score = 84.7 bits (208), Expect(2) = 6e-15 Identities = 57/141 (40%), Positives = 78/141 (55%), Gaps = 2/141 (1%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 AGAGG+IRD G + GF R+IG ++ AELWAV GL LA D+ + LE D++VVV Sbjct: 99 AGAGGVIRDHLGAWIGGFARNIGICSSVNAELWAVYVGLQLAWDRGFRKVDLESDSKVVV 158 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLD--Y 135 L+ + V + ++NI+ + +L V+ HV+R N AD+LAN G LD Sbjct: 159 GLINGDSVRVDRNYNIIMQIKGMLGRDWEVTTYHVYREANCVADWLANYGLTRDLLDRGS 218 Query: 134 VCLFEPTPELGVLLRFGC*GS 72 L EP L LL + GS Sbjct: 219 DVLEEPPSGLYPLLYYDLIGS 239 Score = 23.9 bits (50), Expect(2) = 6e-15 Identities = 9/12 (75%), Positives = 10/12 (83%) Frame = -1 Query: 526 LKLNTDGASKGN 491 +KLN DG SKGN Sbjct: 84 IKLNVDGCSKGN 95 >ref|XP_002314708.1| predicted protein [Populus trichocarpa] Length = 245 Score = 84.0 bits (206), Expect(2) = 1e-14 Identities = 49/117 (41%), Positives = 69/117 (58%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 AGAGG+IRD G + GF R+IG ++ AELWAV GL LA D+ + LE D++VVV Sbjct: 99 AGAGGVIRDHLGAWIGGFARNIGICSSVNAELWAVYVGLQLAWDRGFRKVDLESDSKVVV 158 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLD 138 L+ + V + ++NI+ + +L V+ HV+R N AD+LAN G LD Sbjct: 159 GLINGDSVRVDRNYNIIMQIKGMLGRNWEVTTYHVYREANCVADWLANYGLTRDLLD 215 Score = 23.9 bits (50), Expect(2) = 1e-14 Identities = 9/12 (75%), Positives = 10/12 (83%) Frame = -1 Query: 526 LKLNTDGASKGN 491 +KLN DG SKGN Sbjct: 84 IKLNVDGCSKGN 95 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 84.3 bits (207), Expect(2) = 8e-14 Identities = 50/116 (43%), Positives = 66/116 (56%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A GG++RD G V GF ++GT + AEL A+ GLIL RD NI + +E+DA V+ Sbjct: 2110 AAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVI 2169 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANL 141 LLQ H ++ R LLS F S H+ R GN+ ADFLAN+G + NL Sbjct: 2170 RLLQGNHRGPHAIRYLMVSLRQLLSHF-SFRFSHIFREGNQAADFLANRGHEHQNL 2224 Score = 20.4 bits (41), Expect(2) = 8e-14 Identities = 8/15 (53%), Positives = 11/15 (73%) Frame = -1 Query: 541 PAGGFLKLNTDGASK 497 P+ G KLN DG++K Sbjct: 2091 PSLGEFKLNVDGSAK 2105 >gb|AAF97302.1|AC007843_5 Hypothetical protein [Arabidopsis thaliana] gi|55978717|gb|AAV68820.1| hypothetical protein AT1G17390 [Arabidopsis thaliana] Length = 272 Score = 75.5 bits (184), Expect(2) = 8e-14 Identities = 48/128 (37%), Positives = 72/128 (56%) Frame = -3 Query: 488 AGAGGLIRDERGNKVMGFYRHIGTTTNTIAELWAVRDGLILARDKNISNIVLEVDAEVVV 309 A AGG++RD GN GF +IG + +AELW GL +A ++ ++ + +E+D+E+VV Sbjct: 127 ATAGGVVRDGDGNWCYGFSLNIGICSAPLAELWGAYYGLNIAWERGVTQLEMEIDSEMVV 186 Query: 308 YLLQQEPVPLHPSFNILCDCRSLLSIFQSVSIRHVHRVGNKCADFLANQGCNYANLDYVC 129 L+ HP ++ C LLS SV I HV+R N+ AD LAN + L + Sbjct: 187 GFLRTGIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREANRLADGLANYAF-FLPLGF-H 244 Query: 128 LFEPTPEL 105 LF TP++ Sbjct: 245 LFNSTPDI 252 Score = 29.3 bits (64), Expect(2) = 8e-14 Identities = 12/17 (70%), Positives = 14/17 (82%) Frame = -1 Query: 541 PAGGFLKLNTDGASKGN 491 P G+ KLNTDGAS+GN Sbjct: 107 PRVGWFKLNTDGASRGN 123