BLASTX nr result
ID: Catharanthus22_contig00023502
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00023502 (1283 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC63844.1| putative non-LTR retroelement reverse transcripta... 138 6e-30 dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] 136 2e-29 dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] 136 2e-29 dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] 135 3e-29 gb|AAD37021.1| putative non-LTR retrolelement reverse transcript... 133 1e-28 ref|XP_004306074.1| PREDICTED: putative ribonuclease H protein A... 131 7e-28 ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein A... 130 1e-27 emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|73210... 124 1e-25 emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis ... 115 2e-25 gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis ... 122 3e-25 ref|XP_004305156.1| PREDICTED: putative ribonuclease H protein A... 115 4e-23 ref|XP_006490008.1| PREDICTED: uncharacterized protein LOC102624... 114 7e-23 emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulga... 112 3e-22 emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulga... 111 6e-22 gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptas... 110 1e-21 emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulga... 108 5e-21 gb|EEC76774.1| hypothetical protein OsI_14862 [Oryza sativa Indi... 108 5e-21 gb|EEC74955.1| hypothetical protein OsI_10942 [Oryza sativa Indi... 108 5e-21 gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana] 107 8e-21 gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana] 107 8e-21 >gb|AAC63844.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 138 bits (347), Expect = 6e-30 Identities = 83/248 (33%), Positives = 134/248 (54%), Gaps = 14/248 (5%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKS 185 R+L+ FC S QKV+L + +FF V S ++ IS GI + LG YLGMP+L K Sbjct: 564 RVLERFCEASGQKVSLEKSKIFFSHNV-SREMEQLISEESGIGCTKELGKYLGMPILQKR 622 Query: 186 VSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLM 365 ++ T+ ++E + +L+ WKGR +S+A R L + ++ + M PVSTL+ L Sbjct: 623 MNKETFGEVLERVSARLAGWKGRSLSLAGRITLTKAVLSSIPVHVMSAILLPVSTLDTLD 682 Query: 366 RFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRSL-----AKLAWRF*GNLII 530 R++R FLWGS+ ++ + +S RK+C+PK+ GG+G R+ R + AK+ WR ++ Sbjct: 683 RYSRTFLWGSTMEKKKQHLLSWRKICKPKAEGGIGLRSARDMNKALVAKVGWR-----LL 737 Query: 531 YGQEFCILNMVTLRRWI--------LESSGGLLHIHGSLIEGYR-LLRQGLEWEVGDGCD 683 +E +V + + L+ S+ G R ++ +G+ W GDGC Sbjct: 738 QDKESLWARVVRKKYKVGGVQDTSWLKPQPRWSSTWRSVAVGLREVVVKGVGWVPGDGCT 797 Query: 684 ARFWVDVW 707 RFW+D W Sbjct: 798 IRFWLDRW 805 >dbj|BAE79385.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 136 bits (342), Expect = 2e-29 Identities = 77/241 (31%), Positives = 129/241 (53%), Gaps = 9/241 (3%) Frame = +3 Query: 12 LKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVS 191 L F + S KVN +++F V + +K I + +P++ +LG YLG+P+L + VS Sbjct: 707 LDSFSNASGLKVNFSKSLLFCSSNV-NAGLKRAIGSILQVPVAESLGTYLGIPMLKERVS 765 Query: 192 SGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRF 371 T++ +++ + KLS+WK ++MA R +L+Q S A +Y MQ PVST N++ + Sbjct: 766 RNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTCNEIDKT 825 Query: 372 TRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNL---- 524 R FLWG R +++V+ ++C+P++ GGLG R R L K+AW+ N+ Sbjct: 826 CRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLW 885 Query: 525 IIYGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWVDV 704 + +E + N L L+S S+++G +L ++W VG+G FW D Sbjct: 886 VKVLREKYVKNADFLH---LQSQSNCSWGWRSIMKGKDVLAGAIKWNVGNGRKINFWNDW 942 Query: 705 W 707 W Sbjct: 943 W 943 >dbj|BAE79382.1| unnamed protein product [Ipomoea batatas] Length = 1366 Score = 136 bits (342), Expect = 2e-29 Identities = 77/241 (31%), Positives = 129/241 (53%), Gaps = 9/241 (3%) Frame = +3 Query: 12 LKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVS 191 L F + S KVN +++F V + +K I + +P++ +LG YLG+P+L + VS Sbjct: 707 LDSFSNASGLKVNFSKSLLFCSSNV-NAGLKRAIGSILQVPVAESLGTYLGIPMLKERVS 765 Query: 192 SGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRF 371 T++ +++ + KLS+WK ++MA R +L+Q S A +Y MQ PVST N++ + Sbjct: 766 RNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTCNEIDKT 825 Query: 372 TRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNL---- 524 R FLWG R +++V+ ++C+P++ GGLG R R L K+AW+ N+ Sbjct: 826 CRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLW 885 Query: 525 IIYGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWVDV 704 + +E + N L L+S S+++G +L ++W VG+G FW D Sbjct: 886 VKVLREKYVKNADFLH---LQSQSNCSWGWRSIMKGKDVLAGAIKWNVGNGRKINFWNDW 942 Query: 705 W 707 W Sbjct: 943 W 943 >dbj|BAE79384.1| unnamed protein product [Ipomoea batatas] Length = 1898 Score = 135 bits (341), Expect = 3e-29 Identities = 77/241 (31%), Positives = 128/241 (53%), Gaps = 9/241 (3%) Frame = +3 Query: 12 LKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVS 191 L F S KVN +++F V + +K I + +P++ +LG YLG+P+L + VS Sbjct: 1239 LDSFSDASGLKVNFSKSLLFCSSNV-NAGLKRAIGSILQVPVAESLGTYLGIPMLKERVS 1297 Query: 192 SGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRF 371 T++ +++ + KLS+WK ++MA R +L+Q S A +Y MQ PVST N++ + Sbjct: 1298 RNTFNAVIDKMRTKLSSWKASSLNMAGRRVLVQASLATVPTYTMQVMALPVSTCNEIDKT 1357 Query: 372 TRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNL---- 524 R FLWG R +++V+ ++C+P++ GGLG R R L K+AW+ N+ Sbjct: 1358 CRNFLWGHDTNTRKLHSVNWAEICKPRNEGGLGLRMARDFNRAFLTKMAWQIFSNIDKLW 1417 Query: 525 IIYGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWVDV 704 + +E + N L L+S S+++G +L ++W VG+G FW D Sbjct: 1418 VKVLREKYVKNADFLH---LQSQSNCSWGWRSIMKGKDVLAGAIKWNVGNGRKINFWNDW 1474 Query: 705 W 707 W Sbjct: 1475 W 1475 >gb|AAD37021.1| putative non-LTR retrolelement reverse transcriptase [Arabidopsis thaliana] Length = 732 Score = 133 bits (335), Expect = 1e-28 Identities = 83/243 (34%), Positives = 125/243 (51%), Gaps = 9/243 (3%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKS 185 R+L+ FC S QKV+L + +FF + V S + IS GI + LG YLGMPVL + Sbjct: 226 RVLERFCVASGQKVSLEKSKIFFSENV-SRDLGKLISDESGISSTRELGKYLGMPVLQRR 284 Query: 186 VSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLM 365 ++ T+ ++E L +L+ WKGR +S+A R L + ++ + M T P STL+ L Sbjct: 285 INKDTFGDILEKLTTRLAGWKGRFLSLAGRVTLTKAVLSSIPVHTMSTIALPKSTLDGLD 344 Query: 366 RFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWRF*GNLII 530 + +R FLWGSS QR + +S ++VC+P+S GGLG R N L+K+ WR + Sbjct: 345 KVSRSFLWGSSVTQRKQHLISWKRVCKPRSEGGLGIRKAQDMNKALLSKVGWRLIQDYHS 404 Query: 531 YGQEF--CILNMVTLR--RWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWV 698 C + +R W S + ++ GL W +GDG + FW+ Sbjct: 405 LWARIMRCNYRVQDVRDGAWTKVRSVCSSTWRSVALGMREVVIPGLSWVIGDGREILFWM 464 Query: 699 DVW 707 D W Sbjct: 465 DKW 467 >ref|XP_004306074.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 407 Score = 131 bits (329), Expect = 7e-28 Identities = 73/203 (35%), Positives = 115/203 (56%), Gaps = 8/203 (3%) Frame = +3 Query: 126 GIPISTNLGMYLGMPVLHKSVSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAA 305 G P++ +LG YLGMP++H V+ TYD + + ++ +LS+WK +++SMA R LIQ+ ++A Sbjct: 3 GSPLTNDLGKYLGMPLIHSRVNKHTYDGIFDQVQSRLSSWKSKVLSMAGRLTLIQSVSSA 62 Query: 306 FSSYAMQTSRTPVSTLNQLMRFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR--- 476 +YAMQT++ PVS L + R FLWG + ++ ++ V+ VC+PK G +G + Sbjct: 63 IPNYAMQTAKFPVSLCENLDKLNRNFLWGDTEIKKKVHLVNWDVVCQPKQLGDIGIKKTE 122 Query: 477 --NLRSLAKLAWR-F*GNLIIYGQEFC--ILNMVTLRRWILESSGGLLHIHGSLIEGYRL 641 N LAK++WR F + ++ F L L E + +I G +L Sbjct: 123 DMNQAMLAKISWRMFQCDKGLWASMFAEKYLKNCCLFDDNYEVAVDCSSTWRGIIFGAKL 182 Query: 642 LRQGLEWEVGDGCDARFWVDVWF 710 LR L+W +GDG +FW D WF Sbjct: 183 LRSNLKWRLGDGKSIKFWHDYWF 205 >ref|XP_006480844.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Citrus sinensis] Length = 768 Score = 130 bits (327), Expect = 1e-27 Identities = 83/246 (33%), Positives = 122/246 (49%), Gaps = 13/246 (5%) Frame = +3 Query: 9 LLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSV 188 +L DFC S KVN T V+F + VP +V I G ++ +LG YLGMP+LH V Sbjct: 132 VLGDFCLSSGTKVNQSKTHVYFSKNVPD-AVATRIWRDLGYTVTKDLGKYLGMPLLHSRV 190 Query: 189 SSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMR 368 S TY +++ ++KL W +S+A R L Q+ A YAMQT+ P S +L + Sbjct: 191 SQQTYQGILDKTDQKLLGWAASQLSLAGRITLTQSVLQAVPIYAMQTTNLPGSIKTKLDQ 250 Query: 369 FTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNL-----RSLAKLAWRF*GNLIIY 533 R FLW + R M VS +C+PK +GGLGF+ L L K+AW +LI Sbjct: 251 ICRRFLWSGNDELRKMSLVSWHNICQPKMAGGLGFKRLDIMNEALLLKVAW----HLITE 306 Query: 534 GQEFCILNMVT--------LRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDAR 689 + C+ + T + + G H+ S+ + ++G+ W VG+G + Sbjct: 307 PNKLCVQVLSTKYGVPPLEIPHTLPTRYGS--HLWKSVGRVWDYAKRGIRWIVGNGWKVK 364 Query: 690 FWVDVW 707 FW D W Sbjct: 365 FWWDCW 370 >emb|CAB78008.1| putative protein [Arabidopsis thaliana] gi|7321072|emb|CAB82119.1| putative protein [Arabidopsis thaliana] Length = 947 Score = 124 bits (310), Expect = 1e-25 Identities = 82/240 (34%), Positives = 123/240 (51%), Gaps = 6/240 (2%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKS 185 R+L+ FC S QKV+L + +FF + V S ++ IS GI + LG YLGMP+L + Sbjct: 378 RILETFCIASGQKVSLDKSKIFFSKNV-SRDLEKLISKESGIKSTRELGKYLGMPILQRR 436 Query: 186 VSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLM 365 ++ T+ ++E + +L+ WKGR +S A R L ++ + + M T P STL L Sbjct: 437 INKDTFGEVLERVSSRLAGWKGRSLSFAGRLTLTKSVLSLIPIHTMSTISLPQSTLEGLD 496 Query: 366 RFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWRF*GNLII 530 + R FL GSSA ++ ++ V+ +VC PKS GGLG R N ++K+ WR Sbjct: 497 KLARVFLLGSSAEKKKLHLVAWDRVCLPKSEGGLGIRTSKCMNKALVSKVGWRL------ 550 Query: 531 YGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYR-LLRQGLEWEVGDGCDARFWVDVW 707 I + +L IL S + G R ++ +G W VG+G D FW D W Sbjct: 551 ------INDRYSLWARILRSKYRV---------GLREVVSRGSRWVVGNGRDILFWSDNW 595 >emb|CAB10337.1| reverse transcriptase like protein [Arabidopsis thaliana] gi|7268307|emb|CAB78601.1| reverse transcriptase like protein [Arabidopsis thaliana] Length = 929 Score = 115 bits (289), Expect(2) = 2e-25 Identities = 64/163 (39%), Positives = 95/163 (58%), Gaps = 5/163 (3%) Frame = +3 Query: 36 SQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVSSGTYDFLV 215 +QKV+L + +FF V S ++G I+ GI + LG YLGMPVL K ++ T+ ++ Sbjct: 509 AQKVSLEKSKIFFSNNV-SRDLEGLITAETGIGSTRELGKYLGMPVLQKRINKDTFGEVL 567 Query: 216 ENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRFTRYFLWGS 395 E + +LS WK R +S+A R L + + + M + P S L QL + +R FLWGS Sbjct: 568 ERVSSRLSGWKSRSLSLAGRITLTKAVLMSIPIHTMSSILLPASLLEQLDKVSRNFLWGS 627 Query: 396 SAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWR 509 + +R + +S +KVCRPK++GGLG R N LAK+ WR Sbjct: 628 TVEKRKQHLLSWKKVCRPKAAGGLGLRASKDMNRALLAKVGWR 670 Score = 28.5 bits (62), Expect(2) = 2e-25 Identities = 21/64 (32%), Positives = 25/64 (39%), Gaps = 1/64 (1%) Frame = +2 Query: 482 SILSKTCLAFLRESDHLWARVLHSKYGDIKEVDFGK*WRAPSHTWQPD*GLQAPATRIGV 661 ++L+K L + LWARVL KY D W P TW R GV Sbjct: 662 ALLAKVGWRLLNDKVSLWARVLRRKYKVTDVHDSS--WLVPKATWSSTWRSIGVGLREGV 719 Query: 662 G-GW 670 GW Sbjct: 720 AKGW 723 >gb|ABW81175.1| non-LTR retrotransposon transposase [Arabidopsis cebennensis] Length = 799 Score = 122 bits (306), Expect = 3e-25 Identities = 76/243 (31%), Positives = 122/243 (50%), Gaps = 5/243 (2%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKS 185 ++L+ FC S QKV+L + +FF Q V ++ IS GI + LG YLGMPVL K Sbjct: 156 KVLEKFCIASGQKVSLEKSKIFFSQNV-HRDLEKFISDESGIKSTKELGKYLGMPVLQKR 214 Query: 186 VSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLM 365 ++ T+ ++ + +L+ WKGRM+S+A R L ++ ++ + M T P +TL+ Sbjct: 215 INKDTFGEILLRVSSRLAGWKGRMLSLAGRLTLTKSVLSSIPIHTMSTIALPKATLDGFD 274 Query: 366 RFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNLII 530 R ++ F+WGSS ++ + ++ K+ K +GGLG R+ R+ LAK+ WR L Sbjct: 275 RISKSFVWGSSTEKKKQHLLAWNKIYCTKQAGGLGIRSSRAMNTALLAKIGWRL---LQD 331 Query: 531 YGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWVDVWF 710 + + M+ +R ++ G+ W VGDG RFW D W Sbjct: 332 KSSLWARVIMLGMREVVIP---------------------GVSWVVGDGQTTRFWADKWL 370 Query: 711 KRT 719 T Sbjct: 371 MNT 373 >ref|XP_004305156.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 813 Score = 115 bits (288), Expect = 4e-23 Identities = 76/246 (30%), Positives = 115/246 (46%), Gaps = 10/246 (4%) Frame = +3 Query: 9 LLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSV 188 +L D+ S Q VN G + + F + P S++ +I+ G+ + YLG+P Sbjct: 203 VLADYEKASGQLVNFGKSNIVFSKGTP-VSLQSSIAGELGVGVVVKHEKYLGLPTYVGKS 261 Query: 189 SSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMR 368 + T+ F+ E L KKL W+G+++S A + ILI+ A A SY+M P S L + Sbjct: 262 KTETFAFIKERLSKKLEGWQGKLLSGAGKGILIRVVAQALPSYSMSCFLLPKSFYAALHQ 321 Query: 369 FTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWRF*GNLIIY 533 F GS R ++ +S K+CRPK GG+GFR N+ LAK WR I+ Sbjct: 322 KCARFWLGSKQEDRKIHWLSWEKLCRPKERGGMGFRDLYAHNIAMLAKQGWR-----ILQ 376 Query: 534 GQEFCILNMVTLRRWILESSGGLLHIHGS-----LIEGYRLLRQGLEWEVGDGCDARFWV 698 + + + R + S GS + E +L +G+ W+VGDG W Sbjct: 377 FPDSLVARLFRARYFPSSSFWSATATDGSACWKGIAEARSVLARGMRWQVGDGTRVCIWE 436 Query: 699 DVWFKR 716 D W R Sbjct: 437 DPWLPR 442 >ref|XP_006490008.1| PREDICTED: uncharacterized protein LOC102624085 [Citrus sinensis] Length = 1635 Score = 114 bits (286), Expect = 7e-23 Identities = 76/265 (28%), Positives = 126/265 (47%), Gaps = 14/265 (5%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKS 185 +++ +F + S KVN T+V+F + S I + G ++ NLG YLG+P+ H Sbjct: 1197 KIIDEFSASSGAKVNKSKTLVYFSANI-SAMEASRIGSDLGYSVTDNLGKYLGVPLCHSR 1255 Query: 186 VSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLM 365 +S TY +V+ ++++LS W +++A R L Q+ A S YAMQT++ P S ++ Sbjct: 1256 ISKQTYQSIVDKIDQRLSGWNASHLTLAGRITLAQSVLQAISVYAMQTTKLPRSIKMKID 1315 Query: 366 RFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNL-----RSLAKLAWRF*GNLII 530 + R F+W SA + M V+ +C PK GGLGF+ L L K WR + Sbjct: 1316 QLCRRFIWSGSAEHQKMSLVNWDMICTPKCKGGLGFKKLDIMNHALLMKNTWR------L 1369 Query: 531 YGQEFCILNMVTLRRW---------ILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCD 683 + + N V L ++ L + G H+ ++ + R G+ W +GDG Sbjct: 1370 ITEPTKLSNQVLLTKYGVHLDEFPTSLPTRYG-SHLWKAMGSTWEQTRVGMCWNIGDGKR 1428 Query: 684 ARFWVDVWFKRTIRGGHCTSPSEGS 758 R+ + I H + S G+ Sbjct: 1429 VRYLLPNNILLKIASVHPPTASHGA 1453 >emb|CCA66009.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1378 Score = 112 bits (280), Expect = 3e-22 Identities = 70/246 (28%), Positives = 116/246 (47%), Gaps = 14/246 (5%) Frame = +3 Query: 12 LKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVS 191 L FC S KVN + ++F ++ + T + + + G YLG+P ++ S Sbjct: 709 LDRFCEASGSKVNEDKSKIYF-SANTHLDIRDAVCNTLAMEATADFGKYLGVPTINGRSS 767 Query: 192 SGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRF 371 Y +LV+ + KL+ WK + +S+A RA LIQ++ ++ Y MQ+++ P ST + + R Sbjct: 768 KREYQYLVDRINGKLAGWKTKTLSIAGRATLIQSAFSSIPYYTMQSTKLPRSTCDDIDRK 827 Query: 372 TRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GN----- 521 +R FLWG G+R ++ V+ + + K GGLG R++R L KL WR Sbjct: 828 SRSFLWGEQEGKRRVHLVAWENISKSKKEGGLGIRSMRQANSAFLVKLGWRLLAEPSSLW 887 Query: 522 ----LIIYGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDAR 689 Y C ++M E S G ++ ++R+G+ VG+G Sbjct: 888 SRILRAKYCDNRCDIDM------FKEKSNASSTWRG-ILSSIDVVRKGINSAVGNGAKTL 940 Query: 690 FWVDVW 707 FW W Sbjct: 941 FWHHRW 946 >emb|CCA65995.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1389 Score = 111 bits (278), Expect = 6e-22 Identities = 71/248 (28%), Positives = 120/248 (48%), Gaps = 14/248 (5%) Frame = +3 Query: 6 RLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSV--KGNISTTFGIPISTNLGMYLGMPVLH 179 ++L +C S Q VN + QC P+ K N ++ G+ S+ LG YLG P+++ Sbjct: 717 QILDKYCLMSGQLVNYHKSA---FQCSPNVRDIDKVNFASILGMQESSELGDYLGCPIIN 773 Query: 180 KSVSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQ 359 V+ T+ ++ ++L WK +S A R +LIQ++ A+ +S+ MQ+ P L Sbjct: 774 SRVTKETFAGVISKTVQQLPKWKANSLSQAGRTVLIQSNLASKASFQMQSFTLPKKVLTT 833 Query: 360 LMRFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWRF*GNL 524 L R F W ++ + K+C+PKS GG+GFR N+ KL W+ Sbjct: 834 LDTTYRNFFWNKDPAAKSANFIGWNKICQPKSVGGVGFRKAEVTNIALQMKLLWK----- 888 Query: 525 IIYGQEFCILNMVTLRRWILESSGGLLHIHG-------SLIEGYRLLRQGLEWEVGDGCD 683 I+ ++ + +VT ++++ E + + I +L+ +GL W +GDG D Sbjct: 889 IMVSKDNIWVKLVT-QKYLKEQNLLVCKIPSNASWQWKNLLRHRNFFSKGLRWLIGDGQD 947 Query: 684 ARFWVDVW 707 FW D W Sbjct: 948 ISFWTDNW 955 >gb|EOY24339.1| RNA-directed DNA polymerase (Reverse transcriptase), Polynucleotidyl transferase, Ribonuclease H fold-like protein [Theobroma cacao] Length = 616 Score = 110 bits (276), Expect = 1e-21 Identities = 66/251 (26%), Positives = 117/251 (46%), Gaps = 13/251 (5%) Frame = +3 Query: 21 FCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVSSGT 200 F S +KVN+ + ++ V ++ N+ G+ STNLG YLG+P+ H + Sbjct: 7 FSKISGEKVNVHKSSFYYSANVSKECIE-NLRNISGLSYSTNLGNYLGVPLFHGRKRITS 65 Query: 201 YDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRFTRY 380 + FL + + KLS WK +S A L+++ + Y MQ P+ + ++ R+ + Sbjct: 66 FKFLEDKVRSKLSGWKAFSLSFAGILTLVKSVLSTIPYYVMQIVSIPLDSCKRMERYCQN 125 Query: 381 FLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLR-----SLAKLAWRF*GNLIIYGQEF 545 FLWG A + ++ + ++CRPK LG + L L KL W+ L+ + Sbjct: 126 FLWGGDADHKRIHLIRCNQICRPKEERSLGVKRLHVMNNAFLMKLLWQ----LVTRPKSL 181 Query: 546 CILNMVTLRRWILESSGGLLHIHG------SLIEGYRLLRQGLEWEVGDGCDARFWVDVW 707 + + + ++ ++ HG +L + + + L W +GDG RFW D+W Sbjct: 182 WVSIIRGKYNFNMDRRSSSIYCHGASHTWNALSKLWNVFNNNLRWVLGDGLSIRFWKDIW 241 Query: 708 FKRT--IRGGH 734 + T + GH Sbjct: 242 LEDTPLLEQGH 252 >emb|CCA65997.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1363 Score = 108 bits (270), Expect = 5e-21 Identities = 69/246 (28%), Positives = 122/246 (49%), Gaps = 11/246 (4%) Frame = +3 Query: 3 ERLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHK 182 + +L++F + S +VN+ ++ F + + + ++ + ST+ G YLG +L Sbjct: 703 QNVLEEFGNISGLRVNMSKSLAIFPPKM-NPQRRRMLADFLTMKGSTSFGKYLGCNILPN 761 Query: 183 SVSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQL 362 + G YD L+E ++ ++ W+ + ++MA R LI++ ++F Y MQ+S PVS +N++ Sbjct: 762 KLRRGDYDGLLEKVKSAINGWQAKYLNMAGRCTLIKSVVSSFPVYGMQSSLLPVSVMNEI 821 Query: 363 MRFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFR-----NLRSLAKLAWRF*GNLI 527 + R FLW + +S ++C P GGLGFR NL +AKL W +I Sbjct: 822 EKDCRKFLWNKMDKSHYLARMSWDRICSPTGKGGLGFRRLHNWNLAFMAKLGW-----MI 876 Query: 528 IYGQEFCILNMVTLRRW----ILESSGGLLH--IHGSLIEGYRLLRQGLEWEVGDGCDAR 689 I + + ++ R W L + G H I +++G LL +GL +G+G Sbjct: 877 IKDETKLWVRILKARYWERGSFLSAVGKNHHSPIWRDIVKGRELLEKGLVRRIGNGRSTS 936 Query: 690 FWVDVW 707 W W Sbjct: 937 LWYHWW 942 >gb|EEC76774.1| hypothetical protein OsI_14862 [Oryza sativa Indica Group] Length = 1860 Score = 108 bits (270), Expect = 5e-21 Identities = 69/243 (28%), Positives = 116/243 (47%), Gaps = 11/243 (4%) Frame = +3 Query: 12 LKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHKSVS 191 L+ +C S QK+NL + +FF Q P +K ++ T + + YLGMP S Sbjct: 1477 LQAYCKASGQKINLQKSSIFFGQNCPE-DIKNSVKETLQVSVEILQDTYLGMPTEIGRAS 1535 Query: 192 SGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRF 371 +G++ FL E + +++++W R MS A + +++ A A +Y M + PVST ++ Sbjct: 1536 TGSFHFLPERVWRRVNSWNDRPMSRAGKETMLKAVAQAIPTYVMSCFKLPVSTCEKMKSC 1595 Query: 372 TRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNL-----RSLAKLAWRF*GNL---- 524 WG G++ M+ S + PKS GG+GFR+L LA+ WR +L Sbjct: 1596 ILDHWWGFEDGKKKMHWRSWEWMTTPKSLGGMGFRDLGLFNQAMLARQGWRIVTDLVSLC 1655 Query: 525 --IIYGQEFCILNMVTLRRWILESSGGLLHIHGSLIEGYRLLRQGLEWEVGDGCDARFWV 698 ++ G+ F ++ W S++ G LLR+G+ W +G+G + Sbjct: 1656 ARVLKGRYFPNSDL-----WNAPKPTATSFTWRSILFGRDLLRKGVRWGIGNGSSVKILK 1710 Query: 699 DVW 707 D W Sbjct: 1711 DHW 1713 >gb|EEC74955.1| hypothetical protein OsI_10942 [Oryza sativa Indica Group] Length = 961 Score = 108 bits (270), Expect = 5e-21 Identities = 68/234 (29%), Positives = 110/234 (47%), Gaps = 17/234 (7%) Frame = +3 Query: 66 VFFIQCVPS---TSVKGNISTTFGIPISTNLGMYLGMPVLHKSVSSGTYDFLVENLEKKL 236 + ++ C S ++KG I T + ++ YLG+P + +G + L E EK+L Sbjct: 457 LLWLMCAQSHCDETIKGTIKTVLQVQQASFDDKYLGLPTPLGRMKAGRFQALKERFEKRL 516 Query: 237 SNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQLMRFTRYFLWGSSAGQRTM 416 S W + +SM + +LI++ A +Y M R P S + M+ R F WG R + Sbjct: 517 SEWCEKNLSMGGKEVLIKSVLQALPTYVMSVFRLPASLCEEYMQLIRKFWWGEDQNNRKV 576 Query: 417 YTVS*RKVCRPKSSGGLGFRNLR-----SLAKLAWRF*GNLIIYGQEFCILNMVTLRRWI 581 + +S +++ +PK GG+GFR+L+ LA+ AWR LI Y C V ++ Sbjct: 577 HWISWQQLIKPKGQGGIGFRDLKLFNQALLARQAWR----LIQYPSSLCA--QVLKAKYF 630 Query: 582 LESSGGLL---------HIHGSLIEGYRLLRQGLEWEVGDGCDARFWVDVWFKR 716 SG L+ ++ G LL++GL W + DG + W D W R Sbjct: 631 --PSGDLIDTAFPVDSSETWKGIMHGLELLKKGLIWRISDGSKVKIWRDNWIPR 682 >gb|AAF97969.1|AC000103_19 F21J9.30 [Arabidopsis thaliana] Length = 1270 Score = 107 bits (268), Expect = 8e-21 Identities = 74/247 (29%), Positives = 121/247 (48%), Gaps = 12/247 (4%) Frame = +3 Query: 3 ERLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHK 182 +++LK + + + Q +NL + + F + V +KG I T GI G YLG+P Sbjct: 663 QKILKVYGNATGQTINLNKSSITFGEKVDE-QLKGTIRTCLGIFTEGGAGTYLGLPECFS 721 Query: 183 SVSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQL 362 +L + L++KL W R +S + +L+++ A A +AM + P++T L Sbjct: 722 GSKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCENL 781 Query: 363 MRFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNLI 527 F W S R ++ S ++C PK SGGLGFR+++S LAK AWR L+ Sbjct: 782 ESAMASFWWDSCDHSRKIHWQSWERLCLPKDSGGLGFRDIQSFNQALLAKQAWR----LL 837 Query: 528 IYGQEFCILNMVTLRRW-----ILESSGGLLHIHG--SLIEGYRLLRQGLEWEVGDGCDA 686 + C+L+ + R+ L+++ G S++ G LL +GL+ VGDG Sbjct: 838 HFPD--CLLSRLLKSRYFDATDFLDAALSQRPSFGWRSILFGRELLSKGLQKRVGDGASL 895 Query: 687 RFWVDVW 707 W+D W Sbjct: 896 FVWIDPW 902 >gb|AAG03119.1|AC004133_13 F5A9.24 [Arabidopsis thaliana] Length = 1254 Score = 107 bits (268), Expect = 8e-21 Identities = 74/247 (29%), Positives = 121/247 (48%), Gaps = 12/247 (4%) Frame = +3 Query: 3 ERLLKDFCSQSSQKVNLGNTVVFFIQCVPSTSVKGNISTTFGIPISTNLGMYLGMPVLHK 182 +++LK + + + Q +NL + + F + V +KG I T GI G YLG+P Sbjct: 666 QKILKVYGNATGQTINLNKSSITFGEKVDE-QLKGTIRTCLGIFTEGGAGTYLGLPECFS 724 Query: 183 SVSSGTYDFLVENLEKKLSNWKGRMMSMAARAILIQTSAAAFSSYAMQTSRTPVSTLNQL 362 +L + L++KL W R +S + +L+++ A A +AM + P++T L Sbjct: 725 GSKVDMLHYLKDRLKEKLDVWFTRCLSQGGKEVLLKSVALAMPVFAMSCFKLPITTCENL 784 Query: 363 MRFTRYFLWGSSAGQRTMYTVS*RKVCRPKSSGGLGFRNLRS-----LAKLAWRF*GNLI 527 F W S R ++ S ++C PK SGGLGFR+++S LAK AWR L+ Sbjct: 785 ESAMASFWWDSCDHSRKIHWQSWERLCLPKDSGGLGFRDIQSFNQALLAKQAWR----LL 840 Query: 528 IYGQEFCILNMVTLRRW-----ILESSGGLLHIHG--SLIEGYRLLRQGLEWEVGDGCDA 686 + C+L+ + R+ L+++ G S++ G LL +GL+ VGDG Sbjct: 841 HFPD--CLLSRLLKSRYFDATDFLDAALSQRPSFGWRSILFGRELLSKGLQKRVGDGASL 898 Query: 687 RFWVDVW 707 W+D W Sbjct: 899 FVWIDPW 905