BLASTX nr result
ID: Catharanthus22_contig00016978
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00016978 (1137 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN83285.1| hypothetical protein VITISV_004139 [Vitis vinifera] 219 3e-66 emb|CAN60809.1| hypothetical protein VITISV_044451 [Vitis vinifera] 185 2e-55 gb|EPS73343.1| hypothetical protein M569_01413 [Genlisea aurea] 199 2e-48 emb|CAB40035.1| retrotransposon like protein [Arabidopsis thalia... 163 9e-48 gb|AAC35532.1| contains similarity to proteases [Arabidopsis tha... 162 2e-47 emb|CAN73071.1| hypothetical protein VITISV_032383 [Vitis vinifera] 167 1e-46 emb|CAN78022.1| hypothetical protein VITISV_015518 [Vitis vinifera] 165 4e-46 emb|CAN61640.1| hypothetical protein VITISV_021909 [Vitis vinifera] 164 1e-45 gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Ar... 155 2e-45 gb|AAD43604.1|AC005698_3 T3P18.3 [Arabidopsis thaliana] 162 3e-45 gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana] 162 9e-45 emb|CAA19715.1| putative protein [Arabidopsis thaliana] gi|72695... 155 1e-44 emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 160 2e-44 emb|CAN71553.1| hypothetical protein VITISV_034738 [Vitis vinifera] 164 3e-44 emb|CAN72527.1| hypothetical protein VITISV_009255 [Vitis vinifera] 157 3e-44 gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 153 7e-44 emb|CAN67588.1| hypothetical protein VITISV_036280 [Vitis vinifera] 154 1e-43 emb|CAN75478.1| hypothetical protein VITISV_020209 [Vitis vinifera] 159 5e-43 emb|CAN64816.1| hypothetical protein VITISV_010668 [Vitis vinifera] 154 3e-42 gb|AAR88589.1| putative copia-like retrotransposon protein [Oryz... 160 1e-41 >emb|CAN83285.1| hypothetical protein VITISV_004139 [Vitis vinifera] Length = 1556 Score = 219 bits (559), Expect(2) = 3e-66 Identities = 113/177 (63%), Positives = 133/177 (75%), Gaps = 2/177 (1%) Frame = +3 Query: 444 QDAIYKPNRKYALLAT--GSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPR 617 Q I KPN KYAL +T + IPR +I+ AL H GWK AMDEE+QALH N+TW LVPR Sbjct: 980 QRGIIKPNPKYALTSTTNSTSIPREPHNIRDALAHPGWKAAMDEELQALHTNKTWVLVPR 1039 Query: 618 DSSMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAF 797 S+MHVIG KWV KPKLKP+GSLDRLKAR+VAK Y Q+ G+D T TF VIK GTIR+ Sbjct: 1040 TSNMHVIGSKWVFKPKLKPDGSLDRLKARVVAKGYHQVDGLDYTETFSPVIKPGTIRMVL 1099 Query: 798 SLALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 ++ALV IRQLDV N+FLHG++ EDI+MEQ PGM D + P HVCKLQ LYGLK+ Sbjct: 1100 TIALVKKWPIRQLDVKNAFLHGLISEDIHMEQPPGMADLEHPTHVCKLQKALYGLKQ 1156 Score = 60.8 bits (146), Expect(2) = 3e-66 Identities = 26/33 (78%), Positives = 29/33 (87%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWFDRFS FLLKYGF SLADPSLF+F+ Sbjct: 1155 KQAPRAWFDRFSAFLLKYGFFCSLADPSLFIFH 1187 >emb|CAN60809.1| hypothetical protein VITISV_044451 [Vitis vinifera] Length = 377 Score = 185 bits (469), Expect(2) = 2e-55 Identities = 91/136 (66%), Positives = 108/136 (79%) Frame = +3 Query: 561 MDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGV 740 MDEE+ ALH+N+TW LVPR S MHVIG KWV KPKLKP+GSLDRLKAR+VAK Y Q+ G+ Sbjct: 1 MDEELDALHKNKTWVLVPRTSDMHVIGSKWVFKPKLKPDGSLDRLKARVVAKGYHQVDGL 60 Query: 741 D*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKF 920 D T TF LVIK GTIR+ ++ALV I QLDV N+FLHG+++EDI+MEQ PGM D + Sbjct: 61 DYTKTFSLVIKLGTIRMVITIALVQKWPICQLDVKNAFLHGLILEDIHMEQPPGMADLEH 120 Query: 921 PNHVCKLQ*TLYGLKK 968 P HVCKLQ LYGLK+ Sbjct: 121 PTHVCKLQKALYGLKQ 136 Score = 58.9 bits (141), Expect(2) = 2e-55 Identities = 25/33 (75%), Positives = 29/33 (87%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWFDRFS FLLKYGF SLA+PSLF+F+ Sbjct: 135 KQAPRAWFDRFSAFLLKYGFFCSLANPSLFIFH 167 >gb|EPS73343.1| hypothetical protein M569_01413 [Genlisea aurea] Length = 776 Score = 199 bits (505), Expect = 2e-48 Identities = 102/168 (60%), Positives = 121/168 (72%), Gaps = 1/168 (0%) Frame = +3 Query: 429 PWSLPQDAIYKPNRKYALLATGSD-IPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWS 605 P + Q I KPN KYAL + D IP K++K A+ H GWK AM+EE+ ALH N+TW Sbjct: 607 PRTRSQSGIVKPNPKYALYISHFDSIPLEPKTVKEAISHPGWKAAMEEELAALHHNETWL 666 Query: 606 LVPRDSSMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTI 785 LVPRD + H IGCKWV K KL+P+GSLDRLKARLVAK Y QI GVD T TF VI+ GTI Sbjct: 667 LVPRDEASHTIGCKWVFKTKLRPDGSLDRLKARLVAKGYNQIDGVDYTETFSPVIRPGTI 726 Query: 786 RLAFSLALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNH 929 RL S+AL + DIRQLDV N+FLHG + EDI+MEQ PGM + +FP H Sbjct: 727 RLVLSIALTRNWDIRQLDVKNAFLHGKISEDIFMEQPPGMNNSQFPAH 774 >emb|CAB40035.1| retrotransposon like protein [Arabidopsis thaliana] gi|7267767|emb|CAB81170.1| retrotransposon like protein [Arabidopsis thaliana] Length = 1515 Score = 163 bits (412), Expect(2) = 9e-48 Identities = 91/202 (45%), Positives = 122/202 (60%), Gaps = 5/202 (2%) Frame = +3 Query: 378 PPLTSLIYLFL-----SHITITPWSLPQDAIYKPNRKYALLATGSDIPRISKSIKSALRH 542 PPL S+I SH IT + I KPN KYAL + S+ P KS+K AL+ Sbjct: 874 PPLQSVISSTTAAPETSHPMITR---AKSGITKPNPKYALFSVKSNYPE-PKSVKEALKD 929 Query: 543 DGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRY 722 +GW AM EEM +H+ TW LVP + ++GCKWV K KL +GSLDRLKARLVA+ Y Sbjct: 930 EGWTNAMGEEMGTMHETDTWDLVPPEMVDRLLGCKWVFKTKLNSDGSLDRLKARLVARGY 989 Query: 723 RQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPG 902 Q GVD T+ V++ T+R +A ++ ++QLDV N+FLH L E ++M Q PG Sbjct: 990 EQEEGVDYVETYSPVVRSATVRSILHVATINKWSLKQLDVKNAFLHDELKETVFMTQPPG 1049 Query: 903 MTDPKFPNHVCKLQ*TLYGLKK 968 DP P++VCKL+ +Y LK+ Sbjct: 1050 FEDPSRPDYVCKLKKAIYDLKQ 1071 Score = 55.5 bits (132), Expect(2) = 9e-48 Identities = 22/32 (68%), Positives = 29/32 (90%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAWFD+FS++LLKYGF+ S +DPSLFV+ Sbjct: 1070 KQAPRAWFDKFSSYLLKYGFICSFSDPSLFVY 1101 >gb|AAC35532.1| contains similarity to proteases [Arabidopsis thaliana] Length = 1392 Score = 162 bits (410), Expect(2) = 2e-47 Identities = 92/230 (40%), Positives = 124/230 (53%), Gaps = 28/230 (12%) Frame = +3 Query: 363 YLCQHPPLTSLIYLFLSHITITPWSLPQDAIY---------------------------- 458 Y C HPP T +Y+ H+ P IY Sbjct: 722 YRCLHPP-TGKVYI-CRHVLFDERKFPYSDIYSQFQTISGSPLFTAWQKGFSSTALSRIT 779 Query: 459 KPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVI 638 KPN KYAL + S+ P KS+K AL+ +GW AM EEM +H+ TW LVP + ++ Sbjct: 780 KPNPKYALFSVKSNYPE-PKSVKEALKDEGWTNAMGEEMGTMHETDTWDLVPPEMVDRLL 838 Query: 639 GCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH 818 GCKWV K KL +GSLDRLKARLVA+ Y Q GVD T+ V++ T+R +A ++ Sbjct: 839 GCKWVFKTKLNSDGSLDRLKARLVARGYEQEEGVDYVETYSPVVRSATVRSILHVATINK 898 Query: 819 *DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 ++QLDV N+FLH L E ++M Q PG DP P++VCKL+ +Y LK+ Sbjct: 899 WSLKQLDVKNAFLHDELKETVFMTQPPGFEDPSRPDYVCKLKKAIYDLKQ 948 Score = 55.5 bits (132), Expect(2) = 2e-47 Identities = 22/32 (68%), Positives = 29/32 (90%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAWFD+FS++LLKYGF+ S +DPSLFV+ Sbjct: 947 KQAPRAWFDKFSSYLLKYGFICSFSDPSLFVY 978 >emb|CAN73071.1| hypothetical protein VITISV_032383 [Vitis vinifera] Length = 1239 Score = 167 bits (423), Expect(2) = 1e-46 Identities = 82/151 (54%), Positives = 107/151 (70%) Frame = +3 Query: 516 KSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRL 695 K KSA ++ W AMDEE+QAL QN TW LVPR + +++G KWV + K P+GS++RL Sbjct: 708 KGFKSAAKNPAWLAAMDEEVQALQQNGTWILVPRPVNTNIVGSKWVFRTKYFPDGSVERL 767 Query: 696 KARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIE 875 KARLVAK Y Q+ G+D T TF V+K T+R+ SLA+ + +RQLDV N+FL+G L E Sbjct: 768 KARLVAKGYTQVPGLDYTDTFSPVVKATTVRVVLSLAVTNKWPLRQLDVKNAFLNGTLTE 827 Query: 876 DIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 +YMEQ PG DP+FP HVC L+ LYGLK+ Sbjct: 828 HVYMEQPPGYIDPRFPTHVCLLKKALYGLKQ 858 Score = 47.8 bits (112), Expect(2) = 1e-46 Identities = 22/33 (66%), Positives = 25/33 (75%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWF RFS+FLL GF S AD SLFVF+ Sbjct: 857 KQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFH 889 >emb|CAN78022.1| hypothetical protein VITISV_015518 [Vitis vinifera] Length = 1501 Score = 165 bits (418), Expect(2) = 4e-46 Identities = 81/151 (53%), Positives = 106/151 (70%) Frame = +3 Query: 516 KSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRL 695 K KSA ++ W AMDEE+QAL QN TW LVPR + +++G KWV + K P+GS++RL Sbjct: 825 KGFKSAAKNPAWLAAMDEEVQALQQNGTWILVPRPVNTNIVGSKWVFRTKYLPDGSVERL 884 Query: 696 KARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIE 875 KARLVAK Y + G+D T TF V+K T+R+ SLA+ + +RQLDV N+FL+G L E Sbjct: 885 KARLVAKGYTXVPGLDYTDTFSPVVKATTVRVVLSLAVTNKWPLRQLDVKNAFLNGTLTE 944 Query: 876 DIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 +YMEQ PG DP+FP HVC L+ LYGLK+ Sbjct: 945 HVYMEQPPGYIDPRFPTHVCLLKKALYGLKQ 975 Score = 47.8 bits (112), Expect(2) = 4e-46 Identities = 22/33 (66%), Positives = 25/33 (75%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWF RFS+FLL GF S AD SLFVF+ Sbjct: 974 KQAPRAWFQRFSSFLLTLGFSCSRADTSLFVFH 1006 >emb|CAN61640.1| hypothetical protein VITISV_021909 [Vitis vinifera] Length = 1361 Score = 164 bits (414), Expect(2) = 1e-45 Identities = 77/151 (50%), Positives = 107/151 (70%) Frame = +3 Query: 516 KSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRL 695 + KSA +H W AMD+E+ AL +N TW LVPR +V+GC+W+ K KL +GS++R Sbjct: 841 RGFKSAAKHPEWLSAMDDEIHALKKNDTWVLVPRPQHHNVVGCRWIFKTKLHSDGSIERH 900 Query: 696 KARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIE 875 KARLVA+ + Q++G+D TF V++ T+R+ SLA+ S + QLDV N+FLHG L E Sbjct: 901 KARLVAQGFSQVHGLDFGDTFSPVVRPATVRIILSLAVTSGWRLHQLDVKNAFLHGFLNE 960 Query: 876 DIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 ++YMEQ PG TDP+FP HVC+L+ LYGLK+ Sbjct: 961 EVYMEQPPGYTDPQFPQHVCRLKRALYGLKQ 991 Score = 47.4 bits (111), Expect(2) = 1e-45 Identities = 21/33 (63%), Positives = 26/33 (78%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWF RFS+FLLK+GF S AD SLF ++ Sbjct: 990 KQAPRAWFHRFSSFLLKHGFHSSQADSSLFFYH 1022 >gb|AAF02855.1|AC009324_4 Similar to retrotransposon proteins [Arabidopsis thaliana] Length = 1522 Score = 155 bits (392), Expect(2) = 2e-45 Identities = 79/175 (45%), Positives = 112/175 (64%) Frame = +3 Query: 444 QDAIYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDS 623 ++ I KPN++Y LL IP K++ AL+H GW AM EEM + +TW+LVP Sbjct: 894 KEGISKPNKRYVLLTHKVSIPE-PKTVTEALKHPGWNNAMQEEMGNCKETETWTLVPYSP 952 Query: 624 SMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSL 803 +M+V+G WV + KL +GSLD+LKARLVAK ++Q G+D T+ V++ T+RL + Sbjct: 953 NMNVLGSMWVFRTKLHADGSLDKLKARLVAKGFKQEEGIDYLETYSPVVRTPTVRLILHV 1012 Query: 804 ALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 A V +++Q+DV N+FLHG L E +YM Q G D P+HVC L +LYGLK+ Sbjct: 1013 ATVLKWELKQMDVKNAFLHGDLTETVYMRQPAGFVDKSKPDHVCLLHKSLYGLKQ 1067 Score = 55.1 bits (131), Expect(2) = 2e-45 Identities = 28/58 (48%), Positives = 34/58 (58%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFYCXXXXXXXXXXXXXXXXTGLDSQLL 1137 K+ PRAWFDRFS FLL++GF+ SL DPSLFV+ TG +SQ L Sbjct: 1066 KQSPRAWFDRFSNFLLEFGFICSLFDPSLFVYSSNNDVILLLLYVDDMVITGNNSQSL 1123 >gb|AAD43604.1|AC005698_3 T3P18.3 [Arabidopsis thaliana] Length = 1309 Score = 162 bits (410), Expect(2) = 3e-45 Identities = 82/175 (46%), Positives = 112/175 (64%) Frame = +3 Query: 444 QDAIYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDS 623 +D I KPN +YAL+ + S K+I +A++H GW A+ +E+ +H TWSLVP Sbjct: 699 KDGIQKPNPRYALIVSKSSFDE-PKTITTAMKHPGWNAAVMDEIDRIHMLNTWSLVPATE 757 Query: 624 SMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSL 803 M+++ KWV K KLKP+G++D+LKARLVAK + Q GVD TF V++ TIRL Sbjct: 758 DMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDT 817 Query: 804 ALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 A + ++QLDV N+FLHG L E ++M Q G DP PNHVC+L LYGLK+ Sbjct: 818 ATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQ 872 Score = 47.8 bits (112), Expect(2) = 3e-45 Identities = 21/31 (67%), Positives = 24/31 (77%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFV 1056 K+ PRAWFD FS FLL +GF S +DPSLFV Sbjct: 871 KQAPRAWFDTFSNFLLDFGFECSTSDPSLFV 901 >gb|AAK51235.1|AF287471_1 polyprotein [Arabidopsis thaliana] Length = 1453 Score = 162 bits (411), Expect(2) = 9e-45 Identities = 83/172 (48%), Positives = 113/172 (65%) Frame = +3 Query: 453 IYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMH 632 I+K N +YALL + + KSI AL H GW A+++EM+ +H TWSLV M+ Sbjct: 867 IHKSNTRYALLTSKFSVEE-PKSIDEALNHPGWNNAVNDEMRTIHMLHTWSLVQPTEDMN 925 Query: 633 VIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALV 812 ++GC+WV K KLKP+GS+D+LKARLVAK + Q G+D TF V++ TIRL +A Sbjct: 926 ILGCRWVFKTKLKPDGSVDKLKARLVAKGFHQEEGLDYLETFSPVVRTATIRLVLDVATA 985 Query: 813 SH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 +I+QLDV N+FLHG L E +YM Q PG D + P++VC+L LYGLK+ Sbjct: 986 KGWNIKQLDVSNAFLHGELKEPVYMLQPPGFVDQEKPSYVCRLTKALYGLKQ 1037 Score = 45.8 bits (107), Expect(2) = 9e-45 Identities = 23/58 (39%), Positives = 29/58 (50%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFYCXXXXXXXXXXXXXXXXTGLDSQLL 1137 K+ PRAWFD S +LL +GF S +DPSLF ++ TG D LL Sbjct: 1036 KQAPRAWFDTISNYLLDFGFSCSKSDPSLFTYHKNGKTLVLLLYVDDILLTGSDHNLL 1093 >emb|CAA19715.1| putative protein [Arabidopsis thaliana] gi|7269574|emb|CAB79576.1| putative protein [Arabidopsis thaliana] Length = 1318 Score = 155 bits (391), Expect(2) = 1e-44 Identities = 80/172 (46%), Positives = 110/172 (63%) Frame = +3 Query: 453 IYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMH 632 I KPN +Y L+ P K++ +AL+H GW AM EE+ + QTWSLVP S MH Sbjct: 695 ISKPNPRYVFLSHKVSYPE-PKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMH 753 Query: 633 VIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALV 812 V+G KWV + KL +G+L++LKAR+VAK + Q G+D T+ V++ T+RL LA Sbjct: 754 VLGSKWVFRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSPVVRTPTVRLVLHLATA 813 Query: 813 SH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 + DI+Q+DV N+FLHG L E +YM Q G DP P+HVC L ++YGLK+ Sbjct: 814 LNWDIKQMDVKNAFLHGDLKETVYMTQPAGFVDPSKPDHVCLLHKSIYGLKQ 865 Score = 53.1 bits (126), Expect(2) = 1e-44 Identities = 21/32 (65%), Positives = 28/32 (87%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAWFD+FSTFLL++GF S +DPSLF++ Sbjct: 864 KQSPRAWFDKFSTFLLEFGFFCSKSDPSLFIY 895 >emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] Length = 1466 Score = 160 bits (404), Expect(2) = 2e-44 Identities = 81/175 (46%), Positives = 111/175 (63%) Frame = +3 Query: 444 QDAIYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDS 623 +D I KPN +YAL+ + S K+I +A++H W A+ +E+ +H TWSLVP Sbjct: 856 KDGIQKPNPRYALIVSKSSFDE-PKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLVPATE 914 Query: 624 SMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSL 803 M+++ KWV K KLKP+G++D+LKARLVAK + Q GVD TF V++ TIRL Sbjct: 915 DMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDT 974 Query: 804 ALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 A + ++QLDV N+FLHG L E ++M Q G DP PNHVC+L LYGLK+ Sbjct: 975 ATANEWPLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQ 1029 Score = 47.8 bits (112), Expect(2) = 2e-44 Identities = 21/31 (67%), Positives = 24/31 (77%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFV 1056 K+ PRAWFD FS FLL +GF S +DPSLFV Sbjct: 1028 KQAPRAWFDTFSNFLLDFGFECSTSDPSLFV 1058 >emb|CAN71553.1| hypothetical protein VITISV_034738 [Vitis vinifera] Length = 1312 Score = 164 bits (414), Expect(2) = 3e-44 Identities = 83/151 (54%), Positives = 107/151 (70%) Frame = +3 Query: 516 KSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRL 695 +++K AL+ WK AM++E QAL +NQTWSLVP S+ +IGCKWV K K KPNGS+DR Sbjct: 800 RTVKQALQDPNWKVAMEQEYQALLKNQTWSLVPPPSNAKIIGCKWVFKLKHKPNGSIDRY 859 Query: 696 KARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIE 875 KARLVA+ + Q YG+D TF V+K TIRL S+A+ S+ I+QLDV N+FL+G L E Sbjct: 860 KARLVAQGFHQTYGIDFFETFSPVVKPCTIRLVLSIAVSSNWPIKQLDVHNAFLNGDLQE 919 Query: 876 DIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 ++M Q PG D P HVC+LQ LYGLK+ Sbjct: 920 QVFMMQPPGFEDNSCPTHVCRLQKALYGLKQ 950 Score = 42.7 bits (99), Expect(2) = 3e-44 Identities = 19/33 (57%), Positives = 24/33 (72%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWF + S+FLL+ GF S AD SLF F+ Sbjct: 949 KQAPRAWFHKLSSFLLQIGFQCSRADASLFYFH 981 >emb|CAN72527.1| hypothetical protein VITISV_009255 [Vitis vinifera] Length = 1095 Score = 157 bits (397), Expect(2) = 3e-44 Identities = 81/165 (49%), Positives = 108/165 (65%) Frame = +3 Query: 474 YALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWV 653 +ALLAT K KSA ++ W MD+E++AL N TW LVPR S+ +++G KWV Sbjct: 573 HALLATFEP-----KGFKSAAKNPAWLATMDDEIKALQTNHTWDLVPRPSNTNIVGSKWV 627 Query: 654 LKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQ 833 + K +GS++R KARLVAK Y Q+ G+D TF V+K T+R+ SLA+ +RQ Sbjct: 628 FRTKFLSDGSIERFKARLVAKGYTQLPGLDYKDTFSPVVKASTVRVVLSLAVSHKWPLRQ 687 Query: 834 LDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 LDV N FL+G+L E +YMEQ PG DP+ P HVCKL+ LYGLK+ Sbjct: 688 LDVKNVFLNGILHETVYMEQPPGYVDPRHPLHVCKLKKALYGLKQ 732 Score = 49.3 bits (116), Expect(2) = 3e-44 Identities = 22/32 (68%), Positives = 25/32 (78%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAWF RFS+FLLK GF + AD SLFVF Sbjct: 731 KQAPRAWFQRFSSFLLKLGFFCNCADTSLFVF 762 >gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from Arabidopsis thaliana BAC gb|AF080119 and is a member of the reverse transcriptase family PF|00078 [Arabidopsis thaliana] Length = 1415 Score = 153 bits (387), Expect(2) = 7e-44 Identities = 79/172 (45%), Positives = 109/172 (63%) Frame = +3 Query: 453 IYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMH 632 I KPN +YAL+ + + K++ SA++H GW A+ EE+ +H TWSLVP M+ Sbjct: 841 IQKPNTRYALITSRMNTAE-PKTLASAMKHPGWNEAVHEEINRVHMLHTWSLVPPTDDMN 899 Query: 633 VIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALV 812 ++ KWV K KL P+GS+D+LKARLVAK + Q GVD TF V++ TIRL ++ Sbjct: 900 ILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDVSTS 959 Query: 813 SH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 I+QLDV N+FLHG L E ++M Q G DP+ P HVC+L +YGLK+ Sbjct: 960 KGWPIKQLDVSNAFLHGELQEPVFMYQPSGFIDPQKPTHVCRLTKAIYGLKQ 1011 Score = 52.0 bits (123), Expect(2) = 7e-44 Identities = 27/58 (46%), Positives = 31/58 (53%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFYCXXXXXXXXXXXXXXXXTGLDSQLL 1137 K+ PRAWFD FS FLL YGF+ S +DPSLFV + TG D LL Sbjct: 1010 KQAPRAWFDTFSNFLLDYGFVCSKSDPSLFVCHQDGKILYLLLYVDDILLTGSDQSLL 1067 >emb|CAN67588.1| hypothetical protein VITISV_036280 [Vitis vinifera] Length = 1379 Score = 154 bits (390), Expect(2) = 1e-43 Identities = 80/165 (48%), Positives = 108/165 (65%) Frame = +3 Query: 474 YALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWV 653 +ALLAT K KSA ++ W AMD++++AL N TW LVPR S+ +++G KWV Sbjct: 270 HALLATSEP-----KGFKSAAKNPAWLAAMDDKIKALQTNHTWDLVPRPSNTNIVGSKWV 324 Query: 654 LKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQ 833 + K GS++R KARLVAK Y Q+ G+D TF V+K T+R+ SLA+ +RQ Sbjct: 325 FRTKFLSYGSIERFKARLVAKGYTQLPGLDYKDTFSPVVKASTVRVVLSLAVSHKWPLRQ 384 Query: 834 LDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 LDV N+FL+G+L E +YMEQ G DP+ P HVCKL+ LYGLK+ Sbjct: 385 LDVKNAFLNGILHETVYMEQPLGYVDPRHPLHVCKLKKALYGLKQ 429 Score = 50.1 bits (118), Expect(2) = 1e-43 Identities = 23/32 (71%), Positives = 25/32 (78%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAWF RFS+FLLK GF S AD SLFVF Sbjct: 428 KQAPRAWFQRFSSFLLKLGFFCSRADTSLFVF 459 >emb|CAN75478.1| hypothetical protein VITISV_020209 [Vitis vinifera] Length = 1074 Score = 159 bits (403), Expect(2) = 5e-43 Identities = 81/151 (53%), Positives = 105/151 (69%) Frame = +3 Query: 516 KSIKSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRL 695 +++K L+ WK AM++E QAL +NQTWSLVP S+ +I CKWV K K KPNGS+DR Sbjct: 546 RTVKQTLQDPNWKLAMEQEYQALLKNQTWSLVPPPSNAKIIRCKWVFKLKHKPNGSIDRY 605 Query: 696 KARLVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIE 875 KARLVA+ + Q YG+D TF V+K TIRL S+A+ S+ I+QLDV N+FL+G L E Sbjct: 606 KARLVAQGFHQTYGIDFFETFSPVVKPCTIRLVLSIAVSSNWPIKQLDVHNAFLNGDLQE 665 Query: 876 DIYMEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 ++M Q PG D P HVC+LQ LYGLK+ Sbjct: 666 QVFMMQPPGFEDSSCPTHVCRLQKALYGLKQ 696 Score = 43.1 bits (100), Expect(2) = 5e-43 Identities = 19/33 (57%), Positives = 25/33 (75%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+GPRAWF + S+FLL+ GF S A+ SLF F+ Sbjct: 695 KQGPRAWFHKLSSFLLQIGFQCSRANASLFYFH 727 >emb|CAN64816.1| hypothetical protein VITISV_010668 [Vitis vinifera] Length = 1212 Score = 154 bits (388), Expect(2) = 3e-42 Identities = 74/148 (50%), Positives = 105/148 (70%) Frame = +3 Query: 525 KSALRHDGWKCAMDEEMQALHQNQTWSLVPRDSSMHVIGCKWVLKPKLKPNGSLDRLKAR 704 K A +H W AMD+E+QAL N TW LVP ++ +++G +W+ K KL+ +GS++ KAR Sbjct: 738 KPATKHPEWLSAMDDEIQALKTNDTWVLVPCPTNHNMVGYRWIFKTKLQLDGSIEHHKAR 797 Query: 705 LVAKRYRQIYGVD*TGTFLLVIKFGTIRLAFSLALVSH*DIRQLDV*NSFLHGVLIEDIY 884 LVA+ + QI+G+D TF V++ TIR+ SLA+ S + QLDV N+FLHG L E++Y Sbjct: 798 LVAQGFSQIHGLDFGDTFSPVVRLATIRIILSLAITSGWRLHQLDVKNAFLHGFLNEEVY 857 Query: 885 MEQSPGMTDPKFPNHVCKLQ*TLYGLKK 968 MEQ PG T+P+FP HVC+L+ LYGLK+ Sbjct: 858 MEQPPGYTNPQFPQHVCRLKRALYGLKQ 885 Score = 46.2 bits (108), Expect(2) = 3e-42 Identities = 20/33 (60%), Positives = 25/33 (75%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVFY 1062 K+ PRAWF RFS+FLLK GF + D SLFV++ Sbjct: 884 KQAPRAWFHRFSSFLLKLGFFSNQXDSSLFVYH 916 >gb|AAR88589.1| putative copia-like retrotransposon protein [Oryza sativa Japonica Group] Length = 1399 Score = 160 bits (405), Expect(2) = 1e-41 Identities = 89/181 (49%), Positives = 115/181 (63%) Frame = +3 Query: 426 TPWSLPQDAIYKPNRKYALLATGSDIPRISKSIKSALRHDGWKCAMDEEMQALHQNQTWS 605 T ++P+ A+ P R L +G R KS++ A+ + WK AMD E AL +N+TW Sbjct: 860 TDQAMPEAAV-APIRPKTRLQSGI---RKEKSLEEAVNNKHWKEAMDAEYMALIENKTWH 915 Query: 606 LVPRDSSMHVIGCKWVLKPKLKPNGSLDRLKARLVAKRYRQIYGVD*TGTFLLVIKFGTI 785 LVP +VI CKWV K K K +GSLDR KARLVAK ++Q YG+D TF V+K TI Sbjct: 916 LVPPQKGRNVIDCKWVYKVKRKADGSLDRYKARLVAKGFKQRYGIDYEDTFSPVVKAATI 975 Query: 786 RLAFSLALVSH*DIRQLDV*NSFLHGVLIEDIYMEQSPGMTDPKFPNHVCKLQ*TLYGLK 965 R+ SLA+ +RQLDV N+FLHGVL E++YM+Q PG PN+VCKL LYGLK Sbjct: 976 RIVLSLAVSRGWSLRQLDVKNAFLHGVLEEEVYMKQPPGYEKKSMPNYVCKLDKALYGLK 1035 Query: 966 K 968 + Sbjct: 1036 Q 1036 Score = 37.7 bits (86), Expect(2) = 1e-41 Identities = 17/32 (53%), Positives = 22/32 (68%) Frame = +1 Query: 964 KKGPRAWFDRFSTFLLKYGFLYSLADPSLFVF 1059 K+ PRAW+ R ST L + GF+ S AD SLF + Sbjct: 1035 KQAPRAWYSRLSTKLSELGFVPSKADTSLFFY 1066