BLASTX nr result
ID: Cocculus23_contig00006964
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00006964 (1081 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007099662.1| Gag protease polyprotein-like protein [Theob... 126 1e-26 ref|XP_007028010.1| Gag protease polyprotein [Theobroma cacao] g... 126 1e-26 ref|XP_007023594.1| Gag protease polyprotein [Theobroma cacao] g... 125 3e-26 prf||1510387A retrotransposon del1-46 125 3e-26 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 124 7e-26 ref|XP_007049818.1| Gag protease polyprotein [Theobroma cacao] g... 120 1e-24 ref|XP_007014287.1| Gag protease polyprotein [Theobroma cacao] g... 119 2e-24 emb|CAE03019.3| OSJNBa0091D06.9 [Oryza sativa Japonica Group] 119 2e-24 emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] 119 3e-24 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 118 4e-24 gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] 118 4e-24 ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, p... 118 5e-24 ref|XP_007027895.1| Gag protease polyprotein [Theobroma cacao] g... 118 5e-24 emb|CAE04240.1| OSJNBa0089N06.1 [Oryza sativa Japonica Group] gi... 117 6e-24 ref|NP_001176105.1| Os10g0357350 [Oryza sativa Japonica Group] g... 117 6e-24 emb|CAE04228.1| OSJNBa0011F23.1 [Oryza sativa Japonica Group] 117 6e-24 gb|ABA96609.1| retrotransposon protein, putative, Ty3-gypsy subc... 117 8e-24 emb|CAE04051.2| OSJNBb0062B06.9 [Oryza sativa Japonica Group] 117 8e-24 emb|CAH68033.1| OSIGBa0139N19-OSIGBa0137L10.2 [Oryza sativa Indi... 117 8e-24 emb|CAH66707.1| OSIGBa0147J19.11 [Oryza sativa Indica Group] 117 8e-24 >ref|XP_007099662.1| Gag protease polyprotein-like protein [Theobroma cacao] gi|508728474|gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 126 bits (317), Expect = 1e-26 Identities = 92/316 (29%), Positives = 143/316 (45%), Gaps = 19/316 (6%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + ++LD++ ++ ATR+L+ AR W Sbjct: 46 SKKLKEARQLGCVSFTGELDATVAKDWINQVSKTLSDMRLDDDMKLMVATRLLEKRARTW 105 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + +TWS F FD +YF + +K EF +LKQGN++V +YE +F L Sbjct: 106 WNSVKSRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELML 165 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E + F GLR + + +T G + + E V AL R + + E Sbjct: 166 YVPDLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMAL-------RAEKLAIEN 218 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + + +A +R S V G S S + V+ T+ P Sbjct: 219 RRI-RTEFAKRRNPG-----------MSSSQPVKRGKDSAISGSTTSVSVTSPRPPFPPS 266 Query: 304 QRLWQNFQR--LRGGGRG-SGYKRAASDG---------------CGKSGHKRHVCPN-AS 182 Q+ F R + G GR G R + G CG++GH R CP Sbjct: 267 QQRPSRFSRSAMTGSGRSFGGSDRCRNCGNYHSGLCREPTRCFQCGQTGHIRSNCPRLGR 326 Query: 181 TTMTIGSTPQHVDGER 134 T+ S+P D +R Sbjct: 327 ATVVASSSPARTDIQR 342 >ref|XP_007028010.1| Gag protease polyprotein [Theobroma cacao] gi|508716615|gb|EOY08512.1| Gag protease polyprotein [Theobroma cacao] Length = 404 Score = 126 bits (317), Expect = 1e-26 Identities = 88/308 (28%), Positives = 141/308 (45%), Gaps = 18/308 (5%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + + LD++ ++ ATR+L+ AR W Sbjct: 110 SKKLKEARQLGCVSFTGELDATVAKDWINQVSETLSDMGLDDDMKLMVATRLLEKRARTW 169 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + +TWS F FD +YF + +K EF +LKQGN++V +YE +F L Sbjct: 170 WNSVKSRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELML 229 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E + F GLR + + +T +G + + E V AL R + + E Sbjct: 230 YVPDLVKSEQDQASYFEEGLRNEIRERMTVIGREPHKEVVQMAL-------RAEKLATEN 282 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + + +A +R G+ S V G S S + ++ T+ P Sbjct: 283 RRI-RTKFAKRR------NLGM-----SSSQPVKRGKDSATSGSTTSISVTSPRPPFPPS 330 Query: 304 QRLWQNFQR--LRGGGRG-SGYKRAASDG---------------CGKSGHKRHVCPNAST 179 Q+ F R + G G+ G+ R + G CG++GH R CP Sbjct: 331 QQRPSRFSRSAMTGSGKSLGGFDRCRNCGNYHSGLCRGPTRCFQCGQTGHIRSNCPQLGR 390 Query: 178 TMTIGSTP 155 S+P Sbjct: 391 ATVAASSP 398 >ref|XP_007023594.1| Gag protease polyprotein [Theobroma cacao] gi|508778960|gb|EOY26216.1| Gag protease polyprotein [Theobroma cacao] Length = 426 Score = 125 bits (314), Expect = 3e-26 Identities = 91/316 (28%), Positives = 145/316 (45%), Gaps = 19/316 (6%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + +KL+++ ++ ATR+L+ AR W Sbjct: 110 SKKLKEARQLGCVSFTGELDATVAKDWINQVSETLSDMKLNDDMKLMVATRLLEKRARTW 169 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + +TWS F FD +YF + +K EF +LKQGN++V +YE +F L Sbjct: 170 WNSVKSRSATPQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELML 229 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E + F GLR + + +T G + + E V AL R + + E Sbjct: 230 YVPDLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMAL-------RAEKLATEN 282 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + + +A +R G + VK G S S + ++ T+ P Sbjct: 283 RRI-RTEFAKRR----NPGMSYSQSVK-------RGKDSAISRSTTSISVTSPRPPFPPS 330 Query: 304 QRLWQNFQR--LRGGGRG-SGYKRAASDG---------------CGKSGHKRHVCPN-AS 182 Q+ F R + G G+ G R + G CG++GH R CP Sbjct: 331 QQRPSRFSRSAMTGSGKSFGGSDRCRNCGNYHSGLCREPTRCFQCGQTGHIRSNCPRLGR 390 Query: 181 TTMTIGSTPQHVDGER 134 T+ S+P D +R Sbjct: 391 ATVVASSSPARTDIQR 406 >prf||1510387A retrotransposon del1-46 Length = 1443 Score = 125 bits (314), Expect = 3e-26 Identities = 71/182 (39%), Positives = 105/182 (57%), Gaps = 3/182 (1%) Frame = -1 Query: 1021 KLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWW 842 +L++EF +P F G D EA WI ++ +I TL + +E V+ A+ L+ +A WW Sbjct: 55 RLIKEFKGLNPPIFKGDPDPLEAHRWIRHVTKILDTLGVTDEQKVILASFQLQGEAEFWW 114 Query: 841 LSVLNVHKEVKT---WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRL 671 + + ++ T W +F E+F EK+FP RD LE F L QG+++V QYE KF L Sbjct: 115 DAKVRSREDDTTQIKWDEFVEVFTEKFFPDTVRDD-LERFMTLVQGSLTVAQYEAKFEEL 173 Query: 670 SKYAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSP 491 S+YAP V I+KVKRF GL+ ++ K L++ ++ Y E + RAL E EQR Q Sbjct: 174 SRYAPYQVDINIRKVKRFEQGLKLSITKQLSSHLIKDYREVITRALSVEKREQREAQIMA 233 Query: 490 ER 485 +R Sbjct: 234 KR 235 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 124 bits (311), Expect = 7e-26 Identities = 79/211 (37%), Positives = 118/211 (55%), Gaps = 7/211 (3%) Frame = -1 Query: 1009 EFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLSVL 830 EF K P F+G + EAE+WI ME+ + ++ ++ AT ML+S A EWW + Sbjct: 73 EFQKLKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHK 132 Query: 829 NVHKE--VKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKYAP 656 + E TW FKE F +KYFP + K +EF LKQGN SV +YE++F+RL+++AP Sbjct: 133 KSYSERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAP 192 Query: 655 DDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRA--LDFEHAEQRGTQYSPERQ 482 + V T+ K +RF GLR L + + A + + E V++A L+ + EQR P+++ Sbjct: 193 EFVQTDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLEKGYHEQRIEHGQPQKK 252 Query: 481 ---SVPQNSYANKRRGNYEKGFGLEKRVKSE 398 + PQN + RGNY G +R SE Sbjct: 253 FKTNNPQNQ--GRFRGNYS---GQMQRKSSE 278 >ref|XP_007049818.1| Gag protease polyprotein [Theobroma cacao] gi|508702079|gb|EOX93975.1| Gag protease polyprotein [Theobroma cacao] Length = 548 Score = 120 bits (300), Expect = 1e-24 Identities = 78/260 (30%), Positives = 124/260 (47%), Gaps = 2/260 (0%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + ++LD++ ++ ATR+L+ AR W Sbjct: 110 SKKLKEVRQLGCVSFTGELDATRAKDWINQVSETLSDMRLDDDMKLMVATRLLQKRARTW 169 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + +TWS F FD +YF + +K EF +LKQGN++V +YE +F L Sbjct: 170 WNSVKSRSATAQTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELML 229 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E + F GLR + + +T G + + E V AL R + + E Sbjct: 230 YVPDLVKSEQDQASYFEEGLRNEIRERMTVTGREPHKEVVQMAL-------RAEKLATEN 282 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + + +A +R + S V G S S + V+ T+ P Sbjct: 283 RRI-RTEFAKRRNPS-----------MSSSQPVKRGKDSAISGSTTSVSVTSPRPPFPPS 330 Query: 304 QRLWQNFQR--LRGGGRGSG 251 Q+ F R + G GR G Sbjct: 331 QQRPSRFSRSAMTGSGRSFG 350 >ref|XP_007014287.1| Gag protease polyprotein [Theobroma cacao] gi|508784650|gb|EOY31906.1| Gag protease polyprotein [Theobroma cacao] Length = 389 Score = 119 bits (298), Expect = 2e-24 Identities = 86/296 (29%), Positives = 134/296 (45%), Gaps = 18/296 (6%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + + LD++ ++ ATR+L+ AR W Sbjct: 110 SKKLKEARQLGCVSFTGELDATVAKDWINQVFETLSDMGLDDDMKLMVATRLLEKRARTW 169 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + TWS F FD +YF + +K EF +LKQGN++V +YE +F L Sbjct: 170 WNSVKSRSATPHTWSDFLREFDGQYFTYFHQKEKKREFLSLKQGNLTVEEYETRFNELML 229 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E + F GLR + + +T +G + + E V AL R + + E Sbjct: 230 YVPDLVKSEQDQASYFEEGLRNEIRERMTVIGREPHKEVVQMAL-------RAEKLATEN 282 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + + +A +R S V G S S + V+ T+ P Sbjct: 283 RRI-RTEFAKRRNPG-----------MSSSQPVKRGKDSATSGSTTSVSVTSPRPPFPPS 330 Query: 304 QRLWQNFQR--LRGGGRG-SGYKRAASDG---------------CGKSGHKRHVCP 191 Q+ F R + G G+ G R + G CG++GH R CP Sbjct: 331 QQRPSRFSRSAMIGSGKSLGGSDRCRNCGNYHSGLCRGPTRCFQCGQTGHIRSNCP 386 >emb|CAE03019.3| OSJNBa0091D06.9 [Oryza sativa Japonica Group] Length = 1762 Score = 119 bits (298), Expect = 2e-24 Identities = 84/323 (26%), Positives = 141/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 419 LTDFLRSRPPEFSQTVEPVEADDWLKDVDRKLNLVQCTPVEKTLYASHQLRGPAADWWEN 478 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQGN S+ +Y +F +L++Y Sbjct: 479 YCNAHPEPTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNSSINEYLSQFNKLARY 538 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + T+ +R+ Sbjct: 539 APEEVDTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEATEEYKKRK 598 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 599 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANRQPGYAPRPQINRPAPQPQQRA--PSGNTA 656 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ +G ++ C + GH CP T G H +G Sbjct: 657 PNSVTSFKSPQGPSAVQCFR------CNQMGHYARQCPQNPTNTNSG----HANGSTART 706 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 P AT++ SS+A G G Sbjct: 707 ---PTPAATQS--RPSSQASGQG 724 >emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] Length = 1495 Score = 119 bits (297), Expect = 3e-24 Identities = 76/205 (37%), Positives = 105/205 (51%), Gaps = 14/205 (6%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEA-EDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWL 839 M+ FM P FNG EA E W+ M +I L + EE V AT ML A WW Sbjct: 116 MKRFMVMQPPSFNGEPSAAEAAEHWLRRMRRILVGLDIPEERRVGLATYMLVDKADFWWE 175 Query: 838 SVLNVHK-EVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 S+ V+ EV TW +F+ +F KYF V + K EF +L QG MSV +YE +F+ LS++ Sbjct: 176 SMKRVYDTEVMTWEEFERIFLGKYFGEVAKHAKRMEFEHLIQGTMSVLEYESRFSELSRF 235 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEH------------A 518 A +S E +K +RF GLRPA+ L + ++ Y+E V RAL E Sbjct: 236 ALGMISEEGEKARRFQQGLRPAIRNRLVPLAIRDYSELVKRALLVEQDIDETNQIREQKG 295 Query: 517 EQRGTQYSPERQSVPQNSYANKRRG 443 +++G Q E PQ ++RG Sbjct: 296 DRKGKQRMGESSQGPQQRQRTQQRG 320 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 118 bits (296), Expect = 4e-24 Identities = 66/186 (35%), Positives = 106/186 (56%), Gaps = 4/186 (2%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGI-DLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDARE 848 +K +++F KY+PT F+G + D T A+ W+S++E I R +K E+ V A ML Sbjct: 329 AKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA 388 Query: 847 WWLS---VLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFT 677 WW + +L TW +FKE F K+F RD K +EF NL+QG+M+V QY+ +F Sbjct: 389 WWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFD 448 Query: 676 RLSKYAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQY 497 LS++AP+ ++TE + +F+ GLR + + A T+A+A+ A+D E+ + Sbjct: 449 MLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSK 508 Query: 496 SPERQS 479 + R S Sbjct: 509 TAGRGS 514 >gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] Length = 429 Score = 118 bits (296), Expect = 4e-24 Identities = 66/186 (35%), Positives = 106/186 (56%), Gaps = 4/186 (2%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGI-DLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDARE 848 +K +++F KY+PT F+G + D T A+ W+S++E I R +K E+ V A ML Sbjct: 57 AKHLRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTA 116 Query: 847 WWLS---VLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFT 677 WW + +L TW +FKE F K+F RD K +EF NL+QG+M+V QY+ +F Sbjct: 117 WWETTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFD 176 Query: 676 RLSKYAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQY 497 LS++AP+ ++TE + +F+ GLR + + A T+A+A+ A+D E+ + Sbjct: 177 MLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSK 236 Query: 496 SPERQS 479 + R S Sbjct: 237 TAGRGS 242 >ref|XP_006833015.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] gi|548837601|gb|ERM98293.1| hypothetical protein AMTR_s00876p00007370, partial [Amborella trichopoda] Length = 366 Score = 118 bits (295), Expect = 5e-24 Identities = 70/198 (35%), Positives = 105/198 (53%), Gaps = 4/198 (2%) Frame = -1 Query: 1030 RHSKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAR 851 R + + F K HP F GG D EAE+W+ +E I ++L V A +LK DAR Sbjct: 100 RWEPIYERFRKQHPPNFEGGSDPMEAEEWLRTVEGIVEYMRLGNGDSVACAASLLKKDAR 159 Query: 850 EWWLSVLNVHKEVK--TWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFT 677 WW V+ ++V TW+ F ++F++KY+ R ++ EF NL+QG +V +Y +F Sbjct: 160 IWW-DVIKQTRDVAAMTWADFVQVFNKKYYSEAIRSARVNEFTNLRQGKSTVTEYARQFD 218 Query: 676 RLSKYAPDDVSTEIKKVKRFIMGL--RPALDKALTAVGVQTYAEAVNRALDFEHAEQRGT 503 RL+K+A D V TE ++ RF GL R + D A++ V TYAE N A G Sbjct: 219 RLAKFATDLVPTEFLRIHRFTEGLDSRISRDIAMSGVRATTYAEKDNTARWEARKASNGG 278 Query: 502 QYSPERQSVPQNSYANKR 449 + + Q++ A+KR Sbjct: 279 GDNKRKLPSNQHNEADKR 296 >ref|XP_007027895.1| Gag protease polyprotein [Theobroma cacao] gi|508716500|gb|EOY08397.1| Gag protease polyprotein [Theobroma cacao] Length = 502 Score = 118 bits (295), Expect = 5e-24 Identities = 86/308 (27%), Positives = 134/308 (43%), Gaps = 18/308 (5%) Frame = -1 Query: 1024 SKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREW 845 SK ++E + F G +D T A+DWI+ + + + LD++ ++ ATR+L+ AR W Sbjct: 46 SKKLKEARQLGCVSFTGELDATVAKDWINQVSETLSDIGLDDDMKLMVATRLLEMRARTW 105 Query: 844 WLSVLNVHKEVKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSK 665 W SV + + WS F FD +YF + +K +EF +LKQ N+ V +YE +F L Sbjct: 106 WNSVKSRFATPQIWSDFLREFDGQYFTYFHQKEKKKEFLSLKQRNLIVEEYETRFYELML 165 Query: 664 YAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPER 485 Y PD V +E+ + F GLR + + +T G + + E V AL E + + E Sbjct: 166 YVPDLVKSEMDQASYFEEGLRNEIRERMTVTGREPHKEVVQMALQAE-------KLATEN 218 Query: 484 QSVPQNSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 + + K G S V G ST S + V+ T+ P Sbjct: 219 RRIRTEFAKRKNPG------------MSSSQPVKRGKESTTSGSTTSVSVTSPRPPFPPS 266 Query: 304 QRLWQNFQR--LRGGGRG-SGYKRAASDG---------------CGKSGHKRHVCPNAST 179 Q+ F R + G G+ G R + G CG++G R CP Sbjct: 267 QQRLSRFTRSAMTGSGKSFGGSDRCRNCGNYHSGLCRGPTRCFQCGQTGDIRSNCPQLGR 326 Query: 178 TMTIGSTP 155 + S+P Sbjct: 327 ATVVASSP 334 >emb|CAE04240.1| OSJNBa0089N06.1 [Oryza sativa Japonica Group] gi|57834090|emb|CAE04759.2| OSJNBb0060E08.22 [Oryza sativa Japonica Group] Length = 1851 Score = 117 bits (294), Expect = 6e-24 Identities = 83/323 (25%), Positives = 141/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ I+ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 418 LTDFLRSRPPEFSQTIEPVEADDWLKDVDRKLNLVQCTPVEKTLYASHQLRGPAADWWEN 477 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQG+ SV +Y +F +L++Y Sbjct: 478 YCNAHPEPTNIAWDEFATAFRVAHVPESTIDMKKEEFNRLKQGHSSVNEYLSQFNKLARY 537 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + T+ +R+ Sbjct: 538 APEEVDTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEATEEYKKRK 597 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 598 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANRQSSYAPRPQMNRPAPPPQQRA--PSGNTA 655 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ +G ++ C + GH CP T ++P H +G Sbjct: 656 PNSVTSFKSPQGPSAVQCFR------CNQMGHYARQCPQNPT----NTSPGHANGSTART 705 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 T ++ SS+A G G Sbjct: 706 PTPAAAQS-----RPSSQASGQG 723 >ref|NP_001176105.1| Os10g0357350 [Oryza sativa Japonica Group] gi|255679330|dbj|BAH94833.1| Os10g0357350 [Oryza sativa Japonica Group] Length = 443 Score = 117 bits (294), Expect = 6e-24 Identities = 65/167 (38%), Positives = 96/167 (57%), Gaps = 2/167 (1%) Frame = -1 Query: 1009 EFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLSVL 830 EF K P F+G + EAE+WI ME+ + ++ ++ AT ML+S A EWW + Sbjct: 161 EFQKLKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHK 220 Query: 829 NVHKE--VKTWSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKYAP 656 + E TW FKE F +KYFP + K +EF LKQGN SV +YE++F+RL+++AP Sbjct: 221 KSYSERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAP 280 Query: 655 DDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAE 515 + V T+ K +RF GLR L + + A + + E V++A E E Sbjct: 281 EFVQTDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLEKGE 327 >emb|CAE04228.1| OSJNBa0011F23.1 [Oryza sativa Japonica Group] Length = 1787 Score = 117 bits (294), Expect = 6e-24 Identities = 87/334 (26%), Positives = 144/334 (43%), Gaps = 6/334 (1%) Frame = -1 Query: 1039 ISLRHSKLMQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKS 860 +S SKL +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ Sbjct: 411 MSNNRSKLT-DFLRSRPPEFSQTVEPVEADDWLKDVDRKLNLVQCTPVEKTLYASHQLRG 469 Query: 859 DAREWWLSVLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYEL 686 A +WW + N H E W +F F + P D K EEF LKQGN SV +Y Sbjct: 470 PAADWWENYCNAHPEPTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNSSVNEYLS 529 Query: 685 KFTRLSKYAPDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRG 506 F +L++YAP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + Sbjct: 530 MFNKLARYAPEEVGTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEA 589 Query: 505 TQYSPERQSVPQNSYANKRRG----NYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVA 338 T+ +R+S N N RG Y + + V Q + Sbjct: 590 TEEYKKRKS---NHQGNSSRGAPCPRYGQPMQYHQSVTQANRQPGYAPRPQMNRPAPQPQ 646 Query: 337 QTAVIPSNTGYQRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGST 158 Q A PS +F+ +G ++ C + GH CP T ++G Sbjct: 647 QRA--PSGNTAPNSVTSFKSPQGPSAVQCFR------CNQMGHYARQCPQNPTNTSLG-- 696 Query: 157 PQHVDGERTHAVTEPNQEATENLIEGSSRAKGNG 56 H +G T ++ SS+A G G Sbjct: 697 --HANGSTARTPTPAAAQS-----RPSSQASGQG 723 >gb|ABA96609.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1556 Score = 117 bits (293), Expect = 8e-24 Identities = 85/323 (26%), Positives = 141/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 139 LTDFLRSCPPEFSQTVEPVEADDWLKDVDRKLNFVQCTPVEKTLYASHQLRGPAADWWEN 198 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQGN SV +Y +F +L++Y Sbjct: 199 YCNAHPEPTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNSSVNEYLSQFNKLARY 258 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + T+ +R+ Sbjct: 259 APEEVDTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEATEEYKKRK 318 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 319 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANHQPGYAPRPQMNRPAPQPQQRA--PSGNTA 376 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ +G ++ C + GH CP T G H +G Sbjct: 377 PNSVTSFKSPQGPSAVQCFR------CNRMGHYARQCPRNPTNTNSG----HANGSTART 426 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 P A ++L SS+A G G Sbjct: 427 ---PAPAAAQSL--PSSQASGQG 444 >emb|CAE04051.2| OSJNBb0062B06.9 [Oryza sativa Japonica Group] Length = 1680 Score = 117 bits (293), Expect = 8e-24 Identities = 83/323 (25%), Positives = 140/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 414 LTDFLRSRPPEFSQTVEPVEADDWLKDVDRKLNIVQCTPVEKTLYASHQLRGPAADWWEN 473 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQGN SV +Y +F +L++Y Sbjct: 474 YCNAHPEPTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNNSVNEYLSQFNKLARY 533 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + T+ +R+ Sbjct: 534 APEEVDTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEATEEYRKRK 593 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 594 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANRQPGYAPRPQMNRPTPQPQQRA--PSGNTA 651 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ G ++ C + GH CP ST + +H +G Sbjct: 652 PNSVTSFKSPLGPSAVQCFR------CNQMGHYARQCPQNST----NTNSEHANGSTART 701 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 T ++ SS+A G G Sbjct: 702 PTPAAAQS-----RPSSQASGQG 719 >emb|CAH68033.1| OSIGBa0139N19-OSIGBa0137L10.2 [Oryza sativa Indica Group] Length = 1680 Score = 117 bits (293), Expect = 8e-24 Identities = 83/323 (25%), Positives = 140/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 414 LTDFLRSRPPEFSQTVEPVEADDWLKDVDRKLNIVQCTPVEKTLYASHQLRGPAADWWEN 473 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQGN SV +Y +F +L++Y Sbjct: 474 YCNAHPEPTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNNSVNEYLSQFNKLARY 533 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V T+ KK+++F+ G+ + L A T+ +N+AL E A + T+ +R+ Sbjct: 534 APEEVDTDKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINKALLLEDARKEATEEYRKRK 593 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 594 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANRQPGYAPRPQMNRPTPQPQQRA--PSGNTA 651 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ G ++ C + GH CP ST + +H +G Sbjct: 652 PNSVTSFKSPLGPSAVQCFR------CNQMGHYARQCPQNST----NTNSEHANGSTART 701 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 T ++ SS+A G G Sbjct: 702 PTPAAAQS-----RPSSQASGQG 719 >emb|CAH66707.1| OSIGBa0147J19.11 [Oryza sativa Indica Group] Length = 1851 Score = 117 bits (293), Expect = 8e-24 Identities = 82/323 (25%), Positives = 141/323 (43%), Gaps = 3/323 (0%) Frame = -1 Query: 1015 MQEFMKYHPTKFNGGIDLTEAEDWISNMEQIARTLKLDEESLVLSATRMLKSDAREWWLS 836 + +F++ P +F+ ++ EA+DW+ ++++ ++ L A+ L+ A +WW + Sbjct: 418 LTDFLRSRPPEFSQTVEPVEADDWLKDVDRKLNLVQCTPVEKTLYASHQLRGPAADWWEN 477 Query: 835 VLNVHKEVKT--WSKFKELFDEKYFPMVERDKKLEEFRNLKQGNMSVRQYELKFTRLSKY 662 N H E W +F F + P D K EEF LKQGN SV +Y +F +L++Y Sbjct: 478 YCNAHPEPTNIVWDEFAMAFRAAHVPESTIDMKKEEFNRLKQGNSSVNEYLSQFNKLARY 537 Query: 661 APDDVSTEIKKVKRFIMGLRPALDKALTAVGVQTYAEAVNRALDFEHAEQRGTQYSPERQ 482 AP++V+T+ KK+++F+ G+ + + A T+ +N+AL E A + T+ +R+ Sbjct: 538 APEEVNTDKKKIRKFLKGIAVGMRLQMLAHDFPTFQHMINKALLLEDARKEATEEYKKRK 597 Query: 481 SVPQ-NSYANKRRGNYEKGFGLEKRVKSEGSQVVTGTHSTASNTIAVVAQTAVIPSNTGY 305 S Q NS R Y + + V Q + Q A PS Sbjct: 598 SNHQGNSSRGAPRPRYGQPMQYHQSVTQANHQPGYAPRPQMNRPAPQPQQRA--PSGNTA 655 Query: 304 QRLWQNFQRLRGGGRGSGYKRAASDGCGKSGHKRHVCPNASTTMTIGSTPQHVDGERTHA 125 +F+ +G ++ C ++GH CP T G H +G Sbjct: 656 PNSVTSFKSPQGPSAVQCFR------CNQTGHYARQCPQNPTNTNSG----HANGSTART 705 Query: 124 VTEPNQEATENLIEGSSRAKGNG 56 T ++ SS+A G G Sbjct: 706 PTPAVAQS-----RPSSQASGQG 723