BLASTX nr result
ID: Rheum21_contig00009053
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rheum21_contig00009053 (1394 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003622194.1| Cellular nucleic acid-binding protein-like p... 121 6e-25 ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355... 121 8e-25 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 116 3e-23 emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] 115 3e-23 ref|XP_003605727.1| Cellular nucleic acid-binding protein [Medic... 108 5e-21 gb|ABA97418.2| retrotransposon protein, putative, Ty3-gypsy subc... 107 9e-21 ref|XP_003635931.1| Cellular nucleic acid-binding protein [Medic... 107 2e-20 gb|AAX96504.1| retrotransposon protein, putative, Ty3-gypsy sub-... 106 3e-20 gb|ABD28293.1| RNA-directed DNA polymerase (Reverse transcriptas... 104 8e-20 gb|ABC94893.1| polyprotein [Oryza australiensis] 103 1e-19 gb|EMJ28398.1| hypothetical protein PRUPE_ppa019381mg [Prunus pe... 102 4e-19 gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] 102 4e-19 ref|XP_006598445.1| PREDICTED: uncharacterized protein LOC102661... 102 5e-19 gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy... 101 9e-19 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 101 9e-19 emb|CAD41297.2| OSJNBa0020J04.2 [Oryza sativa Japonica Group] 101 9e-19 gb|EOY16854.1| DNA/RNA polymerases superfamily protein [Theobrom... 100 1e-18 gb|AAX94938.1| retrotransposon protein, putative, Ty3-gypsy sub-... 100 1e-18 gb|ABA98459.1| retrotransposon protein, putative, Ty3-gypsy subc... 100 1e-18 gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 100 1e-18 >ref|XP_003622194.1| Cellular nucleic acid-binding protein-like protein, partial [Medicago truncatula] gi|355497209|gb|AES78412.1| Cellular nucleic acid-binding protein-like protein, partial [Medicago truncatula] Length = 509 Score = 121 bits (304), Expect = 6e-25 Identities = 100/369 (27%), Positives = 157/369 (42%), Gaps = 13/369 (3%) Frame = -3 Query: 1071 ALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWE 892 A V + P F G+ + W +ER+F ++ TE ++V + L +EA +WW Sbjct: 54 AQAVGHQNHPPTFKGRYDLDGAQTWLKEIERVFRVMQCTEVQKVRFGTHMLAEEADDWWI 113 Query: 891 S---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLL 721 S D V+ W +F+ + DR+F +R +K EFL K + VTE A KF L Sbjct: 114 SLLPVLKQDGAVVTWAVFRREFLDRYFLEDVRGKKEIEFLELKQGNMSVTEYAAKFVELA 173 Query: 720 QYAEPKITSEAQKIWY---FHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKR 550 ++ P T+E K F + +I + + T V + EE TK Sbjct: 174 KFY-PHYTAETAKFSKCIKFENGLRAEIKRAIGYQKIRTFSDLVSSCR----IYEEDTKA 228 Query: 549 RNR---QRSMQGQSQATGFKRPAPPSNFEKSGKART--EMTPVXXXXXXXXXXXXXQCFN 385 + +R ++GQ P P S GK R E P C+ Sbjct: 229 HYKIVNERKVKGQQSC-----PKPYSAPADKGKQRMVDERRP-----RKKDAHVEIVCYT 278 Query: 384 CGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTE-SQQGSV-NQNTRPKFKPAE 211 CG GHK+ CP++ K CF CG++GH C+ +++G + +Q +PK Sbjct: 279 CGEKGHKSNACPRDVKRCFCCGKKGHTLAECKHDDIVCFNCNEEGHIGSQCKKPK----- 333 Query: 210 TPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGA 31 AQ G+ A++G Q + + +GT Y ST + ++ D+GA Sbjct: 334 --------KAQTTGRVFALTGTQ----------TESEDHLIRGTCYFDSTPLVVIIDTGA 375 Query: 30 THSFISENC 4 TH FI+ +C Sbjct: 376 THCFIAIDC 384 >ref|XP_003605752.1| Pol polyprotein [Medicago truncatula] gi|355506807|gb|AES87949.1| Pol polyprotein [Medicago truncatula] Length = 745 Score = 121 bits (303), Expect = 8e-25 Identities = 97/366 (26%), Positives = 161/366 (43%), Gaps = 8/366 (2%) Frame = -3 Query: 1074 RALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWW 895 R L+ P F G+ + W +ER+F ++ TE ++V + QL +EA +WW Sbjct: 35 RMLETFMKKNPPTFKGRCDPDGAQTWLKEIERIFRVMQCTEDQKVRFGTHQLAEEADDWW 94 Query: 894 ES---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHL 724 + T + V+ W +F+ + R+F +R +K EFL K + VTE A KF L Sbjct: 95 VALLPTLGQEGAVVTWAVFRREFLRRYFPEDVRGKKEIEFLELKQGNMSVTEYAAKFVEL 154 Query: 723 LQYAEPKITSEA---QKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTK 553 ++ P T+E + F + PDI + + + V++ E + K Sbjct: 155 SKFY-PHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLRVFQDLVNSCRIYEEDTKAHYK 213 Query: 552 RRNRQRSMQGQSQATGFKRPAPPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAY 373 N ++ QS+ + PA K + +M V FNCG Sbjct: 214 VVNERKGKGQQSRPKPYSAPAD--------KGKQKMVDVRRPKKKDAAEIVY--FNCGEK 263 Query: 372 GHKNIECPKEKKSCFICGREGHLKQFC-RLGKQTGTESQQGSV-NQNTRPKFKPAETPSP 199 GHK+ CP+E K C CG++GH+ C R + +G + +Q T+PK P Sbjct: 264 GHKSNACPEEIKKCVRCGKKGHVVADCNRTDIVCFNCNGEGHISSQCTQPKRAP------ 317 Query: 198 SMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSF 19 G+ A++G Q + D+ + +GT Y+++T + + D+GATH F Sbjct: 318 -------TTGRVFALTGTQTE------SEDR----LIRGTCYINNTPLVAIIDTGATHCF 360 Query: 18 ISENCV 1 I+ +CV Sbjct: 361 IAFDCV 366 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 116 bits (290), Expect = 3e-23 Identities = 88/356 (24%), Positives = 151/356 (42%), Gaps = 7/356 (1%) Frame = -3 Query: 1050 MRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWESTD-LDD 874 M+P +F+G+ + HW M R+ ++ E +V +A+ L D+A WWES + D Sbjct: 213 MQPPSFNGEPSAEASEHWLRRMRRILVGLDIPEERRVGLATYMLVDKADFWWESMKRVYD 272 Query: 873 SPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKITS 694 + V+ W F+ ++F + K EF + + V E ++F+ L ++A I+ Sbjct: 273 TEVMTWEEFERIFLGKYFGEVAKHAKRMEFEHLIQGTMLVLEYESRFSELSRFALGMISE 332 Query: 693 EAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQRSMQGQSQ 514 E +K F + + P I +V + V A +E ++E+ + R ++ +G+ Q Sbjct: 333 EGEKARRFQQGLRPIIRNRLVPLAIRDYSELVKRALLVEQDIDETNQIREQKGDKKGK-Q 391 Query: 513 ATGFKRPAPPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKS 334 G P R C+ CGA H CP Sbjct: 392 RMGESSQGPQQRQRTQQFERRPSFYAGEGQIAQRAATNRVCYGCGAGDHLWRACPLR--- 448 Query: 333 CFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKAPAV 154 G Q QGS Q P+ PA + + +Q + + Sbjct: 449 ----------------GAQXAQPQSQGSSQQQPMPQLPPAAQGTRTTTMNSQTRSSQGSN 492 Query: 153 SGGQGQ----RLYEM--READKPNAPVAQGTLYLHSTSVCILFDSGATHSFISENC 4 + G+G+ R++ + E DK +A + +G + ++ST V +LFD+GATHSFIS +C Sbjct: 493 ARGRGRPAAGRVFALTPTEPDK-DALLVEGMILVYSTWVRVLFDTGATHSFISASC 547 >emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera] Length = 1387 Score = 115 bits (289), Expect = 3e-23 Identities = 90/361 (24%), Positives = 160/361 (44%), Gaps = 5/361 (1%) Frame = -3 Query: 1071 ALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWE 892 A+K M+P +F+G+ + HW M R+ ++ E +V +A+ L D+A WWE Sbjct: 51 AMKRFMVMQPPSFNGEPSAEAAEHWLXRMRRILVGLDIPEERRVGLATYMLVDKADFWWE 110 Query: 891 STD-LDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQY 715 S + D+ V+ W F+ ++F + K EF + + V E ++F+ L ++ Sbjct: 111 SMKRVYDTEVMTWEEFERIFLGKYFGEVAKHAKRMEFEHLIQGTMSVLEYESRFSELSRF 170 Query: 714 AEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQR 535 A I+ E +K F + + P I +V + V A +E ++E+ + R ++R Sbjct: 171 ALGMISEEGEKARRFQQGLRPAIRNRLVPLAIRDYSELVKRALLVEQDIDETNQIREKKR 230 Query: 534 SMQGQSQATGFKRPAPPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIE 355 +G+ Q G P ++ + E P + A G + + Sbjct: 231 DRKGK-QRMGESSQGPQ---QRQRTQQFERRP-----------------SFYAEGGQIAQ 269 Query: 354 CPKEKKSCFICGREGHLKQFCRL-GKQTGTESQQGSVNQNTRPKFKPAETPSP--SMQTT 184 + C+ CG HL + C L Q QGS Q + F+P + P M Sbjct: 270 RAAANRVCYGCGAGDHLWRACPLRDTQQARPQSQGSSQQQSVVSFQPPQFQLPYYQMPQL 329 Query: 183 AQNKGKAPAVSGGQGQRLYEMREAD-KPNAPVAQGTLYLHSTSVCILFDSGATHSFISEN 7 G+ +G R++ + + + +A + +G + ++ST V +LFD+GATHSFIS + Sbjct: 330 PPTTGRGRQAAG----RVFALTPTESEEDALLVKGMILVYSTWVRVLFDTGATHSFISAS 385 Query: 6 C 4 C Sbjct: 386 C 386 >ref|XP_003605727.1| Cellular nucleic acid-binding protein [Medicago truncatula] gi|355506782|gb|AES87924.1| Cellular nucleic acid-binding protein [Medicago truncatula] Length = 458 Score = 108 bits (270), Expect = 5e-21 Identities = 92/356 (25%), Positives = 152/356 (42%), Gaps = 9/356 (2%) Frame = -3 Query: 1044 PGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWES---TDLDD 874 P F G+ + W +ER+F ++ +E ++V + L +EA +WW S D Sbjct: 44 PPTFKGRYDPDGAQKWLKEVERIFRVMQCSEVQKVRFGTHMLAEEADDWWVSLLPVLEQD 103 Query: 873 SPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKITS 694 V+ W +F+ + +R+F +R +K EFL K + VTE A KF L ++ P T+ Sbjct: 104 GAVVTWAVFRREFLNRYFPEDVRGKKEIEFLELKQGDMSVTEYAAKFVELAKFY-PHYTA 162 Query: 693 EAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQRSMQGQSQ 514 E + ++ + D+ A YK+ + +R QS+ Sbjct: 163 EIAEFSKCIKFENEDMK----------------AHYKV----------MSERRGKGQQSR 196 Query: 513 ATGFKRPAPPS----NFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPK 346 + PA N E+ K R T + CF CG GHK+ C + Sbjct: 197 LKPYSAPADKGKQRLNDERRPKRRDAPTDIV-------------CFKCGEKGHKSNVCDR 243 Query: 345 EKKSCFICGREGHLKQFCRLGKQTGTE-SQQGSV-NQNTRPKFKPAETPSPSMQTTAQNK 172 EKK CF CG++GH C+ G +++G + +Q T+PK + Sbjct: 244 EKKKCFRCGQKGHTLADCKHGDVVCYNCNEEGHISSQCTQPK-------------KVRTG 290 Query: 171 GKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSFISENC 4 GK A++G Q + E R + +GT + +ST + + D+ A H FI+ +C Sbjct: 291 GKVFALNG--TQTVNEDR--------LIRGTCFFNSTPLIAIIDTSALHYFIAVDC 336 >gb|ABA97418.2| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1807 Score = 107 bits (268), Expect = 9e-21 Identities = 87/351 (24%), Positives = 146/351 (41%), Gaps = 5/351 (1%) Frame = -3 Query: 1047 RPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWES-TDLDDS 871 RP F E V+ W ++R + V+ T E+ AS QLR A++WWE+ + Sbjct: 425 RPPEFSQTVEPVEADDWLKDVDRKLNLVQCTPVEKTLYASHQLRGPAADWWENYCNAHPE 484 Query: 870 PV-LNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKITS 694 P + W F + + K +EF K + E ++FN L +YA ++ + Sbjct: 485 PTNIAWDEFATAFRAAHVPESTIDMKKEEFNRLKQGNSSINEYLSQFNKLARYAPEEVDT 544 Query: 693 EAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQRSMQGQSQ 514 + +KI F + ++ + ++ H T + ++ A LE +E+T+ +++S + Sbjct: 545 DKKKIRKFLKGIAVGMRLQLLAHDFPTFQHMINNALLLEDARKEATEEYKKRKSNHQGNS 604 Query: 513 ATGFKRP--APPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEK 340 + G RP P + +SG + + CF C GH +CP Sbjct: 605 SRGAPRPRYGQPMQYHQSGPSAVQ------------------CFRCNQMGHYARQCP--- 643 Query: 339 KSCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETPS-PSMQTTAQNKGKA 163 Q T + G N +T PA T S PS Q + Q G Sbjct: 644 --------------------QNPTNTNSGHANGSTARTPTPAATQSRPSSQASGQ--GSR 681 Query: 162 PAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSFISE 10 + + G+G+ + E + V G ++S +LFDSGA+HSFIS+ Sbjct: 682 ASNNFGRGRVNHIQAETAQDAPDVVMGMFSVNSVPAIVLFDSGASHSFISQ 732 >ref|XP_003635931.1| Cellular nucleic acid-binding protein [Medicago truncatula] gi|355501866|gb|AES83069.1| Cellular nucleic acid-binding protein [Medicago truncatula] Length = 558 Score = 107 bits (266), Expect = 2e-20 Identities = 85/365 (23%), Positives = 141/365 (38%), Gaps = 8/365 (2%) Frame = -3 Query: 1074 RALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWW 895 R L+ P F + + +W +ER+F ++ +E ++V + L +EA +WW Sbjct: 77 RMLETFLRNHPPTFKERYDPDGAQNWLKEVERVFRVMQCSEVQKVRFGAHMLAEEAEDWW 136 Query: 894 ESTDL---DDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHL 724 S D + W +F+ + +R+F +R +K EFL K + VTE KF L Sbjct: 137 VSLLPILEQDGVAVTWAVFRREFLNRYFPEDVRGKKEIEFLELKQGDMSVTEYVAKFVEL 196 Query: 723 LQYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRN 544 ++ P T+E K F + DI + + V + E + K + Sbjct: 197 AKFY-PHYTAEFSKCIKFKNGLRADIKRAIGYQKIRNFYDLVSSCRIYEEDTKAHYKVMS 255 Query: 543 RQRSMQGQSQATGFKRPAPPS----NFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGA 376 +R QS+ + PA N E+ + R T + CF CG Sbjct: 256 ERRGKGQQSRPKPYSAPANKVKQRLNDERRPRRRDAPTEIV-------------CFKCGE 302 Query: 375 YGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTE-SQQGSVNQNTRPKFKPAETPSP 199 GHK+ C +++K CF CG++GH C+ G ++G ++ R Sbjct: 303 KGHKSNVCDRDEKKCFRCGKKGHTLADCKRGDVVCYNCDEEGHISSQCR----------- 351 Query: 198 SMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSF 19 P Q L+L+ST + + D+GATH F Sbjct: 352 ---------------------------------KPTYQRYLFLYSTPLIAIIDTGATHCF 378 Query: 18 ISENC 4 I+ +C Sbjct: 379 IAVDC 383 >gb|AAX96504.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza sativa Japonica Group] gi|77550471|gb|ABA93268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1506 Score = 106 bits (264), Expect = 3e-20 Identities = 85/369 (23%), Positives = 150/369 (40%), Gaps = 20/369 (5%) Frame = -3 Query: 1059 VNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWES--T 886 V P +F + + W +E + E+ A+ L+ A+ WWE+ T Sbjct: 89 VQRTHPPHFSSAADPLAADDWLRDIEIKLNLCRCDPVEKATFAAYYLQGAAAAWWETYKT 148 Query: 885 DLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEP 706 + + W +F+ + E K KEFL K + E +FN+L +YA Sbjct: 149 LIPPDEPITWTVFREGFRSAHIPAGLMEIKKKEFLNLKQGNMPFMEFMERFNYLGRYAAS 208 Query: 705 KITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQ---R 535 + +E +K+ + ++P++ + H ++++ VD A ++E + +E + R R+ + Sbjct: 209 DLNTETKKVELCRDRLAPELKHALAAHEITSMKTLVDKALRVESSEKEVVEDRKRKWAAK 268 Query: 534 SMQGQSQATGFK----------RPAPPSNFEKSGKARTEMT----PVXXXXXXXXXXXXX 397 G S +T + P PP + +T++ Sbjct: 269 KFAGSSSSTRPRLAPSPAVRPMAPQPPRQMYVPPRPQTQLVRQVPRAVQAAGDASRNANV 328 Query: 396 QCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKP 217 C+NCG GH + CP K ++G SQ P+ +P Sbjct: 329 TCYNCGKKGHYSPSCPYPKTG------------------KSGPYSQGAPQQPRGPPQVQP 370 Query: 216 AETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAP-VAQGTLYLHSTSVCILFD 40 + +P++ PA + G+G RL + + +AP V GT +HS + +LFD Sbjct: 371 GQRGAPAV--------PKPAPTFGRG-RLNHVTAEEATDAPGVVLGTFLVHSIPLTVLFD 421 Query: 39 SGATHSFIS 13 SGATHSF+S Sbjct: 422 SGATHSFMS 430 >gb|ABD28293.1| RNA-directed DNA polymerase (Reverse transcriptase); Zinc finger, CCHC-type; Peptidase aspartic, active site; Retrotransposon gag protein [Medicago truncatula] Length = 912 Score = 104 bits (260), Expect = 8e-20 Identities = 92/330 (27%), Positives = 145/330 (43%), Gaps = 11/330 (3%) Frame = -3 Query: 957 TEAEQVHIASLQLRDEASEWWES---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKE 787 TE ++V + QL +EA +WW + T + VL W +F+ + R+F +R +K E Sbjct: 63 TEDQKVRFGTHQLVEEADDWWVALLPTLGQEGAVLTWAVFRREFLRRYFPEDVRGKKEIE 122 Query: 786 FLYPKTDGLKVTELATKFNHLLQYAEPKITSEA---QKIWYFHEWMSPDINPWMVHHTCS 616 FL K + VTE A KF L ++ P T+E + F + PDI + + Sbjct: 123 FLELKQGNMSVTEYAAKFVELSKFY-PHYTAENAEFSRCIKFENGLRPDIKRAIGYQQLR 181 Query: 615 TLEQYVDAAYKLEVTLEESTKRRNR---QRSMQGQSQATGFKRPAPPSNFEKSGKARTEM 445 + V++ EE+TK + +R+ +GQ RP P S GK + Sbjct: 182 VFQDLVNSCR----IYEENTKAHYKVVNERNGKGQQS-----RPKPYSAPADKGKQKM-- 230 Query: 444 TPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFC-RLGKQTGT 268 V CFNCG GHK+ P+E K C CG++GH+ C R Sbjct: 231 --VDVRRPKKKDAVEIVCFNCGEKGHKSNVYPEEIKKCVRCGKKGHVVADCNRTDIVCFN 288 Query: 267 ESQQGSV-NQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPV 91 + +G + +Q T+PK P G+ A++G Q + + Sbjct: 289 CNGEGHISSQCTQPKRAP-------------TTGRVFALTGTQTEN----------EDRL 325 Query: 90 AQGTLYLHSTSVCILFDSGATHSFISENCV 1 +GT Y+ +T + + D+GATH FI+ +CV Sbjct: 326 IRGTCYISNTPLVAIIDTGATHCFIAFDCV 355 >gb|ABC94893.1| polyprotein [Oryza australiensis] Length = 1469 Score = 103 bits (258), Expect = 1e-19 Identities = 92/375 (24%), Positives = 145/375 (38%), Gaps = 17/375 (4%) Frame = -3 Query: 1086 GEYGRALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEA 907 G G +L +P F E + W +E+ V EA++V A+ QL A Sbjct: 42 GRGGSSLGEFMRAKPPTFSTAEEPMDAEDWLRVIEKKLTLVRVREADRVVFATNQLEGPA 101 Query: 906 SEWWES---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATK 736 S+WW++ T +D+ W F A + F A+ K EF + V E K Sbjct: 102 SDWWDTYKETRAEDAGEPTWEEFTAAFRENFVPAAVMRMKKNEFRRLRQGNTSVQEYLNK 161 Query: 735 FNHLLQYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEEST 556 F L +YA + E +KI F E ++ ++ M+ ++ + ++ +LE + Sbjct: 162 FTQLARYATSDLADEEEKIDKFIEGLNDELRGPMIGQDHTSFQSLINKVVRLEHDQKVVD 221 Query: 555 KRRNRQRSMQGQSQAT----------GFKRPAP----PSNFEKSGKARTEMTPVXXXXXX 418 R R+ +M Q T G+K P P + ++ T Sbjct: 222 NNRKRRLAMARPFQGTPQRPKGATPSGWKPNVPATGRPLASDHVNRSATPQLRTPTPTLA 281 Query: 417 XXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQGSVNQN 238 CFNCG +GH + CPK + + G + GT + Sbjct: 282 APGRRNVSCFNCGEFGHYSNSCPKPRNTPVRTG--------ANVTPVRGTPTPAAG---- 329 Query: 237 TRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTS 58 R F+ TP P+ T +G+ V + Q + V G ++ST Sbjct: 330 -RGLFR---TPLPNEAATGFRRGQVNHVRAEEAQE----------DQSVLMGMFSINSTL 375 Query: 57 VCILFDSGATHSFIS 13 V +LFDSGA+HSFIS Sbjct: 376 VKVLFDSGASHSFIS 390 >gb|EMJ28398.1| hypothetical protein PRUPE_ppa019381mg [Prunus persica] Length = 505 Score = 102 bits (254), Expect = 4e-19 Identities = 97/408 (23%), Positives = 157/408 (38%), Gaps = 43/408 (10%) Frame = -3 Query: 1095 RNRGEYGRALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLR 916 R R +K V + F G + + + +ER+F ++ + ++V +A+ L+ Sbjct: 70 RRRNTESSDIKRVKELGANEFHGSADPAEADACLTDVERIFEVLQCPDRDRVRLAAFLLK 129 Query: 915 DEASEWWEST--DLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELA 742 A W++ + L W F+ D+F+ + + EK EFL+ + + V E Sbjct: 130 GNAYHGWKAVRRGYANPAALTWEEFQRVFFDQFYPHSYKNEKKSEFLHLRQGSMSVLEYE 189 Query: 741 TKFNHLLQYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKL--EVTL 568 KFN L ++A +T+E + F E + DI + +T T+ AA ++ + +L Sbjct: 190 HKFNELSRFAPELVTTEEDRCTRFEEGLWLDIQAVVTANTYPTMRALAQAADRVARKYSL 249 Query: 567 EESTKRRNRQRS-----MQGQSQATGFKRPAPPSNF-----EKSGKARTEMTP------- 439 RR R S QG S+ G + S + SG R+ P Sbjct: 250 GAGISRRRRDSSGFGEPSQGPSKRGGSSSSSAGSEWSGGRGSSSGSRRSGSRPAWSQHSG 309 Query: 438 ---VXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKK----------SCFICGREGHLKQ 298 V C CG GH +CP+ + SC+ CG+ GH + Sbjct: 310 QQSVASTAKDFSQQYNATCHGCGQTGHLRRDCPQRGQTSGPSRRSGVSCYHCGQAGHYRS 369 Query: 297 FCRL-------GKQTGTESQQGSVNQNTRPKFKPAETPSPSMQTTAQN--KGKAPAVSGG 145 C L GK+T + Q S Q + S + Q+ +G++ G Sbjct: 370 ECPLLTVGGTAGKETWAQQGQSSRGQGQTESGASSSAAGSSSSSGVQSTFRGRSGRSQRG 429 Query: 144 QGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSFISENCV 1 Q R AQ T L + V L D ATHSFI+ + V Sbjct: 430 QSGRSTTHARVFSMTHHEAQATPDLITARV--LIDPRATHSFITPSFV 475 >gb|AAO45751.1| gag-protease polyprotein [Cucumis melo subsp. melo] Length = 429 Score = 102 bits (254), Expect = 4e-19 Identities = 96/364 (26%), Positives = 149/364 (40%), Gaps = 12/364 (3%) Frame = -3 Query: 1068 LKVVNSMRPGNFDGKGES-VKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWE 892 L+ P FDG E + W S +E +F ++ E ++V A L D + WWE Sbjct: 60 LRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWE 119 Query: 891 STDL---DDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLL 721 +T+ D + W FK +FFS ++R+ K +EFL + + V + +F+ L Sbjct: 120 TTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLS 179 Query: 720 QYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNR 541 ++A I +EA + F + DI + +T + DA L + ++ S + R Sbjct: 180 RFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPAT---HADA---LRLAVDLSLQERAN 233 Query: 540 QRSMQGQSQATGFKR-------PAPPSNFEKSGKART-EMTPVXXXXXXXXXXXXXQCFN 385 G+ +G KR P P NF G+ R+ + P C Sbjct: 234 SSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKP---FEAGEAARGKPLCTT 290 Query: 384 CGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETP 205 CG H C ++CF C +EGH C L + TG QG+ Sbjct: 291 CGK--HHLGRCLFGTRTCFKCRQEGHTADRCPL-RVTGIAQNQGA--------------- 332 Query: 204 SPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATH 25 A ++G+A A + EA+K V GTL + +LFDSG++H Sbjct: 333 ------GAPHQGRAFATN---------RTEAEKAGT-VVTGTLPVLGHYALVLFDSGSSH 376 Query: 24 SFIS 13 SFIS Sbjct: 377 SFIS 380 >ref|XP_006598445.1| PREDICTED: uncharacterized protein LOC102661177 [Glycine max] Length = 670 Score = 102 bits (253), Expect = 5e-19 Identities = 89/382 (23%), Positives = 145/382 (37%), Gaps = 34/382 (8%) Frame = -3 Query: 1044 PGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWEST----DLD 877 P F G + W +E++F +E + ++V A+ L DEA WWE+T + Sbjct: 45 PPTFKGGYDPEGAEAWLREIEKIFRVMECQDHQKVLFATHMLADEAEYWWENTRPRLEGA 104 Query: 876 DSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKIT 697 V+ W F+ +++F ++ K EFL K + + V E A +F +L++Y P Sbjct: 105 GGVVVQWETFRQTFLEKYFPEDVKNRKEMEFLELKQESMTVAEYAARFENLVRYF-PHYQ 163 Query: 696 SEA---QKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQRSMQ 526 EA K F + P++ + +H Q + + E T + Sbjct: 164 GEAGERSKCVKFVNGLRPEVKMMVNYHGIHNFAQLTNMCRIFDEDQREKTAFYRNANASH 223 Query: 525 GQSQATGFKRPAPP--------SNFEKSGKARTEMTPV------------------XXXX 424 G+ + A P N + + + PV Sbjct: 224 GKDKKPVTHNRAKPYSAPPGKYGNHSRGQRTSGGLQPVGGSSQPINRVSQSAGRSSGGSG 283 Query: 423 XXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQGSV- 247 +C CG GH EC + +CF C +GHL C ++ E + GS+ Sbjct: 284 APAIVTTPLRCGKCGRLGHIARECTDREVTCFNCQGKGHLNTSCPYPRR---EKRSGSLN 340 Query: 246 NQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTLYLH 67 NQ+ RP+ G+ A+SG + E+ QG ++ Sbjct: 341 NQSGRPR----------------TTGRVFALSGADAAQSDEL----------IQGMCFIS 374 Query: 66 STSVCILFDSGATHSFISENCV 1 + +L+DSGATHSFIS CV Sbjct: 375 QVPLVVLYDSGATHSFISRVCV 396 >gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa] gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica Group] gi|31431495|gb|AAP53268.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1230 Score = 101 bits (251), Expect = 9e-19 Identities = 94/374 (25%), Positives = 149/374 (39%), Gaps = 24/374 (6%) Frame = -3 Query: 1050 MRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWES--TDLD 877 ++P F G ++ W ME+ F + T+ E++ A+ L+ A EWW++ Sbjct: 77 LKPPTFSGTANPLEAEEWIVAMEKSFEAMGCTDKEKIIYATYMLQSSAFEWWDAHKKSYS 136 Query: 876 DSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKIT 697 + + W +FK ++F +++ K KEFL K V E +F+ L ++A + Sbjct: 137 ERIFITWELFKEAFYKKYFPESVKRMKEKEFLELKQGNKSVAEYEIEFSRLARFAPEFVQ 196 Query: 696 SEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNRQRSMQGQS 517 ++ K F + + + + + V A LE K + QR GQ Sbjct: 197 TDGSKARRFESGLRQPLKRRVEAFELTIFREVVSKAQLLE-------KGYHEQRIEHGQP 249 Query: 516 QATGFKRPAPPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKK 337 Q FK P + G +M +C C H CP Sbjct: 250 QKK-FKTNNPQNQGRFRGNYSGQM------QRKSSENQGRKCPICQG-SHVPSICPNCWG 301 Query: 336 SCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETPSPSM----QTTAQNKG 169 CF CG GH + C L + + V+ T+P K TP PS+ ++A N G Sbjct: 302 RCFECGEAGHTRYQCPL-----LQKGKNRVSSTTQPNTK-VLTPVPSLYLPGPSSANNHG 355 Query: 168 ----------------KAPAVSGGQGQRLYEMRE--ADKPNAPVAQGTLYLHSTSVCILF 43 ++ GG R+Y + + A++ N V G + + S +LF Sbjct: 356 PNQGKPLANTNTTRGMRSNNSQGGNHARVYNLTKSTAEESNT-VVTGNVLICSYPGKVLF 414 Query: 42 DSGATHSFISENCV 1 DSGATHSFIS N V Sbjct: 415 DSGATHSFISTNFV 428 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 101 bits (251), Expect = 9e-19 Identities = 106/453 (23%), Positives = 185/453 (40%), Gaps = 51/453 (11%) Frame = -3 Query: 1218 VMLSKGDLGVLHVPHARGVSVPPEVPRDEEQESFAPKEGVIRNRGEYGRA---------- 1069 + LS + ++ P G P DE++ +AP ++++GE G+ Sbjct: 1477 IQLSTPESVIVMPPRRVGRGRLPRCYVDEQELPYAPG---VQDQGEIGQQRGARQEGADT 1533 Query: 1068 --LKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEW- 898 ++ M P +F G + + ++ ++++F + ++ E+V +A+ QL+D A W Sbjct: 1534 SRIREFLGMNPSSFTGSSTTEDLENFIEELKKIFDVMHMSDTERVELAAYQLKDVARTWF 1593 Query: 897 --WESTDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHL 724 W+ ++++P +W F+ FF R ++E K +EFL K + L V E + KF L Sbjct: 1594 DQWKGGRVENAPPASWACFEEAFLGHFFPRELKEVKVREFLTLKQESLSVHEYSLKFTQL 1653 Query: 723 LQYAEPKITSEAQKIWYFHEWMS---------------PDINPWMVHHTCSTLEQYVDAA 589 +YA + ++ F +S DI+ MV+ E+ D Sbjct: 1654 SRYAPEMVADMRNRMSLFVAGLSRLSSKEGRAAMLIGDMDISRLMVYVQQVEEEKLRDR- 1712 Query: 588 YKLEVTLEESTKRRNRQRSMQGQSQATGF----KRPAPPSNFEKSGKARTE--------- 448 E + K RN +G S + F K PA S + + R E Sbjct: 1713 ---EEFRNKRVKTRNESGQQRGNSNRSSFQQRQKGPATSSARAPAPRYRGEHNVQNSKDF 1769 Query: 447 -MTPV-XXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQT 274 +TP C CG H +C + + CF CG+EGH + C KQ+ Sbjct: 1770 KVTPAQSSGSVVRGGSSFPACAKCGRV-HPG-KCRQGQTCCFRCGQEGHFMKECPKNKQS 1827 Query: 273 ----GTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREA-D 109 G+ +Q S+ +P M + A + +GG RLY + + Sbjct: 1828 SEKLGSRAQSSSI------------SPLDRMASRG-----ATSSTGGGANRLYAITSRHE 1870 Query: 108 KPNAP-VAQGTLYLHSTSVCILFDSGATHSFIS 13 + N+P V G + + +V L D GA+ SF++ Sbjct: 1871 QENSPNVVTGMIKVFVFNVYALLDPGASLSFVT 1903 Score = 101 bits (251), Expect = 9e-19 Identities = 106/453 (23%), Positives = 185/453 (40%), Gaps = 51/453 (11%) Frame = -3 Query: 1218 VMLSKGDLGVLHVPHARGVSVPPEVPRDEEQESFAPKEGVIRNRGEYGRA---------- 1069 + LS + ++ P G P DE++ +AP ++++GE G+ Sbjct: 2987 IQLSTPESVIVMPPRRVGRGRLPRCYVDEQELPYAPG---VQDQGEIGQQRGARQEGADT 3043 Query: 1068 --LKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEW- 898 ++ M P +F G + + ++ ++++F + ++ E+V +A+ QL+D A W Sbjct: 3044 SRIREFLGMNPSSFTGSSTTEDLENFIEELKKIFDVMHMSDTERVELAAYQLKDVARTWF 3103 Query: 897 --WESTDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHL 724 W+ ++++P +W F+ FF R ++E K +EFL K + L V E + KF L Sbjct: 3104 DQWKGGRVENAPPASWACFEEAFLGHFFPRELKEVKVREFLTLKQESLSVHEYSLKFTQL 3163 Query: 723 LQYAEPKITSEAQKIWYFHEWMS---------------PDINPWMVHHTCSTLEQYVDAA 589 +YA + ++ F +S DI+ MV+ E+ D Sbjct: 3164 SRYAPEMVADMRNRMSLFVAGLSRLSSKEGRAAMLIGDMDISRLMVYVQQVEEEKLRDR- 3222 Query: 588 YKLEVTLEESTKRRNRQRSMQGQSQATGF----KRPAPPSNFEKSGKARTE--------- 448 E + K RN +G S + F K PA S + + R E Sbjct: 3223 ---EEFRNKRVKTRNESGQQRGNSNRSSFQQRQKGPATSSARAPAPRYRGEHNVQNSKDF 3279 Query: 447 -MTPV-XXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQT 274 +TP C CG H +C + + CF CG+EGH + C KQ+ Sbjct: 3280 KVTPAQSSGSVVRGGSSFPACAKCGRV-HPG-KCRQGQTCCFRCGQEGHFMKECPKNKQS 3337 Query: 273 ----GTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREA-D 109 G+ +Q S+ +P M + A + +GG RLY + + Sbjct: 3338 SEKLGSRAQSSSI------------SPLDRMASRG-----ATSSTGGGANRLYAITSRHE 3380 Query: 108 KPNAP-VAQGTLYLHSTSVCILFDSGATHSFIS 13 + N+P V G + + +V L D GA+ SF++ Sbjct: 3381 QENSPNVVTGMIKVFVFNVYALLDPGASLSFVT 3413 Score = 97.8 bits (242), Expect = 9e-18 Identities = 96/400 (24%), Positives = 163/400 (40%), Gaps = 39/400 (9%) Frame = -3 Query: 1095 RNRGEYGRALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLR 916 R G ++ M P +F G + + ++ ++++F + ++ E+V +A+ QL+ Sbjct: 17 RQEGADTSRIREFLGMNPSSFTGSSTTEDLENFIEELKKIFDVMHMSDTERVELAAYQLK 76 Query: 915 DEASEW---WESTDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTEL 745 D A W W+ ++++P +W F+ FF R ++E K +EFL K + L V E Sbjct: 77 DVARTWFDQWKGGRVENAPPASWACFEEAFLGHFFPRELKEVKVREFLTLKQESLSVHEY 136 Query: 744 ATKFNHLLQYAEPKITSEAQKIWYFHEWMS---------------PDINPWMVHHTCSTL 610 + KF L +YA + ++ F +S DI+ MV+ Sbjct: 137 SLKFTQLSRYAPEMVADMRNRMSLFVAGLSRLSSKEGRAAMLIGDMDISRLMVYVQQVEE 196 Query: 609 EQYVDAAYKLEVTLEESTKRRNRQRSMQGQSQATGF----KRPAPPSNFEKSGKARTE-- 448 E+ D E + K RN +G S + F K PA S + + R E Sbjct: 197 EKLRDR----EEFRNKRVKTRNESGQQRGNSNRSSFQQRQKGPATSSARAPAPRYRGEHN 252 Query: 447 --------MTPV-XXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQF 295 +TP C CG H +C + + CF CG+EGH + Sbjct: 253 VQNSKDFKVTPAQSSGSVVRGGSSFPACAKCGRV-HPG-KCRQGQTCCFRCGQEGHFMKE 310 Query: 294 CRLGKQT----GTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLY 127 C KQ+ G+ +Q S+ +P M + A + +GG RLY Sbjct: 311 CPKNKQSSEKLGSRAQSSSI------------SPPDRMASRG-----ATSSTGGGANRLY 353 Query: 126 EMREA-DKPNAP-VAQGTLYLHSTSVCILFDSGATHSFIS 13 + ++ N+P V G + + +V L D GA+ SF++ Sbjct: 354 AITSRHEQENSPNVVTGMIKVFVFNVYALLDPGASLSFVT 393 >emb|CAD41297.2| OSJNBa0020J04.2 [Oryza sativa Japonica Group] Length = 1537 Score = 101 bits (251), Expect = 9e-19 Identities = 94/349 (26%), Positives = 144/349 (41%), Gaps = 20/349 (5%) Frame = -3 Query: 999 WFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWES---TDLDDSPVLNWGMFKAKMTD 829 W +E+ V E ++V A QL AS+WW++ +D+ W F A + Sbjct: 6 WLRIIEKKLTLVRVRETDKVIFAVNQLEGPASDWWDTYKEARENDAGEPTWEEFTAAFRE 65 Query: 828 RFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLLQYAEPKITSEAQKIWYFHEWMSPD 649 F A+ K EF + + V E KF L +YA + E +KI F E ++ + Sbjct: 66 NFVPAAVMRMKKNEFRWLRQGNTTVQEYLNKFTQLARYAIGDLADEEEKIDKFIEGLNDE 125 Query: 648 INPWMVHHTCSTLEQYVDAAYKLEV---TLEESTKRR---NRQRSMQGQ----SQATGFK 499 + M+ + + ++ +LE T+E + KRR +R + Q + ++G+K Sbjct: 126 LRGPMIGQDHESFQSLINKVVRLENDQRTVEHNRKRRLAMSRLPQIVPQRLKGATSSGWK 185 Query: 498 -------RPAPPSNFEKSGKARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNIECPKEK 340 RPA PSNF + A TP CFNCG YGH CP + Sbjct: 186 PPIVATNRPAAPSNFNRP-VAIQNRTPTPTLAAPGAKMNVN-CFNCGGYGHYANNCPHPR 243 Query: 339 KSCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKAP 160 K+ +TGT + +V T P +P TA G+ Sbjct: 244 KT----------------PVRTGTNAM--TVRGTTTPVTGRGLFKTPQSNRTATGLGR-- 283 Query: 159 AVSGGQGQRLYEMREADKPNAPVAQGTLYLHSTSVCILFDSGATHSFIS 13 GQ + E + + + G ++ST V +LFDSGA+HSFIS Sbjct: 284 ------GQVNHVRAEEAQEDQGILMGMFSINSTPVKVLFDSGASHSFIS 326 >gb|EOY16854.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 737 Score = 100 bits (250), Expect = 1e-18 Identities = 100/424 (23%), Positives = 172/424 (40%), Gaps = 38/424 (8%) Frame = -3 Query: 1170 RGVSVPPEVPRDEEQESFAPKEGVIRNRGEYGRALKVVNSMRPGNFDGKGESVKVAHWFS 991 R V P V + A + RG +L ++P F G S K + Sbjct: 97 RVVEGRPTVQESPSSQGQADHQHHEEERGHLDISLPDFLKLKPPTFSGSDASEKPQVFLD 156 Query: 990 HMERLFHNVEFTEAEQVHIASLQLRDEASEWWESTDLD---DSPVLNWGMFKAKMTDRFF 820 +E++ + + V + + QL D A EW+ S ++ L W F DRF Sbjct: 157 KVEKICKALGCSSVRSVELTAFQLEDVAQEWYSSLCRGRPTNATPLAWSEFSVAFLDRFL 216 Query: 819 SRAMREEKHKEF-LYPKTDGLKVTELATKFNHLLQYAEPKITSEAQKIWYFHEWMSPDIN 643 ++R + +EF +T + ++E KF L +YA ++++ KI F + + + Sbjct: 217 PLSVRNARAREFETLVQTSSMTMSEYDIKFTQLARYAPYLVSTKEMKIQRFVDGLVEPLF 276 Query: 642 PWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNR-----------QRSMQGQSQATGFKR 496 + +T VD A ++E+ ES R+R +R G ++ + Sbjct: 277 RAVASRDFTTYSAAVDRAQRIEMRTSESRAARDRAKRGKTEGYQGRRDFSGGGSSSSRQG 336 Query: 495 PAPPSNFEKSGK------ARTEMTPVXXXXXXXXXXXXXQCFNCGAYGHKNI-ECPKEKK 337 P S + G R +C YG ++ C K Sbjct: 337 PQRDSRLPQQGSDAPGANIRVGQRTFSSRRQQDSRQSSQVIRSCDTYGRRHSGRCFLTTK 396 Query: 336 SCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETPSPSMQTTAQNKGKA-- 163 +C+ CG+ GH+++ C + Q+ +S +GS T+P S + + ++G+ Sbjct: 397 TCYRCGQPGHIRRDCPMAHQS-PDSARGS----TQPASSAPSVTVSSGREVSGSRGRGAG 451 Query: 162 ------PAVSG-----GQGQ-RLYEM--READKPNAPVAQGTLYLHSTSVCILFDSGATH 25 P+ SG G+GQ R++ + +EA NA V G L + + + +LFD GATH Sbjct: 452 TSSQGRPSGSGHQSSIGRGQARVFALTQQEAQTSNA-VVSGILSVCNINARVLFDPGATH 510 Query: 24 SFIS 13 SFIS Sbjct: 511 SFIS 514 >gb|AAX94938.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza sativa Japonica Group] gi|77550206|gb|ABA93003.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1436 Score = 100 bits (250), Expect = 1e-18 Identities = 99/381 (25%), Positives = 152/381 (39%), Gaps = 20/381 (5%) Frame = -3 Query: 1095 RNRGEYGRALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLR 916 R R +G ++ +P F E ++ W +E+ V EA++V A QL Sbjct: 43 RGRSSFGEFMRT----KPPTFTTADEPMEAEDWLRIIEKKLTLVRVREADKVIFAVNQLE 98 Query: 915 DEASEWWES---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTEL 745 A + W++ +D+ W F A + F A+ K EF + V E Sbjct: 99 GPAGDRWDTYKEAREEDAGEPTWEEFTAAFQENFVPAAVMRMKKNEFRRMRQGNTTVQEY 158 Query: 744 ATKFNHLLQYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEV--- 574 +F L +YA + E +KI F E ++ ++ M+ + + ++ +LE Sbjct: 159 LNRFTQLARYAIGDLADEEEKIDKFIEGLNDELRGPMIGQDHESFQSLINKVVRLENDQR 218 Query: 573 TLEESTKRR-NRQRSMQGQSQ------ATGFK-------RPAPPSNFEKSGKARTEMTPV 436 T+E + KRR R QG Q ++G+K RPA PSNF + + TP Sbjct: 219 TVEHNHKRRLAMNRPPQGVPQRLKGATSSGWKPPIVAPNRPAAPSNFNRPVVIQNR-TPT 277 Query: 435 XXXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQ 256 CFNCG YGH CP +K+ +TG + Sbjct: 278 PTLAAPGAKKNVD-CFNCGEYGHYANNCPHPRKT----------------PVRTGANAM- 319 Query: 255 GSVNQNTRPKFKPAETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGTL 76 +V T P +P T A G+GQ + E + + V G Sbjct: 320 -TVRGTTTPAAGRGLFKTPQTNRT--------ATGFGRGQVNHVRAEEAQEDQGVLMGMF 370 Query: 75 YLHSTSVCILFDSGATHSFIS 13 ++ST V +LFDSGA+HSFIS Sbjct: 371 SINSTPVKVLFDSGASHSFIS 391 >gb|ABA98459.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1470 Score = 100 bits (249), Expect = 1e-18 Identities = 102/382 (26%), Positives = 149/382 (39%), Gaps = 22/382 (5%) Frame = -3 Query: 1092 NRGEYGRALKVVNSMRPGNFDGKGESVKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRD 913 NRG G + +P F E ++ W +E+ V EA++V A QL Sbjct: 42 NRG--GSSFGEFMRTKPPTFATADEPMEAEDWLRIIEKKLTLVRVREADKVIFAVNQLEG 99 Query: 912 EASEWWES---TDLDDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELA 742 A +WW++ D + W F A + F A+ K EF + V E Sbjct: 100 PAGDWWDTYKEAREDGAGEPTWEEFTAAFRENFVPTAVMRMKKNEFRRLRQGNTTVQEYL 159 Query: 741 TKFNHLLQYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLE---VT 571 KF L +YA + E +KI F E ++ ++ M+ + + ++ +LE T Sbjct: 160 NKFTQLARYAIGDLADEEEKIDKFIEGLNDELRGPMIGQDHESFQSLINKVVRLENDQRT 219 Query: 570 LEESTKRR-NRQRSMQGQSQ------ATGFK-------RPAPPSNFEKSGKARTEMTPVX 433 +E + KRR R Q Q +G+K RPA SNF + A TP Sbjct: 220 VEHNRKRRLAMSRPPQTMPQRLKGATPSGWKPPVMVTNRPAALSNFNRP-VALQNRTPT- 277 Query: 432 XXXXXXXXXXXXQCFNCGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQG 253 CFNCG YGH CP +K+ +TG + Sbjct: 278 PTLAAPGAKKNVDCFNCGKYGHYANNCPHPRKT----------------PVRTGANAM-- 319 Query: 252 SVNQNTRPKFKPA--ETPSPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNAPVAQGT 79 +V T P +TP P+ TT +G+ V + Q + V G Sbjct: 320 TVRGTTTPAVGRGLFKTPQPNRTTTGFGRGQVNHVRAEEAQE----------DQGVLMGM 369 Query: 78 LYLHSTSVCILFDSGATHSFIS 13 L+ST + +LFDSGA HSFIS Sbjct: 370 FSLNSTPIKVLFDSGALHSFIS 391 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 100 bits (249), Expect = 1e-18 Identities = 94/365 (25%), Positives = 146/365 (40%), Gaps = 13/365 (3%) Frame = -3 Query: 1068 LKVVNSMRPGNFDGKGES-VKVAHWFSHMERLFHNVEFTEAEQVHIASLQLRDEASEWWE 892 L+ P FDG E + W S +E +F ++ E ++V A L D + WWE Sbjct: 332 LRDFRKYNPTTFDGSLEDPTRAQMWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTAWWE 391 Query: 891 STDL---DDSPVLNWGMFKAKMTDRFFSRAMREEKHKEFLYPKTDGLKVTELATKFNHLL 721 +T+ D + W FK +FFS ++R+ K +EFL + + V + +F+ L Sbjct: 392 TTERMLGGDVSQITWQQFKESFYAKFFSASLRDAKRQEFLNLEQGDMTVEQYDAEFDMLS 451 Query: 720 QYAEPKITSEAQKIWYFHEWMSPDINPWMVHHTCSTLEQYVDAAYKLEVTLEESTKRRNR 541 ++A I +EA + F + DI + +T + DA L + ++ S + R Sbjct: 452 RFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPAT---HADA---LRLAVDLSLQERAN 505 Query: 540 QRSMQGQSQATGFKR-------PAPPSNFEKSGKART-EMTPVXXXXXXXXXXXXXQCFN 385 G+ +G KR P P NF G+ R+ + P C Sbjct: 506 SSKTAGRGSTSGQKRKAEQQPVPVPQRNFRPGGEFRSFQQKPFEAGEAARGKPL---CTT 562 Query: 384 CGAYGHKNIECPKEKKSCFICGREGHLKQFCRLGKQTGTESQQGSVNQNTRPKFKPAETP 205 CG H C ++CF C +EGH C L + TG Sbjct: 563 CGK--HHLGRCLFGTRTCFKCRQEGHTADRCPL-RPTGI--------------------- 598 Query: 204 SPSMQTTAQNKGKAPAVSGGQGQRLYEMREADKPNA-PVAQGTLYLHSTSVCILFDSGAT 28 AQN+G + G R++ + A V GTL + +LFDSG++ Sbjct: 599 -------AQNQGAGAPLQG----RVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSS 647 Query: 27 HSFIS 13 HSFIS Sbjct: 648 HSFIS 652