BLASTX nr result
ID: Catharanthus22_contig00014972
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00014972 (2648 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] 489 e-135 emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] 435 e-119 gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom... 429 e-117 emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] 427 e-117 gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] 397 e-107 gb|EMJ22494.1| hypothetical protein PRUPE_ppa024499mg, partial [... 395 e-107 gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea bat... 390 e-105 gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] 387 e-104 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 377 e-101 gb|AAM94350.1| gag-pol polyprotein [Zea mays] 376 e-101 gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sati... 375 e-101 gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-... 375 e-101 ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300... 372 e-100 gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus pe... 370 2e-99 gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus pe... 370 2e-99 gb|EMJ00160.1| hypothetical protein PRUPE_ppa020671mg, partial [... 369 3e-99 gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Ja... 369 4e-99 emb|CAE04927.2| OSJNBa0017P10.4 [Oryza sativa Japonica Group] gi... 368 8e-99 gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum] 367 1e-98 gb|AAQ56338.1| putative gag-pol polyprotein [Oryza sativa Japoni... 366 3e-98 >emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] Length = 1521 Score = 489 bits (1260), Expect = e-135 Identities = 270/537 (50%), Positives = 347/537 (64%), Gaps = 10/537 (1%) Frame = -1 Query: 1583 QPTISLIMADRNRSQMTES---KRQDANLECFNCGLRGHYAWECLKKKNLHIGVE-PNDE 1416 QPT ++ + N+ + + S ++ DA CF CG GHYA C K LH VE P E Sbjct: 297 QPTSNVAHQNGNKGKNSMSNGDRKVDATPLCFKCGGHGHYAVVC-PTKGLHFCVEEPESE 355 Query: 1415 QETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQ-KKDWRGTTILQTL 1239 E+ KE + E + VVR +L+ PK + ++DWR +I QT Sbjct: 356 LESYLKKEETYNEDEVSEECDYYDGMTEGHSL--VVRPLLTIPKVKGEEDWRRISIFQTR 413 Query: 1238 VCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTF 1059 + C +IIDGGSS+N+ S+ VEKLNL TE HP+P++VAW++ + IPVS RCLVTF Sbjct: 414 ISCHGRLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVNDTSIPVSFRCLVTF 473 Query: 1058 THGT-YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSE 882 G + +S+WC+V+ + ++HILLG PWL+DR+V+HDG E+TY+ N ++ +LRP Sbjct: 474 LFGKDFEESVWCEVLPIKVSHILLGRPWLFDRKVQHDGYENTYALIHNGRKKILRP---- 529 Query: 881 AMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELK 702 M K N + + + E + +IF + ++V+ + E Y + Sbjct: 530 -MKEVPPIKKSNENAQPKKVLTMCQFENESKETXVIFALMARKVEEFKEQDKE-YPANAR 587 Query: 701 QLLVDL*DV----APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELL 534 ++L D D+ P +LPPMR+IQHAID + G+ LPNL YRM+ EH ELKRQV+ELL Sbjct: 588 KILDDFSDLWPVELPNELPPMRDIQHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELL 647 Query: 533 DDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDST 354 G IRESLSPC VPALLTPKKD +WRMC D R INKIT+KYRFPIPRLDD LDMM S Sbjct: 648 TKGFIRESLSPCGVPALLTPKKDGSWRMCVDSRAINKITIKYRFPIPRLDDMLDMMVGSV 707 Query: 353 IYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPF 174 I+SKIDL Y+++RIR GDEWKT+FKTKDG YEWLVMPFGLTNAPSTFMR MTQVL+PF Sbjct: 708 IFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPF 767 Query: 173 IGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 IGRF+VVYF + +HL+QVMR LR EK YINLKKC+FM SVVFLGF Sbjct: 768 IGRFVVVYFDDILIYSRSCEDHEEHLKQVMRTLRAEKFYINLKKCTFMSPSVVFLGF 824 Score = 88.2 bits (217), Expect(2) = 2e-18 Identities = 42/111 (37%), Positives = 68/111 (61%), Gaps = 1/111 (0%) Frame = -2 Query: 2179 EKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKE 2000 +K ++V F GK++P AF DW ++++ +FDW M + RKVR+ KL G A++WW E Sbjct: 86 KKVRLEVAEFYGKLNPTAFLDWIMSMEDYFDWYAMPENRKVRFVKAKLKGAARLWWHNIE 145 Query: 1999 FDLQLAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 G + TW+EMKL++K L Y+Q ++ +L +L+Q + +V EY Sbjct: 146 NQAHRTGQPPIDTWDEMKLKMKEHFLPTDYEQLMYTKLFSLKQGTKSVEEY 196 Score = 33.9 bits (76), Expect(2) = 2e-18 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 13/90 (14%) Frame = -3 Query: 1782 FKQGLKPEIWN*MLTHQVNNVDDAFQLAYMMES--QKQPAKRFSSQVGE----------- 1642 +K GL+ EI M+ VDD +QLA +E + + ++ SSQ+G Sbjct: 220 YKAGLRMEIQLEMIAAHTYTVDDVYQLALKIEEGLKFRVSRHPSSQIGSTFSNRTTSKPL 279 Query: 1641 ATNTRKFTANIRGTNSTAATANNKFNNGRQ 1552 +T+ + + ++ G ++T T+N NG + Sbjct: 280 STSNFRTSIHVNGGDNTQPTSNVAHQNGNK 309 >emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] Length = 1292 Score = 435 bits (1118), Expect = e-119 Identities = 229/430 (53%), Positives = 299/430 (69%), Gaps = 6/430 (1%) Frame = -1 Query: 1313 VVRRILSTPK-QQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTE 1137 VVR +L+ PK ++++DWR T+I QT + C +IIDGGSS+N+ S+ VEKLNL TE Sbjct: 306 VVRPLLTVPKVKREEDWRRTSIFQTRISCQGRLCTMIIDGGSSLNIASQELVEKLNLKTE 365 Query: 1136 PHPDPYKVAWIDSSGIPVSKRCLVTFTHGT-YTDSIWCDVILMTITHILLG*PWLYDREV 960 HP+P++VAW++ + IPVS RCLVTF G + +S+WC+V+ + ++HILLG PWL+DR V Sbjct: 366 RHPNPFRVAWVNDTSIPVSFRCLVTFLFGKDFEESVWCEVLPIKVSHILLGRPWLFDRXV 425 Query: 959 KHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGG 780 +HDG E+TY+ N + +LRP+ + K D+ ++ S + E + +K Sbjct: 426 QHDGYENTYALIHNGCKTILRPMKEVSPIKK---SDENAQPKKVLSMCQFENESKETK-- 480 Query: 779 LIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV----APEDLPPMREIQHAIDFVLGS 612 +IF + ++V+ + E Y ++++L D D P LPPMR++QHAID + G+ Sbjct: 481 VIFALMARKVEESKEQDKE-YPANVRKILDDFSDFWPTELPNQLPPMRDVQHAIDLIPGA 539 Query: 611 QLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRT 432 LPNL YRM+ EH ELKRQV+ELL G IRESLSP VPALLTPKKD +WRMC D R Sbjct: 540 SLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPYGVPALLTPKKDGSWRMCVDSRA 599 Query: 431 INKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYE 252 +NKIT+KYRFPIPRLDD LDMM S I+SKIDL Y+++RIR GDEWKT+FKTKDG YE Sbjct: 600 MNKITIKYRFPIPRLDDMLDMMVRSVIFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYE 659 Query: 251 WLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLR 72 WLVM FGLTNAPSTFMR MTQVL+PFIGRF+VVYF + +HL+QVM L+ Sbjct: 660 WLVMLFGLTNAPSTFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEEHLKQVMCTLK 719 Query: 71 REKLYINLKK 42 EK YINLKK Sbjct: 720 AEKFYINLKK 729 >gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 429 bits (1103), Expect = e-117 Identities = 239/531 (45%), Positives = 326/531 (61%), Gaps = 14/531 (2%) Frame = -1 Query: 1553 RNRSQMTESKR-QDANLECFNCGLRGHYAWECLKKK----NLHIGVEPNDEQETEEGKEV 1389 +N S + +KR ++++ CF CG +GH ++ C ++K L +EP ++ EE +E+ Sbjct: 234 QNSSGSSTNKRGSNSHIRCFTCGEKGHTSFACPQRKVNLAELGEELEPVYDEYKEEVEEI 293 Query: 1388 DFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTP-KQQKKDWRGTTILQTLVCCGNVTRK 1212 D VVRRI++T ++ +DW+ +I +T V C Sbjct: 294 DVYPAQGESL---------------VVRRIMTTTVNEEAEDWKRRSIFRTRVVCEGKVCD 338 Query: 1211 LIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDS 1035 L+IDGGS N++S+ V KL L T HP PYK+ W+ +PV+ +CLV FT G +D Sbjct: 339 LVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNSDD 398 Query: 1034 -IWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMN--NKR 864 CDV+ M + HIL+G PWLYD ++ H K +TYSF N K+ L PL E N + Sbjct: 399 EALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTLYPLREETKKSANHK 458 Query: 863 ATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL 684 +K R E E S+ G+++ V K +K+ + + Y E++QLL + Sbjct: 459 ISKITRYLSAENFEA-------EGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEF 511 Query: 683 *DVAPEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIR 516 ++ EDLP P+R IQHAID V G+ LPNL YRM + E++RQVEEL + GL+R Sbjct: 512 GELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAEVQRQVEELFEKGLVR 571 Query: 515 ESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKID 336 ES SPCA PALL PKKD +WRMC D R INKIT+KYRFPIPRLD+ LD + S ++SKID Sbjct: 572 ESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKID 631 Query: 335 LTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLV 156 L Y+++R+R GDEWKTAFKT DG +EWLVMPFGL+NAPSTFMR M +VL+PF+ F+V Sbjct: 632 LKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVV 691 Query: 155 VYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 VYF + + HL+QV+ VL++E+LYINLKKCSFM VVFLGF Sbjct: 692 VYFDDILIYSHTKEKHLKHLRQVLEVLQKEQLYINLKKCSFMQPEVVFLGF 742 >emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] Length = 1323 Score = 427 bits (1099), Expect = e-117 Identities = 234/480 (48%), Positives = 299/480 (62%), Gaps = 8/480 (1%) Frame = -1 Query: 1562 MADRNRSQMTES-----KRQDANLECFNCGLRGHYAWECLKKKNLHIGVE-PNDEQETEE 1401 +A +N ++ T S ++ D CF CG GHYA C K LH VE P E E+ Sbjct: 205 VAHKNGNKGTNSMSNGDRKVDVTPLCFKCGGHGHYAVVC-PTKGLHFRVEEPESELESYP 263 Query: 1400 GKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQ-KKDWRGTTILQTLVCCGN 1224 +E + E + VVR +L+ PK + +KDWR T+I QT + C Sbjct: 264 KEEETYNEDEVSEECDYYDGMTEGHSL--VVRPLLTVPKVKGEKDWRXTSIFQTRISCQG 321 Query: 1223 VTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGT- 1047 +IIDGGSS+N+ S+ VEKLNL TE HP+P++VAW++ + IP S RCL TF G Sbjct: 322 RLCTMIIDGGSSLNIASQELVEKLNLKTERHPNPFRVAWVNDTSIPXSFRCLXTFLFGKD 381 Query: 1046 YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNK 867 + + +WC+V+ + ++HILLG PWL+DR V+HDG E+TY+ N ++ +LRP M Sbjct: 382 FEEFVWCEVLPIKVSHILLGRPWLFDRRVQHDGYENTYALIHNXRKKILRP-----MKEV 436 Query: 866 RATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVD 687 K N + + + E + +IF + ++V+ + E Y L Sbjct: 437 PPIKKSNENAQPKKVLTMCQFENESKETKVIFALMARKVEEFKEQDKE-YPANL------ 489 Query: 686 L*DVAPEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507 P LPPMR++QHAID + G+ LPNL YRM+ EH ELKRQV+ELL IRESL Sbjct: 490 -----PNQLPPMRDVQHAIDLIPGASLPNLXAYRMNPTEHXELKRQVDELLTKCFIRESL 544 Query: 506 SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327 SPC VP LLTPKKD +WRMC D R INKIT KY+FPIPRLDD LDMM S I+SKIDL Sbjct: 545 SPCGVPTLLTPKKDGSWRMCVDSRAINKITTKYQFPIPRLDDMLDMMVGSVIFSKIDLRS 604 Query: 326 *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147 Y+++R RLGDEWKT+FKTKDG YEWLVMPFGLTNAPSTFMR MTQVL+PFIGRF VVYF Sbjct: 605 GYHQIRXRLGDEWKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPFIGRFFVVYF 664 >gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] Length = 1306 Score = 397 bits (1020), Expect = e-107 Identities = 220/503 (43%), Positives = 306/503 (60%), Gaps = 7/503 (1%) Frame = -1 Query: 1517 DANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXX 1338 ++++ CF CG GH ++ +++ + E +E +E++ I+ Sbjct: 249 NSHIRCFTCGENGHTSFAGPQRRVNLAELREELEPVYDEYEEIEEIDVYPAQGESL---- 304 Query: 1337 XXDTTFLSVVRRILSTP-KQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATV 1161 VVRR+++T ++ +DW+ +I +T V C L+IDGGS N++S+ V Sbjct: 305 --------VVRRVMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAV 356 Query: 1160 EKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHG-TYTDSIWCDVILMTITHILLG 987 KL L T HP PYK+ W+ +PV+ + LV FT G D CDV+ M + HIL+G Sbjct: 357 NKLKLPTNKHPYPYKIGWLKKGHEVPVTTQYLVKFTMGDNLDDEALCDVVPMDVGHILVG 416 Query: 986 *PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*E 807 PWLYD ++ H + +TYSF + K+ PL E K++ K S E Sbjct: 417 RPWLYDHDMVHKTEPNTYSFYNDNKRYTSYPLKEET---KKSANSKINKITGYLSVENFE 473 Query: 806 IV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLP----PMREIQ 639 E S+ G+++ V K +K+ + Y E++QLL + ++ EDLP P+R IQ Sbjct: 474 A--EGSEMGIMYALVTKHLKSDQMGKSPQYPTEIQQLLKEFGELFNEDLPKSLPPLRSIQ 531 Query: 638 HAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDET 459 HAID V G+ LPNL YRM + E++RQVEELL+ GL+RES SPCA PALL PKKD + Sbjct: 532 HAIDLVPGAALPNLPAYRMPPMQRVEVQRQVEELLEKGLVRESKSPCACPALLAPKKDGS 591 Query: 458 WRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTA 279 WRMC D R INKIT+KYRFPIPRLD+ LD + S ++SKIDL Y+++R+R GDEWKTA Sbjct: 592 WRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKIDLKSEYHQIRMRDGDEWKTA 651 Query: 278 FKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDH 99 FKT DG +EWLVMPFGL+NAPSTFMR M +VL+PF+ F+VVYF + + H Sbjct: 652 FKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKH 711 Query: 98 LQQVMRVLRREKLYINLKKCSFM 30 L+QV+ VL++E+LYINLKKCSFM Sbjct: 712 LRQVLEVLQKEQLYINLKKCSFM 734 Score = 61.6 bits (148), Expect = 2e-06 Identities = 32/93 (34%), Positives = 51/93 (54%), Gaps = 1/93 (1%) Frame = -2 Query: 2125 FTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQLAGNYSV-TWEEMK 1949 + DW +L+ +F+W M++ RKV + +KL G A W E + TWE MK Sbjct: 53 YLDWEASLENYFEWKPMAENRKVLFVKLKLKGTALQWLKRVEEQRARQSKLKISTWEHMK 112 Query: 1948 LELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 +L+++ L Y EL+++ L+Q +MTV EY Sbjct: 113 SKLRKQFLPADYTMELYEKFHCLKQNNMTVEEY 145 >gb|EMJ22494.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica] Length = 1364 Score = 395 bits (1014), Expect = e-107 Identities = 223/490 (45%), Positives = 295/490 (60%), Gaps = 2/490 (0%) Frame = -1 Query: 1508 LECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXD 1329 +ECF+C +GH A C ++ L I +D + E VD +E + Sbjct: 287 IECFHCHAKGHIASRC-PQRTLTISASTDDHCDVEI---VDPLEGVYDPEIDDCFDDDIL 342 Query: 1328 TTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLN 1149 +SV+R I S W+ T+I T V C N T KL+ID GS+MNV+S++ V +LN Sbjct: 343 HQ-VSVMRCIYSA-LALLDSWKRTSIFHTYVPCNNQTCKLVIDSGSTMNVISKSAVTRLN 400 Query: 1148 LLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYD 969 L EPHP P+ VAW+D + +PV++RCLV+ GT + I+ D++ M + H+LLG PWLYD Sbjct: 401 LKPEPHPHPFHVAWVDKTKLPVTERCLVSLKLGTCDEDIYLDLLPMNVAHVLLGRPWLYD 460 Query: 968 REVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*--E 795 V++ G+E+TY+F K I LRP + K +Q S + ++ E Sbjct: 461 HCVQNCGRENTYTFQHEGKSITLRPANPAIKPTKTNITTSSPSQTGNVSGHQLALLSYGE 520 Query: 794 RSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLG 615 K + +Q + L NE + L L P +LPPMR+IQHAID V G Sbjct: 521 FEKEKISAAPSYQQPEPLHQLLNEFSDVMLDDL--------PNELPPMRDIQHAIDLVPG 572 Query: 614 SQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCR 435 SQL NL YRM+ +E EL Q++ LLD G IR SLS CAVP LLTPKKD +WRMC D R Sbjct: 573 SQLLNLPHYRMNSSERAELNTQIQGLLDKGFIRHSLSSCAVPVLLTPKKDGSWRMCVDSR 632 Query: 434 TINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFY 255 INKITVKYRFPIPRL+ L+ + S +SKIDL Y+++RIR GDEWKTAFKT DG Y Sbjct: 633 AINKITVKYRFPIPRLEAMLEELAGSKWFSKIDLRSGYHQIRIREGDEWKTAFKTPDGLY 692 Query: 254 EWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVL 75 EWLVMPFG++NAPSTFMR MT VL+P+IG+FLVVYF + HL+ + +L Sbjct: 693 EWLVMPFGMSNAPSTFMRVMTHVLRPYIGKFLVVYFDDILIYSHSKEDHLQHLRTIFHML 752 Query: 74 RREKLYINLK 45 R+EKL++NLK Sbjct: 753 RQEKLFVNLK 762 >gb|AAV88076.1| putative retrotransposon polyprotein [Ipomoea batatas] Length = 1358 Score = 390 bits (1002), Expect = e-105 Identities = 236/556 (42%), Positives = 312/556 (56%), Gaps = 47/556 (8%) Frame = -1 Query: 1529 SKRQDANLECFNCGLRGHYAWECLK-KKNLHIGVEPNDEQETEEGKEVDFI---ERIXXX 1362 SK++ + + C+ C RGHYA EC KK L G + + E + + ER Sbjct: 308 SKQKVSTVTCYRCQGRGHYARECPNTKKILTTGKDEREYMSANESDDEELEPIGERQKDD 367 Query: 1361 XXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMN 1182 F VV + LST ++ + I T IIDGGS N Sbjct: 368 HSEEEVQEDDALHFNCVVHKALSTLVVLDQEEQRENIFYGKCKIPGATCSFIIDGGSCTN 427 Query: 1181 VVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTI 1005 V+SE V + + T HP PYK+ W++ G + V K+ L++ + G Y D + CDVI M Sbjct: 428 VISEDVVNAMKIPTIQHPQPYKLQWLNDDGELKVHKQALISISIGKYQDDVLCDVIPMHA 487 Query: 1004 THILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQ---- 837 HILLG PW YDR+ H GK + Y+ + K+ L PL+ + + N + K R + Sbjct: 488 CHILLGRPWQYDRDTLHHGKTNKYTIHKGGKKYTLTPLAPKEVYNLQVQSKKLREELAQK 547 Query: 836 -----EETTSNSR*EIV*ERS--KGGLIFMAVVKQVKNLLNTNNE--------------- 723 +ETTS + I E+ K G+ + NLL T E Sbjct: 548 AKEAMKETTSGKQNTIAHEKKQRKEGMK-KDTTQSSHNLLMTKREVEQALRRGEGVFLLY 606 Query: 722 --DYSL----------ELKQLLVDL*DVAPEDLP----PMREIQHAIDFVLGSQLPNLLG 591 D+ L ++ LL + DV PE+LP P+R I+H ID + G+ LPN Sbjct: 607 PIDFCLNVIKSEIIPSDVSALLSEFADVFPEELPKGLPPIRGIEHQIDLIPGASLPNRPA 666 Query: 590 YRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVK 411 YR + E +E++RQV+ELL G I+ESLSPCAVP LL PKKD TWRMC DCR IN ITVK Sbjct: 667 YRTNPDEAKEIQRQVDELLQAGFIQESLSPCAVPVLLVPKKDGTWRMCVDCRAINNITVK 726 Query: 410 YRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFG 231 YR+PIPRLDD LD + + I+SKIDL + Y+++R++ GDEWKTAFKTK+G YEWLVMPFG Sbjct: 727 YRYPIPRLDDMLDELHGAKIFSKIDLRRGYHQIRMQKGDEWKTAFKTKNGLYEWLVMPFG 786 Query: 230 LTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYIN 51 LTNAPSTFMR M VL+ FIG+F+VVYF +D ++ I HL++V VLRRE+LY N Sbjct: 787 LTNAPSTFMRLMNHVLRNFIGKFVVVYFDDILIYSKDPQKHIIHLKEVFLVLRREQLYAN 846 Query: 50 LKKCSFMCSSVVFLGF 3 L+KC F SVVFLGF Sbjct: 847 LEKCYFGVESVVFLGF 862 >gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1713 Score = 387 bits (995), Expect = e-104 Identities = 224/563 (39%), Positives = 324/563 (57%), Gaps = 44/563 (7%) Frame = -1 Query: 1559 ADRNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIG----VEPNDEQETEEGKE 1392 A S S + + ++CF CG RGH A EC + + + E E+E E+ +E Sbjct: 369 AANTSSTSVGSSTKSSGIQCFKCGGRGHVARECPNNRTIVVNDQGEYESTSEEEQEDSEE 428 Query: 1391 VDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRK 1212 + +E+ ++ VV +ILS ++ + + QT + K Sbjct: 429 ENNLEK---------DICEFESGAALVVTQILSVQMSDAENGQRHNLFQTRAKVQDKVVK 479 Query: 1211 LIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDS 1035 +IIDGGS N+ S+ VEKL L HP PY V W+++SG I +++R V F G Y D+ Sbjct: 480 VIIDGGSCHNLASKEMVEKLGLKLLKHPHPYHVQWLNNSGSIKIAQRVKVPFKIGEYIDT 539 Query: 1034 IWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLS----------- 888 + CDV MT+ H+LLG PW YDR H G+ + Y+ + K+++L+P++ Sbjct: 540 MECDVAPMTVCHMLLGRPWQYDRSSLHCGRTNQYTIKWKGKELILKPMTPQQILAEHLQK 599 Query: 887 SEAMNNKRATKDKRRN---QEETTSNSR*EIV*ERSKG---GLIFMAVVKQVKN------ 744 S + N+ A + ++ N ++ S S + + K L+ +A ++++ Sbjct: 600 SSEVRNESAKEGQKNNLSAPHKSVSESHKPNMRDNKKREGENLVMIATKSEMRDVRRNPE 659 Query: 743 -----------LLNTNN-EDYSLELKQLLVDL*DVAPED----LPPMREIQHAIDFVLGS 612 LL+ N+ + ++L + DV PE+ LPP+R I+H ID + G+ Sbjct: 660 QVLFILVCKDTLLSANDLTSVPSVVARVLQEYEDVFPEETPVGLPPLRGIEHQIDLIPGA 719 Query: 611 QLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRT 432 LPN YR + E +E++RQV+ LLD G +RESLSPCAVP +L PKKD +WRMC DCR Sbjct: 720 TLPNRPAYRTNPEETKEIQRQVQALLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRA 779 Query: 431 INKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYE 252 IN ITV+YR PIPRLDD LD ++ S I+SKIDL ++++R+++GDEWKTAFKTK G YE Sbjct: 780 INNITVRYRHPIPRLDDMLDELSGSMIFSKIDLRSGFHQIRMKIGDEWKTAFKTKFGLYE 839 Query: 251 WLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLR 72 WLVMPFGLTNAPSTFMR M VL+ FIG+F+VVYF + + H+QQV+ VLR Sbjct: 840 WLVMPFGLTNAPSTFMRLMNHVLRAFIGKFVVVYFDDILIYSKTLEEHVAHIQQVLDVLR 899 Query: 71 REKLYINLKKCSFMCSSVVFLGF 3 +E+LY NL+KC+F VVFLGF Sbjct: 900 KEQLYANLEKCTFCTDQVVFLGF 922 Score = 73.6 bits (179), Expect = 4e-10 Identities = 36/112 (32%), Positives = 64/112 (57%), Gaps = 1/112 (0%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P F+G DP + W + + + F + S+ +KV ++ G A IWW Sbjct: 137 FGKLKFTMPKFEGGSDPEVYLTWELKVDKIFRLHNYSERKKVAMAALEFDGYALIWWEQM 196 Query: 2002 EFDLQLAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 + + AG V +W EMK E++ + + ++Y+++LFD+L NL+Q S++V EY Sbjct: 197 LNEREEAGQGDVRSWAEMKREMRARFVPKHYRRDLFDKLQNLKQGSLSVDEY 248 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 377 bits (969), Expect = e-101 Identities = 221/536 (41%), Positives = 308/536 (57%), Gaps = 18/536 (3%) Frame = -1 Query: 1556 DRNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNL------HIGVEP----NDEQET 1407 D+ ++ T K+ +C+ C GH+A EC K+ L H G + ++E E Sbjct: 291 DKGKAAETSQKKTMPLKKCYQCQGYGHFAKECPTKRALSSFEVVHWGDDEILVCDEEVEG 350 Query: 1406 EEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVR-RILST-PKQQKKDWRGTTILQTLVC 1233 + +E D + LS+V R++ T P+ + D R Sbjct: 351 TDHEEDDVV---------------MPDAGLSLVTWRVMHTQPQPLEMDQRQQIFRSRCTI 395 Query: 1232 CGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFT 1056 G V LIIDGGS NV S +EKL+L T+ HP PYK+ W++ + V K+CLVTF+ Sbjct: 396 KGRVCN-LIIDGGSCTNVASSTLIEKLSLPTQDHPSPYKLRWLNKGAEVRVDKQCLVTFS 454 Query: 1055 HG-TYTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEA 879 G Y+D CDV+ M H+LLG PW +DR+ H G+++TY+F F ++++L PL Sbjct: 455 IGKNYSDEALCDVLPMDACHLLLGRPWEFDRDSVHHGRDNTYTFKFRSRKVILTPLPPVL 514 Query: 878 MNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQ 699 + +E + E++ E ++ + K V + N E+++ Sbjct: 515 KHT--TPPSMLEPSKEVLLINEAEMLQELKGDEDVYALIAKDV---VFGQNVSLPKEVQE 569 Query: 698 LLVDL*DVAPEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLD 531 LL DV P +LP P+R I+H IDF+ G+ LPN YR +EL++Q+ EL+ Sbjct: 570 LLQSYEDVFPNELPSGLPPLRGIEHQIDFIPGATLPNKAAYRSDPKATQELQQQIGELVS 629 Query: 530 DGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTI 351 G +RESLSPC+VPALL PKKD +WRMC D R IN IT+KYRFPIPRLDD LD ++ + + Sbjct: 630 KGFVRESLSPCSVPALLVPKKDGSWRMCTDSRAINNITIKYRFPIPRLDDILDELSGAQL 689 Query: 350 YSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFI 171 +SKIDL + Y+++RI+ GDEWKTAFKTK G YEWLVMPFGL+NAPSTFMR MT+VL+P++ Sbjct: 690 FSKIDLRQGYHQVRIKEGDEWKTAFKTKHGLYEWLVMPFGLSNAPSTFMRLMTEVLRPYL 749 Query: 170 GRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 GRF+VVYF + HLQ + LR KLY L+KCSFM + V FLGF Sbjct: 750 GRFVVVYFDDILVYSPSKEEHLKHLQVLFETLREHKLYGKLEKCSFMQNEVQFLGF 805 Score = 66.6 bits (161), Expect = 5e-08 Identities = 30/107 (28%), Positives = 60/107 (56%), Gaps = 1/107 (0%) Frame = -2 Query: 2167 IDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQ 1988 +++P+F G ++P DWF T++R F++ SD + + ++KL G A +W+ + + Sbjct: 90 VEIPDFHGSLNPEDLLDWFRTIERVFEFKGYSDGKAFKVAILKLKGYASLWYENLKNQRR 149 Query: 1987 LAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 G + +W ++K +L K + + Y Q++F +LT L+Q + Y Sbjct: 150 RDGKEPIKSWLKLKKKLNEKFIPKEYTQDIFIKLTQLKQDQQPLESY 196 >gb|AAM94350.1| gag-pol polyprotein [Zea mays] Length = 1618 Score = 376 bits (965), Expect = e-101 Identities = 200/464 (43%), Positives = 280/464 (60%), Gaps = 27/464 (5%) Frame = -1 Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134 +V+R+LS ++ + + T+ QT + +LIIDGGS N+ S VEKL L T+P Sbjct: 462 IVQRVLSAQMEKAEQNQRHTLFQTKCVIKERSCRLIIDGGSCNNLASSDMVEKLALTTKP 521 Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957 HP PY + W+++SG + V+K + F G+Y D + CDV+ M +ILLG PW +D + Sbjct: 522 HPHPYHIQWLNNSGKVKVTKLVRINFAIGSYRDVVDCDVVPMDACNILLGRPWQFDSDCM 581 Query: 956 HDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKG-- 783 H G+ + YS + K+I+L P+S EA+ K + +T +N ++V G Sbjct: 582 HHGRSNQYSLIHHDKKIILLPMSPEAIVRDDVAK---ATKAKTENNKNIKVVGNNKDGIK 638 Query: 782 --GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVA--------------------- 672 G +A V L + Y+L K L+ + D+ Sbjct: 639 LKGHCLLATKTDVNELFASTTVAYALVCKDALISIQDMQHSLPPVITNILQEYSDVFPSE 698 Query: 671 -PEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495 PE LPP+R I+H ID + G+ LPN YR + E +E++RQV+ELLD G +RESLSPCA Sbjct: 699 IPEGLPPIRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVQELLDKGYVRESLSPCA 758 Query: 494 VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315 VP +L PKKD TWRMC DCR IN IT++YR PIPRLDD LD ++ + ++SK+DL Y++ Sbjct: 759 VPVILVPKKDGTWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGAIVFSKVDLRSGYHQ 818 Query: 314 LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135 +R++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+ FIG+F+VVYF Sbjct: 819 IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRAFIGKFVVVYFDDIL 878 Query: 134 *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + +DH++ V LR +L+ NL+KC+F V FLG+ Sbjct: 879 IYSKSMDEHVDHMRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 922 Score = 60.8 bits (146), Expect = 3e-06 Identities = 30/111 (27%), Positives = 55/111 (49%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P FDGK DP A+ W + + + F + + +VR + A +WW Sbjct: 144 FSKVKFKIPPFDGKYDPDAYITWEIAVDQKFACHEFPENARVRAATSEFTEFASVWWI-- 201 Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 E + N TW+ +K ++ + + YY +++ ++L LRQ + +V EY Sbjct: 202 EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDMLNKLQQLRQGTKSVEEY 252 >gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sativa Japonica Group] Length = 1739 Score = 375 bits (964), Expect = e-101 Identities = 214/560 (38%), Positives = 317/560 (56%), Gaps = 11/560 (1%) Frame = -1 Query: 1649 SGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDANLECFNCGLRGHYA 1470 +GR +P ++ T A +++ + S + +++C C GH Sbjct: 701 AGRTASPSSTPTTSRAAPPPSSDKSVTKAAQPAPSASSMVSTGRMRDVQCHRCKGFGHVQ 760 Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLS------VV 1308 +C K+ L V+ + E + + D + + + +V Sbjct: 761 RDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEHIGAAFADHYESLIV 818 Query: 1307 RRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHP 1128 +R+LS ++ + + T+ QT ++IIDGGS N+ S VEKL L T+PHP Sbjct: 819 QRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMIIDGGSCNNLASSEMVEKLALSTKPHP 878 Query: 1127 DPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHD 951 PY + W+++SG + V+K + F G Y D + CDV+ M +ILLG PW +DR+ H Sbjct: 879 HPYYIQWLNNSGKVKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSMHH 938 Query: 950 GKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERSKG 783 G+ + YSF ++ K+IVL P+SSE + + +A K K + ++ S+ + E + + K Sbjct: 939 GRSNQYSFLYHDKKIVLHPMSSEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKPK- 997 Query: 782 GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLGSQLP 603 +A + L+ + + Y+LE + P LPP+R I+H ID + G+ LP Sbjct: 998 --CLLATKSDITELIASPSVAYALEYSDVFPK---EVPPGLPPVRGIEHQIDLIPGASLP 1052 Query: 602 NLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINK 423 N YR + E +E++RQV ELLD G +RESLSPCAVP +L PKKD +WRMC DCR IN Sbjct: 1053 NRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINN 1112 Query: 422 ITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLV 243 IT++YR PIPRLDD LD ++ S ++SK+DL Y+++R++LGDEWKT FKTK G YEWLV Sbjct: 1113 ITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQIRMKLGDEWKTTFKTKFGLYEWLV 1172 Query: 242 MPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREK 63 MPFGLTNAPSTFMR M +VL+PFIG+F+VVYF + +HL+ V LR + Sbjct: 1173 MPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDAR 1232 Query: 62 LYINLKKCSFMCSSVVFLGF 3 L+ NL+KC+F V FLG+ Sbjct: 1233 LFGNLEKCTFCTDRVSFLGY 1252 Score = 64.7 bits (156), Expect = 2e-07 Identities = 32/111 (28%), Positives = 55/111 (49%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P FDGK DP AF W + + + F + + + +VR + A +WW Sbjct: 508 FSKIKFKIPPFDGKYDPDAFLSWEIAVDQKFAYHEFPENTRVRAATSEFTDFASVWWI-- 565 Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 E + N TW+ +K ++ + + YY ++L + L LRQ + +V EY Sbjct: 566 EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 616 >gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza sativa Japonica Group] gi|108864301|gb|ABA93040.2| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1748 Score = 375 bits (964), Expect = e-101 Identities = 214/560 (38%), Positives = 317/560 (56%), Gaps = 11/560 (1%) Frame = -1 Query: 1649 SGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDANLECFNCGLRGHYA 1470 +GR +P ++ T A +++ + S + +++C C GH Sbjct: 710 AGRTASPSSTPTTSRAAPPPSSDKSVTKAAQPAPSASSMVSTGRMRDVQCHRCKGFGHVQ 769 Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLS------VV 1308 +C K+ L V+ + E + + D + + + +V Sbjct: 770 RDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEHIGAAFADHYESLIV 827 Query: 1307 RRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHP 1128 +R+LS ++ + + T+ QT ++IIDGGS N+ S VEKL L T+PHP Sbjct: 828 QRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMIIDGGSCNNLASSEMVEKLALSTKPHP 887 Query: 1127 DPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHD 951 PY + W+++SG + V+K + F G Y D + CDV+ M +ILLG PW +DR+ H Sbjct: 888 HPYYIQWLNNSGKVKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSMHH 947 Query: 950 GKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERSKG 783 G+ + YSF ++ K+IVL P+SSE + + +A K K + ++ S+ + E + + K Sbjct: 948 GRSNQYSFLYHDKKIVLHPMSSEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKPK- 1006 Query: 782 GLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDLPPMREIQHAIDFVLGSQLP 603 +A + L+ + + Y+LE + P LPP+R I+H ID + G+ LP Sbjct: 1007 --CLLATKSDITELIASPSVAYALEYSDVFPK---EVPPGLPPVRGIEHQIDLIPGASLP 1061 Query: 602 NLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINK 423 N YR + E +E++RQV ELLD G +RESLSPCAVP +L PKKD +WRMC DCR IN Sbjct: 1062 NRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINN 1121 Query: 422 ITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLV 243 IT++YR PIPRLDD LD ++ S ++SK+DL Y+++R++LGDEWKT FKTK G YEWLV Sbjct: 1122 ITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQIRMKLGDEWKTTFKTKFGLYEWLV 1181 Query: 242 MPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREK 63 MPFGLTNAPSTFMR M +VL+PFIG+F+VVYF + +HL+ V LR + Sbjct: 1182 MPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDAR 1241 Query: 62 LYINLKKCSFMCSSVVFLGF 3 L+ NL+KC+F V FLG+ Sbjct: 1242 LFGNLEKCTFCTDRVSFLGY 1261 Score = 64.7 bits (156), Expect = 2e-07 Identities = 32/111 (28%), Positives = 55/111 (49%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P FDGK DP AF W + + + F + + + +VR + A +WW Sbjct: 517 FSKIKFKIPPFDGKYDPDAFLSWEIAVDQKFAYHEFPENTRVRAATSEFTDFASVWWI-- 574 Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 E + N TW+ +K ++ + + YY ++L + L LRQ + +V EY Sbjct: 575 EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 625 >ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca subsp. vesca] Length = 1034 Score = 372 bits (954), Expect = e-100 Identities = 216/491 (43%), Positives = 294/491 (59%), Gaps = 14/491 (2%) Frame = -1 Query: 1433 VEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTT 1254 +E +DEQ EE +E + +E + V +R+L + KQ+ + + Sbjct: 397 IEGDDEQHEEE-EEDEVVEEAEEYSGDDRE-------YNLVTQRLLCSTKQENQRH---S 445 Query: 1253 ILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSK 1077 I ++ LIID GS N VS+ VE NLLT H PY + WI + +++ Sbjct: 446 IFRSTCTIKEKPMSLIIDSGSCENFVSKKVVEHFNLLTMKHRAPYAIGWIKKGLEVRITE 505 Query: 1076 RCLVTFTHGT-YTDSIWCDVILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVL 900 C V+ + G Y D + CDV+ M +H+LLG PW +D H+G+E+T SF + K I L Sbjct: 506 TCKVSISIGKFYQDEVECDVVDMDASHVLLGKPWQHDVNTIHNGRENTVSFIWEKHHITL 565 Query: 899 RP------LSSEAMNNKRATKDKRRNQEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLL 738 +P L S +N + EE ++ I+ VV++V + Sbjct: 566 KPKTKPTNLVSPKESNFLIVAEPCEKVEELVKDAE-----------AIYPLVVREVM-VA 613 Query: 737 NTNNEDYSL--ELKQLLVD----L*DVAPEDLPPMREIQHAIDFVLGSQLPNLLGYRMSL 576 N E+ + E++QLL D L D P +LPPMR+IQH ID V G+ LPNL YRMS Sbjct: 614 EDNKEEKKIPKEVQQLLQDFEELLADDLPNELPPMRDIQHQIDLVSGASLPNLPHYRMSP 673 Query: 575 AEHEELKRQVEELLDDGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPI 396 E+E LK ++EELL G IRES+SPCAVP LL PKKD +WRMC D R INKIT+KYRFPI Sbjct: 674 KENEILKEKIEELLRKGHIRESMSPCAVPVLLVPKKDRSWRMCVDSRAINKITIKYRFPI 733 Query: 395 PRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAP 216 P+L+D LD++ S ++SKIDL Y+++RI+LGDEWKTAFK+KDG YEWLVMPFGL+NAP Sbjct: 734 PQLEDMLDVLGGSVVFSKIDLRSGYHQIRIKLGDEWKTAFKSKDGLYEWLVMPFGLSNAP 793 Query: 215 STFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCS 36 STFMR M QVL+P+IG +VVYF + + HL++V+ VL+ KLY+NLKKCS Sbjct: 794 STFMRVMNQVLKPYIGTCVVVYFDDILIYSKSKEEHLQHLRKVLEVLQENKLYVNLKKCS 853 Query: 35 FMCSSVVFLGF 3 FM ++FLG+ Sbjct: 854 FMTKKLLFLGY 864 >gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica] Length = 1485 Score = 370 bits (950), Expect = 2e-99 Identities = 216/519 (41%), Positives = 302/519 (58%), Gaps = 2/519 (0%) Frame = -1 Query: 1553 RNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIER 1374 RN+SQ +K C+ C GH + C + K + E ++++E +E E D+ Sbjct: 336 RNQSQNLYAKPMTDI--CYRCQKPGHRSNVCPELKQANFIEEADEDEENDEVGENDYA-- 391 Query: 1373 IXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGG 1194 V++R+L P+++ + +I ++L N +I+D G Sbjct: 392 -----GAEFAVEEGMEKITLVLQRVLLAPREEGQRH---SIFRSLCSIKNKVCDVIVDNG 443 Query: 1193 SSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSKRCLVTFTHGT-YTDSIWCDV 1020 S N VS+ VE L L TEPH PY + W+ + V++ C V + G Y D + CDV Sbjct: 444 SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDV 503 Query: 1019 ILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRN 840 I M HILLG PW +D + G+++ F++N ++I + + +K + + K R+ Sbjct: 504 IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAM----TTTQPSKPSVEVKTRS 559 Query: 839 QEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DVAPEDL 660 T S + + E K + + V+ +L+ E +S L P +L Sbjct: 560 SSFLTLISNEQELNEAVKEAEGEGDIPQDVQQILSQFQELFSENL-----------PNEL 608 Query: 659 PPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCAVPALL 480 PPMR+IQH ID V G+ L NL YRMS E++ L+ Q+EELL G IRESLSPCAVP LL Sbjct: 609 PPMRDIQHRIDLVPGASLQNLPHYRMSPKENDILREQIEELLRKGFIRESLSPCAVPVLL 668 Query: 479 TPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRL 300 PKKD+TWRMC D R INKITVKYRFPIPRL+D LD+++ S ++SKIDL Y+++RIR Sbjct: 669 VPKKDKTWRMCVDSRAINKITVKYRFPIPRLEDMLDVLSGSKVFSKIDLRSGYHQIRIRP 728 Query: 299 GDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QD 120 GDEWKTAFK+KDG +EWLVMPFGL+N PSTFMR M QVL+PFIG F+VVYF Sbjct: 729 GDEWKTAFKSKDGLFEWLVMPFGLSNTPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTT 788 Query: 119 GRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + HL+QV+ VLR KL++NLKKC+F + ++FLGF Sbjct: 789 KEEHLVHLRQVLDVLRENKLFVNLKKCTFCTNKLLFLGF 827 >gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica] Length = 1493 Score = 370 bits (949), Expect = 2e-99 Identities = 217/528 (41%), Positives = 301/528 (57%), Gaps = 11/528 (2%) Frame = -1 Query: 1553 RNRSQMTESKRQDANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIER 1374 RN+SQ +K C+ C GH + C ++K + E ++++E +E E D+ Sbjct: 347 RNQSQNPYAKPMTDI--CYRCQKPGHRSNVCPERKQANFIEEADEDEEKDEVGENDYA-- 402 Query: 1373 IXXXXXXXXXXXXXDTTFLSVVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGG 1194 V++R+L PK++ + I ++L N +I+D G Sbjct: 403 -----GAEFAVEEGIEKITLVLQRVLLAPKEEGQRHN---IFRSLCSIKNKVCDVIVDNG 454 Query: 1193 SSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSS-GIPVSKRCLVTFTHGT-YTDSIWCDV 1020 S N VS+ VE L L TEPH PY + W+ + V++ C V + G Y D + CDV Sbjct: 455 SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDV 514 Query: 1019 ILMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRN 840 I M HILLG PW +D + G+++ F++N ++I + AT R Sbjct: 515 IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAM------------ATTQPSRK 562 Query: 839 QEETTSNSR*EIV*ERSKGGLIFMAVVKQVKNLLNTNNE-----DYSLELKQLLVDL*DV 675 QE +S+ F+ ++ + L E D +++Q+L ++ Sbjct: 563 QELRSSS---------------FLTLISNEQELNEAVKEAEGEGDIPQDVQQILSQFQEL 607 Query: 674 A----PEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507 P +LPPMR+IQH ID V G+ LPNL YRMS E++ L+ Q+EELL G IRESL Sbjct: 608 LSENLPNELPPMRDIQHRIDLVHGASLPNLPHYRMSPKENDILREQIEELLRKGFIRESL 667 Query: 506 SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327 SPCAVP LL PKKD+TWRMC D R +NKI VKYRF IPRL+D LD+++ S ++SKIDL Sbjct: 668 SPCAVPVLLVPKKDKTWRMCVDSRAVNKIKVKYRFSIPRLEDILDVLSGSKVFSKIDLRS 727 Query: 326 *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147 Y+++RIR GDEWKTAFK+KDG +EWLVMPFGL+NAPSTFMR M QVL+PFIG F+VVYF Sbjct: 728 GYHQIRIRPGDEWKTAFKSKDGLFEWLVMPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYF 787 Query: 146 *GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + HL+QV+ VLR KLY+NLKKC+F + ++FLGF Sbjct: 788 DDILIYSTTKEEHLVHLRQVLDVLRENKLYVNLKKCTFCTNKLLFLGF 835 >gb|EMJ00160.1| hypothetical protein PRUPE_ppa020671mg, partial [Prunus persica] Length = 1460 Score = 369 bits (948), Expect = 3e-99 Identities = 197/408 (48%), Positives = 257/408 (62%), Gaps = 10/408 (2%) Frame = -1 Query: 1196 GSSMNVVSEATVEKLNLLTEPHPDPYKVAWIDSSGIPVSKRCLVTFTHGTYTDSIWCDVI 1017 GS+MNV+S++ V +LNL EPHP P+ VAW+D + +PV++ CLV+ GT + I+ D + Sbjct: 436 GSTMNVISKSAVTRLNLKPEPHPHPFHVAWVDKTKLPVTEWCLVSLKLGTCDEDIYLDQL 495 Query: 1016 LMTITHILLG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAMNNKRATKDKRRNQ 837 M + H+LLG PWLYD V++ G+E+TY+F K I+LRP + K +Q Sbjct: 496 PMNVAHVLLGRPWLYDHRVQNCGRENTYTFQHEGKSIMLRPANPAIKPTKTNITTSSPSQ 555 Query: 836 EETTSNSR*------EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV 675 S R E E + G++F V+K++ + + L Q L + DV Sbjct: 556 TGNMSGHRLALLSYGEFEKESLETGVVFALVIKEISAAPSYQQPE---PLHQFLNEFSDV 612 Query: 674 APEDLP----PMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESL 507 P+DLP PMR+IQHAID V GSQLPNL YRM+ +EH EL Q++ LLD G IR SL Sbjct: 613 MPDDLPNELPPMRDIQHAIDLVPGSQLPNLPHYRMNSSEHAELNTQIQGLLDKGFIRHSL 672 Query: 506 SPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK 327 SPCAVP L TPKKD +WRMC D R INKIT D LD + S +SKIDL Sbjct: 673 SPCAVPVLFTPKKDGSWRMCVDSRAINKIT-----------DMLDELAGSKWFSKIDLHS 721 Query: 326 *YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF 147 Y+++RIR GDEWKTAFKT DG YEWLVMPFG++NAPSTFMR MT V +P+IG+FLVVYF Sbjct: 722 GYHQIRIREGDEWKTAFKTPDGLYEWLVMPFGMSNAPSTFMRVMTHVFRPYIGKFLVVYF 781 Query: 146 *GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + HL+ + +LR+EKL++NLKKCSF+ V+FLGF Sbjct: 782 DDILIYSHSKEDHLQHLRTIFHMLRQEKLFVNLKKCSFLQEQVLFLGF 829 Score = 79.7 bits (195), Expect(3) = 7e-16 Identities = 35/88 (39%), Positives = 55/88 (62%), Gaps = 1/88 (1%) Frame = -2 Query: 2158 PNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQLAG 1979 P+FDG+ DP F DW ++ +F+W DMSD +++R+ +KLVG + +W E LQ G Sbjct: 198 PDFDGRGDPTLFVDWISAMEDYFEWDDMSDAQRIRFAKLKLVGAVKQYWKATEHHLQQLG 257 Query: 1978 NYSV-TWEEMKLELKRKNLLRYYQQELF 1898 V W+EMKL+L+ + L +Y Q+ + Sbjct: 258 QTPVILWDEMKLKLREQYLPSFYLQDYY 285 Score = 28.5 bits (62), Expect(3) = 7e-16 Identities = 21/91 (23%), Positives = 42/91 (46%) Frame = -3 Query: 1845 KFKEMKICF*GAEDSR*TLS*FKQGLKPEIWN*MLTHQVNNVDDAFQLAYMMESQKQPAK 1666 +F E K+ E+ T+S F GL+ +I + + + ++DA+ A E+ +P + Sbjct: 287 RFVEHKLHSALQEELAVTVSRFIHGLRIDIKREVSRSRPDVLEDAYCQALEAETYLRPQR 346 Query: 1665 RFSSQVGEATNTRKFTANIRGTNSTAATANN 1573 R+ G+ T T + G + + +N Sbjct: 347 RYPGYPGQPTTTNQARTTTSGLKTEFSEPSN 377 Score = 24.6 bits (52), Expect(3) = 7e-16 Identities = 11/38 (28%), Positives = 21/38 (55%) Frame = -1 Query: 1517 DANLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETE 1404 ++++E F+C +GH A C ++ L I +D + E Sbjct: 391 NSHIEWFHCHAKGHIASRC-PQRTLTISASTDDHCDVE 427 >gb|AAK51582.1|AC022352_18 Putative retroelement [Oryza sativa Japonica Group] gi|31431012|gb|AAP52850.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 2447 Score = 369 bits (947), Expect = 4e-99 Identities = 200/464 (43%), Positives = 282/464 (60%), Gaps = 27/464 (5%) Frame = -1 Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134 +V+R+LS ++ + + T+ QT ++IIDGGS N+ S VEKL L T+P Sbjct: 459 IVQRVLSAQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASSEMVEKLALSTKP 518 Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957 HP PY + W+++SG V+K + F G Y D + CDV+ M +ILLG PW +DR+ Sbjct: 519 HPHPYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDRDSM 578 Query: 956 HDGKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTSNSR*-EIV*ERS 789 H G+ + YSF ++ K+IVL +S E + + +A K K + ++ S+ + E + + Sbjct: 579 HHGRSNQYSFLYHDKKIVLHSMSPEDILRDDVAKAAKSKCESDKKAQSDGKKPETINLKP 638 Query: 788 KGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV---------------------- 675 K +A + L+ + + Y+L K L+ L D+ Sbjct: 639 K---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAVANILQEYSDVFPKE 695 Query: 674 APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495 P LPP+R I+H ID + G+ LPN YR + E +E++RQV ELLD G +RESLSPCA Sbjct: 696 VPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCA 755 Query: 494 VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315 VP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S ++SK+DL Y++ Sbjct: 756 VPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQ 815 Query: 314 LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135 +R++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+PFIG+F+VVYF Sbjct: 816 IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDIL 875 Query: 134 *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + +HL+ V LR +L+ NL+KC+F V FLG+ Sbjct: 876 IYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 919 Score = 68.2 bits (165), Expect = 2e-08 Identities = 49/151 (32%), Positives = 72/151 (47%), Gaps = 5/151 (3%) Frame = -1 Query: 443 DCRTINKIT---VKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYRLRIRLGDEWKT--A 279 D TI+ T +++ PI R R D ++D +++ +GD+ T + Sbjct: 1563 DINTIDTSTSPHIQHDGPITRARARQLNYQDECPRGRVDA---HHKFAATIGDDRATNPS 1619 Query: 278 FKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT*I*QDGRRTIDH 99 G YE+ VM FGLTNAP+ FM M +V ++ +F+VV+ Q H Sbjct: 1620 GFASYGLYEFTVMSFGLTNAPAFFMNLMNKVFMEYLDKFVVVFIDDILVYSQSEEDHQHH 1679 Query: 98 LQQVMRVLRREKLYINLKKCSFMCSSVVFLG 6 L+ V+ LR +LY L KC F S V FLG Sbjct: 1680 LRLVLGKLREHQLYAKLSKCEFWLSEVKFLG 1710 Score = 62.0 bits (149), Expect = 1e-06 Identities = 31/111 (27%), Positives = 54/111 (48%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P FDGK DP A+ W + + + F + + +VR + A +WW Sbjct: 150 FSKIKFKIPPFDGKYDPDAYLSWEIAVDQKFACHEFPESTRVRAATSEFTDFASVWWI-- 207 Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 E + N TW+ +K ++ + + YY ++L + L LRQ + +V EY Sbjct: 208 EHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 258 >emb|CAE04927.2| OSJNBa0017P10.4 [Oryza sativa Japonica Group] gi|38345441|emb|CAE03293.2| OSJNBb0046P18.9 [Oryza sativa Japonica Group] Length = 1134 Score = 368 bits (944), Expect = 8e-99 Identities = 200/464 (43%), Positives = 286/464 (61%), Gaps = 27/464 (5%) Frame = -1 Query: 1313 VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEP 1134 +V+R+LST ++ + + T+ QT ++IIDGGS N+ S VEKL L T+P Sbjct: 642 IVQRVLSTQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASSEMVEKLALSTKP 701 Query: 1133 HPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVK 957 HP PY + W+++SG V+K + F G Y D + CDV+ M +ILLG PW +D++ Sbjct: 702 HPHPYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNILLGRPWQFDKDSL 761 Query: 956 HDGKESTYSFNFNKKQIVLRPLSSEAMNNK---RATKDKRRNQEETTSNSR*-EIV*ERS 789 H G+ + YSF ++ K+IVL P+SSE + + +A K K + ++ S+ + E + + Sbjct: 762 HHGRSNQYSFLYHDKKIVLHPMSSEDILHDDVAKAAKSKCESDKKAQSDGKKPETINLKP 821 Query: 788 KGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*D------------------VAPED 663 K +A + L+ + + Y+L K L+ L D V P++ Sbjct: 822 K---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAIANILQEYSDVFPKE 878 Query: 662 LPP----MREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSPCA 495 +PP + I+H ID +LG+ LPN YR + E +E++RQV ELLD G +RESLSPCA Sbjct: 879 VPPGLLPVHGIEHQIDLILGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPCA 938 Query: 494 VPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*YYR 315 VP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S ++SK+DL Y++ Sbjct: 939 VPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYHQ 998 Query: 314 LRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*GHT 135 +R++LGDEWKTAFKTK G YEWLVMPFGLTNAP+TFMR M +VL+PFI +F+VVYF Sbjct: 999 IRMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPNTFMRLMNEVLRPFIEKFVVVYFDDIL 1058 Query: 134 *I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 + +HL+ V LR +L+ NL+KC+F V FLG+ Sbjct: 1059 IYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCIDRVSFLGY 1102 Score = 63.9 bits (154), Expect = 3e-07 Identities = 32/111 (28%), Positives = 55/111 (49%) Frame = -2 Query: 2182 YEKCAIDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGK 2003 + K +P FDGK DP A+ W + + + F + + +VR + A +WW Sbjct: 333 FSKIKFKIPPFDGKYDPDAYLSWEIAVDQKFACHEFPENTRVRAATSEFTDFASVWWI-- 390 Query: 2002 EFDLQLAGNYSVTWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 E + N S TW+ +K ++ + + YY ++L + L LRQ + +V EY Sbjct: 391 EHGKKNPNNMSQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKSVEEY 441 >gb|ADP20180.1| mutant gag-pol polyprotein [Pisum sativum] Length = 1004 Score = 367 bits (943), Expect = 1e-98 Identities = 226/586 (38%), Positives = 323/586 (55%), Gaps = 39/586 (6%) Frame = -1 Query: 1643 RLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDAN--LECFNCGLRGHYA 1470 R T NSQ + ++ + + ++ ++ + S N ++CF C +GH A Sbjct: 246 RNSTTFNSQSWKDKTKKEGASSSKEATVENKGKTITSSSSSVSTNKSVKCFKCQGQGHIA 305 Query: 1469 WECLKKKNLHIGVEPNDEQETEEGKEVD--FIERIXXXXXXXXXXXXXDTTFLSVVRRIL 1296 +C K+ + + E N+E EE + D F E I L +VRR+L Sbjct: 306 SQCPTKRTMLM--EENEEIVEEEDGDYDKEFGEEIPSGD-------------LLMVRRML 350 Query: 1295 STPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSEATVEKLNLLTEPHPDPYK 1116 + +++ + + LIIDGGS NV S V +L L T+PHP PYK Sbjct: 351 GSQIKEEDTSQRENLFHIRCFVQGKVCSLIIDGGSCTNVASTRLVSRLKLETKPHPKPYK 410 Query: 1115 VAWIDSS-GIPVSKRCLVTFTHGTYTDSIWCDVILMTITHILLG*PWLYDREVKHDGKES 939 + W++ S + V+K+ + F G Y D + CDV+ M +H+LLG PW +DR+ HDG + Sbjct: 411 LQWLNESVEMLVNKQVEICFKIGKYEDVVLCDVVPMEASHLLLGRPWQFDRKANHDGYSN 470 Query: 938 TYSFNFNKKQIVLRPLS-----------SEAMNNKRATKDKRRNQEETTSNSR*EIV*ER 792 YSF ++ ++I L PL+ SE + +R K+K + + E N + E Sbjct: 471 KYSFMYHDQKINLVPLNPSEVREDQRKMSEKYDQERKEKEKEKEKNEKKKNDKRE----- 525 Query: 791 SKGGLIFMAVVKQVKNLLNTNNEDYSLELKQ-------------------LLVDL*DVAP 669 K LI A ++ VK + ++ Y L K+ LL + ++ P Sbjct: 526 KKQSLI--AKIRDVKEAIVSHQPLYLLFCKEVPLLTTISNEKKLPNCIESLLQEFKELFP 583 Query: 668 ED----LPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLDDGLIRESLSP 501 E+ LPP+R I+H ID G+ LPN YR + + +E++RQV EL+ G +RESLSP Sbjct: 584 EEVPSGLPPIRGIEHHIDLNPGASLPNRPAYRSNPQQTQEIQRQVAELISKGWVRESLSP 643 Query: 500 CAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTIYSKIDLTK*Y 321 CAVP +L PKKD +WRMC DCR I+ IT+KYR PIPRLDD LD + + ++SKIDL Y Sbjct: 644 CAVPIILVPKKDGSWRMCTDCRAISNITIKYRHPIPRLDDLLDELFGACLFSKIDLKSGY 703 Query: 320 YRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFIGRFLVVYF*G 141 +++RIR GDEWKTAFKTK G YEW+VMPFGLTNAPSTFMR M VL+ F+G+F+VVYF Sbjct: 704 HQIRIREGDEWKTAFKTKFGLYEWMVMPFGLTNAPSTFMRLMNHVLREFLGKFVVVYFDD 763 Query: 140 HT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 ++ HL+ V++VLR E LY NL+KC F V+FLGF Sbjct: 764 ILIYSKNLDDHCIHLKAVLQVLRYENLYANLEKCVFCTDHVIFLGF 809 Score = 67.0 bits (162), Expect = 4e-08 Identities = 37/107 (34%), Positives = 58/107 (54%), Gaps = 1/107 (0%) Frame = -2 Query: 2167 IDVPNFDGKIDPRAFTDWFVTLKRFFDW*DMSDERKVRYTVMKLVGQAQIWWSGKEFDLQ 1988 I VP F GK DP A+ +W L++ F+ + S+ KV+ ++ A +WW D + Sbjct: 74 IKVPTFVGKSDPEAYLEWETKLEQIFNCHNYSNLEKVQVASIEFKEYALVWWDQLTKDRR 133 Query: 1987 LAGNYSV-TWEEMKLELKRKNLLRYYQQELFDELTNLRQRSMTVIEY 1850 + TWEEMK ++R+ + YY +EL ++L L Q S +V EY Sbjct: 134 RYAERPIDTWEEMKRIMRRRFVPSYYHRELHNKLQRLTQGSKSVEEY 180 >gb|AAQ56338.1| putative gag-pol polyprotein [Oryza sativa Japonica Group] Length = 1619 Score = 366 bits (939), Expect = 3e-98 Identities = 217/596 (36%), Positives = 321/596 (53%), Gaps = 33/596 (5%) Frame = -1 Query: 1691 WKARNNQLRDFLLKSGRLQTPENSQPTLEALTRQLQQPTISLIMADRNRSQMTESKRQDA 1512 W+ R L +GR +P ++ T A + + + S + Sbjct: 314 WQTRTTPL------AGRTASPSSTPTTSRAAPPPSSDKSATKAAQPAPSASSMASTGRMR 367 Query: 1511 NLECFNCGLRGHYAWECLKKKNLHIGVEPNDEQETEEGKEVDFIERIXXXXXXXXXXXXX 1332 +++C C GH +C K+ L V+ + E + + D + + Sbjct: 368 DVQCHRCKGFGHVQRDCPSKRVLV--VKNDGEYSSASDFDDDTLALLAADHADNEPPEEH 425 Query: 1331 DTTFLS------VVRRILSTPKQQKKDWRGTTILQTLVCCGNVTRKLIIDGGSSMNVVSE 1170 + +V+R+LS ++ + + T+ QT ++IIDGGS N+ S Sbjct: 426 IGAAFADHYESLIVQRVLSAQMEKAEQNQRHTLFQTKCVVKERCCRMIIDGGSCNNLASS 485 Query: 1169 ATVEKLNLLTEPHPDPYKVAWIDSSG-IPVSKRCLVTFTHGTYTDSIWCDVILMTITHIL 993 VEKL L T+PHP Y + W+++SG V+K + F G Y D + CDV+ M +IL Sbjct: 486 EMVEKLALSTKPHPHSYYIQWLNNSGKAKVTKLVHINFAIGNYHDVVECDVVPMQACNIL 545 Query: 992 LG*PWLYDREVKHDGKESTYSFNFNKKQIVLRPLSSEAM---NNKRATKDKRRNQEETTS 822 LG PW +DR+ H G+ + YSF ++ K+IVL P+S E + + +A K K + ++ S Sbjct: 546 LGRPWQFDRDSMHHGRSNQYSFLYHDKKIVLHPMSPEDILRDDVAKAAKSKCESDKKAQS 605 Query: 821 NSR*-EIV*ERSKGGLIFMAVVKQVKNLLNTNNEDYSLELKQLLVDL*DV---------- 675 + + E + + K +A + L+ + + Y+L K L+ L D+ Sbjct: 606 DGKKPETINLKPK---CLLATKSDINELIASPSVAYALVCKDALISLHDMQHSLPPAVAN 662 Query: 674 ------------APEDLPPMREIQHAIDFVLGSQLPNLLGYRMSLAEHEELKRQVEELLD 531 P LPP+R I+H ID + G+ LPN YR + E +E++RQV ELLD Sbjct: 663 ILQEYSDVFPKEVPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLD 722 Query: 530 DGLIRESLSPCAVPALLTPKKDETWRMCCDCRTINKITVKYRFPIPRLDDRLDMMTDSTI 351 G +RESLSPCAVP +L PKKD +WRMC DCR IN IT++YR PIPRLDD LD ++ S + Sbjct: 723 KGYVRESLSPCAVPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIV 782 Query: 350 YSKIDLTK*YYRLRIRLGDEWKTAFKTKDGFYEWLVMPFGLTNAPSTFMRFMTQVLQPFI 171 +SK++L Y+++ ++LGDEWKTAFKTK G YEWLVMPFGLTNAPSTFMR M +VL+PFI Sbjct: 783 FSKVELRSGYHQIHMKLGDEWKTAFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFI 842 Query: 170 GRFLVVYF*GHT*I*QDGRRTIDHLQQVMRVLRREKLYINLKKCSFMCSSVVFLGF 3 G+F+VVYF + +HL+ V LR +L+ NL+KC+F V FLG+ Sbjct: 843 GKFVVVYFDDILIYSKSMGEHFNHLRAVFNALRDARLFGNLEKCTFCTDRVSFLGY 898