BLASTX nr result
ID: Mentha27_contig00019411
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00019411 (1980 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] 522 e-145 emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] 518 e-144 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 509 e-141 ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prun... 508 e-141 ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prun... 505 e-140 ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobrom... 498 e-138 emb|CAN64427.1| hypothetical protein VITISV_029384 [Vitis vinifera] 498 e-138 ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prun... 496 e-137 gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sati... 478 e-132 gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-... 478 e-132 ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The... 468 e-129 emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] 456 e-125 ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac... 436 e-119 ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300... 433 e-118 ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223... 433 e-118 emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] 423 e-115 ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306... 388 e-105 ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobrom... 387 e-104 ref|XP_007198961.1| hypothetical protein PRUPE_ppa020671mg, part... 379 e-102 ref|XP_007019611.1| Uncharacterized protein TCM_035724 [Theobrom... 365 3e-98 >gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] Length = 1518 Score = 522 bits (1345), Expect = e-145 Identities = 282/667 (42%), Positives = 378/667 (56%), Gaps = 18/667 (2%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A W+ LK R++ GK + SW K KK + + F+ ++ ++L+ + NL+Q ++V+ Sbjct: 137 GYASLWYDNLKHQRLKEGKDPLRSWSKLKKKMLAKFVTKDYTQDLFIKLSNLKQKEKTVE 196 Query: 183 DYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQA 362 Y EF + + ++N+ Q ++R++ G+ + + M + + + + EK Sbjct: 197 AYLREFEQLTLQCEINEKSEQRIARFLEGLDKNIAAEVRMQPLWSYDDVVNLSLRVEKMG 256 Query: 363 ARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYS--NPSQGRPGFRG---CFN 527 + A PK T QS +G + NP P R CF Sbjct: 257 KTKPVATRPKPVFRPYSSVKINDPPKT---TPQSTVDKGKAPMNPKINPPLSRDKIKCFQ 313 Query: 528 CGDLSHRQADCPKPPT--------GSRGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXS 683 C H + DCP T R + E E L L + + Sbjct: 314 CQGFGHFRKDCPSARTLTAIEVAEWEREGLVEYEEDEALVLEE--VESEKETSPDQIVAH 371 Query: 684 GDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSK 863 D G L L R + S +A R+ +F+S CT+ G+VC II+ GSC NV S VSK Sbjct: 372 PDTGHSLFLWRVMHSQQAPLEADQRSMIFRSRCTVQGRVCNLIINGGSCTNVASTTMVSK 431 Query: 864 LNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVV-PMDACHLLLGR 1040 L L T+ HP PY+L WLS+ + V V K+ +++FSIG Y+D + CDVV PMDACHLLLGR Sbjct: 432 LGLPTQEHPNPYKLRWLSKDSGVRVDKQCIISFSIGKMYKDEVLCDVVVPMDACHLLLGR 491 Query: 1041 PWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXX----LLSRVPFQT 1208 PW+YD H G+ N Y F +GKK+ L LS Sbjct: 492 PWEYDRNTTHQGKDNVYIFKHQGKKVTLTPLPPNQRDYGSPNVPEEMSGVLFLSEAAMIK 551 Query: 1209 AMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHIDLV 1388 + ++ V +LL++ + + F +VFP+ LPS LPPLR I+HHIDLV Sbjct: 552 EIRQAQPVLMLLSREVNQEENTVVPTAVAPLIQRFQEVFPDELPSGLPPLRGIEHHIDLV 611 Query: 1389 PGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVD 1568 PG+ LPN+P YR P +EL+ Q+EEL+A+G +RESLSPCAVPALL PKKDGTWRMC D Sbjct: 612 PGSVLPNKPAYRCDPNATKELQHQIEELMAKGFVRESLSPCAVPALLVPKKDGTWRMCTD 671 Query: 1569 SRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKTREG 1748 SRAIN ITV+YRFPIPRLDD+LD+L GA +FSK+DL+ GYHQ+RIR GDEWKTAFKT+ G Sbjct: 672 SRAINNITVKYRFPIPRLDDMLDELSGASIFSKIDLRQGYHQVRIREGDEWKTAFKTKHG 731 Query: 1749 LYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLREVLL 1928 LYEWLVMPFGLSNAPSTFMR+M + LRP +GKF VVYFDDIL+YS H++HL V Sbjct: 732 LYEWLVMPFGLSNAPSTFMRLMTEVLRPCLGKFAVVYFDDILVYSKTKGEHLKHLEVVFK 791 Query: 1929 VLRRDHL 1949 +LR L Sbjct: 792 ILREQKL 798 >emb|CAN79321.1| hypothetical protein VITISV_018984 [Vitis vinifera] Length = 1521 Score = 518 bits (1335), Expect = e-144 Identities = 280/675 (41%), Positives = 399/675 (59%), Gaps = 28/675 (4%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A+ WW ++ R G+P I +WD+ K ++ FLP ++++ +Y + +L+QG++SV+ Sbjct: 135 GAARLWWHNIENQAHRTGQPPIDTWDEMKLKMKEHFLPTDYEQLMYTKLFSLKQGTKSVE 194 Query: 183 DYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQA 362 +Y+ EF+E R V +S QL +RY G+R+++Q + TV + +Q A + E+ Sbjct: 195 EYTEEFHELSIRNQVXESDAQLAARYKAGLRMEIQLEMIAAHTYTVDDVYQLALKIEEGL 254 Query: 363 ARR----------TTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRPGF 512 R +T + R + Q + N ++G+ Sbjct: 255 KFRVSRHPSSQIGSTFSNRTTSKPLSTSNFRTSIHVNGGDNTQPTSNVAHQNGNKGKNSM 314 Query: 513 RG----------CFNCGDLSHRQADCPKPPTGSRGLF--TDDVESEPLPLF--DTPIXXX 650 CF CG H CP ++GL ++ ESE + Sbjct: 315 SNGDRKVDATPLCFKCGGHGHYAVVCP-----TKGLHFCVEEPESELESYLKKEETYNED 369 Query: 651 XXXXXXXXXXSGDVGPMLMLRRTLLSPRAL-ETEWLRNNLFQSTCTIGGKVCTFIIDAGS 827 G L++R L P+ E +W R ++FQ+ + G++CT IID GS Sbjct: 370 EVSEECDYYDGMTEGHSLVVRPLLTIPKVKGEEDWRRISIFQTRISCHGRLCTMIIDGGS 429 Query: 828 CENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVV 1007 N+ S+ V KLNL TE HP P+R++W++ T + VS R LV F G +++S++C+V+ Sbjct: 430 SLNIASQELVEKLNLKTERHPNPFRVAWVND-TSIPVSFRCLVTFLFGKDFEESVWCEVL 488 Query: 1008 PMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRG-KKIVLVXXXXXXXXXXXXXXXXXXL 1184 P+ H+LLGRPW +D VQHDG NTY+ + G KKI+ + Sbjct: 489 PIKVSHILLGRPWLFDRKVQHDGYENTYALIHNGRKKILRPMKEVPPIKKSNENAQPKKV 548 Query: 1185 LSRVPFQTAMEESGLVFVLLAQPLGD--STSXXXXXXXXXXXXXFADVFPESLPSTLPPL 1358 L+ F+ +E+ ++F L+A+ + + F+D++P LP+ LPP+ Sbjct: 549 LTMCQFENESKETXVIFALMARKVEEFKEQDKEYPANARKILDDFSDLWPVELPNELPPM 608 Query: 1359 RDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPK 1538 RDIQH IDL+PGA+LPN P YRM+P EH EL+RQV+ELL +G IRESLSPC VPALLTPK Sbjct: 609 RDIQHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPCGVPALLTPK 668 Query: 1539 KDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDE 1718 KDG+WRMCVDSRAINKIT++YRFPIPRLDD+LD + G+ +FSK+DL+SGYHQIRIR GDE Sbjct: 669 KDGSWRMCVDSRAINKITIKYRFPIPRLDDMLDMMVGSVIFSKIDLRSGYHQIRIRPGDE 728 Query: 1719 WKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPIL 1898 WKT+FKT++GLYEWLVMPFGL+NAPSTFMR+M Q L+PFIG+FVVVYFDDILIYS Sbjct: 729 WKTSFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCED 788 Query: 1899 HIQHLREVLLVLRRD 1943 H +HL++V+ LR + Sbjct: 789 HEEHLKQVMRTLRAE 803 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 509 bits (1312), Expect = e-141 Identities = 266/658 (40%), Positives = 380/658 (57%), Gaps = 10/658 (1%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A W++ LK R R GK I SW K KK + F+P + ++++ + L+Q + ++ Sbjct: 135 GYASLWYENLKNQRRRDGKEPIKSWLKLKKKLNEKFIPKEYTQDIFIKLTQLKQDQQPLE 194 Query: 183 DYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQA 362 Y +F + + ++N+ P Q ++R++ G+ ++ + M + EA A + EK Sbjct: 195 SYLRDFEQLTLQCELNEKPEQKIARFVEGLDTKIAHRVRMQQVWSFDEAVNLALRVEKMG 254 Query: 363 ARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRPGF--RGCFNCGD 536 + T + + P+ + + +G + + + + C+ C Sbjct: 255 KGKATTTKPTTKPATFRPPTSFKINEPPSQNKTTILDKGKAAETSQKKTMPLKKCYQCQG 314 Query: 537 LSHRQADCPKPPTGSRGLFTDDVE---SEPLPLFDTPIXXXXXXXXXXXXXSGDVGPMLM 707 H +CP R L + +V + + + D + D G L+ Sbjct: 315 YGHFAKECPT----KRALSSFEVVHWGDDEILVCDEEVEGTDHEEDDVVMP--DAGLSLV 368 Query: 708 LRRTL-LSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSKLNLTTES 884 R + P+ LE + R +F+S CTI G+VC IID GSC NV S + KL+L T+ Sbjct: 369 TWRVMHTQPQPLEMDQ-RQQIFRSRCTIKGRVCNLIIDGGSCTNVASSTLIEKLSLPTQD 427 Query: 885 HPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGRPWQYDNTV 1064 HP PY+L WL++G +V V K+ LV FSIG Y D CDV+PMDACHLLLGRPW++D Sbjct: 428 HPSPYKLRWLNKGAEVRVDKQCLVTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFDRDS 487 Query: 1065 QHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXX----LLSRVPFQTAMEESGLV 1232 H GR NTY+F FR +K++L L++ ++ V Sbjct: 488 VHHGRDNTYTFKFRSRKVILTPLPPVLKHTTPPSMLEPSKEVLLINEAEMLQELKGDEDV 547 Query: 1233 FVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHIDLVPGAALPNR 1412 + L+A+ + + + DVFP LPS LPPLR I+H ID +PGA LPN+ Sbjct: 548 YALIAKDVVFGQNVSLPKEVQELLQSYEDVFPNELPSGLPPLRGIEHQIDFIPGATLPNK 607 Query: 1413 PHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVDSRAINKIT 1592 YR PK +EL++Q+ EL+++G +RESLSPC+VPALL PKKDG+WRMC DSRAIN IT Sbjct: 608 AAYRSDPKATQELQQQIGELVSKGFVRESLSPCSVPALLVPKKDGSWRMCTDSRAINNIT 667 Query: 1593 VRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKTREGLYEWLVMP 1772 ++YRFPIPRLDD+LD+L GA +FSK+DL+ GYHQ+RI+ GDEWKTAFKT+ GLYEWLVMP Sbjct: 668 IKYRFPIPRLDDILDELSGAQLFSKIDLRQGYHQVRIKEGDEWKTAFKTKHGLYEWLVMP 727 Query: 1773 FGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLREVLLVLRRDH 1946 FGLSNAPSTFMR+M + LRP++G+FVVVYFDDIL+YS H++HL +VL R+H Sbjct: 728 FGLSNAPSTFMRLMTEVLRPYLGRFVVVYFDDILVYSPSKEEHLKHL-QVLFETLREH 784 >ref|XP_007210190.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica] gi|462405925|gb|EMJ11389.1| hypothetical protein PRUPE_ppa017790mg [Prunus persica] Length = 1485 Score = 508 bits (1307), Expect = e-141 Identities = 288/675 (42%), Positives = 380/675 (56%), Gaps = 28/675 (4%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDY 188 A WW QL+ R R GK ++ +W K K + FLP ++++ LY+ + QG+ SV +Y Sbjct: 153 AAVWWDQLQNLRQRQGKQRVRTWRKMKSLMMEQFLPTDYEQILYRMYLGCAQGTHSVSEY 212 Query: 189 SNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQAAR 368 + EF R + ++ Q V+RY G+++ +Q+ + M + T+ EA A +AE Sbjct: 213 TEEFMRLAERNHLTETDNQKVARYNNGLKISIQEKIGMQNIWTLQEAINMALKAELLEKE 272 Query: 369 RTTANLR---LXXXXXXXXXXXXIVPKVPAPTQQSA------------------------ 467 + N R K A Q S Sbjct: 273 KRQPNFRRNTTEASDYTAGASSGAGDKGKAQQQSSGGMTKPTTVGQNKNFNEGSSRNYNR 332 Query: 468 -PPRGYSNPSQGRPGFRGCFNCGDLSHRQADCPKPPTGSRGLFTDDVESEPLPLFDTPIX 644 PR S +P C+ C HR CP+ + F ++ + + + + Sbjct: 333 GQPRNQSQNLYAKPMTDICYRCQKPGHRSNVCPELKQAN---FIEEADEDEE---NDEVG 386 Query: 645 XXXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAG 824 G L+L+R LL+PR E R+++F+S C+I KVC I+D G Sbjct: 387 ENDYAGAEFAVEEGMEKITLVLQRVLLAPRE---EGQRHSIFRSLCSIKNKVCDVIVDNG 443 Query: 825 SCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDV 1004 SCEN +S+ V L L+TE H PY L W+ +G V V++ V SIG Y+D + CDV Sbjct: 444 SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDEVLCDV 503 Query: 1005 VPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXL 1184 + MDACH+LLGRPWQ+D GR N F + +KI + Sbjct: 504 IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAMTTTQPSKPSVEVKTRSSS-F 562 Query: 1185 LSRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRD 1364 L+ + + + E+ V A+ GD F ++F E+LP+ LPP+RD Sbjct: 563 LTLISNEQELNEA----VKEAEGEGDIPQDVQQILSQ-----FQELFSENLPNELPPMRD 613 Query: 1365 IQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKD 1544 IQH IDLVPGA+L N PHYRMSPKE++ LR Q+EELL +G IRESLSPCAVP LL PKKD Sbjct: 614 IQHRIDLVPGASLQNLPHYRMSPKENDILREQIEELLRKGFIRESLSPCAVPVLLVPKKD 673 Query: 1545 GTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWK 1724 TWRMCVDSRAINKITV+YRFPIPRL+D+LD L G+ VFSK+DL+SGYHQIRIR GDEWK Sbjct: 674 KTWRMCVDSRAINKITVKYRFPIPRLEDMLDVLSGSKVFSKIDLRSGYHQIRIRPGDEWK 733 Query: 1725 TAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHI 1904 TAFK+++GL+EWLVMPFGLSN PSTFMR+MNQ LRPFIG FVVVYFDDILIYST H+ Sbjct: 734 TAFKSKDGLFEWLVMPFGLSNTPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHL 793 Query: 1905 QHLREVLLVLRRDHL 1949 HLR+VL VLR + L Sbjct: 794 VHLRQVLDVLRENKL 808 >ref|XP_007207232.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica] gi|462402874|gb|EMJ08431.1| hypothetical protein PRUPE_ppa026856mg [Prunus persica] Length = 1493 Score = 505 bits (1300), Expect = e-140 Identities = 286/675 (42%), Positives = 374/675 (55%), Gaps = 28/675 (4%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDY 188 A WW QL+ R R GK ++ +W K K + FLP ++++ LY+ + QG+RSV +Y Sbjct: 164 AAVWWDQLQNLRQRQGKQRVRTWRKMKSLMMERFLPTDYEQILYRMYLGCAQGTRSVSEY 223 Query: 189 SNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQAAR 368 + EF R + ++ Q V+RY G++ +Q+ + M + T+ EA A +AE Sbjct: 224 TEEFMRLAERNHLTETDNQKVARYNNGLKSSIQEKIGMQNIWTLQEAINMALKAELLEKE 283 Query: 369 RTTANLR---LXXXXXXXXXXXXIVPKVPAPTQQSA------------------------ 467 + N R K A Q S Sbjct: 284 KRQPNFRRNKTEASDYTAGASSGAGDKEKAQQQNSGGMTKPATVGQNKNFNEGSSRNYNR 343 Query: 468 -PPRGYSNPSQGRPGFRGCFNCGDLSHRQADCPKPPTGSRGLFTDDVESEPLPLFDTPIX 644 PR S +P C+ C HR CP+ + D+ E + + Sbjct: 344 GQPRNQSQNPYAKPMTDICYRCQKPGHRSNVCPERKQANFIEEADEDEEKD------EVG 397 Query: 645 XXXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAG 824 G L+L+R LL+P+ E R+N+F+S C+I KVC I+D G Sbjct: 398 ENDYAGAEFAVEEGIEKITLVLQRVLLAPKE---EGQRHNIFRSLCSIKNKVCDVIVDNG 454 Query: 825 SCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDV 1004 SCEN +S+ V L L+TE H PY L W+ +G V V++ V SIG Y+D + CDV Sbjct: 455 SCENFVSKKLVEYLQLSTEPHVSPYSLGWVKKGPSVRVAETCRVPLSIGKHYRDDVLCDV 514 Query: 1005 VPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXL 1184 + MDACH+LLGRPWQ+D GR N F + +KI + + Sbjct: 515 IDMDACHILLGRPWQFDVDATFKGRDNVILFSWNNRKIAMATTQPSRKQELRSSSFLTLI 574 Query: 1185 LSRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRD 1364 + A++E A+ GD F ++ E+LP+ LPP+RD Sbjct: 575 SNEQELNEAVKE--------AEGEGDIPQDVQQILSQ-----FQELLSENLPNELPPMRD 621 Query: 1365 IQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKD 1544 IQH IDLV GA+LPN PHYRMSPKE++ LR Q+EELL +G IRESLSPCAVP LL PKKD Sbjct: 622 IQHRIDLVHGASLPNLPHYRMSPKENDILREQIEELLRKGFIRESLSPCAVPVLLVPKKD 681 Query: 1545 GTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWK 1724 TWRMCVDSRA+NKI V+YRF IPRL+D+LD L G+ VFSK+DL+SGYHQIRIR GDEWK Sbjct: 682 KTWRMCVDSRAVNKIKVKYRFSIPRLEDILDVLSGSKVFSKIDLRSGYHQIRIRPGDEWK 741 Query: 1725 TAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHI 1904 TAFK+++GL+EWLVMPFGLSNAPSTFMR+MNQ LRPFIG FVVVYFDDILIYST H+ Sbjct: 742 TAFKSKDGLFEWLVMPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYFDDILIYSTTKEEHL 801 Query: 1905 QHLREVLLVLRRDHL 1949 HLR+VL VLR + L Sbjct: 802 VHLRQVLDVLRENKL 816 >ref|XP_007010495.1| Uncharacterized protein TCM_044370 [Theobroma cacao] gi|508727408|gb|EOY19305.1| Uncharacterized protein TCM_044370 [Theobroma cacao] Length = 1306 Score = 498 bits (1283), Expect = e-138 Identities = 279/654 (42%), Positives = 375/654 (57%), Gaps = 5/654 (0%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A W ++++ R R K KI++W+ K +R FLP ++ ELY++F L+Q + +V+ Sbjct: 84 GTALQWLKRVEEQRARQSKLKISTWEHMKSKLRKQFLPADYTMELYEKFHCLKQNNMTVE 143 Query: 183 DYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQA 362 +Y +EF RV + +S Q+ SRY+ G+ ++D + + + +A Q A AEK+ Sbjct: 144 EYISEFNNLSIRVGLAESNEQITSRYLAGLNHFIRDEMGVVRLYNIEDARQYALSAEKRI 203 Query: 363 ARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRPGFRGCFNCGDLS 542 R P Q A +N R CF CG+ Sbjct: 204 LRYGARKPLYGTHWQNNSEARRGYP-TSQQNYQGAATINKTNRGGSNSHIR-CFTCGENG 261 Query: 543 HRQADCPKPPTGSRGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXSGDVGPMLMLRRTL 722 H P+ R + ++ E P++D G L++RR + Sbjct: 262 HTSFAGPQ-----RRVNLAELREELEPVYDEYEEIEEIDVYPAQ------GESLVVRRVM 310 Query: 723 LSPRALETE-WLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSKLNLTTESHPKPY 899 + E E W R ++F++ GKVC +ID GS EN+IS+ AV+KL L T HP PY Sbjct: 311 TTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPY 370 Query: 900 RLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGRPWQYDNTVQHDGR 1079 ++ WL +G +V V+ + LV F++G D CDVVPMD H+L+GRPW YD+ + H Sbjct: 371 KIGWLKKGHEVPVTTQYLVKFTMGDNLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTE 430 Query: 1080 CNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXX-LLSRVPFQTAMEESGLVFVLLAQPL 1256 NTYSF K+ LS F+ E G+++ L+ + L Sbjct: 431 PNTYSFYNDNKRYTSYPLKEETKKSANSKINKITGYLSVENFEAEGSEMGIMYALVTKHL 490 Query: 1257 GDST---SXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHIDLVPGAALPNRPHYRM 1427 S F ++F E LP +LPPLR IQH IDLVPGAALPN P YRM Sbjct: 491 KSDQMGKSPQYPTEIQQLLKEFGELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRM 550 Query: 1428 SPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVDSRAINKITVRYRF 1607 P + E++RQVEELL +G +RES SPCA PALL PKKDG+WRMCVDSRAINKIT++YRF Sbjct: 551 PPMQRVEVQRQVEELLEKGLVRESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRF 610 Query: 1608 PIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKTREGLYEWLVMPFGLSN 1787 PIPRLD++LDQL G+ VFSK+DLKS YHQIR+R GDEWKTAFKT +GL+EWLVMPFGLSN Sbjct: 611 PIPRLDEMLDQLVGSRVFSKIDLKSEYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSN 670 Query: 1788 APSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLREVLLVLRRDHL 1949 APSTFMRVM + L+PF+ FVVVYFDDILIYS H++HLR+VL VL+++ L Sbjct: 671 APSTFMRVMAEVLKPFLNSFVVVYFDDILIYSHTKEKHLKHLRQVLEVLQKEQL 724 >emb|CAN64427.1| hypothetical protein VITISV_029384 [Vitis vinifera] Length = 1392 Score = 498 bits (1281), Expect = e-138 Identities = 279/664 (42%), Positives = 387/664 (58%), Gaps = 25/664 (3%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A+ WW ++ R G+P I +WD+ K ++ FLP ++++ +Y + +L+QG++SV+ Sbjct: 145 GAARLWWHNIENQAHRTGQPPIDTWDEMKLKMKEHFLPTDYEQLMYTKLFSLKQGTKSVE 204 Query: 183 DYSNEFYEFLA-RVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEK- 356 +Y+ EF+E L+ R V +S QL +RY G+R+++Q + TV + +Q A + E+ Sbjct: 205 EYTEEFHELLSIRNQVRESDAQLAARYKAGLRMEIQLEMIAAHTYTVDDVYQLALKIEEG 264 Query: 357 ---QAARR-------TTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRP 506 + +RR T +N TQQ++ Y N ++G+ Sbjct: 265 LKFRVSRRPSSQIGSTFSNRTASKPLSTSNFRTPNHVNGGGNTQQTSNV-AYKNGNKGKN 323 Query: 507 GFRG----------CFNCGDLSHRQADCPKPPTGSRGLFTDDVESE--PLPLFDTPIXXX 650 CF CG H CP T S ++ ESE P + Sbjct: 324 SMSNGDRKVDVTPLCFKCGGHGHYAVVCP---TKSLHFCVEEPESELESYPKEEETYNED 380 Query: 651 XXXXXXXXXXSGDVGPMLMLRRTLLSPRAL-ETEWLRNNLFQSTCTIGGKVCTFIIDAGS 827 G L++R L P+ E +W R ++FQ+ + G++CT IID GS Sbjct: 381 EVSEECDYYDGMTEGXSLVVRPLLTVPKVKGEEDWRRTSIFQTRISCQGRLCTMIIDGGS 440 Query: 828 CENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVV 1007 N+ S+ V KLNL TE HP P+R++W++ T + VS R LV F G +++S++C+V+ Sbjct: 441 SLNIASQELVEKLNLKTERHPNPFRVAWVND-TSIPVSFRCLVTFLFGKDFEESVWCEVL 499 Query: 1008 PMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXLL 1187 P+ H+LLGRPW +D VQHDG NTY+ + G+K +L + Sbjct: 500 PIKVSHILLGRPWLFDRKVQHDGYENTYALIHNGRKKILRP------------------M 541 Query: 1188 SRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDI 1367 VP +E+ AQP + LP+ LPP+RDI Sbjct: 542 KEVPPIKKSDEN-------AQP------------------------KKELPNELPPMRDI 570 Query: 1368 QHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDG 1547 QH IDL+PGA+LPN P YRM+P EH EL+RQV+ELL +G IRESLSPC VPALLTPKKDG Sbjct: 571 QHAIDLIPGASLPNLPAYRMNPTEHAELKRQVDELLTKGFIRESLSPCGVPALLTPKKDG 630 Query: 1548 TWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKT 1727 +WRMCVDSRAINKIT++YRFPIPRLDD+LD + G+ +FSK+DL+SGYHQIRIR GDEWKT Sbjct: 631 SWRMCVDSRAINKITIKYRFPIPRLDDMLDMMVGSVIFSKIDLRSGYHQIRIRPGDEWKT 690 Query: 1728 AFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQ 1907 +FKT++GLYEWLVMPFGL+NAPSTFMR+M Q L+PFIG+FVVVYFDDILIYS H + Sbjct: 691 SFKTKDGLYEWLVMPFGLTNAPSTFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEE 750 Query: 1908 HLRE 1919 HL++ Sbjct: 751 HLKQ 754 >ref|XP_007220740.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica] gi|462417202|gb|EMJ21939.1| hypothetical protein PRUPE_ppa023598mg [Prunus persica] Length = 1457 Score = 496 bits (1278), Expect = e-137 Identities = 290/682 (42%), Positives = 385/682 (56%), Gaps = 35/682 (5%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDY 188 A WW QL+ SR R GK ++ +W K K + FLP ++++ LY+ + QG+RSV +Y Sbjct: 129 AAVWWDQLQNSRQRQGKQRVRTWRKMKSLMMERFLPTDYEQILYRMYLGCTQGNRSVSEY 188 Query: 189 SNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAE----- 353 + EF R + ++ Q V+RY G+++ +Q+ + M + T+ EA A +AE Sbjct: 189 TEEFMHLAERNHLTETDNQKVARYNNGLKISIQEKIGMQNIWTLQEAINMAMKAELLEKE 248 Query: 354 ---KQAARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPP---------------RG 479 R TT V + P T + A RG Sbjct: 249 KRQPNFRRNTTEASEYATGASSGSGDKGKVQQQPRGTTKPATTVQNKNFNESSSRTFNRG 308 Query: 480 YS-NPSQG---RPGFRGCFNCGDLSHRQADCPKPPTGSRGLFTDDVESEPLPLFDTPIXX 647 S N SQ +P C+ C HR CP+ ++ F ++V+ + + Sbjct: 309 QSRNQSQNPYAKPRTDICYRCQKPGHRSNVCPE---WTQANFIEEVDEDEEK---DEVGE 362 Query: 648 XXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAGS 827 +L+L+R LL+P+ E R+++ +S C+I KVC I+D GS Sbjct: 363 DDYAGAEFAIEERMERIILVLQRVLLAPKE---EGQRHSICRSLCSIKNKVCDVIVDNGS 419 Query: 828 CENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVV 1007 CEN +S+ V L L+TE H +PY L W+ +G V V++ V SIG Y D + CDV+ Sbjct: 420 CENFVSKKLVEHLQLSTEPHVRPYSLGWVKKGPSVRVAETYSVPLSIGKHYIDDVLCDVI 479 Query: 1008 PMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXLL 1187 MDACH+LLG+ WQ+D + GR N F + +KI + L Sbjct: 480 DMDACHILLGQLWQFDVDATYKGRDNVILFSWNNRKIAMATTKPSKQSVEPKTRSSSFLT 539 Query: 1188 ---SRVPFQTAMEESGLVFVLLAQPL-----GDSTSXXXXXXXXXXXXXFADVFPESLPS 1343 S ++E+ L+ + L G+S F ++ E LP+ Sbjct: 540 LISSEQELNKVVKEAEYFCPLVLKGLLKLGRGESD---IPQDVQKILSQFQELLSEKLPN 596 Query: 1344 TLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPA 1523 LP +RDIQH IDLVPGA LPN PHYRMSPKE++ LR Q+EELL +G IRESLSPCAVP Sbjct: 597 ELPSMRDIQHRIDLVPGANLPNLPHYRMSPKENDILREQIEELLQKGFIRESLSPCAVPV 656 Query: 1524 LLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRI 1703 LL PKKD TWRMCVDSRAINKITV+ RFPIPRL+D+LD L G+ VFSK+DL+SGYHQIRI Sbjct: 657 LLVPKKDKTWRMCVDSRAINKITVKSRFPIPRLEDMLDVLSGSRVFSKIDLRSGYHQIRI 716 Query: 1704 RTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYS 1883 R GDEWKTAFK+++GL+EWLVMPFGLSNAPSTFMR+MNQ LRPFIG FVVVYFDDILIYS Sbjct: 717 RPGDEWKTAFKSKDGLFEWLVMPFGLSNAPSTFMRLMNQVLRPFIGSFVVVYFDDILIYS 776 Query: 1884 TDPILHIQHLREVLLVLRRDHL 1949 T H+ HLR+VL VLR + L Sbjct: 777 TTKEEHLVHLRQVLDVLRENKL 798 >gb|AAX95495.1| Retrotransposon gag protein, putative [Oryza sativa Japonica Group] Length = 1739 Score = 478 bits (1229), Expect = e-132 Identities = 274/682 (40%), Positives = 370/682 (54%), Gaps = 39/682 (5%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITS----WDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRS 176 A WW + HGK + WD K+ +R+ F+P + R+L R Q LRQG++S Sbjct: 560 ASVWW-------IEHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKS 612 Query: 177 VDDYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEK 356 V++Y E L R ++ ++ ++R++GG+ ++ D+++ D + A +AE+ Sbjct: 613 VEEYYQELQMGLLRCNLEETEDTAMARFLGGLNREIYDIVDYKDYTNMTRLFHLACKAER 672 Query: 357 QA-ARRTTANLRLXXXXXXXXXXXXIVP--KVPAP-----TQQSAPP------------- 473 + RR +A P + +P T ++APP Sbjct: 673 EVQGRRASAKANFSAGKTSSWQTRTTPPAGRTASPSSTPTTSRAAPPPSSDKSVTKAAQP 732 Query: 474 --RGYSNPSQGRPGFRGCFNCGDLSHRQADCPKPPT---GSRGLFTD--DVESEPLPLFD 632 S S GR C C H Q DCP + G ++ D + + L L Sbjct: 733 APSASSMVSTGRMRDVQCHRCKGFGHVQRDCPSKRVLVVKNDGEYSSASDFDDDTLALLA 792 Query: 633 TPIXXXXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFI 812 D L+++R L + + R+ LFQ+ C + + C I Sbjct: 793 ADHADNEPPEEHIGAAFADHYESLIVQRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMI 852 Query: 813 IDAGSCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSI 992 ID GSC N+ S V KL L+T+ HP PY + WL+ V V+K V +NF+IG Y D + Sbjct: 853 IDGGSCNNLASSEMVEKLALSTKPHPHPYYIQWLNNSGKVKVTKLVHINFAIG-NYHDVV 911 Query: 993 YCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXX 1172 CDVVPM AC++LLGRPWQ+D H GR N YSF++ KK + Sbjct: 912 ECDVVPMQACNILLGRPWQFDRDSMHHGRSNQYSFLYHDKK---IVLHPMSSEDILRDDV 968 Query: 1173 XXXLLSRVPFQTAMEESG-------LVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPE 1331 S+ + G L L D T ++DVFP+ Sbjct: 969 AKAAKSKCESDKKAQSDGKKPETINLKPKCLLATKSDITELIASPSVAYALE-YSDVFPK 1027 Query: 1332 SLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPC 1511 +P LPP+R I+H IDL+PGA+LPNR YR +P+E +E++RQV ELL +G++RESLSPC Sbjct: 1028 EVPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPC 1087 Query: 1512 AVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYH 1691 AVP +L PKKDG+WRMCVD RAIN IT+RYR PIPRLDD+LD+L G+ VFSK+DL+SGYH Sbjct: 1088 AVPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYH 1147 Query: 1692 QIRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDI 1871 QIR++ GDEWKT FKT+ GLYEWLVMPFGL+NAPSTFMR+MN+ LRPFIGKFVVVYFDDI Sbjct: 1148 QIRMKLGDEWKTTFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDI 1207 Query: 1872 LIYSTDPILHIQHLREVLLVLR 1937 LIYS H HLR V LR Sbjct: 1208 LIYSKSMGEHFNHLRAVFNALR 1229 >gb|AAX96717.1| retrotransposon protein, putative, Ty3-gypsy sub-class [Oryza sativa Japonica Group] gi|108864301|gb|ABA93040.2| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1748 Score = 478 bits (1229), Expect = e-132 Identities = 274/682 (40%), Positives = 370/682 (54%), Gaps = 39/682 (5%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITS----WDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRS 176 A WW + HGK + WD K+ +R+ F+P + R+L R Q LRQG++S Sbjct: 569 ASVWW-------IEHGKKNPNNMPQTWDALKRVMRARFVPSYYARDLLNRLQQLRQGAKS 621 Query: 177 VDDYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEK 356 V++Y E L R ++ ++ ++R++GG+ ++ D+++ D + A +AE+ Sbjct: 622 VEEYYQELQMGLLRCNLEETEDTAMARFLGGLNREIYDIVDYKDYTNMTRLFHLACKAER 681 Query: 357 QA-ARRTTANLRLXXXXXXXXXXXXIVP--KVPAP-----TQQSAPP------------- 473 + RR +A P + +P T ++APP Sbjct: 682 EVQGRRASAKANFSAGKTSSWQTRTTPPAGRTASPSSTPTTSRAAPPPSSDKSVTKAAQP 741 Query: 474 --RGYSNPSQGRPGFRGCFNCGDLSHRQADCPKPPT---GSRGLFTD--DVESEPLPLFD 632 S S GR C C H Q DCP + G ++ D + + L L Sbjct: 742 APSASSMVSTGRMRDVQCHRCKGFGHVQRDCPSKRVLVVKNDGEYSSASDFDDDTLALLA 801 Query: 633 TPIXXXXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFI 812 D L+++R L + + R+ LFQ+ C + + C I Sbjct: 802 ADHADNEPPEEHIGAAFADHYESLIVQRVLSAQMEKAEQNQRHTLFQTKCVLKERCCRMI 861 Query: 813 IDAGSCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSI 992 ID GSC N+ S V KL L+T+ HP PY + WL+ V V+K V +NF+IG Y D + Sbjct: 862 IDGGSCNNLASSEMVEKLALSTKPHPHPYYIQWLNNSGKVKVTKLVHINFAIG-NYHDVV 920 Query: 993 YCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXX 1172 CDVVPM AC++LLGRPWQ+D H GR N YSF++ KK + Sbjct: 921 ECDVVPMQACNILLGRPWQFDRDSMHHGRSNQYSFLYHDKK---IVLHPMSSEDILRDDV 977 Query: 1173 XXXLLSRVPFQTAMEESG-------LVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPE 1331 S+ + G L L D T ++DVFP+ Sbjct: 978 AKAAKSKCESDKKAQSDGKKPETINLKPKCLLATKSDITELIASPSVAYALE-YSDVFPK 1036 Query: 1332 SLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPC 1511 +P LPP+R I+H IDL+PGA+LPNR YR +P+E +E++RQV ELL +G++RESLSPC Sbjct: 1037 EVPPGLPPVRGIEHQIDLIPGASLPNRAPYRTNPEETKEIQRQVHELLDKGYVRESLSPC 1096 Query: 1512 AVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYH 1691 AVP +L PKKDG+WRMCVD RAIN IT+RYR PIPRLDD+LD+L G+ VFSK+DL+SGYH Sbjct: 1097 AVPVILVPKKDGSWRMCVDCRAINNITIRYRHPIPRLDDMLDELSGSIVFSKVDLRSGYH 1156 Query: 1692 QIRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDI 1871 QIR++ GDEWKT FKT+ GLYEWLVMPFGL+NAPSTFMR+MN+ LRPFIGKFVVVYFDDI Sbjct: 1157 QIRMKLGDEWKTTFKTKFGLYEWLVMPFGLTNAPSTFMRLMNEVLRPFIGKFVVVYFDDI 1216 Query: 1872 LIYSTDPILHIQHLREVLLVLR 1937 LIYS H HLR V LR Sbjct: 1217 LIYSKSMGEHFNHLRAVFNALR 1238 >ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508703673|gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 468 bits (1205), Expect = e-129 Identities = 267/630 (42%), Positives = 353/630 (56%), Gaps = 32/630 (5%) Frame = +3 Query: 156 LRQGSRSVDDYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQ 335 + Q + +V++Y++EF RV + +S Q+ SRY+ G+ ++D + + + +A Q Sbjct: 104 IEQNNMTVEEYTSEFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVVRLYNIEDARQ 163 Query: 336 RASQAEKQAARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAP--------------- 470 A AEK+ R P Q +A Sbjct: 164 YALSAEKRVLRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTNRGATNVEKND 223 Query: 471 ------PRGYSNPSQGRPGFRG------CFNCGDLSHRQADCPKPPTGSRGLFTDDVESE 614 P G N S RG CF CG+ H CP+ R + ++ E Sbjct: 224 KGKSIMPYGGQNSSGSSTNKRGSNSHIRCFTCGEKGHTSFACPQ-----RKVNLAELGEE 278 Query: 615 PLPLFDTPIXXXXXXXXXXXXXSGDVGPMLMLRRTLLSPRALETE-WLRNNLFQSTCTIG 791 P++D G L++RR + + E E W R ++F++ Sbjct: 279 LEPVYDEYKEEVEEIDVYPAQ-----GESLVVRRIMTTTVNEEAEDWKRRSIFRTRVVCE 333 Query: 792 GKVCTFIIDAGSCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIG 971 GKVC +ID GS EN+IS+ AV+KL L T HP PY++ WL +G +V V+ + LV F++G Sbjct: 334 GKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMG 393 Query: 972 PTYQDSIYCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXX 1151 D CDVVPMD H+L+GRPW YD+ + H + NTYSF K+ L Sbjct: 394 DNSDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTLYPLREETKK 453 Query: 1152 XXXXXXXXXX-LLSRVPFQTAMEESGLVFVLLAQPLGD---STSXXXXXXXXXXXXXFAD 1319 LS F+ E G+++ L+ + L S S F + Sbjct: 454 SANHKISKITRYLSAENFEAEGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEFGE 513 Query: 1320 VFPESLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRES 1499 +F E LP +LPPLR IQH IDLVPGAALPN P YRM P + E++RQVEEL +G +RES Sbjct: 514 LFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAEVQRQVEELFEKGLVRES 573 Query: 1500 LSPCAVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLK 1679 SPCA PALL PKKDG+WRMCVDSRAINKIT++YRFPIPRLD++LDQL G+ VFSK+DLK Sbjct: 574 KSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKIDLK 633 Query: 1680 SGYHQIRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVY 1859 SGYHQIR+R GDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRVM + L+PF+ FVVVY Sbjct: 634 SGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVVVY 693 Query: 1860 FDDILIYSTDPILHIQHLREVLLVLRRDHL 1949 FDDILIYS H++HLR+VL VL+++ L Sbjct: 694 FDDILIYSHTKEKHLKHLRQVLEVLQKEQL 723 >emb|CAN69233.1| hypothetical protein VITISV_003380 [Vitis vinifera] Length = 1292 Score = 456 bits (1173), Expect = e-125 Identities = 252/598 (42%), Positives = 355/598 (59%), Gaps = 6/598 (1%) Frame = +3 Query: 168 SRSVDDYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQ 347 ++SV++Y+ EF+E R V +S QL +RY G R+++Q + + TV + +Q A + Sbjct: 149 TKSVEEYTEEFHELSIRNQVRESDAQLAARYKVGFRMEIQLEMIVAHTYTVDDVYQLALK 208 Query: 348 AEKQAARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRPGFRGCFN 527 E+ R V K P+ S +SN + +P F Sbjct: 209 IEEGLKFR--------------------VSKRPSSQIGST----FSNRTTSKPLSISNFR 244 Query: 528 CGDLSHRQADCPKPPTGS--RGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXSGDVGPM 701 + + + + + G + E E P + G Sbjct: 245 TSNHVNGGGNTQQTSNVAYKNGNKEPESELESYPKEEETYNEDEVSEECDYYDGMTEGHS 304 Query: 702 LMLRRTLLSPRAL-ETEWLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSKLNLTT 878 L++R L P+ E +W R ++FQ+ + G++CT IID GS N+ S+ V KLNL T Sbjct: 305 LVVRPLLTVPKVKREEDWRRTSIFQTRISCQGRLCTMIIDGGSSLNIASQELVEKLNLKT 364 Query: 879 ESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGRPWQYDN 1058 E HP P+R++W++ T + VS R LV F G +++S++C+V+P+ H+LLGRPW +D Sbjct: 365 ERHPNPFRVAWVND-TSIPVSFRCLVTFLFGKDFEESVWCEVLPIKVSHILLGRPWLFDR 423 Query: 1059 TVQHDGRCNTYSFMFRGKKIVL-VXXXXXXXXXXXXXXXXXXLLSRVPFQTAMEESGLVF 1235 VQHDG NTY+ + G K +L +LS F+ +E+ ++F Sbjct: 424 XVQHDGYENTYALIHNGCKTILRPMKEVSPIKKSDENAQPKKVLSMCQFENESKETKVIF 483 Query: 1236 VLLAQPLGDSTSXXXXXXXXXXXXX--FADVFPESLPSTLPPLRDIQHHIDLVPGAALPN 1409 L+A+ + +S F+D +P LP+ LPP+RD+QH IDL+PGA+LPN Sbjct: 484 ALMARKVEESKEQDKEYPANVRKILDDFSDFWPTELPNQLPPMRDVQHAIDLIPGASLPN 543 Query: 1410 RPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVDSRAINKI 1589 P YRM+P EH EL+RQV+ELL +G IRESLSP VPALLTPKKDG+WRMCVDSRA+NKI Sbjct: 544 LPAYRMNPTEHAELKRQVDELLTKGFIRESLSPYGVPALLTPKKDGSWRMCVDSRAMNKI 603 Query: 1590 TVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKTREGLYEWLVM 1769 T++YRFPIPRLDD+LD + + +FSK+DL+SGYHQIRIR GDEWKT+FKT++GLYEWLVM Sbjct: 604 TIKYRFPIPRLDDMLDMMVRSVIFSKIDLRSGYHQIRIRPGDEWKTSFKTKDGLYEWLVM 663 Query: 1770 PFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLREVLLVLRRD 1943 FGL+NAPSTFMR+M Q L+PFIG+FVVVYFDDILIYS H +HL++V+ L+ + Sbjct: 664 LFGLTNAPSTFMRIMTQVLKPFIGRFVVVYFDDILIYSRSCEDHEEHLKQVMCTLKAE 721 >ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao] gi|508704828|gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 436 bits (1120), Expect = e-119 Identities = 233/552 (42%), Positives = 319/552 (57%), Gaps = 2/552 (0%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDY 188 A WW+ LK R R G+ KI +WDK ++ ++ FLP ++ +E++ +F NLRQ + +V++Y Sbjct: 127 ASIWWENLKRQREREGRNKIRTWDKMRRELKRKFLPEHYRQEIFIKFHNLRQKTMTVEEY 186 Query: 189 SNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQAAR 368 + EF + + DV++ Q V+RY+GG+ + + DV+ + + + + A + EKQ R Sbjct: 187 TMEFEQLHMKCDVHEPEEQTVARYLGGLNVGIADVVQLQPYWNLNDVIRLALKVEKQQLR 246 Query: 369 RTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQGRPGF-RGCFNCGDLSH 545 +++ + P S S P + CF C H Sbjct: 247 KSSMSSSRQKDSTSNRGRQSSATIPPPKVNSSKTINHKETTSTRAPNVNKKCFKCQGFGH 306 Query: 546 RQADCPKPPTGSRGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXSGDVGPMLMLRRTLL 725 +DCP S L ++V EP L + S D G L++RR L Sbjct: 307 IASDCPNRRIIS--LIEEEVMEEP-SLEEVDDELEIFNNEEIEEVSADHGEALVVRRNLN 363 Query: 726 SPRALETE-WLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSKLNLTTESHPKPYR 902 + E E WLR+N+F + CT GKVC IID+GSCENVI+ V KL L TE HP PY+ Sbjct: 364 TAMLTEDESWLRHNIFHTRCTSQGKVCNVIIDSGSCENVIANYMVKKLKLQTEVHPHPYK 423 Query: 903 LSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGRPWQYDNTVQHDGRC 1082 L WL +G +V V+KR V FSIG Y+D ++CDV+PMDACHLLLGRPWQYD HDG Sbjct: 424 LQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQYDRRAHHDGYK 483 Query: 1083 NTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXLLSRVPFQTAMEESGLVFVLLAQPLGD 1262 NTYSF+ G KI+L +S + A +S L+++LL + Sbjct: 484 NTYSFIKDGAKIMLTPLKPEDCPKKQEKDKALITMSGL--NKAFRKSSLLYLLLVCEENE 541 Query: 1263 STSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEH 1442 +S F DV PE +P LPP+RDIQH ID +PG+ +PN+P YRMSP+EH Sbjct: 542 VSSPLSKDVKPIIEE-FCDVVPEEIPHGLPPMRDIQHAIDFIPGSIIPNKPAYRMSPQEH 600 Query: 1443 EELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRL 1622 +EL+ QV++LL +G +RES+SPCAVPALL PKKDGTWRMC+DSRA+NKIT++YRFPIPRL Sbjct: 601 KELQHQVKQLLEKGLVRESVSPCAVPALLVPKKDGTWRMCIDSRAVNKITIKYRFPIPRL 660 Query: 1623 DDLLDQLGGACV 1658 DDLLDQL G V Sbjct: 661 DDLLDQLHGYVV 672 >ref|XP_004309164.1| PREDICTED: uncharacterized protein LOC101300012 [Fragaria vesca subsp. vesca] Length = 1034 Score = 433 bits (1114), Expect = e-118 Identities = 222/430 (51%), Positives = 290/430 (67%), Gaps = 7/430 (1%) Frame = +3 Query: 681 SGDVGPMLMLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVS 860 SGD ++ + LL E + R+++F+STCTI K + IID+GSCEN +S+ V Sbjct: 420 SGDDREYNLVTQRLLCSTKQENQ--RHSIFRSTCTIKEKPMSLIIDSGSCENFVSKKVVE 477 Query: 861 KLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGR 1040 NL T H PY + W+ +G +V +++ V+ SIG YQD + CDVV MDA H+LLG+ Sbjct: 478 HFNLLTMKHRAPYAIGWIKKGLEVRITETCKVSISIGKFYQDEVECDVVDMDASHVLLGK 537 Query: 1041 PWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXLLSRVPFQTAME- 1217 PWQ+D H+GR NT SF++ I L L+ P + E Sbjct: 538 PWQHDVNTIHNGRENTVSFIWEKHHITL--KPKTKPTNLVSPKESNFLIVAEPCEKVEEL 595 Query: 1218 --ESGLVFVLLAQPL----GDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHI 1379 ++ ++ L+ + + + F ++ + LP+ LPP+RDIQH I Sbjct: 596 VKDAEAIYPLVVREVMVAEDNKEEKKIPKEVQQLLQDFEELLADDLPNELPPMRDIQHQI 655 Query: 1380 DLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRM 1559 DLV GA+LPN PHYRMSPKE+E L+ ++EELL +GHIRES+SPCAVP LL PKKD +WRM Sbjct: 656 DLVSGASLPNLPHYRMSPKENEILKEKIEELLRKGHIRESMSPCAVPVLLVPKKDRSWRM 715 Query: 1560 CVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKT 1739 CVDSRAINKIT++YRFPIP+L+D+LD LGG+ VFSK+DL+SGYHQIRI+ GDEWKTAFK+ Sbjct: 716 CVDSRAINKITIKYRFPIPQLEDMLDVLGGSVVFSKIDLRSGYHQIRIKLGDEWKTAFKS 775 Query: 1740 REGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLRE 1919 ++GLYEWLVMPFGLSNAPSTFMRVMNQ L+P+IG VVVYFDDILIYS H+QHLR+ Sbjct: 776 KDGLYEWLVMPFGLSNAPSTFMRVMNQVLKPYIGTCVVVYFDDILIYSKSKEEHLQHLRK 835 Query: 1920 VLLVLRRDHL 1949 VL VL+ + L Sbjct: 836 VLEVLQENKL 845 Score = 62.4 bits (150), Expect = 7e-07 Identities = 32/92 (34%), Positives = 56/92 (60%), Gaps = 2/92 (2%) Frame = +3 Query: 120 NFDRELYQRFQNLRQGSRSVDDYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLN 299 ++++ L++++Q + Q +RSV D++ +FY + R + ++ Q V+RYI G+ Q+QD + Sbjct: 206 DYEQTLFEQYQEVSQENRSVQDFTTDFYRLVERNKLTETKAQQVARYIRGLNPQIQDKIG 265 Query: 300 MFDPLTVAEAHQRASQAEKQAAR--RTTANLR 389 + V EAH+ A +AEK A TT N R Sbjct: 266 LLTFKDVGEAHKMALKAEKLAKSTIATTNNRR 297 >ref|XP_004161321.1| PREDICTED: uncharacterized protein LOC101223713 [Cucumis sativus] Length = 645 Score = 433 bits (1114), Expect = e-118 Identities = 256/665 (38%), Positives = 341/665 (51%), Gaps = 27/665 (4%) Frame = +3 Query: 9 AQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDY 188 A WW QL+++R R GK I SW+K KK ++ Sbjct: 78 ASTWWDQLEINRQRCGKQSIRSWEKMKKLLK----------------------------- 108 Query: 189 SNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQAAR 368 ++R++GG++L +++ + + ++EA A E+ A Sbjct: 109 --------------------IARFVGGLQLDIKEKVKLQPFRFLSEAISFAETVEEMIAV 148 Query: 369 RTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSNPSQ----------------- 497 R+ R K S +G +Q Sbjct: 149 RSKNLKRRPAWKTTSTRMNNYADKTNDQPSTSTKGKGKEVENQEVVVERKNEQAFKTSSQ 208 Query: 498 ---GRPGFRGCFNCGDLSHRQADCPKPPT-----GSRGLFTDDVESEPLPLFDTPIXXXX 653 RP F CG H +CP+ T R + D +E Sbjct: 209 NNYSRPLLGKFFRCGQTEHLSNNCPQRKTIAIAEEGRQMSEDSKGAED------------ 256 Query: 654 XXXXXXXXXSGDVGPML--MLRRTLLSPRALETEWLRNNLFQSTCTIGGKVCTFIIDAGS 827 D G + +++R L++P+ E + R+ LF++ CTI G+VC IID S Sbjct: 257 ----EIELIEADDGERVSCVIQRVLITPKE-EKKQQRHCLFKARCTINGRVCDVIIDNDS 311 Query: 828 CENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVV 1007 +N +++ V+ LNL E+HP Y++ W+ + + VS+ V SI Y+D I CDV+ Sbjct: 312 SKNFVAKKLVTVLNLKAEAHPTSYKIGWVRKEGEATVSEICTVPLSIENAYKDQIVCDVI 371 Query: 1008 PMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXXLL 1187 MD CHLLLGRPWQYD H GR NTY G+K+VL+ Sbjct: 372 EMDVCHLLLGRPWQYDTQSLHKGRENTYELQLMGRKVVLLPI------------------ 413 Query: 1188 SRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDI 1367 T + GL G+ D+ P LPPLRDI Sbjct: 414 ------TRKNKEGL--------RGEKQLFTTVSGKNMLKEREQDLLGLEEPEGLPPLRDI 459 Query: 1368 QHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDG 1547 QHHIDL+PGA+LPN HYRMSP+E++ L +EELL +GHI+ SLSPCAVPALLT KKDG Sbjct: 460 QHHIDLIPGASLPNLAHYRMSPQEYKTLHDHIEELLKKGHIKPSLSPCAVPALLTLKKDG 519 Query: 1548 TWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKT 1727 +WRMCVDSRAIN+ITV+YRF IPR+ DLLDQLG A +FSK+DLKSGYHQIRIR GDEWKT Sbjct: 520 SWRMCVDSRAINRITVKYRFSIPRISDLLDQLGKASIFSKIDLKSGYHQIRIRPGDEWKT 579 Query: 1728 AFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQ 1907 FKT+EGL+EW+VMPFGLSNAP+TFMR+MNQ L PF+ KF+VVYFDDIL+YST+ H+ Sbjct: 580 TFKTKEGLFEWMVMPFGLSNAPNTFMRLMNQILHPFLNKFIVVYFDDILVYSTNNEEHLL 639 Query: 1908 HLREV 1922 HLR++ Sbjct: 640 HLRKM 644 >emb|CAN71532.1| hypothetical protein VITISV_018180 [Vitis vinifera] Length = 1323 Score = 423 bits (1087), Expect = e-115 Identities = 217/469 (46%), Positives = 292/469 (62%), Gaps = 2/469 (0%) Frame = +3 Query: 519 CFNCGDLSHRQADCPKPPTGSRGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXSGDVGP 698 CF CG H CP R + + E E P + G Sbjct: 230 CFKCGGHGHYAVVCPTKGLHFR-VEEPESELESYPKEEETYNEDEVSEECDYYDGMTEGH 288 Query: 699 MLMLRRTLLSPRAL-ETEWLRNNLFQSTCTIGGKVCTFIIDAGSCENVISEVAVSKLNLT 875 L++R L P+ E +W ++FQ+ + G++CT IID GS N+ S+ V KLNL Sbjct: 289 SLVVRPLLTVPKVKGEKDWRXTSIFQTRISCQGRLCTMIIDGGSSLNIASQELVEKLNLK 348 Query: 876 TESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCDVVPMDACHLLLGRPWQYD 1055 TE HP P+R++W++ T + S R L F G +++ ++C+V+P+ H+LLGRPW +D Sbjct: 349 TERHPNPFRVAWVND-TSIPXSFRCLXTFLFGKDFEEFVWCEVLPIKVSHILLGRPWLFD 407 Query: 1056 NTVQHDGRCNTYSFMFRG-KKIVLVXXXXXXXXXXXXXXXXXXLLSRVPFQTAMEESGLV 1232 VQHDG NTY+ + KKI+ +L+ F+ +E+ ++ Sbjct: 408 RRVQHDGYENTYALIHNXRKKILRPMKEVPPIKKSNENAQPKKVLTMCQFENESKETKVI 467 Query: 1233 FVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESLPSTLPPLRDIQHHIDLVPGAALPNR 1412 F L+A+ + + +P +LP+ LPP+RD+QH IDL+PGA+LPN Sbjct: 468 FALMARKVEEFKEQDKE-------------YPANLPNQLPPMRDVQHAIDLIPGASLPNL 514 Query: 1413 PHYRMSPKEHEELRRQVEELLARGHIRESLSPCAVPALLTPKKDGTWRMCVDSRAINKIT 1592 YRM+P EH EL+RQV+ELL + IRESLSPC VP LLTPKKDG+WRMCVDSRAINKIT Sbjct: 515 XAYRMNPTEHXELKRQVDELLTKCFIRESLSPCGVPTLLTPKKDGSWRMCVDSRAINKIT 574 Query: 1593 VRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQIRIRTGDEWKTAFKTREGLYEWLVMP 1772 +Y+FPIPRLDD+LD + G+ +FSK+DL+SGYHQIR R GDEWKT+FKT++GLYEWLVMP Sbjct: 575 TKYQFPIPRLDDMLDMMVGSVIFSKIDLRSGYHQIRXRLGDEWKTSFKTKDGLYEWLVMP 634 Query: 1773 FGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILIYSTDPILHIQHLRE 1919 FGL+NAPSTFMR+M Q L+PFIG+F VVYFDDILIYS H +HL++ Sbjct: 635 FGLTNAPSTFMRIMTQVLKPFIGRFFVVYFDDILIYSRXCEDHKEHLKQ 683 >ref|XP_004292437.1| PREDICTED: uncharacterized protein LOC101306407 [Fragaria vesca subsp. vesca] Length = 1300 Score = 388 bits (996), Expect = e-105 Identities = 193/324 (59%), Positives = 233/324 (71%) Frame = +3 Query: 978 YQDSIYCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXX 1157 YQDS +CDV MDACHLLLGRP QYD HDG NTY+F+ G K++L Sbjct: 530 YQDSQWCDVALMDACHLLLGRPSQYDRKYVHDGHLNTYTFVKDGNKVIL-GPSRYEHKPS 588 Query: 1158 XXXXXXXXLLSRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVFPESL 1337 L+ F +E G+ ++L+ + D+ + F DV PE L Sbjct: 589 SKHAEGDNFLTMCNFLNESKEEGMFYMLIGREANDN-AHEAPEVVASLLKEFVDVVPEEL 647 Query: 1338 PSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCAV 1517 P LPPLRDIQHHID VPGA+LPN+PHYRMSP+E++EL + V ELL +G IRES+SPC V Sbjct: 648 PVGLPPLRDIQHHIDFVPGASLPNKPHYRMSPQEYDELNKYVTELLKKGVIRESMSPCVV 707 Query: 1518 PALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQI 1697 ALLTPKKDGTW+MCVDSRAINKI VRYRFPIPRL+D+LD L GA VFSK+DL+SGYHQI Sbjct: 708 SALLTPKKDGTWQMCVDSRAINKIAVRYRFPIPRLEDMLDHLAGAKVFSKIDLRSGYHQI 767 Query: 1698 RIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFDDILI 1877 R+R GDEWKTAFKTR+GL+EW+VMPFGL+NAPSTFMR++ Q FIGKFVVVYFDDIL+ Sbjct: 768 RMRPGDEWKTAFKTRDGLFEWMVMPFGLTNAPSTFMRIIIQVFCSFIGKFVVVYFDDILV 827 Query: 1878 YSTDPILHIQHLREVLLVLRRDHL 1949 YS+D ++HLR+V VLR + L Sbjct: 828 YSSDVSQLMEHLRQVFEVLRAEKL 851 Score = 95.5 bits (236), Expect = 8e-17 Identities = 70/259 (27%), Positives = 115/259 (44%), Gaps = 7/259 (2%) Frame = +3 Query: 42 RVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVDDYSNEFYEFLARV 221 R+ HG +K +R F+ N+ + + + N+RQGSR+VDD++ EF R Sbjct: 274 RLEHGGSPWLITGAMRKELRKKFMHENYLQNNFLKLHNIRQGSRTVDDFTKEFDLLTMRC 333 Query: 222 DVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAEKQAARR--TTANLRLX 395 + + Q V+RY+ G+R ++ DV+ + + +E +Q A Q EKQ R A+ Sbjct: 334 GLAEEEEQTVARYLAGLRREIHDVVVLQPCWSYSEVYQLAIQVEKQLQSRYKRGASEDYE 393 Query: 396 XXXXXXXXXXXIVPKVPA----PTQQSAPPRGYSNPSQGRPGFRGCFNCGDLSHRQADCP 563 I P + A P + A + + S + CF C L H +DCP Sbjct: 394 AKKIASSSTPKITPMLDANIREPLKNQAEHKAEARESNKGKNVK-CFKCSGLGHIASDCP 452 Query: 564 KPPTGSRGLFTDDVESEPLPLFDTPIXXXXXXXXXXXXXSGDVGPMLMLRRTLLSPRAL- 740 + L + ES L D P D G L++R+T+ + + Sbjct: 453 NRRVVN--LVEELGESSSAGLDDMPTSDDYGDQDEEEITWSDHGESLVIRQTMSASKVED 510 Query: 741 ETEWLRNNLFQSTCTIGGK 797 ++EWL++N+F + CT GK Sbjct: 511 DSEWLKHNIFHTKCTSNGK 529 >ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobroma cacao] gi|508724802|gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao] Length = 1392 Score = 387 bits (994), Expect = e-104 Identities = 241/646 (37%), Positives = 328/646 (50%), Gaps = 36/646 (5%) Frame = +3 Query: 3 GRAQAWWQQLKVSRVRHGKPKITSWDKFKKHIRSAFLPYNFDRELYQRFQNLRQGSRSVD 182 G A W ++++ R R GK KI++W+ K +R FLP ++ ELY++F L+Q + +V+ Sbjct: 149 GTALQWRKRVEEQRARQGKLKISTWEHMKSKLRKQFLPADYTMELYEKFHCLKQNNMTVE 208 Query: 183 DYSNEFYEFLARVDVNDSPTQLVSRYIGGMRLQLQDVLNMFDPLTVAEAHQRASQAE--- 353 +Y++EF RV + +S Q SRY+ G+ ++D + + + +A Q A AE Sbjct: 209 EYTSEFNNLSIRVGLVESNEQNTSRYLAGLNHSIRDEMGVVRLYNIEDARQYALSAEKRV 268 Query: 354 -KQAARRTTANLRLXXXXXXXXXXXXIVPKVPAPTQQSAPPRGYSN-------------- 488 + AR+ + RG +N Sbjct: 269 LRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTNRGATNFEKNDKGKGIMPYG 328 Query: 489 --PSQGRPGFRG-------CFNCGDLSHRQADCPKPPTGSRGLFTDDVESEPLPLFDTPI 641 S G +G CF CG+ H CP+ R + + E P++D Sbjct: 329 GQNSSGSSTNKGGSNSHIRCFTCGEKGHTSFACPQ-----RRVNLAKLAEELEPVYDE-- 381 Query: 642 XXXXXXXXXXXXXSGDVGPM----LMLRRTLLSPRALETE-WLRNNLFQSTCTIGGKVCT 806 DV P L++RR + + E E W R Sbjct: 382 -------YEEEVEEIDVYPAQRDSLVVRRVMTTTVNEEAEDWKRR--------------- 419 Query: 807 FIIDAGSCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQD 986 ++KL L T HP PY++ WL + +V V+ + LV F++G D Sbjct: 420 ----------------MNKLKLPTNRHPYPYKIGWLKKEHEVPVTTQCLVKFTMGDNLDD 463 Query: 987 SIYCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXX 1166 CDVVPMD H+L+GRPW YD+ + H + NTYSF K+ L Sbjct: 464 EALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTLYPLREETKKSANNK 523 Query: 1167 XXXXX-LLSRVPFQTAMEESGLVFVLLAQPLGD---STSXXXXXXXXXXXXXFADVFPES 1334 LS F+ E G+++ L+ + L S S F ++F E Sbjct: 524 ISKITGYLSAENFEAEGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEFGELFNED 583 Query: 1335 LPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLSPCA 1514 LP +LP LR IQH IDLVPGAALPN P Y+M P + E++RQVEELL +G +RES SPCA Sbjct: 584 LPKSLPHLRSIQHAIDLVPGAALPNLPAYKMPPMQRTEVQRQVEELLEKGLVRESKSPCA 643 Query: 1515 VPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSGYHQ 1694 PALL PKKDG+WRMCVDSRAINKIT++ RFPIPRLD++LDQL G+ VFSK+DLKSGYHQ Sbjct: 644 CPALLAPKKDGSWRMCVDSRAINKITIKSRFPIPRLDEMLDQLVGSRVFSKIDLKSGYHQ 703 Query: 1695 IRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRP 1832 IR+R GDE KTAFKT +GL+EWLVMPFGLSNAPSTFM + L+P Sbjct: 704 IRMRDGDERKTAFKTPDGLFEWLVMPFGLSNAPSTFMSHGRKGLKP 749 >ref|XP_007198961.1| hypothetical protein PRUPE_ppa020671mg, partial [Prunus persica] gi|462394256|gb|EMJ00160.1| hypothetical protein PRUPE_ppa020671mg, partial [Prunus persica] Length = 1460 Score = 379 bits (974), Expect = e-102 Identities = 201/388 (51%), Positives = 252/388 (64%), Gaps = 12/388 (3%) Frame = +3 Query: 822 GSCENVISEVAVSKLNLTTESHPKPYRLSWLSQGTDVMVSKRVLVNFSIGPTYQDSIYCD 1001 GS NVIS+ AV++LNL E HP P+ ++W+ + T + V++ LV+ +G T + IY D Sbjct: 436 GSTMNVISKSAVTRLNLKPEPHPHPFHVAWVDK-TKLPVTEWCLVSLKLG-TCDEDIYLD 493 Query: 1002 VVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXXXXXXXXXXXXXX 1181 +PM+ H+LLGRPW YD+ VQ+ GR NTY+F GK I+L Sbjct: 494 QLPMNVAHVLLGRPWLYDHRVQNCGRENTYTFQHEGKSIMLRPANPAIKPTKTNITTSSP 553 Query: 1182 ------------LLSRVPFQTAMEESGLVFVLLAQPLGDSTSXXXXXXXXXXXXXFADVF 1325 LLS F+ E+G+VF L+ + + + S F+DV Sbjct: 554 SQTGNMSGHRLALLSYGEFEKESLETGVVFALVIKEISAAPSYQQPEPLHQFLNEFSDVM 613 Query: 1326 PESLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIRESLS 1505 P+ LP+ LPP+RDIQH IDLVPG+ LPN PHYRM+ EH EL Q++ LL +G IR SLS Sbjct: 614 PDDLPNELPPMRDIQHAIDLVPGSQLPNLPHYRMNSSEHAELNTQIQGLLDKGFIRHSLS 673 Query: 1506 PCAVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLDLKSG 1685 PCAVP L TPKKDG+WRMCVDSRAINKIT D+LD+L G+ FSK+DL SG Sbjct: 674 PCAVPVLFTPKKDGSWRMCVDSRAINKIT-----------DMLDELAGSKWFSKIDLHSG 722 Query: 1686 YHQIRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVVVYFD 1865 YHQIRIR GDEWKTAFKT +GLYEWLVMPFG+SNAPSTFMRVM RP+IGKF+VVYFD Sbjct: 723 YHQIRIREGDEWKTAFKTPDGLYEWLVMPFGMSNAPSTFMRVMTHVFRPYIGKFLVVYFD 782 Query: 1866 DILIYSTDPILHIQHLREVLLVLRRDHL 1949 DILIYS H+QHLR + +LR++ L Sbjct: 783 DILIYSHSKEDHLQHLRTIFHMLRQEKL 810 >ref|XP_007019611.1| Uncharacterized protein TCM_035724 [Theobroma cacao] gi|508724939|gb|EOY16836.1| Uncharacterized protein TCM_035724 [Theobroma cacao] Length = 475 Score = 365 bits (938), Expect = 3e-98 Identities = 187/332 (56%), Positives = 228/332 (68%), Gaps = 4/332 (1%) Frame = +3 Query: 966 IGPTYQDSIYCDVVPMDACHLLLGRPWQYDNTVQHDGRCNTYSFMFRGKKIVLVXXXXXX 1145 +G D CDVVPMD H+L+GRPW YD+ + H + NTYSF K+ L Sbjct: 1 MGNNLDDEALCDVVPMDVGHILVGRPWLYDHDMVHKTKPNTYSFYKNNKRYTLYPLREET 60 Query: 1146 XXXXXXXXXXXX-LLSRVPFQTAMEESGLVFVLLAQPLGD---STSXXXXXXXXXXXXXF 1313 LS F+ E G+++ L+ + L S S F Sbjct: 61 KKSANNKISKITGYLSAENFEAEGSEMGIMYALVTKHLKSDQMSKSPQYPTEIQQLLKEF 120 Query: 1314 ADVFPESLPSTLPPLRDIQHHIDLVPGAALPNRPHYRMSPKEHEELRRQVEELLARGHIR 1493 ++F E LP +LPPLR IQH IDLVPGAALPN P YRM P + E++RQVEELL +G +R Sbjct: 121 GELFNEDLPKSLPPLRSIQHAIDLVPGAALPNLPAYRMPPMQRAEVQRQVEELLEKGLVR 180 Query: 1494 ESLSPCAVPALLTPKKDGTWRMCVDSRAINKITVRYRFPIPRLDDLLDQLGGACVFSKLD 1673 ES SPCA PALL PKKDG+WRMCVDSRAINKIT++YRFPIPRLD++LDQL G+ VFSK+D Sbjct: 181 ESKSPCACPALLAPKKDGSWRMCVDSRAINKITIKYRFPIPRLDEMLDQLVGSRVFSKID 240 Query: 1674 LKSGYHQIRIRTGDEWKTAFKTREGLYEWLVMPFGLSNAPSTFMRVMNQALRPFIGKFVV 1853 LKSGYHQIR+R GDEWKTAFKT +GL+EWLVMPFGLSNAPSTFMRVM + L+PF+ FVV Sbjct: 241 LKSGYHQIRMRDGDEWKTAFKTPDGLFEWLVMPFGLSNAPSTFMRVMAEVLKPFLNSFVV 300 Query: 1854 VYFDDILIYSTDPILHIQHLREVLLVLRRDHL 1949 VYFDDILIYS H+++LR+VL VL+++ L Sbjct: 301 VYFDDILIYSHTKEKHLKYLRQVLEVLQKEQL 332