BLASTX nr result

ID: Paeonia24_contig00005809 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Paeonia24_contig00005809
         (2446 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854...   872   0.0  
emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera]   872   0.0  
ref|XP_002527444.1| protein dimerization, putative [Ricinus comm...   853   0.0  
ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615...   828   0.0  
ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr...   827   0.0  
ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom...   234   1e-58
ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A...   227   2e-56
ref|XP_004299161.1| PREDICTED: uncharacterized protein LOC101293...   221   2e-54
ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]...   214   1e-52
ref|XP_003543854.2| PREDICTED: uncharacterized protein LOC100780...   209   6e-51
ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662...   207   1e-50
ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun...   207   1e-50
ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu...   207   1e-50
ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627...   207   2e-50
ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307...   206   5e-50
ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   204   2e-49
ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222...   204   2e-49
ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part...   203   3e-49
ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260...   198   1e-47
ref|XP_006603987.1| PREDICTED: uncharacterized protein LOC102660...   197   2e-47

>ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera]
          Length = 635

 Score =  872 bits (2254), Expect = 0.0
 Identities = 432/613 (70%), Positives = 504/613 (82%), Gaps = 4/613 (0%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M +ESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP
Sbjct: 1    MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 1921
            AIDRS+REAFQILEEERLARKKKRTSGSGK GKRIRTSQ S+  +WK+I+KEDVDD+VAR
Sbjct: 61   AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 1741
            FFYA+GL+ ++ NSPYF E+   +A+FGPGYE P+ +KLS  FL KEKA+IEK++A VRE
Sbjct: 121  FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180

Query: 1740 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 1561
            SWP TGCTILCVN L  T+G    N+FVSSPRGLMF KA+ +NDGD ++N+F   +SDAI
Sbjct: 181  SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240

Query: 1560 MEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1381
            MEV  TNVLQII NLGH SESFESL++ KF  +FWSPCTSHSI  LME+I +LDWIKPIV
Sbjct: 241  MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300

Query: 1380 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1201
            LCAK I++C+LT+QRSS  V T    +SSDPLS KFAPSY +V RIFE+KQAL       
Sbjct: 301  LCAKEIDECILTYQRSSLCVLT---LESSDPLSTKFAPSYCIVERIFELKQALLGVVVSE 357

Query: 1200 XXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1021
                 KL T+ ED +++E +ILGDNFWS     LQ  EPFVRLL T +I+KSVMGDVF+W
Sbjct: 358  EWKQWKL-TIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416

Query: 1020 RMRALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMR 841
            R++ALEA++ KG+DD  LNQ+E+L+ES+WDM FSPLHA+GYILNP+YFG  Q+KDKT MR
Sbjct: 417  RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476

Query: 840  GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 661
            GWK+TL+RYESDS  RRVLREQLSSYWR+EGS G+EDAVDCRDKMDPVAWWENFGFETPH
Sbjct: 477  GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536

Query: 660  LQTLAVKILCQVSSVGICQV----SDIPCQEAANRLKVERVEDLVFVQNNLRLHSQRIGN 493
            LQTLA+KIL QVSSV + Q     ++  CQ A N L VER EDLVFV+NNLRLHSQR GN
Sbjct: 537  LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQRNGN 596

Query: 492  LNSPYGVKHGMAS 454
             +S  G ++  +S
Sbjct: 597  SSSSPGNRNQSSS 609


>emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera]
          Length = 635

 Score =  872 bits (2252), Expect = 0.0
 Identities = 431/613 (70%), Positives = 504/613 (82%), Gaps = 4/613 (0%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M +ESDKWGWKHVSVFGGFDKGSGTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP
Sbjct: 1    MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 1921
            AIDRS+REAFQILEEERLARKKKRTSGSGK GKRIRTSQ S+  +WK+I+KEDVDD+VAR
Sbjct: 61   AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 1741
            FFYA+GL+ ++ NSPYF E+   +A+FGPGYE P+ +KLS  FL KEKA+IEK++A VRE
Sbjct: 121  FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180

Query: 1740 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 1561
            SWP TGCTILCVN L  T+G    N+FVSSPRGLMF KA+ +NDGD ++N+F   +SDAI
Sbjct: 181  SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240

Query: 1560 MEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1381
            MEV  TNVLQII NLGH SESFESL++ KF  +FWSPCTSHSI  LME+I +LDWIKPIV
Sbjct: 241  MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300

Query: 1380 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1201
            LCAK I++C+LT+QRSS  V T    +SSDPLS KFAPSY +V RIFE+KQAL       
Sbjct: 301  LCAKEIDECILTYQRSSLCVLT---LESSDPLSTKFAPSYCIVERIFELKQALLGVVVSE 357

Query: 1200 XXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1021
                 KL T+ ED +++E +ILGDNFWS     LQ  EPFVRLL T +I+KSVMGDVF+W
Sbjct: 358  EWKQWKL-TIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416

Query: 1020 RMRALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMR 841
            R++ALEA++ KG+DD  LNQ+E+L+ES+WDM FSPLHA+GYILNP+YFG  Q+KDKT MR
Sbjct: 417  RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476

Query: 840  GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 661
            GWK+TL+RYESDS  RRVLREQLSSYWR+EGS G+EDAVDCRDKMDPVAWWENFGFETPH
Sbjct: 477  GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536

Query: 660  LQTLAVKILCQVSSVGICQV----SDIPCQEAANRLKVERVEDLVFVQNNLRLHSQRIGN 493
            LQTLA+KIL QVSSV + Q     ++  CQ A N L VER EDLVFV+NNLRLHSQR GN
Sbjct: 537  LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQRNGN 596

Query: 492  LNSPYGVKHGMAS 454
             +S  G ++  +S
Sbjct: 597  SSSSPGNRNQSSS 609


>ref|XP_002527444.1| protein dimerization, putative [Ricinus communis]
            gi|223533179|gb|EEF34936.1| protein dimerization,
            putative [Ricinus communis]
          Length = 633

 Score =  853 bits (2205), Expect = 0.0
 Identities = 416/613 (67%), Positives = 501/613 (81%), Gaps = 4/613 (0%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M SESDKWGW+HVSVFGGFD+GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1    MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 1921
            AIDRS+REAFQILEEERL RKKK+ S +GKPGKR R SQ S++  WK+I+KEDVDD+VAR
Sbjct: 61   AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 1741
            FFYA+GLNI V NSPYF+E++  + +FG GYE PS+DKLS SFL KEK RIEKSLA +RE
Sbjct: 119  FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178

Query: 1740 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 1561
            SWP TGCTILCV  LD   GC  IN+FVSSPRGL+F KAV V+D D  ++V  GA+SDAI
Sbjct: 179  SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238

Query: 1560 MEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1381
            +EVG +NVLQII +LG   +S ES ++ KFP IFWSPCTSHSI  LME IAEL+W+KPIV
Sbjct: 239  LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298

Query: 1380 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1201
            LCA+ IEQC++T+Q ++  +F Q  K+S D +SAKFAPSY+ V RIFE++Q LQ      
Sbjct: 299  LCARRIEQCIMTYQHATSCIFMQSPKESCDLISAKFAPSYFFVQRIFELRQTLQEVVVSE 358

Query: 1200 XXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1021
                    ++ ++  SIE++ILGD+FWS +HL LQL EPF++LL   +IDKSV+G V+DW
Sbjct: 359  QWKH----SIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414

Query: 1020 RMRALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMR 841
            R++ALEA+R K IDDD LNQ+EVL+E++WD+ FSPLHA GYILNPRY G  Q KDK+ MR
Sbjct: 415  RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474

Query: 840  GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 661
            GWK+TLERYE +S ARRVLREQLSSYWR+EGSLGDEDAVDCRDKMDPVAWWENFGFETP 
Sbjct: 475  GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534

Query: 660  LQTLAVKILCQVSSVGIC----QVSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQRIGN 493
            LQTLA+K+L QVSSV +C    Q +D  CQEAANRL V+RVEDL+FV+NNLRLH Q+  N
Sbjct: 535  LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQKNCN 594

Query: 492  LNSPYGVKHGMAS 454
            L++  G+++ ++S
Sbjct: 595  LSTSPGLRNTISS 607


>ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus
            sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED:
            uncharacterized protein LOC102615434 isoform X2 [Citrus
            sinensis]
          Length = 636

 Score =  828 bits (2138), Expect = 0.0
 Identities = 412/604 (68%), Positives = 483/604 (79%), Gaps = 4/604 (0%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M SESDKWGW+HVSVFGGF++GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1    MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 1921
            AIDRSMRE FQILEEER+ARKKKRTSG  K GKRIR  Q S+  + K+ISKEDVD++VAR
Sbjct: 61   AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSI--VSKAISKEDVDEMVAR 118

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 1741
            FFYA GLN++V NSPYF E++ ++A+FG GY+ PS++ LS SFL KEK +IEK +ASVRE
Sbjct: 119  FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178

Query: 1740 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 1561
            SWP TGCTILCV+ LD   GC    +FVSSPRGL+F KA+ ++D D  EN+F   +SDAI
Sbjct: 179  SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238

Query: 1560 MEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1381
            +EVG  NVLQII +LGH  +S+ESL++ KFP IF SPCT  SI   ME IA L+WIK  V
Sbjct: 239  LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298

Query: 1380 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1201
            LCAK IEQ ++ +Q + P +F  +LK+SSD +S K APSY  V RI E+KQ LQ      
Sbjct: 299  LCAKRIEQHIMYYQHAYPCLFPHNLKESSDQVSTKIAPSYCFVQRIIELKQVLQEAVVSE 358

Query: 1200 XXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1021
                 KL ++  D+  +E++ILGD+FW   HLFLQLCEPFVRLLATF+IDKSVMG V+DW
Sbjct: 359  EFKQWKL-SMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417

Query: 1020 RMRALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMR 841
            R +ALEA+R KGID  ALNQ+EVL E+RWD  FSPLHAAGYILNPRYFG  Q KDKT MR
Sbjct: 418  RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477

Query: 840  GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 661
            GWKSTLERYESDS  RR+LREQLSSYWR+EGSLG+EDAVD RDKM+PVAWWENFGFE  H
Sbjct: 478  GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537

Query: 660  LQTLAVKILCQVSSVGICQV----SDIPCQEAANRLKVERVEDLVFVQNNLRLHSQRIGN 493
            LQTLA+K+L QVSSV ICQ     +D PC+EAANR  VER EDL+FV+NNLRLH+QR  N
Sbjct: 538  LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNVN 597

Query: 492  LNSP 481
            L+SP
Sbjct: 598  LSSP 601


>ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina]
            gi|557526284|gb|ESR37590.1| hypothetical protein
            CICLE_v10028008mg [Citrus clementina]
          Length = 636

 Score =  827 bits (2136), Expect = 0.0
 Identities = 411/604 (68%), Positives = 483/604 (79%), Gaps = 4/604 (0%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M SESDKWGW+HVSVFGGF++GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP
Sbjct: 1    MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 1921
            AIDRSMRE FQILEEER+ARKKKRTSG  K GKRIR  Q S+  + K+ISKEDVD++VAR
Sbjct: 61   AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSI--VSKAISKEDVDEMVAR 118

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 1741
            FFYA GLN++V NSPYF E++ ++A+FG GY+ PS++ LS SFL KEK +IEK +ASVRE
Sbjct: 119  FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178

Query: 1740 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 1561
            SWP TGCTILCV+ LD   GC    +FVSSPRGL+F KA+ ++D D  EN+F   +SDAI
Sbjct: 179  SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238

Query: 1560 MEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1381
            ++VG  NVLQII +LGH  +S+ESL++ KFP IF SPCT  SI   ME IA L+WIK  V
Sbjct: 239  LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298

Query: 1380 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1201
            LCAK IEQ +L +Q + P +F  +LK+SSD +S K APSY  V RI E+KQ LQ      
Sbjct: 299  LCAKRIEQHILYYQHAYPCLFPHNLKESSDQVSTKIAPSYCFVQRIIELKQVLQEAVVSE 358

Query: 1200 XXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1021
                 KL ++  D+  +E++ILGD+FW   HLFLQLCEPFVRLLATF+IDKSVMG V+DW
Sbjct: 359  EFKQWKL-SMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417

Query: 1020 RMRALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMR 841
            R +ALEA+R KGID  ALNQ+EVL E+RWD  FSPLHAAGYILNPRYFG  Q KDKT MR
Sbjct: 418  RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477

Query: 840  GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 661
            GWKSTLERYESDS  RR+LREQLSSYWR+EGSLG+EDAVD RDKM+PVAWWENFGFE  H
Sbjct: 478  GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537

Query: 660  LQTLAVKILCQVSSVGICQV----SDIPCQEAANRLKVERVEDLVFVQNNLRLHSQRIGN 493
            LQTLA+K+L QVSSV +CQ     +D PC+EAANR  VER EDL+FV+NNLRLH+QR  N
Sbjct: 538  LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNVN 597

Query: 492  LNSP 481
            L+SP
Sbjct: 598  LSSP 601


>ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao]
            gi|508784897|gb|EOY32153.1| Uncharacterized protein
            TCM_039722 [Theobroma cacao]
          Length = 381

 Score =  234 bits (597), Expect = 1e-58
 Identities = 113/191 (59%), Positives = 142/191 (74%)
 Frame = -2

Query: 1176 TLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAI 997
            ++ +D + IEASILGD FWS  H+ LQL +PF +LLA  +IDKSVMG ++DWR++ALE +
Sbjct: 163  SILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDKSVMGAIYDWRVQALEVV 222

Query: 996  RRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQAKDKTAMRGWKSTLER 817
            R K ID+ ALNQ+EVL+E++W++ FS LHAAGYILNP YFG                   
Sbjct: 223  RSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK------------------ 264

Query: 816  YESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQTLAVKI 637
                  AR VLR+QLSSYWR+EGS G+EDA+DCRDKMD VAWWENFGFETPHLQTLA+K+
Sbjct: 265  ------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWWENFGFETPHLQTLAIKV 318

Query: 636  LCQVSSVGICQ 604
            L QVS++ +CQ
Sbjct: 319  LSQVSTISMCQ 329



 Score =  167 bits (422), Expect = 3e-38
 Identities = 77/91 (84%), Positives = 84/91 (92%)
 Frame = -2

Query: 2280 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2101
            M SE DKWGW+HV+VFG FD+GSGTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC 
Sbjct: 1    MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60

Query: 2100 AIDRSMREAFQILEEERLARKKKRTSGSGKP 2008
            AI+R++REAF ILEEERLARKKKRT GSGKP
Sbjct: 61   AINRTLREAFHILEEERLARKKKRTFGSGKP 91



 Score = 79.0 bits (193), Expect = 1e-11
 Identities = 36/52 (69%), Positives = 40/52 (76%)
 Frame = -2

Query: 1845 SFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDC 1690
            +FG GYE PS+DKLS  FL KEK RIEKS+  VRESWP TG T+LCV CL C
Sbjct: 92   TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRESWPHTGYTVLCVGCLGC 143


>ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda]
            gi|548843859|gb|ERN03513.1| hypothetical protein
            AMTR_s00003p00270420 [Amborella trichopoda]
          Length = 732

 Score =  227 bits (578), Expect = 2e-56
 Identities = 172/640 (26%), Positives = 278/640 (43%), Gaps = 56/640 (8%)
 Frame = -2

Query: 2253 WKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREA 2074
            W ++   G    G G    +C  C   + GSY+RV++HLLG  G GVK C  ID      
Sbjct: 35   WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94

Query: 2073 FQILEEERLARKKKRTSGSGKPGKRIRTSQLSL------NHIWKSIS---KEDVDDVVAR 1921
               L +E   RK + +S S  P  ++ +  + L      N + K +    K+ +D ++AR
Sbjct: 95   LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154

Query: 1920 FFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDKLSGSFLIKEKARIEKSLASVR 1744
             FYA G+++++  SPYF ++I         GY  P+ D L  S L  EKA IE+S+   R
Sbjct: 155  CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214

Query: 1743 ESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKA----VRVNDGDTLENVFTGA 1576
             SW   G ++L     D T     IN   +S  G +F KA    V + + D ++N+F   
Sbjct: 215  SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFL-- 272

Query: 1575 ISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELD- 1399
              + + EVG T+V+QII +           +    P IFW+PC  H++   ++NI   D 
Sbjct: 273  --EMVAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDD 330

Query: 1398 -----------WIKPIVLCAKGI------EQCMLTFQRSSPNVFTQDLKQSSDPLSAKFA 1270
                       WI+ +    K I         +LT     P +    + +S      +FA
Sbjct: 331  ERKAEKYLHCQWIRDLDRDVKMIRSFVVDHNAVLTIYSQYPTLRLLSVTES------RFA 384

Query: 1269 PSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLC 1090
             +  +V RI E+K AL             +   +E    +++ ++ D +W      +   
Sbjct: 385  STVIIVKRIKEVKPAL-CRMVVDSYWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFT 443

Query: 1089 EPFVRLLATFNIDKSVMGDVFDWRMRALEAIRRKGIDDDALN----------QVEVLLES 940
            EP + +L   + D+  + +V+D     +E +R     ++  N           +  +L  
Sbjct: 444  EPILAMLRAIDTDEPTLHEVYDMWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRILVG 503

Query: 939  RWDMYFSPLHAAGYILNPRYFGNN----------QAKDKTAMRGWKSTLERYESDSGARR 790
             W+   +PL    + LNP+Y+ +             KD+    G      R        +
Sbjct: 504  SWNKSKTPLQCLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSELQ 563

Query: 789  VLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQTLAVKILCQVSSVGI 610
             + E+   +   +G  G  D +  R  M P++WWENFG   P L  LA ++L Q SS   
Sbjct: 564  KVHEEFEMFSMCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSSSC 623

Query: 609  CQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQR 502
            C+       +  +   NRL  +R EDLV+V +NLRL S+R
Sbjct: 624  CERNWGTFSLIKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663


>ref|XP_004299161.1| PREDICTED: uncharacterized protein LOC101293587 [Fragaria vesca
            subsp. vesca]
          Length = 730

 Score =  221 bits (562), Expect = 2e-54
 Identities = 168/656 (25%), Positives = 295/656 (44%), Gaps = 72/656 (10%)
 Frame = -2

Query: 2253 WKHVSVFGGFDKGSGTK-RWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDR-SMR 2080
            WK+V++     KG G    ++C+ C +++NGS+ RV+ HLL   G GV+ C  I     R
Sbjct: 31   WKYVTITREAKKGQGGNCEFQCSFCKIKFNGSHYRVKHHLLQIIGKGVRKCEKIPPPKKR 90

Query: 2079 EAFQILEEERLARK------------KKRTSGSGKP---------------GKRIRTSQL 1981
            E   ++E   L++K             K  S SG                  K+ +    
Sbjct: 91   ELMALMESYELSKKMAGPRLVPLPSSSKDPSSSGSTFGFGQDLLDDIVVDTSKKRKEVGG 150

Query: 1980 SLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDKL 1804
            SL   + + ++E +D  +AR FY  GL+ ++A +P++    N   ++   GY  P+ + L
Sbjct: 151  SLEKSFNNGAREQLDGEIARMFYTGGLSFNLAKNPHYIRAFNRACAYPIAGYRPPNYNAL 210

Query: 1803 SGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKA 1624
              + L KE+  IE+ L  ++ +W Q G ++ C +    T+    IN+  +   G MF +A
Sbjct: 211  RTTLLEKERNHIERLLEPIKLTWKQKGVSV-CSDGWSDTQRRPLINVMAACESGPMFLRA 269

Query: 1623 VRVNDGDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCT 1444
                     ++  +  + ++I+E+G T+V+Q+I +     ++  +++  +FP IFW+PC 
Sbjct: 270  ENCEGESKDKHFISDLLIESILEIGPTHVVQVITDNASNCKAAGAIINARFPHIFWTPCV 329

Query: 1443 SHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQ--D 1309
             H++   ++NI              E  WI  I      ++  ++        +F Q  +
Sbjct: 330  VHTLNLALKNICAPSSIPTKRAAYDECHWISEIADDVYFVKNFIMNHGMRLA-MFNQHSE 388

Query: 1308 LKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNI----SIEAS 1141
            LK  S     +FA +  ++ R  +IKQ+LQ              T  +D++    ++   
Sbjct: 389  LKMLS-VAETRFASAVVMLKRFKKIKQSLQRMMISDEWD-----TYKDDDVGKARAVSDY 442

Query: 1140 ILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAIRR-------KGI 982
            IL + +W      +    P   +L   + DK  +  V++W     E ++        K  
Sbjct: 443  ILSNEWWRKIDYIISFTLPIYTMLRRCDTDKPCLHKVYEWWDTMFEEVKVAIYINECKEY 502

Query: 981  DDDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNNQA----------KDKTAMRG 838
            ++++   N V  +L SRW    +PLH   + LNPRY+               +D    + 
Sbjct: 503  EEESPFYNVVYSILLSRWTKSSTPLHCMAHSLNPRYYSTEYLSGAPNRTPPHQDSEIAKE 562

Query: 837  WKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHL 658
             K  L++Y ++    R++ E+ +S+        + D++  R KMDP+ WW   G  TP+L
Sbjct: 563  RKECLKKYYANEDQMRLVNEEFASFSACLDEFANSDSMSDRGKMDPMKWWIVHGSTTPNL 622

Query: 657  QTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQR 502
            Q +A+K+L Q  S   C+              NR+  +R EDLVFV NNLRL S R
Sbjct: 623  QKIALKLLGQPCSSSCCERNWSTYTFIHSLRRNRITPQRAEDLVFVHNNLRLLSTR 678


>ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]
            gi|508777206|gb|EOY24462.1| HAT transposon superfamily
            [Theobroma cacao]
          Length = 674

 Score =  214 bits (545), Expect = 1e-52
 Identities = 157/609 (25%), Positives = 282/609 (46%), Gaps = 46/609 (7%)
 Frame = -2

Query: 2196 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQILEEERLARKKKRTSGS 2017
            +CN+C+  ++G   R++ HL       +  C  +   +R+  Q +     + KK++T   
Sbjct: 23   RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRDHIQTILN---SPKKQKTPKK 79

Query: 2016 GKPGKRIRTSQ---------LSLNH---------------------------IWKSISKE 1945
             K  K +   Q         L LNH                             +   +E
Sbjct: 80   PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139

Query: 1944 DVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIE 1765
            D D  +A FF+   +    A S Y+ E+++ +A  G GY++PS + L  + L K K  I 
Sbjct: 140  DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199

Query: 1764 KSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVF 1585
                  R+ W +TGCTILC +  D  R  S +   V+ P+G +F K+V V+  +   +  
Sbjct: 200  DCYKKYRDEWKETGCTILCDSWSD-GRTKSFVIFSVTCPKGTLFLKSVDVSGHEDDASYL 258

Query: 1584 TGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAE 1405
               +   ++EVG  NV+Q+I +          L+M K+  +FWSPC S+ I +++E+I++
Sbjct: 259  FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318

Query: 1404 LDWIKPIVLCAKGIEQCMLT--FQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIK 1231
             +W+  ++  AK I Q + +  +  +    FT   ++   P   +F  +Y  +  I  I+
Sbjct: 319  QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGG-RELMRPRITRFVANYLTLRSII-IQ 376

Query: 1230 QALQXXXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNID 1051
            +               + +   D  +I++ +  + FW   H  + + EP V++L   + D
Sbjct: 377  EDNLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGD 436

Query: 1050 KSVMGDVFDWRMRALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRY 880
               MG +++   RA  AI+   KG+++  +  +  +++ RW+M   SPLHAA   LNP  
Sbjct: 437  MPAMGYIYEGIERAKVAIKAYYKGLEEKYM-PIWDIIDRRWNMQLHSPLHAAAAFLNPSI 495

Query: 879  FGNNQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMD 703
            F N   K    MR G++  + +  +    +  + ++   Y   +G+LG + A+  R    
Sbjct: 496  FYNPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNA 555

Query: 702  PVAWWENFGFETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVF 535
            P  WW ++G+E P LQ +A++IL Q  S   C+      +    +  N++++E+  DLVF
Sbjct: 556  PGDWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVF 615

Query: 534  VQNNLRLHS 508
            V  NL L +
Sbjct: 616  VHCNLCLQA 624


>ref|XP_003543854.2| PREDICTED: uncharacterized protein LOC100780312 [Glycine max]
          Length = 701

 Score =  209 bits (531), Expect = 6e-51
 Identities = 171/657 (26%), Positives = 293/657 (44%), Gaps = 60/657 (9%)
 Frame = -2

Query: 2295 PSHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVG 2116
            PS    Q +  K  W +V+       G G    KCN C+  +NGSY+RVRAHLL  TG G
Sbjct: 6    PSQAKEQDDDTKPLWTYVTKIKSV-AGGGNYEIKCNICDFTFNGSYTRVRAHLLKMTGKG 64

Query: 2115 VKSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTS 1987
            V+ C  +  +     + ++ E   R+ R K +              T+  G   K+ +TS
Sbjct: 65   VRVCQKVTVAKLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGIDPKKRKTS 124

Query: 1986 QLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVD 1810
              S+ + +   ++E +D  +AR FY+ GL  H+A +P++ +     A+    GY+ P  +
Sbjct: 125  --SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYN 182

Query: 1809 KLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFF 1630
            KL  + L  E+  +E  L  ++ +W Q G +I+     D  R  S IN  V +  G MF 
Sbjct: 183  KLRTTLLQNERRHVENLLQPIKNAWSQKGVSIVSDGWSDPQRR-SLINFMVVTESGPMFL 241

Query: 1629 KAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSP 1450
            KA+  ++    ++     + + IMEVG +NV+QI+ +     ++   ++  +FP I+W+P
Sbjct: 242  KAIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTP 301

Query: 1449 CTSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCML--TFQRSSPNVFT 1315
            C  H++   ++NI              E  WI  I   A  ++  ++  + + S  N F 
Sbjct: 302  CVVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMSHSMRLSIFNSFN 361

Query: 1314 QDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNIS----IE 1147
                 S  P   +FA +  ++ R  ++K+ LQ              +  ED+++    ++
Sbjct: 362  SLKLLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVK 414

Query: 1146 ASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAIR-------RK 988
             ++L D +W      L    P   +L   + + S +  V++     +E ++       RK
Sbjct: 415  DTLLDDKWWDKVDYILSFTSPIYDVLRRTDTEASSLHLVYEMWDSMIEKVKNAIYQYERK 474

Query: 987  GIDDDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNN----------QAKDKTAM 844
               + +     V  +L  RW    +PLH   + LNPRY+ +             +D    
Sbjct: 475  EESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELT 534

Query: 843  RGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETP 664
            R      +R+  D   RR +  + +++        D D+++ R +MDP AWW   G   P
Sbjct: 535  RERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGINAP 594

Query: 663  HLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQ 505
             LQ +A+K+L Q  S   C+              N++   R EDLVFV +NLRL S+
Sbjct: 595  ILQKIALKLLAQPCSSSCCERNWSTYSFIHSLKRNKMTPHRAEDLVFVHSNLRLLSR 651


>ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662659 [Glycine max]
          Length = 847

 Score =  207 bits (528), Expect = 1e-50
 Identities = 170/657 (25%), Positives = 293/657 (44%), Gaps = 60/657 (9%)
 Frame = -2

Query: 2295 PSHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVG 2116
            PS    Q +  K  W +V+       G GT   KCN C+  +NGSY+RVRAHLL  TG G
Sbjct: 152  PSQAKEQDDDTKPLWTYVTKIKSV-AGGGTYEIKCNICDFTFNGSYTRVRAHLLKMTGKG 210

Query: 2115 VKSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTS 1987
            V+ C  +  +     + ++ E   R+ R K +              T+  G   K+ +TS
Sbjct: 211  VRVCQKVTVAKLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS 270

Query: 1986 QLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVD 1810
              S+ + +   ++E +D  +AR FY+ GL  H+A +P++ +     A+    GY+ P  +
Sbjct: 271  --SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYN 328

Query: 1809 KLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFF 1630
            KL  + L  E+  +E  L  ++ +W Q G +I+        R  S IN  V +  G MF 
Sbjct: 329  KLRITLLQNERRHVENLLQPIKNAWSQKGVSIVSDGWSGPQRR-SLINFMVVTESGPMFL 387

Query: 1629 KAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSP 1450
            KA+  ++    ++     + + IMEVG +NV+QI+ +     ++   ++  +FP I+W+P
Sbjct: 388  KAIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTP 447

Query: 1449 CTSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCML--TFQRSSPNVFT 1315
            C  H++   ++NI              E  WI  I   A  ++  ++  + + S  N F 
Sbjct: 448  CVVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMSHSMRLSIFNSFN 507

Query: 1314 QDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNIS----IE 1147
                 S  P   +FA +  ++ R  ++K+ LQ              +  ED+++    ++
Sbjct: 508  SLKLLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVK 560

Query: 1146 ASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAIR------RKG 985
             ++L D +W      L    P   +L   + + S +  V++     +E ++       + 
Sbjct: 561  DTLLDDKWWDKVDYILSFTSPIYDVLRRTDTEASSLHLVYEMWDSMIEKVKNAIYQYERN 620

Query: 984  IDDDALNQVEV---LLESRWDMYFSPLHAAGYILNPRYFGNN----------QAKDKTAM 844
             + +     EV   +L  RW    +PLH   + LNPRY+ +             +D    
Sbjct: 621  EESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELT 680

Query: 843  RGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETP 664
            R      +R+  D   RR +  + +++        D D+++ R +MDP AWW   G   P
Sbjct: 681  RERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGINAP 740

Query: 663  HLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQ 505
             LQ +A+K+L Q  S   C+              N++   R EDLVFV +NLRL S+
Sbjct: 741  ILQKIALKLLAQPCSSSCCERNWSTYSFIHSLKRNKMTPHRAEDLVFVHSNLRLLSR 797


>ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica]
            gi|462411014|gb|EMJ16063.1| hypothetical protein
            PRUPE_ppa018860mg [Prunus persica]
          Length = 805

 Score =  207 bits (528), Expect = 1e-50
 Identities = 174/670 (25%), Positives = 289/670 (43%), Gaps = 70/670 (10%)
 Frame = -2

Query: 2301 IEPSHESMQSESDKWG-WKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFT 2125
            + PS +    ++D    WK+V       K  G   ++CN+C   + GSY RV++HLL   
Sbjct: 111  LAPSRQLKHGDNDNTPLWKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLK 170

Query: 2124 GVGVKSCPAIDRS-MREAFQILEEERLARKKKR-------TSGSGKPGKRIRTSQLSLNH 1969
            G GV SC  +  S + E  +++EE  L  K  +       TS +   G    +S L ++ 
Sbjct: 171  GNGVASCTKVTNSHLMEMEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGS--SSGLGMSS 228

Query: 1968 IWKSISK----------------EDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG 1837
             W S SK                E +D  +AR FY  GL+   + +P++        S  
Sbjct: 229  NWCSDSKKRKGNPIEKAFNNNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKT 288

Query: 1836 -PGYESPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMF 1660
             PGY+ P  + L  + L KEK  IE+ ++   + W       L             IN+ 
Sbjct: 289  LPGYQPPGYNMLRTTLLQKEKNNIEEWVSVCSDGWSDAQRRPL-------------INVM 335

Query: 1659 VSSPRGLMFFKAVRVNDGDTLENVF-TGAISDAIMEVGSTNVLQIILNLGHGSESFESLM 1483
                 G MF KA+   +G+  +  F    + ++I E+G  NV+Q++ +     ++   ++
Sbjct: 336  AICESGPMFLKAINC-EGECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIV 394

Query: 1482 MPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQD-- 1309
              KF  IFW+PC  H++   ++NI       P+    +  EQC      SS   F ++  
Sbjct: 395  EAKFKHIFWTPCVVHTLNLALKNICS-----PVPRNPEVYEQCSWISTISSDAWFIKNFI 449

Query: 1308 ------LKQSSDPLSAK--------FAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTL 1171
                  L   +D    K        FA +  ++ R  ++KQ L+                
Sbjct: 450  MNHNMRLSMYNDHCKLKLLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIY----- 504

Query: 1170 SEDNI----SIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALE 1003
             ED++    +++  IL + FW      L    P   +L   + D   +  +++W    +E
Sbjct: 505  KEDDVVKARTVKEKILDECFWEDIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIE 564

Query: 1002 AIR-------RKGIDDDAL--NQVEVLLESRWDMYFSPLHAAGYILNPRYF-------GN 871
             ++       RK ++++++  N V  +L  RW    +PLH   + LNP+Y+        +
Sbjct: 565  KVKTIIYRKERKQLNEESMFFNVVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAH 624

Query: 870  NQA---KDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDP 700
            N+    KD    R  K  +ER+ S+   RR + E+ +S+          D++  R  M P
Sbjct: 625  NRCPPHKDIEITRERKQCIERFFSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAP 684

Query: 699  VAWWENFGFETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFV 532
            V WW   G  TP LQT+A+K+L   SS   C+      +       N++  ER EDLVFV
Sbjct: 685  VKWWVIHGASTPKLQTIALKLLGHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFV 744

Query: 531  QNNLRLHSQR 502
             +NLRL S++
Sbjct: 745  HSNLRLLSRK 754


>ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis]
            gi|223549490|gb|EEF50978.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 670

 Score =  207 bits (528), Expect = 1e-50
 Identities = 157/614 (25%), Positives = 286/614 (46%), Gaps = 46/614 (7%)
 Frame = -2

Query: 2196 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKK- 2032
            +CN+CN  ++G   R++ HL       +  C  +   +R   Q +    ++++  +K+K 
Sbjct: 23   RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRNHIQSILSTPKKQKTPKKQKT 82

Query: 2031 -------------------RTSGSGKPG---------KRIRTSQLSLNHIWKSISKEDVD 1936
                                   SG+ G         + + TSQ  ++   ++  + + D
Sbjct: 83   DQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDA-QNEKQNNAD 141

Query: 1935 DVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSL 1756
              +A FF+   +    A S Y+ E+ + VA  G GY++PS +KL  S L K K  I    
Sbjct: 142  KRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDWY 201

Query: 1755 ASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGA 1576
               R+ W +TGCTILC    D  R  S I   V+ P+G +F K+V ++  +   N     
Sbjct: 202  RKYRDDWKETGCTILCDGWSD-GRTKSVIVFSVTCPKGTLFLKSVDISGHENDANYLFEL 260

Query: 1575 ISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDW 1396
            +   ++EVG  NV+Q+I +          L+M K+  +FWSPC S+ + +++E+I++ +W
Sbjct: 261  LESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQEW 320

Query: 1395 IKPIVLCAKGIEQCMLT--FQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQAL 1222
            +  ++  A  I + + +  +  +    FT   ++   P   ++  S YL  R   I++  
Sbjct: 321  VGTVMEEANTITKYIYSHAWTLNMMRRFTGG-RELIRPRITRYV-SNYLSLRAIVIQEDN 378

Query: 1221 QXXXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSV 1042
                         + +   D   +++ +  D FW   H  + + EP +++L   + D   
Sbjct: 379  LKHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPA 438

Query: 1041 MGDVFDWRMRALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRYFGN 871
            MG +++   RA  +I+   KGI+D  +   E+ ++ RW++   SPLHAA   LNP  F N
Sbjct: 439  MGYIYEVLERAKVSIKAYYKGIEDKYMPIWEI-IDRRWNIQLHSPLHAAAAFLNPSIFYN 497

Query: 870  NQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVA 694
               K    MR G++  + +  +    +  + ++   Y   +G+LG + A+  R    P  
Sbjct: 498  QNFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGD 557

Query: 693  WWENFGFETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQN 526
            WW  +G+E P LQ +A+++L Q  S   C+      +    +  N+ ++E++ DLVFV  
Sbjct: 558  WWAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHC 617

Query: 525  NL---RLHSQRIGN 493
            NL    ++  R+GN
Sbjct: 618  NLWLQAIYQSRVGN 631


>ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis]
          Length = 674

 Score =  207 bits (527), Expect = 2e-50
 Identities = 160/611 (26%), Positives = 282/611 (46%), Gaps = 48/611 (7%)
 Frame = -2

Query: 2196 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQ-ILEEERLARKKKR--- 2029
            +CN+C   ++G   R++ HL       +  C  +   +R+  Q IL   +  +  KR   
Sbjct: 23   RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRDHIQRILSIPKKQKNPKRPKV 82

Query: 2028 -----------TSGSGKPGKRIRTS--------QLSLNHIWKSIS----------KEDVD 1936
                       +S SG   +  R+S         L   H   SI           ++D D
Sbjct: 83   EKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQDDTD 142

Query: 1935 DVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSL 1756
              +A FF+   +    A S Y+ E++N +A  G GY +PS +KL  + L K K  I+   
Sbjct: 143  KKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDIDDCC 202

Query: 1755 ASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGA 1576
               RE W +TGCTILC N  D  R  S +   V+ P+G +F K+V V+  +         
Sbjct: 203  KKYREEWKETGCTILCDNWSD-ERTKSLVVFSVACPKGTLFLKSVDVSGHEEDATFLFEL 261

Query: 1575 ISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDW 1396
            +   +++VG  NV+Q+I +          L+M K+  +FWSPC ++ I +++E+I++ +W
Sbjct: 262  LESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDISKQEW 321

Query: 1395 IKPIVLCAKGIEQCMLTFQRSSPNVFTQDL-------KQSSDPLSAKFAPSYYLVHRIFE 1237
            +  ++  AK I +   +      + +T ++       ++   P   +F  +Y  +  I  
Sbjct: 322  VAMVLEEAKTITKYFYS------HAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVI 375

Query: 1236 IKQALQXXXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFN 1057
             ++ L+            + +   D  +I++ +  D FW   H  + + EP V++L   +
Sbjct: 376  HEENLK-HMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVD 434

Query: 1056 IDKSVMGDVFDWRMRALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNP 886
             D   MG +++   RA  AI+   KG+++  +  +  +++ RW+M   SPLHAA   LNP
Sbjct: 435  GDMPAMGYMYEGIERAKLAIQAYYKGVEEKYV-PIWDIIDRRWNMQLHSPLHAAAAFLNP 493

Query: 885  RYFGNNQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDK 709
              F N   K    MR G++  + +  +    +  + ++   Y   +G+LG + AV  R  
Sbjct: 494  SIFYNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKL 553

Query: 708  MDPVAWWENFGFETPHLQTLAVKILCQVSSV----GICQVSDIPCQEAANRLKVERVEDL 541
              P  WW ++G+E P LQ  A++IL Q  S           +    +  N++++E+  DL
Sbjct: 554  NAPGDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDL 613

Query: 540  VFVQNNLRLHS 508
            +FV  NLRL +
Sbjct: 614  LFVHCNLRLQA 624


>ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca
            subsp. vesca]
          Length = 719

 Score =  206 bits (523), Expect = 5e-50
 Identities = 158/661 (23%), Positives = 287/661 (43%), Gaps = 62/661 (9%)
 Frame = -2

Query: 2298 EPSHESMQSES-DKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTG 2122
            EPS ES +S+  D   WK+V++  G DK  G   + CN C  +  GS+SRV++HLL   G
Sbjct: 9    EPSVESTKSQRLDAPLWKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKG 68

Query: 2121 VGVKSCPAIDRSMREAFQIL-----------EEERLARKKKRTSGSG----------KPG 2005
             GVK  P I R      Q L            + ++A      +GSG             
Sbjct: 69   TGVKIYPTITRDQTVELQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEV 128

Query: 2004 KRIRTSQLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYE 1825
            K+ R     L+  ++   + + D  VAR FY+ GL  +VA +P + E   ++AS  PGY 
Sbjct: 129  KKRRGLSPQLSKAFRQEDRRECDASVARLFYSSGLAFNVARNPNYRESY-SLASKIPGYV 187

Query: 1824 SPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPR 1645
             P  + L  + L  EK  IE++L  ++++W +TG + LC +     +    INM  ++  
Sbjct: 188  PPGYNALRTTLLDNEKRHIERTLLPIKKTWKETGVS-LCSDGWTDGQKRPLINMMAAAKD 246

Query: 1644 GLMFFKAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPR 1465
            G M  KA+        +      + ++I E+G  NV+Q++ +    S +  +++    P 
Sbjct: 247  GAMMLKAINCEGVTKSKEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPH 306

Query: 1464 IFWSPCTSHSIRQLMEN-------------IAELDWIKPIVLCAKGIEQCMLTFQRSSPN 1324
            IFW+PC  H++   +++             + EL W+  +      I+  ++        
Sbjct: 307  IFWTPCVVHTLNLALKDLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVVNHNMRLAM 366

Query: 1323 VFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNISIEA 1144
                   +       +FA  + ++ R  ++K  LQ           K    S+  + ++ 
Sbjct: 367  YHEHCALRLLQVAPTRFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARV-VKE 425

Query: 1143 SILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAIRRKGIDDDAL- 967
             +L + FW      + L  P   ++   ++D+  +  V++W    +E +++   + + + 
Sbjct: 426  MLLKEKFWEQIDFLIALMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVH 485

Query: 966  ------------NQVEVLLESRWDMYFSPLHAAGYILNPRYFGNN----------QAKDK 853
                        + V  +L +RW    +PLH   + LNP+Y+ +             +D 
Sbjct: 486  VITEHCDVTRFYDVVYPILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDA 545

Query: 852  TAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGF 673
                  +   ++   DS  R  + E+ + +    G     DA++ +   +P+ WW ++G 
Sbjct: 546  ELNNERRRCFQKLFPDSQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGP 605

Query: 672  ETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQ 505
             TP LQ+LA+K+L Q  S   C+              N+L+  R +DLV+V  NLRL ++
Sbjct: 606  STPLLQSLALKLLNQPCSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLAR 665

Query: 504  R 502
            +
Sbjct: 666  K 666


>ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis
            sativus]
          Length = 673

 Score =  204 bits (518), Expect = 2e-49
 Identities = 153/603 (25%), Positives = 275/603 (45%), Gaps = 42/603 (6%)
 Frame = -2

Query: 2196 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKKR 2029
            +CN+C   ++G   R++ HL       +  C  +   +R+  Q +    ++++  +K K 
Sbjct: 23   RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKV 82

Query: 2028 TSGSGKPGKRIRTSQLSLNHIWKS---------------------------ISKEDVDDV 1930
               +   G++  +S     H   S                             K++ D  
Sbjct: 83   DMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKK 142

Query: 1929 VARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLAS 1750
            VA FF+   +    A S Y+ E+++ +A +G GY++PS +KL  + L K K  I  S   
Sbjct: 143  VAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKK 202

Query: 1749 VRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAIS 1570
             R+ W +TGCTILC +  D  +  S + + V+  +G +F K+V ++  +      +  + 
Sbjct: 203  HRDEWKETGCTILCDSWSD-GQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLE 261

Query: 1569 DAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIK 1390
              I+EVG  NV+QII +          L+M K+  +FWSPC S+ + Q++E+I++++W+ 
Sbjct: 262  TIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVS 321

Query: 1389 PIVLCAKGIEQCMLTFQR--SSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQX 1216
             ++  AK I + + +     ++   FT   K+   P   +F  ++  +  I  ++  L+ 
Sbjct: 322  AVLEEAKIITRYIYSHASILNTMRKFTGG-KELIRPRITRFVTNFLSLRSIVILEDNLK- 379

Query: 1215 XXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMG 1036
                       + +   D  +I + +  D FW   H  + +CEP +R+L   + D   MG
Sbjct: 380  HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439

Query: 1035 DVFDWRMRALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRYFGNNQ 865
             +F+   RA   I+    G +D  +   E  ++ RW++   + LH A   LNP  F N  
Sbjct: 440  YIFEGIERAKVEIKTYYNGFEDKYMPIWET-IDRRWNLQLHTTLHTAAAFLNPSXFYNPN 498

Query: 864  AK-DKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWW 688
             K D     G++  + +  +    +  +  +  +Y   +G+LG + A+  R    P  WW
Sbjct: 499  FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558

Query: 687  ENFGFETPHLQTLAVKILCQVSSVGIC-----QVSDIPCQEAANRLKVERVEDLVFVQNN 523
              +G+E P LQ  AV+IL Q  S   C        +    +  +R + E++ DLVFVQ N
Sbjct: 559  SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618

Query: 522  LRL 514
            L L
Sbjct: 619  LWL 621


>ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus]
          Length = 673

 Score =  204 bits (518), Expect = 2e-49
 Identities = 153/603 (25%), Positives = 275/603 (45%), Gaps = 42/603 (6%)
 Frame = -2

Query: 2196 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKKR 2029
            +CN+C   ++G   R++ HL       +  C  +   +R+  Q +    ++++  +K K 
Sbjct: 23   RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKV 82

Query: 2028 TSGSGKPGKRIRTSQLSLNHIWKS---------------------------ISKEDVDDV 1930
               +   G++  +S     H   S                             K++ D  
Sbjct: 83   DMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDETDKK 142

Query: 1929 VARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLAS 1750
            VA FF+   +    A S Y+ E+++ +A +G GY++PS +KL  + L K K  I  S   
Sbjct: 143  VAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKK 202

Query: 1749 VRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAIS 1570
             R+ W +TGCTILC +  D  +  S + + V+  +G +F K+V ++  +      +  + 
Sbjct: 203  HRDEWKETGCTILCDSWSD-GQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLE 261

Query: 1569 DAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIK 1390
              I+EVG  NV+QII +          L+M K+  +FWSPC S+ + Q++E+I++++W+ 
Sbjct: 262  TIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVS 321

Query: 1389 PIVLCAKGIEQCMLTFQR--SSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQX 1216
             ++  AK I + + +     ++   FT   K+   P   +F  ++  +  I  ++  L+ 
Sbjct: 322  AVLEEAKIITRYIYSHASILNTMRKFTGG-KELIRPRITRFVTNFLSLRSIVILEDNLK- 379

Query: 1215 XXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMG 1036
                       + +   D  +I + +  D FW   H  + +CEP +R+L   + D   MG
Sbjct: 380  HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439

Query: 1035 DVFDWRMRALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRYFGNNQ 865
             +F+   RA   I+    G +D  +   E  ++ RW++   + LH A   LNP  F N  
Sbjct: 440  YIFEGIERAKVEIKTYYNGFEDKYMPIWET-IDRRWNLQLHTTLHTAAAFLNPSVFYNPN 498

Query: 864  AK-DKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWW 688
             K D     G++  + +  +    +  +  +  +Y   +G+LG + A+  R    P  WW
Sbjct: 499  FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558

Query: 687  ENFGFETPHLQTLAVKILCQVSSVGIC-----QVSDIPCQEAANRLKVERVEDLVFVQNN 523
              +G+E P LQ  AV+IL Q  S   C        +    +  +R + E++ DLVFVQ N
Sbjct: 559  SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618

Query: 522  LRL 514
            L L
Sbjct: 619  LWL 621


>ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris]
            gi|561034735|gb|ESW33265.1| hypothetical protein
            PHAVU_001G056200g, partial [Phaseolus vulgaris]
          Length = 702

 Score =  203 bits (516), Expect = 3e-49
 Identities = 161/618 (26%), Positives = 273/618 (44%), Gaps = 32/618 (5%)
 Frame = -2

Query: 2256 GWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMRE 2077
            GWKH     G D     K+ KC++C+   +G   R + HL G T    + C ++   +R+
Sbjct: 24   GWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCSVPEEIRD 77

Query: 2076 AF-QILEEERLARKKKR----------------------TSGSGKPGKRIRTSQLSLNHI 1966
               +I+ E + A  KKR                      + G  K G R    Q ++N +
Sbjct: 78   LMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSR-GAVQATINQM 136

Query: 1965 WKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLI 1786
             K   KE+VD  VA FFY   +  +V  +P F ++   +  +G GY+ PS   +    L 
Sbjct: 137  MKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDIREKLLK 196

Query: 1785 KEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVND- 1609
            +   + +  L   +E W +TGCTI+     D  R   C N  V+SP+G +F  ++  +D 
Sbjct: 197  QAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSIC-NFLVNSPKGTVFMYSLDTSDI 255

Query: 1608 GDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPCTSHSIR 1429
              T + VF   + D +  VG  NV+Q++ +     ++   L+M K   ++W+PC +H I 
Sbjct: 256  SKTADKVFK-MLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCAAHCID 314

Query: 1428 QLMENIAELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQDLKQSSD---PLSAKFAPSYY 1258
               E+  +   +  + +  KG +     + RS      +   +  D   P   +FA +Y 
Sbjct: 315  LSFEDFEKKLKVHELTI-KKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFATAYL 373

Query: 1257 LVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFV 1078
             +  + E+K +L            K  T S++   +E  IL + FW      L++  P +
Sbjct: 374  TLGCLHELKASLLTMFSSEEWKTSKFGT-SQEGKKVENMILDNRFWKNISTCLKVAAPLM 432

Query: 1077 RLLATFNID-KSVMGDVFDWRMRALEAIRRK-GIDDDALNQVEVLLESRWD-MYFSPLHA 907
             +L   + D K  MG +++   RA E I+        +  +V  ++++RWD     PLHA
Sbjct: 433  VVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWKIIDARWDNQLHRPLHA 492

Query: 906  AGYILNPR--YFGNNQAKDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDE 733
            A Y LNP+  Y    ++ D     G  +++ R   D+  RR++  QL  Y    G+   +
Sbjct: 493  AAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFGRGAFAMD 552

Query: 732  DAVDCRDKMDPVAWWENFGFETPHLQTLAVKILCQVSSVGICQVSDIPCQEAANRLKVER 553
            DA + R  + P  WWE FG+ TP L                         +  N L  ++
Sbjct: 553  DAKESRKTILPGEWWEMFGYRTPEL-------------------------KRRNHLHQKK 587

Query: 552  VEDLVFVQNNLRLHSQRI 499
            + DL++V  NL+L +++I
Sbjct: 588  MNDLLYVMYNLKLSNKQI 605


>ref|XP_002273287.1| PREDICTED: uncharacterized protein LOC100260844 [Vitis vinifera]
          Length = 758

 Score =  198 bits (503), Expect = 1e-47
 Identities = 163/658 (24%), Positives = 289/658 (43%), Gaps = 73/658 (11%)
 Frame = -2

Query: 2256 GWKHVSVFGGFDKGSGTKRWKCNHCN-LRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMR 2080
            GW H  +  G     G ++ KC +C+ +   G  SR++ HL G  G  V  C  +   ++
Sbjct: 31   GWAHGIMVNG-----GRQKIKCKYCHKVILGGGISRLKQHLAGERG-NVAPCEEVPEDVK 84

Query: 2079 E------AFQILEE-------------------------ERLARKKKRTSGSG----KPG 2005
                    F++LE+                         + + R  K  S  G    + G
Sbjct: 85   VQIQQHLGFKVLEKLKRQKGLKSSKNSLVPYYQDREGGADDVQRSPKAASARGISRKRRG 144

Query: 2004 KRIR--------------------TSQLSLNHIWKSI-SKEDVDDVVARFFYAEGLNIHV 1888
            K I                      +Q+S+++ + S  S +  D  VARF Y  G+    
Sbjct: 145  KEIDEGTSYKKKRHKKQLFPTATPVAQVSIHNSFASQESMDQADMAVARFMYEAGVPFSA 204

Query: 1887 ANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILC 1708
            ANS YF ++ + +A+ GPGY+ PS   L G  L +    +E     +R SW  TGC+++ 
Sbjct: 205  ANSYYFQQMADAIAAVGPGYKMPSCHSLRGKLLNRSVQDVEGLCEELRRSWEVTGCSVMV 264

Query: 1707 VNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQI 1528
              C D T G + +N +V  P+G +F ++V  +D               + EVG  N++  
Sbjct: 265  DRCTDRT-GHTVLNFYVYCPKGTVFLRSVYASDIANSTEALLSLFVSVVEEVGPKNIVNF 323

Query: 1527 ILNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIVLCAKGIEQCM- 1351
            + +     ++   L+M ++   FWS C +H I  ++E + + D +K ++  AK I Q + 
Sbjct: 324  VTDTTPTYKAAGKLLMGRYKTFFWSACGAHCIDLMLEEVGKRDEVKELLAKAKRITQFIY 383

Query: 1350 -------LTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXX 1192
                   LT +R+      +D+ Q +     +FA ++  +  I   K+AL          
Sbjct: 384  NNTWVLNLTRKRTG----GRDIVQLA---ITRFASNFLTLQSIVSFKEALH-QMFTSATW 435

Query: 1191 XXKLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNI-DKSVMGDVFDWRM 1015
                 +     + +   I+   FWS     L++ +P + +L   +  ++  +G ++D   
Sbjct: 436  MQSAFSKQRAGVEVAEIIVDPTFWSMCDRALKVSKPLLAVLHLIDCEERPSVGYIYDAME 495

Query: 1014 RALEAIRRKGIDDDA-LNQVEVLLESRW-DMYFSPLHAAGYILNPRYFGN-NQAKDKTAM 844
            +A ++I     D ++  +    +++  W + + SPLHAA Y LNP  F N + + +K   
Sbjct: 496  KAKKSIILAFDDKESDYSPYLKIIDCIWKEEFHSPLHAAAYYLNPSIFYNPSFSTNKVIQ 555

Query: 843  RGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETP 664
            +G    +E  E +   + ++   ++ Y    G      A+  R+ + P  WW  +  + P
Sbjct: 556  KGLLDCIESLEPNLSTQVMITSHINYYEEAVGDFSRPVALRGRESLAPATWWSLYAADYP 615

Query: 663  HLQTLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQR 502
             LQ LAV+IL Q  SV  C+    +S+    +  NRL+ +R+ DL+FV  NLRL  +R
Sbjct: 616  DLQRLAVRILSQTCSVTRCETSWSMSERVHSKQRNRLEHQRLSDLIFVHYNLRLQEKR 673


>ref|XP_006603987.1| PREDICTED: uncharacterized protein LOC102660926 [Glycine max]
          Length = 698

 Score =  197 bits (501), Expect = 2e-47
 Identities = 164/654 (25%), Positives = 290/654 (44%), Gaps = 58/654 (8%)
 Frame = -2

Query: 2292 SHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGV 2113
            S    Q +  K  W +++       G G    KCN C+  +NGSY+RVRAHLL  TG GV
Sbjct: 7    SQAKEQDDDTKPIWTYITKIKSV-AGGGNYEIKCNICDFTFNGSYTRVRAHLLKMTGKGV 65

Query: 2112 KSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTSQ 1984
            + C  +  +   A + ++ +   R+ R K +              T+  G   K+ +TS 
Sbjct: 66   RVCQKVTVAKLIALKKIDNKATLRVVRSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS- 124

Query: 1983 LSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDK 1807
             S+ + +   ++E +D  +AR FY+ GL  H+A +P++ +     A+    GY+    +K
Sbjct: 125  -SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKTFAYAANNQISGYQPSGYNK 183

Query: 1806 LSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFK 1627
            L  + L  E+  +E  L  ++ +W Q G +I+     D  R  S IN  V +  G MF K
Sbjct: 184  LRTTLLQNERRHVENLLQPIKNAWNQKGVSIVSDGWSDPQRR-SLINFMVVTESGPMFLK 242

Query: 1626 AVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIILNLGHGSESFESLMMPKFPRIFWSPC 1447
            A+  ++    ++     + + IMEVG +NV+QI+++     ++   ++  +FP I+W+PC
Sbjct: 243  AIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVIDNAAVCKAAGLIIEAEFPSIYWTPC 302

Query: 1446 TSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQDL 1306
              H++   ++NI              E  WI  I   A  ++  +++      ++F    
Sbjct: 303  VVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKIFIMSHSMRL-SIFNSLK 361

Query: 1305 KQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXKLMTLSEDNIS----IEASI 1138
              S  P   +FA +  ++ R  ++K+ LQ              +  ED+++    ++ ++
Sbjct: 362  LLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVKDTL 414

Query: 1137 LGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMRALEAIR-------RKGID 979
            L D +W      L    P   +L   +   S +  V++     +E ++       RK   
Sbjct: 415  LDDKWWDKVDYILSFTSPIYDVLRRTDTKVSSLHLVYEMWDSMIEKVKNAIYQYERKEES 474

Query: 978  DDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGNN----------QAKDKTAMRGW 835
            + +     V  +L  RW    +PLH   + LNPRY+ +             +D    R  
Sbjct: 475  EGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELTRER 534

Query: 834  KSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQ 655
                +R+  D   RR +  + +++        D D+++ R +MDP AWW       P LQ
Sbjct: 535  LKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHDINAPILQ 594

Query: 654  TLAVKILCQVSSVGICQ----VSDIPCQEAANRLKVERVEDLVFVQNNLRLHSQ 505
             +A+K+L Q  S   C+              N++   R E+LVFV +NLRL S+
Sbjct: 595  KIALKLLAQPCSSSCCERNWSTYSFIHSLKRNKMTPHRAENLVFVHSNLRLLSR 648


Top