BLASTX nr result

ID: Mentha29_contig00017021 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha29_contig00017021
         (1366 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU33290.1| hypothetical protein MIMGU_mgv1a022895mg [Mimulus...   467   e-129
ref|XP_006349291.1| PREDICTED: uncharacterized protein LOC102579...   439   e-120
ref|XP_006349292.1| PREDICTED: uncharacterized protein LOC102579...   438   e-120
ref|XP_004230425.1| PREDICTED: uncharacterized protein LOC101259...   438   e-120
ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255...   438   e-120
gb|ABE91850.1| Protein of unknown function DUF707 [Medicago trun...   437   e-120
ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prun...   430   e-118
ref|XP_006598790.1| PREDICTED: uncharacterized protein LOC100797...   429   e-117
ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797...   429   e-117
ref|XP_003592883.1| hypothetical protein MTR_2g005330 [Medicago ...   429   e-117
ref|XP_003592882.1| hypothetical protein MTR_2g005330 [Medicago ...   429   e-117
ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma...   428   e-117
ref|XP_006592649.1| PREDICTED: uncharacterized protein LOC100526...   426   e-116
ref|XP_007148587.1| hypothetical protein PHAVU_006G221000g [Phas...   421   e-115
ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma...   420   e-115
ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [...   420   e-115
ref|XP_006598789.1| PREDICTED: uncharacterized protein LOC100797...   419   e-114
ref|XP_006598788.1| PREDICTED: uncharacterized protein LOC100797...   419   e-114
ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma...   419   e-114
ref|XP_002528866.1| conserved hypothetical protein [Ricinus comm...   419   e-114

>gb|EYU33290.1| hypothetical protein MIMGU_mgv1a022895mg [Mimulus guttatus]
          Length = 336

 Score =  467 bits (1201), Expect = e-129
 Identities = 214/290 (73%), Positives = 247/290 (85%)
 Frame = +1

Query: 385  LKFSPGFGLKTADQDTISHKCKENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSH 564
            ++ SPGF + T  Q T ++KC++ C+P+G EALP GIIS  +N+E   LWGPVSE+NK  
Sbjct: 8    MQLSPGFRMNTTQQHTDTYKCEQKCRPVGSEALPNGIISIHANMEMRPLWGPVSEDNKPK 67

Query: 565  NTTGLLAMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVN 744
            + TGLLA+AVGI+QK LVN+IVKKFLEN FVVMLFHYDG V+KW+D +WS+RV+HISV N
Sbjct: 68   HGTGLLAVAVGINQKELVNKIVKKFLENDFVVMLFHYDGFVDKWHDFDWSNRVLHISVKN 127

Query: 745  QTKWWFAKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKS 924
            QTKWWFAKRF+HPDIVA+YEYIFLWDEDL V++FHP RY+SIVKEEGLEISQPALDPGKS
Sbjct: 128  QTKWWFAKRFLHPDIVAEYEYIFLWDEDLGVEDFHPKRYISIVKEEGLEISQPALDPGKS 187

Query: 925  EVHHPITIXXXXXXXXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLI 1104
            EVHHPIT           YYKFKG GRCD  ST+PPCVGWVEMMAPVFSRAAW CVW++I
Sbjct: 188  EVHHPITARRHKSRVHRRYYKFKGSGRCDEKSTSPPCVGWVEMMAPVFSRAAWRCVWYMI 247

Query: 1105 QNDLIHAWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGGFLDKNK 1254
            QNDLIHAWGLD +LGYCAQGDRTVK+GVVDEEYIVHLG+PTLG F D+NK
Sbjct: 248  QNDLIHAWGLDMQLGYCAQGDRTVKIGVVDEEYIVHLGLPTLGVFSDRNK 297


>ref|XP_006349291.1| PREDICTED: uncharacterized protein LOC102579538 isoform X1 [Solanum
            tuberosum]
          Length = 427

 Score =  439 bits (1129), Expect = e-120
 Identities = 224/396 (56%), Positives = 274/396 (69%), Gaps = 27/396 (6%)
 Frame = +1

Query: 259  MKFYYILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQD-- 429
            MK + I+ + D KSR  + SL +++ LIC VYF GS  + KDF  FS GF + +  Q+  
Sbjct: 1    MKSWNIVSVPDAKSRSCICSLFLTLALICAVYFTGSALMAKDFRAFS-GFTMNSTKQNGQ 59

Query: 430  ---------------------TISHKCKENCKPIGVEALPKGIISATSNLETHSLWGPVS 546
                                   ++KC++ C+P+G EALP+GIIS TSNLE   LWG V 
Sbjct: 60   CGKCKVPPPREEKQESHVTENVQNNKCQKKCRPLGSEALPEGIISKTSNLEMRPLWGDVE 119

Query: 547  ENNKSHNTTGLLAMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVI 726
            +  KS ++  LL +AVGI QK +VN+IVKKFLE+ FVVMLFHYDGVV++WNDLEWS+R I
Sbjct: 120  K--KSPHSVNLLGIAVGIKQKEMVNKIVKKFLEHDFVVMLFHYDGVVDEWNDLEWSNRAI 177

Query: 727  HISVVNQTKWWFAKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPA 906
            H+S +NQTKWWFAKRF+HPDIV++Y+YIFLWDEDL V+NFHP +Y+SIV+EEGLEISQP 
Sbjct: 178  HVSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGLEISQPG 237

Query: 907  LDPGKSEVHHPITIXXXXXXXXXXYYKFKGGGR-CDGNSTAPPCVGWVEMMAPVFSRAAW 1083
            LD  KSEVHH IT+          +Y+   GGR CD NST PPCVGWVEMMAPVFS+AAW
Sbjct: 238  LDASKSEVHHHITVRRGRSKVHRRFYRLNRGGRTCDNNSTEPPCVGWVEMMAPVFSKAAW 297

Query: 1084 CCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGGFLDKNKTNI 1263
             C W+++QNDLIHAWGLD KLGYCAQGDRT KVGVVD EYI HL VP+LGG  D      
Sbjct: 298  RCAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAVPSLGGNSDVETVIK 357

Query: 1264 ELD--TAHXXXXXXXXXLVADMYKLDNRSAVRIRSF 1365
            ELD  +           L A + K DNRS VR +S+
Sbjct: 358  ELDNNSLQGKNLSDSDTLAAPVEKFDNRSLVRRQSY 393


>ref|XP_006349292.1| PREDICTED: uncharacterized protein LOC102579538 isoform X2 [Solanum
            tuberosum]
          Length = 426

 Score =  438 bits (1127), Expect = e-120
 Identities = 224/396 (56%), Positives = 273/396 (68%), Gaps = 27/396 (6%)
 Frame = +1

Query: 259  MKFYYILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQD-- 429
            MK + I+ + D KSR  + SL +++ LIC VYF GS  + KDF  FS GF + +  Q+  
Sbjct: 1    MKSWNIVSVPDAKSRSCICSLFLTLALICAVYFTGSALMAKDFRAFS-GFTMNSTKQNGQ 59

Query: 430  ---------------------TISHKCKENCKPIGVEALPKGIISATSNLETHSLWGPVS 546
                                   ++KC++ C+P+G EALP+GIIS TSNLE   LWG V 
Sbjct: 60   CGKCKVPPPREEKQESHVTENVQNNKCQKKCRPLGSEALPEGIISKTSNLEMRPLWGDVE 119

Query: 547  ENNKSHNTTGLLAMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVI 726
               KS ++  LL +AVGI QK +VN+IVKKFLE+ FVVMLFHYDGVV++WNDLEWS+R I
Sbjct: 120  ---KSPHSVNLLGIAVGIKQKEMVNKIVKKFLEHDFVVMLFHYDGVVDEWNDLEWSNRAI 176

Query: 727  HISVVNQTKWWFAKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPA 906
            H+S +NQTKWWFAKRF+HPDIV++Y+YIFLWDEDL V+NFHP +Y+SIV+EEGLEISQP 
Sbjct: 177  HVSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGLEISQPG 236

Query: 907  LDPGKSEVHHPITIXXXXXXXXXXYYKFKGGGR-CDGNSTAPPCVGWVEMMAPVFSRAAW 1083
            LD  KSEVHH IT+          +Y+   GGR CD NST PPCVGWVEMMAPVFS+AAW
Sbjct: 237  LDASKSEVHHHITVRRGRSKVHRRFYRLNRGGRTCDNNSTEPPCVGWVEMMAPVFSKAAW 296

Query: 1084 CCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGGFLDKNKTNI 1263
             C W+++QNDLIHAWGLD KLGYCAQGDRT KVGVVD EYI HL VP+LGG  D      
Sbjct: 297  RCAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAVPSLGGNSDVETVIK 356

Query: 1264 ELD--TAHXXXXXXXXXLVADMYKLDNRSAVRIRSF 1365
            ELD  +           L A + K DNRS VR +S+
Sbjct: 357  ELDNNSLQGKNLSDSDTLAAPVEKFDNRSLVRRQSY 392


>ref|XP_004230425.1| PREDICTED: uncharacterized protein LOC101259678 [Solanum
            lycopersicum]
          Length = 428

 Score =  438 bits (1127), Expect = e-120
 Identities = 221/395 (55%), Positives = 274/395 (69%), Gaps = 26/395 (6%)
 Frame = +1

Query: 259  MKFYYILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQD-- 429
            MK +  + + DPKSR ++ SL +++ LIC VYF GS  + KDF  FS GF + +  Q+  
Sbjct: 1    MKSWNTVSVPDPKSRSFICSLFLTLALICAVYFTGSALMAKDFRAFS-GFTINSTKQNGQ 59

Query: 430  --------------------TISHKCKENCKPIGVEALPKGIISATSNLETHSLWGPVSE 549
                                  ++KC++ C+P+G EALP+GI+S TSNLE   LWG V +
Sbjct: 60   CGKCEVPPREEKQESHVTENVQNNKCQKKCRPLGSEALPEGIVSKTSNLEMRPLWGDVEK 119

Query: 550  NNKSHNTTGLLAMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIH 729
              KS ++  LL +AVGI QK LVN+IVK+FLE+ FVVMLFHYDGVV++WNDLEWS+R IH
Sbjct: 120  --KSPHSVNLLGIAVGIKQKELVNKIVKRFLEHDFVVMLFHYDGVVDEWNDLEWSNRAIH 177

Query: 730  ISVVNQTKWWFAKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPAL 909
            +S +NQTKWWFAKRF+HPDIV++Y+YIFLWDEDL V+NFHP +Y+SIV+EEGLEISQP L
Sbjct: 178  VSAMNQTKWWFAKRFLHPDIVSEYDYIFLWDEDLGVENFHPEKYISIVREEGLEISQPGL 237

Query: 910  DPGKSEVHHPITIXXXXXXXXXXYYKFKGGGR-CDGNSTAPPCVGWVEMMAPVFSRAAWC 1086
            D  KSEVHH IT+          +Y+   GGR CD NST PPCVGWVEMMAPVFS+AAW 
Sbjct: 238  DASKSEVHHHITVRRGRSKVHRRFYRLNRGGRTCDNNSTEPPCVGWVEMMAPVFSKAAWR 297

Query: 1087 CVWHLIQNDLIHAWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGGFLDKNKTNIE 1266
            C W+++QNDLIHAWGLD KLGYCAQGDRT KVGVVD EYI HL +P+LG   D      E
Sbjct: 298  CAWYMVQNDLIHAWGLDMKLGYCAQGDRTKKVGVVDAEYITHLAIPSLGANSDVETVIKE 357

Query: 1267 LD--TAHXXXXXXXXXLVADMYKLDNRSAVRIRSF 1365
            LD  +           L A + K DNRS VR +S+
Sbjct: 358  LDNNSPQGKNLSDSDTLAAPVEKFDNRSLVRRQSY 392


>ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera]
            gi|297739491|emb|CBI29673.3| unnamed protein product
            [Vitis vinifera]
          Length = 413

 Score =  438 bits (1126), Expect = e-120
 Identities = 221/372 (59%), Positives = 262/372 (70%), Gaps = 12/372 (3%)
 Frame = +1

Query: 286  SDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTAD-------QDTISH 441
            SDPKSR +L SL +   L CGVYFI S +  KD+   S  + +           Q+T S 
Sbjct: 11   SDPKSRSYLCSLFIGACLFCGVYFIASEFTVKDYKDRSSRWQISVFQNAHSNSIQNTQSS 70

Query: 442  KCKENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVN 621
            KCK  C+P G EALP+GI+  TSNLE   LWG      KS  +  LLAMAVGI QK +VN
Sbjct: 71   KCKNQCRPSGSEALPEGIVVKTSNLEVQPLWGATLNGEKSSPSKSLLAMAVGIKQKEIVN 130

Query: 622  EIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADY 801
            +IV+KF+ + FVVMLFHYDGVV++W +  WSD  IH++VVNQTKWWFAKRF+HPDIVA+Y
Sbjct: 131  QIVEKFILSNFVVMLFHYDGVVDEWREFAWSDHAIHVTVVNQTKWWFAKRFLHPDIVAEY 190

Query: 802  EYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXY 981
             YIFLWDEDL V+NFHPGRY+SIV++EGLEISQPALDP KS VHH IT            
Sbjct: 191  NYIFLWDEDLGVENFHPGRYVSIVEDEGLEISQPALDPKKSRVHHQITARVRNSRVHRRT 250

Query: 982  YKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQ 1161
            YK +G GRCD  STAPPCVGWVEMMAPVFS+AAW CVWH+IQN+LIHAWG+D +LGYCAQ
Sbjct: 251  YKHRGSGRCDDQSTAPPCVGWVEMMAPVFSKAAWRCVWHMIQNELIHAWGVDMQLGYCAQ 310

Query: 1162 GDRTVKVGVVDEEYIVHLGVPTLGGFLDKNKTNIELDTAHXXXXXXXXXLVA----DMYK 1329
            GDRT  VGVVD EY+VHL +PTL G LD+N+   E    H          VA    + +K
Sbjct: 311  GDRTKNVGVVDSEYVVHLALPTL-GVLDENELRGE-GHDHSSLREKLPKSVALAQSEFHK 368

Query: 1330 LDNRSAVRIRSF 1365
            +DNRSAVR +SF
Sbjct: 369  VDNRSAVRRQSF 380


>gb|ABE91850.1| Protein of unknown function DUF707 [Medicago truncatula]
            gi|92893916|gb|ABE91966.1| Protein of unknown function
            DUF707 [Medicago truncatula]
          Length = 355

 Score =  437 bits (1125), Expect = e-120
 Identities = 205/321 (63%), Positives = 253/321 (78%), Gaps = 5/321 (1%)
 Frame = +1

Query: 289  DPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTI---SHKCKEN 456
            DPK+R +L +L++ ++LICG YF+G+ +  K++ +    +GL  +  DT    S+ CK+ 
Sbjct: 14   DPKNRLFLWTLLILLSLICGAYFLGNAFSAKEYKQRLARWGLIYSMPDTTTSNSNACKKQ 73

Query: 457  CKPIGVEALPKGIISATSNLETHSLWGPVSENNK-SHNTTGLLAMAVGIDQKGLVNEIVK 633
            C+P G +ALP+GI++ TSNLET  LW   + NN+ S++   LLA++VG+ QK +V++IVK
Sbjct: 74   CRPSGTQALPQGIVARTSNLETRPLWDDSAVNNRISNHPLNLLAISVGVKQKEVVDKIVK 133

Query: 634  KFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIF 813
            KF  + FVVMLFHYDG V+ W +L WS+R IH+S +NQTKWWFAKRF+HPDIVADY YIF
Sbjct: 134  KFPSSDFVVMLFHYDGFVDGWKNLAWSNRAIHVSAINQTKWWFAKRFLHPDIVADYNYIF 193

Query: 814  LWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFK 993
            LWDEDL VDNF P RYLSIVKEEGLEISQPALDPGKSE+HHP+T+          YYKFK
Sbjct: 194  LWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPGKSEIHHPLTVHKAGSKVHRRYYKFK 253

Query: 994  GGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRT 1173
            G GRCD NSTAPPC+GWVEMMAPVFS+ +W CVWH+IQNDLIHAWGLDR+LGYCAQGDR 
Sbjct: 254  GSGRCDDNSTAPPCLGWVEMMAPVFSKKSWQCVWHMIQNDLIHAWGLDRQLGYCAQGDRM 313

Query: 1174 VKVGVVDEEYIVHLGVPTLGG 1236
              VGVVD EYIVHLG+PTLGG
Sbjct: 314  KNVGVVDSEYIVHLGLPTLGG 334


>ref|XP_007222756.1| hypothetical protein PRUPE_ppa006529mg [Prunus persica]
            gi|462419692|gb|EMJ23955.1| hypothetical protein
            PRUPE_ppa006529mg [Prunus persica]
          Length = 407

 Score =  430 bits (1105), Expect = e-118
 Identities = 203/318 (63%), Positives = 243/318 (76%), Gaps = 1/318 (0%)
 Frame = +1

Query: 283  LSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKENC 459
            L DPK+R +  SL +  +LICG YFIG   + K++ +    + +    Q+T    CK  C
Sbjct: 11   LPDPKNRSFYCSLFIVASLICGAYFIGGASIAKEYKERLTRWKVIYTRQNTKFDTCKNRC 70

Query: 460  KPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVKKF 639
            +P+G EALP+GI++ TS+LE   LWG    N  S  +  LLA+AVGI QK +V+ IVKKF
Sbjct: 71   QPLGSEALPEGIVAKTSDLEVRPLWGSSVNNENSKPSMSLLAIAVGIKQKEIVDRIVKKF 130

Query: 640  LENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIFLW 819
            L + FVVMLFHYDG V+KW DL WSDR IH+SV+NQTKWWFAKRF+HPDIV++YEYIFLW
Sbjct: 131  LSSDFVVMLFHYDGAVDKWRDLNWSDRAIHVSVMNQTKWWFAKRFLHPDIVSEYEYIFLW 190

Query: 820  DEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFKGG 999
            DEDL V+NF P RYLSIV+EEGLEISQPALDP KS+V+HPIT           +YKFKG 
Sbjct: 191  DEDLGVENFDPKRYLSIVREEGLEISQPALDPDKSDVYHPITARVKKLKVHRRFYKFKGS 250

Query: 1000 GRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVK 1179
            GRCD +S+APPC GWVEMMAPVFS+AAW CVW++IQNDLIHAWGLD +LGYCAQGDRT  
Sbjct: 251  GRCDNHSSAPPCAGWVEMMAPVFSKAAWQCVWYMIQNDLIHAWGLDVQLGYCAQGDRTKN 310

Query: 1180 VGVVDEEYIVHLGVPTLG 1233
            VGVVD EYIVHLG+PTLG
Sbjct: 311  VGVVDSEYIVHLGLPTLG 328


>ref|XP_006598790.1| PREDICTED: uncharacterized protein LOC100797710 isoform X4 [Glycine
            max]
          Length = 387

 Score =  429 bits (1102), Expect = e-117
 Identities = 204/323 (63%), Positives = 241/323 (74%), Gaps = 1/323 (0%)
 Frame = +1

Query: 271  YILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKC 447
            +ILP  DPK+R +L S+ + ++LI G YF+G+ +  K++ +    +GL     D+  + C
Sbjct: 12   FILP--DPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGLIHTMPDSKFNSC 69

Query: 448  KENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEI 627
            K  C P G EALP+GII+ TSNLE   LW    +N        LLAMAVG++QK +VN+I
Sbjct: 70   KRQCLPFGSEALPEGIIARTSNLEMRPLWDSGKDNGILKRPLNLLAMAVGLEQKEIVNKI 129

Query: 628  VKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEY 807
            V+KFL + FVVMLFHYDG V+ W  L WS R IH+S +NQTKWWFAKRF+HPDIV +Y Y
Sbjct: 130  VEKFLSSDFVVMLFHYDGFVDGWKSLAWSSRAIHVSAINQTKWWFAKRFLHPDIVVEYNY 189

Query: 808  IFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYK 987
            IFLWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+T+          YYK
Sbjct: 190  IFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPLTVHKAGSKVHRRYYK 249

Query: 988  FKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGD 1167
             KG GRCD  STAPPC+GWVEMMAPVFS+ +W CVWHLIQNDLIHAWGLDR+LGYCAQGD
Sbjct: 250  LKGSGRCDDKSTAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGD 309

Query: 1168 RTVKVGVVDEEYIVHLGVPTLGG 1236
            R   VGVVD EYIVHLG+PTLGG
Sbjct: 310  RMQNVGVVDSEYIVHLGLPTLGG 332


>ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797710 isoform X1 [Glycine
            max]
          Length = 385

 Score =  429 bits (1102), Expect = e-117
 Identities = 204/323 (63%), Positives = 241/323 (74%), Gaps = 1/323 (0%)
 Frame = +1

Query: 271  YILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKC 447
            +ILP  DPK+R +L S+ + ++LI G YF+G+ +  K++ +    +GL     D+  + C
Sbjct: 10   FILP--DPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGLIHTMPDSKFNSC 67

Query: 448  KENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEI 627
            K  C P G EALP+GII+ TSNLE   LW    +N        LLAMAVG++QK +VN+I
Sbjct: 68   KRQCLPFGSEALPEGIIARTSNLEMRPLWDSGKDNGILKRPLNLLAMAVGLEQKEIVNKI 127

Query: 628  VKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEY 807
            V+KFL + FVVMLFHYDG V+ W  L WS R IH+S +NQTKWWFAKRF+HPDIV +Y Y
Sbjct: 128  VEKFLSSDFVVMLFHYDGFVDGWKSLAWSSRAIHVSAINQTKWWFAKRFLHPDIVVEYNY 187

Query: 808  IFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYK 987
            IFLWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+T+          YYK
Sbjct: 188  IFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPLTVHKAGSKVHRRYYK 247

Query: 988  FKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGD 1167
             KG GRCD  STAPPC+GWVEMMAPVFS+ +W CVWHLIQNDLIHAWGLDR+LGYCAQGD
Sbjct: 248  LKGSGRCDDKSTAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGD 307

Query: 1168 RTVKVGVVDEEYIVHLGVPTLGG 1236
            R   VGVVD EYIVHLG+PTLGG
Sbjct: 308  RMQNVGVVDSEYIVHLGLPTLGG 330


>ref|XP_003592883.1| hypothetical protein MTR_2g005330 [Medicago truncatula]
            gi|355481931|gb|AES63134.1| hypothetical protein
            MTR_2g005330 [Medicago truncatula]
          Length = 406

 Score =  429 bits (1102), Expect = e-117
 Identities = 205/334 (61%), Positives = 253/334 (75%), Gaps = 18/334 (5%)
 Frame = +1

Query: 289  DPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTI---SHKCKEN 456
            DPK+R +L +L++ ++LICG YF+G+ +  K++ +    +GL  +  DT    S+ CK+ 
Sbjct: 14   DPKNRLFLWTLLILLSLICGAYFLGNAFSAKEYKQRLARWGLIYSMPDTTTSNSNACKKQ 73

Query: 457  CKPIGVEALPKGIISATSNLETHSLWGPVSENNK-SHNTTGLLAMAVGIDQKGLVNEIVK 633
            C+P G +ALP+GI++ TSNLET  LW   + NN+ S++   LLA++VG+ QK +V++IVK
Sbjct: 74   CRPSGTQALPQGIVARTSNLETRPLWDDSAVNNRISNHPLNLLAISVGVKQKEVVDKIVK 133

Query: 634  KFL-------------ENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRF 774
            KF               + FVVMLFHYDG V+ W +L WS+R IH+S +NQTKWWFAKRF
Sbjct: 134  KFKLISADLYRWWQFPSSDFVVMLFHYDGFVDGWKNLAWSNRAIHVSAINQTKWWFAKRF 193

Query: 775  MHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXX 954
            +HPDIVADY YIFLWDEDL VDNF P RYLSIVKEEGLEISQPALDPGKSE+HHP+T+  
Sbjct: 194  LHPDIVADYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPGKSEIHHPLTVHK 253

Query: 955  XXXXXXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGL 1134
                    YYKFKG GRCD NSTAPPC+GWVEMMAPVFS+ +W CVWH+IQNDLIHAWGL
Sbjct: 254  AGSKVHRRYYKFKGSGRCDDNSTAPPCLGWVEMMAPVFSKKSWQCVWHMIQNDLIHAWGL 313

Query: 1135 DRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGG 1236
            DR+LGYCAQGDR   VGVVD EYIVHLG+PTLGG
Sbjct: 314  DRQLGYCAQGDRMKNVGVVDSEYIVHLGLPTLGG 347


>ref|XP_003592882.1| hypothetical protein MTR_2g005330 [Medicago truncatula]
            gi|355481930|gb|AES63133.1| hypothetical protein
            MTR_2g005330 [Medicago truncatula]
          Length = 368

 Score =  429 bits (1102), Expect = e-117
 Identities = 205/334 (61%), Positives = 253/334 (75%), Gaps = 18/334 (5%)
 Frame = +1

Query: 289  DPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTI---SHKCKEN 456
            DPK+R +L +L++ ++LICG YF+G+ +  K++ +    +GL  +  DT    S+ CK+ 
Sbjct: 14   DPKNRLFLWTLLILLSLICGAYFLGNAFSAKEYKQRLARWGLIYSMPDTTTSNSNACKKQ 73

Query: 457  CKPIGVEALPKGIISATSNLETHSLWGPVSENNK-SHNTTGLLAMAVGIDQKGLVNEIVK 633
            C+P G +ALP+GI++ TSNLET  LW   + NN+ S++   LLA++VG+ QK +V++IVK
Sbjct: 74   CRPSGTQALPQGIVARTSNLETRPLWDDSAVNNRISNHPLNLLAISVGVKQKEVVDKIVK 133

Query: 634  KFL-------------ENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRF 774
            KF               + FVVMLFHYDG V+ W +L WS+R IH+S +NQTKWWFAKRF
Sbjct: 134  KFKLISADLYRWWQFPSSDFVVMLFHYDGFVDGWKNLAWSNRAIHVSAINQTKWWFAKRF 193

Query: 775  MHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXX 954
            +HPDIVADY YIFLWDEDL VDNF P RYLSIVKEEGLEISQPALDPGKSE+HHP+T+  
Sbjct: 194  LHPDIVADYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPGKSEIHHPLTVHK 253

Query: 955  XXXXXXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGL 1134
                    YYKFKG GRCD NSTAPPC+GWVEMMAPVFS+ +W CVWH+IQNDLIHAWGL
Sbjct: 254  AGSKVHRRYYKFKGSGRCDDNSTAPPCLGWVEMMAPVFSKKSWQCVWHMIQNDLIHAWGL 313

Query: 1135 DRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGG 1236
            DR+LGYCAQGDR   VGVVD EYIVHLG+PTLGG
Sbjct: 314  DRQLGYCAQGDRMKNVGVVDSEYIVHLGLPTLGG 347


>ref|XP_007041142.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508705077|gb|EOX96973.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 405

 Score =  428 bits (1100), Expect = e-117
 Identities = 211/364 (57%), Positives = 257/364 (70%), Gaps = 3/364 (0%)
 Frame = +1

Query: 283  LSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKENC 459
            +SDPK+R  L  L V  +LICG YFI   ++ K++      + +    Q++ S+ CK  C
Sbjct: 10   VSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICKIRC 69

Query: 460  KPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVKKF 639
            +P G EALP+GI+  TSNLE   LW    +N     ++ LLA+AVGI QK +VN+I+KKF
Sbjct: 70   RPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKF 129

Query: 640  LENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIFLW 819
              + FVVMLFHYDG+V++W DLEWSD  IH+S VNQTKWWFAKRF+HPDIVADY+Y+FLW
Sbjct: 130  PSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLW 189

Query: 820  DEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFKGG 999
            DEDL VDNF P +YLSIV++EGLEISQPALDP KSEVHH IT            YKFKG 
Sbjct: 190  DEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKFKGS 249

Query: 1000 GRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVK 1179
            GRCDG STAPPC+GWVEMMAPVFSRAAW C W++IQNDLIHAWGLD +LGYCAQGDR   
Sbjct: 250  GRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKN 309

Query: 1180 VGVVDEEYIVHLGVPTLGGFLDK--NKTNIELDTAHXXXXXXXXXLVADMYKLDNRSAVR 1353
            VGVVD EYIVHLG+ TLG   +   N T + + T             ++ +K+DNR  VR
Sbjct: 310  VGVVDAEYIVHLGLSTLGVLAENELNSTRVNI-TRRQPSSDSETLAPSESHKVDNRPEVR 368

Query: 1354 IRSF 1365
             +SF
Sbjct: 369  RQSF 372


>ref|XP_006592649.1| PREDICTED: uncharacterized protein LOC100526994 isoform X1 [Glycine
            max]
          Length = 385

 Score =  426 bits (1094), Expect = e-116
 Identities = 203/323 (62%), Positives = 240/323 (74%), Gaps = 1/323 (0%)
 Frame = +1

Query: 271  YILPLSDPKSRWLS-SLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKC 447
            ++LP  DPK+R L  S+++ ++LI G YF+G+ +  K++ +    +GL      +  + C
Sbjct: 10   FVLP--DPKNRLLLWSVLILVSLISGAYFVGNAFFAKEYKQRLARWGLIHTMPHSKFNAC 67

Query: 448  KENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEI 627
            K  C P G EALP+GII+ TSNLE   LW    +N        LLAMAVG+ QK +VN+I
Sbjct: 68   KRQCLPFGSEALPEGIIARTSNLEMRPLWDSGKDNRILKRPLNLLAMAVGLKQKEIVNKI 127

Query: 628  VKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEY 807
            V+KFL +GFVVMLFHYDG V+ W  L WS   IH+S +NQTKWWFAKRF+HPDIVA+Y Y
Sbjct: 128  VEKFLSSGFVVMLFHYDGFVDGWKSLAWSSCAIHVSAINQTKWWFAKRFLHPDIVAEYNY 187

Query: 808  IFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYK 987
            IFLWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+T+          YYK
Sbjct: 188  IFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPLTVHKAVSKVHRRYYK 247

Query: 988  FKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGD 1167
             KG GRCD  STAPPC+GWVEMMAPVFS+ +W CVWHLIQNDLIHAWGLDR+LGYCAQGD
Sbjct: 248  LKGSGRCDDKSTAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGD 307

Query: 1168 RTVKVGVVDEEYIVHLGVPTLGG 1236
            R   VGVVD EYIVHLG+PTLGG
Sbjct: 308  RMRNVGVVDSEYIVHLGLPTLGG 330


>ref|XP_007148587.1| hypothetical protein PHAVU_006G221000g [Phaseolus vulgaris]
            gi|561021810|gb|ESW20581.1| hypothetical protein
            PHAVU_006G221000g [Phaseolus vulgaris]
          Length = 387

 Score =  421 bits (1082), Expect = e-115
 Identities = 198/321 (61%), Positives = 241/321 (75%), Gaps = 1/321 (0%)
 Frame = +1

Query: 277  LPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKE 453
            L L +PK+R +L S+++ ++LI G YF+G+ +   ++ +    +GL     D+ S+ CK 
Sbjct: 10   LVLPEPKNRLFLWSVLIVVSLISGAYFVGNAFFANEYKQRLARWGLIRTIPDSKSNACKR 69

Query: 454  NCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVK 633
             C P G +ALP+GII+ TSNLE   LW    ++        LLAMAVG+ QK +V++IV+
Sbjct: 70   QCWPFGSDALPEGIIARTSNLEMRPLWDSGRDHRIIKRPLNLLAMAVGLKQKEIVSKIVE 129

Query: 634  KFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIF 813
            KFL + FVVMLFHYDG V+ W  L WS+ VIH+S +NQTKWWFAKRF+HPDI+A+Y YIF
Sbjct: 130  KFLSSDFVVMLFHYDGSVDGWKSLAWSNHVIHVSAINQTKWWFAKRFLHPDIIAEYNYIF 189

Query: 814  LWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFK 993
            LWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+T+          YYK K
Sbjct: 190  LWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPLTVHKAGSKVHRRYYKLK 249

Query: 994  GGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRT 1173
            G GRCD +S APPC+GWVEMMAPVFS+ +W CVWHLIQNDLIHAWGLDR+LGYCAQGDR 
Sbjct: 250  GSGRCDDDSIAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIHAWGLDRQLGYCAQGDRM 309

Query: 1174 VKVGVVDEEYIVHLGVPTLGG 1236
              VGVVD EYIVHLG+PTLGG
Sbjct: 310  KNVGVVDSEYIVHLGLPTLGG 330


>ref|XP_007041145.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508705080|gb|EOX96976.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 374

 Score =  420 bits (1079), Expect = e-115
 Identities = 203/327 (62%), Positives = 243/327 (74%), Gaps = 1/327 (0%)
 Frame = +1

Query: 283  LSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKENC 459
            +SDPK+R  L  L V  +LICG YFI   ++ K++      + +    Q++ S+ CK  C
Sbjct: 10   VSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICKIRC 69

Query: 460  KPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVKKF 639
            +P G EALP+GI+  TSNLE   LW    +N     ++ LLA+AVGI QK +VN+I+KKF
Sbjct: 70   RPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKF 129

Query: 640  LENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIFLW 819
              + FVVMLFHYDG+V++W DLEWSD  IH+S VNQTKWWFAKRF+HPDIVADY+Y+FLW
Sbjct: 130  PSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLW 189

Query: 820  DEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFKGG 999
            DEDL VDNF P +YLSIV++EGLEISQPALDP KSEVHH IT            YKFKG 
Sbjct: 190  DEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKFKGS 249

Query: 1000 GRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVK 1179
            GRCDG STAPPC+GWVEMMAPVFSRAAW C W++IQNDLIHAWGLD +LGYCAQGDR   
Sbjct: 250  GRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKN 309

Query: 1180 VGVVDEEYIVHLGVPTLGGFLDKNKTN 1260
            VGVVD EYIVHLG+ TL G L +N+ N
Sbjct: 310  VGVVDAEYIVHLGLSTL-GVLAENELN 335


>ref|XP_007041144.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|508705079|gb|EOX96975.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 438

 Score =  420 bits (1079), Expect = e-115
 Identities = 203/327 (62%), Positives = 243/327 (74%), Gaps = 1/327 (0%)
 Frame = +1

Query: 283  LSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKENC 459
            +SDPK+R  L  L V  +LICG YFI   ++ K++      + +    Q++ S+ CK  C
Sbjct: 92   VSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICKIRC 151

Query: 460  KPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVKKF 639
            +P G EALP+GI+  TSNLE   LW    +N     ++ LLA+AVGI QK +VN+I+KKF
Sbjct: 152  RPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKF 211

Query: 640  LENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIFLW 819
              + FVVMLFHYDG+V++W DLEWSD  IH+S VNQTKWWFAKRF+HPDIVADY+Y+FLW
Sbjct: 212  PSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLW 271

Query: 820  DEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXXXXYYKFKGG 999
            DEDL VDNF P +YLSIV++EGLEISQPALDP KSEVHH IT            YKFKG 
Sbjct: 272  DEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHRRMYKFKGS 331

Query: 1000 GRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGYCAQGDRTVK 1179
            GRCDG STAPPC+GWVEMMAPVFSRAAW C W++IQNDLIHAWGLD +LGYCAQGDR   
Sbjct: 332  GRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQLGYCAQGDRMKN 391

Query: 1180 VGVVDEEYIVHLGVPTLGGFLDKNKTN 1260
            VGVVD EYIVHLG+ TL G L +N+ N
Sbjct: 392  VGVVDAEYIVHLGLSTL-GVLAENELN 417


>ref|XP_006598789.1| PREDICTED: uncharacterized protein LOC100797710 isoform X3 [Glycine
            max]
          Length = 400

 Score =  419 bits (1078), Expect = e-114
 Identities = 204/338 (60%), Positives = 242/338 (71%), Gaps = 16/338 (4%)
 Frame = +1

Query: 271  YILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKC 447
            +ILP  DPK+R +L S+ + ++LI G YF+G+ +  K++ +    +GL     D+  + C
Sbjct: 10   FILP--DPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGLIHTMPDSKFNSC 67

Query: 448  KENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHN---------------TTGLL 582
            K  C P G EALP+GII+ TSNLE   LW    +N    +                  LL
Sbjct: 68   KRQCLPFGSEALPEGIIARTSNLEMRPLWDSGKDNAYKSSFPLDCLSCDQGILKRPLNLL 127

Query: 583  AMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWF 762
            AMAVG++QK +VN+IV+KFL + FVVMLFHYDG V+ W  L WS R IH+S +NQTKWWF
Sbjct: 128  AMAVGLEQKEIVNKIVEKFLSSDFVVMLFHYDGFVDGWKSLAWSSRAIHVSAINQTKWWF 187

Query: 763  AKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPI 942
            AKRF+HPDIV +Y YIFLWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+
Sbjct: 188  AKRFLHPDIVVEYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPL 247

Query: 943  TIXXXXXXXXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIH 1122
            T+          YYK KG GRCD  STAPPC+GWVEMMAPVFS+ +W CVWHLIQNDLIH
Sbjct: 248  TVHKAGSKVHRRYYKLKGSGRCDDKSTAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIH 307

Query: 1123 AWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGG 1236
            AWGLDR+LGYCAQGDR   VGVVD EYIVHLG+PTLGG
Sbjct: 308  AWGLDRQLGYCAQGDRMQNVGVVDSEYIVHLGLPTLGG 345


>ref|XP_006598788.1| PREDICTED: uncharacterized protein LOC100797710 isoform X2 [Glycine
            max]
          Length = 402

 Score =  419 bits (1078), Expect = e-114
 Identities = 204/338 (60%), Positives = 242/338 (71%), Gaps = 16/338 (4%)
 Frame = +1

Query: 271  YILPLSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKC 447
            +ILP  DPK+R +L S+ + ++LI G YF+G+ +  K++ +    +GL     D+  + C
Sbjct: 12   FILP--DPKNRLFLWSVFLVVSLISGAYFVGNAFFAKEYKQRLARWGLIHTMPDSKFNSC 69

Query: 448  KENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHN---------------TTGLL 582
            K  C P G EALP+GII+ TSNLE   LW    +N    +                  LL
Sbjct: 70   KRQCLPFGSEALPEGIIARTSNLEMRPLWDSGKDNAYKSSFPLDCLSCDQGILKRPLNLL 129

Query: 583  AMAVGIDQKGLVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWF 762
            AMAVG++QK +VN+IV+KFL + FVVMLFHYDG V+ W  L WS R IH+S +NQTKWWF
Sbjct: 130  AMAVGLEQKEIVNKIVEKFLSSDFVVMLFHYDGFVDGWKSLAWSSRAIHVSAINQTKWWF 189

Query: 763  AKRFMHPDIVADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPI 942
            AKRF+HPDIV +Y YIFLWDEDL VDNF P RYLSIVKEEGLEISQPALDP KSEVHHP+
Sbjct: 190  AKRFLHPDIVVEYNYIFLWDEDLLVDNFDPKRYLSIVKEEGLEISQPALDPTKSEVHHPL 249

Query: 943  TIXXXXXXXXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIH 1122
            T+          YYK KG GRCD  STAPPC+GWVEMMAPVFS+ +W CVWHLIQNDLIH
Sbjct: 250  TVHKAGSKVHRRYYKLKGSGRCDDKSTAPPCIGWVEMMAPVFSKKSWQCVWHLIQNDLIH 309

Query: 1123 AWGLDRKLGYCAQGDRTVKVGVVDEEYIVHLGVPTLGG 1236
            AWGLDR+LGYCAQGDR   VGVVD EYIVHLG+PTLGG
Sbjct: 310  AWGLDRQLGYCAQGDRMQNVGVVDSEYIVHLGLPTLGG 347


>ref|XP_007041143.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508705078|gb|EOX96974.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 416

 Score =  419 bits (1078), Expect = e-114
 Identities = 211/375 (56%), Positives = 257/375 (68%), Gaps = 14/375 (3%)
 Frame = +1

Query: 283  LSDPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDTISHKCKENC 459
            +SDPK+R  L  L V  +LICG YFI   ++ K++      + +    Q++ S+ CK  C
Sbjct: 10   VSDPKTRSCLCRLFVVASLICGAYFISGAFIAKEYKDRLSRWEVINMLQNSKSNICKIRC 69

Query: 460  KPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKGLVNEIVKKF 639
            +P G EALP+GI+  TSNLE   LW    +N     ++ LLA+AVGI QK +VN+I+KKF
Sbjct: 70   RPPGSEALPQGIVVKTSNLEMRPLWSDTVKNGNLEPSSNLLAIAVGIKQKEIVNQIIKKF 129

Query: 640  LENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIVADYEYIFLW 819
              + FVVMLFHYDG+V++W DLEWSD  IH+S VNQTKWWFAKRF+HPDIVADY+Y+FLW
Sbjct: 130  PSSDFVVMLFHYDGIVDEWRDLEWSDHAIHVSAVNQTKWWFAKRFLHPDIVADYKYLFLW 189

Query: 820  DEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITI-----------XXXXXX 966
            DEDL VDNF P +YLSIV++EGLEISQPALDP KSEVHH IT                  
Sbjct: 190  DEDLGVDNFDPKQYLSIVEDEGLEISQPALDPVKSEVHHQITARRRNSRVHSYDTINPSR 249

Query: 967  XXXXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKL 1146
                 YKFKG GRCDG STAPPC+GWVEMMAPVFSRAAW C W++IQNDLIHAWGLD +L
Sbjct: 250  LNRRMYKFKGSGRCDGRSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDMQL 309

Query: 1147 GYCAQGDRTVKVGVVDEEYIVHLGVPTLGGFLDK--NKTNIELDTAHXXXXXXXXXLVAD 1320
            GYCAQGDR   VGVVD EYIVHLG+ TLG   +   N T + + T             ++
Sbjct: 310  GYCAQGDRMKNVGVVDAEYIVHLGLSTLGVLAENELNSTRVNI-TRRQPSSDSETLAPSE 368

Query: 1321 MYKLDNRSAVRIRSF 1365
             +K+DNR  VR +SF
Sbjct: 369  SHKVDNRPEVRRQSF 383


>ref|XP_002528866.1| conserved hypothetical protein [Ricinus communis]
            gi|223531717|gb|EEF33540.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 370

 Score =  419 bits (1077), Expect = e-114
 Identities = 202/329 (61%), Positives = 245/329 (74%), Gaps = 2/329 (0%)
 Frame = +1

Query: 259  MKFYYILPLS-DPKSR-WLSSLIVSITLICGVYFIGSTWLRKDFLKFSPGFGLKTADQDT 432
            MK  Y    S DPKSR +L +L V  +LIC  YFIG +++ K++ +    + +    Q T
Sbjct: 1    MKSLYCASASPDPKSRSYLCTLFVVASLICSAYFIGGSFIGKEYKERLARWQVIETVQST 60

Query: 433  ISHKCKENCKPIGVEALPKGIISATSNLETHSLWGPVSENNKSHNTTGLLAMAVGIDQKG 612
             S  C++ CKP G +ALP+GI+  TS+ E   LW    E+NK   +  LLA+AVGI+QK 
Sbjct: 61   KSTNCEDQCKPTGTKALPQGIVRKTSDFEMRPLWNSSLEDNKQKLSKSLLALAVGINQKV 120

Query: 613  LVNEIVKKFLENGFVVMLFHYDGVVEKWNDLEWSDRVIHISVVNQTKWWFAKRFMHPDIV 792
            +V++IVKKF  + FVVMLFHYDGVV+KW DL WSD  IH+S VNQTKWWFAKRF+HPDIV
Sbjct: 121  VVDQIVKKFPLSDFVVMLFHYDGVVDKWRDLPWSDHAIHVSAVNQTKWWFAKRFLHPDIV 180

Query: 793  ADYEYIFLWDEDLRVDNFHPGRYLSIVKEEGLEISQPALDPGKSEVHHPITIXXXXXXXX 972
            ++Y+Y+FLWDEDL V+NF+P RYLSI+++EGLEISQPALDP KS V+HPIT         
Sbjct: 181  SEYDYLFLWDEDLGVENFNPKRYLSIIRDEGLEISQPALDPTKSAVYHPITARQPKSTVH 240

Query: 973  XXYYKFKGGGRCDGNSTAPPCVGWVEMMAPVFSRAAWCCVWHLIQNDLIHAWGLDRKLGY 1152
               YKFKG GRC GNST+PPC+GWVEMMAPVFS AAW C WH+IQNDLIHAWGLD +LGY
Sbjct: 241  RRIYKFKGSGRCYGNSTSPPCIGWVEMMAPVFSTAAWRCAWHMIQNDLIHAWGLDFQLGY 300

Query: 1153 CAQGDRTVKVGVVDEEYIVHLGVPTLGGF 1239
            CAQGDRT  VGVVD EYIVHLG+ TLG F
Sbjct: 301  CAQGDRTKNVGVVDSEYIVHLGLLTLGVF 329


Top