BLASTX nr result

ID: Mentha25_contig00028677 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00028677
         (1042 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU39971.1| hypothetical protein MIMGU_mgv1a000318mg [Mimulus...   208   3e-51
ref|XP_003632681.1| PREDICTED: uncharacterized protein LOC100257...   150   1e-33
emb|CBI29995.3| unnamed protein product [Vitis vinifera]              140   7e-31
ref|XP_003625298.1| hypothetical protein MTR_7g093630 [Medicago ...   134   8e-29
ref|XP_002322738.1| hypothetical protein POPTR_0016s06020g [Popu...   133   1e-28
ref|XP_002530363.1| conserved hypothetical protein [Ricinus comm...   130   1e-27
ref|XP_007203211.1| hypothetical protein PRUPE_ppa000350mg [Prun...   126   2e-26
ref|XP_004493617.1| PREDICTED: uncharacterized protein LOC101489...   125   4e-26
gb|EXC30858.1| hypothetical protein L484_028037 [Morus notabilis]     122   2e-25
ref|XP_007029040.1| Uncharacterized protein isoform 2 [Theobroma...   122   3e-25
ref|XP_007029039.1| Uncharacterized protein isoform 1 [Theobroma...   122   3e-25
ref|XP_004303344.1| PREDICTED: uncharacterized protein LOC101309...   119   3e-24
ref|XP_006576869.1| PREDICTED: uncharacterized protein LOC100786...   114   5e-23
ref|XP_003520543.1| PREDICTED: uncharacterized protein LOC100786...   114   5e-23
ref|XP_003553437.1| PREDICTED: uncharacterized protein LOC100813...   114   7e-23
ref|XP_006492833.1| PREDICTED: uncharacterized protein LOC102619...   113   1e-22
ref|XP_006429914.1| hypothetical protein CICLE_v100109261mg, par...   113   1e-22
ref|XP_004249188.1| PREDICTED: uncharacterized protein LOC101258...   110   1e-21
ref|XP_002881811.1| hypothetical protein ARALYDRAFT_321881 [Arab...   109   2e-21
ref|XP_004497878.1| PREDICTED: uncharacterized protein LOC101509...   108   4e-21

>gb|EYU39971.1| hypothetical protein MIMGU_mgv1a000318mg [Mimulus guttatus]
          Length = 1263

 Score =  208 bits (529), Expect = 3e-51
 Identities = 136/278 (48%), Positives = 165/278 (59%), Gaps = 28/278 (10%)
 Frame = +1

Query: 106  HKVVDDQITPTRQNANSHEP-ASGAINSLSSLPAETAIYCESGKASVGVGMDVQPQSVLP 282
            H  V++Q T   Q+A S  P  S   NS+ SL  E A YCE+ KA+V +G  VQPQSVLP
Sbjct: 995  HGTVENQSTSRGQDAKSQAPPTSTGTNSVGSLVKEAAPYCENTKANVSIG--VQPQSVLP 1052

Query: 283  TMMHNQNIQYPIYQAPTMGCY---PVSWH--TNGLMSFHYSNHYLFANGFGYDLNTNAQL 447
              MHN+NI +P++QAPTMG Y   PVSW   TNGLMSF +SNHYLFAN +GY LN NA+ 
Sbjct: 1053 -QMHNKNINFPVFQAPTMGYYHQNPVSWAGPTNGLMSFPHSNHYLFANTYGYGLNGNARF 1111

Query: 448  MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKNQKVSN----------------- 576
            M YG L  QH+P  LIN +H+PV+  V+    V   +  KV++                 
Sbjct: 1112 MQYGAL--QHMPPQLINHVHVPVYQPVSQVNGVNLNEPAKVAHLPGLKEGQPRIKKVEHP 1169

Query: 577  PETPTVAVAEDGQNGMISEKVADKGSNDFSLFHFGGPVALSEGFKAEAV-TXXXXXXXXX 753
             E PTV  A   QNG   +   D G+N FSLFHFGGPVALS GFKA+ +           
Sbjct: 1170 AEVPTVLDAV--QNGKPDK--MDMGNNGFSLFHFGGPVALSTGFKADPIPLKEGFMGNAS 1225

Query: 754  XXXXXXXPDG---CNRKDSIEEYNLFAASN-GIKFSIY 855
                    DG   C++KDSIEEYNLFAA+N GIKFSIY
Sbjct: 1226 PNSSINCTDGDHTCDKKDSIEEYNLFAATNGGIKFSIY 1263


>ref|XP_003632681.1| PREDICTED: uncharacterized protein LOC100257222 [Vitis vinifera]
          Length = 1284

 Score =  150 bits (378), Expect = 1e-33
 Identities = 120/307 (39%), Positives = 160/307 (52%), Gaps = 41/307 (13%)
 Frame = +1

Query: 58   GRNMLHCLETTRFVDCHKVVDD--QITPTRQNANSHEPAS----GAINSL-SSLPAETAI 216
            GR    C++   F +CH+VV +  QI   ++   S   A      A NSL ++ P +TA 
Sbjct: 986  GRETSVCIQNG-FPECHEVVVEKKQIENGKEAFRSKMSAGFSPDSARNSLPANAPTKTAQ 1044

Query: 217  YCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSW---HTNGLM 375
              +SGK +V +G   Q Q +LPTM H QN+ YP++QAP TM  Y   PVSW     NGLM
Sbjct: 1045 NLDSGKPNVSMGS--QHQGMLPTM-HKQNLHYPMFQAPSTMSYYHQNPVSWPAASANGLM 1101

Query: 376  SFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVC 552
             F + NHYLF +  GY LN +++L M Y  LQ  HL  P++NP  +PV+  +T    V  
Sbjct: 1102 PFPHPNHYLFTSPLGYGLNGSSRLCMQYSALQ--HLTPPVLNPGQLPVYHPITKANGVNS 1159

Query: 553  TKNQKV-------------------SNPETPTVAV--AEDGQNGMISEKVADKGSNDFSL 669
             + +K+                   S    PT A    +DGQNG  ++     G+  FSL
Sbjct: 1160 EEQEKIFKTGGAQEAFNEAKKERVPSAGPRPTDAPPNGDDGQNGNSAK--LHTGNQSFSL 1217

Query: 670  FHFGGPVALSEGFKAEAV-TXXXXXXXXXXXXXXXXPDG---CNRKD-SIEEYNLFAASN 834
            FHFGGPVALS G K   V +                 DG   CN+K+ +IEEYNLFAASN
Sbjct: 1218 FHFGGPVALSTGNKVNPVPSKEGNVGDYSSKFSADHVDGDHACNKKETTIEEYNLFAASN 1277

Query: 835  GIKFSIY 855
            G+KFS +
Sbjct: 1278 GMKFSFF 1284


>emb|CBI29995.3| unnamed protein product [Vitis vinifera]
          Length = 1196

 Score =  140 bits (354), Expect = 7e-31
 Identities = 102/245 (41%), Positives = 131/245 (53%), Gaps = 14/245 (5%)
 Frame = +1

Query: 163  PASGAINSL-SSLPAETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAP-TM 336
            P   A NSL ++ P +TA   +SGK +V +G   Q Q +LPTM H QN+ YP++QAP TM
Sbjct: 969  PEYSARNSLPANAPTKTAQNLDSGKPNVSMGS--QHQGMLPTM-HKQNLHYPMFQAPSTM 1025

Query: 337  GCY---PVSW---HTNGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLI 495
              Y   PVSW     NGLM F + NHYLF +  GY LN +++L M Y  LQ  HL  P++
Sbjct: 1026 SYYHQNPVSWPAASANGLMPFPHPNHYLFTSPLGYGLNGSSRLCMQYSALQ--HLTPPVL 1083

Query: 496  NPLHMPVFPTVTTQANVVCTKNQKVSNPETPTVAVAEDGQNGMISEKVADKGSNDFSLFH 675
            NP  +PV+  +T    V   + +K+        A  E             K    FSLFH
Sbjct: 1084 NPGQLPVYHPITKANGVNSEEQEKIFKTGGAQEAFNEA------------KKERSFSLFH 1131

Query: 676  FGGPVALSEGFKAEAV-TXXXXXXXXXXXXXXXXPDG---CNRKDS-IEEYNLFAASNGI 840
            FGGPVALS G K   V +                 DG   CN+K++ IEEYNLFAASNG+
Sbjct: 1132 FGGPVALSTGNKVNPVPSKEGNVGDYSSKFSADHVDGDHACNKKETTIEEYNLFAASNGM 1191

Query: 841  KFSIY 855
            KFS +
Sbjct: 1192 KFSFF 1196


>ref|XP_003625298.1| hypothetical protein MTR_7g093630 [Medicago truncatula]
            gi|355500313|gb|AES81516.1| hypothetical protein
            MTR_7g093630 [Medicago truncatula]
          Length = 1261

 Score =  134 bits (336), Expect = 8e-29
 Identities = 109/296 (36%), Positives = 147/296 (49%), Gaps = 44/296 (14%)
 Frame = +1

Query: 100  DCHKVVDDQITPTRQNANSHEPASGAINSLSSLPAETAIYCESGKA-----SVGVGMDV- 261
            DCH+V  +      QNAN  E  S + +SL+    +      SG       S G G    
Sbjct: 982  DCHEVAMEN----NQNANG-ESLSRSSSSLTGASFDGTRSDASGNFVEIGHSFGNGFSTT 1036

Query: 262  ----QPQSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSWH---TNGLMSFHYSNHYLFA 408
                QPQ++ P ++ NQNIQ+P +QAP TMG +   PVSW    TNGLM F + NHYL+A
Sbjct: 1037 NVCSQPQNLFP-LVSNQNIQFPAFQAPSTMGYFHQNPVSWPAAPTNGLMPFAHPNHYLYA 1095

Query: 409  NGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKN-QKVSNPE 582
               GY LN + +  + YG LQ    P P+ NP  +PV+  V  +ANV+  +   +VS P 
Sbjct: 1096 GPLGYGLNEDPRFCLQYGSLQQ---PTPMFNPA-IPVYQPVA-RANVLNAEEWAQVSKP- 1149

Query: 583  TPTVAVAEDGQNGMISEKVADKGSN-----------------------DFSLFHFGGPVA 693
                A  ++  NG I+E+    G+N                       DFSLFHFGGPVA
Sbjct: 1150 ----ASLQEHINGSIAERAVSSGNNLKIPVFNGEVKHDRSAKSQENNGDFSLFHFGGPVA 1205

Query: 694  LSEGFKAEAVTXXXXXXXXXXXXXXXXPDGCNRKD--SIEEYNLFAASNGIKFSIY 855
            LS G K+   +                   CN+KD  ++EEYNLFAASN ++FSI+
Sbjct: 1206 LSTGCKSALASSNGDVSLKSSADHAEKVHTCNKKDTTTMEEYNLFAASNNLRFSIF 1261


>ref|XP_002322738.1| hypothetical protein POPTR_0016s06020g [Populus trichocarpa]
            gi|222867368|gb|EEF04499.1| hypothetical protein
            POPTR_0016s06020g [Populus trichocarpa]
          Length = 1180

 Score =  133 bits (334), Expect = 1e-28
 Identities = 89/216 (41%), Positives = 117/216 (54%), Gaps = 16/216 (7%)
 Frame = +1

Query: 256  DVQPQSVLPTMMHNQNIQYPIYQAP-TMGCY----PVSWHT---NGLMSFHYSNHYLFAN 411
            ++QP  V P M HN N+Q+P++QAP TMG Y    PVSW     NGLM F   NHYL+A 
Sbjct: 977  ELQPDVVFPPM-HNHNLQFPVFQAPSTMGYYHHQTPVSWPAAPANGLMPFPQPNHYLYAG 1035

Query: 412  GFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKNQKVSNPETP 588
              GY LN N++  M YG +Q  HL  P+ NP  +PV+  V  +  +         N E  
Sbjct: 1036 SLGYGLNGNSRFCMQYGPVQ--HLATPVFNPGPVPVYQPVAKEYGL---------NSEVR 1084

Query: 589  TVAVAE-DGQNGMISEKVA-DKGSNDFSLFHFGGPVALSEGFKAEAVT----XXXXXXXX 750
            T   A   G++G +        G++ FSLFHFGGPVALS G K++ V             
Sbjct: 1085 TETQAPPSGESGKVDNSAKLPNGNSGFSLFHFGGPVALSTGCKSDPVPSKNGIIGDFSSK 1144

Query: 751  XXXXXXXXPDGCNRKD-SIEEYNLFAASNGIKFSIY 855
                       CN+K+ ++EEYNLFAASNGI+FSI+
Sbjct: 1145 VTTNQIENDPACNKKEIAMEEYNLFAASNGIRFSIF 1180


>ref|XP_002530363.1| conserved hypothetical protein [Ricinus communis]
            gi|223530110|gb|EEF32024.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1239

 Score =  130 bits (326), Expect = 1e-27
 Identities = 89/267 (33%), Positives = 134/267 (50%), Gaps = 36/267 (13%)
 Frame = +1

Query: 163  PASGAINSLSSLPAETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAPTMGC 342
            P    +N+L ++P + A   ++G  +V +G   Q QS+ PTM  NQN+Q+P++ +P++  
Sbjct: 986  PECPRLNALGNMPTKAAQNTDNGIPAVAIGS--QQQSMFPTMQ-NQNLQFPVFHSPSLNY 1042

Query: 343  Y---PVSWHT---NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINP 501
            Y   PV+W     NGLM F + NHYL+A+   Y LN N++L M Y  +   HL  P+ NP
Sbjct: 1043 YHQNPVAWPAAPPNGLMPFPHPNHYLYASPLSYGLNGNSRLCMQYSPVH--HLATPVFNP 1100

Query: 502  LHMPVFPTVTTQANVVCTKNQKVSNPETPTVAVAEDGQNGMISEKVADKGSN-------- 657
              +PV+  V         K   +++ E     + ++     ++EK A  GS+        
Sbjct: 1101 GPVPVYQAVG--------KANGLNSEERIKTCIVQEALTDDMAEKKASAGSHLTEGPPSG 1152

Query: 658  ----------------DFSLFHFGGPVALSEGFKAEAVTXXXXXXXXXXXXXXXXP---- 777
                             FSLFHFGGPVALS G K E+V+                     
Sbjct: 1153 EGGKMDNSAKLHVSDSSFSLFHFGGPVALSTGCKPESVSKKDGLVGDLSSKVSADQIENN 1212

Query: 778  DGCNRKD-SIEEYNLFAASNGIKFSIY 855
              CN+K+ ++EEYNLFAASNG++FS +
Sbjct: 1213 SACNKKETTVEEYNLFAASNGLRFSFF 1239


>ref|XP_007203211.1| hypothetical protein PRUPE_ppa000350mg [Prunus persica]
            gi|462398742|gb|EMJ04410.1| hypothetical protein
            PRUPE_ppa000350mg [Prunus persica]
          Length = 1257

 Score =  126 bits (316), Expect = 2e-26
 Identities = 91/289 (31%), Positives = 143/289 (49%), Gaps = 35/289 (12%)
 Frame = +1

Query: 94   FVDCHKVVDDQITPTRQNANSHEPASGAINS-----LSSLPAETAIYCESGKASVGVGMD 258
            F +CH + ++Q     ++  S   +  ++N      L +     A   ++G +++ VG  
Sbjct: 980  FPECHGMENNQDAKRGESMESRALSGPSLNGAGSNILGNPSTNIAQRFDNGLSAISVGS- 1038

Query: 259  VQPQSVLPTMMHNQNIQYPIYQAPTMGCY---PVSW---HTNGLMSFHYSNHYLFANGFG 420
             Q   +L T MHNQN+ +P++QAP+MG Y    VSW    T+G+MSF + NHYL+A   G
Sbjct: 1039 -QHHGML-TPMHNQNVHFPLFQAPSMGYYHQSSVSWPAAPTSGMMSFPHPNHYLYAGPLG 1096

Query: 421  YDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKNQKVSNP------ 579
            Y +N N+   M Y  +  QH+P PL  P  +P++P + T+      +  ++SNP      
Sbjct: 1097 YGMNGNSGFCMPYSPV--QHVPTPLFTPGPVPIYPAINTE------EQTQISNPGVQESL 1148

Query: 580  -ETPTVAVAEDGQNGMISEKVADKGSND-----------FSLFHFGGPVALSEGFKAEAV 723
             E  T +V   G   M +    ++  +D           FSLFH+GGP+A   G  +  +
Sbjct: 1149 YEANTESVDPSGPYSMQAPASGERAEDDNSGRLHTSNDSFSLFHYGGPLADPPGCNSNLM 1208

Query: 724  TXXXXXXXXXXXXXXXXPD----GCNRKD-SIEEYNLFAASNGIKFSIY 855
                              +     CN+K+ +IEEYNLFAASNGI+FS +
Sbjct: 1209 PLEEQTVGDFPQKCSDHVENDHHACNKKEATIEEYNLFAASNGIRFSFF 1257


>ref|XP_004493617.1| PREDICTED: uncharacterized protein LOC101489385 [Cicer arietinum]
          Length = 1264

 Score =  125 bits (313), Expect = 4e-26
 Identities = 101/266 (37%), Positives = 135/266 (50%), Gaps = 38/266 (14%)
 Frame = +1

Query: 172  GAINSLSSLPAETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAP-TMGCY- 345
            G  +S S   AE A   ++G ++  V    QPQS+LP +  NQNIQ+P + AP T+G + 
Sbjct: 1012 GTRSSASGNFAEIAQNFDNGFSTTNVCS--QPQSMLPAVS-NQNIQFPAFHAPSTIGYFH 1068

Query: 346  --PVSWH---TNGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLH 507
              PVSW    TNGLM F + NHYL+A   GY LN +    + YG LQ    PAPL NP  
Sbjct: 1069 QSPVSWPAAPTNGLMPFPHPNHYLYAGPLGYGLNEDPHFCLQYGALQQ---PAPLFNPA- 1124

Query: 508  MPVFPTVTTQANVVCTKN-QKVSNPETPTVAVAEDGQNGMISEKVADKGSN--------- 657
            +PV+  V  +ANV+  +   +VS P     A  ++  NG I+E+    G+N         
Sbjct: 1125 VPVYQPVA-RANVLNVEEWTRVSKP-----ASLQEHINGSIAERAVSSGTNYKKPEFSGE 1178

Query: 658  --------------DFSLFHFGGPVALSEGFKAEAV----TXXXXXXXXXXXXXXXXPDG 783
                          DFSLFHFGGPVALS G K+                           
Sbjct: 1179 VKHDRSAKSQENNSDFSLFHFGGPVALSTGCKSSLAFSNGNAADDFSLKSSADHAEKVHT 1238

Query: 784  CNRKD--SIEEYNLFAASNGIKFSIY 855
            CN+K+  ++EEYNLFAASN ++FSI+
Sbjct: 1239 CNKKETTTMEEYNLFAASNNLRFSIF 1264


>gb|EXC30858.1| hypothetical protein L484_028037 [Morus notabilis]
          Length = 1339

 Score =  122 bits (307), Expect = 2e-25
 Identities = 102/283 (36%), Positives = 139/283 (49%), Gaps = 45/283 (15%)
 Frame = +1

Query: 142  QNANSHEPASGAI-----------NSLSSLPAETAIYCESGKASVGVGMDVQPQSVLPTM 288
            QNAN  EP  G             N++ +   + A   ++G ++V +G   Q QS +  M
Sbjct: 1069 QNANGGEPIGGRTSVGSQNGVLGNNAIGTPTTKIAHAFDNGLSAVNMGS--QHQSTISPM 1126

Query: 289  MHNQNIQYPIYQAP-TMGCY---PVSWHT---NGLMSFHYSNHYLFANGFGYDLNTNAQL 447
                   +P++QAP T+G Y   PVSW     NGL+ F + NHYL+A+  GY +N N++ 
Sbjct: 1127 ------HFPVFQAPSTLGYYHQNPVSWPAAPNNGLIPFSHPNHYLYADPLGYGMNGNSRF 1180

Query: 448  -MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKNQ-KVSNP---ETPTVAVAEDG 612
             M YG +Q  HL  PL  P  +P +  +  +ANV+  + Q ++S P   E P VA  EDG
Sbjct: 1181 CMQYGPMQ--HLATPLYAPGPVPFYQPIA-KANVINPEEQTQISKPHVQEAPNVAT-EDG 1236

Query: 613  QN--GMISEKVA---------------DKGSNDFSLFHFGGPVALSEGFKAEAVTXXXXX 741
             +  G  S + A               + G   FSLFHFGGPVALS G K   V      
Sbjct: 1237 TDLVGRHSTQAAPSGEGFQRDDPGKPHNTGDKSFSLFHFGGPVALSSGCKPNPVPSKEEI 1296

Query: 742  XXXXXXXXXXXP----DGCNRKD-SIEEYNLFAASNGIKFSIY 855
                       P      CN+K+ +IEEYNLFAASNGI FS +
Sbjct: 1297 VGDFSTKCPTDPVEGDPACNKKEATIEEYNLFAASNGISFSFF 1339


>ref|XP_007029040.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508717645|gb|EOY09542.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1174

 Score =  122 bits (305), Expect = 3e-25
 Identities = 94/250 (37%), Positives = 125/250 (50%), Gaps = 33/250 (13%)
 Frame = +1

Query: 205  ETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSWHT--- 363
            +TA   ++GK +  +G   Q      T +HNQ+IQ+P+YQAP TMG Y   PVSW     
Sbjct: 932  KTAENSDNGKPTAVMGSQHQGMF---TSVHNQHIQFPVYQAPSTMGYYHQNPVSWPASPA 988

Query: 364  NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQA 540
            NGLM F   N YL+A   GY LN N++L M YG LQ  HL  PL NP  +PV+  V+   
Sbjct: 989  NGLMPFP-PNPYLYAGPLGYGLNGNSRLCMPYGTLQ--HLATPLFNPGPVPVYQPVSKVN 1045

Query: 541  NVVCTKNQKVSNPETPTVAVAE--------------------DGQNGMISEKVADKGSND 660
             +   +  ++  P T   A  E                    +G+   +S K+    ++ 
Sbjct: 1046 GLYSEEQTQIPKPGTTKEAFTEVNTERVVPGRLHPTEQAANGEGRQNDVSAKLHTDNTS- 1104

Query: 661  FSLFHFGGPVALSEGFKAEAV----TXXXXXXXXXXXXXXXXPDGCNRKD-SIEEYNLFA 825
            FSLFHFGGPVALS G K+  V                        CN+K+ +IEEYNLFA
Sbjct: 1105 FSLFHFGGPVALSTGCKSNPVPLKDEIVGELSSQFSVDHVENGHACNKKETTIEEYNLFA 1164

Query: 826  ASNGIKFSIY 855
            ASNGI+F  +
Sbjct: 1165 ASNGIRFPFF 1174


>ref|XP_007029039.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508717644|gb|EOY09541.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1271

 Score =  122 bits (305), Expect = 3e-25
 Identities = 94/250 (37%), Positives = 125/250 (50%), Gaps = 33/250 (13%)
 Frame = +1

Query: 205  ETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSWHT--- 363
            +TA   ++GK +  +G   Q      T +HNQ+IQ+P+YQAP TMG Y   PVSW     
Sbjct: 1029 KTAENSDNGKPTAVMGSQHQGMF---TSVHNQHIQFPVYQAPSTMGYYHQNPVSWPASPA 1085

Query: 364  NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQA 540
            NGLM F   N YL+A   GY LN N++L M YG LQ  HL  PL NP  +PV+  V+   
Sbjct: 1086 NGLMPFP-PNPYLYAGPLGYGLNGNSRLCMPYGTLQ--HLATPLFNPGPVPVYQPVSKVN 1142

Query: 541  NVVCTKNQKVSNPETPTVAVAE--------------------DGQNGMISEKVADKGSND 660
             +   +  ++  P T   A  E                    +G+   +S K+    ++ 
Sbjct: 1143 GLYSEEQTQIPKPGTTKEAFTEVNTERVVPGRLHPTEQAANGEGRQNDVSAKLHTDNTS- 1201

Query: 661  FSLFHFGGPVALSEGFKAEAV----TXXXXXXXXXXXXXXXXPDGCNRKD-SIEEYNLFA 825
            FSLFHFGGPVALS G K+  V                        CN+K+ +IEEYNLFA
Sbjct: 1202 FSLFHFGGPVALSTGCKSNPVPLKDEIVGELSSQFSVDHVENGHACNKKETTIEEYNLFA 1261

Query: 826  ASNGIKFSIY 855
            ASNGI+F  +
Sbjct: 1262 ASNGIRFPFF 1271


>ref|XP_004303344.1| PREDICTED: uncharacterized protein LOC101309464 [Fragaria vesca
            subsp. vesca]
          Length = 1267

 Score =  119 bits (297), Expect = 3e-24
 Identities = 92/280 (32%), Positives = 136/280 (48%), Gaps = 26/280 (9%)
 Frame = +1

Query: 94   FVDCHKV-VDDQITPTRQNANSHEPASGAINSLSSLPAETAIY-CESGKASVGVGMDVQP 267
            F +C++V +++ +   R         +G   +  + P    ++  ++  AS+G     Q 
Sbjct: 995  FTECNEVGIENNLNVKRGEFAESRAFTGLPPNEGTNPLTNVLHNFDTSAASMGS----QQ 1050

Query: 268  QSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSWH---TNGLMSFHYSNHYLFANGFGYD 426
            QS+LP M  NQ + +P++QAP TMG Y   PVSW    TNGL+ F + NHYL+A+  GY 
Sbjct: 1051 QSMLPPMK-NQTVHFPVFQAPSTMGYYHQSPVSWPPAPTNGLLPFTHPNHYLYASPLGYG 1109

Query: 427  LNTNAQL-MHYGGLQSQHLPAPLINPLHMPVF-PTVTTQANVVCTKNQKVSNP-ETPTVA 597
            +N N+ L M Y  +Q   LP PL  P  +P+F P + T+      K+     P E  T  
Sbjct: 1110 INGNSGLCMQYSPMQQ--LPTPLFTPTPVPMFQPLINTEEQAQIFKSGVQEYPIEVNTDN 1167

Query: 598  VAEDGQNGMISEKVADKGSND-----------FSLFHFGGPVALSEGFKAEAVTXXXXXX 744
                G   M +    +   ND           FSLFHFGGPVALS G  +  +       
Sbjct: 1168 SDAIGHFSMQTSSTGEGAHNDNSGKLHMNNGGFSLFHFGGPVALSSGGNSNPMPSQEELV 1227

Query: 745  XXXXXXXXXXPDG---CNRKDSIEEYNLFAASNGIKFSIY 855
                       +    CN++ ++EEYNLFAASNG++F  +
Sbjct: 1228 RDSPIKHADHIENDHACNKEATMEEYNLFAASNGMRFKFF 1267


>ref|XP_006576869.1| PREDICTED: uncharacterized protein LOC100786822 isoform X3 [Glycine
            max]
          Length = 1266

 Score =  114 bits (286), Expect = 5e-23
 Identities = 111/308 (36%), Positives = 148/308 (48%), Gaps = 43/308 (13%)
 Frame = +1

Query: 61   RNMLHCLETTRFVDCHKVVDDQITPTRQNANSHEPASGAINSLSSLPAE-TAIYCESGKA 237
            RN   C+ET     CH+V  +      QNA S E  +   +SL  L  + T  Y      
Sbjct: 971  RNNSDCVETV-LSHCHEVAVEN----SQNA-SGEGLTRKSSSLIGLSLDGTRNYALGNLV 1024

Query: 238  SVGVGMD---------VQPQSVLPTMMHNQNIQYPIYQAPT-MGCY---PVSWH---TNG 369
                  D          Q QS+LP +  NQNI +P++QAP+ MG +   PVSW    TNG
Sbjct: 1025 ETAQNFDNCFSTTNVCSQLQSMLPPLS-NQNIHFPVFQAPSAMGYFHQNPVSWPAAPTNG 1083

Query: 370  LMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANV 546
            L+ F +SN YLFA   GY LN + +  + YG LQ    P  L NP  +PV+  V  +ANV
Sbjct: 1084 LIPFPHSNPYLFAGPLGYGLNEDPRFSLRYGALQQ---PTSLFNP-GVPVYQPVA-RANV 1138

Query: 547  VCTK-----NQKVSNPETPTVAVAEDG-QNGMISEKVADKGS-------------NDFSL 669
            +  +     ++  S PE    +VAE     G IS++ A  G              NDFSL
Sbjct: 1139 LNAEERTQVSKPASLPEHLNGSVAEMVFPAGPISKRPASHGEVRHDNSSKPLENKNDFSL 1198

Query: 670  FHFGGPVALSEGFKAEAVT----XXXXXXXXXXXXXXXXPDGCNRKD--SIEEYNLFAAS 831
            FHFGGPVALS G K+   +                       CN+K+  ++EEYNLFAAS
Sbjct: 1199 FHFGGPVALSTGCKSAFTSLNGDTVGDFSSKSSADHVEKVHNCNKKETPAMEEYNLFAAS 1258

Query: 832  NGIKFSIY 855
            N ++FSI+
Sbjct: 1259 NNLRFSIF 1266


>ref|XP_003520543.1| PREDICTED: uncharacterized protein LOC100786822 isoform X1 [Glycine
            max] gi|571445665|ref|XP_006576868.1| PREDICTED:
            uncharacterized protein LOC100786822 isoform X2 [Glycine
            max]
          Length = 1274

 Score =  114 bits (286), Expect = 5e-23
 Identities = 111/308 (36%), Positives = 148/308 (48%), Gaps = 43/308 (13%)
 Frame = +1

Query: 61   RNMLHCLETTRFVDCHKVVDDQITPTRQNANSHEPASGAINSLSSLPAE-TAIYCESGKA 237
            RN   C+ET     CH+V  +      QNA S E  +   +SL  L  + T  Y      
Sbjct: 979  RNNSDCVETV-LSHCHEVAVEN----SQNA-SGEGLTRKSSSLIGLSLDGTRNYALGNLV 1032

Query: 238  SVGVGMD---------VQPQSVLPTMMHNQNIQYPIYQAPT-MGCY---PVSWH---TNG 369
                  D          Q QS+LP +  NQNI +P++QAP+ MG +   PVSW    TNG
Sbjct: 1033 ETAQNFDNCFSTTNVCSQLQSMLPPLS-NQNIHFPVFQAPSAMGYFHQNPVSWPAAPTNG 1091

Query: 370  LMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQANV 546
            L+ F +SN YLFA   GY LN + +  + YG LQ    P  L NP  +PV+  V  +ANV
Sbjct: 1092 LIPFPHSNPYLFAGPLGYGLNEDPRFSLRYGALQQ---PTSLFNP-GVPVYQPVA-RANV 1146

Query: 547  VCTK-----NQKVSNPETPTVAVAEDG-QNGMISEKVADKGS-------------NDFSL 669
            +  +     ++  S PE    +VAE     G IS++ A  G              NDFSL
Sbjct: 1147 LNAEERTQVSKPASLPEHLNGSVAEMVFPAGPISKRPASHGEVRHDNSSKPLENKNDFSL 1206

Query: 670  FHFGGPVALSEGFKAEAVT----XXXXXXXXXXXXXXXXPDGCNRKD--SIEEYNLFAAS 831
            FHFGGPVALS G K+   +                       CN+K+  ++EEYNLFAAS
Sbjct: 1207 FHFGGPVALSTGCKSAFTSLNGDTVGDFSSKSSADHVEKVHNCNKKETPAMEEYNLFAAS 1266

Query: 832  NGIKFSIY 855
            N ++FSI+
Sbjct: 1267 NNLRFSIF 1274


>ref|XP_003553437.1| PREDICTED: uncharacterized protein LOC100813046 [Glycine max]
          Length = 1274

 Score =  114 bits (285), Expect = 7e-23
 Identities = 111/310 (35%), Positives = 151/310 (48%), Gaps = 45/310 (14%)
 Frame = +1

Query: 61   RNMLHCLETTRFVDCHKVVDDQITPTRQNAN----SHEPASGAINSLSSLP--------A 204
            RN L C+ET     CH    D      QNAN    +  P+S   +SL             
Sbjct: 978  RNNLDCVETV-LSHCH----DVSIVNSQNANGEGLTRNPSSLISSSLDGTRNYALGNPIV 1032

Query: 205  ETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAPT-MGCY---PVSWH---T 363
            ETA   ++  ++  V    Q QS+LP +  NQNI +P++QAP+ MG +   PVSW    T
Sbjct: 1033 ETAQNFDNCFSTTNVCS--QSQSMLPPVS-NQNIHFPVFQAPSAMGYFHQNPVSWPAAPT 1089

Query: 364  NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTVTTQA 540
            NGL+ F +SN YL+A   GY LN + +  + YG LQ    P  L NP  +PV+  V + A
Sbjct: 1090 NGLIPFPHSNPYLYAGPLGYGLNEDHRFCLQYGALQQ---PTSLFNP-GVPVYQPVAS-A 1144

Query: 541  NVVCTK-----NQKVSNPETPTVAVAEDG-QNGMISEKVADKG-------------SNDF 663
            NV+  +     ++  S PE    + AE     G IS+K A  G             +NDF
Sbjct: 1145 NVLNAEERTRVSKTASLPEHLNGSFAERVFPAGPISKKPASHGEVRHDNSAKSLENNNDF 1204

Query: 664  SLFHFGGPVALSEGFKAEAVT----XXXXXXXXXXXXXXXXPDGCNRKD--SIEEYNLFA 825
            SLFHFGGPVALS G K+   +                       CN+K+  ++EEYNLFA
Sbjct: 1205 SLFHFGGPVALSTGCKSAFTSLNGDTVGDFSSKSSADHVEKVHNCNKKETPAMEEYNLFA 1264

Query: 826  ASNGIKFSIY 855
             SN ++FSI+
Sbjct: 1265 TSNNLRFSIF 1274


>ref|XP_006492833.1| PREDICTED: uncharacterized protein LOC102619076 [Citrus sinensis]
          Length = 1277

 Score =  113 bits (282), Expect = 1e-22
 Identities = 90/251 (35%), Positives = 124/251 (49%), Gaps = 26/251 (10%)
 Frame = +1

Query: 181  NSLSSLPAETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAPT-MGCY---P 348
            N   +LP +TA   + G  +  VG   Q Q + P + H+QN+Q P +Q P+ MG Y   P
Sbjct: 1033 NFSGNLPEKTAQNPDKGIPTASVGS--QHQGIFPPL-HSQNVQIPAFQPPSAMGYYHQNP 1089

Query: 349  VSWHT---NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPV 516
            VSW     NGLM F + N YL+    GY LN N++L M YGG   QH+  P+ NP  +PV
Sbjct: 1090 VSWPAAPANGLMPFTHPNQYLYTGPLGYGLNGNSRLCMQYGG-ALQHVATPVFNPSPVPV 1148

Query: 517  FPTVTTQANVVCTKNQKVSNPETP----------TVAVAEDGQNGMISEKVADKGSND-F 663
            + ++  +AN +  K      P  P            A+A       +++      +ND F
Sbjct: 1149 YQSI-AKANSM-EKRPHDGKPGAPQEAFNDTNAERAALARSHLTDALAKGEGGHQNNDGF 1206

Query: 664  SLFHFGGPVALSEGFKAEAV----TXXXXXXXXXXXXXXXXPDGCNRKD-SIEEYNLFAA 828
            SLFHFGGPV LS G K   +                        CN+K+ +IE+YNLFAA
Sbjct: 1207 SLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSADHVENDHACNKKETTIEQYNLFAA 1266

Query: 829  S--NGIKFSIY 855
            S  NGI+FS +
Sbjct: 1267 SNGNGIRFSFF 1277


>ref|XP_006429914.1| hypothetical protein CICLE_v100109261mg, partial [Citrus clementina]
            gi|557531971|gb|ESR43154.1| hypothetical protein
            CICLE_v100109261mg, partial [Citrus clementina]
          Length = 769

 Score =  113 bits (282), Expect = 1e-22
 Identities = 88/251 (35%), Positives = 125/251 (49%), Gaps = 26/251 (10%)
 Frame = +1

Query: 181  NSLSSLPAETAIYCESGKASVGVGMDVQPQSVLPTMMHNQNIQYPIYQAPT-MGCY---P 348
            N   +LP +TA   + G  +V V    Q QS+ P + H+QN+Q P +Q P+ MG Y   P
Sbjct: 526  NFSGNLPEKTAQNPDKGIPTVSVSS--QHQSIFPPL-HSQNVQIPAFQPPSAMGYYHQNP 582

Query: 349  VSWHT---NGLMSFHYSNHYLFANGFGYDLNTNAQL-MHYGGLQSQHLPAPLINPLHMPV 516
            VSW     NGL+ F + N YL+    GY LN N++L M YG LQ  H+  P++NP  +PV
Sbjct: 583  VSWPAAPANGLVPFTHPNQYLYTGPLGYGLNGNSRLCMQYGALQ--HVATPVLNPSPVPV 640

Query: 517  FPTVTTQANVVCTKNQKVSNPETPTVAVAEDGQ-----------NGMISEKVADKGSNDF 663
            + ++  +AN +  K      P  P  A  +              + +   +   + ++ F
Sbjct: 641  YQSIA-KANSM-EKRTHDGKPGAPQEAFNDTNAERSAPARSHLTDALAKGEGGHQNNDGF 698

Query: 664  SLFHFGGPVALSEGFKAEAV----TXXXXXXXXXXXXXXXXPDGCNRKD-SIEEYNLFAA 828
            SLFHFGGPV LS G K   +                        CN+K+ +IEEYNLFAA
Sbjct: 699  SLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSADHVENDHACNKKETTIEEYNLFAA 758

Query: 829  S--NGIKFSIY 855
            S  NGI+FS +
Sbjct: 759  SNGNGIRFSFF 769


>ref|XP_004249188.1| PREDICTED: uncharacterized protein LOC101258014 [Solanum
            lycopersicum]
          Length = 1254

 Score =  110 bits (275), Expect = 1e-21
 Identities = 87/269 (32%), Positives = 123/269 (45%), Gaps = 18/269 (6%)
 Frame = +1

Query: 94   FVDCHKVVDDQITPTRQNANSHEPASGAINSLSSLPAETAIYCESGKASVGVGMDVQPQS 273
            F +C++V  ++ T     A   + +S   NS+ +          S  A+V   + ++PQS
Sbjct: 994  FAECYEVAQEKRTAA---AKGEDVSSLTPNSVGTTVGSFPTTAASTNANVNGTLGMRPQS 1050

Query: 274  VLPTMMHNQNIQYPIYQAPTMGCY---PVSWHT---NGLMSFHYSNHYLFANGFGYDLNT 435
            + P + H+Q   +P +Q P M  Y   P SW T   NG + F + NHY+FA  F Y LN 
Sbjct: 1051 LRPPV-HSQGTHFPRFQVPAMDYYYQTPPSWATTPVNGFIPFPHPNHYVFATPFSYGLNA 1109

Query: 436  NAQLMHYGGLQSQHLPAPLINPLHMPVFPTVTTQANVVCTKNQKVS-----NPETPTVAV 600
            NA  M +G L  QHL  P IN  H+PVF +V   ++    +N +VS       E     +
Sbjct: 1110 NAHFMQHGAL--QHLIPPPINHGHLPVFQSVAPTSDRCIKENARVSTVGRLKEEANVQRM 1167

Query: 601  AEDGQNGMISEKVADKGSND------FSLFHF-GGPVALSEGFKAEAVTXXXXXXXXXXX 759
            A  GQ+ M     A  G  +      FSLF F   P +L EG      +           
Sbjct: 1168 APVGQHTMEKSTTAGSGETEESRNSGFSLFSFTPDPFSLKEGMARNLSS-------NLRT 1220

Query: 760  XXXXXPDGCNRKDSIEEYNLFAASNGIKF 846
                   GCN+K+ IEEYN FA  N I+F
Sbjct: 1221 NHIAGESGCNKKEPIEEYNPFA--NRIEF 1247


>ref|XP_002881811.1| hypothetical protein ARALYDRAFT_321881 [Arabidopsis lyrata subsp.
            lyrata] gi|297327650|gb|EFH58070.1| hypothetical protein
            ARALYDRAFT_321881 [Arabidopsis lyrata subsp. lyrata]
          Length = 1218

 Score =  109 bits (272), Expect = 2e-21
 Identities = 93/286 (32%), Positives = 129/286 (45%), Gaps = 34/286 (11%)
 Frame = +1

Query: 100  DCHKVVDDQITPTRQNAN--------SHEPASGAINSLSSLPAETAIY-CESGKASVGVG 252
            DCH+ + +++T  R +          S+ PA    + LS  P        E+    V  G
Sbjct: 933  DCHEKMVEKVTEMRMDERDVLRIKNMSNLPADNGESKLSGTPFMVPSQNMENMVPGVNTG 992

Query: 253  MDV-QPQSVLPTMMHNQNIQYPIYQAP-TMGCY---PVSWHT---NGLMSFHYSNHYLFA 408
              + QPQ+++   M NQ+I  P++QAP TMG Y   PVSW +   NGLM F + NHY++ 
Sbjct: 993  SYLSQPQNMMFPQMLNQSIPLPVFQAPSTMGYYHQAPVSWSSAPANGLMQFPHPNHYVYT 1052

Query: 409  NGFGYDLNTNAQLMHYGGLQSQHLPAPLINPLHMPVF-PTVTTQANVVCTKNQKVSNPET 585
               GY LN  + L    G    H  AP  N   +PVF P   T       + Q +   E 
Sbjct: 1053 GPLGYSLNGESPLCVQYGTPLSHSAAPFFNSGAVPVFHPYAETTTMNTVDQAQALEPLEH 1112

Query: 586  PTVAVAEDGQ-NGMISEKVADKG------SNDFSLFHFGGPVALSEGFKAEAV-----TX 729
              +  A + + N M   +   +G        +FSLFHFGGPVALS G K+          
Sbjct: 1113 SFLKEANERKLNEMPPMETPRRGGLQHDSDENFSLFHFGGPVALSTGSKSNPARSKDGIL 1172

Query: 730  XXXXXXXXXXXXXXXPDGCNRKDSI----EEYNLFAASNGIKFSIY 855
                           P G ++KD      EEYNLFA SN ++FSI+
Sbjct: 1173 GDFSLQFSGDHVFGDPTGNSKKDKENTVGEEYNLFATSNSLRFSIF 1218


>ref|XP_004497878.1| PREDICTED: uncharacterized protein LOC101509839 isoform X1 [Cicer
            arietinum] gi|502122737|ref|XP_004497879.1| PREDICTED:
            uncharacterized protein LOC101509839 isoform X2 [Cicer
            arietinum]
          Length = 1253

 Score =  108 bits (270), Expect = 4e-21
 Identities = 91/285 (31%), Positives = 141/285 (49%), Gaps = 34/285 (11%)
 Frame = +1

Query: 103  CHKVVDDQITPTRQNANSHEPASGAINSLSSLPAETAIY----CESGKASVGVGMDVQPQ 270
            C+K V ++      NAN  + +S + +  S   AE+  +     E+G  S  V    QP+
Sbjct: 979  CYKAVIEKT----HNANGEDLSSRSPSVPSLDVAESEAFGNHVFENGFTSTNVCS--QPE 1032

Query: 271  SVLPTMMHNQNIQYPIYQAPT-MGCY---PVSWHT---NGLMSFHYSNHYLFANGFGYDL 429
            S+LP M  N+NIQ+P++Q P+ MG Y   PVSW +   NGLM F + N+YL++   GY+L
Sbjct: 1033 SMLPPMP-NRNIQFPVFQTPSAMGYYHQNPVSWQSAPANGLMPFVHPNNYLYSGPLGYNL 1091

Query: 430  NTNAQL-MHYGGLQSQHLPAPLINPLHMPVFPTV--------------TTQANVVCTKNQ 564
              + +  + YG LQ    P P  N   +PV+  V              +  A++    N+
Sbjct: 1092 TEDPRFCLQYGALQQ---PTPQFNSAAIPVYHPVARAKGLNGEELSQISKSASMQDHFNE 1148

Query: 565  KVSNPETPTVA----VAEDGQNGMISEKVADKGSNDFSLFHFGGPVALSEGFKAEAVTXX 732
             ++    P  A     A +G++   +   + + +  FSLFHFGGPVA S   K  A +  
Sbjct: 1149 SIAERVVPVAANSRKSALNGEDRYGNSAKSQESNGGFSLFHFGGPVAFSNERKTVAASSE 1208

Query: 733  XXXXXXXXXXXXXXPD--GCNRKDS--IEEYNLFAASNGIKFSIY 855
                              GC++K++  +EEYNLFAASN ++FSI+
Sbjct: 1209 NVGDFNSKISLDQVEKDRGCSKKETAFVEEYNLFAASNTLRFSIF 1253


Top