BLASTX nr result

ID: Perilla23_contig00000478 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00000478
         (1758 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011078982.1| PREDICTED: uncharacterized protein LOC105162...   325   6e-86
ref|XP_011075079.1| PREDICTED: uncharacterized protein LOC105159...   299   5e-78
ref|XP_012834423.1| PREDICTED: uncharacterized protein LOC105955...   286   3e-74
emb|CDP10180.1| unnamed protein product [Coffea canephora]            235   8e-59
ref|XP_003632681.1| PREDICTED: uncharacterized protein LOC100257...   191   2e-45
emb|CBI29995.3| unnamed protein product [Vitis vinifera]              172   1e-39
ref|XP_003625298.1| hypothetical protein MTR_7g093630 [Medicago ...   166   6e-38
ref|XP_006429914.1| hypothetical protein CICLE_v100109261mg, par...   165   1e-37
gb|KDO70881.1| hypothetical protein CISIN_1g000806mg [Citrus sin...   162   7e-37
gb|KDO70880.1| hypothetical protein CISIN_1g000806mg [Citrus sin...   162   7e-37
gb|KDO70879.1| hypothetical protein CISIN_1g000806mg [Citrus sin...   162   7e-37
ref|XP_006492833.1| PREDICTED: uncharacterized protein LOC102619...   158   1e-35
ref|XP_011039488.1| PREDICTED: uncharacterized protein LOC105136...   156   6e-35
ref|XP_004493617.1| PREDICTED: uncharacterized protein LOC101489...   155   1e-34
gb|KJB82794.1| hypothetical protein B456_013G213300 [Gossypium r...   154   2e-34
ref|XP_007203211.1| hypothetical protein PRUPE_ppa000350mg [Prun...   154   2e-34
ref|XP_007029040.1| Uncharacterized protein isoform 2 [Theobroma...   154   3e-34
ref|XP_007029039.1| Uncharacterized protein isoform 1 [Theobroma...   154   3e-34
gb|KRG95423.1| hypothetical protein GLYMA_19G149800 [Glycine max]     149   8e-33
gb|KRG95422.1| hypothetical protein GLYMA_19G149800 [Glycine max]     149   8e-33

>ref|XP_011078982.1| PREDICTED: uncharacterized protein LOC105162607 [Sesamum indicum]
          Length = 1221

 Score =  325 bits (834), Expect = 6e-86
 Identities = 201/428 (46%), Positives = 239/428 (55%), Gaps = 21/428 (4%)
 Frame = -1

Query: 1743 KVDTTESDRVHESSGIAXXXXXXXXXXXTHNKDNELRDLTRSGDEPCTDTENGFHAREKS 1564
            KV+ +E+D++ ESS  A           T+N+ N++R     G E C D EN FHA  KS
Sbjct: 809  KVENSETDQLPESSATANSNEAVDTSVETNNEANDVR-----GPENCGDRENQFHAMTKS 863

Query: 1563 ENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXXXXXXXXSEGDSNTFSSNGQNLXXX 1384
            E Y K   AEDGE C +ARSP R V                 SEGDSNT SSN QNL   
Sbjct: 864  EKYSKEAVAEDGEGCSLARSPHRRVDSSMSSSSNSDNCSSCLSEGDSNTSSSNPQNLEST 923

Query: 1383 XXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDANSQEPASGAANSSCSLP 1204
                        E R+  HCLESR  E   V++D+S TR  D   Q PASG  N+  SLP
Sbjct: 924  STSDSEESSPNSEARETSHCLESRSTECCSVLEDQSITRGHDTKGQTPASGITNTLGSLP 983

Query: 1203 EVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQYPIYQAPTMGYYRQSPVSWPAGPTDG 1024
               + YCESG+AN+  + Q QS  P  MH+Q+I YP++ AP+MGYY QSP+SW  GP +G
Sbjct: 984  TEVATYCESGRANISRSVQSQSVPP--MHSQNIPYPVFHAPSMGYYHQSPLSWQTGP-NG 1040

Query: 1023 LMSFHYSNHYLFANTNSFGYDLNTNAQLMQYGGLQXXXXXXXLINPVHMPVLPPVTQGNV 844
            LMS+ +SNHYLFA  N+FGYDLN N   MQYG LQ       L+NP HMPV P V Q N 
Sbjct: 1041 LMSYPHSNHYLFA--NAFGYDLNGNGGFMQYGALQ--HLAPPLLNPAHMPVYPLVAQANG 1096

Query: 843  VYPKDHATAASPPGVVVKEVVHHGI---QTGEGHS--------AGQT------DEGNNDF 715
            V  K+H    +   +     VHH I    + E HS        AGQ       D+GNN F
Sbjct: 1097 VSTKEHCKGTN---LCAPREVHHSINKVDSAETHSAETPTVVDAGQNGKSDKIDKGNNGF 1153

Query: 714  SLFHFGGPVALSTRFK----XXXXXXXXXXXXXXGDGNGNGNHPCNRKDSIEEYNLFAAS 547
            SLFHFGGPVALST F                    D + +GNHPCN+KDSIEEYNLFAAS
Sbjct: 1154 SLFHFGGPVALSTGFSADPVSLKEGTMGNTALDLSDNSADGNHPCNKKDSIEEYNLFAAS 1213

Query: 546  NGIKFSIF 523
            NGIKFSIF
Sbjct: 1214 NGIKFSIF 1221


>ref|XP_011075079.1| PREDICTED: uncharacterized protein LOC105159649 [Sesamum indicum]
          Length = 1277

 Score =  299 bits (766), Expect = 5e-78
 Identities = 185/428 (43%), Positives = 237/428 (55%), Gaps = 20/428 (4%)
 Frame = -1

Query: 1746 PKVDTTESDRVHESSGIAXXXXXXXXXXXTHNKDNELRDLTRSGDEPCTDTENGFHAREK 1567
            PKV+ +ESD++ ES   +            H+ D ++  LTRS  E C D +NGF + EK
Sbjct: 860  PKVEASESDQLPESCSTSSDEVTDISVHTNHD-DTDVLHLTRSRAENCGDLDNGFLSVEK 918

Query: 1566 SENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXXXXXXXXSEGDSNTFSSNGQNLXX 1387
             +N+ K E+  DGELC    S   T+                 SEG+SN +S N QNL  
Sbjct: 919  PQNHSK-EDVADGELCPTKSSAIGTLDSSMSSSSNSDNCSSCLSEGESNMYS-NPQNLES 976

Query: 1386 XXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDANSQEPASGAANSSCSL 1207
                         EGR+   C E+     H+VV+D++T+  Q+A SQ P S   NSS SL
Sbjct: 977  TSTSDSEESNQNSEGREASDCNENGITASHRVVEDQNTSSGQEAKSQGPVSAGTNSSGSL 1036

Query: 1206 PEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQYPIYQAPTMGYYRQSPVSWPAGPTD 1027
             +  +  C++G+ NV V+AQPQ  LP  MHNQSI YP++QAP MGYY Q+PVSWPA PT+
Sbjct: 1037 LKEAAPDCDNGRVNVSVSAQPQCMLP-QMHNQSISYPLFQAPAMGYYHQNPVSWPAAPTN 1095

Query: 1026 GLMSFHYSNHYLFANTNSFGYDLNTNAQLMQYGGLQXXXXXXXLINPVHMPVLPPVTQGN 847
            GLMSF +SNHYL+ANT  FGY LN NA+ +QYG LQ       L+N  H+PV  PV+Q N
Sbjct: 1096 GLMSFPHSNHYLYANT--FGYGLNGNARFLQYGALQ--HLGPPLLNHAHVPVFQPVSQVN 1151

Query: 846  VVYPKDHATAASPPGVVVKEVVHH----------------GIQTGEGHSAGQTDEGNNDF 715
             V   + +  A   G  +KE  H                 G+  G+   A + D GNN F
Sbjct: 1152 GVSTNEPSKVAHVSG--LKEAQHSMQKVVSTDQHPANAPTGVDAGQNGKADKMDMGNNGF 1209

Query: 714  SLFHFGGPVALSTRFK----XXXXXXXXXXXXXXGDGNGNGNHPCNRKDSIEEYNLFAAS 547
            SLFHFGGPVALS+ FK                   D +  G+HP N+KDSIEEYNLFAA+
Sbjct: 1210 SLFHFGGPVALSSGFKADPVSLKDGIMGDASPNSSDNSPGGDHPSNKKDSIEEYNLFAAT 1269

Query: 546  NGIKFSIF 523
            NGIKFSIF
Sbjct: 1270 NGIKFSIF 1277


>ref|XP_012834423.1| PREDICTED: uncharacterized protein LOC105955253 [Erythranthe
            guttatus] gi|604336125|gb|EYU39971.1| hypothetical
            protein MIMGU_mgv1a000318mg [Erythranthe guttata]
          Length = 1263

 Score =  286 bits (733), Expect = 3e-74
 Identities = 185/424 (43%), Positives = 234/424 (55%), Gaps = 16/424 (3%)
 Frame = -1

Query: 1746 PKVDTTESDRVHESSGIAXXXXXXXXXXXTHNKDNELRDLTRSGDEPCTDTENGFHAREK 1567
            PKV  +ESD++ E    +            H +DN +RDL RS  E C D  +G   +E 
Sbjct: 848  PKVVASESDQLPECCSTSSDEVTDISVQANH-EDNNMRDLARSKAENCRDIGSGLQTKET 906

Query: 1566 SENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXXXXXXXXSEGDSNTFSSNGQNLXX 1387
              NY K   AE+GELC M RSP  T                  SEG++N + SN QNL  
Sbjct: 907  PGNYSKEAVAEEGELCSMTRSPLGTSDSSMNSSSNSDNCSSCLSEGENNNY-SNPQNLES 965

Query: 1386 XXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDANSQ-EPASGAANSSCS 1210
                         EG +   C+E+     H  V+++ST+R QDA SQ  P S   NS  S
Sbjct: 966  TSTSDSEESSHNSEGIETSCCVENGVTGSHGTVENQSTSRGQDAKSQAPPTSTGTNSVGS 1025

Query: 1209 LPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQYPIYQAPTMGYYRQSPVSWPAGPT 1030
            L +  + YCE+ KANV +  QPQS LP  MHN++I +P++QAPTMGYY Q+PVSW AGPT
Sbjct: 1026 LVKEAAPYCENTKANVSIGVQPQSVLP-QMHNKNINFPVFQAPTMGYYHQNPVSW-AGPT 1083

Query: 1029 DGLMSFHYSNHYLFANTNSFGYDLNTNAQLMQYGGLQXXXXXXXLINPVHMPVLPPVTQG 850
            +GLMSF +SNHYLFANT  +GY LN NA+ MQYG LQ       LIN VH+PV  PV+Q 
Sbjct: 1084 NGLMSFPHSNHYLFANT--YGYGLNGNARFMQYGALQ--HMPPQLINHVHVPVYQPVSQV 1139

Query: 849  NVVYPKDHATAASPPGVV-----VKEVVHHG-----IQTGEGHSAGQTDEGNNDFSLFHF 700
            N V   + A  A  PG+      +K+V H       +   +     + D GNN FSLFHF
Sbjct: 1140 NGVNLNEPAKVAHLPGLKEGQPRIKKVEHPAEVPTVLDAVQNGKPDKMDMGNNGFSLFHF 1199

Query: 699  GGPVALSTRFKXXXXXXXXXXXXXXGDGNG----NGNHPCNRKDSIEEYNLFAASN-GIK 535
            GGPVALST FK                 +     +G+H C++KDSIEEYNLFAA+N GIK
Sbjct: 1200 GGPVALSTGFKADPIPLKEGFMGNASPNSSINCTDGDHTCDKKDSIEEYNLFAATNGGIK 1259

Query: 534  FSIF 523
            FSI+
Sbjct: 1260 FSIY 1263


>emb|CDP10180.1| unnamed protein product [Coffea canephora]
          Length = 1251

 Score =  235 bits (600), Expect = 8e-59
 Identities = 156/413 (37%), Positives = 207/413 (50%), Gaps = 6/413 (1%)
 Frame = -1

Query: 1743 KVDTTESDRVHESSGIAXXXXXXXXXXXTHNKDNELRDLTRSGDEPCTDTENGFHAREKS 1564
            K +TTESD+  ESS  +             ++D +L  + +S  E   + ENGFH +EKS
Sbjct: 851  KGETTESDQAPESSIASSSDNLMGITIQIKHEDKDLHAVIKSEPEAERNGENGFHPKEKS 910

Query: 1563 ENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXXXXXXXXSEGDSNTFSSNGQNLXXX 1384
            + Y +  + EDGELC M+RS Q T+                 SEGDSN  SSN Q     
Sbjct: 911  QQYKEATD-EDGELCPMSRSLQATLDSSLSSSSNSDNCSSCLSEGDSNISSSNPQTTESS 969

Query: 1383 XXXXXXXXXXXXEGRDVLHCLESRFA--ERHKVVDDRSTTRRQDANSQEPASGAANSSCS 1210
                        EGR+   C +S     +   +V   +T   +    +     A N+  +
Sbjct: 970  SSSDSDDASQNSEGRETSVCFQSGITVCQDAGMVKGENTCGVEHVKGEVVNDAATNTWGT 1029

Query: 1209 LPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQYPIYQAPTMGYYRQSPVSWPAGPT 1030
            L    +   E+G+AN+ +NAQPQ  LP  +HNQS+Q+PI+Q+P MGYY QSP+SWPA PT
Sbjct: 1030 LSSKAN--SENGRANMSINAQPQVVLP-QLHNQSMQFPIFQSPPMGYYHQSPLSWPAAPT 1086

Query: 1029 DGLMSFHYSNHYLFANTNSFGYDLNTNAQLMQYGGLQXXXXXXXLINPVHMPVLPPVTQG 850
            +G M+F   NHYLFA  + FGY LN N+ LMQYG LQ       ++N  H+PV   V Q 
Sbjct: 1087 NGFMAFPSPNHYLFA--SPFGYGLNGNSHLMQYGTLQ--HPTPQMLNRSHVPVFQSVAQS 1142

Query: 849  NVVYPKDHATAASPPGVVVKEVVHHGIQTGEGHSAGQTDEGNNDFSLFHFGGPVALSTRF 670
            N +  KDH   ++  G +      H    G       +D  N  FSLFHFGGPV +    
Sbjct: 1143 NGINGKDHMKISNVGGTIET----HAGANGMNLKTEGSDVRNTGFSLFHFGGPVDVPPGL 1198

Query: 669  KXXXXXXXXXXXXXXGD----GNGNGNHPCNRKDSIEEYNLFAASNGIKFSIF 523
            K                     +  G+  CN+K SIEEYNLFAASNGIKFS F
Sbjct: 1199 KSEPASLKEEIGTDLSSKLSADHSEGDQTCNKKSSIEEYNLFAASNGIKFSFF 1251


>ref|XP_003632681.1| PREDICTED: uncharacterized protein LOC100257222 [Vitis vinifera]
          Length = 1284

 Score =  191 bits (485), Expect = 2e-45
 Identities = 141/407 (34%), Positives = 193/407 (47%), Gaps = 32/407 (7%)
 Frame = -1

Query: 1647 DNELRDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSP--QRTVXXXXX 1474
            DN L + + S     TD +NGFH  EK E Y   E A++        +P    T      
Sbjct: 884  DNHLNESSNSSSIMDTDCQNGFHVGEK-EPYYSTEAADEVTGLSSMTNPCLDETSEPTMS 942

Query: 1473 XXXXXXXXXXXXSEGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHK 1294
                        SEGDSNT SSN  NL               EGR+   C+++ F E H+
Sbjct: 943  STSNSDNCSSCLSEGDSNTASSNPLNLESSSTSDSEDASQQSEGRETSVCIQNGFPECHE 1002

Query: 1293 VV-------DDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSR 1135
            VV       + +   R + +    P S   +   + P  T+   +SGK NV + +Q Q  
Sbjct: 1003 VVVEKKQIENGKEAFRSKMSAGFSPDSARNSLPANAPTKTAQNLDSGKPNVSMGSQHQGM 1062

Query: 1134 LPAMMHNQSIQYPIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDL 958
            LP M H Q++ YP++QAP TM YY Q+PVSWPA   +GLM F + NHYLF  T+  GY L
Sbjct: 1063 LPTM-HKQNLHYPMFQAPSTMSYYHQNPVSWPAASANGLMPFPHPNHYLF--TSPLGYGL 1119

Query: 957  NTNAQL-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGV--VVKE 787
            N +++L MQY  LQ       ++NP  +PV  P+T+ N V  ++        G      E
Sbjct: 1120 NGSSRLCMQYSALQ--HLTPPVLNPGQLPVYHPITKANGVNSEEQEKIFKTGGAQEAFNE 1177

Query: 786  VVHHGIQT--------------GEGHSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXX 649
                 + +              G+  ++ +   GN  FSLFHFGGPVALST  K      
Sbjct: 1178 AKKERVPSAGPRPTDAPPNGDDGQNGNSAKLHTGNQSFSLFHFGGPVALSTGNKVNPVPS 1237

Query: 648  XXXXXXXXGD----GNGNGNHPCNRKD-SIEEYNLFAASNGIKFSIF 523
                           + +G+H CN+K+ +IEEYNLFAASNG+KFS F
Sbjct: 1238 KEGNVGDYSSKFSADHVDGDHACNKKETTIEEYNLFAASNGMKFSFF 1284


>emb|CBI29995.3| unnamed protein product [Vitis vinifera]
          Length = 1196

 Score =  172 bits (435), Expect = 1e-39
 Identities = 130/382 (34%), Positives = 174/382 (45%), Gaps = 7/382 (1%)
 Frame = -1

Query: 1647 DNELRDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXX 1468
            DN L + + S     TD +NGFH  E +     +    + + C    S            
Sbjct: 884  DNHLNESSNSSSIMDTDCQNGFHTSEPT-----MSSTSNSDNCSSCLS------------ 926

Query: 1467 XXXXXXXXXXSEGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVV 1288
                       EGDSNT SSN  NL               EGR+   C+++ F E     
Sbjct: 927  -----------EGDSNTASSNPLNLESSSTSDSEDASQQSEGRETSVCIQNGFPE----- 970

Query: 1287 DDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQS 1108
                      A +  PA+         P  T+   +SGK NV + +Q Q  LP M H Q+
Sbjct: 971  --------YSARNSLPANA--------PTKTAQNLDSGKPNVSMGSQHQGMLPTM-HKQN 1013

Query: 1107 IQYPIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQ 934
            + YP++QAP TM YY Q+PVSWPA   +GLM F + NHYLF  T+  GY LN +++L MQ
Sbjct: 1014 LHYPMFQAPSTMSYYHQNPVSWPAASANGLMPFPHPNHYLF--TSPLGYGLNGSSRLCMQ 1071

Query: 933  YGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQTGEG 754
            Y  LQ       ++NP  +PV  P+T+ N V  ++                    +TG  
Sbjct: 1072 YSALQ--HLTPPVLNPGQLPVYHPITKANGVNSEEQEKI---------------FKTGGA 1114

Query: 753  HSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----GNGNGNHPCNR 586
              A    +    FSLFHFGGPVALST  K                     + +G+H CN+
Sbjct: 1115 QEAFNEAKKERSFSLFHFGGPVALSTGNKVNPVPSKEGNVGDYSSKFSADHVDGDHACNK 1174

Query: 585  KD-SIEEYNLFAASNGIKFSIF 523
            K+ +IEEYNLFAASNG+KFS F
Sbjct: 1175 KETTIEEYNLFAASNGMKFSFF 1196


>ref|XP_003625298.1| hypothetical protein MTR_7g093630 [Medicago truncatula]
            gi|355500313|gb|AES81516.1| hypothetical protein
            MTR_7g093630 [Medicago truncatula]
          Length = 1261

 Score =  166 bits (420), Expect = 6e-38
 Identities = 119/331 (35%), Positives = 173/331 (52%), Gaps = 27/331 (8%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGD+NT SSN +N                E RD   C+E   ++ H+V  + +     ++
Sbjct: 939  EGDNNTTSSNHENQESSTTSDSEDVCQQSEVRDNSACVEKVLSDCHEVAMENNQNANGES 998

Query: 1254 NSQEPAS--GAA------NSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQY 1099
             S+  +S  GA+      ++S +  E+   +  +G +   V +QPQ+  P ++ NQ+IQ+
Sbjct: 999  LSRSSSSLTGASFDGTRSDASGNFVEIGHSF-GNGFSTTNVCSQPQNLFP-LVSNQNIQF 1056

Query: 1098 PIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGG 925
            P +QAP TMGY+ Q+PVSWPA PT+GLM F + NHYL+A     GY LN + +  +QYG 
Sbjct: 1057 PAFQAPSTMGYFHQNPVSWPAAPTNGLMPFAHPNHYLYA--GPLGYGLNEDPRFCLQYGS 1114

Query: 924  LQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPP-------GVVVKEVVHHG-- 772
            LQ       + NP  +PV  PV + NV+  ++ A  + P        G + +  V  G  
Sbjct: 1115 LQ---QPTPMFNPA-IPVYQPVARANVLNAEEWAQVSKPASLQEHINGSIAERAVSSGNN 1170

Query: 771  ----IQTGE--GHSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGDGNG 610
                +  GE     + ++ E N DFSLFHFGGPVALST  K                 + 
Sbjct: 1171 LKIPVFNGEVKHDRSAKSQENNGDFSLFHFGGPVALSTGCKSALASSNGDVSLKSSADHA 1230

Query: 609  NGNHPCNRKD--SIEEYNLFAASNGIKFSIF 523
               H CN+KD  ++EEYNLFAASN ++FSIF
Sbjct: 1231 EKVHTCNKKDTTTMEEYNLFAASNNLRFSIF 1261


>ref|XP_006429914.1| hypothetical protein CICLE_v100109261mg, partial [Citrus clementina]
            gi|557531971|gb|ESR43154.1| hypothetical protein
            CICLE_v100109261mg, partial [Citrus clementina]
          Length = 769

 Score =  165 bits (418), Expect = 1e-37
 Identities = 134/394 (34%), Positives = 177/394 (44%), Gaps = 23/394 (5%)
 Frame = -1

Query: 1635 RDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXX 1456
            RDLT S D      +NG H   K   Y      +D  LC    S    +           
Sbjct: 384  RDLTHSTDGIY---QNGCHVEAKDAFYSTGAAYDDSGLCHARNSAFNGISDPIMGSSSNS 440

Query: 1455 XXXXXXS-EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKV---- 1291
                    EGDSNT SSN  NL               EGRD   C ++ F+E  +V    
Sbjct: 441  DNCSSCLSEGDSNTVSSNHGNLESSSTSDSEDASQQSEGRDTSACTQNGFSEFQEVGMGK 500

Query: 1290 ---VDDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMM 1120
                D   T  R+        S  +N S +LPE T+   + G   V V++Q QS  P + 
Sbjct: 501  KLITDGGETLGRRAFVGLPSDSMGSNFSGNLPEKTAQNPDKGIPTVSVSSQHQSIFPPL- 559

Query: 1119 HNQSIQYPIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQ 943
            H+Q++Q P +Q P+ MGYY Q+PVSWPA P +GL+ F + N YL+  T   GY LN N++
Sbjct: 560  HSQNVQIPAFQPPSAMGYYHQNPVSWPAAPANGLVPFTHPNQYLY--TGPLGYGLNGNSR 617

Query: 942  L-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQ 766
            L MQYG LQ       ++NP  +PV   + + N +  + H      P     +       
Sbjct: 618  LCMQYGALQ--HVATPVLNPSPVPVYQSIAKANSMEKRTHDGKPGAPQEAFNDTNAERSA 675

Query: 765  TGEGH-----SAGQTDEGNND-FSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----G 616
                H     + G+    NND FSLFHFGGPV LST  K                     
Sbjct: 676  PARSHLTDALAKGEGGHQNNDGFSLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSAD 735

Query: 615  NGNGNHPCNRKD-SIEEYNLFAAS--NGIKFSIF 523
            +   +H CN+K+ +IEEYNLFAAS  NGI+FS F
Sbjct: 736  HVENDHACNKKETTIEEYNLFAASNGNGIRFSFF 769


>gb|KDO70881.1| hypothetical protein CISIN_1g000806mg [Citrus sinensis]
          Length = 748

 Score =  162 bits (411), Expect = 7e-37
 Identities = 132/394 (33%), Positives = 177/394 (44%), Gaps = 23/394 (5%)
 Frame = -1

Query: 1635 RDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXX 1456
            RDL+ S D      +NG H   K   Y      +D  LC    S    +           
Sbjct: 363  RDLSHSTDGIY---QNGCHVEAKGAFYSTGAAYDDSGLCHTRNSTFNGISDPIMGSSSNS 419

Query: 1455 XXXXXXS-EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKV---- 1291
                    EGDSNT SSN  NL               EGRD   C ++ F+E  +V    
Sbjct: 420  DNCSSCLSEGDSNTVSSNHGNLESSSTSDSEDASQQSEGRDTSACTQNGFSEFQEVGMGK 479

Query: 1290 ---VDDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMM 1120
                D   T  R+        S  +N S +LPE T+   + G   V V++Q QS  P + 
Sbjct: 480  KLITDGGETLGRRAFVGLPSDSMGSNFSGNLPEKTAQNPDKGIPTVSVSSQHQSIFPPL- 538

Query: 1119 HNQSIQYPIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQ 943
            H+Q++Q P +Q P+ MGYY Q+PVSWPA P +GL+ F + N YL+  T   GY LN N++
Sbjct: 539  HSQNVQIPAFQPPSAMGYYHQNPVSWPAAPANGLVPFTHPNQYLY--TGPLGYGLNGNSR 596

Query: 942  L-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQ 766
            L MQYG LQ       ++NP  +PV   + + N +  + H      P     +       
Sbjct: 597  LCMQYGALQ--HVATPVLNPSPVPVYQSIAKANSMEKRTHDGKPGAPQEAFNDTNAERSA 654

Query: 765  TGEGH-----SAGQTDEGNND-FSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----G 616
                H     + G+    NND FSLFHFGGPV LST  K                     
Sbjct: 655  PARSHLTDALAKGEGGHQNNDGFSLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSAD 714

Query: 615  NGNGNHPCNRKD-SIEEYNLFAAS--NGIKFSIF 523
            +   +H CN+K+ +IE+YNLFAAS  NGI+FS F
Sbjct: 715  HVENDHACNKKETTIEQYNLFAASNGNGIRFSFF 748


>gb|KDO70880.1| hypothetical protein CISIN_1g000806mg [Citrus sinensis]
          Length = 994

 Score =  162 bits (411), Expect = 7e-37
 Identities = 132/394 (33%), Positives = 177/394 (44%), Gaps = 23/394 (5%)
 Frame = -1

Query: 1635 RDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXX 1456
            RDL+ S D      +NG H   K   Y      +D  LC    S    +           
Sbjct: 609  RDLSHSTDGIY---QNGCHVEAKGAFYSTGAAYDDSGLCHTRNSTFNGISDPIMGSSSNS 665

Query: 1455 XXXXXXS-EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKV---- 1291
                    EGDSNT SSN  NL               EGRD   C ++ F+E  +V    
Sbjct: 666  DNCSSCLSEGDSNTVSSNHGNLESSSTSDSEDASQQSEGRDTSACTQNGFSEFQEVGMGK 725

Query: 1290 ---VDDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMM 1120
                D   T  R+        S  +N S +LPE T+   + G   V V++Q QS  P + 
Sbjct: 726  KLITDGGETLGRRAFVGLPSDSMGSNFSGNLPEKTAQNPDKGIPTVSVSSQHQSIFPPL- 784

Query: 1119 HNQSIQYPIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQ 943
            H+Q++Q P +Q P+ MGYY Q+PVSWPA P +GL+ F + N YL+  T   GY LN N++
Sbjct: 785  HSQNVQIPAFQPPSAMGYYHQNPVSWPAAPANGLVPFTHPNQYLY--TGPLGYGLNGNSR 842

Query: 942  L-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQ 766
            L MQYG LQ       ++NP  +PV   + + N +  + H      P     +       
Sbjct: 843  LCMQYGALQ--HVATPVLNPSPVPVYQSIAKANSMEKRTHDGKPGAPQEAFNDTNAERSA 900

Query: 765  TGEGH-----SAGQTDEGNND-FSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----G 616
                H     + G+    NND FSLFHFGGPV LST  K                     
Sbjct: 901  PARSHLTDALAKGEGGHQNNDGFSLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSAD 960

Query: 615  NGNGNHPCNRKD-SIEEYNLFAAS--NGIKFSIF 523
            +   +H CN+K+ +IE+YNLFAAS  NGI+FS F
Sbjct: 961  HVENDHACNKKETTIEQYNLFAASNGNGIRFSFF 994


>gb|KDO70879.1| hypothetical protein CISIN_1g000806mg [Citrus sinensis]
          Length = 1276

 Score =  162 bits (411), Expect = 7e-37
 Identities = 132/394 (33%), Positives = 177/394 (44%), Gaps = 23/394 (5%)
 Frame = -1

Query: 1635 RDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXX 1456
            RDL+ S D      +NG H   K   Y      +D  LC    S    +           
Sbjct: 891  RDLSHSTDGIY---QNGCHVEAKGAFYSTGAAYDDSGLCHTRNSTFNGISDPIMGSSSNS 947

Query: 1455 XXXXXXS-EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKV---- 1291
                    EGDSNT SSN  NL               EGRD   C ++ F+E  +V    
Sbjct: 948  DNCSSCLSEGDSNTVSSNHGNLESSSTSDSEDASQQSEGRDTSACTQNGFSEFQEVGMGK 1007

Query: 1290 ---VDDRSTTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMM 1120
                D   T  R+        S  +N S +LPE T+   + G   V V++Q QS  P + 
Sbjct: 1008 KLITDGGETLGRRAFVGLPSDSMGSNFSGNLPEKTAQNPDKGIPTVSVSSQHQSIFPPL- 1066

Query: 1119 HNQSIQYPIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQ 943
            H+Q++Q P +Q P+ MGYY Q+PVSWPA P +GL+ F + N YL+  T   GY LN N++
Sbjct: 1067 HSQNVQIPAFQPPSAMGYYHQNPVSWPAAPANGLVPFTHPNQYLY--TGPLGYGLNGNSR 1124

Query: 942  L-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQ 766
            L MQYG LQ       ++NP  +PV   + + N +  + H      P     +       
Sbjct: 1125 LCMQYGALQ--HVATPVLNPSPVPVYQSIAKANSMEKRTHDGKPGAPQEAFNDTNAERSA 1182

Query: 765  TGEGH-----SAGQTDEGNND-FSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----G 616
                H     + G+    NND FSLFHFGGPV LST  K                     
Sbjct: 1183 PARSHLTDALAKGEGGHQNNDGFSLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSAD 1242

Query: 615  NGNGNHPCNRKD-SIEEYNLFAAS--NGIKFSIF 523
            +   +H CN+K+ +IE+YNLFAAS  NGI+FS F
Sbjct: 1243 HVENDHACNKKETTIEQYNLFAASNGNGIRFSFF 1276


>ref|XP_006492833.1| PREDICTED: uncharacterized protein LOC102619076 [Citrus sinensis]
          Length = 1277

 Score =  158 bits (400), Expect = 1e-35
 Identities = 131/394 (33%), Positives = 175/394 (44%), Gaps = 23/394 (5%)
 Frame = -1

Query: 1635 RDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXXXXXXXX 1456
            RDL+ S D      +NG H   K   Y      +D  LC    S    +           
Sbjct: 891  RDLSHSTDGIY---QNGCHVEAKGAFYSTGAAYDDSGLCHTRNSTFNGISDPIMGSSSNS 947

Query: 1455 XXXXXXS-EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERH------ 1297
                    EGDSNT SSN  NL               EGRD   C ++ F+E        
Sbjct: 948  DNCSSCLSEGDSNTVSSNHGNLESSSTSDSEDASQQSEGRDTSACTQNGFSEFQEVGMGK 1007

Query: 1296 KVVDDRSTTRRQDANSQEPA-SGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMM 1120
            K++ D   T  + A    P+ S  +N S +LPE T+   + G     V +Q Q   P + 
Sbjct: 1008 KLITDGGETLGRGAFVGLPSDSMGSNFSGNLPEKTAQNPDKGIPTASVGSQHQGIFPPL- 1066

Query: 1119 HNQSIQYPIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQ 943
            H+Q++Q P +Q P+ MGYY Q+PVSWPA P +GLM F + N YL+  T   GY LN N++
Sbjct: 1067 HSQNVQIPAFQPPSAMGYYHQNPVSWPAAPANGLMPFTHPNQYLY--TGPLGYGLNGNSR 1124

Query: 942  L-MQYGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVKEVVHHGIQ 766
            L MQYGG         + NP  +PV   + + N +  + H      P     +       
Sbjct: 1125 LCMQYGG-ALQHVATPVFNPSPVPVYQSIAKANSMEKRPHDGKPGAPQEAFNDTNAERAA 1183

Query: 765  TGEGH-----SAGQTDEGNND-FSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGD----G 616
                H     + G+    NND FSLFHFGGPV LST  K                     
Sbjct: 1184 LARSHLTDALAKGEGGHQNNDGFSLFHFGGPVGLSTGCKVNPMPSKDEIVGNFSSQFSAD 1243

Query: 615  NGNGNHPCNRKD-SIEEYNLFAAS--NGIKFSIF 523
            +   +H CN+K+ +IE+YNLFAAS  NGI+FS F
Sbjct: 1244 HVENDHACNKKETTIEQYNLFAASNGNGIRFSFF 1277


>ref|XP_011039488.1| PREDICTED: uncharacterized protein LOC105136031 [Populus euphratica]
          Length = 1278

 Score =  156 bits (394), Expect = 6e-35
 Identities = 124/343 (36%), Positives = 168/343 (48%), Gaps = 39/343 (11%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGDSNT SSN ++                EGRD   C  + F+  H++V D   +   D 
Sbjct: 952  EGDSNTVSSNNEHPESSSTSDSEDTSPQSEGRDTSTCSGNGFSNSHELVLDNKPSTNGDE 1011

Query: 1254 --NSQEP----ASGAANSSCSLPEVTSIYC-ESGKANVGVNAQPQSRLPAMMHNQSIQYP 1096
               S++P      G   ++   P  T++   ++G   V V  Q Q   P + HN ++Q+P
Sbjct: 1012 VFGSKKPFELQPDGLRLNTLGNPPTTTVQNPDNGIPTVSVGLQRQVVFPPV-HNHNLQFP 1070

Query: 1095 IYQAP-TMGYYR-QSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGG 925
            ++QAP TMGYY  Q+PVSWPA P +GLM F   NHYL+A   S GY LN N++  MQYG 
Sbjct: 1071 VFQAPSTMGYYHHQTPVSWPAAPANGLMPFPQPNHYLYA--GSLGYGLNGNSRFCMQYGP 1128

Query: 924  LQXXXXXXXLINPVHMPVLPPV---------------------TQGN---VVYPKDHATA 817
            +Q       + NP  +PV  PV                     T+ N   +V  K  +T 
Sbjct: 1129 VQ--HLATPVFNPSPVPVYQPVAKEYGLNSEVRTETRMMQETLTEANKERMVPAKSRSTE 1186

Query: 816  ASPPGVVVKEVVHHGIQTGEGHSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXXXXXX 637
            A P G           ++G+  ++ +   G++ FSLFHFGGPVALST  K          
Sbjct: 1187 APPSG-----------ESGKVDNSAKLPNGSSGFSLFHFGGPVALSTGCKSDPVLSKNGI 1235

Query: 636  XXXXGD---GNGNGNHP-CNRKD-SIEEYNLFAASNGIKFSIF 523
                      N   N P CN+K+ ++EEYNLFAASNGIKFSIF
Sbjct: 1236 IGDFSSKVTTNQIENDPACNKKEIAMEEYNLFAASNGIKFSIF 1278


>ref|XP_004493617.1| PREDICTED: uncharacterized protein LOC101489385 [Cicer arietinum]
          Length = 1264

 Score =  155 bits (391), Expect = 1e-34
 Identities = 117/335 (34%), Positives = 164/335 (48%), Gaps = 31/335 (9%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFA-------ERHKVVDDRS 1276
            EGD+NT SSN  N                E RD   C+E   +       E ++  +  +
Sbjct: 938  EGDNNTTSSNHDNQESSTTSDSEDVSQQSEVRDNSACVEKALSDCPEVPMENNQNANGET 997

Query: 1275 TTRRQDANSQEPASGAANS-SCSLPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQY 1099
              R   +       G  +S S +  E+   + ++G +   V +QPQS LPA+  NQ+IQ+
Sbjct: 998  FVRSSSSLISRSLDGTRSSASGNFAEIAQNF-DNGFSTTNVCSQPQSMLPAVS-NQNIQF 1055

Query: 1098 PIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGG 925
            P + AP T+GY+ QSPVSWPA PT+GLM F + NHYL+A     GY LN +    +QYG 
Sbjct: 1056 PAFHAPSTIGYFHQSPVSWPAAPTNGLMPFPHPNHYLYA--GPLGYGLNEDPHFCLQYGA 1113

Query: 924  LQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPP-------GVVVKEVVHHGIQ 766
            LQ       L NP  +PV  PV + NV+  ++    + P        G + +  V  G  
Sbjct: 1114 LQ---QPAPLFNPA-VPVYQPVARANVLNVEEWTRVSKPASLQEHINGSIAERAVSSGTN 1169

Query: 765  TGEGHSAGQ--------TDEGNNDFSLFHFGGPVALSTRFK----XXXXXXXXXXXXXXG 622
              +   +G+        + E N+DFSLFHFGGPVALST  K                   
Sbjct: 1170 YKKPEFSGEVKHDRSAKSQENNSDFSLFHFGGPVALSTGCKSSLAFSNGNAADDFSLKSS 1229

Query: 621  DGNGNGNHPCNRKD--SIEEYNLFAASNGIKFSIF 523
              +    H CN+K+  ++EEYNLFAASN ++FSIF
Sbjct: 1230 ADHAEKVHTCNKKETTTMEEYNLFAASNNLRFSIF 1264


>gb|KJB82794.1| hypothetical protein B456_013G213300 [Gossypium raimondii]
          Length = 1258

 Score =  154 bits (389), Expect = 2e-34
 Identities = 119/335 (35%), Positives = 161/335 (48%), Gaps = 31/335 (9%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAE-------RHKVVDDRS 1276
            EGDSNT +SN  NL               + RD   C+E+ F+E       + +  D   
Sbjct: 931  EGDSNTSASNHGNLESSSTSDSEDACQQSDRRDASICIENGFSECQVKGMDKKQDADGGV 990

Query: 1275 TTRRQDANSQEPASGAANSSCSLPEVTSIYCESGKANVGVNAQPQSRLPAMMHNQSIQYP 1096
               RQ     +P      +  +LP  T+   ++GK    + +Q Q    ++ HNQ IQ+P
Sbjct: 991  ALERQALFGHQPDGTGNKAPGNLPTKTAENSDNGKPTAFMGSQHQGMFTSV-HNQHIQFP 1049

Query: 1095 IYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGGL 922
            +Y  P TMGYY QSPVSWPA P +GL+ F   N YL+  T   GY LN N+ L M YG L
Sbjct: 1050 VYPTPSTMGYYHQSPVSWPATPANGLVPFP-PNPYLY--TGPLGYGLNGNSHLCMPYGAL 1106

Query: 921  QXXXXXXXLINPVHMPVLPPVTQGNVVY-------PKDHATAASPPGVVVKEVVHHGIQT 763
            Q         NP  +PV  PV++ N +Y       PK   T+ +      + VV   +  
Sbjct: 1107 Q--HLAAPPFNPDPVPVYQPVSEANGLYAEERTLIPKPGRTSEAFTEFSAERVVPGRLHA 1164

Query: 762  GEGHSAGQ---------TDEGNNDFSLFHFGGPVALSTRFK-----XXXXXXXXXXXXXX 625
             E  + G+         ++  ++ FSLFHFGGPVALST  K                   
Sbjct: 1165 TEKTAIGEVWQNDVSVKSNADDSSFSLFHFGGPVALSTGCKTSPVPLKDEIVEELSSQFS 1224

Query: 624  GDGNGNGNHPCNRKDS-IEEYNLFAASNGIKFSIF 523
             D   NG H CN+K+S IE+YNLFAASNG++FS F
Sbjct: 1225 ADHVENG-HGCNKKESTIEQYNLFAASNGLRFSFF 1258


>ref|XP_007203211.1| hypothetical protein PRUPE_ppa000350mg [Prunus persica]
            gi|462398742|gb|EMJ04410.1| hypothetical protein
            PRUPE_ppa000350mg [Prunus persica]
          Length = 1257

 Score =  154 bits (389), Expect = 2e-34
 Identities = 120/398 (30%), Positives = 183/398 (45%), Gaps = 21/398 (5%)
 Frame = -1

Query: 1653 NKDNELRDLTRSGDEPCTDTENGFHAREKSENYIKVEEAEDGELCFMARSPQRTVXXXXX 1474
            ++DN L++L +S        +NGFHA  +      ++ A +G    M  S   +      
Sbjct: 881  DEDNNLKELRKSSIGMDVSCQNGFHAGAQDS----IDTALNGISDSMVGSSSNS------ 930

Query: 1473 XXXXXXXXXXXXSEGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHK 1294
                        SEGDSNT SSN  N                 G++    +++ F E H 
Sbjct: 931  -----DNCSSCLSEGDSNTTSSNHGNQESSSTSDSEDASQKSGGKETSLSIQNGFPECHG 985

Query: 1293 VVDDRSTTRRQDANSQE---PASGAANSSCSLPEVTSIY--CESGKANVGVNAQPQSRLP 1129
            + +++   R +   S+    P+   A S+      T+I    ++G + + V +Q    L 
Sbjct: 986  MENNQDAKRGESMESRALSGPSLNGAGSNILGNPSTNIAQRFDNGLSAISVGSQHHGMLT 1045

Query: 1128 AMMHNQSIQYPIYQAPTMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTN 949
             M HNQ++ +P++QAP+MGYY QS VSWPA PT G+MSF + NHYL+A     GY +N N
Sbjct: 1046 PM-HNQNVHFPLFQAPSMGYYHQSSVSWPAAPTSGMMSFPHPNHYLYA--GPLGYGMNGN 1102

Query: 948  AQL-MQYGGLQXXXXXXXLINPVHMPVLPPVT---QGNVVYP-------KDHATAASPPG 802
            +   M Y  +Q          PV  P+ P +    Q  +  P       + +  +  P G
Sbjct: 1103 SGFCMPYSPVQHVPTPLFTPGPV--PIYPAINTEEQTQISNPGVQESLYEANTESVDPSG 1160

Query: 801  VVVKEVVHHGIQTGEGHSAGQTDEGNNDFSLFHFGGPVA----LSTRFKXXXXXXXXXXX 634
                +    G +  E  ++G+    N+ FSLFH+GGP+A     ++              
Sbjct: 1161 PYSMQAPASG-ERAEDDNSGRLHTSNDSFSLFHYGGPLADPPGCNSNLMPLEEQTVGDFP 1219

Query: 633  XXXGDGNGNGNHPCNRKD-SIEEYNLFAASNGIKFSIF 523
                D   N +H CN+K+ +IEEYNLFAASNGI+FS F
Sbjct: 1220 QKCSDHVENDHHACNKKEATIEEYNLFAASNGIRFSFF 1257


>ref|XP_007029040.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508717645|gb|EOY09542.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1174

 Score =  154 bits (388), Expect = 3e-34
 Identities = 127/340 (37%), Positives = 160/340 (47%), Gaps = 36/340 (10%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGDSNT SSN  NL               +GRD   C ++ F+E    V  +   ++QD 
Sbjct: 847  EGDSNTSSSNHGNLESSSTSDSEDASQQSDGRDTSVCHQNGFSE----VQVKGMDKKQDV 902

Query: 1254 NSQ----------EPASGAANSSCSLPEV-TSIYCESGKANVGVNAQPQSRLPAMMHNQS 1108
            N                G  N     P   T+   ++GK    + +Q Q    ++ HNQ 
Sbjct: 903  NGGVALGSQALFGNTPDGRGNKVPGNPLTKTAENSDNGKPTAVMGSQHQGMFTSV-HNQH 961

Query: 1107 IQYPIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQ 934
            IQ+P+YQAP TMGYY Q+PVSWPA P +GLM F   N YL+A     GY LN N++L M 
Sbjct: 962  IQFPVYQAPSTMGYYHQNPVSWPASPANGLMPFP-PNPYLYA--GPLGYGLNGNSRLCMP 1018

Query: 933  YGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVK---EVVHHGIQT 763
            YG LQ       L NP  +PV  PV++ N +Y ++  T    PG   +   EV    +  
Sbjct: 1019 YGTLQ--HLATPLFNPGPVPVYQPVSKVNGLYSEEQ-TQIPKPGTTKEAFTEVNTERVVP 1075

Query: 762  GEGHSAGQTDEG--------------NNDFSLFHFGGPVALSTRFK-----XXXXXXXXX 640
            G  H   Q   G              N  FSLFHFGGPVALST  K              
Sbjct: 1076 GRLHPTEQAANGEGRQNDVSAKLHTDNTSFSLFHFGGPVALSTGCKSNPVPLKDEIVGEL 1135

Query: 639  XXXXXGDGNGNGNHPCNRKD-SIEEYNLFAASNGIKFSIF 523
                  D   NG H CN+K+ +IEEYNLFAASNGI+F  F
Sbjct: 1136 SSQFSVDHVENG-HACNKKETTIEEYNLFAASNGIRFPFF 1174


>ref|XP_007029039.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508717644|gb|EOY09541.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1271

 Score =  154 bits (388), Expect = 3e-34
 Identities = 127/340 (37%), Positives = 160/340 (47%), Gaps = 36/340 (10%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGDSNT SSN  NL               +GRD   C ++ F+E    V  +   ++QD 
Sbjct: 944  EGDSNTSSSNHGNLESSSTSDSEDASQQSDGRDTSVCHQNGFSE----VQVKGMDKKQDV 999

Query: 1254 NSQ----------EPASGAANSSCSLPEV-TSIYCESGKANVGVNAQPQSRLPAMMHNQS 1108
            N                G  N     P   T+   ++GK    + +Q Q    ++ HNQ 
Sbjct: 1000 NGGVALGSQALFGNTPDGRGNKVPGNPLTKTAENSDNGKPTAVMGSQHQGMFTSV-HNQH 1058

Query: 1107 IQYPIYQAP-TMGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQ 934
            IQ+P+YQAP TMGYY Q+PVSWPA P +GLM F   N YL+A     GY LN N++L M 
Sbjct: 1059 IQFPVYQAPSTMGYYHQNPVSWPASPANGLMPFP-PNPYLYA--GPLGYGLNGNSRLCMP 1115

Query: 933  YGGLQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDHATAASPPGVVVK---EVVHHGIQT 763
            YG LQ       L NP  +PV  PV++ N +Y ++  T    PG   +   EV    +  
Sbjct: 1116 YGTLQ--HLATPLFNPGPVPVYQPVSKVNGLYSEEQ-TQIPKPGTTKEAFTEVNTERVVP 1172

Query: 762  GEGHSAGQTDEG--------------NNDFSLFHFGGPVALSTRFK-----XXXXXXXXX 640
            G  H   Q   G              N  FSLFHFGGPVALST  K              
Sbjct: 1173 GRLHPTEQAANGEGRQNDVSAKLHTDNTSFSLFHFGGPVALSTGCKSNPVPLKDEIVGEL 1232

Query: 639  XXXXXGDGNGNGNHPCNRKD-SIEEYNLFAASNGIKFSIF 523
                  D   NG H CN+K+ +IEEYNLFAASNGI+F  F
Sbjct: 1233 SSQFSVDHVENG-HACNKKETTIEEYNLFAASNGIRFPFF 1271


>gb|KRG95423.1| hypothetical protein GLYMA_19G149800 [Glycine max]
          Length = 1236

 Score =  149 bits (376), Expect = 8e-33
 Identities = 118/337 (35%), Positives = 168/337 (49%), Gaps = 33/337 (9%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGD+NT SS+ +N                E R+ L C+E+  +  H V    S     + 
Sbjct: 909  EGDNNTTSSSHENTESSITSDSEDASRQSELRNNLDCVETVLSHCHDVSIVNSQNANGEG 968

Query: 1254 NSQEPAS-------GAANSSCSLPEV-TSIYCESGKANVGVNAQPQSRLPAMMHNQSIQY 1099
             ++ P+S       G  N +   P V T+   ++  +   V +Q QS LP +  NQ+I +
Sbjct: 969  LTRNPSSLISSSLDGTRNYALGNPIVETAQNFDNCFSTTNVCSQSQSMLPPVS-NQNIHF 1027

Query: 1098 PIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGG 925
            P++QAP+ MGY+ Q+PVSWPA PT+GL+ F +SN YL+A     GY LN + +  +QYG 
Sbjct: 1028 PVFQAPSAMGYFHQNPVSWPAAPTNGLIPFPHSNPYLYA--GPLGYGLNEDHRFCLQYGA 1085

Query: 924  LQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDH-----------------ATAASPPGVV 796
            LQ       L NP  +PV  PV   NV+  ++                  A    P G +
Sbjct: 1086 LQ---QPTSLFNP-GVPVYQPVASANVLNAEERTRVSKTASLPEHLNGSFAERVFPAGPI 1141

Query: 795  VKEVVHHGIQTGEGHSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGDG 616
             K+   HG +    +SA ++ E NNDFSLFHFGGPVALST  K                 
Sbjct: 1142 SKKPASHG-EVRHDNSA-KSLENNNDFSLFHFGGPVALSTGCKSAFTSLNGDTVGDFSSK 1199

Query: 615  NGNGN----HPCNRKD--SIEEYNLFAASNGIKFSIF 523
            +   +    H CN+K+  ++EEYNLFA SN ++FSIF
Sbjct: 1200 SSADHVEKVHNCNKKETPAMEEYNLFATSNNLRFSIF 1236


>gb|KRG95422.1| hypothetical protein GLYMA_19G149800 [Glycine max]
          Length = 1242

 Score =  149 bits (376), Expect = 8e-33
 Identities = 118/337 (35%), Positives = 168/337 (49%), Gaps = 33/337 (9%)
 Frame = -1

Query: 1434 EGDSNTFSSNGQNLXXXXXXXXXXXXXXXEGRDVLHCLESRFAERHKVVDDRSTTRRQDA 1255
            EGD+NT SS+ +N                E R+ L C+E+  +  H V    S     + 
Sbjct: 915  EGDNNTTSSSHENTESSITSDSEDASRQSELRNNLDCVETVLSHCHDVSIVNSQNANGEG 974

Query: 1254 NSQEPAS-------GAANSSCSLPEV-TSIYCESGKANVGVNAQPQSRLPAMMHNQSIQY 1099
             ++ P+S       G  N +   P V T+   ++  +   V +Q QS LP +  NQ+I +
Sbjct: 975  LTRNPSSLISSSLDGTRNYALGNPIVETAQNFDNCFSTTNVCSQSQSMLPPVS-NQNIHF 1033

Query: 1098 PIYQAPT-MGYYRQSPVSWPAGPTDGLMSFHYSNHYLFANTNSFGYDLNTNAQL-MQYGG 925
            P++QAP+ MGY+ Q+PVSWPA PT+GL+ F +SN YL+A     GY LN + +  +QYG 
Sbjct: 1034 PVFQAPSAMGYFHQNPVSWPAAPTNGLIPFPHSNPYLYA--GPLGYGLNEDHRFCLQYGA 1091

Query: 924  LQXXXXXXXLINPVHMPVLPPVTQGNVVYPKDH-----------------ATAASPPGVV 796
            LQ       L NP  +PV  PV   NV+  ++                  A    P G +
Sbjct: 1092 LQ---QPTSLFNP-GVPVYQPVASANVLNAEERTRVSKTASLPEHLNGSFAERVFPAGPI 1147

Query: 795  VKEVVHHGIQTGEGHSAGQTDEGNNDFSLFHFGGPVALSTRFKXXXXXXXXXXXXXXGDG 616
             K+   HG +    +SA ++ E NNDFSLFHFGGPVALST  K                 
Sbjct: 1148 SKKPASHG-EVRHDNSA-KSLENNNDFSLFHFGGPVALSTGCKSAFTSLNGDTVGDFSSK 1205

Query: 615  NGNGN----HPCNRKD--SIEEYNLFAASNGIKFSIF 523
            +   +    H CN+K+  ++EEYNLFA SN ++FSIF
Sbjct: 1206 SSADHVEKVHNCNKKETPAMEEYNLFATSNNLRFSIF 1242


Top