BLASTX nr result

ID: Astragalus23_contig00020242 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00020242
         (1182 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAP90379.1| Hyp O-arabinosyltransferase homolog [Lotus japon...   634   0.0  
ref|XP_004511039.1| PREDICTED: uncharacterized protein LOC101490...   627   0.0  
ref|XP_020240277.1| hydroxyproline O-arabinosyltransferase 3-lik...   623   0.0  
ref|XP_003627860.1| NOD3, putative [Medicago truncatula] >gi|355...   622   0.0  
ref|XP_006593477.1| PREDICTED: uncharacterized protein LOC100820...   622   0.0  
ref|XP_019453517.1| PREDICTED: hydroxyproline O-arabinosyltransf...   621   0.0  
ref|XP_013445092.1| NOD3, putative [Medicago truncatula] >gi|657...   619   0.0  
ref|XP_003546712.1| PREDICTED: uncharacterized protein LOC100787...   618   0.0  
gb|KHN13066.1| hypothetical protein glysoja_043756 [Glycine soja...   614   0.0  
ref|XP_015936987.1| hydroxyproline O-arabinosyltransferase 3 [Ar...   613   0.0  
ref|XP_014522808.1| hydroxyproline O-arabinosyltransferase 3 [Vi...   613   0.0  
ref|XP_007133751.1| hypothetical protein PHAVU_011G206200g [Phas...   612   0.0  
ref|XP_007133750.1| hypothetical protein PHAVU_011G206200g [Phas...   612   0.0  
ref|XP_017432158.1| PREDICTED: uncharacterized protein LOC108339...   611   0.0  
ref|NP_001241917.1| uncharacterized protein LOC100820233 [Glycin...   582   0.0  
ref|XP_017985179.1| PREDICTED: uncharacterized protein LOC185872...   574   0.0  
gb|EOY19771.1| Uncharacterized protein TCM_045112 isoform 1 [The...   574   0.0  
ref|XP_021297242.1| hydroxyproline O-arabinosyltransferase 3-lik...   573   0.0  
ref|XP_022764737.1| hydroxyproline O-arabinosyltransferase 3-lik...   570   0.0  
gb|OMO73416.1| hypothetical protein CCACVL1_17273 [Corchorus cap...   570   0.0  

>dbj|BAP90379.1| Hyp O-arabinosyltransferase homolog [Lotus japonicus]
          Length = 360

 Score =  634 bits (1634), Expect = 0.0
 Identities = 299/344 (86%), Positives = 317/344 (92%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            M R SPLL+I LV GSSFATYNLVTMI+ YGSSESVAT DGALFFDPIIEMP+HVKNRKT
Sbjct: 1    MARASPLLIIFLVFGSSFATYNLVTMIIRYGSSESVATDDGALFFDPIIEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YNQWQCRIMYYWYK+Q+N PGSEMGGFTRILHSGKPDNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNQWQCRIMYYWYKRQRNSPGSEMGGFTRILHSGKPDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDH+FVRPLPNLA+GE+PA
Sbjct: 121  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHVFVRPLPNLAYGENPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PDQNEK++RK+YPEEKGPVTNIDPIGNSPVII+ D+IAKIAPTWMN+SLKMK
Sbjct: 181  AFPFFYIRPDQNEKIIRKYYPEEKGPVTNIDPIGNSPVIIKTDVIAKIAPTWMNVSLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYAIASALH VRHILRKDFMLQPPWDLET+NK+IIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAIASALHGVRHILRKDFMLQPPWDLETHNKYIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            LKGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  LKGELTYGKIGEWRFDKRSHLRGPPPRNLPLPPPGVPESVVTLV 344


>ref|XP_004511039.1| PREDICTED: uncharacterized protein LOC101490765 [Cicer arietinum]
          Length = 361

 Score =  627 bits (1618), Expect = 0.0
 Identities = 297/345 (86%), Positives = 315/345 (91%), Gaps = 1/345 (0%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGA-LFFDPIIEMPEHVKNRK 326
            M R SP+L+ICLVLG+SFATYNLVTMI+HYGSSE+VAT DG  LFFDPIIEMPEHVKNRK
Sbjct: 1    MARASPILLICLVLGTSFATYNLVTMIIHYGSSENVATDDGGGLFFDPIIEMPEHVKNRK 60

Query: 327  TSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIP 506
            TSK  FHVALTATDA YN+WQCRIMYYWYKKQ+ LPGSEMGGFTRILHSGK DNLMDEIP
Sbjct: 61   TSKVLFHVALTATDAPYNKWQCRIMYYWYKKQRGLPGSEMGGFTRILHSGKADNLMDEIP 120

Query: 507  TVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHP 686
            T VVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEY+LMAEPDH+FVRPLPNLAFGEHP
Sbjct: 121  TAVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYILMAEPDHVFVRPLPNLAFGEHP 180

Query: 687  AAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKM 866
            AAFPFFYIKP++NEK+VRKFYPEE GPVTNIDPIGNSPVIIRKDLI+KIAPTWMN+SLKM
Sbjct: 181  AAFPFFYIKPNENEKIVRKFYPEENGPVTNIDPIGNSPVIIRKDLISKIAPTWMNVSLKM 240

Query: 867  KEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDY 1046
            K+DPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET+NKFIIHYTYGCDY
Sbjct: 241  KQDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETHNKFIIHYTYGCDY 300

Query: 1047 NLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            NLKGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  NLKGELTYGKIGEWRFDKRSHLQGPPPKNLPLPPPGVPESVVTLV 345


>ref|XP_020240277.1| hydroxyproline O-arabinosyltransferase 3-like [Cajanus cajan]
          Length = 360

 Score =  623 bits (1606), Expect = 0.0
 Identities = 292/344 (84%), Positives = 312/344 (90%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR SPLL+I LVLGSSFATYNLVTM+MHYGSSE VA  DGALFFDPIIEMP+HVKNRKT
Sbjct: 1    MGRASPLLLIFLVLGSSFATYNLVTMLMHYGSSEGVAIDDGALFFDPIIEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            S+ PFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSG PDNLMDEIPT
Sbjct: 61   SRTPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGNPDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAGLDRGYIVLNRPWAFVQWL+KAKIEEEYVLMAEPDHIF+RPLPNLA+G HPA
Sbjct: 121  VVVDPLPAGLDRGYIVLNRPWAFVQWLQKAKIEEEYVLMAEPDHIFLRPLPNLAYGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PDQNEK++RKFYPEE GPVT +DPIGNSPVIIRKDLI+KIAPTWMNISLKMK
Sbjct: 181  AFPFFYIRPDQNEKIIRKFYPEELGPVTKVDPIGNSPVIIRKDLISKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETNKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|XP_003627860.1| NOD3, putative [Medicago truncatula]
 gb|AET02336.1| NOD3, putative [Medicago truncatula]
          Length = 360

 Score =  622 bits (1605), Expect = 0.0
 Identities = 289/344 (84%), Positives = 312/344 (90%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            M R SPLL+ICLVLGSSFATYNLVTMI+HYGS++S+AT DG LFFDPI+EMPEHVKN KT
Sbjct: 1    MARASPLLMICLVLGSSFATYNLVTMIIHYGSADSLATEDGGLFFDPIVEMPEHVKNTKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFH+ALTATDA YN+WQCRIMYYWYKKQ++LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHIALTATDAIYNKWQCRIMYYWYKKQRSLPGSEMGGFTRILHSGKADNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLP GLDRGY+VLNRPWAFVQWLEKA IEEEY+LMAEPDH+FVRPLPNLAFGE+PA
Sbjct: 121  VVVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHVFVRPLPNLAFGENPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKP +NEK+VRK+YPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNIS+KMK
Sbjct: 181  AFPFFYIKPKENEKIVRKYYPEENGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISMKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMY YA+ASALH VRHILRKDFMLQPPWD ET+NK+IIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYGYAVASALHGVRHILRKDFMLQPPWDTETFNKYIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            LKGELTYGK+GEWRFDKRSH                 ESV TLV
Sbjct: 301  LKGELTYGKIGEWRFDKRSHLRGPPPRNLPLPPPGVPESVATLV 344


>ref|XP_006593477.1| PREDICTED: uncharacterized protein LOC100820233 isoform X1 [Glycine
            max]
 gb|KHN37679.1| hypothetical protein glysoja_007382 [Glycine soja]
 gb|KRH20715.1| hypothetical protein GLYMA_13G196300 [Glycine max]
          Length = 360

 Score =  622 bits (1604), Expect = 0.0
 Identities = 294/344 (85%), Positives = 309/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S LL+I LVLGSSFATYN+VTMI HYGSSE VA  DGALFFDPI EMP+HVKNRKT
Sbjct: 1    MGRASSLLIIFLVLGSSFATYNVVTMIRHYGSSEGVAVNDGALFFDPITEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SK PFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSG PDNLMDEIPT
Sbjct: 61   SKVPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGNPDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAGLDRGYIVLNRPWAFVQWLEK KIEEEYVLMAEPDHIFVRPLPNLA+G HPA
Sbjct: 121  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKTKIEEEYVLMAEPDHIFVRPLPNLAYGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PD+NEK++RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMK
Sbjct: 181  AFPFFYIRPDENEKIIRKFYPEELGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+IIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETNKKYIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGKVGEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKVGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|XP_019453517.1| PREDICTED: hydroxyproline O-arabinosyltransferase 3-like [Lupinus
            angustifolius]
 gb|OIW06137.1| hypothetical protein TanjilG_22359 [Lupinus angustifolius]
          Length = 360

 Score =  621 bits (1601), Expect = 0.0
 Identities = 295/344 (85%), Positives = 309/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            M R SPLL+I LVLGSSFATYNLVTMI+HYGSSESVA   GAL  DPI EMP HVKNRKT
Sbjct: 1    MARASPLLLIFLVLGSSFATYNLVTMIIHYGSSESVAIDGGALLLDPITEMPAHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAG+DRGY+VLNRPWAFVQWLE+  IEEEYVLMAEPDH+FVRPLPNLA G HPA
Sbjct: 121  VVVDPLPAGVDRGYVVLNRPWAFVQWLERTTIEEEYVLMAEPDHVFVRPLPNLAHGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PDQNEKV+RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMN+SLKMK
Sbjct: 181  AFPFFYIRPDQNEKVIRKFYPEEYGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNVSLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET NK+IIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETTNKYIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            LKGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  LKGELTYGKIGEWRFDKRSHLRGPPPRNLPLPPPGVPESVVTLV 344


>ref|XP_013445092.1| NOD3, putative [Medicago truncatula]
 gb|KEH19118.1| NOD3, putative [Medicago truncatula]
          Length = 359

 Score =  619 bits (1596), Expect = 0.0
 Identities = 283/320 (88%), Positives = 306/320 (95%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            M R SPLL+ICLVLGSSFATYNLVTMI+HYGS++S+AT DG LFFDPI+EMPEHVKN KT
Sbjct: 1    MARASPLLMICLVLGSSFATYNLVTMIIHYGSADSLATEDGGLFFDPIVEMPEHVKNTKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFH+ALTATDA YN+WQCRIMYYWYKKQ++LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHIALTATDAIYNKWQCRIMYYWYKKQRSLPGSEMGGFTRILHSGKADNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLP GLDRGY+VLNRPWAFVQWLEKA IEEEY+LMAEPDH+FVRPLPNLAFGE+PA
Sbjct: 121  VVVDPLPEGLDRGYVVLNRPWAFVQWLEKANIEEEYILMAEPDHVFVRPLPNLAFGENPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKP +NEK+VRK+YPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNIS+KMK
Sbjct: 181  AFPFFYIKPKENEKIVRKYYPEENGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISMKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMY YA+ASALH VRHILRKDFMLQPPWD ET+NK+IIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYGYAVASALHGVRHILRKDFMLQPPWDTETFNKYIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSH 1109
            LKGELTYGK+GEWRFDKRSH
Sbjct: 301  LKGELTYGKIGEWRFDKRSH 320


>ref|XP_003546712.1| PREDICTED: uncharacterized protein LOC100787652 [Glycine max]
 ref|XP_006598107.1| PREDICTED: uncharacterized protein LOC100787652 [Glycine max]
 ref|XP_006598108.1| PREDICTED: uncharacterized protein LOC100787652 [Glycine max]
 ref|XP_014623714.1| PREDICTED: uncharacterized protein LOC100787652 [Glycine max]
 gb|KRH13369.1| hypothetical protein GLYMA_15G234500 [Glycine max]
 gb|KRH13370.1| hypothetical protein GLYMA_15G234500 [Glycine max]
          Length = 364

 Score =  618 bits (1593), Expect = 0.0
 Identities = 291/344 (84%), Positives = 309/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S LL++ LVLGSSFATYN+VTMI HYGSSE VA  DGALFFDPI EMP+HVKNRKT
Sbjct: 1    MGRASLLLIVFLVLGSSFATYNVVTMIRHYGSSEGVAVDDGALFFDPITEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSG PDNLM+EIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGNPDNLMNEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAGLDRGYIVLNRPWAFVQWLEK KIEEEYVLMAEPDHIF+RPLPNLAFG HPA
Sbjct: 121  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKTKIEEEYVLMAEPDHIFLRPLPNLAFGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PDQNEK +RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMK
Sbjct: 181  AFPFFYIRPDQNEKTIRKFYPEELGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASA+H VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASAVHGVRHILRKDFMLQPPWDLETNKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>gb|KHN13066.1| hypothetical protein glysoja_043756 [Glycine soja]
 gb|KRH13371.1| hypothetical protein GLYMA_15G234500 [Glycine max]
          Length = 347

 Score =  614 bits (1583), Expect = 0.0
 Identities = 289/343 (84%), Positives = 307/343 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S LL++ LVLGSSFATYN+VTMI HYGSSE VA  DGALFFDPI EMP+HVKNRKT
Sbjct: 1    MGRASLLLIVFLVLGSSFATYNVVTMIRHYGSSEGVAVDDGALFFDPITEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSG PDNLM+EIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGNPDNLMNEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAGLDRGYIVLNRPWAFVQWLEK KIEEEYVLMAEPDHIF+RPLPNLAFG HPA
Sbjct: 121  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKTKIEEEYVLMAEPDHIFLRPLPNLAFGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI+PDQNEK +RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMK
Sbjct: 181  AFPFFYIRPDQNEKTIRKFYPEELGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASA+H VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASAVHGVRHILRKDFMLQPPWDLETNKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTL 1178
            +KGELTYGK+GEWRFDKRSH                 ESVV L
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVCL 343


>ref|XP_015936987.1| hydroxyproline O-arabinosyltransferase 3 [Arachis duranensis]
 ref|XP_015936989.1| hydroxyproline O-arabinosyltransferase 3 [Arachis duranensis]
 ref|XP_015936990.1| hydroxyproline O-arabinosyltransferase 3 [Arachis duranensis]
 ref|XP_016171430.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
 ref|XP_016171431.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
 ref|XP_016171432.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
 ref|XP_016171433.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
 ref|XP_020965456.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
 ref|XP_020965457.1| hydroxyproline O-arabinosyltransferase 3 [Arachis ipaensis]
          Length = 360

 Score =  613 bits (1581), Expect = 0.0
 Identities = 288/344 (83%), Positives = 307/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR SPLL+I LVLGSSFATYN+VTM++ YGSSE VA  DG L FDPIIEMP H KNRK 
Sbjct: 1    MGRTSPLLIIFLVLGSSFATYNVVTMLIRYGSSEGVAFSDGLLLFDPIIEMPAHAKNRKV 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCRIMYYWYK+QKN+PGSEMGGFTRILHSGKPDNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRIMYYWYKRQKNMPGSEMGGFTRILHSGKPDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPLPAG+DRGYIVLNRPWAFVQWLEKA IEEEYVLMAEPDHIFVRPLPNLA+G HPA
Sbjct: 121  VVVDPLPAGMDRGYIVLNRPWAFVQWLEKATIEEEYVLMAEPDHIFVRPLPNLAYGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYI P++NEK++RKF+PEE GPVTN+DPIGNSPVIIRKDLI KIAPTWMN+SLKMK
Sbjct: 181  AFPFFYITPEKNEKIIRKFFPEEHGPVTNVDPIGNSPVIIRKDLIEKIAPTWMNVSLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDL T NKFIIHYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLSTENKFIIHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPRNLPLPPPGVPESVVTLV 344


>ref|XP_014522808.1| hydroxyproline O-arabinosyltransferase 3 [Vigna radiata var. radiata]
 ref|XP_014522809.1| hydroxyproline O-arabinosyltransferase 3 [Vigna radiata var. radiata]
 ref|XP_022632246.1| hydroxyproline O-arabinosyltransferase 3 [Vigna radiata var. radiata]
          Length = 360

 Score =  613 bits (1581), Expect = 0.0
 Identities = 288/344 (83%), Positives = 307/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S  L+I +VLGSSFATYNLVTM+  YGSSE VA  DG LFFDPI EMP+HVKNR+T
Sbjct: 1    MGRASTFLIIFIVLGSSFATYNLVTMLRRYGSSEGVAVSDGGLFFDPIFEMPDHVKNRRT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGKSDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPL AGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIF+RPLPNLA+G HPA
Sbjct: 121  VVVDPLAAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFLRPLPNLAYGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKPDQNEK++RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMN+SLKMK
Sbjct: 181  AFPFFYIKPDQNEKIIRKFYPEEYGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNVSLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETRKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|XP_007133751.1| hypothetical protein PHAVU_011G206200g [Phaseolus vulgaris]
 gb|ESW05745.1| hypothetical protein PHAVU_011G206200g [Phaseolus vulgaris]
          Length = 360

 Score =  612 bits (1579), Expect = 0.0
 Identities = 289/344 (84%), Positives = 306/344 (88%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S LL+I LVLGSSFATYNLVTM+ HYGSSE V   DG LFFDPI EMP+HVKNRKT
Sbjct: 1    MGRASTLLIIFLVLGSSFATYNLVTMLRHYGSSEGVTVADGGLFFDPITEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGKSDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPL  G+DRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIF+RPLPNLA+  HPA
Sbjct: 121  VVVDPLAEGMDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFLRPLPNLAYQGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKPDQNEK++RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMK
Sbjct: 181  AFPFFYIKPDQNEKIIRKFYPEEYGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETNKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|XP_007133750.1| hypothetical protein PHAVU_011G206200g [Phaseolus vulgaris]
 gb|ESW05744.1| hypothetical protein PHAVU_011G206200g [Phaseolus vulgaris]
          Length = 365

 Score =  612 bits (1579), Expect = 0.0
 Identities = 289/344 (84%), Positives = 306/344 (88%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S LL+I LVLGSSFATYNLVTM+ HYGSSE V   DG LFFDPI EMP+HVKNRKT
Sbjct: 1    MGRASTLLIIFLVLGSSFATYNLVTMLRHYGSSEGVTVADGGLFFDPITEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGKSDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPL  G+DRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIF+RPLPNLA+  HPA
Sbjct: 121  VVVDPLAEGMDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFLRPLPNLAYQGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKPDQNEK++RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMK
Sbjct: 181  AFPFFYIKPDQNEKIIRKFYPEEYGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETNKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|XP_017432158.1| PREDICTED: uncharacterized protein LOC108339536 [Vigna angularis]
 ref|XP_017432159.1| PREDICTED: uncharacterized protein LOC108339536 [Vigna angularis]
 gb|KOM49208.1| hypothetical protein LR48_Vigan08g003500 [Vigna angularis]
 dbj|BAT89281.1| hypothetical protein VIGAN_06019900 [Vigna angularis var. angularis]
          Length = 360

 Score =  611 bits (1576), Expect = 0.0
 Identities = 287/344 (83%), Positives = 307/344 (89%)
 Frame = +3

Query: 150  MGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKT 329
            MGR S  L+I +VLGSSFATYNLVTM+  YGSSE VA  DG L FDPI+EMP+HVKNRKT
Sbjct: 1    MGRASTFLIIFIVLGSSFATYNLVTMLRRYGSSEGVAVPDGRLLFDPILEMPDHVKNRKT 60

Query: 330  SKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPT 509
            SKAPFHVALTATDA YN+WQCR+MYYWYK+QK LPGSEMGGFTRILHSGK DNLMDEIPT
Sbjct: 61   SKAPFHVALTATDAPYNKWQCRVMYYWYKQQKKLPGSEMGGFTRILHSGKSDNLMDEIPT 120

Query: 510  VVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPA 689
            VVVDPL AGLDRGY+VLNRPWAFVQWLEKAKIEEEYVLMAEPDHIF+RPLPNLA+G HPA
Sbjct: 121  VVVDPLAAGLDRGYVVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFLRPLPNLAYGGHPA 180

Query: 690  AFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMK 869
            AFPFFYIKPDQNEK++RKFYPEE GPVTN+DPIGNSPVIIRKDLIAKIAPTWMN+SLKMK
Sbjct: 181  AFPFFYIKPDQNEKIIRKFYPEEYGPVTNVDPIGNSPVIIRKDLIAKIAPTWMNVSLKMK 240

Query: 870  EDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYN 1049
            EDPETDKAFGWVLEMYAYA+ASALH VRHILRKDFMLQPPWDLET  K+I+HYTYGCDYN
Sbjct: 241  EDPETDKAFGWVLEMYAYAVASALHGVRHILRKDFMLQPPWDLETRKKYILHYTYGCDYN 300

Query: 1050 LKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            +KGELTYGK+GEWRFDKRSH                 ESVVTLV
Sbjct: 301  MKGELTYGKIGEWRFDKRSHLRGPPPKNLPLPPPGVPESVVTLV 344


>ref|NP_001241917.1| uncharacterized protein LOC100820233 [Glycine max]
 gb|ACU21224.1| unknown [Glycine max]
          Length = 335

 Score =  582 bits (1500), Expect = 0.0
 Identities = 272/319 (85%), Positives = 285/319 (89%)
 Frame = +3

Query: 225  MIMHYGSSESVATGDGALFFDPIIEMPEHVKNRKTSKAPFHVALTATDATYNQWQCRIMY 404
            MI HYGSSE V   DGALFFDPI EMP+HVKNRKTSK PFHVALTATDA YN+WQCR+MY
Sbjct: 1    MIRHYGSSEGVVVNDGALFFDPITEMPDHVKNRKTSKVPFHVALTATDAPYNKWQCRVMY 60

Query: 405  YWYKKQKNLPGSEMGGFTRILHSGKPDNLMDEIPTVVVDPLPAGLDRGYIVLNRPWAFVQ 584
            YWYK+QK LPGSEMGGFTRILHSG PDNLMDEIPTVVVDPLP GLDRGYIVLNRPWAFVQ
Sbjct: 61   YWYKQQKKLPGSEMGGFTRILHSGNPDNLMDEIPTVVVDPLPVGLDRGYIVLNRPWAFVQ 120

Query: 585  WLEKAKIEEEYVLMAEPDHIFVRPLPNLAFGEHPAAFPFFYIKPDQNEKVVRKFYPEEKG 764
            WLEK KIEEEYVLMAEPDHIFVRPLPNLA+G HPAAFPFFYI+PD+NEK++RKFYPEE G
Sbjct: 121  WLEKTKIEEEYVLMAEPDHIFVRPLPNLAYGGHPAAFPFFYIRPDENEKIIRKFYPEELG 180

Query: 765  PVTNIDPIGNSPVIIRKDLIAKIAPTWMNISLKMKEDPETDKAFGWVLEMYAYAIASALH 944
            PVTN+DPIGNSPVIIRKDLIAKIAPTWMNISLKMKEDPETDKAFGWVLEMYAYA+ASALH
Sbjct: 181  PVTNVDPIGNSPVIIRKDLIAKIAPTWMNISLKMKEDPETDKAFGWVLEMYAYAVASALH 240

Query: 945  DVRHILRKDFMLQPPWDLETYNKFIIHYTYGCDYNLKGELTYGKVGEWRFDKRSHXXXXX 1124
             VRHILRKDFMLQPPWDLET  K+IIHYTYGCDYN+KGELTYGKVGEWRFDKRSH     
Sbjct: 241  GVRHILRKDFMLQPPWDLETNKKYIIHYTYGCDYNMKGELTYGKVGEWRFDKRSHLRGPP 300

Query: 1125 XXXXXXXXXXXXESVVTLV 1181
                        ESVVTLV
Sbjct: 301  PKNLPLPPPGVPESVVTLV 319


>ref|XP_017985179.1| PREDICTED: uncharacterized protein LOC18587202 [Theobroma cacao]
 ref|XP_017985180.1| PREDICTED: uncharacterized protein LOC18587202 [Theobroma cacao]
 ref|XP_017985181.1| PREDICTED: uncharacterized protein LOC18587202 [Theobroma cacao]
 ref|XP_007010962.2| PREDICTED: uncharacterized protein LOC18587202 [Theobroma cacao]
 ref|XP_017985182.1| PREDICTED: uncharacterized protein LOC18587202 [Theobroma cacao]
          Length = 368

 Score =  574 bits (1479), Expect = 0.0
 Identities = 270/349 (77%), Positives = 298/349 (85%), Gaps = 2/349 (0%)
 Frame = +3

Query: 141  RKTMGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGD--GALFFDPIIEMPEHV 314
            RK MGR+SPLL++ LVLG  FATYNLVTM+MH  S       D  G +FFDP+IEMPE+V
Sbjct: 4    RKNMGRLSPLLLVTLVLGFCFATYNLVTMVMHNRSISKWKVNDSNGGIFFDPVIEMPENV 63

Query: 315  KNRKTSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLM 494
            K    +K PFHVALTATDA Y++WQCR+MYYWYKKQK+LPGSEMGGFTR+LHSG PDNL+
Sbjct: 64   KKPNNAKQPFHVALTATDAPYSKWQCRVMYYWYKKQKDLPGSEMGGFTRVLHSGSPDNLV 123

Query: 495  DEIPTVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAF 674
            DEIPTV+VDPLPAGLDRGYIVLNRPWAFVQWLEKA IEEEY+LMAEPDHIF+RPLPNL  
Sbjct: 124  DEIPTVIVDPLPAGLDRGYIVLNRPWAFVQWLEKATIEEEYILMAEPDHIFIRPLPNLGH 183

Query: 675  GEHPAAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNI 854
            G +PAAFPFFYIKP QNEK++RKFYPEE GPVTNIDPIGNSPVII+KDL+ KIAPTWMN+
Sbjct: 184  GGYPAAFPFFYIKPAQNEKLLRKFYPEEMGPVTNIDPIGNSPVIIKKDLLEKIAPTWMNV 243

Query: 855  SLKMKEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTY 1034
            SLKMK DPETDKAFGWVLEMYAYA+ASALH V+HILRKDFMLQPPWDLET  KFIIHYTY
Sbjct: 244  SLKMKNDPETDKAFGWVLEMYAYAVASALHGVQHILRKDFMLQPPWDLETGKKFIIHYTY 303

Query: 1035 GCDYNLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            GCDYN+KGELTYGK+GEWRFDKRS+                 ESVVTLV
Sbjct: 304  GCDYNMKGELTYGKIGEWRFDKRSYLRGPPPRNLSLPPPGVPESVVTLV 352


>gb|EOY19771.1| Uncharacterized protein TCM_045112 isoform 1 [Theobroma cacao]
 gb|EOY19772.1| Uncharacterized protein TCM_045112 isoform 1 [Theobroma cacao]
          Length = 368

 Score =  574 bits (1479), Expect = 0.0
 Identities = 270/349 (77%), Positives = 298/349 (85%), Gaps = 2/349 (0%)
 Frame = +3

Query: 141  RKTMGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGD--GALFFDPIIEMPEHV 314
            RK MGR+SPLL++ LVLG  FATYNLVTM+MH  S       D  G +FFDP+IEMPE+V
Sbjct: 4    RKNMGRLSPLLLVTLVLGFCFATYNLVTMVMHNRSISKWKVNDSNGGIFFDPVIEMPENV 63

Query: 315  KNRKTSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLM 494
            K    +K PFHVALTATDA Y++WQCR+MYYWYKKQK+LPGSEMGGFTR+LHSG PDNL+
Sbjct: 64   KKPNNAKQPFHVALTATDAPYSKWQCRVMYYWYKKQKDLPGSEMGGFTRVLHSGSPDNLV 123

Query: 495  DEIPTVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAF 674
            DEIPTV+VDPLPAGLDRGYIVLNRPWAFVQWLEKA IEEEY+LMAEPDHIF+RPLPNL  
Sbjct: 124  DEIPTVIVDPLPAGLDRGYIVLNRPWAFVQWLEKATIEEEYILMAEPDHIFIRPLPNLGH 183

Query: 675  GEHPAAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNI 854
            G +PAAFPFFYIKP QNEK++RKFYPEE GPVTNIDPIGNSPVII+KDL+ KIAPTWMN+
Sbjct: 184  GGYPAAFPFFYIKPAQNEKLLRKFYPEEMGPVTNIDPIGNSPVIIKKDLLEKIAPTWMNV 243

Query: 855  SLKMKEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTY 1034
            SLKMK DPETDKAFGWVLEMYAYA+ASALH V+HILRKDFMLQPPWDLET  KFIIHYTY
Sbjct: 244  SLKMKNDPETDKAFGWVLEMYAYAVASALHGVQHILRKDFMLQPPWDLETGKKFIIHYTY 303

Query: 1035 GCDYNLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            GCDYN+KGELTYGK+GEWRFDKRS+                 ESVVTLV
Sbjct: 304  GCDYNMKGELTYGKIGEWRFDKRSYLRGPPPRNLSLPPPGVPESVVTLV 352


>ref|XP_021297242.1| hydroxyproline O-arabinosyltransferase 3-like [Herrania umbratica]
 ref|XP_021297243.1| hydroxyproline O-arabinosyltransferase 3-like [Herrania umbratica]
          Length = 368

 Score =  573 bits (1477), Expect = 0.0
 Identities = 271/349 (77%), Positives = 298/349 (85%), Gaps = 2/349 (0%)
 Frame = +3

Query: 141  RKTMGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGD--GALFFDPIIEMPEHV 314
            RK MGR+SPLL++ LVLG  FATYNLVTM+MH  S  +    D  G +FFDP+IEMPE+V
Sbjct: 4    RKNMGRLSPLLLLTLVLGFCFATYNLVTMVMHNRSISNWIVNDSNGGIFFDPVIEMPENV 63

Query: 315  KNRKTSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLM 494
            K    +K PFHVALTATDA Y++WQCRIMYYWYKKQK+LPGSEMGGFTR+LHSG PDNLM
Sbjct: 64   KKPNNAKLPFHVALTATDAPYSKWQCRIMYYWYKKQKDLPGSEMGGFTRVLHSGSPDNLM 123

Query: 495  DEIPTVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAF 674
            DEIPTV+VDPLPAGLDRGYIVLNRPWAFVQWLEKA IEEEY+LMAEPDHIF+RPLPNL  
Sbjct: 124  DEIPTVIVDPLPAGLDRGYIVLNRPWAFVQWLEKATIEEEYILMAEPDHIFIRPLPNLGH 183

Query: 675  GEHPAAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNI 854
            G +PAAFPFFYIKP  NEK++RKFYPEE GPVTNIDPIGNSPVII+KDL+ KIAPTWMN+
Sbjct: 184  GGYPAAFPFFYIKPALNEKLLRKFYPEEMGPVTNIDPIGNSPVIIKKDLLEKIAPTWMNV 243

Query: 855  SLKMKEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTY 1034
            SLKMK DPETDKAFGWVLEMYAYA+ASALH V+HILRKDFMLQPPWDLET  KFIIHYTY
Sbjct: 244  SLKMKNDPETDKAFGWVLEMYAYAVASALHGVQHILRKDFMLQPPWDLETGKKFIIHYTY 303

Query: 1035 GCDYNLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            GCDYN+KGELTYGK+GEWRFDKRS+                 ESVVTLV
Sbjct: 304  GCDYNMKGELTYGKIGEWRFDKRSYLRGPPPQNLSLPPPGVPESVVTLV 352


>ref|XP_022764737.1| hydroxyproline O-arabinosyltransferase 3-like [Durio zibethinus]
          Length = 368

 Score =  570 bits (1470), Expect = 0.0
 Identities = 270/349 (77%), Positives = 296/349 (84%), Gaps = 2/349 (0%)
 Frame = +3

Query: 141  RKTMGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSESVATGDG--ALFFDPIIEMPEHV 314
            RK MGR SPLL+I LVLG  FATYNLVTM+MH  S       D    +FFDP++EMPE+V
Sbjct: 4    RKNMGRASPLLLITLVLGFCFATYNLVTMVMHNRSISKWVADDSNDGIFFDPVVEMPENV 63

Query: 315  KNRKTSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLM 494
            +  K  K PFHVALTATDA Y++WQCRIMYYWYKKQK+LPGSEMGGFTRILHSG PDNLM
Sbjct: 64   RKPKNIKLPFHVALTATDAPYSKWQCRIMYYWYKKQKDLPGSEMGGFTRILHSGSPDNLM 123

Query: 495  DEIPTVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAF 674
            DEIPTV+VDPLPAGLDRGYIVLNRPWAFVQWLEKA IEEEY+LMAEPDHIF+ PLPNLA 
Sbjct: 124  DEIPTVIVDPLPAGLDRGYIVLNRPWAFVQWLEKATIEEEYILMAEPDHIFITPLPNLAH 183

Query: 675  GEHPAAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNI 854
            G +PAAFPFFYIKP QNEK++RKFYPEE GPVTNIDPIGNSPVII+KDL+ KIAPTWMN+
Sbjct: 184  GGYPAAFPFFYIKPAQNEKLLRKFYPEEMGPVTNIDPIGNSPVIIKKDLLEKIAPTWMNV 243

Query: 855  SLKMKEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTY 1034
            SLKMK+DPETDKAFGWVLEMYAYA+ASALH V+H+LRKDFMLQPPWDLE   KFIIHYTY
Sbjct: 244  SLKMKDDPETDKAFGWVLEMYAYAVASALHGVQHVLRKDFMLQPPWDLEIGKKFIIHYTY 303

Query: 1035 GCDYNLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            GCDYN+KGELTYGK+GEWRFDKRS+                 ESVVTLV
Sbjct: 304  GCDYNMKGELTYGKIGEWRFDKRSYLRGPIARNLSLPPPGVPESVVTLV 352


>gb|OMO73416.1| hypothetical protein CCACVL1_17273 [Corchorus capsularis]
          Length = 368

 Score =  570 bits (1468), Expect = 0.0
 Identities = 269/349 (77%), Positives = 297/349 (85%), Gaps = 2/349 (0%)
 Frame = +3

Query: 141  RKTMGRVSPLLVICLVLGSSFATYNLVTMIMHYGSSES--VATGDGALFFDPIIEMPEHV 314
            RK M R SPLL++ LVLG  FATYNLVTM++H  S     V   +G +FFDP+IEMPE+V
Sbjct: 4    RKNMARASPLLLVTLVLGFCFATYNLVTMVIHNRSISKWIVDDSNGGIFFDPVIEMPENV 63

Query: 315  KNRKTSKAPFHVALTATDATYNQWQCRIMYYWYKKQKNLPGSEMGGFTRILHSGKPDNLM 494
            +  K +K PFHVALTATDA Y++WQCRIMYYWYKK K+LPGSEMGGFTRILHSG PDNLM
Sbjct: 64   RKPKNTKMPFHVALTATDAPYSKWQCRIMYYWYKKHKDLPGSEMGGFTRILHSGSPDNLM 123

Query: 495  DEIPTVVVDPLPAGLDRGYIVLNRPWAFVQWLEKAKIEEEYVLMAEPDHIFVRPLPNLAF 674
            DEIPTV+VDPLPAGLDRGYIVLNRPWAFVQWLEK  IEEEYVLMAEPDH+F+RPLPNLA 
Sbjct: 124  DEIPTVIVDPLPAGLDRGYIVLNRPWAFVQWLEKTTIEEEYVLMAEPDHVFIRPLPNLAS 183

Query: 675  GEHPAAFPFFYIKPDQNEKVVRKFYPEEKGPVTNIDPIGNSPVIIRKDLIAKIAPTWMNI 854
            G  PAAFPFFYIKPDQNEK++RKFYPEE GPVTNIDPIGNSPVII+KDL+ KIAPTWMN+
Sbjct: 184  GGFPAAFPFFYIKPDQNEKLLRKFYPEEMGPVTNIDPIGNSPVIIKKDLLEKIAPTWMNV 243

Query: 855  SLKMKEDPETDKAFGWVLEMYAYAIASALHDVRHILRKDFMLQPPWDLETYNKFIIHYTY 1034
            SLKMK+DPETDKAFGWVLEMYAYA+ASALH V+HILRKDFMLQPPWDLET  +FIIHYTY
Sbjct: 244  SLKMKDDPETDKAFGWVLEMYAYAVASALHGVQHILRKDFMLQPPWDLETGKRFIIHYTY 303

Query: 1035 GCDYNLKGELTYGKVGEWRFDKRSHXXXXXXXXXXXXXXXXXESVVTLV 1181
            GCDYN+KGELTYGK+GEWRFDKRS+                 ESV TLV
Sbjct: 304  GCDYNMKGELTYGKIGEWRFDKRSYLRGPPPRNLSLPPPGVPESVETLV 352


Top