BLASTX nr result

ID: Phellodendron21_contig00007893 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00007893
         (1649 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006487482.1 PREDICTED: uncharacterized protein LOC102618081 [...   449   e-152
XP_006423726.1 hypothetical protein CICLE_v10028677mg [Citrus cl...   446   e-151
XP_017972018.1 PREDICTED: uncharacterized protein LOC18608347 [T...   200   9e-55
EOX98872.1 Uncharacterized protein TCM_007548 isoform 1 [Theobro...   199   2e-54
KCW57034.1 hypothetical protein EUGRSUZ_I02697 [Eucalyptus grandis]   194   3e-52
KCW57035.1 hypothetical protein EUGRSUZ_I02697 [Eucalyptus grandis]   194   8e-52
OAY52103.1 hypothetical protein MANES_04G057900 [Manihot esculenta]   193   8e-52
XP_012080849.1 PREDICTED: uncharacterized protein LOC105641013 [...   190   1e-50
OMO99105.1 hypothetical protein CCACVL1_03928, partial [Corchoru...   185   2e-50
OMO86884.1 hypothetical protein COLO4_20877 [Corchorus olitorius]     186   7e-50
XP_018842140.1 PREDICTED: uncharacterized protein LOC109007073 [...   187   1e-49
XP_004139809.2 PREDICTED: uncharacterized protein LOC101208216 i...   188   1e-49
XP_011659008.1 PREDICTED: uncharacterized protein LOC101208216 i...   188   2e-49
XP_003593131.1 hypothetical protein MTR_2g008130 [Medicago trunc...   184   6e-49
KHN33496.1 hypothetical protein glysoja_005538 [Glycine soja]         185   1e-48
KRH10490.1 hypothetical protein GLYMA_15G050700 [Glycine max]         185   1e-48
XP_008447174.1 PREDICTED: uncharacterized protein LOC103489686 i...   185   2e-48
XP_016900347.1 PREDICTED: uncharacterized protein LOC103489686 i...   185   2e-48
XP_008447173.1 PREDICTED: uncharacterized protein LOC103489686 i...   185   3e-48
GAU43067.1 hypothetical protein TSUD_194210 [Trifolium subterran...   179   6e-47

>XP_006487482.1 PREDICTED: uncharacterized protein LOC102618081 [Citrus sinensis]
            XP_006487483.1 PREDICTED: uncharacterized protein
            LOC102618081 [Citrus sinensis] XP_015388509.1 PREDICTED:
            uncharacterized protein LOC102618081 [Citrus sinensis]
            XP_015388510.1 PREDICTED: uncharacterized protein
            LOC102618081 [Citrus sinensis]
          Length = 373

 Score =  449 bits (1154), Expect = e-152
 Identities = 246/374 (65%), Positives = 269/374 (71%), Gaps = 29/374 (7%)
 Frame = +3

Query: 48   MDKGDEQESTN-KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRI 224
            MDKGDEQESTN KKTRDLPNLSEC ACGFRIDSCTGN+KIQILYSEWRIVLLC KCL RI
Sbjct: 1    MDKGDEQESTNNKKTRDLPNLSECQACGFRIDSCTGNDKIQILYSEWRIVLLCCKCLDRI 60

Query: 225  ESSQICSYCFKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLESLICVDCWVPKSLV 404
            ESS+ICSYC+KET EDFLTCSQCKRSVH+NCFLKCK + S++S LESLICVDCWVPKSLV
Sbjct: 61   ESSKICSYCYKETIEDFLTCSQCKRSVHRNCFLKCKAIDSMSS-LESLICVDCWVPKSLV 119

Query: 405  KRREFFTFRKNC---TDLGNSNSRVLNGGGNGA-VERKIVFALMASGMMERKSLVP-RSN 569
            KRRE  T RK C    DLG SNSRV NGGG+ A VERKIVFALMA+ M+ RK  VP +SN
Sbjct: 120  KRRELLTCRKICNSSADLGISNSRVSNGGGSCAVVERKIVFALMATEMIGRKPFVPKKSN 179

Query: 570  ALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSNVSQVPKKWEC 749
            ALD EVKR+ GGE  K+ A DDDAE AFQLHR+MNSSPRISKNLCVVNS+ S VPKK EC
Sbjct: 180  ALDLEVKREEGGEIHKKVASDDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQEC 239

Query: 750  DGALMLXXXXXXXXXNNALKSSGDETSTNFDSRSSYNNHYESTSCKVEICNRKPDXXXXX 929
            DG L+L         +NALKSSGDETSTNFDSR SY+   ES S K+ +CN++PD     
Sbjct: 240  DGVLILGGSGSGSCSSNALKSSGDETSTNFDSRPSYDKRCESASYKLAVCNKQPDRFFFK 299

Query: 930  XXXXXXXXXXXXXXXXXXXXXHVL-----------------------DNKSDIEISNRKP 1040
                                  VL                       DNKSDIEI N+KP
Sbjct: 300  YRKRGSRRFLLKYRRRSSSSKPVLDNKSDIFLLKYRRRRSAGSKPVPDNKSDIEICNQKP 359

Query: 1041 DRYLLKYRRRDTSS 1082
            DRYL KYRRRD SS
Sbjct: 360  DRYLFKYRRRDKSS 373


>XP_006423726.1 hypothetical protein CICLE_v10028677mg [Citrus clementina]
            XP_006423727.1 hypothetical protein CICLE_v10028677mg
            [Citrus clementina] ESR36966.1 hypothetical protein
            CICLE_v10028677mg [Citrus clementina] ESR36967.1
            hypothetical protein CICLE_v10028677mg [Citrus
            clementina] KDO50070.1 hypothetical protein
            CISIN_1g017357mg [Citrus sinensis]
          Length = 373

 Score =  446 bits (1148), Expect = e-151
 Identities = 246/374 (65%), Positives = 267/374 (71%), Gaps = 29/374 (7%)
 Frame = +3

Query: 48   MDKGDEQESTN-KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRI 224
            MDKG EQESTN KKTRDLPNLSEC ACGFRIDSCTGN+KIQILYSEWRIVLLC KCL RI
Sbjct: 1    MDKGHEQESTNNKKTRDLPNLSECQACGFRIDSCTGNDKIQILYSEWRIVLLCCKCLDRI 60

Query: 225  ESSQICSYCFKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLESLICVDCWVPKSLV 404
            ESS+ICSYC+KET EDFLTCSQCKRSVH+NCFLKCK + S++S LESLICVDCWVPKSLV
Sbjct: 61   ESSKICSYCYKETIEDFLTCSQCKRSVHRNCFLKCKAIDSMSS-LESLICVDCWVPKSLV 119

Query: 405  KRREFFTFRKNC---TDLGNSNSRVLNGGGNGA-VERKIVFALMASGMMERKSLVP-RSN 569
            KRRE  T RK C    DLG SNSRV NGGG+ A VERKIVFALMAS M+ RK  VP +SN
Sbjct: 120  KRRELLTCRKICNSSADLGISNSRVSNGGGSCAVVERKIVFALMASEMIGRKPFVPKKSN 179

Query: 570  ALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSNVSQVPKKWEC 749
            ALD EVKRD GGE  K+ A DDDAE AFQLHR+MNSSPRISKNLCVVNS+ S VPKK EC
Sbjct: 180  ALDLEVKRDEGGEIHKKVASDDDAELAFQLHRSMNSSPRISKNLCVVNSSDSHVPKKQEC 239

Query: 750  DGALMLXXXXXXXXXNNALKSSGDETSTNFDSRSSYNNHYESTSCKVEICNRKPDXXXXX 929
            DG L+L         +NALKSSGDETSTNFDSR SY+   ES S K+ +CN++PD     
Sbjct: 240  DGVLILGGSGSGSCSSNALKSSGDETSTNFDSRPSYDKRCESASYKLAVCNKQPDRFFFK 299

Query: 930  XXXXXXXXXXXXXXXXXXXXXHVLDNKS-----------------------DIEISNRKP 1040
                                  VLDNKS                       DIEI N+KP
Sbjct: 300  YRKRGSRRFLLKYRRRSSSSKPVLDNKSDIFLLKYRRRRSAGSKPVPDNKLDIEICNQKP 359

Query: 1041 DRYLLKYRRRDTSS 1082
            DRY  KYRRRD SS
Sbjct: 360  DRYSFKYRRRDKSS 373


>XP_017972018.1 PREDICTED: uncharacterized protein LOC18608347 [Theobroma cacao]
          Length = 442

 Score =  200 bits (509), Expect = 9e-55
 Identities = 104/220 (47%), Positives = 138/220 (62%), Gaps = 4/220 (1%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S+ KKTRDLPNL+EC ACG R D+  G N+IQ LYSEWRIVLLCS+C +R++SS+ICSYC
Sbjct: 35  SSPKKTRDLPNLTECQACGSRTDTANGKNRIQTLYSEWRIVLLCSRCYHRVDSSEICSYC 94

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL---ESLICVDCWVPKSLVKRREFF 422
           FKE +ED+ +C QCKRS+HK CFL CK+V   +  +   E  +C+DCWVPK + ++R   
Sbjct: 95  FKEASEDYFSCGQCKRSLHKTCFLNCKSVPPWSFSICGSEFTVCIDCWVPKQIARKRG-- 152

Query: 423 TFRKNCTDLGNSNSRVLNGGGNGAVERKIVFALMASGMMERKSLVPRSNALDSEVKRDMG 602
            FR+N     +S     +GGG   +E  +  A  A G     ++  R  A+   +     
Sbjct: 153 NFRRNKKAKNSSILDNRDGGGAKLLESVVKDANYAMGKKVEAAVKAREMAVKKAIVAKRA 212

Query: 603 GENQKRDAVD-DDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
            E       + DDAE AF+LHR MNSSPRISKN  + + N
Sbjct: 213 VELASNALEECDDAELAFRLHRAMNSSPRISKNRILGDQN 252


>EOX98872.1 Uncharacterized protein TCM_007548 isoform 1 [Theobroma cacao]
           EOX98873.1 Uncharacterized protein TCM_007548 isoform 1
           [Theobroma cacao]
          Length = 442

 Score =  199 bits (506), Expect = 2e-54
 Identities = 108/236 (45%), Positives = 144/236 (61%), Gaps = 20/236 (8%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S+ KKTRDLPNL+EC ACG R D+  G N+IQ LYSEWRIVLLCS+C +R++SS+ICSYC
Sbjct: 35  SSPKKTRDLPNLTECQACGSRTDTANGKNRIQTLYSEWRIVLLCSRCYHRVDSSEICSYC 94

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL---ESLICVDCWVPKSLVKRREFF 422
           FKE +ED  +C QCKRS+HK CFL CK+V   +  +   E  +C+DCWVPK + ++R  F
Sbjct: 95  FKEASEDCFSCGQCKRSLHKTCFLNCKSVPPWSFSICGSEFTVCIDCWVPKQIARKRGNF 154

Query: 423 TFRKNCTDLGNSNSRVLNGGG-----------NGAVERKIVFALMASGMMERKSLVPR-- 563
              K   +    ++R  +GGG           N A+ +K+  A+ A  M  +K++V +  
Sbjct: 155 RHNKKAKNSSILDNR--DGGGAKLLESVVKDANYAMGKKVEAAVKAREMAVKKAIVAKRA 212

Query: 564 ----SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
               SNAL+                  DDAE AF+LHR MNSSPRISKN  + + N
Sbjct: 213 VELASNALEEY----------------DDAELAFRLHRAMNSSPRISKNRIMGDQN 252


>KCW57034.1 hypothetical protein EUGRSUZ_I02697 [Eucalyptus grandis]
          Length = 485

 Score =  194 bits (494), Expect = 3e-52
 Identities = 116/262 (44%), Positives = 143/262 (54%), Gaps = 38/262 (14%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S  KKTRDLPNLS+C ACG RID   G N++Q L+SEWRIVLLC KCL R+ESS +CSYC
Sbjct: 46  SVPKKTRDLPNLSQCHACGSRIDLVDGKNRLQPLHSEWRIVLLCKKCLIRVESSVVCSYC 105

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLESL-----ICVDCWVPKSLVKRRE 416
           F ET+++   C  CKR VHKNCFL+ K  A  +     L     +CVDCW+P+S+ K   
Sbjct: 106 FSETSDECFRCCACKRRVHKNCFLEYKNAAPWSYSCSGLGSEFSVCVDCWLPRSMAKLNG 165

Query: 417 FFTFRKN--CTD----LGNSNSRVLNGG--------------GNGAVERKIVFALMASGM 536
               R++   TD     G  +SR+L  G               N  VE K   A  A   
Sbjct: 166 ASKRRRSGGKTDGRAVCGLGHSRLLEDGDCCVMKCLEDVVKDANCVVELKNAAACKAKEK 225

Query: 537 MERKSLVPR------SNALDSEVKRDMG-------GENQKRDAVDDDAEFAFQLHRTMNS 677
             +K++V +      S ALD     D         G     D V DD E AFQLHR+MNS
Sbjct: 226 AHKKAVVAKRAVELASEALDMVANTDESSLLACEEGGGDCDDKVVDDEELAFQLHRSMNS 285

Query: 678 SPRISKNLCVVNSNVSQVPKKW 743
           SPRISKN C VN + + VP+ W
Sbjct: 286 SPRISKNFCSVNKSCADVPRMW 307


>KCW57035.1 hypothetical protein EUGRSUZ_I02697 [Eucalyptus grandis]
          Length = 528

 Score =  194 bits (494), Expect = 8e-52
 Identities = 116/262 (44%), Positives = 143/262 (54%), Gaps = 38/262 (14%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S  KKTRDLPNLS+C ACG RID   G N++Q L+SEWRIVLLC KCL R+ESS +CSYC
Sbjct: 46  SVPKKTRDLPNLSQCHACGSRIDLVDGKNRLQPLHSEWRIVLLCKKCLIRVESSVVCSYC 105

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLESL-----ICVDCWVPKSLVKRRE 416
           F ET+++   C  CKR VHKNCFL+ K  A  +     L     +CVDCW+P+S+ K   
Sbjct: 106 FSETSDECFRCCACKRRVHKNCFLEYKNAAPWSYSCSGLGSEFSVCVDCWLPRSMAKLNG 165

Query: 417 FFTFRKN--CTD----LGNSNSRVLNGG--------------GNGAVERKIVFALMASGM 536
               R++   TD     G  +SR+L  G               N  VE K   A  A   
Sbjct: 166 ASKRRRSGGKTDGRAVCGLGHSRLLEDGDCCVMKCLEDVVKDANCVVELKNAAACKAKEK 225

Query: 537 MERKSLVPR------SNALDSEVKRDMG-------GENQKRDAVDDDAEFAFQLHRTMNS 677
             +K++V +      S ALD     D         G     D V DD E AFQLHR+MNS
Sbjct: 226 AHKKAVVAKRAVELASEALDMVANTDESSLLACEEGGGDCDDKVVDDEELAFQLHRSMNS 285

Query: 678 SPRISKNLCVVNSNVSQVPKKW 743
           SPRISKN C VN + + VP+ W
Sbjct: 286 SPRISKNFCSVNKSCADVPRMW 307


>OAY52103.1 hypothetical protein MANES_04G057900 [Manihot esculenta]
          Length = 468

 Score =  193 bits (490), Expect = 8e-52
 Identities = 111/248 (44%), Positives = 146/248 (58%), Gaps = 12/248 (4%)
 Frame = +3

Query: 48  MDKGDEQESTNKK-TRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRI 224
           M+   +Q S+NKK TRDLPNLSEC +CGF+ DSC G N++Q LYSEWRIVLLC  C  R+
Sbjct: 1   MESQQQQGSSNKKKTRDLPNLSECHSCGFQFDSCAGKNRLQTLYSEWRIVLLCKICFARV 60

Query: 225 ESSQICSYCFKETTEDFLTCSQCKRSVHKNCFLKCKTVASIN-SYLESLICVDCWVPKSL 401
           ESSQ+CSYCFK ++++   C +CKR +HK+CFL   +VA  + S     +CVDCWVPKS+
Sbjct: 61  ESSQLCSYCFKGSSDNCFHCCECKRIIHKDCFLDYASVAPCSFSSSNFSVCVDCWVPKSV 120

Query: 402 VKRREFF--TFRKNCTDLGNSNSRVLN-----GGGNGAVERKIVFALMASGMMERKSLVP 560
             +R     + RK    LG  + ++ +        N AV RKI     A  + E K+L  
Sbjct: 121 AAKRASLRPSNRKKSAVLGFGDCQIKSPEDVVREANSAVHRKIEADAKARELAEEKALAA 180

Query: 561 RSNALDSEVKRD---MGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSNVSQV 731
           R  A  ++V  D   +G +N    A  DD E A QLH+ +NSS  I KN+C VNS    V
Sbjct: 181 RRAAELTKVALDSMSLGDDNGSPAAGIDDVELALQLHQAVNSSSSILKNMCSVNSCCLAV 240

Query: 732 PKKWECDG 755
            K     G
Sbjct: 241 QKSLVSSG 248


>XP_012080849.1 PREDICTED: uncharacterized protein LOC105641013 [Jatropha curcas]
           KDP45544.1 hypothetical protein JCGZ_18781 [Jatropha
           curcas]
          Length = 480

 Score =  190 bits (482), Expect = 1e-50
 Identities = 113/284 (39%), Positives = 151/284 (53%), Gaps = 21/284 (7%)
 Frame = +3

Query: 66  QESTNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICS 245
           + +  KKTRDLPNLSEC +CGFRIDSCTGNN++  LYSEWRIVLLC  C  R+ESSQ+CS
Sbjct: 6   KNNKRKKTRDLPNLSECHSCGFRIDSCTGNNRLHTLYSEWRIVLLCKICFIRVESSQLCS 65

Query: 246 YCFKETTEDFLTCSQCKRSVHKNCFLKCKTV---------ASINSYLESLICVDCWVPKS 398
           YCFK ++++   C QCKR +HK+C     TV         +S +   +  +CVDCWVPK 
Sbjct: 66  YCFKGSSDNCFNCLQCKRIIHKSCLFDYATVSPWSFSSSSSSSSRASQFSVCVDCWVPKY 125

Query: 399 LVKRREFFTFRKNCTDLGNSNSRVLNG---GGNGAVERKIVFALMASGMMERKSLVPR-- 563
           +  +R        C       +++L G     N +V+R +  A  A  +  +K+L  R  
Sbjct: 126 IADKRA-------CYFRPIKRNKLLEGVDRDANCSVQRTVA-AARARELAVKKALAARQA 177

Query: 564 ----SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSNVSQV 731
                NALD   +R   G       + DD E + Q H  MNSSPRI KN C V+S    V
Sbjct: 178 AELAQNALDLVAER--YGSGCAGGGIHDDMELSLQFHGVMNSSPRILKNFCFVSSRCLDV 235

Query: 732 PKKWECDGA---LMLXXXXXXXXXNNALKSSGDETSTNFDSRSS 854
           P  W   G    L +         +  + +SG ++S N DS  S
Sbjct: 236 PNPWVRAGVCRKLEVSNEKSVSDPSVCVTTSGYDSSVNMDSLGS 279


>OMO99105.1 hypothetical protein CCACVL1_03928, partial [Corchorus capsularis]
          Length = 322

 Score =  185 bits (470), Expect = 2e-50
 Identities = 103/233 (44%), Positives = 137/233 (58%), Gaps = 17/233 (7%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S+ KKTRDLPNL+EC ACG R D+  G N+IQ LYSEWRIVLLC +C +R+ SSQICSYC
Sbjct: 34  SSLKKTRDLPNLTECQACGSRTDTAKGKNRIQTLYSEWRIVLLCPRCYHRVVSSQICSYC 93

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL---ESLICVDCWVPKSLVKRREFF 422
           FK  ++D  +CSQCKRS+HK CFL  K++   +  +   E  +C+DCWVPK + ++R   
Sbjct: 94  FKAASDDCFSCSQCKRSIHKTCFLNYKSIPPWSYSIRGSEFTVCIDCWVPKQIARKRGIL 153

Query: 423 TFRKNCTDLGNSNSRVLNG--------------GGNGAVERKIVFALMASGMMERKSLVP 560
              +        NS VL+G                N A+E+K+  A+ A     +K++V 
Sbjct: 154 RRNRKA-----KNSSVLDGRDDEGAKLLEDVVKDANCAMEKKVEVAVKARETAVKKAVVA 208

Query: 561 RSNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
           +      E+ RD   E        DDAE AF+LHR MNSSPRI KN  + + N
Sbjct: 209 KRAV---ELARDALEE-------CDDAELAFRLHRAMNSSPRILKNRILGDQN 251


>OMO86884.1 hypothetical protein COLO4_20877 [Corchorus olitorius]
          Length = 418

 Score =  186 bits (473), Expect = 7e-50
 Identities = 102/224 (45%), Positives = 138/224 (61%), Gaps = 8/224 (3%)
 Frame = +3

Query: 72  STNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYC 251
           S+ KKTRDLPNL+EC ACG R D+  G N+IQ LYSEWRIVLLC +C +R+ SSQICSYC
Sbjct: 34  SSLKKTRDLPNLTECQACGSRTDTAKGKNRIQTLYSEWRIVLLCPRCYHRVVSSQICSYC 93

Query: 252 FKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL---ESLICVDCWVPKSLVKRREFF 422
           FKE ++D  +CSQCKRS+HK CFL  K++   +  +   E  +C+DCW+PK + ++R   
Sbjct: 94  FKEASDDCFSCSQCKRSIHKTCFLNYKSIPPWSYSIRGSEFTVCIDCWLPKQIARKRGIL 153

Query: 423 TFRKNCTDLGNSNSRVLNGGGNGAVERKIVFALMASGMMERK---SLVPRSNALDSEVKR 593
              +        NS VL+G  +   +        A+  ME+K   ++  R  A+   V  
Sbjct: 154 RRNRKA-----KNSSVLDGRDDEGAKLLEDVVKDANCAMEKKVEVAVKARETAVKKAVVA 208

Query: 594 DMGGENQKRDAVD--DDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
           +   E   RDA++  DDAE AF+LHR MNSSPRI KN  + + N
Sbjct: 209 NRAVE-LARDALEECDDAELAFRLHRAMNSSPRILKNRILGDQN 251


>XP_018842140.1 PREDICTED: uncharacterized protein LOC109007073 [Juglans regia]
            XP_018842141.1 PREDICTED: uncharacterized protein
            LOC109007073 [Juglans regia] XP_018842142.1 PREDICTED:
            uncharacterized protein LOC109007073 [Juglans regia]
            XP_018842143.1 PREDICTED: uncharacterized protein
            LOC109007073 [Juglans regia] XP_018842144.1 PREDICTED:
            uncharacterized protein LOC109007073 [Juglans regia]
          Length = 448

 Score =  187 bits (474), Expect = 1e-49
 Identities = 146/396 (36%), Positives = 185/396 (46%), Gaps = 49/396 (12%)
 Frame = +3

Query: 60   DEQESTNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQI 239
            D      KKTRDLPNLS+C ACGFR D+  G  ++Q LYSEWRIVLLC KC  R+ESS+I
Sbjct: 68   DHPHPALKKTRDLPNLSQCHACGFRTDTGNGKQRLQTLYSEWRIVLLCKKCAVRVESSEI 127

Query: 240  CSYCFKET-TEDFLTCSQCKRSVHKNCFLKCKTVASIN---SYLESLICVDCWVPKSLVK 407
            CSYCF+ET   +   C +C R VH++CF K ++VA  +   S  E  +CVDCWVPK +  
Sbjct: 128  CSYCFQETLAAECFCCGECNRRVHRDCFKKYRSVAPWSYSCSGEEFSVCVDCWVPKPIAL 187

Query: 408  RREFFTFRK----NCTDLGNSNSRVLNG---------GGNGAVERKIVFALMASGMMERK 548
             R     RK    + +     +SR+L+G           N   ++KI  A  A     RK
Sbjct: 188  SRALLGGRKIRRRDRSGGKARDSRILDGAKSLKDVAENANSVAKKKIEEAAKARVEAVRK 247

Query: 549  SLVPR------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNL--- 701
            ++V R      +NALD    RD  GE  +     DDAE AFQLHR MNSSPRIS+NL   
Sbjct: 248  AVVARRAVELATNALDFVASRDENGEKVQN---VDDAELAFQLHRAMNSSPRISRNLWEN 304

Query: 702  ---CVVNSNVSQVP---KKWECDGALMLXXXXXXXXXN------NALKSSGD-ETSTNFD 842
                +V  + S  P    K E D +++                 N LKS G  ETS   D
Sbjct: 305  KGDSLVGGSRSGYPGVCGKLEPDRSVLEPVVCAGSLDGGSSMNVNCLKSVGKIETSGVKD 364

Query: 843  SRSSYNNHYE----------STSCKVEICNRKPDXXXXXXXXXXXXXXXXXXXXXXXXXX 992
                     E            SC +++ N   D                          
Sbjct: 365  GECQMRCDSEFGILGVRMEGEGSCSIKVSNSNGD------------DNSMDSANRSSHQQ 412

Query: 993  HVLDNKSDIEISNRKPDRYLLKYRRRDTSSKQIHDD 1100
            H L    D +  N +PDRY LKY RR+  SK I  D
Sbjct: 413  HKLTMPKD-KRYNGEPDRYPLKYCRRNRKSKAISCD 447


>XP_004139809.2 PREDICTED: uncharacterized protein LOC101208216 isoform X2 [Cucumis
           sativus] KGN44222.1 hypothetical protein Csa_7G230970
           [Cucumis sativus]
          Length = 524

 Score =  188 bits (478), Expect = 1e-49
 Identities = 111/238 (46%), Positives = 137/238 (57%), Gaps = 25/238 (10%)
 Frame = +3

Query: 81  KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
           KKTRDLPN SEC ACGFRID+  G +++  LYSEWRIVLLC+KC   +ESSQ+CSYCF +
Sbjct: 53  KKTRDLPNFSECHACGFRIDTVDGRSRLNSLYSEWRIVLLCNKCFSLVESSQVCSYCFAD 112

Query: 261 TTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLES----LICVDCWVPKSLVKRREFFTF 428
           TT D   C +C R VH+ CF +   VA   SY  S     +C+DCWVPK +V  R     
Sbjct: 113 TTGDSFICCECNRRVHRECFSQYSRVAPW-SYSSSGSVFSVCIDCWVPKPIVTARAVLRS 171

Query: 429 RK------NCTDL--------GNSNS-RVLNGGGNGAVERKIVFALMASGMMERKSLVPR 563
           RK      N +DL        GN  S   L    N  VE+K+  A+ A     +K+ V R
Sbjct: 172 RKIRRKNVNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVAR 231

Query: 564 ------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
                 S+AL+   +RD     +  D+  +DAE A QLHR MNSSPR SKNLC  NSN
Sbjct: 232 RASALASDALNLVAQRDESAAKESGDSA-EDAELAIQLHRAMNSSPRFSKNLCSTNSN 288


>XP_011659008.1 PREDICTED: uncharacterized protein LOC101208216 isoform X1 [Cucumis
           sativus]
          Length = 541

 Score =  188 bits (478), Expect = 2e-49
 Identities = 111/238 (46%), Positives = 137/238 (57%), Gaps = 25/238 (10%)
 Frame = +3

Query: 81  KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
           KKTRDLPN SEC ACGFRID+  G +++  LYSEWRIVLLC+KC   +ESSQ+CSYCF +
Sbjct: 53  KKTRDLPNFSECHACGFRIDTVDGRSRLNSLYSEWRIVLLCNKCFSLVESSQVCSYCFAD 112

Query: 261 TTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLES----LICVDCWVPKSLVKRREFFTF 428
           TT D   C +C R VH+ CF +   VA   SY  S     +C+DCWVPK +V  R     
Sbjct: 113 TTGDSFICCECNRRVHRECFSQYSRVAPW-SYSSSGSVFSVCIDCWVPKPIVTARAVLRS 171

Query: 429 RK------NCTDL--------GNSNS-RVLNGGGNGAVERKIVFALMASGMMERKSLVPR 563
           RK      N +DL        GN  S   L    N  VE+K+  A+ A     +K+ V R
Sbjct: 172 RKIRRKNVNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVAR 231

Query: 564 ------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
                 S+AL+   +RD     +  D+  +DAE A QLHR MNSSPR SKNLC  NSN
Sbjct: 232 RASALASDALNLVAQRDESAAKESGDSA-EDAELAIQLHRAMNSSPRFSKNLCSTNSN 288


>XP_003593131.1 hypothetical protein MTR_2g008130 [Medicago truncatula] AES63382.1
            hypothetical protein MTR_2g008130 [Medicago truncatula]
          Length = 420

 Score =  184 bits (467), Expect = 6e-49
 Identities = 129/374 (34%), Positives = 173/374 (46%), Gaps = 44/374 (11%)
 Frame = +3

Query: 81   KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
            KKTRDLPNL+EC ACGF+ID CTG NK+Q LYSEWR+VLLC KC   ++SSQICSYCF E
Sbjct: 30   KKTRDLPNLTECHACGFKIDVCTGKNKLQTLYSEWRVVLLCKKCFSCVKSSQICSYCFSE 89

Query: 261  TTEDFLTCSQCKRSVHKNCFLKCKTVASINSY----LESLICVDCWVPK----------- 395
            ++ D L C +CK SVHKNCFLK K VA   SY     E  +CVDCWVPK           
Sbjct: 90   SSSDSLRCVKCKHSVHKNCFLKNKNVAPW-SYSCVGSEFSVCVDCWVPKHVEISRRRTIR 148

Query: 396  SLVKRREFFTFRKNCTDLGNSNSRVLNGG------------GNGAVERKIVFALMASGMM 539
            SL K +     +K   DL   +SRVL GG                 ++K+  A MA  + 
Sbjct: 149  SLRKVKSGVIVKKGRVDLVKESSRVLKGGNLTRSMEDVVKDAKQKAKKKVEAAAMARRVA 208

Query: 540  ERKSLVPR------SNALDSEVKRDMGGEN--QKRDAVDDDAEFAFQLHRTMNSSPRISK 695
             +K++  R      +  L+    R+ G  N   K D V             +N+SP ISK
Sbjct: 209  SKKAVAARRAVELANKTLNIAANREEGTLNLPSKMDPVKVVGCSCLAFDLCLNNSPMISK 268

Query: 696  NLCVVNSNVSQVPKKW--ECDGALMLXXXXXXXXXNNALKSSGDETSTNFD----SRSSY 857
            + C++++N    PK+W    D +            + +L+S   ++ST+       R   
Sbjct: 269  SRCLLDTNNLDAPKRWTFSVDSS---GKTSNSRSASGSLRSLDSDSSTDLSCPCIGRCDM 325

Query: 858  NNHYESTSCKVEICNRKPDXXXXXXXXXXXXXXXXXXXXXXXXXXHVLDNKSD---IEIS 1028
                +   C  E+   +                              +  KSD    + S
Sbjct: 326  ITSPKDGECTAELKEGEGSCSDRLINFSGENSALHGEERSDRYFFKYVRRKSDRYFFKYS 385

Query: 1029 NRKPDRYLLKYRRR 1070
             R+ DRY  KY RR
Sbjct: 386  RRRSDRYFFKYSRR 399


>KHN33496.1 hypothetical protein glysoja_005538 [Glycine soja]
          Length = 478

 Score =  185 bits (469), Expect = 1e-48
 Identities = 110/258 (42%), Positives = 143/258 (55%), Gaps = 31/258 (12%)
 Frame = +3

Query: 63  EQESTNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQIC 242
           + +  +KKTRDLPNL+EC ACGF++D CTG N+++ LYSEWR+VLLC KC   +ESSQIC
Sbjct: 22  DTDPPHKKTRDLPNLTECHACGFKVDVCTGKNRLRTLYSEWRVVLLCKKCFSSVESSQIC 81

Query: 243 SYCFKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL----ESLICVDCWVPKSLV-- 404
           SYCF   + +   C+QC  SVHK+CFLK K  A   SY     E  +CVDCW+PK L   
Sbjct: 82  SYCFSGASPESFRCNQCLHSVHKSCFLKYKNAAPW-SYACLGSEFSVCVDCWIPKHLAIS 140

Query: 405 KRREFFTFR--KNCTDLGNSNSRVLNGGGN-------------GAVERKIVFALMASGMM 539
           +RR     +  KN   +    S  + GGGN              AV  K+  A  A    
Sbjct: 141 RRRNKIGVKNGKNGRVMPEKGSPRVFGGGNLVRSMEDLVEDAKRAVGEKVEAAARARDEA 200

Query: 540 ERKSLVPRS------NALDSEVKRDMGGEN--QKRDAVD--DDAEFAFQLHRTMNSSPRI 689
            +K++V RS      NAL     R+    N   K DAV   D +E  F+LH   NS PRI
Sbjct: 201 MQKAMVARSALEIANNALSLVANREESSLNLPPKMDAVKVLDGSELTFELHPRFNSLPRI 260

Query: 690 SKNLCVVNSNVSQVPKKW 743
           SK+ C++N +    PK+W
Sbjct: 261 SKSCCLLNVSYLDTPKRW 278


>KRH10490.1 hypothetical protein GLYMA_15G050700 [Glycine max]
          Length = 490

 Score =  185 bits (469), Expect = 1e-48
 Identities = 110/258 (42%), Positives = 143/258 (55%), Gaps = 31/258 (12%)
 Frame = +3

Query: 63  EQESTNKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQIC 242
           + +  +KKTRDLPNL+EC ACGF++D CTG N+++ LYSEWR+VLLC KC   +ESSQIC
Sbjct: 22  DTDPPHKKTRDLPNLTECHACGFKVDVCTGKNRLRTLYSEWRVVLLCKKCFSSVESSQIC 81

Query: 243 SYCFKETTEDFLTCSQCKRSVHKNCFLKCKTVASINSYL----ESLICVDCWVPKSLV-- 404
           SYCF   + +   C+QC  SVHK+CFLK K  A   SY     E  +CVDCW+PK L   
Sbjct: 82  SYCFSGASPESFRCNQCLHSVHKSCFLKYKNAAPW-SYACLGSEFSVCVDCWIPKHLAIS 140

Query: 405 KRREFFTFR--KNCTDLGNSNSRVLNGGGN-------------GAVERKIVFALMASGMM 539
           +RR     +  KN   +    S  + GGGN              AV  K+  A  A    
Sbjct: 141 RRRNKIGVKNGKNGRVMPEKGSPRVFGGGNLVRSMEDLVEDAKRAVGEKVEAAARARDEA 200

Query: 540 ERKSLVPRS------NALDSEVKRDMGGEN--QKRDAVD--DDAEFAFQLHRTMNSSPRI 689
            +K++V RS      NAL     R+    N   K DAV   D +E  F+LH   NS PRI
Sbjct: 201 MQKAMVARSALEIANNALSLVANREESSLNLPPKMDAVKVLDGSELTFELHPRFNSLPRI 260

Query: 690 SKNLCVVNSNVSQVPKKW 743
           SK+ C++N +    PK+W
Sbjct: 261 SKSCCLLNVSYLDTPKRW 278


>XP_008447174.1 PREDICTED: uncharacterized protein LOC103489686 isoform X3 [Cucumis
           melo] XP_008447175.1 PREDICTED: uncharacterized protein
           LOC103489686 isoform X4 [Cucumis melo]
          Length = 524

 Score =  185 bits (470), Expect = 2e-48
 Identities = 109/238 (45%), Positives = 136/238 (57%), Gaps = 25/238 (10%)
 Frame = +3

Query: 81  KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
           KKTRDLPN SEC +CGFRID+  G +++  LYSEWRIVLLC KC   +ESSQ+CSYCF +
Sbjct: 53  KKTRDLPNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFAD 112

Query: 261 TTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLES----LICVDCWVPKSLVKRREFFTF 428
           +T D   C +C R VH+ CF +   VA   SY  S     +C+DCWVPK +V  R     
Sbjct: 113 STGDSFICCECNRRVHRECFSQYSRVAPW-SYSSSGSVFSVCIDCWVPKPIVTARAVLRS 171

Query: 429 RK------NCTDL--------GNSNS-RVLNGGGNGAVERKIVFALMASGMMERKSLVPR 563
           RK      N +DL        GN  S   L    N  VE+K+  A+ A     +K+ V R
Sbjct: 172 RKIRRKNVNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVAR 231

Query: 564 ------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
                 S+AL+   +RD     +  D+  +DAE A QLHR MNSSPR SKNLC  NSN
Sbjct: 232 RASALASDALNLVAQRDESAAKESGDSA-EDAELAIQLHRAMNSSPRFSKNLCSTNSN 288


>XP_016900347.1 PREDICTED: uncharacterized protein LOC103489686 isoform X2 [Cucumis
           melo]
          Length = 525

 Score =  185 bits (470), Expect = 2e-48
 Identities = 109/238 (45%), Positives = 136/238 (57%), Gaps = 25/238 (10%)
 Frame = +3

Query: 81  KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
           KKTRDLPN SEC +CGFRID+  G +++  LYSEWRIVLLC KC   +ESSQ+CSYCF +
Sbjct: 53  KKTRDLPNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFAD 112

Query: 261 TTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLES----LICVDCWVPKSLVKRREFFTF 428
           +T D   C +C R VH+ CF +   VA   SY  S     +C+DCWVPK +V  R     
Sbjct: 113 STGDSFICCECNRRVHRECFSQYSRVAPW-SYSSSGSVFSVCIDCWVPKPIVTARAVLRS 171

Query: 429 RK------NCTDL--------GNSNS-RVLNGGGNGAVERKIVFALMASGMMERKSLVPR 563
           RK      N +DL        GN  S   L    N  VE+K+  A+ A     +K+ V R
Sbjct: 172 RKIRRKNVNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVAR 231

Query: 564 ------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
                 S+AL+   +RD     +  D+  +DAE A QLHR MNSSPR SKNLC  NSN
Sbjct: 232 RASALASDALNLVAQRDESAAKESGDSA-EDAELAIQLHRAMNSSPRFSKNLCSTNSN 288


>XP_008447173.1 PREDICTED: uncharacterized protein LOC103489686 isoform X1 [Cucumis
           melo]
          Length = 548

 Score =  185 bits (470), Expect = 3e-48
 Identities = 109/238 (45%), Positives = 136/238 (57%), Gaps = 25/238 (10%)
 Frame = +3

Query: 81  KKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFKE 260
           KKTRDLPN SEC +CGFRID+  G +++  LYSEWRIVLLC KC   +ESSQ+CSYCF +
Sbjct: 53  KKTRDLPNFSECHSCGFRIDTVDGRSRLNSLYSEWRIVLLCKKCFSLVESSQVCSYCFAD 112

Query: 261 TTEDFLTCSQCKRSVHKNCFLKCKTVASINSYLES----LICVDCWVPKSLVKRREFFTF 428
           +T D   C +C R VH+ CF +   VA   SY  S     +C+DCWVPK +V  R     
Sbjct: 113 STGDSFICCECNRRVHRECFSQYSRVAPW-SYSSSGSVFSVCIDCWVPKPIVTARAVLRS 171

Query: 429 RK------NCTDL--------GNSNS-RVLNGGGNGAVERKIVFALMASGMMERKSLVPR 563
           RK      N +DL        GN  S   L    N  VE+K+  A+ A     +K+ V R
Sbjct: 172 RKIRRKNVNVSDLRSSKVSTSGNCKSLSALVKDANCLVEKKVDAAVRAREHALKKAAVAR 231

Query: 564 ------SNALDSEVKRDMGGENQKRDAVDDDAEFAFQLHRTMNSSPRISKNLCVVNSN 719
                 S+AL+   +RD     +  D+  +DAE A QLHR MNSSPR SKNLC  NSN
Sbjct: 232 RASALASDALNLVAQRDESAAKESGDSA-EDAELAIQLHRAMNSSPRFSKNLCSTNSN 288


>GAU43067.1 hypothetical protein TSUD_194210 [Trifolium subterraneum]
          Length = 425

 Score =  179 bits (453), Expect = 6e-47
 Identities = 130/371 (35%), Positives = 176/371 (47%), Gaps = 40/371 (10%)
 Frame = +3

Query: 78   NKKTRDLPNLSECLACGFRIDSCTGNNKIQILYSEWRIVLLCSKCLYRIESSQICSYCFK 257
            +KKTRDLPNL+EC ACGF+ID CT  NK+Q+LYSEWR+VLLC KC   +ESSQICSYCF 
Sbjct: 57   HKKTRDLPNLTECHACGFKIDVCTAKNKLQLLYSEWRVVLLCKKCFSSVESSQICSYCFS 116

Query: 258  ETTEDFLTCSQCKRSVHKNCFLKCKTVASINSY----LESLICVDCWVPKS--LVKRREF 419
             T+ D L C++CK SVHK+CFLK K VA   SY     E  +CVDCWVPK+  + +RR  
Sbjct: 117  GTSSDSLCCTKCKHSVHKSCFLKYKNVAPW-SYSCVGSEFSVCVDCWVPKNVEISRRRRL 175

Query: 420  FTFR---------KNCTDLGNSNSRVLNGG------------GNGAVERKIVFALMASGM 536
             + R         K   D    +SRVL GG                V++K+  A  A  +
Sbjct: 176  RSLRKIKSGMIEKKGRVDFVKKSSRVLRGGNLIRSVEDVVKEAKNEVKKKVEAAARAREL 235

Query: 537  MERKSLVPR------SNALDSEVKRDMGGEN--QKRDAVD--DDAEFAFQLHRTMNSSPR 686
              ++++  R      +NAL     R+    N   K D V     +  AF LH  +NS  +
Sbjct: 236  ATKRAVAARRAVELANNALSLVANREESTPNLSPKIDPVKVVHRSYLAFDLH--LNSPTK 293

Query: 687  ISKNLCVVNSNVSQVPKKWECDGALMLXXXXXXXXXNNALKSSGDETSTNFDSRSSYNNH 866
            ISK  C++ ++    PKKW     + +          NA       +  + DS SS   +
Sbjct: 294  ISKTRCLLKTS---FPKKW----TVSIDSSCKRSKSRNASGFDNKLSLGSSDSDSSTAEY 346

Query: 867  YESTSCKVEICNRKPDXXXXXXXXXXXXXXXXXXXXXXXXXXHVLDNKSD---IEISNRK 1037
                 C +E+   + D                               +SD   ++ S RK
Sbjct: 347  GVEEDCGLELDCEQADSALNEEERSLAIKDFISSSSGDCCQLK-YSRRSDRYFLKYSRRK 405

Query: 1038 PDRYLLKYRRR 1070
             DRY  KY R+
Sbjct: 406  SDRYFFKYSRK 416


Top