BLASTX nr result

ID: Rehmannia28_contig00017142 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00017142
         (1297 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011075586.1| PREDICTED: uncharacterized protein LOC105160...   356   e-117
ref|XP_011070952.1| PREDICTED: uncharacterized protein LOC105156...   299   3e-94
ref|XP_012855239.1| PREDICTED: uncharacterized protein LOC105974...   276   5e-86
ref|XP_012847724.1| PREDICTED: uncharacterized protein LOC105967...   242   6e-73
emb|CDP00415.1| unnamed protein product [Coffea canephora]            238   4e-70
ref|XP_010258118.1| PREDICTED: uncharacterized protein LOC104597...   213   3e-60
ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma...   212   8e-60
ref|XP_012478745.1| PREDICTED: uncharacterized protein LOC105794...   210   8e-59
ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma...   209   8e-59
gb|KHG09453.1| AT-rich interactive domain-containing 2 -like pro...   204   1e-56
ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614...   201   2e-55
ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citr...   201   2e-55
ref|XP_011012203.1| PREDICTED: uncharacterized protein LOC105116...   200   3e-55
gb|KDO83102.1| hypothetical protein CISIN_1g009163mg [Citrus sin...   198   2e-54
ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494...   197   4e-54
gb|KJB19398.1| hypothetical protein B456_003G100500, partial [Go...   197   6e-54
ref|XP_002316094.2| hypothetical protein POPTR_0010s16720g [Popu...   193   9e-54
gb|KHN17805.1| AT-rich interactive domain-containing protein 2 [...   195   2e-53
ref|XP_012464094.1| PREDICTED: uncharacterized protein LOC105783...   195   2e-53
ref|XP_012464093.1| PREDICTED: uncharacterized protein LOC105783...   194   3e-53

>ref|XP_011075586.1| PREDICTED: uncharacterized protein LOC105160027 [Sesamum indicum]
            gi|747058492|ref|XP_011075587.1| PREDICTED:
            uncharacterized protein LOC105160027 [Sesamum indicum]
          Length = 423

 Score =  356 bits (914), Expect = e-117
 Identities = 186/330 (56%), Positives = 224/330 (67%), Gaps = 1/330 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 487
            MGVKRPL +E++PELSFKQPKQ D+N K L F  +D  +      +DSPGE KSN CK  
Sbjct: 1    MGVKRPLEQEDLPELSFKQPKQLDNNRK-LTFTAEDFPTHRTTLEVDSPGEVKSNFCKIH 59

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
             DGMLENG+T+GAS+A  ELE +                             PLS VT+S
Sbjct: 60   SDGMLENGDTNGASLADKELEASA----------------------------PLSLVTSS 91

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S EE+AGNED S L+ FPGY D  IP W+P +Q E+P+I +LN  PRKEVPVGPD+Q EV
Sbjct: 92   SSEEDAGNEDTSILYNFPGYIDFSIP-WRPPQQYEDPYISLLNSSPRKEVPVGPDYQAEV 150

Query: 848  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCPDV 1027
            P WDPS+  K    SNNF  ++ ++ L G C+IP PG+  SS+D   VGRGRTDCSC D+
Sbjct: 151  PEWDPSSSAKDSLGSNNFVDNE-EQRLMGACVIPMPGLNGSSVDGVTVGRGRTDCSCLDM 209

Query: 1028 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1207
            GS+RCVQQHVKEARE LR  IGE  F  LGFY+MGEEVA  W+ EDE +F+ V+F+NPA 
Sbjct: 210  GSMRCVQQHVKEAREKLRETIGEEAFANLGFYDMGEEVAWRWSAEDEHIFHNVIFSNPAS 269

Query: 1208 SGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
             GRNFWK L   FP RTK+EL+SYYFNVFM
Sbjct: 270  HGRNFWKHLSVMFPTRTKKELISYYFNVFM 299


>ref|XP_011070952.1| PREDICTED: uncharacterized protein LOC105156500 [Sesamum indicum]
          Length = 428

 Score =  299 bits (765), Expect = 3e-94
 Identities = 172/330 (52%), Positives = 203/330 (61%), Gaps = 1/330 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKP- 487
            MG+KRPL EE+ PE SFKQPKQ D N K L   T++  S      +DSPG  KS  CK  
Sbjct: 2    MGMKRPLEEEDFPEPSFKQPKQLDYNKK-LTLNTEE--SHLTTLTVDSPGRTKSIFCKSQ 58

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
             DG LENG+ + AS+AG E E +                             PLS VT+S
Sbjct: 59   FDGRLENGDLYSASLAGKEFEPSA----------------------------PLSLVTSS 90

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S EE+  N D S    FP +TD G+P   P +Q E+P+I +LN  PRKEVP+GPDHQ  V
Sbjct: 91   SREEDVVNGDTSVWSNFPAFTDFGLPRRLP-QQFEDPYISLLNSSPRKEVPIGPDHQAGV 149

Query: 848  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCPDV 1027
            P WDP+A     S  NN      +E L GTCII  P    S+I +F VGRGRTDC C DV
Sbjct: 150  PLWDPNA-----SRDNNR-----EEELMGTCIISMPDANDSTIGEFRVGRGRTDCDCLDV 199

Query: 1028 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPAY 1207
            GS+RCVQQHV EARE LR  IG+  F+ELGF NMG+EVA  WT E+E++F+EVVF+NP  
Sbjct: 200  GSMRCVQQHVTEAREKLRETIGDENFEELGFSNMGDEVACKWTPEEEQVFHEVVFSNPVS 259

Query: 1208 SGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
             GR FWK L  AFP RTK ELVSYYFNVFM
Sbjct: 260  HGRKFWKHLRVAFPARTKRELVSYYFNVFM 289


>ref|XP_012855239.1| PREDICTED: uncharacterized protein LOC105974668 [Erythranthe guttata]
            gi|848914755|ref|XP_012855240.1| PREDICTED:
            uncharacterized protein LOC105974668 [Erythranthe
            guttata]
          Length = 399

 Score =  276 bits (707), Expect = 5e-86
 Identities = 159/331 (48%), Positives = 200/331 (60%), Gaps = 2/331 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCK-P 487
            MG+KRPL E + PE SF +P    D +K L+  T+D    T  PR DS G  KSN C+  
Sbjct: 1    MGIKRPLEEADFPEASFGKPI---DYNKKLISCTEDFHITT--PRFDSLGGPKSNICELQ 55

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
             DG LE+  T+ AS A  E E +                             PLS VT+S
Sbjct: 56   FDGRLEDSETYSASAADKEFEASA----------------------------PLSLVTSS 87

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S EE+AGN D SF   FPGY D+ IPP +P EQ ++P+I +LN  P+KEVP+GPD+Q EV
Sbjct: 88   S-EEDAGNGDTSFWSYFPGYIDISIPPRRPPEQFDDPYISLLNSSPKKEVPLGPDYQAEV 146

Query: 848  PTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTD-CSCPD 1024
            P W+ +          N+   + ++ L GTC+IP P +  S+ D   VG GRT  CSC D
Sbjct: 147  PLWEGA----------NYFTDEREQQLMGTCVIPMPDLNDSTSDGVRVGHGRTVVCSCLD 196

Query: 1025 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1204
            VGS+RCVQQHVKEARE L   IGE  F +LGF +MG+EVA  WT  DE +F+E+V +NP 
Sbjct: 197  VGSMRCVQQHVKEAREKLLETIGENNFIDLGFCHMGDEVACKWTPADEHVFHEIVLSNPV 256

Query: 1205 YSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
              GRNFWK L  AFP+RTK+ELVSYYFNVF+
Sbjct: 257  SHGRNFWKLLRSAFPSRTKKELVSYYFNVFV 287


>ref|XP_012847724.1| PREDICTED: uncharacterized protein LOC105967657 [Erythranthe guttata]
            gi|848895363|ref|XP_012847725.1| PREDICTED:
            uncharacterized protein LOC105967657 [Erythranthe
            guttata]
          Length = 381

 Score =  242 bits (618), Expect = 6e-73
 Identities = 150/331 (45%), Positives = 190/331 (57%), Gaps = 2/331 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MGVKRP  EEN+PELSF+Q +  D N+K L F  +D  S TA PR   PGE         
Sbjct: 1    MGVKRPFEEENLPELSFEQ-RIEDHNNKKLSFTPEDSPSTTA-PRFHYPGE--------- 49

Query: 491  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSS 670
                EN  T GA I         D+E   S                     PLS   ++ 
Sbjct: 50   ---FENCVTDGACIV--------DKESTPSA--------------------PLSLAASNG 78

Query: 671  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 850
             +EE   E+  +    P   D   P   P  Q E+P+IY+LN  PRKE+P+GPDHQ +VP
Sbjct: 79   KQEEE-EEEEEYAGNIPDI-DFRTPSMPPPLQFEDPYIYLLNTPPRKEIPIGPDHQADVP 136

Query: 851  TWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHS-SIDQFMVGRGRTDCSCPDV 1027
             WDP A  K FS        + ++ L G+CI+  PG+  S S D F  GRGRTDCSC DV
Sbjct: 137  EWDPFARRKDFS-------DEREQELMGSCIVRPPGLNRSGSTDPFAAGRGRTDCSCMDV 189

Query: 1028 GSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVA-LNWTFEDERLFYEVVFANPA 1204
            GS+RCVQQHV EARE L+  +G+ +F +LGFY+MGEE A   WT ++E+LF+EVVF+NP 
Sbjct: 190  GSMRCVQQHVHEAREKLQETMGDEVFVKLGFYDMGEEAASRKWTPDEEQLFHEVVFSNP- 248

Query: 1205 YSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
              G +FWK LG  FP+RT+++ VSYYFNVFM
Sbjct: 249  --GGDFWKVLGSVFPSRTRKDFVSYYFNVFM 277


>emb|CDP00415.1| unnamed protein product [Coffea canephora]
          Length = 467

 Score =  238 bits (606), Expect = 4e-70
 Identities = 141/332 (42%), Positives = 186/332 (56%), Gaps = 3/332 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLF---MTQDVTSQTAVPRIDSPGEAKSNDC 481
            MGVKRP  EE+    S KQ KQ + ++K   F    + D  SQ +  R          D 
Sbjct: 1    MGVKRPFDEEDFQVSSVKQAKQLEFDNKQTSFSKAFSSDDVSQNSGSR---------GDF 51

Query: 482  KPCDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVT 661
              C    E GN    S + +                           K LE + PLS+VT
Sbjct: 52   DKCQLFKELGNEDSRSASSSA-------------------------EKELETSAPLSWVT 86

Query: 662  TSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQC 841
            +SS EE+AG+ +  ++ LFP Y +   P  +    LE+ +  ++N  PRK++P+GP+HQ 
Sbjct: 87   SSSGEEDAGSGEPFYVSLFPEYFEFNFPR-RTVVHLEDSYSSLINSSPRKQIPIGPNHQA 145

Query: 842  EVPTWDPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCP 1021
            E+P WDP AV       NN    D +E + GTCII      +SS D+  +G+GR DC C 
Sbjct: 146  EIPPWDPQAVETDPLTPNNCVRDDNEEAV-GTCIISASLSSYSSRDEVKIGQGRKDCVCL 204

Query: 1022 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1201
            D GSVRCV+QH+KEARE LR  IG+  F ELGFY+MGEEVA  WT E+ER+F+EVV+ NP
Sbjct: 205  DRGSVRCVRQHIKEAREKLREVIGDEKFLELGFYDMGEEVAAKWTEEEERVFHEVVYYNP 264

Query: 1202 AYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
               G+NFWK L  AFP+RT  +LVS+YFNVFM
Sbjct: 265  VSLGKNFWKQLAVAFPSRTSRDLVSFYFNVFM 296


>ref|XP_010258118.1| PREDICTED: uncharacterized protein LOC104597984 [Nelumbo nucifera]
          Length = 516

 Score =  213 bits (542), Expect = 3e-60
 Identities = 140/341 (41%), Positives = 179/341 (52%), Gaps = 12/341 (3%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPR--IDSPGEAKSNDCK 484
            M  KRP G+E   EL+ K P+Q + +++   F   D+TS   +P+  +   GE+   DC 
Sbjct: 1    MVYKRPFGDEESCELACKHPRQLEYSNQLASFA--DITSYNDMPQNPLSLVGES---DCS 55

Query: 485  P--CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLP---- 646
               CD  LE+G     SI      G G                     K LE+T P    
Sbjct: 56   KGQCDERLESGTITELSI------GAG---------------------KDLEITAPVGIS 88

Query: 647  -LSFVTTSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPV 823
             LS+ T+S+ EE++ +E    +  FPGY +   P      Q E  +   L++ PRK V V
Sbjct: 89   SLSWATSSTSEEDSRSEATDRVPFFPGYYEPDYPA-TVLAQSEEIYSSPLDYHPRKLVAV 147

Query: 824  GPDHQCEVPTW---DPSAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVG 994
            GPDHQ  VP W   D    G    V          + L GTCIIP P +  S       G
Sbjct: 148  GPDHQANVPAWGFQDTHCFGAEVMVPETTD-----DKLMGTCIIPMPDLEQSVYSSDNFG 202

Query: 995  RGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERL 1174
             GRT CSCPD GS+RCV+QH+ E RE LR  +G+  F ELGF +MGEEVA NW  E+E+ 
Sbjct: 203  CGRTICSCPDGGSIRCVKQHIMETREKLRETLGQEKFAELGFCDMGEEVARNWNEEEEQS 262

Query: 1175 FYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            F+EVVF+NPA  G+NFW  L   FP+RTK ELVSYYFNVFM
Sbjct: 263  FHEVVFSNPASLGKNFWDHLSVVFPSRTKSELVSYYFNVFM 303


>ref|XP_007045912.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508709847|gb|EOY01744.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 526

 Score =  212 bits (540), Expect = 8e-60
 Identities = 134/332 (40%), Positives = 183/332 (55%), Gaps = 3/332 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P        KP 
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQ-------KPH 46

Query: 491  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSS 670
               +E+G          E +   D    +               K  E + PLS VT+ S
Sbjct: 47   ISEVEDGFRKYQWDEVFETDALNDVTHFVD--------------KDFETSAPLSLVTSPS 92

Query: 671  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 850
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 93   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 151

Query: 851  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCP 1021
            +W    V KY    ++ S S   D +E + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 152  SWGRH-VKKYEFAQSDASDSTDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 210

Query: 1022 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1201
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 211  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 270

Query: 1202 AYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            +  G+ FWK L   FP+R+K ELVSYYFNVF+
Sbjct: 271  SSLGKKFWKDLSVVFPSRSKRELVSYYFNVFI 302


>ref|XP_012478745.1| PREDICTED: uncharacterized protein LOC105794229 [Gossypium raimondii]
            gi|823157725|ref|XP_012478746.1| PREDICTED:
            uncharacterized protein LOC105794229 [Gossypium
            raimondii] gi|823157727|ref|XP_012478747.1| PREDICTED:
            uncharacterized protein LOC105794229 [Gossypium
            raimondii] gi|763763195|gb|KJB30449.1| hypothetical
            protein B456_005G144600 [Gossypium raimondii]
            gi|763763196|gb|KJB30450.1| hypothetical protein
            B456_005G144600 [Gossypium raimondii]
          Length = 542

 Score =  210 bits (534), Expect = 8e-59
 Identities = 128/332 (38%), Positives = 176/332 (53%), Gaps = 3/332 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAV-PRIDSPGEAKSNDCKP 487
            MG KRP   E + EL FK P+Q D+N+K   F      S T   P I    E     C+ 
Sbjct: 1    MGFKRPFDSEELQELPFKHPRQFDNNNKLTQFANTISHSYTHQNPHISVDVEGGFCKCQ- 59

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
             D   E G             G  D   ++               K  E + PLS +T+ 
Sbjct: 60   WDEAFETG-------------GLNDERPSVD--------------KDFETSAPLSLITSI 92

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S EE+     A+   + P Y D   P  +    +E+ +  +L+  PRK+VP+GP+HQ  V
Sbjct: 93   SSEEDVDTGPAAISPISPEYFDFDFPR-RTLGPVEDAYSLLLDRSPRKQVPLGPNHQANV 151

Query: 848  PTWDPSAVGKYF--SVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCP 1021
            P+         F  + +++ +   ++E + GTC+IP P    S+ D   VG GRTDCSC 
Sbjct: 152  PSLGRHIKKDKFVQNCASDTNDIGYEEIMMGTCVIPMPDSDLSANDSGKVGAGRTDCSCL 211

Query: 1022 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1201
            D GS+RCV+QHV EARE LR  +G   F +LGFY+MGE+VA  W+ E+E +F EVV++NP
Sbjct: 212  DGGSLRCVRQHVMEAREKLRKSLGHEKFVKLGFYDMGEDVAYKWSEEEEEIFREVVYSNP 271

Query: 1202 AYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            A  G+NFWK     FP+R+K ELVSYYFNVF+
Sbjct: 272  ASLGKNFWKHFSMVFPSRSKSELVSYYFNVFI 303


>ref|XP_007045913.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508709848|gb|EOY01745.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 527

 Score =  209 bits (533), Expect = 8e-59
 Identities = 132/332 (39%), Positives = 183/332 (55%), Gaps = 3/332 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E + EL FK  +Q D ++K    MTQ   +    PR ++P +   +     
Sbjct: 1    MGFKRPFDDEELQELPFKNLRQFDYSNK----MTQFADT---FPRSNTPQKPHIS----- 48

Query: 491  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSS 670
               +E+G          E +   D    +               K  E + PLS VT+ S
Sbjct: 49   -AEVEDGFRKYQWDEVFETDALNDVTHFVD--------------KDFETSAPLSLVTSPS 93

Query: 671  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 850
             EE+ G   A+ L + P Y D  +P  +    +E+ +   L+  PR++V +GP+HQ  VP
Sbjct: 94   SEEDTGTGAAAILPVSPEYFDFDLPR-RTFAPVEDAYSLFLDRSPRRQVLLGPNHQANVP 152

Query: 851  TWDPSAVGKYFSVSNNFSGS---DWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCP 1021
            +W    V KY    ++ S S   D +E + GTC+IP P  Y S+ +   VG GRTDCSC 
Sbjct: 153  SWGRH-VKKYEFAQSDASDSTDNDKEEMMMGTCVIPMPESYLSANNSGKVGAGRTDCSCL 211

Query: 1022 DVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANP 1201
            D GS+RCVQQHV EARE LR  +G   F +LGFY+MGE+VA  W+ EDE +F EVV++NP
Sbjct: 212  DRGSLRCVQQHVMEARERLRKSLGHEKFVKLGFYDMGEDVAYKWSEEDEEIFREVVYSNP 271

Query: 1202 AYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            +  G+ FWK L   FP+R+K ELVSYYFNVF+
Sbjct: 272  SSLGKKFWKDLSVVFPSRSKRELVSYYFNVFI 303


>gb|KHG09453.1| AT-rich interactive domain-containing 2 -like protein [Gossypium
            arboreum]
          Length = 539

 Score =  204 bits (519), Expect = 1e-56
 Identities = 126/331 (38%), Positives = 173/331 (52%), Gaps = 2/331 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAV-PRIDSPGEAKSNDCKP 487
            MG KRP   E + EL FK P+Q D+N+K   F      S T   P I    E     C+ 
Sbjct: 1    MGFKRPFDSEELQELPFKHPRQFDNNNKLTQFADTISHSYTHQDPHISVDVEGGFRKCQ- 59

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
             D   E G             G  D    +               K  E + PLS +T+ 
Sbjct: 60   WDEAFETG-------------GLNDERPLVD--------------KDFETSAPLSLITSI 92

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S EE+     A+   + P Y D   P  +    +E+ +  +L+  PRK+V +GP+HQ  V
Sbjct: 93   SSEEDVDTGPAAISPISPEYFDFDFPR-RMLGPVEDAYSLLLDRSPRKQVLLGPNHQANV 151

Query: 848  PTWDPSAVGKYF-SVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCPD 1024
            P+       K+  + +++ +   ++E + GTC+IP P    S+ D   VG GRTDCSC D
Sbjct: 152  PSLGRHIKDKFVQNCASDTNDIGYEEIMMGTCVIPMPDSDLSANDSGKVGAGRTDCSCLD 211

Query: 1025 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1204
             GS RCV+QHV EARE LR  +G   F +LGFY+MGE+VA  W+ E+E +F EVV++NPA
Sbjct: 212  GGSFRCVRQHVMEAREKLRKSLGHEKFVKLGFYDMGEDVAYKWSEEEEEIFREVVYSNPA 271

Query: 1205 YSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
              G+ FWK     FP+R+K ELVSYYFNVF+
Sbjct: 272  SLGKKFWKHFSMVFPSRSKRELVSYYFNVFI 302


>ref|XP_006483051.1| PREDICTED: uncharacterized protein LOC102614272 [Citrus sinensis]
          Length = 541

 Score =  201 bits (511), Expect = 2e-55
 Identities = 131/349 (37%), Positives = 182/349 (52%), Gaps = 20/349 (5%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E   EL +K  +Q D N+K + F                      ++  PC
Sbjct: 1    MGFKRPFDDEEFQELPYKHSRQLDINNKMIRF----------------------SEFGPC 38

Query: 491  DGMLENGNTHGASIAGNELEGTGDREEN---ISELSCKXXXXXXXXXKRLEVTLPLSFVT 661
            D   +  +T G        +G+G  E      SE             K  E + PLS+VT
Sbjct: 39   DAASQKHDTSGE-------DGSGFYEHQWHEASENGTVANELMNLVDKDFETSAPLSWVT 91

Query: 662  TSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQC 841
            +SS EE+AG+   +   L   + +   P  +     E+ +  +L+  PRK+VP+GP+HQ 
Sbjct: 92   SSSCEEDAGSGSTTHAPLSLEHIEYDYPR-RTFVPFEDSYSSLLDRSPRKQVPLGPNHQA 150

Query: 842  EVPTWDPSAVGKYFSV---------------SNNFSGSDWKENLRGTCIIPRP--GVYHS 970
             +P+WD S +GK                   S+N   +D +E   GTCIIP P    +  
Sbjct: 151  ILPSWDRS-MGKNILDGKATLRGNNSLVHLGSHNVVDNDNEEKWMGTCIIPMPDSNSFAH 209

Query: 971  SIDQFMVGRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALN 1150
            +IDQ  VGRG  DC C D GS+RCVQQHV EARE L   +G   F +LG  +MGEEV+  
Sbjct: 210  NIDQ--VGRGIMDCDCLDEGSIRCVQQHVMEAREKLLKSLGHEKFVKLGLCDMGEEVSCK 267

Query: 1151 WTFEDERLFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            W+ E+E++F+EVV++NP   GRNFWK L   FP+RTK+E+VSYYFNVF+
Sbjct: 268  WSEEEEQVFHEVVYSNPFSLGRNFWKQLSAVFPSRTKKEIVSYYFNVFV 316


>ref|XP_006438810.1| hypothetical protein CICLE_v10031172mg [Citrus clementina]
            gi|557541006|gb|ESR52050.1| hypothetical protein
            CICLE_v10031172mg [Citrus clementina]
          Length = 541

 Score =  201 bits (511), Expect = 2e-55
 Identities = 131/349 (37%), Positives = 182/349 (52%), Gaps = 20/349 (5%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E   EL +K  +Q D N+K + F                      ++  PC
Sbjct: 1    MGFKRPFDDEEFQELPYKHSRQLDINNKMIRF----------------------SEFGPC 38

Query: 491  DGMLENGNTHGASIAGNELEGTGDREEN---ISELSCKXXXXXXXXXKRLEVTLPLSFVT 661
            D   +  +T G        +G+G  E      SE             K  E + PLS+VT
Sbjct: 39   DAASQKHDTSGE-------DGSGFYEHQWHEASENGTVANELTNLVDKDFETSAPLSWVT 91

Query: 662  TSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQC 841
            +SS EE+AG+   +   L   + +   P  +     E+ +  +L+  PRK+VP+GP+HQ 
Sbjct: 92   SSSCEEDAGSGSTTHAPLSLEHIEYDYPR-RTFVPFEDSYSSLLDRSPRKQVPLGPNHQA 150

Query: 842  EVPTWDPSAVGKYFSV---------------SNNFSGSDWKENLRGTCIIPRP--GVYHS 970
             +P+WD S +GK                   S+N   +D +E   GTCIIP P    +  
Sbjct: 151  ILPSWDRS-MGKNILDGKATLRGNNSLDHLGSHNVVDNDNEEKWMGTCIIPMPDSNSFAH 209

Query: 971  SIDQFMVGRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALN 1150
            +IDQ  VGRG  DC C D GS+RCVQQHV EARE L   +G   F +LG  +MGEEV+  
Sbjct: 210  NIDQ--VGRGIMDCDCLDEGSIRCVQQHVMEAREKLLKSLGHEKFVKLGLCDMGEEVSCK 267

Query: 1151 WTFEDERLFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            W+ E+E++F+EVV++NP   GRNFWK L   FP+RTK+E+VSYYFNVF+
Sbjct: 268  WSEEEEQVFHEVVYSNPFSLGRNFWKQLSAVFPSRTKKEIVSYYFNVFV 316


>ref|XP_011012203.1| PREDICTED: uncharacterized protein LOC105116503 [Populus euphratica]
            gi|743799031|ref|XP_011012211.1| PREDICTED:
            uncharacterized protein LOC105116503 [Populus euphratica]
          Length = 533

 Score =  200 bits (509), Expect = 3e-55
 Identities = 130/341 (38%), Positives = 174/341 (51%), Gaps = 12/341 (3%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E   +  FKQ +Q D  +K   F   D  S    P+ D   +  S+  KP 
Sbjct: 1    MGFKRPFDDEEFQDHPFKQARQVDYCNKLTQFSETDAHSYMP-PKPDITDDCGSSVVKPL 59

Query: 491  -DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
                 EN                 D+   +S L+           K  + + PLS VT S
Sbjct: 60   WHETFEN-----------------DKVIEVSNLA-----------KDSDSSAPLSLVTCS 91

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S +E  G+  A+     P Y     P  +    L++   + L+ FPRK+VP+GP+HQ  +
Sbjct: 92   SSDENFGSGMAAS----PEYCQFEFPR-KMSMPLKDAHSFYLDDFPRKQVPLGPNHQASI 146

Query: 848  PTWDP-----------SAVGKYFSVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVG 994
            P WD            +  G   S S++   +D +E L GTCIIP P        ++  G
Sbjct: 147  PLWDNHIKKDKLVQFINPNGSSLSESDHHIYNDNEEKLMGTCIIPMPDTELQLCSRYEAG 206

Query: 995  RGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERL 1174
             GR+DC C D GS RCV+QH+ EARE L   IG   F  LGFY+MGEEVA  WT E+ER+
Sbjct: 207  CGRSDCGCLDEGSFRCVRQHIMEAREELIKSIGHEKFVNLGFYDMGEEVACKWTKEEERV 266

Query: 1175 FYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            F+EVV++ PA  G+NFWK L   FP+RT +E+VSYYFNVFM
Sbjct: 267  FHEVVYSRPASLGQNFWKHLAQVFPDRTTKEIVSYYFNVFM 307


>gb|KDO83102.1| hypothetical protein CISIN_1g009163mg [Citrus sinensis]
          Length = 541

 Score =  198 bits (503), Expect = 2e-54
 Identities = 131/349 (37%), Positives = 181/349 (51%), Gaps = 20/349 (5%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E   EL +K  +Q D N+K + F                      ++  PC
Sbjct: 1    MGFKRPFDDEEFQELPYKHSRQLDINNKMIRF----------------------SEFGPC 38

Query: 491  DGMLENGNTHGASIAGNELEGTGDREEN---ISELSCKXXXXXXXXXKRLEVTLPLSFVT 661
            D   E  +T G        +G+G  E      SE             K  E + PLS+VT
Sbjct: 39   DAASEKHDTSGE-------DGSGFYEHQWHEASENGTVANELTNLVDKDFETSGPLSWVT 91

Query: 662  TSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQC 841
            +SS EE+AG+   +   L   + +   P  +     E+ +  +L+  PRK+VP+GP+HQ 
Sbjct: 92   SSSCEEDAGSGSTTHAPLSLEHFEYDYPR-RTFVPFEDSYSSLLDRSPRKQVPLGPNHQA 150

Query: 842  EVPTWDPSAVGKYFSV---------------SNNFSGSDWKENLRGTCIIPRP--GVYHS 970
             +P+WD S +GK                   S+N   +D +E   GTCIIP P    +  
Sbjct: 151  ILPSWDRS-MGKNILDGKAMLRGNNSLDHLGSHNVVDNDNEEKWMGTCIIPMPDSNSFAH 209

Query: 971  SIDQFMVGRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALN 1150
            +IDQ  VGR   DC C D GS+RCVQQHV EARE L   +G   F +LG  +MGEEV+  
Sbjct: 210  NIDQ--VGRDIMDCDCLDEGSIRCVQQHVMEAREKLLKSLGHEKFVKLGLCDMGEEVSCK 267

Query: 1151 WTFEDERLFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            W+ E+E++F+EVV++NP   GRNFWK L   FP+RTK+E+VSYYFNVF+
Sbjct: 268  WSEEEEQVFHEVVYSNPFSLGRNFWKQLSSVFPSRTKKEIVSYYFNVFV 316


>ref|XP_004499488.1| PREDICTED: uncharacterized protein LOC101494171 [Cicer arietinum]
            gi|502126914|ref|XP_004499489.1| PREDICTED:
            uncharacterized protein LOC101494171 [Cicer arietinum]
          Length = 533

 Score =  197 bits (501), Expect = 4e-54
 Identities = 130/342 (38%), Positives = 170/342 (49%), Gaps = 16/342 (4%)
 Frame = +2

Query: 320  KRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPCD-G 496
            KRP   E + E+SFK PK    N   +        S++  P  D   +        CD G
Sbjct: 4    KRPFDAEEVLEVSFKHPKHSSPNDLLVPL------SESVFPNDDRHTQLPKTSEGGCDQG 57

Query: 497  MLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSSHE 676
              E        I     +G GD E +                    V +P  + T+S+  
Sbjct: 58   SCECNEKLAGEICDELPKGAGDSEASFPV-----------------VGIPAPWATSSA-T 99

Query: 677  EEAGNEDASFLHLFPGYTDVGIP-------PWQPREQLENPFIYMLNHFPRKEVPVGPDH 835
            E+  +E    L LFP Y     P       P +   + E+ +  +L H PRK V VG +H
Sbjct: 100  EDLRSEQPIHLSLFPEYFSPERPIYFSPERPIRTLTRYEDIYSILLEHSPRKPVSVGANH 159

Query: 836  QCEVPTWDPSAVGKYFSVSNNFSGSDW--------KENLRGTCIIPRPGVYHSSIDQFMV 991
            Q +VP W  S        S   S S++        ++ L GTCIIP P +  +SIDQ  V
Sbjct: 160  QADVPPWGFSRASYVPHASGTVSDSNFTAWNRDEAEKRLMGTCIIPMPEMELTSIDQ-KV 218

Query: 992  GRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDER 1171
            G+GRTDCSC D  S+RCV+QH+ E RE L   IG   F ELGF +MGE+VA  W+ EDE 
Sbjct: 219  GKGRTDCSCVDRESMRCVRQHIMEEREKLLKSIGFEKFTELGFADMGEQVAEKWSAEDEH 278

Query: 1172 LFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            LF++VVF NPA   RNFW +L   FP+RTK+E+VSYYFNVFM
Sbjct: 279  LFHKVVFNNPASLNRNFWNYLSIVFPSRTKKEIVSYYFNVFM 320


>gb|KJB19398.1| hypothetical protein B456_003G100500, partial [Gossypium raimondii]
          Length = 551

 Score =  197 bits (501), Expect = 6e-54
 Identities = 140/364 (38%), Positives = 200/364 (54%), Gaps = 10/364 (2%)
 Frame = +2

Query: 236  FQSSDLCDFLLKFDNFPYFSINYLRMGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQ 415
            FQ SD+ D L K         N   M  KRP  E+ + E+S KQP+Q + +++ +L  ++
Sbjct: 55   FQFSDISDLLRK---------NKFNMVHKRPFIED-VFEVSCKQPRQAEHSNQWVL-SSE 103

Query: 416  DVTSQTAVPRIDSPGEAK-SNDCKPCDGMLENGNTHGASIAGNELEGTGDREENISELSC 592
             +  + A P  ++ GE + +N    CD  L N           + E  G+ E+       
Sbjct: 104  PLFPEDAAPFSNASGEGRFTNVNTKCDEKLANAI---------DTEHQGNPED------- 147

Query: 593  KXXXXXXXXXKRLEVTLP----LSFVTTSS-HEEEAGNEDASFLHLFPGYTDVGIP--PW 751
                        LE  +P    +SF+ TSS HEE+   ++   LH+ P + +   P  P 
Sbjct: 148  ------------LEANIPGCIAISFLGTSSTHEEDLWPDEP--LHM-PSFAECFNPERPV 192

Query: 752  QPREQLENPFIYMLNHFPRKEVPVGPDHQCEVPTWDPSAVGKYFSVSNNFS--GSDWKEN 925
            +   +LE+ +  +L + PRK V VGP++Q ++P WD S V +  S     S   S ++  
Sbjct: 193  RTVARLEDIYSILLQYPPRKPVLVGPNYQADIPEWD-SQVTRNASNCEEVSETASRYERE 251

Query: 926  LRGTCIIPRPGVYHSSIDQFMVGRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLF 1105
            + GTCIIP P +  S+ D+  VG GRT+CSC D  SVRCV+QH+ EARE LR  +G   F
Sbjct: 252  MVGTCIIPIPALESSAYDE-KVGHGRTNCSCEDKDSVRCVRQHILEAREELRKSLGHERF 310

Query: 1106 KELGFYNMGEEVALNWTFEDERLFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYF 1285
             ELGFY+MGE VA  W+  +E+LF++VVF NPA  GRNFW  L   FP+RTK ++VSYYF
Sbjct: 311  MELGFYDMGEVVAEKWSEHEEQLFHKVVFYNPASLGRNFWGSLASVFPHRTKADIVSYYF 370

Query: 1286 NVFM 1297
            NVFM
Sbjct: 371  NVFM 374


>ref|XP_002316094.2| hypothetical protein POPTR_0010s16720g [Populus trichocarpa]
            gi|550329966|gb|EEF02265.2| hypothetical protein
            POPTR_0010s16720g [Populus trichocarpa]
          Length = 402

 Score =  193 bits (490), Expect = 9e-54
 Identities = 129/341 (37%), Positives = 176/341 (51%), Gaps = 12/341 (3%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAV-PRIDSPGEAKSNDCKP 487
            MG KRP   E   +L FKQ +Q D  +K   F      S   + P I       ++DC  
Sbjct: 1    MGFKRPFDYEEFQDLPFKQARQVDYCNKLTQFSETGAHSYMPLKPDI-------TDDC-- 51

Query: 488  CDGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTS 667
                   GN+    +     E   D+   +S L+           K  + + PLS VT S
Sbjct: 52   -------GNSFVKPLWHETFEN--DKVIEVSNLA-----------KDSDFSAPLSLVTCS 91

Query: 668  SHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEV 847
            S +E   +  A+     P Y     P  +    L++   + L+ FPRK+VP+GP+HQ  +
Sbjct: 92   SSDENFESRMATS----PEYFQFEFPR-KMSMPLKDAHSFYLDDFPRKQVPLGPNHQASI 146

Query: 848  PTWD----PSAVGKYF-------SVSNNFSGSDWKENLRGTCIIPRPGVYHSSIDQFMVG 994
            P WD       + ++F       S S++   +D +E L GTCIIP P        ++  G
Sbjct: 147  PLWDNHIKKDKLVQFFNPNSSSLSESDHHIYNDNEEKLMGTCIIPMPDTELQLCSRYEAG 206

Query: 995  RGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERL 1174
             GR+DC C D GS RCV+QH+ EARE L   IG      LGFY+MGEEVA NWT E+ER+
Sbjct: 207  CGRSDCGCLDEGSFRCVRQHIMEAREELIKSIGHEKCVNLGFYDMGEEVACNWTKEEERV 266

Query: 1175 FYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            F+EVV++ PA  G+NFWK L   FP+RT +E+VSYYFNVFM
Sbjct: 267  FHEVVYSRPASLGQNFWKHLAQVFPDRTTKEIVSYYFNVFM 307


>gb|KHN17805.1| AT-rich interactive domain-containing protein 2 [Glycine soja]
            gi|947106276|gb|KRH54659.1| hypothetical protein
            GLYMA_06G201500 [Glycine max] gi|947106277|gb|KRH54660.1|
            hypothetical protein GLYMA_06G201500 [Glycine max]
          Length = 522

 Score =  195 bits (496), Expect = 2e-53
 Identities = 110/232 (47%), Positives = 140/232 (60%), Gaps = 11/232 (4%)
 Frame = +2

Query: 635  VTLPLSFVTTSSHEEEAGNEDASFLHLFPGYTDVGIPPWQPREQL---ENPFIYMLNHFP 805
            + +P S   TSS  ++   E    L LFP Y      P +P   L   E+ +  +L H P
Sbjct: 87   IDIPASSWATSSTTQDLHLEPPLHLSLFPEY----FSPERPIRTLTRYEDIYSILLEHSP 142

Query: 806  RKEVPVGPDHQCEVPTWDPSAVGKYFSVSNNFSGSDW--------KENLRGTCIIPRPGV 961
            RK V VGPDHQ +VP WD S      + S+  S SD+        ++ L GTC+IP P +
Sbjct: 143  RKPVSVGPDHQADVPAWDISGATNRPNASDAVSVSDFTVGDIDGTEKRLMGTCVIPMPQM 202

Query: 962  YHSSIDQFMVGRGRTDCSCPDVGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEV 1141
              SS D   VG+GRTDCSC D GS+RCV+QH+ E RE LR   G   F ELGF NMGE+V
Sbjct: 203  ELSSNDD-EVGKGRTDCSCEDQGSMRCVRQHIAEEREKLRKLFGPKKFTELGFTNMGEQV 261

Query: 1142 ALNWTFEDERLFYEVVFANPAYSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
            A +W+ EDE+LF+EVVF NP    +NFW +L   FP+ TK+E+VSYYFNVFM
Sbjct: 262  AESWSAEDEQLFHEVVFNNPVSLDKNFWNYLSIVFPSLTKKEIVSYYFNVFM 313


>ref|XP_012464094.1| PREDICTED: uncharacterized protein LOC105783280 isoform X2 [Gossypium
            raimondii] gi|763813483|gb|KJB80335.1| hypothetical
            protein B456_013G092500 [Gossypium raimondii]
          Length = 520

 Score =  195 bits (495), Expect = 2e-53
 Identities = 125/331 (37%), Positives = 172/331 (51%), Gaps = 2/331 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E + EL  K P+Q   ++K   F   D T     P  +SP        KP 
Sbjct: 1    MGFKRPFDDEELQELPIKNPRQFGYDNKLTRFA--DTT-----PHGNSPQ-------KP- 45

Query: 491  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSS 670
                     H + + G   +   D E     L+           +  E + PLS VT++S
Sbjct: 46   ---------HISEVEGGFHKHQWDEEFKSDALN----DVAHLVDEDFETSAPLSLVTSTS 92

Query: 671  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 850
             EE+      +   +   Y D   P  +    +E+ +  +L+  PRK+VP+GP+HQ  VP
Sbjct: 93   SEEDISTGVTAISPVSSEYFDFDFPR-RTFSPVEDDYYLLLDRSPRKQVPLGPNHQANVP 151

Query: 851  TWDPSAVGKYFSVSNNFSGSDWK-ENLR-GTCIIPRPGVYHSSIDQFMVGRGRTDCSCPD 1024
            +W        F+ +     +D   E+++ GTCIIP P    S+     VG GR DCSC D
Sbjct: 152  SWGRHIKNIKFAQNEASKAADIDHEDIKMGTCIIPLPDSDLSANSSDKVGAGRFDCSCLD 211

Query: 1025 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1204
             GS+RCVQQHV EAR+ LR  +G   F +LGFYNMGE+VA  W+ EDE +F EVV+ NP 
Sbjct: 212  RGSLRCVQQHVTEARKLLRKSLGHEKFVKLGFYNMGEDVAYKWSEEDEEIFREVVYTNPV 271

Query: 1205 YSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
              G+ FWK L   FP+R+K E+VSYYFNVF+
Sbjct: 272  SLGKKFWKHLSVVFPSRSKREIVSYYFNVFI 302


>ref|XP_012464093.1| PREDICTED: uncharacterized protein LOC105783280 isoform X1 [Gossypium
            raimondii] gi|763813484|gb|KJB80336.1| hypothetical
            protein B456_013G092500 [Gossypium raimondii]
          Length = 521

 Score =  194 bits (494), Expect = 3e-53
 Identities = 125/331 (37%), Positives = 173/331 (52%), Gaps = 2/331 (0%)
 Frame = +2

Query: 311  MGVKRPLGEENIPELSFKQPKQPDDNSKSLLFMTQDVTSQTAVPRIDSPGEAKSNDCKPC 490
            MG KRP  +E + EL  K P+Q   ++K   F   D T     P  +SP        KP 
Sbjct: 1    MGFKRPFDDEELQELPIKNPRQFGYDNKLTRFA--DTT-----PHGNSPQ-------KPH 46

Query: 491  DGMLENGNTHGASIAGNELEGTGDREENISELSCKXXXXXXXXXKRLEVTLPLSFVTTSS 670
              +   G  H       + E   D   +++ L            +  E + PLS VT++S
Sbjct: 47   ISVEVEGGFHKHQW---DEEFKSDALNDVAHL----------VDEDFETSAPLSLVTSTS 93

Query: 671  HEEEAGNEDASFLHLFPGYTDVGIPPWQPREQLENPFIYMLNHFPRKEVPVGPDHQCEVP 850
             EE+      +   +   Y D   P  +    +E+ +  +L+  PRK+VP+GP+HQ  VP
Sbjct: 94   SEEDISTGVTAISPVSSEYFDFDFPR-RTFSPVEDDYYLLLDRSPRKQVPLGPNHQANVP 152

Query: 851  TWDPSAVGKYFSVSNNFSGSDWK-ENLR-GTCIIPRPGVYHSSIDQFMVGRGRTDCSCPD 1024
            +W        F+ +     +D   E+++ GTCIIP P    S+     VG GR DCSC D
Sbjct: 153  SWGRHIKNIKFAQNEASKAADIDHEDIKMGTCIIPLPDSDLSANSSDKVGAGRFDCSCLD 212

Query: 1025 VGSVRCVQQHVKEAREFLRVDIGEWLFKELGFYNMGEEVALNWTFEDERLFYEVVFANPA 1204
             GS+RCVQQHV EAR+ LR  +G   F +LGFYNMGE+VA  W+ EDE +F EVV+ NP 
Sbjct: 213  RGSLRCVQQHVTEARKLLRKSLGHEKFVKLGFYNMGEDVAYKWSEEDEEIFREVVYTNPV 272

Query: 1205 YSGRNFWKFLGFAFPNRTKEELVSYYFNVFM 1297
              G+ FWK L   FP+R+K E+VSYYFNVF+
Sbjct: 273  SLGKKFWKHLSVVFPSRSKREIVSYYFNVFI 303


Top