BLASTX nr result

ID: Rehmannia26_contig00007647 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00007647
         (1931 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004241776.1| PREDICTED: uncharacterized protein LOC101261...   357   1e-95
ref|XP_006364924.1| PREDICTED: micronuclear linker histone polyp...   349   3e-93
ref|XP_004245024.1| PREDICTED: uncharacterized protein LOC101258...   339   2e-90
gb|EMJ24332.1| hypothetical protein PRUPE_ppa005440mg [Prunus pe...   325   6e-86
ref|XP_006483365.1| PREDICTED: enolase-phosphatase E1-like isofo...   323   1e-85
ref|XP_006450433.1| hypothetical protein CICLE_v10008326mg [Citr...   321   6e-85
ref|XP_002309627.2| hypothetical protein POPTR_0006s27050g [Popu...   320   1e-84
ref|XP_002285138.1| PREDICTED: uncharacterized protein LOC100244...   316   2e-83
ref|XP_002515513.1| conserved hypothetical protein [Ricinus comm...   309   3e-81
gb|EXC46039.1| hypothetical protein L484_000806 [Morus notabilis]     307   1e-80
gb|EOY29480.1| TPX2 family protein, putative [Theobroma cacao]        304   8e-80
ref|XP_004291456.1| PREDICTED: uncharacterized protein LOC101310...   299   3e-78
ref|XP_004173325.1| PREDICTED: uncharacterized protein LOC101231...   296   2e-77
ref|XP_004136350.1| PREDICTED: uncharacterized protein LOC101207...   295   5e-77
ref|XP_002324860.2| hypothetical protein POPTR_0018s01730g [Popu...   292   4e-76
gb|ABK95344.1| unknown [Populus trichocarpa]                          290   1e-75
emb|CAN82789.1| hypothetical protein VITISV_030600 [Vitis vinifera]   280   2e-72
ref|XP_003549281.1| PREDICTED: protein gar2-like isoform X1 [Gly...   276   2e-71
ref|XP_006601154.1| PREDICTED: protein gar2-like isoform X3 [Gly...   273   3e-70
ref|XP_003588767.1| Seed specific protein Bn15D14A [Medicago tru...   272   3e-70

>ref|XP_004241776.1| PREDICTED: uncharacterized protein LOC101261927 [Solanum
            lycopersicum]
          Length = 476

 Score =  357 bits (915), Expect = 1e-95
 Identities = 225/485 (46%), Positives = 286/485 (58%), Gaps = 24/485 (4%)
 Frame = -2

Query: 1720 VMDADTTIVVSGNG-DFENGLHQQLPISTTI-NGTSNGSLEVEGLGENLEDASNVNDQKA 1547
            +MD    I VSGNG  F NG+HQ   +   I NG  + S   E L  + +    + D + 
Sbjct: 1    MMDVVNNIAVSGNGLGFGNGVHQLPEVVAGISNGMPHASPSNEELERSFQSTVILTDSET 60

Query: 1546 LGST-QDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAA 1370
            +GS+ Q+++ E  +  ES++  + +E   KES D+ N K  KT  RA+N K   P+   A
Sbjct: 61   VGSSVQEVSNETTITVESNAGVSSEEHEAKESDDATNSKEQKTPPRARNAKNSGPQNGVA 120

Query: 1369 TGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQN-- 1196
                KSKDGKE      ASNG +AS+ RP+Q+S+  AK KS ++K+  +     ++ +  
Sbjct: 121  ---KKSKDGKE------ASNGTLASKPRPKQSSSLDAKGKSFSDKKTVEYYSKPALAHLN 171

Query: 1195 ----KQQHGLPD-ATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAK 1031
                KQQ G  + A S+S S  QSE L EKTKL  LKK                 A DAK
Sbjct: 172  VDRAKQQPGHAEVAASASPSAAQSEGLKEKTKLMPLKKVPPAKADGSAESSSPTAASDAK 231

Query: 1030 ALRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLR 851
              ++GTLPTY  SFKC+ RAEKRKEFYSKLEEK QAKE EK+N+QAK+KETQEAE+KMLR
Sbjct: 232  PRKVGTLPTYNISFKCDARAEKRKEFYSKLEEKTQAKEVEKSNMQAKTKETQEAEIKMLR 291

Query: 850  KSLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPAR 671
            KSL FKATPMPSFYQEP PPK+ELKKIP TRAKSPKLGR+KSSPT +         RP R
Sbjct: 292  KSLKFKATPMPSFYQEPAPPKMELKKIPPTRAKSPKLGRRKSSPTKERINESVM--RPGR 349

Query: 670  LSLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITT------- 512
            LSLDE  SQNN  +      VKKPQRKSLPKLP E TNLS E +K S RK ++       
Sbjct: 350  LSLDENASQNNPVKGHSPLIVKKPQRKSLPKLPSEKTNLSNETRKLSIRKSSSSKESAEA 409

Query: 511  -------PKETGESEVQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQ 353
                   PKET E   Q NN  ++ ++  +D +  EV    EP ++E+ V  +      Q
Sbjct: 410  ASLPNALPKETSEVSSQPNNQHKQATEFDADGRECEVVSVVEPSQTETGVKAQIETNLVQ 469

Query: 352  EAIAV 338
            E + +
Sbjct: 470  EHVTI 474


>ref|XP_006364924.1| PREDICTED: micronuclear linker histone polyprotein-like isoform X1
            [Solanum tuberosum] gi|565398738|ref|XP_006364925.1|
            PREDICTED: micronuclear linker histone polyprotein-like
            isoform X2 [Solanum tuberosum]
          Length = 474

 Score =  349 bits (895), Expect = 3e-93
 Identities = 216/472 (45%), Positives = 286/472 (60%), Gaps = 16/472 (3%)
 Frame = -2

Query: 1720 VMDADTTIVVSGNGD-FENGLHQQLPIST-TINGTSNGSLEVEGLGENLEDASNVNDQKA 1547
            +M+ D  I VS NG  F NG+HQ   + T  ++   NGS  V+GL  +L+ A  VND + 
Sbjct: 1    MMEVDNNISVSVNGSSFGNGIHQLPEVVTGKLDDVPNGSFSVQGLERSLQSAVMVNDSET 60

Query: 1546 LGST-QDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAA 1370
            +GST  ++  E     E++  ++ +    KES +SKN K  K  G+ KN         A+
Sbjct: 61   VGSTAHEVAHESTTTIENNPCASSEGHEAKESRESKNSKQPKAPGKGKN--------TAS 112

Query: 1369 TGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASV---- 1202
             G+ K+KDGK+    SV SNG++AS+ R +Q S+   KSKS N+++ ADN+   +V    
Sbjct: 113  IGVKKTKDGKDASAGSVVSNGSLASQQRSKQASSLGVKSKSFNDRKTADNNLKPAVARIN 172

Query: 1201 -QNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKAL 1025
              + +Q G PDATS S +   ++ L EKT   +LKK                 A DAK+ 
Sbjct: 173  ASHAKQSGQPDATSPSPN---ADGLKEKTNPISLKKAAPNKADGNAESPLSP-AADAKSR 228

Query: 1024 RLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKS 845
            ++G LPTY  SFKCNERAEKR+EFYSKLEEKI AKE E++NLQAK+KETQEAE+KMLRKS
Sbjct: 229  KVGALPTYNMSFKCNERAEKRREFYSKLEEKIHAKEVEQSNLQAKTKETQEAEIKMLRKS 288

Query: 844  LAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLS 665
            L FKATPMPSFYQEPPPP+VELKKIPTTRAKSPKLGR+KSSPT ++      T   +RLS
Sbjct: 289  LKFKATPMPSFYQEPPPPQVELKKIPTTRAKSPKLGRRKSSPTKEANHTNMHT---SRLS 345

Query: 664  LDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGES-- 491
            LD+  SQN  A+    D+VKKP RKSLPKLP +  NL    KK S  K +  +ET E+  
Sbjct: 346  LDKSASQNP-AKGHPPDNVKKPTRKSLPKLPSQKINLLSNTKKPSLIKTSKCQETNEAAS 404

Query: 490  ------EVQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQ 353
                    Q NN  E+T++  +  Q  E     E  +S++ V+ ++     Q
Sbjct: 405  NNMSAVASQPNNAPEQTNEIETFVQEHEATSVVETSQSKTFVEAQSETNVVQ 456


>ref|XP_004245024.1| PREDICTED: uncharacterized protein LOC101258086 [Solanum
            lycopersicum]
          Length = 460

 Score =  339 bits (870), Expect = 2e-90
 Identities = 213/478 (44%), Positives = 288/478 (60%), Gaps = 17/478 (3%)
 Frame = -2

Query: 1720 VMDADTTIVVSGNGDF-ENGLHQQLP--ISTTINGTSNGSLEVEGLGENLEDASNVNDQK 1550
            +M+ D  I VS NG    NG+HQ LP  ++  ++   NG   V+GL  + + A  VND +
Sbjct: 1    MMEIDNNISVSVNGSSCGNGIHQ-LPEVVAAKLDDVPNGGFSVQGLERSFQSAVMVNDSE 59

Query: 1549 ALGST-QDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAA 1373
             +GST  ++T E     E++  ++ +    KES +SKN K  K  G+ KN          
Sbjct: 60   TVGSTVHEVTHESTTTIENNPCASSEGHEAKESRESKNSKQSKAPGKGKN--------TV 111

Query: 1372 ATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNS-----GAA 1208
            + G+ K+KD       SV  NG++AS+ R +QTS+   KSKS N+++ ADN+        
Sbjct: 112  SIGVKKTKDAST---GSVVLNGSLASQQRSKQTSSLGVKSKSFNDRKTADNNLKPPVARI 168

Query: 1207 SVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKA 1028
            +V + +Q G PDATS S +   ++ L EKT   +LKK                 A DAK+
Sbjct: 169  NVSHAKQSGQPDATSPSPN---ADSLREKTNPISLKKAAPNNADGNAESPLSP-AADAKS 224

Query: 1027 LRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRK 848
             ++G LPTY  SFKC+ERAEKR+EFYSKLEEKI AKE EK+NLQAK+KETQEAE+KMLRK
Sbjct: 225  RKVGALPTYNMSFKCDERAEKRREFYSKLEEKIHAKEVEKSNLQAKTKETQEAEIKMLRK 284

Query: 847  SLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARL 668
            SL FKATPMPSFYQEPPPP+VELKKIPTTRAKSPKLGR+KSSPT ++      T   +RL
Sbjct: 285  SLKFKATPMPSFYQEPPPPQVELKKIPTTRAKSPKLGRRKSSPTKEADHTSMHT---SRL 341

Query: 667  SLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGES- 491
            SLD+ +SQN  A+    ++VKKP R+SLPKLP +  NL    KK S  K +  +ET E+ 
Sbjct: 342  SLDKNVSQNP-AKGHPPENVKKPTRRSLPKLPSQKINLLSNTKKPSPIKTSISQETNEAA 400

Query: 490  -------EVQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQEAIAV 338
                     Q NN+SE+T++  +  Q  +     E  +S++ V+ ++     Q  IAV
Sbjct: 401  SNNMSAVASQPNNVSEQTNEIVTFVQKHDATSVVETSQSKTFVEAQSETNVVQPPIAV 458


>gb|EMJ24332.1| hypothetical protein PRUPE_ppa005440mg [Prunus persica]
          Length = 461

 Score =  325 bits (832), Expect = 6e-86
 Identities = 217/477 (45%), Positives = 277/477 (58%), Gaps = 17/477 (3%)
 Frame = -2

Query: 1717 MDADTTIVVSG-NGDFENGLHQQLPI-STTINGTSNGSLEVEGLGEN--LEDASNVNDQK 1550
            MD+D  +   G     +NG+H Q  + S  INGT + +   E    N  +E+   ++D  
Sbjct: 1    MDSDNLVATYGLEVAHQNGVHGQPGVVSDNINGTVSETTTTETAAPNGKIENVVKLDDGV 60

Query: 1549 ALGS-TQDITEEPALPPESH----STSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSP 1385
               S T +  EE  + PE +    + +  KE  VK S  SK  K  K  G++KN KP  P
Sbjct: 61   TNNSSTGEAKEESTVNPERNGLTIALTIAKEGEVKGSLHSKQTKVQKGQGKSKNEKPSGP 120

Query: 1384 KRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAAS 1205
            K  +   + KSKDG +   ++  SNG+ A+ SRP+Q +    K++S N +Q         
Sbjct: 121  KNVSPVWMKKSKDGNDGEVTAAVSNGSAATTSRPKQPN----KTRSFNGRQ--------- 167

Query: 1204 VQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKAL 1025
            VQ+  Q    D        E SE   EKTKLK LKK                T GD K  
Sbjct: 168  VQSSNQLEKSDT-------ELSEGTVEKTKLKPLKKDSLNKAEGESQSSLSPTEGDMKPP 220

Query: 1024 RLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKS 845
            R+ TLP YGFSF+C+ERAEKR+EFY+KLEEKI AKE EKNNLQAKSKET EAE++MLRK 
Sbjct: 221  RVSTLPNYGFSFRCDERAEKRREFYTKLEEKIHAKEMEKNNLQAKSKETLEAEIRMLRKK 280

Query: 844  LAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLS 665
            L FKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGR+KS P A S+ N     R +RLS
Sbjct: 281  LTFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRRKSLPPAVSEGNSNTNDRSSRLS 340

Query: 664  LDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEV 485
            LDEK+ QN+ A+     H KKPQRKSLP+LP E T L       + RKIT+ K T E + 
Sbjct: 341  LDEKVPQNS-AKGPSPVHPKKPQRKSLPRLPSEKTTL---PNAGNERKITS-KATNEGKN 395

Query: 484  QL-NNLSEET-------SKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQEAIAV 338
             L + ++EE        S++ SDTQ QE  P AE  +++ + D+E  +E Q + I V
Sbjct: 396  NLIDAMNEENATLPNAKSEAGSDTQEQEAVPKAETSEAQPHTDDETVVEEQHDPIYV 452


>ref|XP_006483365.1| PREDICTED: enolase-phosphatase E1-like isoform X1 [Citrus sinensis]
          Length = 439

 Score =  323 bits (829), Expect = 1e-85
 Identities = 211/468 (45%), Positives = 265/468 (56%), Gaps = 8/468 (1%)
 Frame = -2

Query: 1717 MDADTTIVVSGNG-DFENGLHQQLP-------ISTTINGTSNGSLEVEGLGENLEDASNV 1562
            MD+D   V  G+    +NG H+QL        I+  +N T   +    G      D+  V
Sbjct: 1    MDSDDLKVAEGDEVALQNGAHKQLVASGEDGVIADDVNQTITETARPNG------DSETV 54

Query: 1561 NDQKALGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPK 1382
            +     G+T ++ E  +   ES+     K    K +  SK   PLK  G++K+ KPL+PK
Sbjct: 55   DKLDESGTTGEVMEGESDNVESNGLVVAKTGKGKAADTSKQSIPLKGHGKSKSEKPLNPK 114

Query: 1381 RAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASV 1202
              ++TG+ KSKDGK    +S  SNG++   S  +Q+     KS + NE+Q          
Sbjct: 115  NVSSTGVKKSKDGKNDDGTSTISNGSVGLNSHSKQSF----KSMTFNERQA--------- 161

Query: 1201 QNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALR 1022
            Q  +Q G  D  SS       E L EKTK K LKKG                + DAK  R
Sbjct: 162  QFSKQSGKSDTPSS-------EGLAEKTKSKPLKKGPPEKAGKDLDYK----SDDAKPRR 210

Query: 1021 LGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSL 842
            +G LP YGFSF+C+ERAEKR+EFYSKLEEKI AKE EK+ LQAKSKETQEAE+KMLRKSL
Sbjct: 211  VGALPNYGFSFRCDERAEKRREFYSKLEEKIHAKEVEKSTLQAKSKETQEAEIKMLRKSL 270

Query: 841  AFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSL 662
             FKATPMP+FYQEPPPPKVELKKIPTTRAKSPKLGR+KSS  ADS E+   + RP RLSL
Sbjct: 271  NFKATPMPTFYQEPPPPKVELKKIPTTRAKSPKLGRRKSSTPADSVEDST-SCRPGRLSL 329

Query: 661  DEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQ 482
            D K   +N AR     H KKPQRKSLPKLP E   +    K+ ++     P E   +   
Sbjct: 330  DAKGPPSNSARGISPVHPKKPQRKSLPKLPSEKATILNSMKEENTTSSKAPNEENTTS-- 387

Query: 481  LNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQEAIAV 338
                S  T + AS T+ QE  PTAEP +++ + DE    E Q + I V
Sbjct: 388  ----SNATKEVASPTEEQEQIPTAEPEETQFHKDEGLVAEEQAQPILV 431


>ref|XP_006450433.1| hypothetical protein CICLE_v10008326mg [Citrus clementina]
            gi|557553659|gb|ESR63673.1| hypothetical protein
            CICLE_v10008326mg [Citrus clementina]
          Length = 439

 Score =  321 bits (823), Expect = 6e-85
 Identities = 210/468 (44%), Positives = 264/468 (56%), Gaps = 8/468 (1%)
 Frame = -2

Query: 1717 MDADTTIVVSGNG-DFENGLHQQLP-------ISTTINGTSNGSLEVEGLGENLEDASNV 1562
            MD+D   V  G+    +NG H+QL        I+  +N T   +    G      D+  V
Sbjct: 1    MDSDDLKVAEGDEVALQNGAHKQLVASGEDGVIADDVNQTITETARPNG------DSETV 54

Query: 1561 NDQKALGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPK 1382
            +     G+T ++ E  +   ES+          K +  SK   PLK  G++K+ KPL+PK
Sbjct: 55   DKLDESGTTGEVMEGESDNVESNGLVVATTGKGKAADTSKQSIPLKGHGKSKSEKPLNPK 114

Query: 1381 RAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASV 1202
              ++TG+ KSKDGK    +S  SNG++   S  +Q+     KS + NE+Q          
Sbjct: 115  NVSSTGVKKSKDGKNDDGTSTISNGSVGLNSHSKQSF----KSMTFNERQS--------- 161

Query: 1201 QNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALR 1022
            Q  +Q G  D  SS       E L EKTK K LKKG                + DAK  R
Sbjct: 162  QFSKQSGKSDTPSS-------EGLAEKTKSKPLKKGPPEKAGKDLDYK----SDDAKPRR 210

Query: 1021 LGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSL 842
            +G LP YGFSF+C+ERAEKR+EFYSKLEEKI AKE EK+ LQAKSKETQEAE+KMLRKSL
Sbjct: 211  VGALPNYGFSFRCDERAEKRREFYSKLEEKIHAKEVEKSTLQAKSKETQEAEIKMLRKSL 270

Query: 841  AFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSL 662
             FKATPMP+FYQEPPPPKVELKKIPTTRAKSPKLGR+KSS  ADS E+   + RP RLSL
Sbjct: 271  NFKATPMPTFYQEPPPPKVELKKIPTTRAKSPKLGRRKSSTPADSVEDST-SCRPGRLSL 329

Query: 661  DEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQ 482
            D K   +N AR     H KKPQRKSLPKLP E   +    K+ ++     P E   +   
Sbjct: 330  DAKGPPSNSARGISPVHPKKPQRKSLPKLPSEKATILNSMKEENTTSSKAPNEENTTS-- 387

Query: 481  LNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQEAIAV 338
                S  T + AS T+ QE  PTAEP +++ + DE    E Q + I V
Sbjct: 388  ----SNATKEVASPTEEQEQIPTAEPEETQFHKDEGLVAEEQAQPILV 431


>ref|XP_002309627.2| hypothetical protein POPTR_0006s27050g [Populus trichocarpa]
            gi|550337170|gb|EEE93150.2| hypothetical protein
            POPTR_0006s27050g [Populus trichocarpa]
          Length = 436

 Score =  320 bits (821), Expect = 1e-84
 Identities = 215/475 (45%), Positives = 266/475 (56%), Gaps = 15/475 (3%)
 Frame = -2

Query: 1717 MDADTTIVVSGNGD--FENGLHQQLP-------ISTTINGTSNGSLEVE-GLGENLEDAS 1568
            MD+D  ++  G  +   +NG HQQ P       +S  +NG+   + +++ G  +NL    
Sbjct: 1    MDSDNHLLPDGGLEAAHQNGGHQQSPAAGEDGVVSNNLNGSVGNTFKLDDGTTDNLSTGE 60

Query: 1567 NVNDQKA-LGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPL 1391
              ++ KA +GS         LP          E+ VK++ +S+N K  K  G+    KP 
Sbjct: 61   VEDELKAYVGSN-------GLPVFKEG-----EVKVKDADNSENAKSQKGPGKRGTAKPS 108

Query: 1390 SPKRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGA 1211
              K A+AT + K KDG++       SNG++A  S+ +Q      KSKS NE+QG      
Sbjct: 109  HLKNASATQVKKGKDGRDAEVQLTVSNGSVAVNSQLKQ----HLKSKSFNERQG------ 158

Query: 1210 ASVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAK 1031
               Q  +Q G  DA          E + EKTKLK LKKG               T  DAK
Sbjct: 159  ---QASKQSGTSDAGPP-------EGIVEKTKLKPLKKGPVDKAEADTDSTSSPTVEDAK 208

Query: 1030 ALRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLR 851
              ++G LP YGFSFKC+ERAEKRKEFYSKLEEKI AKE EK  LQAKSKET EAE+KMLR
Sbjct: 209  PRKVGALPNYGFSFKCDERAEKRKEFYSKLEEKIHAKEVEKTTLQAKSKETHEAEIKMLR 268

Query: 850  KSLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPAR 671
            KSL FKATPMPSFYQEP PPKVELKKIPTTRAKSPKLGR+KSS  AD++ N + + RP R
Sbjct: 269  KSLGFKATPMPSFYQEPAPPKVELKKIPTTRAKSPKLGRRKSSSPADTEGNNSQSYRPGR 328

Query: 670  LSLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGES 491
            LSLDEK+S N   +     H KKPQRKSLPKLP E T LS      S  K   PK + E 
Sbjct: 329  LSLDEKVSSNIPIKGLSPAHPKKPQRKSLPKLPSEKTKLS------SDEKTKLPKASNEE 382

Query: 490  EVQLNNLSEETSKSASDTQAQEVAPTAE----PIKSESNVDEENALEAQQEAIAV 338
               L+N S E S   S TQ QE     E    P K E+ V EE      ++ +A+
Sbjct: 383  NPTLSNQSNEGS---SPTQEQEAVSKNESEFLPGKDETAVKEEAQATLAKDPVAL 434


>ref|XP_002285138.1| PREDICTED: uncharacterized protein LOC100244101 [Vitis vinifera]
            gi|296082039|emb|CBI21044.3| unnamed protein product
            [Vitis vinifera]
          Length = 439

 Score =  316 bits (810), Expect = 2e-83
 Identities = 202/466 (43%), Positives = 273/466 (58%), Gaps = 9/466 (1%)
 Frame = -2

Query: 1720 VMDADTTIVVSGNGD-FENGLHQQLP-------ISTTINGTSNGSLEVEGLGENLEDASN 1565
            VMD D  + V+G  +  +NG+H+QL        I   +NG  + S E  G+  N E+   
Sbjct: 8    VMDVDDLLPVNGLEEGHQNGIHEQLSAAGGEGVIPEKVNGNLDLSTESAGMNGNAENVGM 67

Query: 1564 VNDQKALG-STQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLS 1388
             +D   +  ST ++ E   +    +  +  ++L V+++  SK+ KP K  G++   K  S
Sbjct: 68   WDDNGIINASTAEVGEGSHIRARVNGLTISEDLEVEDADPSKHSKPQKGQGKSSKEKLSS 127

Query: 1387 PKRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAA 1208
            PK A  T + K KDGK+ + +S ++NG++AS SRP+QT     KS+S ++KQ        
Sbjct: 128  PKHAGTTWVKK-KDGKDEIVTSASTNGSLASISRPKQT----LKSRSFSDKQDH-----L 177

Query: 1207 SVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKA 1028
            S Q+K      +A SS+++  Q E   EKT+LK +K G                  D K 
Sbjct: 178  SKQSKNS----EAASSTSNMIQPEGRAEKTRLKPVKLGAPTVSDVNTKSPSPTE--DTKP 231

Query: 1027 LRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRK 848
             R+  LP+Y FSF+C+ERAEKR+EFY+KLEEK  AKE E+ NLQAKSKETQEAE+KMLRK
Sbjct: 232  RRVAALPSYNFSFRCDERAEKRREFYTKLEEKTHAKEIERTNLQAKSKETQEAEIKMLRK 291

Query: 847  SLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARL 668
            SL FKATPMPSFYQEPPPPKVELKKIP TRAKSPKLGRKKSSP  +S+  G+ + R  RL
Sbjct: 292  SLTFKATPMPSFYQEPPPPKVELKKIPPTRAKSPKLGRKKSSPAPESE--GSSSHRSGRL 349

Query: 667  SLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESE 488
            SLDEK+SQNN A+     H KKP RKSLPKLP E T                        
Sbjct: 350  SLDEKVSQNNPAKGISPGHPKKPLRKSLPKLPSERT------------------------ 385

Query: 487  VQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQE 350
                NLS+ T+++A  +Q QE     +P KS+ + D+++ +E Q +
Sbjct: 386  ----NLSKSTNEAAFLSQQQEPVQVPDPSKSQPDADDKSEVEEQAQ 427


>ref|XP_002515513.1| conserved hypothetical protein [Ricinus communis]
            gi|223545457|gb|EEF46962.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 426

 Score =  309 bits (791), Expect = 3e-81
 Identities = 206/462 (44%), Positives = 256/462 (55%), Gaps = 2/462 (0%)
 Frame = -2

Query: 1717 MDADTTIVVSG--NGDFENGLHQQLPISTTINGTSNGSLEVEGLGENLEDASNVNDQKAL 1544
            M+ D T+ + G       NG+H+Q   S      SNG+LE       LED+   N   A 
Sbjct: 1    MEFDDTVPIDGLVETSHRNGIHEQSLASMDDGVVSNGNLEN---ASKLEDSITSNTSSA- 56

Query: 1543 GSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAATG 1364
                ++ E   +   S+  +  KE  VK  G S++ K LK  G++K+ K  +PK  +AT 
Sbjct: 57   ---GEVCERSNVHVGSNGLTGCKEGNVKNEGHSEHAKSLKGPGKSKSEKSSNPKNTSATQ 113

Query: 1363 LSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQQH 1184
            L K KDGK    +   SNG+  S S+ +Q      KSKS +E+          VQ  +  
Sbjct: 114  LKKRKDGKVAGAAPTVSNGSATSNSQSKQP----LKSKSFSERL---------VQTAKHP 160

Query: 1183 GLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTLPT 1004
               D TSS       E L E  KLK  K                 TA DAK  R+  LP 
Sbjct: 161  AKCDVTSS-------EGLMETLKLKTSK--GPAKAEEIAQASLSPTAEDAKPRRVAALPN 211

Query: 1003 YGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKATP 824
            YGFSFKC+ERAEKR+EFYSKLEEKI AKE E NNLQAKSKETQEAE+KMLRKSLAFKATP
Sbjct: 212  YGFSFKCDERAEKRREFYSKLEEKIHAKELEMNNLQAKSKETQEAEIKMLRKSLAFKATP 271

Query: 823  MPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQ 644
            MPSFYQEPPPPK+ELKKIPTTR KSPKLGRKKSS   DS+EN   + R ARLSLD+K+S 
Sbjct: 272  MPSFYQEPPPPKMELKKIPTTRPKSPKLGRKKSSSPVDSEENDDQSRRLARLSLDQKVSH 331

Query: 643  NNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQLNNLSE 464
            NN A+       KKPQRKSLPKLP + T+LS      +  K+ + + T E  V     S 
Sbjct: 332  NNAAKGPSPIRSKKPQRKSLPKLPSQKTSLS---SAVNDEKVISSEATNEENV----TSN 384

Query: 463  ETSKSASDTQAQEVAPTAEPIKSESNVDEENALEAQQEAIAV 338
            +T++ +S  + Q    TA   +     D E  +  Q +   V
Sbjct: 385  QTNEGSSPAEEQNAILTAVAGEVHFQTDGEFVVGEQAQPTVV 426


>gb|EXC46039.1| hypothetical protein L484_000806 [Morus notabilis]
          Length = 462

 Score =  307 bits (786), Expect = 1e-80
 Identities = 206/475 (43%), Positives = 272/475 (57%), Gaps = 27/475 (5%)
 Frame = -2

Query: 1717 MDADTTIVVSG-NGDFENGLHQQLP-------ISTTINGTSNGSLEVEGLGENLEDASNV 1562
            MD+D  ++  G     +NG ++Q P       IS  +N  S  S E      N +  +++
Sbjct: 1    MDSDNVVLTDGYEVAHQNGAYEQTPATVEDFVISDNVNVPSIKSNETAVPNGNAKVVAHL 60

Query: 1561 NDQKALGST-QDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSP 1385
            +D  A  S+ +++ EE      S+  +  KE   K S  SK  K  K +G +KNGKP S 
Sbjct: 61   DDGIAKNSSSEEVKEESINSIASNGLTVAKEGEAKVSDQSKQPKSQKGLGTSKNGKPSST 120

Query: 1384 KRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAAS 1205
            K    + L K+KDGK V   S   NG++AS S+PR+T     KSKS N+++      AA 
Sbjct: 121  KNDLGSSLKKNKDGKAVEAISTIPNGSVASNSQPRKT----IKSKSFNDRKQPVKPEAA- 175

Query: 1204 VQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXT-AGDAKA 1028
                       AT  +T         +K KLK LKK                  AG+AK 
Sbjct: 176  -----------ATEGNT---------DKLKLKPLKKEPVNKAEVEADTKSSSPTAGEAKP 215

Query: 1027 LRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRK 848
             R+  LP YGFSFKC+ERAEKR+EFY+KL EKI AKE E+ NLQAKSKETQEAE+K+LRK
Sbjct: 216  PRVAMLPNYGFSFKCDERAEKRREFYTKLGEKIHAKEMEQTNLQAKSKETQEAEIKLLRK 275

Query: 847  SLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARL 668
            SLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGR+KS P  +S+ +   T +  RL
Sbjct: 276  SLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRRKSLPPTESEGSSNPTNQSGRL 335

Query: 667  SLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESE 488
            SLDEK+S+N+ A+   V   +KP+RKSLP LP E  +L+     T  RK T+ K T E +
Sbjct: 336  SLDEKVSKNS-AKGPAV-QPRKPERKSLPTLPSEKASLA---NATKGRKTTSSKATNEEK 390

Query: 487  VQLNNLSEE-----------------TSKSASDTQAQEVAPTAEPIKSESNVDEE 374
              L+N ++E                  S++ S TQ +EV P AEP ++++N D++
Sbjct: 391  PSLSNANQEQPVVSDGTNEEKKTSNANSENGSCTQ-EEVVPKAEPSEAQTNTDDD 444


>gb|EOY29480.1| TPX2 family protein, putative [Theobroma cacao]
          Length = 457

 Score =  304 bits (779), Expect = 8e-80
 Identities = 212/487 (43%), Positives = 275/487 (56%), Gaps = 27/487 (5%)
 Frame = -2

Query: 1717 MDADTTIVVSG-NGDFENGLHQQLPIS---TTINGTSNGSLE--VEGLGENLEDASNVND 1556
            MD+D  +   G      NG++ QL +S   + I+   NG++E   +   +N  D +    
Sbjct: 1    MDSDNLLSAGGLEIAHRNGVYPQLRVSGDDSEISDNVNGNVEKAAKSYVQNGMDDNGATG 60

Query: 1555 QKALGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRA 1376
            +   GS   +        E++   + KE  +K++  +K  KP K  G+ KN KP  PK  
Sbjct: 61   EAREGSNDFV--------ENNGLIDSKEGELKDN--AKQSKPQKVQGKTKNEKPSGPKNV 110

Query: 1375 AATGLSKSKDGKEVMKSSVASNG-NIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQ 1199
            ++T + KSKDGK        SNG ++A+ SR +Q      KS S NE+Q       AS Q
Sbjct: 111  SSTLVKKSKDGKSADVMLTTSNGGSVATNSRLKQP----LKSMSFNERQAN-----ASKQ 161

Query: 1198 NKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRL 1019
            +++    PDA  S       E   EK KLK LKKG                A DAK  R+
Sbjct: 162  SEK----PDAAFS-------EGTMEKPKLKPLKKGPVNKAEGDTESFPT--AADAKPRRV 208

Query: 1018 GTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLA 839
            GTLP YGFSFKC+ERAEKR+EFY+KL EKI A+E EK+NLQAKSKETQEAE+KMLRKSL 
Sbjct: 209  GTLPNYGFSFKCDERAEKRREFYTKLGEKIHAREVEKSNLQAKSKETQEAEIKMLRKSLN 268

Query: 838  FKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSLD 659
            FKATPMPSFYQEPPPPKVELKKIP TRAKSPKLGRKK S  ++S  N     +  RLSLD
Sbjct: 269  FKATPMPSFYQEPPPPKVELKKIPPTRAKSPKLGRKKGSTPSESDGNSNSGHQSGRLSLD 328

Query: 658  EKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNLS--FEKKKTS----SRKITTPKETG 497
            EK SQ+   +     H +KPQRKSLPKLP + T+LS    ++KTS      K+T  K T 
Sbjct: 329  EKASQSISGKVISPVHARKPQRKSLPKLPSQKTSLSSAANEEKTSKGSNQEKVTASKATT 388

Query: 496  ESEV--------QLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVD------EENALEA 359
            E ++        +   LS+ T++  S  Q QE   TA+  +S+  +D      EE  L+ 
Sbjct: 389  EGKIASSKATNEENTTLSDVTNEELSPVQQQEAVSTADSGESQLYMDQAPVIGEEGQLDL 448

Query: 358  QQEAIAV 338
             QE IA+
Sbjct: 449  VQEPIAL 455


>ref|XP_004291456.1| PREDICTED: uncharacterized protein LOC101310775 [Fragaria vesca
            subsp. vesca]
          Length = 470

 Score =  299 bits (766), Expect = 3e-78
 Identities = 205/448 (45%), Positives = 252/448 (56%), Gaps = 11/448 (2%)
 Frame = -2

Query: 1717 MDADTTIVVSG-NGDFENGLHQQLPIST-TINGTSNGSLEVEGLGEN--LEDASNVNDQK 1550
            MD+D +    G     ENG H QL +    ING+++ +   E    N  +E+  N +D  
Sbjct: 1    MDSDNSEAAYGLQVALENGDHGQLNVGPDAINGSASETALTESAALNGKMENVVNSDDGV 60

Query: 1549 ALGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAA 1370
            +  S+    +E +    S+     KE G K S  SK  K  K  G++K  KP SPK A  
Sbjct: 61   SNNSSAGEVKEESRVNSSNGLKIAKERGPKVSVQSKQFKVQKGQGKSKIEKPPSPKIALP 120

Query: 1369 TGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQ 1190
            T + KSKDG +   ++  SN   A  SR +Q +    KS+SSN  Q         VQ   
Sbjct: 121  TSMKKSKDGNDAEATATVSNDLAAPISRAKQPN----KSRSSNGPQ---------VQ--- 164

Query: 1189 QHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTL 1010
               L D     +    +E L EKT LK L KG                 GD K  R+GTL
Sbjct: 165  ---LSDQQPKQSEAPSTEGLVEKTDLKPLIKGSYKADGDSQSSLSPTE-GD-KPPRVGTL 219

Query: 1009 PTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKA 830
            P YGFSF+C+ERAEKR+EFY+KLEEKI AKE EKNNLQAKSKET EAE+KMLRK L FKA
Sbjct: 220  PNYGFSFRCDERAEKRREFYTKLEEKIHAKEMEKNNLQAKSKETLEAEIKMLRKKLTFKA 279

Query: 829  TPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKL 650
            TPMPSFYQEPPPPKVELKK+PTTRAKSPKLGRKKS P ADS+ N    ++  RLSL EK+
Sbjct: 280  TPMPSFYQEPPPPKVELKKLPTTRAKSPKLGRKKSLPAADSEGNSTTKSQSGRLSLGEKV 339

Query: 649  SQNNLARATRVDHVKKPQRKSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQLNNL 470
             QN+ A+       KKPQRKSLP+LP E T LS  K   + RK+T+     E    +N +
Sbjct: 340  PQNS-AKGPSPVLPKKPQRKSLPRLPSETTTLSGVK---NVRKVTSKAIKEEKNNLINAM 395

Query: 469  SEE-------TSKSASDTQAQEVAPTAE 407
            +EE        S +    Q QEV P AE
Sbjct: 396  NEEAATLPSAASGAGPHIQEQEVVPRAE 423


>ref|XP_004173325.1| PREDICTED: uncharacterized protein LOC101231649 [Cucumis sativus]
          Length = 509

 Score =  296 bits (759), Expect = 2e-77
 Identities = 196/472 (41%), Positives = 253/472 (53%), Gaps = 25/472 (5%)
 Frame = -2

Query: 1708 DTTIVVSGNG---DFENGLHQQLP----------ISTTINGTSNGSLEVEGLGENLEDAS 1568
            ++ I+V  +G     +NG H+ +           +S  I+  +   ++ E + +++ D S
Sbjct: 3    ESEILVPADGLKLTLQNGFHEHVSAAEEIVPKVIVSEDIDKDTGSPMQQENIEDDINDGS 62

Query: 1567 NVNDQKALGSTQDITEEPALPPESHSTSNPKELGVKESGDS-KNMKPLKTMGRAKNGKPL 1391
              N+     +T+++TE    P ES  ++   E G ++SGD  K +KP K   ++KN K  
Sbjct: 63   ATNES----TTRELTEGSNFPEESDISTLSME-GEEKSGDPPKKVKPEKGQIKSKNEKSS 117

Query: 1390 SPKRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGA 1211
            S K+ ++TG+ K+KDGKE  +  +  +G  AS   P+Q S    KS+S NE+Q       
Sbjct: 118  SLKQISSTGVKKNKDGKEA-EHLLNGSGTGASHPHPKQPS----KSRSFNERQAQ----- 167

Query: 1210 ASVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAK 1031
                      +P  T  S      E   E T LK LKKG                AGD K
Sbjct: 168  ----------VPKQTEKSDG--DGEGSKENTNLKPLKKGQPSKSEGESESSLSPRAGDEK 215

Query: 1030 ALRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLR 851
              R+G LP YGFSF+CNERAEKRKEFYSKLEEKIQAKE EKN LQAKSKETQEAE+KMLR
Sbjct: 216  PNRVGRLPNYGFSFRCNERAEKRKEFYSKLEEKIQAKEVEKNTLQAKSKETQEAEIKMLR 275

Query: 850  KSLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPAR 671
            KSL FKATPMPSFYQEPPPPKVELKKIP TRAKSPKLGRKKSS  ADS  N     R AR
Sbjct: 276  KSLNFKATPMPSFYQEPPPPKVELKKIPPTRAKSPKLGRKKSSTLADSSSNDGGDVRSAR 335

Query: 670  LSLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNL-----------SFEKKKTSSR 524
            LSLDE ++ NN ++        KP+R+SLP LP E   +           S  K K   +
Sbjct: 336  LSLDENVALNNNSKGVYPVRSDKPKRRSLPNLPSEKIVIHGVVANAGGKSSATKVKIVEK 395

Query: 523  KITTPKETGESEVQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENA 368
            +   P     +    N   EE   S+    A     T +     S  +E+ A
Sbjct: 396  EKEKPAAASATSTTTNGKKEEKRTSSEAAAAAATTSTKKSASLRSTNEEKTA 447


>ref|XP_004136350.1| PREDICTED: uncharacterized protein LOC101207396 [Cucumis sativus]
          Length = 509

 Score =  295 bits (755), Expect = 5e-77
 Identities = 195/472 (41%), Positives = 253/472 (53%), Gaps = 25/472 (5%)
 Frame = -2

Query: 1708 DTTIVVSGNG---DFENGLHQ----------QLPISTTINGTSNGSLEVEGLGENLEDAS 1568
            ++ I+V  +G     +NG H+          ++ +S  I+  +   ++ E + +++ D S
Sbjct: 3    ESEILVPADGLKLTLQNGFHEHVSAAEEIVPKVTVSEDIDKDTGSPMQQENIEDDINDGS 62

Query: 1567 NVNDQKALGSTQDITEEPALPPESHSTSNPKELGVKESGDS-KNMKPLKTMGRAKNGKPL 1391
              N+     +T+++TE    P ES  ++   E G ++ GD  K +KP K   ++KN K  
Sbjct: 63   ATNES----TTRELTEGSNFPEESDISTLSME-GEEKCGDPPKKVKPEKGQIKSKNEKSS 117

Query: 1390 SPKRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGA 1211
            S K+ ++TG+ K+KDGKE  +  +  +G  AS   P+Q S    KS+S NE+Q       
Sbjct: 118  SLKQISSTGVKKNKDGKEA-EHLLNGSGTGASHPHPKQPS----KSRSFNERQAQ----- 167

Query: 1210 ASVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAK 1031
                      +P  T  S      E   E T LK LKKG                AGD K
Sbjct: 168  ----------VPKQTEKSDG--DGEGSKENTNLKPLKKGQPSKSEGESESSLSPRAGDEK 215

Query: 1030 ALRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLR 851
              R+G LP YGFSF+CNERAEKRKEFYSKLEEKIQAKE EKN LQAKSKETQEAE+KMLR
Sbjct: 216  PNRVGRLPNYGFSFRCNERAEKRKEFYSKLEEKIQAKEVEKNTLQAKSKETQEAEIKMLR 275

Query: 850  KSLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPAR 671
            KSL FKATPMPSFYQEPPPPKVELKKIP TRAKSPKLGRKKSS  ADS  N     R AR
Sbjct: 276  KSLNFKATPMPSFYQEPPPPKVELKKIPPTRAKSPKLGRKKSSTLADSSSNDGGDVRSAR 335

Query: 670  LSLDEKLSQNNLARATRVDHVKKPQRKSLPKLPLEDTNL-----------SFEKKKTSSR 524
            LSLDE ++ NN ++        KP+R+SLP LP E   +           S  K K   +
Sbjct: 336  LSLDENVALNNNSKGVYPVRSDKPKRRSLPNLPSEKIVIPGVVANAGGKSSATKVKIVEK 395

Query: 523  KITTPKETGESEVQLNNLSEETSKSASDTQAQEVAPTAEPIKSESNVDEENA 368
            +   P     +    N   EE   S+    A     T +     S  +E+ A
Sbjct: 396  EKEKPAAASATSTTTNGKKEEKRTSSEAAAAAATTSTKKSASLRSTNEEKTA 447


>ref|XP_002324860.2| hypothetical protein POPTR_0018s01730g [Populus trichocarpa]
            gi|550317812|gb|EEF03425.2| hypothetical protein
            POPTR_0018s01730g [Populus trichocarpa]
          Length = 422

 Score =  292 bits (747), Expect = 4e-76
 Identities = 195/440 (44%), Positives = 243/440 (55%), Gaps = 5/440 (1%)
 Frame = -2

Query: 1672 ENGLHQQLPISTTINGTSNGSLEVEGLGENLEDASNVNDQKALGSTQDITEEPALPPESH 1493
            +NG+H+Q   +      SN      G    ++D +N N      ST+++  E        
Sbjct: 18   QNGVHEQSAAAREDGVVSNNLSGSMGNTFEVDDCTNDNL-----STREVEGE-------- 64

Query: 1492 STSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAATGLSKSKDGKEVMKSSVAS 1313
                  E  VK++ +S+  +  K  G+  N KP +PK  +AT + K KDG++ +  +  S
Sbjct: 65   --LKEGEAKVKDADNSEKARSQKGSGKGGNAKPSNPKNVSATQV-KGKDGRDAVARTAVS 121

Query: 1312 NGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQQHGLPDATSSSTSGEQSED 1133
            NG++A  S+ +Q      KS S NE+QG         Q  +Q G  DA  S+   E    
Sbjct: 122  NGSVAVNSQLKQP----LKSNSFNERQG---------QASKQSGKSDAVLSAGLVE---- 164

Query: 1132 LPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTLPTYGFSFKCNERAEKRKEF 953
                 K K LKKG               TA DAK+ + GTLP YGFSFKC+ERAEKRKEF
Sbjct: 165  -----KAKPLKKGPVVKAEGETESTSSPTAEDAKSRKFGTLPNYGFSFKCDERAEKRKEF 219

Query: 952  YSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKATPMPSFYQEPPPPKVELKK 773
            Y+KLEEKI AKE EK+ LQAKSKETQEAE+K+ RKSLAFKATPMPSFYQEP P KVELKK
Sbjct: 220  YTKLEEKIHAKEVEKSTLQAKSKETQEAEIKLFRKSLAFKATPMPSFYQEPAPLKVELKK 279

Query: 772  IPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQNNLARATRVDHVKKPQR 593
            IPTTRAKSPKLGRKKS   ADS+ N + + R  RLSLDEK+S     R     H KKPQR
Sbjct: 280  IPTTRAKSPKLGRKKSPSPADSEGNNSQSNRSGRLSLDEKISSKIPIRGLSPAHPKKPQR 339

Query: 592  KSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQLNNLSEETSKSASDTQAQEVAPT 413
            KSLPKLP E  NL    +K    K +  + T         LS++T++  S  Q QE    
Sbjct: 340  KSLPKLPSEKINLYANDEKGKLPKASNEENT--------TLSDQTNEGVSANQEQEAVSK 391

Query: 412  AE-----PIKSESNVDEENA 368
             E     P K E  V EE A
Sbjct: 392  NEASEFLPPKEEVVVQEEAA 411


>gb|ABK95344.1| unknown [Populus trichocarpa]
          Length = 422

 Score =  290 bits (743), Expect = 1e-75
 Identities = 194/440 (44%), Positives = 243/440 (55%), Gaps = 5/440 (1%)
 Frame = -2

Query: 1672 ENGLHQQLPISTTINGTSNGSLEVEGLGENLEDASNVNDQKALGSTQDITEEPALPPESH 1493
            +NG+H+Q   +      SN      G    ++D +N N      ST+++  E        
Sbjct: 18   QNGVHEQSAAAREDGVVSNNLSGSMGNTFEVDDCTNDNL-----STREVEGE-------- 64

Query: 1492 STSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAATGLSKSKDGKEVMKSSVAS 1313
                  E  VK++ +S+  +  K  G+  N KP +PK  +AT + K KDG++ +  +  S
Sbjct: 65   --LKEGEAKVKDADNSEKARSQKGSGKGGNAKPSNPKNVSATQV-KGKDGRDAVARTAVS 121

Query: 1312 NGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQQHGLPDATSSSTSGEQSED 1133
            NG++A  S+ +Q      KS S NE+QG         Q  +Q G  DA  S+   E    
Sbjct: 122  NGSVAVNSQLKQP----LKSNSFNERQG---------QASKQSGKSDAVLSAGLVE---- 164

Query: 1132 LPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTLPTYGFSFKCNERAEKRKEF 953
                 K K LKKG               TA DAK+ + GTLP YGFSFKC+ERAEKRKEF
Sbjct: 165  -----KAKPLKKGPVVKAEGETESTSSPTAEDAKSRKFGTLPNYGFSFKCDERAEKRKEF 219

Query: 952  YSKLEEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKATPMPSFYQEPPPPKVELKK 773
            Y+KLEEKI AKE EK+ LQAKSKETQEAE+K+ RKSLAFKATPMPSFYQEP P KVELKK
Sbjct: 220  YTKLEEKIHAKEVEKSTLQAKSKETQEAEIKLFRKSLAFKATPMPSFYQEPAPLKVELKK 279

Query: 772  IPTTRAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQNNLARATRVDHVKKPQR 593
            IPTTRAKSPKLGRKKS   ADS+ N + + R  RLSLDEK+S     R     H KKPQR
Sbjct: 280  IPTTRAKSPKLGRKKSPSPADSEGNNSQSNRSGRLSLDEKISSKIPIRGLSPAHPKKPQR 339

Query: 592  KSLPKLPLEDTNLSFEKKKTSSRKITTPKETGESEVQLNNLSEETSKSASDTQAQEVAPT 413
            KSLP+LP E  NL    +K    K +  + T         LS++T++  S  Q QE    
Sbjct: 340  KSLPELPSEKINLYANDEKGKLPKASNEENT--------TLSDQTNEGVSANQEQEAVSK 391

Query: 412  AE-----PIKSESNVDEENA 368
             E     P K E  V EE A
Sbjct: 392  NEASEFLPPKEEVVVQEEAA 411


>emb|CAN82789.1| hypothetical protein VITISV_030600 [Vitis vinifera]
          Length = 440

 Score =  280 bits (715), Expect = 2e-72
 Identities = 195/497 (39%), Positives = 265/497 (53%), Gaps = 41/497 (8%)
 Frame = -2

Query: 1717 MDADTTIVVSGNGD-FENGLHQQLP-------ISTTINGTSNGSLEVEGLGENLEDASNV 1562
            MD D  + V+G  +  +NG+H+QL        I   +NG  + S E  G+  N E+    
Sbjct: 1    MDVDDLLPVNGLEEGHQNGIHEQLSAAGGEGVIPEKVNGNLDLSTESAGMNGNAENVGMW 60

Query: 1561 NDQKALG-STQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKPLSP 1385
            +D   +  ST ++ E   +    +  +  ++L V+++  SK+ KP K  G++   K  SP
Sbjct: 61   DDNGIINASTAEVGEGSHIRARVNGLTISEDLEVEDADPSKHSKPQKGQGKSSKEKLSSP 120

Query: 1384 KRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSGAAS 1205
            K A  T + K KDGK+ + +S ++NG++AS SRP+QT     KS+S ++KQ        S
Sbjct: 121  KHAGTTWVKK-KDGKDEIVTSASTNGSLASISRPKQT----LKSRSFSDKQDH-----LS 170

Query: 1204 VQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDAKAL 1025
             Q+K      +A SS+++  Q E     T+                         D K  
Sbjct: 171  KQSKNS----EAASSTSNMIQPEGRASPTE-------------------------DTKPR 201

Query: 1024 RLGTLPTYGFSFKCNERAEKRKE-----------------FYSKLEEKIQAKEAEKNNLQ 896
            R+  LP+Y FSF+C+ERAEKR+E                 FY+KLEEK  AKE E+ NLQ
Sbjct: 202  RVAALPSYNFSFRCDERAEKRREQHFCFSTEDNVYHFVGQFYTKLEEKTHAKEIERTNLQ 261

Query: 895  AKSKETQEAELKMLRKSLAFKATPMPSFYQEPPPPKVELK---------------KIPTT 761
            AKSKETQEAE+KMLRKSL FKATPMPSFYQEPPPPKVELK               KIP T
Sbjct: 262  AKSKETQEAEIKMLRKSLTFKATPMPSFYQEPPPPKVELKKLCHVFGNENGNLMQKIPPT 321

Query: 760  RAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQNNLARATRVDHVKKPQRKSLP 581
            RAKSPKLGRKKSSP  +S+  G+ + R  RLSLDEK+SQNN A+     H KKP RKSLP
Sbjct: 322  RAKSPKLGRKKSSPAPESE--GSSSHRSGRLSLDEKVSQNNPAKGISPGHPKKPLRKSLP 379

Query: 580  KLPLEDTNLSFEKKKTSSRKITTPKETGESEVQLNNLSEETSKSASDTQAQEVAPTAEPI 401
            KLP E T                            NLS+ T+++A  +Q QE     +P 
Sbjct: 380  KLPSERT----------------------------NLSKSTNEAAFLSQQQEPVQVPDPS 411

Query: 400  KSESNVDEENALEAQQE 350
            KS+ + D+++ +E Q +
Sbjct: 412  KSQPDADDKSEVEEQAQ 428


>ref|XP_003549281.1| PREDICTED: protein gar2-like isoform X1 [Glycine max]
            gi|571538444|ref|XP_006601153.1| PREDICTED: protein
            gar2-like isoform X2 [Glycine max]
          Length = 481

 Score =  276 bits (707), Expect = 2e-71
 Identities = 182/395 (46%), Positives = 234/395 (59%), Gaps = 22/395 (5%)
 Frame = -2

Query: 1480 PKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAATGLSKSKDGKEVMKSSVASNGNI 1301
            PKE  VK S  +K  +  K + + KN K  S     A+ ++KSK GK+   SS  SNG  
Sbjct: 88   PKEEEVKISDQTKQSRAPKGLVKNKNAKAPSSSGVHASLVNKSKIGKDKEASSSVSNGTS 147

Query: 1300 ASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQQHGLPDATSSSTSGEQSEDLPEK 1121
            A +SRPRQ++     S+S N++Q         +   +     DATSS  S        EK
Sbjct: 148  ALDSRPRQSTK---SSRSFNDRQ-------TQLSKPKHPSKSDATSSEVS-------VEK 190

Query: 1120 TKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTLPTYGFSFKCNERAEKRKEFYSKL 941
            TK K+ +K                   DAK  R+GTLP YGFSFKC ERAE+R+EFY+KL
Sbjct: 191  TKPKSSRKEPIDKVQGEAESSLSSNTEDAKPQRVGTLPNYGFSFKCGERAERRREFYNKL 250

Query: 940  EEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKATPMPSFYQEPPPPKVELKKIPTT 761
            EE+IQAKE EK+NLQAKSKETQEAE+KMLRKSL FKATPMPSFYQEP P K ELKKIPTT
Sbjct: 251  EERIQAKEVEKSNLQAKSKETQEAEIKMLRKSLNFKATPMPSFYQEPAPAKAELKKIPTT 310

Query: 760  RAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQNNLARA-TRVDHVKKPQRKSL 584
            RAKSPKLGRKKSS  ++S  N + ++R ARLSLDEK+S++NL +  T   H KKPQR+SL
Sbjct: 311  RAKSPKLGRKKSSANSESDGNNSSSSRLARLSLDEKVSESNLTKGPTPPVHQKKPQRRSL 370

Query: 583  P-KLPLEDTNLSFEK-KKTSSRKITTPKE---------------TGESEVQLNNLSEETS 455
            P +L  E  ++S  +   TSS+ I   K                TGE + +    +EE S
Sbjct: 371  PARLAPERNSVSNSRTAPTSSKAIKDEKSSLSSAAKKHTNLSNATGEEKAKTIAANEEKS 430

Query: 454  KSASDTQ----AQEVAPTAEPIKSESNVDEENALE 362
              +S+T        V P+ +P +  S+V+ + A+E
Sbjct: 431  TLSSETSDAVLLNVVLPSDKPSEEVSHVNGDIAVE 465


>ref|XP_006601154.1| PREDICTED: protein gar2-like isoform X3 [Glycine max]
          Length = 480

 Score =  273 bits (697), Expect = 3e-70
 Identities = 182/395 (46%), Positives = 234/395 (59%), Gaps = 22/395 (5%)
 Frame = -2

Query: 1480 PKELGVKESGDSKNMKPLKTMGRAKNGKPLSPKRAAATGLSKSKDGKEVMKSSVASNGNI 1301
            PKE  VK S  +K  +  K + + KN K  S     A+ ++KSK GK+   SS  SNG  
Sbjct: 88   PKEEEVKISDQTKQSRAPKGLVKNKNAKAPSSSGVHASLVNKSKIGKDKEASSSVSNGTS 147

Query: 1300 ASESRPRQTSAFRAKSKSSNEKQGADNSGAASVQNKQQHGLPDATSSSTSGEQSEDLPEK 1121
            A +SRPRQ++     S+S N++Q         +   +     DATSS  S        EK
Sbjct: 148  ALDSRPRQSTK---SSRSFNDRQ-------TQLSKPKHPSKSDATSSEVS-------VEK 190

Query: 1120 TKLKALKKGXXXXXXXXXXXXXXXTAGDAKALRLGTLPTYGFSFKCNERAEKRKEFYSKL 941
            TK K+ +K                   DAK  R+GTLP YGFSFKC ERAE+R+EFY+KL
Sbjct: 191  TKPKSSRK-EPIDKVQGEAESSFSNTEDAKPQRVGTLPNYGFSFKCGERAERRREFYNKL 249

Query: 940  EEKIQAKEAEKNNLQAKSKETQEAELKMLRKSLAFKATPMPSFYQEPPPPKVELKKIPTT 761
            EE+IQAKE EK+NLQAKSKETQEAE+KMLRKSL FKATPMPSFYQEP P K ELKKIPTT
Sbjct: 250  EERIQAKEVEKSNLQAKSKETQEAEIKMLRKSLNFKATPMPSFYQEPAPAKAELKKIPTT 309

Query: 760  RAKSPKLGRKKSSPTADSKENGAFTARPARLSLDEKLSQNNLARA-TRVDHVKKPQRKSL 584
            RAKSPKLGRKKSS  ++S  N + ++R ARLSLDEK+S++NL +  T   H KKPQR+SL
Sbjct: 310  RAKSPKLGRKKSSANSESDGNNSSSSRLARLSLDEKVSESNLTKGPTPPVHQKKPQRRSL 369

Query: 583  P-KLPLEDTNLSFEK-KKTSSRKITTPKE---------------TGESEVQLNNLSEETS 455
            P +L  E  ++S  +   TSS+ I   K                TGE + +    +EE S
Sbjct: 370  PARLAPERNSVSNSRTAPTSSKAIKDEKSSLSSAAKKHTNLSNATGEEKAKTIAANEEKS 429

Query: 454  KSASDTQ----AQEVAPTAEPIKSESNVDEENALE 362
              +S+T        V P+ +P +  S+V+ + A+E
Sbjct: 430  TLSSETSDAVLLNVVLPSDKPSEEVSHVNGDIAVE 464


>ref|XP_003588767.1| Seed specific protein Bn15D14A [Medicago truncatula]
            gi|355477815|gb|AES59018.1| Seed specific protein
            Bn15D14A [Medicago truncatula]
          Length = 458

 Score =  272 bits (696), Expect = 3e-70
 Identities = 193/480 (40%), Positives = 262/480 (54%), Gaps = 11/480 (2%)
 Frame = -2

Query: 1750 PSNALKDRMSVMDADTTIVVSGNGDFENGLHQQL-PISTTINGTSNGSLEVEGLGENLED 1574
            PSN+ +D +S  D D  + V+      +G  + +  + +T  G S    E+EG  +N+ D
Sbjct: 24   PSNSGEDAVS-NDLDPHVTVNTETFVPDGNSENINQLESTATGNS-AMKEIEGSNDNV-D 80

Query: 1573 ASNVNDQKALGSTQDITEEPALPPESHSTSNPKELGVKESGDSKNMKPLKTMGRAKNGKP 1394
             SN+                       + S  KE+ +K S +    +  K   + KN K 
Sbjct: 81   GSNL-----------------------TVSKEKEVKIKVSTEQSRAQ--KGPVKNKNAKV 115

Query: 1393 LSPKRAAATGLSKSKDGKEVMKSSVASNGNIASESRPRQTSAFRAKSKSSNEKQGADNSG 1214
             S     A+ +  SK GK+   S   SNG  A +SRPRQ      K++SSN++Q      
Sbjct: 116  GSSSGVNASLVKNSKIGKDKQASPAVSNGTSALDSRPRQP----IKNRSSNDRQS----- 166

Query: 1213 AASVQNKQQHGLPDATSSSTSGEQSEDLPEKTKLKALKKGXXXXXXXXXXXXXXXTAGDA 1034
                Q  +Q    +A SS  +        EK K K+LKKG                  D 
Sbjct: 167  ----QLSKQPSKSEAASSDVA-------VEKKKPKSLKKGPLDKVQGEGESSLTNRE-DT 214

Query: 1033 KALRLGTLPTYGFSFKCNERAEKRKEFYSKLEEKIQAKEAEKNNLQAKSKETQEAELKML 854
            K  R+GTLP YGFSF+C ERAEKR+EF +K+EEKIQAKE EK++LQAKSKE+QEAE+K L
Sbjct: 215  KPRRVGTLPNYGFSFRCGERAEKRREFLTKVEEKIQAKEEEKSSLQAKSKESQEAEIKKL 274

Query: 853  RKSLAFKATPMPSFYQEPPPPKVELKKIPTTRAKSPKLGRKKSSPTADSKENGAFTARPA 674
            RKSL FKATP+P+FYQEP PPKVELKKIPTTRAKSPKLGRKK+S  ++S  NG+ ++R  
Sbjct: 275  RKSLTFKATPLPTFYQEPAPPKVELKKIPTTRAKSPKLGRKKTSTNSESDGNGSCSSRQG 334

Query: 673  RLSLDEKLSQNNLARATRVDHVKKPQRKSLP-KLPLEDTNLSFEKKKTSSRKITT-PKET 500
            RLSL+EK+SQ+N      + H KKP RKSLP +L  E TN +      +++K T+  K T
Sbjct: 335  RLSLNEKVSQSNSPTGVTLAHQKKPLRKSLPTRLASERTNSAAAPTSKATKKDTSLSKGT 394

Query: 499  GESEVQLNNLSEETSKSASDTQA---QEVAPTAEP-----IKSESNVDEENALEAQQEAI 344
            GE + ++   +EE S  +SDT     Q   P+ +P     +  +  V+E   L   QE I
Sbjct: 395  GEEKTEIVTANEENSTLSSDTNVALPQNAVPSDKPSEEFHVNGDIVVEENPQLVLSQEPI 454


Top