BLASTX nr result

ID: Akebia24_contig00020104 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00020104
         (769 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006485885.1| PREDICTED: uncharacterized protein LOC102608...   150   6e-34
ref|XP_006485884.1| PREDICTED: uncharacterized protein LOC102608...   150   6e-34
ref|XP_006436269.1| hypothetical protein CICLE_v10030482mg [Citr...   150   6e-34
ref|XP_002316354.2| hypothetical protein POPTR_0010s22670g [Popu...   149   1e-33
ref|XP_002311103.2| myb family transcription factor family prote...   148   2e-33
ref|XP_002274774.2| PREDICTED: uncharacterized protein LOC100240...   140   7e-31
emb|CAN62996.1| hypothetical protein VITISV_026902 [Vitis vinifera]   140   7e-31
ref|XP_007220311.1| hypothetical protein PRUPE_ppa000126mg [Prun...   139   1e-30
ref|XP_002534495.1| conserved hypothetical protein [Ricinus comm...   124   3e-26
gb|EXB80104.1| Nuclear receptor corepressor 1 [Morus notabilis]       121   3e-25
ref|XP_004307402.1| PREDICTED: uncharacterized protein LOC101302...   120   5e-25
ref|XP_007009786.1| Duplicated homeodomain-like superfamily prot...   107   6e-21
ref|XP_004237681.1| PREDICTED: uncharacterized protein LOC101263...   102   2e-19
ref|XP_006340031.1| PREDICTED: uncharacterized protein LOC102602...   101   3e-19
ref|XP_006589438.1| PREDICTED: uncharacterized protein LOC100806...   100   7e-19
ref|XP_006589437.1| PREDICTED: uncharacterized protein LOC100806...   100   7e-19
ref|XP_006589436.1| PREDICTED: uncharacterized protein LOC100806...   100   7e-19
ref|XP_006589435.1| PREDICTED: uncharacterized protein LOC100806...   100   7e-19
ref|XP_006589434.1| PREDICTED: uncharacterized protein LOC100806...   100   7e-19
ref|XP_006606235.1| PREDICTED: uncharacterized protein LOC100810...    98   4e-18

>ref|XP_006485885.1| PREDICTED: uncharacterized protein LOC102608361 isoform X4 [Citrus
            sinensis]
          Length = 1730

 Score =  150 bits (378), Expect = 6e-34
 Identities = 109/281 (38%), Positives = 144/281 (51%), Gaps = 30/281 (10%)
 Frame = +1

Query: 1    YKQYLLHCS--NQAESSQILRGYPLHELNKKEMNGH------ADLIGCEKHGTLPPNQFL 156
            Y+Q+L   S  N  ES QIL GYPL    KKEMNG       +++    K        +L
Sbjct: 1361 YRQHLSVHSIVNHIESPQILNGYPLPISTKKEMNGDINCRQLSEVQSISKSDRNIDEPYL 1420

Query: 157  LPDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFG 336
              D Y    N S P H+  + LP L+   EQ+S    ++HS   SDTE     GD KLFG
Sbjct: 1421 AQDCYLRKCNSSMP-HSSVTELPFLAENIEQTS-DRRRAHSCSFSDTEKPSKNGDVKLFG 1478

Query: 337  KKIISQPL--QKSITTTQETNDTVXXXXXXXXXXXFKLT--------------NDVNYSS 468
            K I+S P   QKS  ++ +  +              K T              +  NY  
Sbjct: 1479 K-ILSHPSSSQKSAFSSHDNGENGHHHKQSSKASNLKFTAHHPPDGGAALLKFDRNNYVG 1537

Query: 469  LKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSLGL------PAVIKR 630
            L+  P RSYGFWDG++IQTGFSSLPDSAIL++KYP  FG  PASS  +       AV+K 
Sbjct: 1538 LENGPARSYGFWDGSKIQTGFSSLPDSAILLAKYPAAFGGYPASSSKMEQQSLQAAVVKS 1597

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+R++  V+V P  ++S +  + DYQVY RS +   VQP +
Sbjct: 1598 NERHLNGVAVVPPREISSSNGVVDYQVY-RSREGNKVQPFS 1637


>ref|XP_006485884.1| PREDICTED: uncharacterized protein LOC102608361 isoform X3 [Citrus
            sinensis]
          Length = 1763

 Score =  150 bits (378), Expect = 6e-34
 Identities = 109/281 (38%), Positives = 144/281 (51%), Gaps = 30/281 (10%)
 Frame = +1

Query: 1    YKQYLLHCS--NQAESSQILRGYPLHELNKKEMNGH------ADLIGCEKHGTLPPNQFL 156
            Y+Q+L   S  N  ES QIL GYPL    KKEMNG       +++    K        +L
Sbjct: 1394 YRQHLSVHSIVNHIESPQILNGYPLPISTKKEMNGDINCRQLSEVQSISKSDRNIDEPYL 1453

Query: 157  LPDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFG 336
              D Y    N S P H+  + LP L+   EQ+S    ++HS   SDTE     GD KLFG
Sbjct: 1454 AQDCYLRKCNSSMP-HSSVTELPFLAENIEQTS-DRRRAHSCSFSDTEKPSKNGDVKLFG 1511

Query: 337  KKIISQPL--QKSITTTQETNDTVXXXXXXXXXXXFKLT--------------NDVNYSS 468
            K I+S P   QKS  ++ +  +              K T              +  NY  
Sbjct: 1512 K-ILSHPSSSQKSAFSSHDNGENGHHHKQSSKASNLKFTAHHPPDGGAALLKFDRNNYVG 1570

Query: 469  LKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSLGL------PAVIKR 630
            L+  P RSYGFWDG++IQTGFSSLPDSAIL++KYP  FG  PASS  +       AV+K 
Sbjct: 1571 LENGPARSYGFWDGSKIQTGFSSLPDSAILLAKYPAAFGGYPASSSKMEQQSLQAAVVKS 1630

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+R++  V+V P  ++S +  + DYQVY RS +   VQP +
Sbjct: 1631 NERHLNGVAVVPPREISSSNGVVDYQVY-RSREGNKVQPFS 1670


>ref|XP_006436269.1| hypothetical protein CICLE_v10030482mg [Citrus clementina]
            gi|567887496|ref|XP_006436270.1| hypothetical protein
            CICLE_v10030482mg [Citrus clementina]
            gi|568865020|ref|XP_006485882.1| PREDICTED:
            uncharacterized protein LOC102608361 isoform X1 [Citrus
            sinensis] gi|568865022|ref|XP_006485883.1| PREDICTED:
            uncharacterized protein LOC102608361 isoform X2 [Citrus
            sinensis] gi|557538465|gb|ESR49509.1| hypothetical
            protein CICLE_v10030482mg [Citrus clementina]
            gi|557538466|gb|ESR49510.1| hypothetical protein
            CICLE_v10030482mg [Citrus clementina]
          Length = 1764

 Score =  150 bits (378), Expect = 6e-34
 Identities = 109/281 (38%), Positives = 144/281 (51%), Gaps = 30/281 (10%)
 Frame = +1

Query: 1    YKQYLLHCS--NQAESSQILRGYPLHELNKKEMNGH------ADLIGCEKHGTLPPNQFL 156
            Y+Q+L   S  N  ES QIL GYPL    KKEMNG       +++    K        +L
Sbjct: 1395 YRQHLSVHSIVNHIESPQILNGYPLPISTKKEMNGDINCRQLSEVQSISKSDRNIDEPYL 1454

Query: 157  LPDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFG 336
              D Y    N S P H+  + LP L+   EQ+S    ++HS   SDTE     GD KLFG
Sbjct: 1455 AQDCYLRKCNSSMP-HSSVTELPFLAENIEQTS-DRRRAHSCSFSDTEKPSKNGDVKLFG 1512

Query: 337  KKIISQPL--QKSITTTQETNDTVXXXXXXXXXXXFKLT--------------NDVNYSS 468
            K I+S P   QKS  ++ +  +              K T              +  NY  
Sbjct: 1513 K-ILSHPSSSQKSAFSSHDNGENGHHHKQSSKASNLKFTAHHPPDGGAALLKFDRNNYVG 1571

Query: 469  LKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSLGL------PAVIKR 630
            L+  P RSYGFWDG++IQTGFSSLPDSAIL++KYP  FG  PASS  +       AV+K 
Sbjct: 1572 LENGPARSYGFWDGSKIQTGFSSLPDSAILLAKYPAAFGGYPASSSKMEQQSLQAAVVKS 1631

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+R++  V+V P  ++S +  + DYQVY RS +   VQP +
Sbjct: 1632 NERHLNGVAVVPPREISSSNGVVDYQVY-RSREGNKVQPFS 1671


>ref|XP_002316354.2| hypothetical protein POPTR_0010s22670g [Populus trichocarpa]
            gi|550330381|gb|EEF02525.2| hypothetical protein
            POPTR_0010s22670g [Populus trichocarpa]
          Length = 1721

 Score =  149 bits (375), Expect = 1e-33
 Identities = 106/271 (39%), Positives = 138/271 (50%), Gaps = 29/271 (10%)
 Frame = +1

Query: 28   NQAESSQILRGYPLHELNKKEMNG---------HADLIGCEKHGTLPPNQFLLPDFYQEI 180
            N  ESSQI RGY L    KKEMNG            L   EK+ T   +Q    + Y + 
Sbjct: 1375 NHNESSQIPRGYSLQIPTKKEMNGVISGRLLSGAQSLPNSEKNVT---SQSEAQECYLQK 1431

Query: 181  YNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQPL 360
             +  K  H+V   LP +S    + S  H + HS+ SSD E  C  GD KLFGK I+S PL
Sbjct: 1432 CSSLKAQHSVPE-LPFISQRRGRGS-DHLRDHSRRSSDVEKPCRNGDVKLFGK-ILSNPL 1488

Query: 361  QKSITTTQETNDT-VXXXXXXXXXXXFKLT--------------NDVNYSSLKELPTRSY 495
            QK  ++ +E  +              FK T              +  N   L+ +P RSY
Sbjct: 1489 QKQNSSARENGEKEAQHLKPTSKSSTFKFTGHHPTEGNMTLSKCDPNNQPGLENVPMRSY 1548

Query: 496  GFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSLGLP-----AVIKRNDRNMGCVSV 660
            GFWDGNRIQTGF S+PDSA L+ KYP  F N   SS  +P     A +K N+ N+  +SV
Sbjct: 1549 GFWDGNRIQTGFPSMPDSATLLVKYPAAFSNYHVSSSKMPQQTLQAAVKSNECNLNGISV 1608

Query: 661  FPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            FPS +++G+  + DYQ+Y RS+D T V   T
Sbjct: 1609 FPSREITGSNGVVDYQMY-RSHDSTGVPSFT 1638


>ref|XP_002311103.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550332397|gb|EEE88470.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 1716

 Score =  148 bits (373), Expect = 2e-33
 Identities = 116/278 (41%), Positives = 142/278 (51%), Gaps = 31/278 (11%)
 Frame = +1

Query: 28   NQAESSQILRGYPLHELNKKEMNGH---------ADLIGCEKHGTLPPN---QFLLPDFY 171
            +Q +SSQILRGYPL    KKEMNG                EK+ T   N   QF   D Y
Sbjct: 1368 SQNDSSQILRGYPLQIPTKKEMNGDNYARPLSEARSFPNSEKNVTSEKNVTSQFEAEDCY 1427

Query: 172  QEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIIS 351
             +  +GSK  H+V S LP LS   E  S    + HS+ SSD E  C  GD KLFGK I+S
Sbjct: 1428 LQKCSGSKSQHSV-SELPFLSQRFEHGS-DCPRDHSRRSSDMEKPCRNGDVKLFGK-ILS 1484

Query: 352  QPLQKSITTTQETNDT-VXXXXXXXXXXXFKLTN----DVNYSSLK---------ELPTR 489
             PLQK  +   E  +              FKLT     + N + LK         E    
Sbjct: 1485 NPLQKQNSIAHENGEKEAPHLKPAGKSATFKLTGHHPTEGNMAFLKCDRNNQLGPENFPL 1544

Query: 490  SYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSLGLP-----AVIKRNDRNMGCV 654
            S+GFWD NR QTG   LPDSA L++KYP  F N P  S  +P     +V+K N+ N   +
Sbjct: 1545 SHGFWDENRTQTG---LPDSAALLAKYPAAFSNYPVPSSKMPQQTLQSVVKSNECNQSGL 1601

Query: 655  SVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            SVFPS D+SG   + DYQ+Y RS+D T VQP  A D+K
Sbjct: 1602 SVFPSRDVSGTNGVVDYQLY-RSHDSTGVQPF-AVDMK 1637


>ref|XP_002274774.2| PREDICTED: uncharacterized protein LOC100240985 [Vitis vinifera]
          Length = 1940

 Score =  140 bits (352), Expect = 7e-31
 Identities = 102/281 (36%), Positives = 142/281 (50%), Gaps = 34/281 (12%)
 Frame = +1

Query: 13   LLHCSNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGT-----------LPPNQFLL 159
            LL+ +  AE SQ + G PL    K++MN     + C+   +           +  +  L 
Sbjct: 1501 LLNNAVNAELSQKVGGCPLQTPPKEDMNRD---LSCKNPSSAAERLSKLDRDIQSSHSLA 1557

Query: 160  PDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGK 339
             D Y +  NGSK  H++ + LP LS   E++S   +++H +  SDTE +   GDFKLFG+
Sbjct: 1558 QDCYLQKCNGSKS-HSLGTELPFLSQSLERTS-NQTRAHGRSLSDTEKTSRNGDFKLFGQ 1615

Query: 340  KIISQP--LQKSITTTQETNDT-VXXXXXXXXXXXFKLTNDV--------------NYSS 468
             I+S P  LQ   + + E +D               K T                 NY  
Sbjct: 1616 -ILSHPPSLQNPNSCSNENDDKGAHNPKLSSKSVNLKFTGHHCIDGNLGASKVDRNNYLG 1674

Query: 469  LKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASS------LGLPAVIKR 630
            L+ LP  SYGFWDGNRIQTGFSSLPDS +L++KYP  F N P SS        L  V+K 
Sbjct: 1675 LENLPM-SYGFWDGNRIQTGFSSLPDSTLLLAKYPAAFSNYPMSSSTKIEQQSLQTVVKS 1733

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+RN+  +SVFP+ D+S +  ++DY    R  D T +QP T
Sbjct: 1734 NERNLNGISVFPTRDMSSSNGVADYHQVFRGRDCTKLQPFT 1774


>emb|CAN62996.1| hypothetical protein VITISV_026902 [Vitis vinifera]
          Length = 1971

 Score =  140 bits (352), Expect = 7e-31
 Identities = 102/281 (36%), Positives = 142/281 (50%), Gaps = 34/281 (12%)
 Frame = +1

Query: 13   LLHCSNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGT-----------LPPNQFLL 159
            LL+ +  AE SQ + G PL    K++MN     + C+   +           +  +  L 
Sbjct: 1392 LLNNAVNAELSQKVGGCPLQTPPKEDMNRD---LSCKNPSSAAERLSKLDRDIQSSHSLA 1448

Query: 160  PDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGK 339
             D Y +  NGSK  H++ + LP LS   E++S   +++H +  SDTE +   GDFKLFG+
Sbjct: 1449 QDCYLQKCNGSKS-HSLGTELPFLSQSLERTS-NQTRAHGRSLSDTEKTSRNGDFKLFGQ 1506

Query: 340  KIISQP--LQKSITTTQETNDT-VXXXXXXXXXXXFKLTNDV--------------NYSS 468
             I+S P  LQ   + + E +D               K T                 NY  
Sbjct: 1507 -ILSHPPSLQNPNSCSNENDDKGAHNPKLSSKSVNLKFTGHHCIDGNLGASKVDRNNYLG 1565

Query: 469  LKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASS------LGLPAVIKR 630
            L+ LP  SYGFWDGNRIQTGFSSLPDS +L++KYP  F N P SS        L  V+K 
Sbjct: 1566 LENLPM-SYGFWDGNRIQTGFSSLPDSTLLLAKYPAAFSNYPMSSSTKIEQQSLQTVVKS 1624

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+RN+  +SVFP+ D+S +  ++DY    R  D T +QP T
Sbjct: 1625 NERNLNGISVFPTRDMSSSNGVADYHQVFRGRDCTKLQPFT 1665


>ref|XP_007220311.1| hypothetical protein PRUPE_ppa000126mg [Prunus persica]
            gi|462416773|gb|EMJ21510.1| hypothetical protein
            PRUPE_ppa000126mg [Prunus persica]
          Length = 1721

 Score =  139 bits (350), Expect = 1e-30
 Identities = 98/272 (36%), Positives = 135/272 (49%), Gaps = 28/272 (10%)
 Frame = +1

Query: 37   ESSQILRGYPLHELNKKEMNGH------ADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKP 198
            ESSQ+L+GYPL    KK+ NG       +++    K        ++  D + +   G+  
Sbjct: 1366 ESSQVLKGYPLQMPTKKDTNGDVTSGNLSEVQNFSKPDRKINGHYMTKDGFLQF--GNCK 1423

Query: 199  PHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
            P       P+     EQ  +G  K+HS  SSD++     GD KLFGK I+S P  L KS 
Sbjct: 1424 PQCSEVDFPLAPRKVEQP-VGPPKAHSWSSSDSDKPSRNGDVKLFGK-ILSNPSSLSKSS 1481

Query: 373  TTTQETNDT-VXXXXXXXXXXXFKLTNDVN--------------YSSLKELPTRSYGFWD 507
            +   E  +               K T   N              Y  ++++P RSYGFW+
Sbjct: 1482 SNIHENEEKGAHNHKLSNTSSNLKFTGHHNADGNSSLLKFDCSSYVGIEKVPRRSYGFWE 1541

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASS-----LGLPAVIKRNDRNMGCVSVFPST 672
            GN++  G+ S  DSAIL++KYP  FGN P +S       L AV+K NDRN+  VSVFPS 
Sbjct: 1542 GNKVHAGYPSFSDSAILLAKYPAAFGNFPTTSSKMEQQPLQAVVKNNDRNINGVSVFPSR 1601

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            ++SG+  + DY V+SRS D   V P T  DVK
Sbjct: 1602 EISGSNGVVDYPVFSRSRDGAKVPPFT-VDVK 1632


>ref|XP_002534495.1| conserved hypothetical protein [Ricinus communis]
            gi|223525187|gb|EEF27889.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1651

 Score =  124 bits (312), Expect = 3e-26
 Identities = 95/271 (35%), Positives = 132/271 (48%), Gaps = 22/271 (8%)
 Frame = +1

Query: 7    QYLLHCSNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLP---------PNQFLL 159
            Q L++C    ES QIL GYP+    K+EMNG    I C  H  +           NQF+ 
Sbjct: 1291 QALVNC---IESQQILGGYPVQIPMKREMNGD---ISCRSHSEVQRGLTSESNGANQFVA 1344

Query: 160  PDFYQEIYNGSKPPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGK 339
             D Y +  N +K   +V   LP+L    EQ      K +S+ SSDTE     GD KLFGK
Sbjct: 1345 QDCYLQKCNNTKIQCSVPE-LPLLPQHAEQC-----KDNSRSSSDTEKPSRNGDVKLFGK 1398

Query: 340  KIISQPLQK-----------SITTTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPT 486
             + +   QK            +  T  T+               K  ++ NY  L+ +P 
Sbjct: 1399 ILSNSSSQKMENGDHGTHCPKLGNTSSTSKFSGHQTTDGSTSVLKFDHN-NYLGLENVPV 1457

Query: 487  RSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGN--LPASSLGLPAVIKRNDRNMGCVSV 660
            +SYG+WDGN+IQTGF S+P    L +KYP  F N  + AS +   A  K ND ++  VSV
Sbjct: 1458 KSYGYWDGNKIQTGFPSIPPEYFL-AKYPAAFSNYHISASKVEQQAAGKCNDHSLNSVSV 1516

Query: 661  FPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
             P  ++SG+  + DYQ++ RS   + VQP +
Sbjct: 1517 LPPREISGSNGVVDYQMF-RSNGSSKVQPFS 1546


>gb|EXB80104.1| Nuclear receptor corepressor 1 [Morus notabilis]
          Length = 1731

 Score =  121 bits (303), Expect = 3e-25
 Identities = 101/270 (37%), Positives = 123/270 (45%), Gaps = 19/270 (7%)
 Frame = +1

Query: 16   LHCSNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSK 195
            L  S+ +ESS +LR Y L    KKEMNG        +   LP +            +GS 
Sbjct: 1400 LPLSSNSESSHVLRAYSLQLPVKKEMNGEVRCRNLSEVQNLPNS------------DGSS 1447

Query: 196  PPHTVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQPLQKSIT 375
              H V+        C+ Q     S     CS  TE+    GD KLFGK I+S PL     
Sbjct: 1448 SNHFVSQG------CYLQKC---STLKPPCSV-TENG---GDVKLFGK-ILSNPLSVHNH 1493

Query: 376  TTQETNDTVXXXXXXXXXXXFKLTN--------------DVNYSSLKELPTRSYGFWDGN 513
               E N+              K  N                NY  L  +  RSY +WDGN
Sbjct: 1494 CENEENEGSHEHNSSNKPSNTKFINLHNLDGSSAILKFDRNNYLGLDNVQMRSYTYWDGN 1553

Query: 514  RIQTGFSSLPDSAILMSKYPTDFGNLPASS-----LGLPAVIKRNDRNMGCVSVFPSTDL 678
            R+Q  F SLPDSAIL++KYP  F N P SS       L AV K N+RN+  VSVFP+ D+
Sbjct: 1554 RLQAAFPSLPDSAILLAKYPAAFSNFPTSSKMEQQQQLQAVAKSNERNVNGVSVFPTRDI 1613

Query: 679  SGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            S +  + DYQVY RS D   VQP T  DVK
Sbjct: 1614 SSSNGMVDYQVY-RSRDAPMVQPFT-VDVK 1641


>ref|XP_004307402.1| PREDICTED: uncharacterized protein LOC101302495 [Fragaria vesca
            subsp. vesca]
          Length = 1703

 Score =  120 bits (301), Expect = 5e-25
 Identities = 87/268 (32%), Positives = 127/268 (47%), Gaps = 24/268 (8%)
 Frame = +1

Query: 37   ESSQILRGYPLHELNKKEMNGHADL--IGCEKHGTLPPNQFLLPDFYQEIYN-GSKPPHT 207
            + + +L+GYPLH    KE+NGH     +   KH + P            I   G+  P +
Sbjct: 1350 DPAHVLKGYPLHMAMGKEINGHTSCGNLSEVKHLSKPDGDLTGHKPKDCILQFGNCKPRS 1409

Query: 208  VASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQPLQ-KSITTTQ 384
                 P++    E+ S   +K+HS  SSDT+     GD KLFGK + S      SI   +
Sbjct: 1410 SQVDFPLVHQKTERRS-DTTKAHSWSSSDTDKPSRNGDVKLFGKILTSTSKSGSSIHENE 1468

Query: 385  ETNDTVXXXXXXXXXXXFKLTNDV------------NYSSLKELPTRSYGFWDGNRIQTG 528
            E                F   +++            NY+ ++ +P R+Y FW+GN++Q G
Sbjct: 1469 EKGSHTHNLSNKASNLKFSGHHNLDGNSGVLKFDSSNYAGIENVPRRNYSFWEGNKVQNG 1528

Query: 529  FSSLPDSAILMSKYPTDFGNLPASSLGL---PAVIKRNDRNMGCVSVFPSTDL-----SG 684
              S PDSA+L++KYP  FGN P SS  L   P  + RND ++   SVFPS ++     SG
Sbjct: 1529 HPSFPDSALLLAKYPAAFGNFPTSSSKLEQQPLAVVRNDGHVNGASVFPSREISSSSSSG 1588

Query: 685  NGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            +G +  +QV+SR  D     P    DVK
Sbjct: 1589 SGIVDYHQVFSRHRDGGAKVPPFTVDVK 1616


>ref|XP_007009786.1| Duplicated homeodomain-like superfamily protein isoform 2 [Theobroma
            cacao] gi|508726699|gb|EOY18596.1| Duplicated
            homeodomain-like superfamily protein isoform 2 [Theobroma
            cacao]
          Length = 1384

 Score =  107 bits (266), Expect = 6e-21
 Identities = 55/101 (54%), Positives = 70/101 (69%), Gaps = 5/101 (4%)
 Frame = +1

Query: 466  SLKELPTRSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASS-----LGLPAVIKR 630
            +++ +P RSYGFWDGNRIQTG SSLPDSAIL++KYP  F N P+SS       L  V++ 
Sbjct: 1195 NVENVPKRSYGFWDGNRIQTGLSSLPDSAILVAKYPAAFVNYPSSSSQMEQQALQTVVRS 1254

Query: 631  NDRNMGCVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLT 753
            N+RN+  VSV+PS ++S N  + DYQVY R  D T V P T
Sbjct: 1255 NERNLNGVSVYPSREISSNNGVVDYQVY-RGRDCTKVAPFT 1294


>ref|XP_004237681.1| PREDICTED: uncharacterized protein LOC101263808 [Solanum
            lycopersicum]
          Length = 1677

 Score =  102 bits (254), Expect = 2e-19
 Identities = 91/270 (33%), Positives = 117/270 (43%), Gaps = 27/270 (10%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S Q ES QIL  Y L E    E NG     GC     L           QE+  G     
Sbjct: 1343 SAQVESCQILGSYLLGESTLTE-NGDP---GCRASAAL-----------QEVQVGRNLQL 1387

Query: 205  TVASSLPVLSTCHEQSSMGHSKSH--------SQCSSDTEHSCPTGDFKLFGKKIISQPL 360
               S+   L  C+  +  G S S            SS  E  C  GD KLFG+ I+S+P 
Sbjct: 1388 DTFSTTCFLQKCNGTNRGGCSVSDLVPNREQTGSSSSVVEKPCRNGDVKLFGQ-ILSKPC 1446

Query: 361  QKSITTTQETNDTVXXXXXXXXXXXFKLTNDV------------NYSSLKELPTRSYGFW 504
             K+  ++                  F  ++ +            N+   +  P RS+GFW
Sbjct: 1447 PKANPSSNAEPIDGSNQMLKVGSNSFSASHSLEGNSATAKFERNNFLGSENHPLRSFGFW 1506

Query: 505  DGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPS 669
            DG+RIQTGFSSLPDSAIL++KYP  FG+   SS       L  V+K  +RN+    VF +
Sbjct: 1507 DGSRIQTGFSSLPDSAILLAKYPAAFGSYGLSSTKMEQPSLHGVVKTTERNLNSPPVFAA 1566

Query: 670  TDLSGNGHL--SDYQVYSRSYDRTNVQPLT 753
             D S N  +  SDYQVY       +VQP T
Sbjct: 1567 RDSSSNSAVAGSDYQVYR----NRDVQPFT 1592


>ref|XP_006340031.1| PREDICTED: uncharacterized protein LOC102602320 [Solanum tuberosum]
          Length = 1677

 Score =  101 bits (252), Expect = 3e-19
 Identities = 68/191 (35%), Positives = 92/191 (48%), Gaps = 19/191 (9%)
 Frame = +1

Query: 238  CHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQPLQKSITTTQETNDTVXXXXX 417
            C     + + +     SS  E  C  GD KLFG+ I+S+P  K+  ++            
Sbjct: 1406 CSVSDLIPNREQTGSSSSIVEKPCRNGDVKLFGQ-ILSKPCPKANPSSNAERSDGSNQKL 1464

Query: 418  XXXXXXFKLTNDV------------NYSSLKELPTRSYGFWDGNRIQTGFSSLPDSAILM 561
                  F  ++ +            N+   +  P RS+GFWDGNRIQTGFSSLPDSAIL+
Sbjct: 1465 KVGSDSFSASHSLEGNSATAKFERNNFLGSENHPVRSFGFWDGNRIQTGFSSLPDSAILL 1524

Query: 562  SKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPSTDLSGNGHL--SDYQVYSR 720
            +KYP  FGN   +S       L  V+K  +RN+    VF + D S N  +  SDYQVY  
Sbjct: 1525 AKYPAAFGNYAIASTKMEQPSLHGVVKTAERNLNSPPVFAARDSSSNNGVAGSDYQVYR- 1583

Query: 721  SYDRTNVQPLT 753
                 +VQP T
Sbjct: 1584 ---NRDVQPFT 1591


>ref|XP_006589438.1| PREDICTED: uncharacterized protein LOC100806246 isoform X5 [Glycine
            max]
          Length = 1651

 Score =  100 bits (248), Expect = 7e-19
 Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 24/272 (8%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYPL    KKEM+     + C    T  P   LLP              
Sbjct: 1338 SDHVDAVSILQGYPLQVPVKKEMDSD---MNCTSSATELP---LLPQ------------- 1378

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQC--SSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
                             + H   H +   SSD++ +   GD KLFGK I++ P   QK  
Sbjct: 1379 ----------------KIEHDDDHIKAFQSSDSDKTFRNGDVKLFGK-ILTNPSTTQKPN 1421

Query: 373  T---------------TTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPTRSYGFWD 507
                            +++ +N  +               +  +Y  L+ +P RSYG+WD
Sbjct: 1422 VGAKGSEENGTHHPKLSSKSSNPKITGHHSADGNLKILKFDHNDYVGLENVPMRSYGYWD 1481

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPST 672
            GNRIQTG S+LPDSAIL++KYP  F N   SS       L    K N+R +   S F + 
Sbjct: 1482 GNRIQTGLSTLPDSAILLAKYPAAFSNYLTSSAKLEQPSLQTYSKNNERLLNGASTFTTR 1541

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            D++G+  L DYQ++ R  D   VQP    DVK
Sbjct: 1542 DINGSNALIDYQMFRR--DGPKVQPF-MVDVK 1570


>ref|XP_006589437.1| PREDICTED: uncharacterized protein LOC100806246 isoform X4 [Glycine
            max]
          Length = 1652

 Score =  100 bits (248), Expect = 7e-19
 Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 24/272 (8%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYPL    KKEM+     + C    T  P   LLP              
Sbjct: 1339 SDHVDAVSILQGYPLQVPVKKEMDSD---MNCTSSATELP---LLPQ------------- 1379

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQC--SSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
                             + H   H +   SSD++ +   GD KLFGK I++ P   QK  
Sbjct: 1380 ----------------KIEHDDDHIKAFQSSDSDKTFRNGDVKLFGK-ILTNPSTTQKPN 1422

Query: 373  T---------------TTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPTRSYGFWD 507
                            +++ +N  +               +  +Y  L+ +P RSYG+WD
Sbjct: 1423 VGAKGSEENGTHHPKLSSKSSNPKITGHHSADGNLKILKFDHNDYVGLENVPMRSYGYWD 1482

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPST 672
            GNRIQTG S+LPDSAIL++KYP  F N   SS       L    K N+R +   S F + 
Sbjct: 1483 GNRIQTGLSTLPDSAILLAKYPAAFSNYLTSSAKLEQPSLQTYSKNNERLLNGASTFTTR 1542

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            D++G+  L DYQ++ R  D   VQP    DVK
Sbjct: 1543 DINGSNALIDYQMFRR--DGPKVQPF-MVDVK 1571


>ref|XP_006589436.1| PREDICTED: uncharacterized protein LOC100806246 isoform X3 [Glycine
            max]
          Length = 1678

 Score =  100 bits (248), Expect = 7e-19
 Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 24/272 (8%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYPL    KKEM+     + C    T  P   LLP              
Sbjct: 1365 SDHVDAVSILQGYPLQVPVKKEMDSD---MNCTSSATELP---LLPQ------------- 1405

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQC--SSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
                             + H   H +   SSD++ +   GD KLFGK I++ P   QK  
Sbjct: 1406 ----------------KIEHDDDHIKAFQSSDSDKTFRNGDVKLFGK-ILTNPSTTQKPN 1448

Query: 373  T---------------TTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPTRSYGFWD 507
                            +++ +N  +               +  +Y  L+ +P RSYG+WD
Sbjct: 1449 VGAKGSEENGTHHPKLSSKSSNPKITGHHSADGNLKILKFDHNDYVGLENVPMRSYGYWD 1508

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPST 672
            GNRIQTG S+LPDSAIL++KYP  F N   SS       L    K N+R +   S F + 
Sbjct: 1509 GNRIQTGLSTLPDSAILLAKYPAAFSNYLTSSAKLEQPSLQTYSKNNERLLNGASTFTTR 1568

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            D++G+  L DYQ++ R  D   VQP    DVK
Sbjct: 1569 DINGSNALIDYQMFRR--DGPKVQPF-MVDVK 1597


>ref|XP_006589435.1| PREDICTED: uncharacterized protein LOC100806246 isoform X2 [Glycine
            max]
          Length = 1678

 Score =  100 bits (248), Expect = 7e-19
 Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 24/272 (8%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYPL    KKEM+     + C    T  P   LLP              
Sbjct: 1365 SDHVDAVSILQGYPLQVPVKKEMDSD---MNCTSSATELP---LLPQ------------- 1405

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQC--SSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
                             + H   H +   SSD++ +   GD KLFGK I++ P   QK  
Sbjct: 1406 ----------------KIEHDDDHIKAFQSSDSDKTFRNGDVKLFGK-ILTNPSTTQKPN 1448

Query: 373  T---------------TTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPTRSYGFWD 507
                            +++ +N  +               +  +Y  L+ +P RSYG+WD
Sbjct: 1449 VGAKGSEENGTHHPKLSSKSSNPKITGHHSADGNLKILKFDHNDYVGLENVPMRSYGYWD 1508

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPST 672
            GNRIQTG S+LPDSAIL++KYP  F N   SS       L    K N+R +   S F + 
Sbjct: 1509 GNRIQTGLSTLPDSAILLAKYPAAFSNYLTSSAKLEQPSLQTYSKNNERLLNGASTFTTR 1568

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            D++G+  L DYQ++ R  D   VQP    DVK
Sbjct: 1569 DINGSNALIDYQMFRR--DGPKVQPF-MVDVK 1597


>ref|XP_006589434.1| PREDICTED: uncharacterized protein LOC100806246 isoform X1 [Glycine
            max]
          Length = 1679

 Score =  100 bits (248), Expect = 7e-19
 Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 24/272 (8%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYPL    KKEM+     + C    T  P   LLP              
Sbjct: 1366 SDHVDAVSILQGYPLQVPVKKEMDSD---MNCTSSATELP---LLPQ------------- 1406

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQC--SSDTEHSCPTGDFKLFGKKIISQP--LQKSI 372
                             + H   H +   SSD++ +   GD KLFGK I++ P   QK  
Sbjct: 1407 ----------------KIEHDDDHIKAFQSSDSDKTFRNGDVKLFGK-ILTNPSTTQKPN 1449

Query: 373  T---------------TTQETNDTVXXXXXXXXXXXFKLTNDVNYSSLKELPTRSYGFWD 507
                            +++ +N  +               +  +Y  L+ +P RSYG+WD
Sbjct: 1450 VGAKGSEENGTHHPKLSSKSSNPKITGHHSADGNLKILKFDHNDYVGLENVPMRSYGYWD 1509

Query: 508  GNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNMGCVSVFPST 672
            GNRIQTG S+LPDSAIL++KYP  F N   SS       L    K N+R +   S F + 
Sbjct: 1510 GNRIQTGLSTLPDSAILLAKYPAAFSNYLTSSAKLEQPSLQTYSKNNERLLNGASTFTTR 1569

Query: 673  DLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
            D++G+  L DYQ++ R  D   VQP    DVK
Sbjct: 1570 DINGSNALIDYQMFRR--DGPKVQPF-MVDVK 1598


>ref|XP_006606235.1| PREDICTED: uncharacterized protein LOC100810588 isoform X5 [Glycine
            max]
          Length = 1664

 Score = 97.8 bits (242), Expect = 4e-18
 Identities = 91/280 (32%), Positives = 123/280 (43%), Gaps = 32/280 (11%)
 Frame = +1

Query: 25   SNQAESSQILRGYPLHELNKKEMNGHADLIGCEKHGTLPPNQFLLPDFYQEIYNGSKPPH 204
            S+  ++  IL+GYP     KKEMNG    + C    T  P  FL              PH
Sbjct: 1343 SDHVDAVSILQGYPFQVPLKKEMNGD---MNCSSSATELP--FL--------------PH 1383

Query: 205  TVASSLPVLSTCHEQSSMGHSKSHSQCSSDTEHSCPTGDFKLFGKKIISQPLQKSITTTQ 384
             +            +    H K+    SSD++ +   GD KLFGK I++ P     +TTQ
Sbjct: 1384 KI------------EQDDDHIKTFQ--SSDSDKTSRNGDVKLFGK-ILTNP-----STTQ 1423

Query: 385  ETN--------DTVXXXXXXXXXXXFKLTN----DVNYSSLK--------------ELPT 486
            + N        +              K T     D N   LK               +P 
Sbjct: 1424 KPNVGAKGSEENGTHHPKLSSKSSNLKFTGHHSADGNLKILKFDHNDYVGLENVLENVPM 1483

Query: 487  RSYGFWDGNRIQTGFSSLPDSAILMSKYPTDFGNLPASSL-----GLPAVIKRNDRNM-G 648
            RSYG+WDGNRIQTG S+LPDSAIL++KYP  F N P SS       L    K N+R + G
Sbjct: 1484 RSYGYWDGNRIQTGLSTLPDSAILLAKYPAAFSNYPTSSAKLEQPSLQTYSKNNERLLNG 1543

Query: 649  CVSVFPSTDLSGNGHLSDYQVYSRSYDRTNVQPLTAADVK 768
              ++  + D++G+  + DYQ++ R  D   VQP    DVK
Sbjct: 1544 APTLTTTRDINGSNAVIDYQLFRR--DGPKVQPF-MVDVK 1580


Top