BLASTX nr result

ID: Akebia23_contig00017764 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00017764
         (1322 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265699.1| PREDICTED: general transcription factor 3C p...   295   4e-77
emb|CBI24131.3| unnamed protein product [Vitis vinifera]              288   4e-75
gb|EXC33654.1| General transcription factor 3C polypeptide 3 [Mo...   258   5e-66
ref|XP_004241851.1| PREDICTED: uncharacterized protein LOC101258...   252   3e-64
emb|CAN60433.1| hypothetical protein VITISV_020389 [Vitis vinifera]   247   9e-63
ref|XP_007019760.1| Tetratricopeptide repeat-containing protein,...   246   1e-62
ref|XP_007019759.1| Tetratricopeptide repeat-containing protein,...   246   1e-62
ref|XP_004146849.1| PREDICTED: transcription factor tau subunit ...   246   2e-62
ref|XP_006356573.1| PREDICTED: general transcription factor 3C p...   239   2e-60
ref|XP_002978980.1| hypothetical protein SELMODRAFT_444100 [Sela...   233   1e-58
ref|XP_002994584.1| hypothetical protein SELMODRAFT_432497 [Sela...   233   1e-58
ref|XP_007200319.1| hypothetical protein PRUPE_ppa001046mg [Prun...   231   5e-58
ref|XP_006836911.1| hypothetical protein AMTR_s00099p00131860 [A...   226   2e-56
ref|XP_006441797.1| hypothetical protein CICLE_v10020572mg [Citr...   218   4e-54
ref|XP_006376472.1| hypothetical protein POPTR_0013s13300g [Popu...   218   4e-54
ref|XP_006478352.1| PREDICTED: general transcription factor 3C p...   217   1e-53
ref|XP_001753353.1| predicted protein [Physcomitrella patens] gi...   209   3e-51
ref|XP_006590810.1| PREDICTED: general transcription factor 3C p...   206   2e-50
ref|XP_007131656.1| hypothetical protein PHAVU_011G031000g [Phas...   205   4e-50
ref|XP_006592051.1| PREDICTED: general transcription factor 3C p...   202   3e-49

>ref|XP_002265699.1| PREDICTED: general transcription factor 3C polypeptide 3-like [Vitis
            vinifera]
          Length = 1110

 Score =  295 bits (754), Expect = 4e-77
 Identities = 165/326 (50%), Positives = 202/326 (61%), Gaps = 34/326 (10%)
 Frame = +2

Query: 446  LRFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAF 625
            LRFE  MNPLDF E++A G+QPY+QFERLEYEALAE+KRKALS  Q +G AKK+R ED  
Sbjct: 35   LRFEDGMNPLDFTENDASGLQPYEQFERLEYEALAEKKRKALSQCQFEGLAKKARHEDDS 94

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIR 805
               FDEIME M+                        P V+RKLG+AN+ YA G+YEEAI 
Sbjct: 95   QAIFDEIMETMNHRRRRKSRKRKKSGRRKGLKNKLSPEVTRKLGEANLHYAHGRYEEAIL 154

Query: 806  VLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSRE 985
            VL EVV LAPNL + YHT GLVY   GDKK+A++FYMLAAHL PKD   LW  L   S E
Sbjct: 155  VLKEVVRLAPNLPDAYHTFGLVYNAFGDKKRALNFYMLAAHLTPKDS-SLWKLLVTWSIE 213

Query: 986  EGSIRQAAYCLSKAISAD----------------------------------PENVEVRK 1063
            +G+  QA YCLSKAI+AD                                  PENVE  K
Sbjct: 214  QGNTGQARYCLSKAITADPEDISLRFHRASLYVELGEYQKAAESYEQISQLFPENVEAPK 273

Query: 1064 KTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARS 1243
              AK+Y +C QVERSV ILE YIKD+ T+ADLS+V +L  + M+++ + +ALQ IE A+ 
Sbjct: 274  TGAKLYKKCGQVERSVSILEDYIKDHPTKADLSIVDMLAAVCMENNVHDRALQHIEHAQL 333

Query: 1244 VHCSGEHWPFSLTVMEGICQVQLGNI 1321
            ++CSG+  P  LT+  GIC + LGNI
Sbjct: 334  LYCSGKDLPLHLTIKAGICHIHLGNI 359


>emb|CBI24131.3| unnamed protein product [Vitis vinifera]
          Length = 915

 Score =  288 bits (737), Expect = 4e-75
 Identities = 161/320 (50%), Positives = 198/320 (61%), Gaps = 34/320 (10%)
 Frame = +2

Query: 464  MNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFTTSFDE 643
            MNPLDF E++A G+QPY+QFERLEYEALAE+KRKALS  Q +G AKK+R ED     FDE
Sbjct: 1    MNPLDFTENDASGLQPYEQFERLEYEALAEKKRKALSQCQFEGLAKKARHEDDSQAIFDE 60

Query: 644  IMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIRVLNEVV 823
            IME M+                        P V+RKLG+AN+ YA G+YEEAI VL EVV
Sbjct: 61   IMETMNHRRRRKSRKRKKSGRRKGLKNKLSPEVTRKLGEANLHYAHGRYEEAILVLKEVV 120

Query: 824  MLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSREEGSIRQ 1003
             LAPNL + YHT GLVY   GDKK+A++FYMLAAHL PKD   LW  L   S E+G+  Q
Sbjct: 121  RLAPNLPDAYHTFGLVYNAFGDKKRALNFYMLAAHLTPKDS-SLWKLLVTWSIEQGNTGQ 179

Query: 1004 AAYCLSKAISAD----------------------------------PENVEVRKKTAKMY 1081
            A YCLSKAI+AD                                  PENVE  K  AK+Y
Sbjct: 180  ARYCLSKAITADPEDISLRFHRASLYVELGEYQKAAESYEQISQLFPENVEAPKTGAKLY 239

Query: 1082 LECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARSVHCSGE 1261
             +C QVERSV ILE YIKD+ T+ADLS+V +L  + M+++ + +ALQ IE A+ ++CSG+
Sbjct: 240  KKCGQVERSVSILEDYIKDHPTKADLSIVDMLAAVCMENNVHDRALQHIEHAQLLYCSGK 299

Query: 1262 HWPFSLTVMEGICQVQLGNI 1321
              P  LT+  GIC + LGNI
Sbjct: 300  DLPLHLTIKAGICHIHLGNI 319


>gb|EXC33654.1| General transcription factor 3C polypeptide 3 [Morus notabilis]
          Length = 534

 Score =  258 bits (658), Expect = 5e-66
 Identities = 151/340 (44%), Positives = 197/340 (57%), Gaps = 49/340 (14%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSD-RQSDGSAKKSRKEDAF 625
            RF+  +NPLDF+EDNAL  QPY+QFERLEYEALAE+KRKAL+D  + +GS KK+R EDA 
Sbjct: 67   RFKEGVNPLDFVEDNALSGQPYKQFERLEYEALAEKKRKALADSHRREGSIKKARVEDAS 126

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEE--- 796
              + +EIM+ M++                         V +KLG+A ++YA G+YEE   
Sbjct: 127  RATMEEIMKAMNYGVRRKSREPKKRGRQKGSKNKPNREVVQKLGEATLYYAHGRYEEIFI 186

Query: 797  -----------AIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKD 943
                       AI VL+++V+ AP+L + YHTLGLV+    D ++A+ FYMLAAHL PKD
Sbjct: 187  SHLGLALWSYSAISVLHQIVLKAPHLPDAYHTLGLVHTAMDDTERALGFYMLAAHLMPKD 246

Query: 944  HFPLWNRLADLSREEGSIRQAAYCLSKAISAD---------------------------- 1039
               LW  L   S E+G I QA YCL+KAI+AD                            
Sbjct: 247  S-SLWKLLVSWSTEKGDIAQANYCLTKAITADPNDIQLRLLRASLYLELRDAQKAAESYD 305

Query: 1040 ------PENVEVRKKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHH 1201
                  PENVE  K  AK+Y  C  VERS+ ILE Y+ ++ TEADLSV+ LL  I M+ +
Sbjct: 306  QIYQLCPENVEALKTGAKLYKRCGLVERSIHILEDYLNNHPTEADLSVIDLLASILMETN 365

Query: 1202 AYIKALQQIEQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
             + KALQ IE A  V+CSG+  PF+LTV   IC + LGN+
Sbjct: 366  EHSKALQHIEHALLVYCSGKELPFNLTVKAAICHIYLGNM 405


>ref|XP_004241851.1| PREDICTED: uncharacterized protein LOC101258763 [Solanum
            lycopersicum]
          Length = 943

 Score =  252 bits (643), Expect = 3e-64
 Identities = 147/331 (44%), Positives = 195/331 (58%), Gaps = 40/331 (12%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLE--YEALAERKRKALSDRQSDGSAKKSRKEDA 622
            +F  EM+PL F E++A G QPYQQFE LE  YEALA +KRK  +   S+  AKKSR+ED 
Sbjct: 64   QFGAEMDPLAFTEEDAFGRQPYQQFEHLEHQYEALAAKKRKVQALPPSEIPAKKSRQEDR 123

Query: 623  FT----TSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQY 790
                   S+DEI+E M++                       P ++RKLG A + YA G+Y
Sbjct: 124  QEDGPGASYDEILEAMNYGMRKKSRKLKKRGRRKGSKSKVSPELTRKLGDATLHYAHGRY 183

Query: 791  EEAIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLA 970
            EEA  VL EV+ L+PNL + YHTLGL+Y   GDKK+AM+FYMLAAHL+PKD   LWN L 
Sbjct: 184  EEAKLVLREVIRLSPNLPDPYHTLGLIYNAMGDKKRAMNFYMLAAHLSPKD-ASLWNLLV 242

Query: 971  DLSREEGSIRQAAYCLSKAISADPENVEVRKKTAKMYLE--------------------- 1087
              S E+G  +Q  YCLSKAI ADPE++ +R + A +Y+E                     
Sbjct: 243  AWSTEQGDRKQTRYCLSKAIKADPEDLSLRFQRASIYIELGDYQKAAEQYEQIARLCPND 302

Query: 1088 -------------CNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQI 1228
                         C + E SVGILE Y+K++ TEADLSV+ LL  IHM+ +A++KAL  I
Sbjct: 303  VGVLKTAVQFYSKCGKHECSVGILEDYLKNHPTEADLSVIHLLAVIHMEDNAHLKALDLI 362

Query: 1229 EQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            E A+  + +G+  PF+L +  GIC + LG+I
Sbjct: 363  EWAKQRYFTGKQMPFNLNIKAGICHLHLGHI 393


>emb|CAN60433.1| hypothetical protein VITISV_020389 [Vitis vinifera]
          Length = 1463

 Score =  247 bits (630), Expect = 9e-63
 Identities = 152/354 (42%), Positives = 194/354 (54%), Gaps = 62/354 (17%)
 Frame = +2

Query: 446  LRFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAF 625
            LRFE  MNPLDF E++A G+QPY+QFERLEYEALAE+KRKALS  Q +G AKK+R ED  
Sbjct: 35   LRFEDGMNPLDFTENDASGLQPYEQFERLEYEALAEKKRKALSQCQFEGLAKKARHEDDS 94

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIR 805
               FDEIME M+                        P V+RKLG+AN+ YA G+YEEAI 
Sbjct: 95   QAIFDEIMETMNHRRRRKSRKRKKSGRRKGLKNKLSPEVTRKLGEANLHYAHGRYEEAIL 154

Query: 806  VLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLA----- 970
            VL EVV LAPNL + YHT GLVY   GDKK+A++FYMLAAHL PKD   LW  L      
Sbjct: 155  VLKEVVRLAPNLPDAYHTFGLVYNAFGDKKRALNFYMLAAHLTPKDS-SLWKLLVTWSIK 213

Query: 971  --------------DLSRE----------------EGSIRQAAYCLSKAISADPENVEVR 1060
                          DL+ E                + SI ++     K    +   VE R
Sbjct: 214  KDLTEKIPEEDKGQDLASETEGDRCKPSSDRMRSSKSSISESERLWRKLKFQEKGRVEPR 273

Query: 1061 KKTAKMYL---------------------------ECNQVERSVGILEKYIKDYSTEADL 1159
             +  +++L                           +C QVERSV ILE YIKD+ T+ADL
Sbjct: 274  IEEIRIFLSETTLNNPGYWDPKGKVIFCLPFQLYKKCGQVERSVSILEDYIKDHPTKADL 333

Query: 1160 SVVVLLVDIHMKHHAYIKALQQIEQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            S+V +L  + M+++ + +ALQ IE A+ ++CSG+  P  LT+  GIC + LGNI
Sbjct: 334  SIVDMLAAVCMENNVHDRALQHIEHAQLLYCSGKDLPLHLTIKAGICHIHLGNI 387


>ref|XP_007019760.1| Tetratricopeptide repeat-containing protein, putative isoform 2
            [Theobroma cacao] gi|508725088|gb|EOY16985.1|
            Tetratricopeptide repeat-containing protein, putative
            isoform 2 [Theobroma cacao]
          Length = 807

 Score =  246 bits (629), Expect = 1e-62
 Identities = 141/325 (43%), Positives = 192/325 (59%), Gaps = 35/325 (10%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQ-SDGSAKKSRKEDAF 625
            RF+  +NPL+F+ +NA G+Q YQQFERLEYEALAE+KRKAL+D   S+G AKK+R+ED  
Sbjct: 48   RFKSGINPLEFVGENASGLQIYQQFERLEYEALAEKKRKALADTHLSEGPAKKARQEDIS 107

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIR 805
              + DEIM+ ++F                       P +   LG A + YA G+Y+EAI 
Sbjct: 108  EATMDEIMQVINFGARRKSKKRKKRGRRKGSRNKLSPEILGMLGDATLHYANGRYKEAIS 167

Query: 806  VLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSRE 985
            VLNEVV LAPNL ++YHTLGLV++  G+ K A  FYMLA  L PKD   LW +L   S E
Sbjct: 168  VLNEVVRLAPNLPDSYHTLGLVHKALGNNKIAFEFYMLAGILKPKDS-SLWQQLFTWSIE 226

Query: 986  EGSIRQAAYCLSKAISAD----------------------------------PENVEVRK 1063
            +G++ Q  YCLSKAI+AD                                  P NVE  K
Sbjct: 227  QGNVSQTCYCLSKAITADPTDISLRFHQASLYVELGDHQRAAESYEQIQRLSPANVEALK 286

Query: 1064 KTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARS 1243
              AK+Y +C Q ER+V ILE Y++ + +E DLSV+ LLV + MK +AY +A+ +IE+A+ 
Sbjct: 287  SGAKLYQKCGQTERAVAILEDYLRGHPSEVDLSVIDLLVAMLMKINAYKRAILKIEEAQI 346

Query: 1244 VHCSGEHWPFSLTVMEGICQVQLGN 1318
            ++ S +  P +L +  GIC + LG+
Sbjct: 347  IYYSEKELPLNLKIKAGICHIHLGD 371


>ref|XP_007019759.1| Tetratricopeptide repeat-containing protein, putative isoform 1
            [Theobroma cacao] gi|590602468|ref|XP_007019761.1|
            Tetratricopeptide repeat-containing protein, putative
            isoform 1 [Theobroma cacao]
            gi|590602472|ref|XP_007019762.1| Tetratricopeptide
            repeat-containing protein, putative isoform 1 [Theobroma
            cacao] gi|508725087|gb|EOY16984.1| Tetratricopeptide
            repeat-containing protein, putative isoform 1 [Theobroma
            cacao] gi|508725089|gb|EOY16986.1| Tetratricopeptide
            repeat-containing protein, putative isoform 1 [Theobroma
            cacao] gi|508725090|gb|EOY16987.1| Tetratricopeptide
            repeat-containing protein, putative isoform 1 [Theobroma
            cacao]
          Length = 923

 Score =  246 bits (629), Expect = 1e-62
 Identities = 141/325 (43%), Positives = 192/325 (59%), Gaps = 35/325 (10%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQ-SDGSAKKSRKEDAF 625
            RF+  +NPL+F+ +NA G+Q YQQFERLEYEALAE+KRKAL+D   S+G AKK+R+ED  
Sbjct: 48   RFKSGINPLEFVGENASGLQIYQQFERLEYEALAEKKRKALADTHLSEGPAKKARQEDIS 107

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIR 805
              + DEIM+ ++F                       P +   LG A + YA G+Y+EAI 
Sbjct: 108  EATMDEIMQVINFGARRKSKKRKKRGRRKGSRNKLSPEILGMLGDATLHYANGRYKEAIS 167

Query: 806  VLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSRE 985
            VLNEVV LAPNL ++YHTLGLV++  G+ K A  FYMLA  L PKD   LW +L   S E
Sbjct: 168  VLNEVVRLAPNLPDSYHTLGLVHKALGNNKIAFEFYMLAGILKPKDS-SLWQQLFTWSIE 226

Query: 986  EGSIRQAAYCLSKAISAD----------------------------------PENVEVRK 1063
            +G++ Q  YCLSKAI+AD                                  P NVE  K
Sbjct: 227  QGNVSQTCYCLSKAITADPTDISLRFHQASLYVELGDHQRAAESYEQIQRLSPANVEALK 286

Query: 1064 KTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARS 1243
              AK+Y +C Q ER+V ILE Y++ + +E DLSV+ LLV + MK +AY +A+ +IE+A+ 
Sbjct: 287  SGAKLYQKCGQTERAVAILEDYLRGHPSEVDLSVIDLLVAMLMKINAYKRAILKIEEAQI 346

Query: 1244 VHCSGEHWPFSLTVMEGICQVQLGN 1318
            ++ S +  P +L +  GIC + LG+
Sbjct: 347  IYYSEKELPLNLKIKAGICHIHLGD 371


>ref|XP_004146849.1| PREDICTED: transcription factor tau subunit sfc4-like [Cucumis
            sativus]
          Length = 927

 Score =  246 bits (628), Expect = 2e-62
 Identities = 138/325 (42%), Positives = 186/325 (57%), Gaps = 34/325 (10%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFT 628
            +F+   NP DF+E     VQPY++FERLEYEALAE+KRKAL++ QS+ +AK+ R ED   
Sbjct: 60   KFKAGENPFDFVEGTDFSVQPYKKFERLEYEALAEKKRKALANGQSERAAKRGRVEDISG 119

Query: 629  TSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIRV 808
             SFDEI+E M++                         V++ LG A + YA+G++E+AI +
Sbjct: 120  ASFDEILEAMNYGSRRKLKEPKKRGRRKGSKKKLNRDVTKLLGDATLCYAQGEHEKAISL 179

Query: 809  LNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSREE 988
            L +VV+ AP+L ++YHTLGLVY   GD  KAM FYMLAAHL PKD   LW  L   S + 
Sbjct: 180  LRQVVLRAPDLPDSYHTLGLVYNAIGDDVKAMGFYMLAAHLMPKDS-SLWKLLFSWSIDR 238

Query: 989  GSIRQAAYCLSKAISADPE----------------------------------NVEVRKK 1066
            G I QA+YCLSKAI A+P+                                  NVE    
Sbjct: 239  GDIDQASYCLSKAIKAEPDDINLLFHRASLYLERGDCEKAAETYDQIHQQCLGNVEALMT 298

Query: 1067 TAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARSV 1246
             AK+Y +C  +ER++ ILE YIK + +EADL VV LL  ++M    + KAL++IE A  V
Sbjct: 299  GAKLYQKCGHLERAICILEDYIKGHPSEADLDVVDLLASLYMGSKEFSKALERIEHADRV 358

Query: 1247 HCSGEHWPFSLTVMEGICQVQLGNI 1321
            +C+G   P +LT   GIC   LG++
Sbjct: 359  YCAGNELPLNLTTKAGICHAHLGDL 383


>ref|XP_006356573.1| PREDICTED: general transcription factor 3C polypeptide 3-like
            [Solanum tuberosum]
          Length = 955

 Score =  239 bits (610), Expect = 2e-60
 Identities = 146/337 (43%), Positives = 191/337 (56%), Gaps = 46/337 (13%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLE--YEALAERKRKALS------DRQSDGSAKK 604
            +F  EM+PL F E +A G QPYQQFE LE  YEALA +KRKA +         S+  AKK
Sbjct: 63   QFGAEMDPLAFTEVDAFGRQPYQQFEHLEHQYEALAAKKRKAQALPPRCVSECSEIPAKK 122

Query: 605  SRKEDAFT----TSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMF 772
            SR+ED        S+DEI+E M++                         + RKLG A + 
Sbjct: 123  SRQEDRQEDGPGASYDEILEAMNYGMRRKSRKLKKRGRRKGSKSKVSSELKRKLGDATLH 182

Query: 773  YAKGQYEEAIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFP 952
            YA G+YEEA  VL EVV L+PNL + YHTLGL+Y   GDKK+AM+FYMLAAHL+PKD   
Sbjct: 183  YAHGRYEEAKLVLREVVRLSPNLPDPYHTLGLIYNAMGDKKRAMNFYMLAAHLSPKD-AS 241

Query: 953  LWNRLADLSREEGSIRQAAYCLSKAISADPENVEVRKKTAKMYLE--------------- 1087
            LWN L   S ++G  +Q  YCLSKAI ADPE++ +R   A +Y+E               
Sbjct: 242  LWNLLVAWSTDQGDRKQTRYCLSKAIKADPEDLSLRFHRASIYIELGDYQKAAEQYEQIA 301

Query: 1088 -------------------CNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYI 1210
                               C + E SVGILE Y+K++ TEADLSV+ LL  IHM+ +A++
Sbjct: 302  RLCPNDVGVLKTAVQFYSKCGKHECSVGILEDYLKNHPTEADLSVIHLLAVIHMEDNAHL 361

Query: 1211 KALQQIEQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            KAL  IE A+  + +G+  P +L +  GIC + LG+I
Sbjct: 362  KALDLIEWAKQRYFTGKQMPLNLNIKAGICHLHLGHI 398


>ref|XP_002978980.1| hypothetical protein SELMODRAFT_444100 [Selaginella moellendorffii]
            gi|300153298|gb|EFJ19937.1| hypothetical protein
            SELMODRAFT_444100 [Selaginella moellendorffii]
          Length = 1047

 Score =  233 bits (595), Expect = 1e-58
 Identities = 136/331 (41%), Positives = 184/331 (55%), Gaps = 39/331 (11%)
 Frame = +2

Query: 446  LRFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALS-DRQSDGSAKKSRKEDA 622
            LRFEG+M+PL F++ +  G  PYQQFERLEYEALAERKRKAL+  R+ +    K  ++D 
Sbjct: 168  LRFEGDMDPLAFVDVDQNGDLPYQQFERLEYEALAERKRKALAKKREEEEMNAKESQQDI 227

Query: 623  FTTSFDEIMEGMS----FXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQY 790
            F    D+I                                 P VSRKLG+AN+ YA  + 
Sbjct: 228  FGADIDDIWNAFGPKRRRRAGEAKRKGRKKVPGIPGASRLPPEVSRKLGEANLLYATRKN 287

Query: 791  EEAIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLA 970
            +EAI +L EVV LAPN  + YHTLGL+Y+  GD+KKA++FYM+ AHL PKD   LW RLA
Sbjct: 288  DEAIALLKEVVRLAPNAPDAYHTLGLLYDAMGDRKKALNFYMICAHLKPKD-AALWKRLA 346

Query: 971  DLSREEGSIRQAAYCLSKAISADPE----------------------------------N 1048
              S E G+  Q  +CL+KAI ADP+                                  +
Sbjct: 347  SWSTELGNTGQVIHCLTKAIRADPDDIDAKWDRASLYAEILDFQKAADAFEQMLVLRSSD 406

Query: 1049 VEVRKKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQI 1228
            VEV K  AKM  +   ++R+  +LEK+I ++S EAD + V LL ++HM +  Y  AL QI
Sbjct: 407  VEVCKMVAKMQHKNGNIQRATEVLEKFIDEHSAEADFAAVNLLAELHMGNRNYAAALSQI 466

Query: 1229 EQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            ++AR ++C G+  P  L++  GIC V LGN+
Sbjct: 467  DRARQMYCHGQALPLDLSIKSGICHVHLGNL 497


>ref|XP_002994584.1| hypothetical protein SELMODRAFT_432497 [Selaginella moellendorffii]
            gi|300137377|gb|EFJ04351.1| hypothetical protein
            SELMODRAFT_432497 [Selaginella moellendorffii]
          Length = 1006

 Score =  233 bits (595), Expect = 1e-58
 Identities = 136/331 (41%), Positives = 184/331 (55%), Gaps = 39/331 (11%)
 Frame = +2

Query: 446  LRFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALS-DRQSDGSAKKSRKEDA 622
            LRFEG+M+PL F++ +  G  PYQQFERLEYEALAERKRKAL+  R+ +    K  ++D 
Sbjct: 127  LRFEGDMDPLAFVDVDQNGDLPYQQFERLEYEALAERKRKALAKKREEEEMNAKESQQDI 186

Query: 623  FTTSFDEIMEGMS----FXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQY 790
            F    D+I                                 P VSRKLG+AN+ YA  + 
Sbjct: 187  FGADIDDIWNAFGPKRRRRAGEAKRKGRKKVPGIPGASRLPPEVSRKLGEANLLYATRKN 246

Query: 791  EEAIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLA 970
            +EAI +L EVV LAPN  + YHTLGL+Y+  GD+KKA++FYM+ AHL PKD   LW RLA
Sbjct: 247  DEAIALLKEVVRLAPNAPDAYHTLGLLYDAMGDRKKALNFYMICAHLKPKD-AALWKRLA 305

Query: 971  DLSREEGSIRQAAYCLSKAISADPE----------------------------------N 1048
              S E G+  Q  +CL+KAI ADP+                                  +
Sbjct: 306  SWSTELGNTGQVIHCLTKAIRADPDDIDAKWDRASLYAEILDFQKAADAFEQMLVLRSSD 365

Query: 1049 VEVRKKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQI 1228
            VEV K  AKM  +   ++R+  +LEK+I ++S EAD + V LL ++HM +  Y  AL QI
Sbjct: 366  VEVCKMVAKMQHKNGNIQRATEVLEKFIDEHSAEADFAAVNLLAELHMGNRNYAAALSQI 425

Query: 1229 EQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            ++AR ++C G+  P  L++  GIC V LGN+
Sbjct: 426  DRARQMYCHGQALPLDLSIKSGICHVHLGNL 456


>ref|XP_007200319.1| hypothetical protein PRUPE_ppa001046mg [Prunus persica]
            gi|462395719|gb|EMJ01518.1| hypothetical protein
            PRUPE_ppa001046mg [Prunus persica]
          Length = 924

 Score =  231 bits (589), Expect = 5e-58
 Identities = 132/324 (40%), Positives = 182/324 (56%), Gaps = 34/324 (10%)
 Frame = +2

Query: 452  FEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFTT 631
            F+  +NPLDF+ED+A G Q Y+QF  + YEALAERKRK L D + +GS KK+R ED    
Sbjct: 57   FKDGVNPLDFVEDDAFGDQVYEQFVGMGYEALAERKRKTLEDSRPEGSVKKARHEDVTGA 116

Query: 632  SFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIRVL 811
            S +EIME M++                       P ++R+LG+A + Y  G+YEEAI +L
Sbjct: 117  SMEEIMEAMNYGMQRRTRKPKKKGRRKGSKKKLTPEITRRLGEATLHYVHGRYEEAIPIL 176

Query: 812  NEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSREEG 991
             E+V  AP+LSETYHTLGLV++  G++ KA++ + +AA LAPK+   LW  L       G
Sbjct: 177  AEIVKQAPDLSETYHTLGLVHDNLGNELKALNCFTIAALLAPKNP-ALWELLFGWFNRRG 235

Query: 992  SIRQAAYCLSKAISAD----------------------------------PENVEVRKKT 1069
               +A YCLS+AISAD                                  P+NVE  K  
Sbjct: 236  DAHKAIYCLSRAISADPKNIDLKLGRASLYVKLGDYHKAAASYEQIVQACPDNVEALKTA 295

Query: 1070 AKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARSVH 1249
            A MY    Q E S+ ILE Y++D+ TEAD SV+ LL  I M+++A+ +A+Q IE A+ V 
Sbjct: 296  AVMYDRSGQHEHSIHILEAYLRDHPTEADPSVIDLLASILMENNAHNEAIQHIEHAQLVF 355

Query: 1250 CSGEHWPFSLTVMEGICQVQLGNI 1321
            CS +  P ++ +  GIC   LGN+
Sbjct: 356  CSNKAMPLTMKIKAGICHAYLGNM 379


>ref|XP_006836911.1| hypothetical protein AMTR_s00099p00131860 [Amborella trichopoda]
            gi|548839475|gb|ERM99764.1| hypothetical protein
            AMTR_s00099p00131860 [Amborella trichopoda]
          Length = 914

 Score =  226 bits (575), Expect = 2e-56
 Identities = 126/324 (38%), Positives = 189/324 (58%), Gaps = 34/324 (10%)
 Frame = +2

Query: 452  FEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFTT 631
            FEG+M+PL F + N   ++ +++FERLEYEALAERKRK ++D   + S+K+ R+E+ F  
Sbjct: 58   FEGDMDPLGFADANESVIETFEKFERLEYEALAERKRKGITDSSGE-SSKRPRQENLFGA 116

Query: 632  SFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXXPGVSRKLGKANMFYAKGQYEEAIRVL 811
            + +EI E M +                       P V+RK+  AN+ Y  G+Y+EA+ +L
Sbjct: 117  NIEEIEEVMRYGSGRKSSEPKKRGRKKGSKKKLSPEVTRKIMDANIHYTFGRYDEAVSLL 176

Query: 812  NEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSREEG 991
             E+V LAP+++++Y TLG++Y + GDKK+A +FY LAA+L PK+   LW  L  LS+E G
Sbjct: 177  TEIVRLAPSVADSYITLGIIYNDRGDKKRAKNFYSLAAYLTPKNP-KLWECLITLSKELG 235

Query: 992  SIRQAAYCLSKAISADPE----------------------------------NVEVRKKT 1069
             + Q  YC S+AI A+PE                                  NVE  K  
Sbjct: 236  DMVQVNYCFSRAIKANPEDLSLAFLLASHYSELGDYPKAAESYDRILKLCPGNVEACKLA 295

Query: 1070 AKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARSVH 1249
            A+MY  C QVER++ ILE +IK++  +ADL+VV  +  ++M++  Y+KAL+ IE A+ V+
Sbjct: 296  AEMYHSCGQVERAISILENFIKEHPEDADLTVVRSVAALNMENKDYLKALKHIEAAKVVY 355

Query: 1250 CSGEHWPFSLTVMEGICQVQLGNI 1321
            CSG+  P  L V  GIC+  LG++
Sbjct: 356  CSGKVLPCDLIVKAGICEAYLGHM 379


>ref|XP_006441797.1| hypothetical protein CICLE_v10020572mg [Citrus clementina]
            gi|557544059|gb|ESR55037.1| hypothetical protein
            CICLE_v10020572mg [Citrus clementina]
          Length = 384

 Score =  218 bits (556), Expect = 4e-54
 Identities = 130/327 (39%), Positives = 187/327 (57%), Gaps = 36/327 (11%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFT 628
            RF+  +NPL++ E+   G++ YQQFERLEYEALA+RKRKA+        A  + +ED   
Sbjct: 54   RFKSGVNPLEWTENETSGLEAYQQFERLEYEALADRKRKAI--------AATNTEEDVAG 105

Query: 629  TSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX--PGVSRKLGKANMFYAKGQYEEAI 802
            TS D IME +++                         PGV++ LG+A++ YA G +E+AI
Sbjct: 106  TSVDAIMELINYGGYRRKTRKLNKKRGRRKGSKNKLSPGVTKLLGEASLQYAYGNFEQAI 165

Query: 803  RVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSR 982
             +L EVV L+PNL ETY+TLGL +   G+ K A +FY++AAHL+PKD   LW +L   + 
Sbjct: 166  SLLKEVVRLSPNLPETYNTLGLAHSALGNHKSAFAFYVIAAHLSPKDS-ALWKQLLTFAV 224

Query: 983  EEGSIRQAAYCLSKAISAD----------------------------------PENVEVR 1060
            ++G   QA Y L +AI A+                                  P+NV+  
Sbjct: 225  QKGDTAQAMYYLRQAIRAEPKDISLRIHLASFYVEIGDYEKAAESYEQIQKSFPDNVDAT 284

Query: 1061 KKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQAR 1240
            K  A+++L+C Q  RS+GILE+Y+K + ++ADLSV+ LLV I M+++AY K LQ IE A+
Sbjct: 285  KTGAQLFLKCGQTARSIGILEEYLKVHPSDADLSVIDLLVAILMENNAYEKTLQHIEHAQ 344

Query: 1241 SVHCSGEHWPFSLTVMEGICQVQLGNI 1321
             V  SG+  P  L V  GIC ++LGN+
Sbjct: 345  IVRFSGKELPLKLKVKAGICYLRLGNM 371


>ref|XP_006376472.1| hypothetical protein POPTR_0013s13300g [Populus trichocarpa]
            gi|550325748|gb|ERP54269.1| hypothetical protein
            POPTR_0013s13300g [Populus trichocarpa]
          Length = 478

 Score =  218 bits (556), Expect = 4e-54
 Identities = 138/325 (42%), Positives = 179/325 (55%), Gaps = 36/325 (11%)
 Frame = +2

Query: 455  EGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFTTS 634
            E E N LD +E N       QQFE   YEALAE+KRK L+D + +GSAKK+R+ED    S
Sbjct: 54   EEEENALDSMEQN-------QQFE---YEALAEKKRKTLADAKGEGSAKKARQEDMTGAS 103

Query: 635  FDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX--PGVSRKLGKANMFYAKGQYEEAIRV 808
              EI E M+F                         P ++R LG A + YA G YEEA+ V
Sbjct: 104  LAEIEEIMNFGMRKKRRRRMPKRRGRRKGSKNKLSPEITRMLGDATLHYAHGNYEEALTV 163

Query: 809  LNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSREE 988
            L+EVV  AP ++++YHTLGLV++  G+ +KAM FY +AA L PKD   LW  L     E+
Sbjct: 164  LSEVVKRAPLVADSYHTLGLVHKALGNTEKAMKFYRIAAFLRPKDS-SLWKLLFSWHVEQ 222

Query: 989  GSIRQAAYCLSKAISAD----------------------------------PENVEVRKK 1066
            G I +A  CLSKAISAD                                  PE+VE  K 
Sbjct: 223  GDIARAWKCLSKAISADPDDISLRSLHALFYDELGDHQRAAESYEQIVRICPEDVEAIKT 282

Query: 1067 TAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQARSV 1246
             AKM+L C Q++R VGILE Y+K + +EADLSV++LL D+ M+  A+  ALQ IE A+ +
Sbjct: 283  AAKMHLNCGQIKRCVGILEDYLKGHPSEADLSVIILLADVFMEIDAHNNALQHIEHAQMI 342

Query: 1247 HCSGEHWPFSLTVMEGICQVQLGNI 1321
            + SG+  P  L +  GIC V LGNI
Sbjct: 343  YYSGKELPLELMIKAGICHVFLGNI 367


>ref|XP_006478352.1| PREDICTED: general transcription factor 3C polypeptide 3-like [Citrus
            sinensis]
          Length = 922

 Score =  217 bits (552), Expect = 1e-53
 Identities = 129/327 (39%), Positives = 186/327 (56%), Gaps = 36/327 (11%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAFT 628
            RF+  +NPL++ E+   G++ YQQFERLEYEALA+RKRKA+        A  + +ED   
Sbjct: 54   RFKSGVNPLEWTENETSGLEAYQQFERLEYEALADRKRKAI--------AATNTEEDVAG 105

Query: 629  TSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX--PGVSRKLGKANMFYAKGQYEEAI 802
            TS D IME +++                         PGV++ LG+A++ YA G +E+AI
Sbjct: 106  TSVDAIMELINYGGYRKKTRKLNKKRGRRKGSKNKLSPGVTKMLGEASLQYAYGNFEQAI 165

Query: 803  RVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSR 982
             +L EVV L+PNL ETY+TLGL +   G+ K A  FY++AAHL+PKD   LW +L   + 
Sbjct: 166  SLLKEVVRLSPNLPETYNTLGLAHSALGNHKSAFDFYVIAAHLSPKDS-ALWKQLLTFAV 224

Query: 983  EEGSIRQAAYCLSKAISAD----------------------------------PENVEVR 1060
            ++G   QA Y + +AI A+                                  P+NV+  
Sbjct: 225  QKGDTAQAMYYIRQAIRAEPKDISLRIHLASFYVEIGDYEKAAESYEQIQKLFPDNVDAT 284

Query: 1061 KKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQAR 1240
            K  A+++L+C Q  RS+GILE+Y+K + ++ADLSV+ LLV I M+++AY K LQ IE A+
Sbjct: 285  KTGAQLFLKCGQTARSMGILEEYLKVHPSDADLSVIDLLVAILMENNAYEKTLQHIEHAQ 344

Query: 1241 SVHCSGEHWPFSLTVMEGICQVQLGNI 1321
             V  SG+  P  L V  GIC ++LGN+
Sbjct: 345  IVRFSGKELPLKLKVKAGICYLRLGNM 371


>ref|XP_001753353.1| predicted protein [Physcomitrella patens] gi|162695639|gb|EDQ81982.1|
            predicted protein [Physcomitrella patens]
          Length = 926

 Score =  209 bits (531), Expect = 3e-51
 Identities = 124/334 (37%), Positives = 181/334 (54%), Gaps = 48/334 (14%)
 Frame = +2

Query: 464  MNPLDFIEDNALGVQPYQQFERLEYEALAERKRKALSDRQSDG-SAKKSRKEDAFTTSFD 640
            M+PL F E++  G  PY+QF+RLEYEALA RKRK L+ R ++   AK ++++D F  S D
Sbjct: 1    MDPLRFAEEDENGKLPYEQFQRLEYEALAARKRKNLATRSTETVQAKITKQQDIFGASVD 60

Query: 641  EIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX--PGVSRKLGKANMFYAKGQYEE------ 796
            EI +   F                         P +++KLG+AN+ YA GQ++E      
Sbjct: 61   EIWDAAGFGAPGRRRRKGPKRKGRRRKAPGGLTPEINKKLGEANLLYATGQFDEVITTLM 120

Query: 797  -----AIRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWN 961
                 A+ +L EVV +APN++++YHTLGL+Y+  GD+K+A++FYM+AAHL PKD   LW 
Sbjct: 121  CGYSQAVEILKEVVRIAPNVADSYHTLGLLYDAKGDRKRALNFYMIAAHLTPKD-IVLWK 179

Query: 962  RLADLSREEGSIRQAAYCLSKAISADPENVEVRKKTAKMYLECN---------------- 1093
            RLA  S E G+  Q  YCL KA+ ADP +V+ R   A +Y E N                
Sbjct: 180  RLASWSMELGNPGQVIYCLQKAMRADPTDVDARWDCASLYAELNEFPKAIDCLEQLLALR 239

Query: 1094 ------------------QVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKAL 1219
                              Q E++  +LE  I+ Y  EADLS V LL ++HM + A+   +
Sbjct: 240  PGDVEICKMVAKMRQKNGQSEQATQLLEHLIETYPYEADLSAVNLLAELHMANGAFAITI 299

Query: 1220 QQIEQARSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
              I++AR ++ + +  P  L+V  GIC   LG++
Sbjct: 300  SWIDRARELYSADQPLPLDLSVKAGICHAYLGDL 333


>ref|XP_006590810.1| PREDICTED: general transcription factor 3C polypeptide 3-like
            [Glycine max]
          Length = 914

 Score =  206 bits (523), Expect = 2e-50
 Identities = 132/328 (40%), Positives = 174/328 (53%), Gaps = 37/328 (11%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNA-LGVQPYQQFERLEYEALAERKRKALSDRQSDGS-AKKSRKEDA 622
            RF+  MNPLDF++DN   G+QPYQ+F RLE EALA++KRKA+    S+   +K +R+ D 
Sbjct: 42   RFKNGMNPLDFVDDNDDSGIQPYQRFVRLEREALADKKRKAIEQCHSEEPPSKMAREGDV 101

Query: 623  FTTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX-PGVSRKLGKANMFYAKGQYEEA 799
                  EIME M +                        P +++ LG A   YA+G Y++A
Sbjct: 102  SGAKIAEIMEAMDYYGVRKRSRKPKKRGRRKGSKNKDDPKLTQMLGDATFHYARGDYDQA 161

Query: 800  IRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLS 979
              VL EV+ LAPNL E+YHTLGLVY    D K+AM+ Y++AAHL  K+   LW  +   S
Sbjct: 162  KAVLREVIRLAPNLHESYHTLGLVYTSLQDYKRAMALYLIAAHLDAKES-SLWKTIFTWS 220

Query: 980  REEGSIRQAAYCLSKAISADP----------------------------------ENVEV 1057
             E+G + QA YCL KAI ADP                                  EN++ 
Sbjct: 221  IEQGYVDQAGYCLLKAIKADPKDVTLRCHLARLYAELGHYQKAAVTYEQVHKLCCENIDA 280

Query: 1058 RKKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQA 1237
             K  AK Y +C QVE SV ILE YIK     A+ SVV LL  I M+  A+ +ALQ IE A
Sbjct: 281  LKAAAKFYKKCGQVEYSVRILEDYIKSQPDVANASVVDLLGTILMETKAHDRALQHIEHA 340

Query: 1238 RSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            ++V+   E  P +L +  GIC   LGN+
Sbjct: 341  QAVNARKE-LPLNLKIKAGICHAHLGNL 367


>ref|XP_007131656.1| hypothetical protein PHAVU_011G031000g [Phaseolus vulgaris]
            gi|561004656|gb|ESW03650.1| hypothetical protein
            PHAVU_011G031000g [Phaseolus vulgaris]
          Length = 917

 Score =  205 bits (521), Expect = 4e-50
 Identities = 130/327 (39%), Positives = 178/327 (54%), Gaps = 36/327 (11%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNA-LGVQPYQQFERLEYEALAERKRKALSDRQSDGSAKKSRKEDAF 625
            RF+  M+PLDFI++N   G+QPY++FERLE EALA++KRKA      +  +K  R+ D  
Sbjct: 46   RFQNGMDPLDFIDNNDDSGLQPYERFERLEQEALADKKRKATECHSEEPPSKMIRESDIS 105

Query: 626  TTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX-PGVSRKLGKANMFYAKGQYEEAI 802
             +   EIME M++                        P ++R LG A + YA G Y++A 
Sbjct: 106  GSKIAEIMEAMNYHGVRKRSRKPKKRGRRKGSKNKMDPRLTRMLGDATLHYACGHYDKAK 165

Query: 803  RVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLSR 982
             VL EV+ LAPNL ++YHTLGLV     D K+AMSFY++AAHL PKD   LW R+   S 
Sbjct: 166  AVLLEVIKLAPNLPDSYHTLGLVCSSLQDYKRAMSFYLIAAHLTPKDS-SLWKRIFTWSI 224

Query: 983  EEGSIRQAAYCLSKAISADP----------------------------------ENVEVR 1060
            E+G I QA +CL +AI+ADP                                  ENV+  
Sbjct: 225  EQGYIDQARHCLLRAITADPQDVTLRGLLARLYVELGDYQKAAVTYEQVHQLCYENVDPL 284

Query: 1061 KKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQAR 1240
            K  AK+Y +C QVE +V ILE Y+K     A+ SVV LL  I M+  A+ +ALQ IE A+
Sbjct: 285  KAAAKLYKKCGQVEHAVRILEDYLKSQPDGANASVVDLLCTILMETKAHDRALQYIEHAQ 344

Query: 1241 SVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            +V+ + +  P +L +  GIC   LG +
Sbjct: 345  AVN-AWKELPLNLKIKAGICHAHLGKM 370


>ref|XP_006592051.1| PREDICTED: general transcription factor 3C polypeptide 3-like isoform
            X1 [Glycine max] gi|571491818|ref|XP_006592052.1|
            PREDICTED: general transcription factor 3C polypeptide
            3-like isoform X2 [Glycine max]
          Length = 918

 Score =  202 bits (514), Expect = 3e-49
 Identities = 131/328 (39%), Positives = 173/328 (52%), Gaps = 37/328 (11%)
 Frame = +2

Query: 449  RFEGEMNPLDFIEDNA-LGVQPYQQFERLEYEALAERKRKALSDRQSDGS-AKKSRKEDA 622
            RF+  MNPLDF++DN   G+QPYQ+F RLE EALA++KRKA     S+   +K +R+ D 
Sbjct: 46   RFKTGMNPLDFVDDNDDSGIQPYQRFVRLEREALADKKRKAPEQCHSEEPPSKMAREGDI 105

Query: 623  FTTSFDEIMEGMSFXXXXXXXXXXXXXXXXXXXXXXX-PGVSRKLGKANMFYAKGQYEEA 799
                  EIME M +                        P +++  G A   YA G Y+ A
Sbjct: 106  SGAKIAEIMEAMDYYGMRKRSRKPKKRGRRKGSKNRVDPKLTQMQGDATFHYACGDYDRA 165

Query: 800  IRVLNEVVMLAPNLSETYHTLGLVYEETGDKKKAMSFYMLAAHLAPKDHFPLWNRLADLS 979
              VL EV+ LAPNL E+YHTLGLVY    D K+AM+ Y++AAHL PK+  PLW  +   S
Sbjct: 166  KAVLCEVIRLAPNLHESYHTLGLVYTSLQDYKRAMALYLIAAHLDPKES-PLWKTIFTWS 224

Query: 980  REEGSIRQAAYCLSKAISADP----------------------------------ENVEV 1057
             E+G + QA YCL KAI ADP                                  EN++ 
Sbjct: 225  IEQGYVDQAGYCLLKAIKADPKDVTLRFHLARLYAELGHYQKAAVTYEQVHKLCCENIDA 284

Query: 1058 RKKTAKMYLECNQVERSVGILEKYIKDYSTEADLSVVVLLVDIHMKHHAYIKALQQIEQA 1237
             K  AK Y +C QVE S+ ILE YIK     A++SVV LL  + M+  A+ +ALQ IE A
Sbjct: 285  LKAAAKFYKKCGQVEYSIQILEDYIKSQPDGANVSVVDLLGTVLMETKAHDRALQHIEHA 344

Query: 1238 RSVHCSGEHWPFSLTVMEGICQVQLGNI 1321
            ++V+   E  P +L +  GIC   LGN+
Sbjct: 345  QTVNARKE-LPLNLKIKAGICHAHLGNM 371


Top