BLASTX nr result

ID: Akebia25_contig00005311 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00005311
         (1276 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007203792.1| hypothetical protein PRUPE_ppa001273mg [Prun...   249   2e-63
ref|XP_006430297.1| hypothetical protein CICLE_v10010952mg [Citr...   229   1e-57
ref|XP_006430296.1| hypothetical protein CICLE_v10010952mg [Citr...   229   1e-57
ref|XP_006430295.1| hypothetical protein CICLE_v10010952mg [Citr...   229   1e-57
ref|XP_006481887.1| PREDICTED: ubiquitin-associated protein 2-li...   229   2e-57
ref|XP_006481885.1| PREDICTED: ubiquitin-associated protein 2-li...   229   2e-57
emb|CBI30249.3| unnamed protein product [Vitis vinifera]              227   1e-56
ref|XP_006851712.1| hypothetical protein AMTR_s00040p00210200 [A...   218   3e-54
ref|XP_004303026.1| PREDICTED: uncharacterized protein LOC101305...   218   4e-54
gb|EYU38613.1| hypothetical protein MIMGU_mgv1a002674mg [Mimulus...   217   8e-54
ref|XP_003632752.1| PREDICTED: uncharacterized protein C4G9.04c-...   209   2e-51
ref|XP_006381311.1| hypothetical protein POPTR_0006s11660g [Popu...   205   4e-50
ref|XP_006341164.1| PREDICTED: uncharacterized protein LOC102593...   204   7e-50
gb|EXB37772.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Morus ...   203   1e-49
ref|XP_002528590.1| conserved hypothetical protein [Ricinus comm...   202   3e-49
ref|XP_007027622.1| ENTH/VHS family protein, putative isoform 3 ...   199   2e-48
ref|XP_007027621.1| ENTH/VHS family protein, putative isoform 2 ...   199   2e-48
ref|XP_007027620.1| ENTH/VHS family protein, putative isoform 1 ...   199   2e-48
ref|XP_004246564.1| PREDICTED: uncharacterized protein LOC101244...   199   3e-48
ref|XP_003625749.1| Pre-mRNA cleavage complex 2 protein Pcf11 [M...   182   3e-43

>ref|XP_007203792.1| hypothetical protein PRUPE_ppa001273mg [Prunus persica]
            gi|462399323|gb|EMJ04991.1| hypothetical protein
            PRUPE_ppa001273mg [Prunus persica]
          Length = 866

 Score =  249 bits (636), Expect = 2e-63
 Identities = 133/276 (48%), Positives = 168/276 (60%), Gaps = 20/276 (7%)
 Frame = -2

Query: 1260 NPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXS 1081
            +P+S+LLS+LVAKGLIS+   E PT  S Q+ + L  Q                      
Sbjct: 589  DPISNLLSSLVAKGLISASKSESPTPVSSQMPNELQNQSVSTPVTSSVSVSPVSASPSLP 648

Query: 1080 ----GNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHK 913
                 +D+      AK ++ + +  K+E KN +GIEFKP+ IRE HPSVI +LFDDLPHK
Sbjct: 649  VSSRTDDVSLAEPLAKTSAALPQSSKIETKNPIGIEFKPDKIREFHPSVIEELFDDLPHK 708

Query: 912  CSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSGPS--- 766
            CSICG RLK +E+L+ HLEWHA KT         SRRWY     WVAG  G   GP    
Sbjct: 709  CSICGLRLKLKERLERHLEWHALKTPEFNGSVKASRRWYADSTNWVAGKAGPPLGPEDNM 768

Query: 765  -----VETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDI 601
                  ET +  EP+VPADESQCVC++CG  FED Y  +RDEWM+KGA+Y+S+P   GD+
Sbjct: 769  SIDKPSETMDNGEPMVPADESQCVCVICGYIFEDLYCQERDEWMFKGASYLSIPYGVGDL 828

Query: 600  GTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            GTT+     GPIVHANC +  S+SDLGL+  IKLE+
Sbjct: 829  GTTEESVVKGPIVHANCIAENSLSDLGLASRIKLEK 864


>ref|XP_006430297.1| hypothetical protein CICLE_v10010952mg [Citrus clementina]
            gi|557532354|gb|ESR43537.1| hypothetical protein
            CICLE_v10010952mg [Citrus clementina]
          Length = 906

 Score =  229 bits (585), Expect = 1e-57
 Identities = 122/280 (43%), Positives = 167/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+S+LLSTLVAKGLIS+   E P+ T+PQV SR+  +                  
Sbjct: 625  SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNL 684

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  + + S    A + +  +S+   VE +NL+G++FKP++IRE H SVI  LFD 
Sbjct: 685  LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 744

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG- 772
             PH CSICG RLK QEQLD HLEWHA +         +SRRWY +   WVAG  G   G 
Sbjct: 745  FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGL 804

Query: 771  -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
                    S +T ++ EP+VPAD++QC C++CGE FED Y+  R EWM+K A YM +P+ 
Sbjct: 805  ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 864

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            +G++GTT+  ++ GPIVH NC S  SV DL +   +K+E+
Sbjct: 865  NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 904


>ref|XP_006430296.1| hypothetical protein CICLE_v10010952mg [Citrus clementina]
            gi|557532353|gb|ESR43536.1| hypothetical protein
            CICLE_v10010952mg [Citrus clementina]
          Length = 1073

 Score =  229 bits (585), Expect = 1e-57
 Identities = 122/280 (43%), Positives = 167/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+S+LLSTLVAKGLIS+   E P+ T+PQV SR+  +                  
Sbjct: 792  SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNL 851

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  + + S    A + +  +S+   VE +NL+G++FKP++IRE H SVI  LFD 
Sbjct: 852  LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 911

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG- 772
             PH CSICG RLK QEQLD HLEWHA +         +SRRWY +   WVAG  G   G 
Sbjct: 912  FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGL 971

Query: 771  -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
                    S +T ++ EP+VPAD++QC C++CGE FED Y+  R EWM+K A YM +P+ 
Sbjct: 972  ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 1031

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            +G++GTT+  ++ GPIVH NC S  SV DL +   +K+E+
Sbjct: 1032 NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1071


>ref|XP_006430295.1| hypothetical protein CICLE_v10010952mg [Citrus clementina]
            gi|557532352|gb|ESR43535.1| hypothetical protein
            CICLE_v10010952mg [Citrus clementina]
          Length = 829

 Score =  229 bits (585), Expect = 1e-57
 Identities = 122/280 (43%), Positives = 167/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+S+LLSTLVAKGLIS+   E P+ T+PQV SR+  +                  
Sbjct: 548  SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPATVSSVPNL 607

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  + + S    A + +  +S+   VE +NL+G++FKP++IRE H SVI  LFD 
Sbjct: 608  LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 667

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG- 772
             PH CSICG RLK QEQLD HLEWHA +         +SRRWY +   WVAG  G   G 
Sbjct: 668  FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKISRRWYANSDDWVAGKAGLPLGL 727

Query: 771  -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
                    S +T ++ EP+VPAD++QC C++CGE FED Y+  R EWM+K A YM +P+ 
Sbjct: 728  ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 787

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            +G++GTT+  ++ GPIVH NC S  SV DL +   +K+E+
Sbjct: 788  NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 827


>ref|XP_006481887.1| PREDICTED: ubiquitin-associated protein 2-like isoform X3 [Citrus
            sinensis]
          Length = 1070

 Score =  229 bits (584), Expect = 2e-57
 Identities = 122/280 (43%), Positives = 167/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+S+LLSTLVAKGLIS+   E P+ T+PQV SR+  +                  
Sbjct: 789  SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPAAVSSVPNL 848

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  + + S    A + +  +S+   VE +NL+G++FKP++IRE H SVI  LFD 
Sbjct: 849  LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 908

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG- 772
             PH CSICG RLK QEQLD HLEWHA +         +SRRWY +   WVAG  G   G 
Sbjct: 909  FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKVSRRWYANSDDWVAGKAGLPLGL 968

Query: 771  -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
                    S +T ++ EP+VPAD++QC C++CGE FED Y+  R EWM+K A YM +P+ 
Sbjct: 969  ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 1028

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            +G++GTT+  ++ GPIVH NC S  SV DL +   +K+E+
Sbjct: 1029 NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1068


>ref|XP_006481885.1| PREDICTED: ubiquitin-associated protein 2-like isoform X1 [Citrus
            sinensis] gi|568856635|ref|XP_006481886.1| PREDICTED:
            ubiquitin-associated protein 2-like isoform X2 [Citrus
            sinensis]
          Length = 1073

 Score =  229 bits (584), Expect = 2e-57
 Identities = 122/280 (43%), Positives = 167/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+S+LLSTLVAKGLIS+   E P+ T+PQV SR+  +                  
Sbjct: 792  SKTSNPISNLLSTLVAKGLISASKTEPPSHTTPQVTSRMQNESPGISSSSPAAVSSVPNL 851

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  + + S    A + +  +S+   VE +NL+G++FKP++IRE H SVI  LFD 
Sbjct: 852  LPIPPSSTVDETSLPAPAGESSFALSESTTVETQNLIGLKFKPDVIREFHESVIKRLFDG 911

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSG- 772
             PH CSICG RLK QEQLD HLEWHA +         +SRRWY +   WVAG  G   G 
Sbjct: 912  FPHLCSICGLRLKLQEQLDRHLEWHALRKPGLDDVDKVSRRWYANSDDWVAGKAGLPLGL 971

Query: 771  -------PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
                    S +T ++ EP+VPAD++QC C++CGE FED Y+  R EWM+K A YM +P+ 
Sbjct: 972  ESISCMEDSGKTIDEGEPMVPADDNQCACVMCGELFEDCYNQARGEWMFKAAVYMMIPSG 1031

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
            +G++GTT+  ++ GPIVH NC S  SV DL +   +K+E+
Sbjct: 1032 NGEVGTTNESSAKGPIVHGNCISENSVHDLRVISKVKVEK 1071


>emb|CBI30249.3| unnamed protein product [Vitis vinifera]
          Length = 1049

 Score =  227 bits (578), Expect = 1e-56
 Identities = 128/280 (45%), Positives = 171/280 (61%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            S   NP+++LLS+LVAKGLIS+   E  T    Q+ +RL  Q                  
Sbjct: 772  SNASNPIANLLSSLVAKGLISASKTESSTHVPTQMPARLQNQSAGISTISPIPVSSVSVA 831

Query: 1092 XXXSGNDLLFKGS----AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDD 925
                 +  +   S    AAK +  V++   VE KNL+G EFK +IIRESHPSVIS+LFDD
Sbjct: 832  SSVPLSSTMDAVSHTEPAAKASVAVTQSTSVEVKNLIGFEFKSDIIRESHPSVISELFDD 891

Query: 924  LPHKCSICGHRLKFQEQLDLHLEWHASK--------TLSRRWYPSLGVWVAGNEG----- 784
            LPH+CSICG RLK +E+LD HLEWHA K          SR W+ + G W+A   G     
Sbjct: 892  LPHQCSICGLRLKLRERLDRHLEWHALKKSEPNGLNRASRSWFVNSGEWIAEVAGFPTEA 951

Query: 783  ---SSSGPSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAV 613
               S +G S +  E SE +VPADE+QCVC+LCGE FEDFYS + D+WM++GA  M++P+ 
Sbjct: 952  KSTSPAGESGKPLETSEQMVPADENQCVCVLCGEVFEDFYSQEMDKWMFRGAVKMTVPSQ 1011

Query: 612  DGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
             G++GT     + GPIVHA+C + +SV DLGL+ +IK+E+
Sbjct: 1012 GGELGT----KNQGPIVHADCITESSVHDLGLACDIKVEK 1047


>ref|XP_006851712.1| hypothetical protein AMTR_s00040p00210200 [Amborella trichopoda]
            gi|548855292|gb|ERN13179.1| hypothetical protein
            AMTR_s00040p00210200 [Amborella trichopoda]
          Length = 1173

 Score =  218 bits (556), Expect = 3e-54
 Identities = 128/291 (43%), Positives = 162/291 (55%), Gaps = 27/291 (9%)
 Frame = -2

Query: 1275 ASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXX 1096
            AS V N LS LLS+LVAKGLIS+P+ E          + +  Q                 
Sbjct: 891  ASGVSNQLSGLLSSLVAKGLISAPTSESSNPPVSHAPTEVQHQTAVVATSATSMLSSRSL 950

Query: 1095 XXXXSGNDLLFKGSAAKITSTVSK------------PMKVERKNLLGIEFKPEIIRESHP 952
                    +        +++++S             P+ +E  NL+GIEFKPE+IRE HP
Sbjct: 951  VSSTPPTSIPIDEPELWVSTSISSAPPQAPRVDTKDPIAIE-PNLIGIEFKPEVIRERHP 1009

Query: 951  SVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVA 796
            SVIS LFD +PH+CS CG R   QE+L  HLEWHASK         + R WY SL  WV 
Sbjct: 1010 SVISGLFDAMPHRCSACGLRFNRQEELSKHLEWHASKNHEQSSGKRVLRNWYVSLRNWVE 1069

Query: 795  GNEGSSSGPS-------VETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGA 637
            G+ G S+G +       +   EK EPVVPADESQC+CILCGEPFED+YSH+RDEWMYKGA
Sbjct: 1070 GDVGPSTGDASFPLDEKLSNVEKEEPVVPADESQCICILCGEPFEDYYSHERDEWMYKGA 1129

Query: 636  TYMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLEQMDG 484
            TYMS     G+ G  DG +S   IVH NC S  +  DL  ++N  +++ DG
Sbjct: 1130 TYMS-----GNGG--DGSSSPVSIVHVNCISKGAADDLLEAENDNVDKADG 1173


>ref|XP_004303026.1| PREDICTED: uncharacterized protein LOC101305191 [Fragaria vesca
            subsp. vesca]
          Length = 1110

 Score =  218 bits (555), Expect = 4e-54
 Identities = 120/281 (42%), Positives = 157/281 (55%), Gaps = 19/281 (6%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKEM--PTLTSPQVASRLPKQXXXXXXXXXXXXXXXX 1099
            S   +P+S+LLS+LVAKGLIS+   E   P  +      ++ K                 
Sbjct: 830  SNAKDPISNLLSSLVAKGLISASKSESTTPLPSHKPTEVQIQKLPTTTVSSISPGSASSI 889

Query: 1098 XXXXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLP 919
                   ++        K ++ +++  K E+KN +G EFKP+ IRE HPSVI +LFDDL 
Sbjct: 890  VPGSSRRDNAPLAEQVVKPSAALAQSTKTEKKNPIGFEFKPDKIRELHPSVIDELFDDLQ 949

Query: 918  HKCSICGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEGSSSGPSV 763
            HKC +CG RLK +E+LD HLEWHA KT         SR WY +   WV G  GSSS    
Sbjct: 950  HKCILCGLRLKLKERLDRHLEWHALKTPEADGSIKASRGWYANSANWVTGKAGSSSDLDS 1009

Query: 762  E--------TAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDG 607
                     T   +EP VPADESQC CI+CG  FEDFY  + D+WM+KGA YM++PA DG
Sbjct: 1010 NNSNDMTGMTVASNEPTVPADESQCACIICGNTFEDFYCQESDDWMFKGAVYMTVPAGDG 1069

Query: 606  DIGTTDGCASLGPIVHANCASPTSVSDLGL-SKNIKLEQMD 487
            ++GT  G    GPIVHA C    S+ +LGL +  +KLE+ D
Sbjct: 1070 ELGTAGGSVLKGPIVHATCIDENSLEELGLAATRVKLEKDD 1110


>gb|EYU38613.1| hypothetical protein MIMGU_mgv1a002674mg [Mimulus guttatus]
          Length = 648

 Score =  217 bits (553), Expect = 8e-54
 Identities = 133/287 (46%), Positives = 163/287 (56%), Gaps = 28/287 (9%)
 Frame = -2

Query: 1272 SAVPNPLSSLLSTLVAKGLISSPSKE---MPTLTSPQVA-------SRLPKQXXXXXXXX 1123
            S+  NP SSLLS+LVAKGLISS   +   +P    P VA       S +P          
Sbjct: 373  SSSSNPFSSLLSSLVAKGLISSSKSDSLMVPVDKVPAVATSSSSPVSSVPFTIPKPLVSI 432

Query: 1122 XXXXXXXXXXXXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVI 943
                         + NDLL   S  KI            K L+G EFKP+++R SHP VI
Sbjct: 433  TDIPSSSLEPAVKASNDLL--QSTEKI------------KQLIGFEFKPDVVRNSHPDVI 478

Query: 942  SDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASK--------TLSRRWYPSLGVWVAG-- 793
            SDL  DLPH+C+ICG R K QE+L  H+EWHASK         +SR+WY S+  WVAG  
Sbjct: 479  SDLVSDLPHECTICGLRFKLQERLGRHMEWHASKFSDYNPNSNMSRKWYASVVDWVAGIG 538

Query: 792  ---NEGSSSG---PSVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATY 631
                +GS S     S E  E  E +VPADESQC CILCGE FEDFYS +RDEWMYK A Y
Sbjct: 539  LLHLQGSPSDMLEASGEMLETCEQMVPADESQCACILCGELFEDFYSQERDEWMYKAAVY 598

Query: 630  MSLPAVDG--DIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIKLE 496
            +++P+ +    I T++  A LGPIVHANC S  S+ DLGL  ++KLE
Sbjct: 599  LTIPSSESVERIATSNDSAILGPIVHANCVSKDSIHDLGLVSDVKLE 645


>ref|XP_003632752.1| PREDICTED: uncharacterized protein C4G9.04c-like [Vitis vinifera]
          Length = 244

 Score =  209 bits (533), Expect = 2e-51
 Identities = 107/203 (52%), Positives = 141/203 (69%), Gaps = 16/203 (7%)
 Frame = -2

Query: 1053 AAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQ 874
            AAK +  V++   VE KNL+G EFK +IIRESHPSVIS+LFDDLPH+CSICG RLK +E+
Sbjct: 44   AAKASVAVTQSTSVEVKNLIGFEFKSDIIRESHPSVISELFDDLPHQCSICGLRLKLRER 103

Query: 873  LDLHLEWHASK--------TLSRRWYPSLGVWVAGNEG--------SSSGPSVETAEKSE 742
            LD HLEWHA K          SR W+ + G W+A   G        S +G S +  E SE
Sbjct: 104  LDRHLEWHALKKSEPNGLNRASRSWFVNSGEWIAEVAGFPTEAKSTSPAGESGKPLETSE 163

Query: 741  PVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIV 562
             +VPADE+QCVC+LCGE FEDFYS + D+WM++GA  M++P+  G++GT     + GPIV
Sbjct: 164  QMVPADENQCVCVLCGEVFEDFYSQEMDKWMFRGAVKMTVPSQGGELGT----KNQGPIV 219

Query: 561  HANCASPTSVSDLGLSKNIKLEQ 493
            HA+C + +SV DLGL+ +IK+E+
Sbjct: 220  HADCITESSVHDLGLACDIKVEK 242


>ref|XP_006381311.1| hypothetical protein POPTR_0006s11660g [Populus trichocarpa]
            gi|550336013|gb|ERP59108.1| hypothetical protein
            POPTR_0006s11660g [Populus trichocarpa]
          Length = 908

 Score =  205 bits (521), Expect = 4e-50
 Identities = 116/273 (42%), Positives = 158/273 (57%), Gaps = 16/273 (5%)
 Frame = -2

Query: 1263 PNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXX 1084
            PNP+S+LLS+LVAKGLIS+   E  +    QV S+L K+                     
Sbjct: 632  PNPISNLLSSLVAKGLISTSKSETSSPLPTQVPSQLQKKNPSITSPSSEPISSATLHSST 691

Query: 1083 SGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSI 904
             G   + +    K +  +S+  KVE  +L+G+EFKPE+IRE HP VIS LF+DLPH+CS+
Sbjct: 692  VGEASIPEPDT-KCSVALSQTTKVEIDDLIGLEFKPEVIRELHPPVISSLFEDLPHRCSL 750

Query: 903  CGHRLKFQEQLDLHLEWHASKT--------LSRRWYPSLGVWVAGNEG-----SSSGPS- 766
            CG +LK +E+L  HLEWH  +          +R WY  LG W+  N+G      SS P  
Sbjct: 751  CGLQLKLKERLHRHLEWHNQRKPESDGINGPTRGWYADLGHWLTVNDGLPLGVESSCPMD 810

Query: 765  --VETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTT 592
               ET E  +  V A E  CVC+LCG+ FED+Y  +R++WM+KGA  M+LP+ DG +GT 
Sbjct: 811  DFEETTECDDKTVLAHEDHCVCVLCGKLFEDYYCEERNKWMFKGAVRMTLPSGDGQMGTA 870

Query: 591  DGCASLGPIVHANCASPTSVSDLGLSKNIKLEQ 493
               A  GP VH NC S +S+ DL L+  IK+E+
Sbjct: 871  KESAK-GPTVHVNCISESSLCDLVLASGIKMEK 902


>ref|XP_006341164.1| PREDICTED: uncharacterized protein LOC102593629 [Solanum tuberosum]
          Length = 1046

 Score =  204 bits (519), Expect = 7e-50
 Identities = 117/268 (43%), Positives = 154/268 (57%), Gaps = 19/268 (7%)
 Frame = -2

Query: 1260 NPLSSLLSTLVAKGLISSPSKEMPTLTS----PQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            +PLSS+LSTLVAKGLIS+  K+ P  T     PQ  + +P                    
Sbjct: 789  SPLSSILSTLVAKGLISASKKDPPIYTPSDTPPQTQNLIPPASSISTPALSAPISASVPS 848

Query: 1092 XXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHK 913
                 ++L     +AK    + +    E K+L+G+ FKP++IR SHP+VISDL DD+PH+
Sbjct: 849  SAPK-DELSHSKPSAKTLEVLLQSTNEEAKSLIGLVFKPDVIRNSHPAVISDLLDDVPHQ 907

Query: 912  CSICGHRLKFQEQLDLHLEWHASK-------TLSRRWYPSLGVWVAGNEG-----SSSGP 769
            C ICG  LK QE+LD HLEWH+ +         SR+WY + G W+A   G      S GP
Sbjct: 908  CGICGFGLKLQEKLDRHLEWHSLRNPDVKLLNNSRKWYLNSGEWIAAFGGLPCGDKSKGP 967

Query: 768  ---SVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 598
               S ET+E +E +VPADE QCVC+LCGE FEDFY+ + DEWM+K A YMS+P       
Sbjct: 968  AGGSSETSECTETMVPADECQCVCVLCGEFFEDFYNEESDEWMFKDAVYMSIP------- 1020

Query: 597  TTDGCASLGPIVHANCASPTSVSDLGLS 514
            +   C   GPIVH NC S +S  +LGL+
Sbjct: 1021 SESDCQ--GPIVHKNCISESSCQELGLA 1046


>gb|EXB37772.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Morus notabilis]
          Length = 1101

 Score =  203 bits (516), Expect = 1e-49
 Identities = 119/298 (39%), Positives = 161/298 (54%), Gaps = 40/298 (13%)
 Frame = -2

Query: 1275 ASAVPNPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXX 1096
            A+  P+P+S+LLS+LVAKGLIS+  KE P    P V +   K+                 
Sbjct: 799  ANNTPDPISNLLSSLVAKGLISASKKESPQAIPPVVPTETQKKSPSITGTGSVPVSLVSG 858

Query: 1095 XXXXSGND----------------LLFKGSAAKITSTV---------SKPMKVERKNLLG 991
                S  D                   K +  +I + +         +K   +E KNL+G
Sbjct: 859  STVSSTRDDSSISEPTADSPVSLPESTKSTNLEIKNLIGFDFKPDESTKSTNLEIKNLIG 918

Query: 990  IEFKPEIIRESHPSVISDLFDDLPHKCSICGHRLKFQEQLDLHLEWHASKTL-------- 835
             +FKP+++RE HPSV+SDL D   H+C++CG +LK +E+L  HLEWH +K L        
Sbjct: 919  FDFKPDVVREFHPSVVSDLLDGFEHQCNMCGLQLKLKERLTRHLEWHNTKKLDANGPTKA 978

Query: 834  SRRWYPSLGVWVAGNEGSSSG----PSVET---AEKSEPVVPADESQCVCILCGEPFEDF 676
            SR WY +   W+ G  G SSG     SV+     +K E +V ADESQCVC+LCGE FEDF
Sbjct: 979  SRMWYANPSDWINGVAGFSSGLESAKSVDKPGKTDKGESMVVADESQCVCVLCGEIFEDF 1038

Query: 675  YSHDRDEWMYKGATYMSLPAVDGDIGTTDGCASLGPIVHANCASPTSVSDLGLSKNIK 502
            Y  +RDEWM+KGA +M +P+  G+ G+    +  GPIVHANC S  S+ DLGL   IK
Sbjct: 1039 YCQERDEWMFKGAMHMIIPSATGETGSNGEGSRKGPIVHANCISECSLQDLGLVSRIK 1096


>ref|XP_002528590.1| conserved hypothetical protein [Ricinus communis]
            gi|223531986|gb|EEF33798.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1123

 Score =  202 bits (514), Expect = 3e-49
 Identities = 116/273 (42%), Positives = 156/273 (57%), Gaps = 19/273 (6%)
 Frame = -2

Query: 1260 NPLSSLLSTLVAKGLISSPSKEMPTLTSPQVA----SRLPKQXXXXXXXXXXXXXXXXXX 1093
            NP+S+LLS+LVAKGLIS+   E  +   P+      S+ P                    
Sbjct: 815  NPISNLLSSLVAKGLISASKSETSSPLPPESPTPSQSQNPTITNSSSKPASSVPASSATS 874

Query: 1092 XXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHK 913
               + ++  F     K ++ V +P   E ++L+G+EFK ++IRESHP VI  LFDD PH+
Sbjct: 875  LSSTKDEASFPKPDVKSSAAVPQPTAPEIESLIGLEFKSDVIRESHPHVIGALFDDFPHQ 934

Query: 912  CSICGHRLKFQEQLDLHLEWHA-------SKTLSRRWYPSLGVWVAGNE----GSSSGPS 766
            CSICG +LK +E+LD HLEWH             RRWY  LG WVAG      G  S  S
Sbjct: 935  CSICGLQLKLKERLDRHLEWHIWSKPEPDGLNRVRRWYADLGNWVAGKAEIPFGIESSVS 994

Query: 765  VE----TAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 598
            ++    T ++ EP+V ADE+QCVC+LCGE FED+YS  R +WM+K A +++L    GDIG
Sbjct: 995  MDEFGRTVDEDEPMVLADENQCVCVLCGELFEDYYSQQRKKWMFKAAMHLTLSLKGGDIG 1054

Query: 597  TTDGCASLGPIVHANCASPTSVSDLGLSKNIKL 499
            T +   S GPIVH NC S +SV DL L+   K+
Sbjct: 1055 TANE-NSKGPIVHVNCMSESSVHDLELTSGTKM 1086


>ref|XP_007027622.1| ENTH/VHS family protein, putative isoform 3 [Theobroma cacao]
            gi|508716227|gb|EOY08124.1| ENTH/VHS family protein,
            putative isoform 3 [Theobroma cacao]
          Length = 1091

 Score =  199 bits (507), Expect = 2e-48
 Identities = 100/209 (47%), Positives = 134/209 (64%), Gaps = 15/209 (7%)
 Frame = -2

Query: 1077 NDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICG 898
            +++ F   A K +  + +   +E +NL+G+EF+P++IRE H SVIS L DDLPH CS+CG
Sbjct: 880  DEVSFAEPATKSSVALHQSAAMEEENLIGLEFRPDVIREFHSSVISKLLDDLPHCCSLCG 939

Query: 897  HRLKFQEQLDLHLEWHASKTLS--------RRWYPSLGVWVAGNEGSSSGPSV------- 763
             RLK QE+LD HLE HA K           R WY     W+ G  G  +  S        
Sbjct: 940  LRLKLQERLDRHLECHAMKKTESEGSNRALRGWYARSDDWIGGKPGQFAFESTGSVNQLE 999

Query: 762  ETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGC 583
            +T  KSE +VPADE+Q  C+LCGE FED++   R EWM+KGA Y+++P+ DG++GTT+G 
Sbjct: 1000 KTTAKSELMVPADENQYACMLCGELFEDYFCQIRGEWMFKGAVYLTIPSKDGEVGTTNGS 1059

Query: 582  ASLGPIVHANCASPTSVSDLGLSKNIKLE 496
            A  GPIVHANC S +SV DLGL+  +KLE
Sbjct: 1060 AGNGPIVHANCISESSVHDLGLAGGVKLE 1088


>ref|XP_007027621.1| ENTH/VHS family protein, putative isoform 2 [Theobroma cacao]
            gi|508716226|gb|EOY08123.1| ENTH/VHS family protein,
            putative isoform 2 [Theobroma cacao]
          Length = 1091

 Score =  199 bits (507), Expect = 2e-48
 Identities = 100/209 (47%), Positives = 134/209 (64%), Gaps = 15/209 (7%)
 Frame = -2

Query: 1077 NDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICG 898
            +++ F   A K +  + +   +E +NL+G+EF+P++IRE H SVIS L DDLPH CS+CG
Sbjct: 880  DEVSFAEPATKSSVALHQSAAMEEENLIGLEFRPDVIREFHSSVISKLLDDLPHCCSLCG 939

Query: 897  HRLKFQEQLDLHLEWHASKTLS--------RRWYPSLGVWVAGNEGSSSGPSV------- 763
             RLK QE+LD HLE HA K           R WY     W+ G  G  +  S        
Sbjct: 940  LRLKLQERLDRHLECHAMKKTESEGSNRALRGWYARSDDWIGGKPGQFAFESTGSVNQLE 999

Query: 762  ETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGC 583
            +T  KSE +VPADE+Q  C+LCGE FED++   R EWM+KGA Y+++P+ DG++GTT+G 
Sbjct: 1000 KTTAKSELMVPADENQYACMLCGELFEDYFCQIRGEWMFKGAVYLTIPSKDGEVGTTNGS 1059

Query: 582  ASLGPIVHANCASPTSVSDLGLSKNIKLE 496
            A  GPIVHANC S +SV DLGL+  +KLE
Sbjct: 1060 AGNGPIVHANCISESSVHDLGLAGGVKLE 1088


>ref|XP_007027620.1| ENTH/VHS family protein, putative isoform 1 [Theobroma cacao]
            gi|508716225|gb|EOY08122.1| ENTH/VHS family protein,
            putative isoform 1 [Theobroma cacao]
          Length = 1125

 Score =  199 bits (507), Expect = 2e-48
 Identities = 100/209 (47%), Positives = 134/209 (64%), Gaps = 15/209 (7%)
 Frame = -2

Query: 1077 NDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSICG 898
            +++ F   A K +  + +   +E +NL+G+EF+P++IRE H SVIS L DDLPH CS+CG
Sbjct: 914  DEVSFAEPATKSSVALHQSAAMEEENLIGLEFRPDVIREFHSSVISKLLDDLPHCCSLCG 973

Query: 897  HRLKFQEQLDLHLEWHASKTLS--------RRWYPSLGVWVAGNEGSSSGPSV------- 763
             RLK QE+LD HLE HA K           R WY     W+ G  G  +  S        
Sbjct: 974  LRLKLQERLDRHLECHAMKKTESEGSNRALRGWYARSDDWIGGKPGQFAFESTGSVNQLE 1033

Query: 762  ETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIGTTDGC 583
            +T  KSE +VPADE+Q  C+LCGE FED++   R EWM+KGA Y+++P+ DG++GTT+G 
Sbjct: 1034 KTTAKSELMVPADENQYACMLCGELFEDYFCQIRGEWMFKGAVYLTIPSKDGEVGTTNGS 1093

Query: 582  ASLGPIVHANCASPTSVSDLGLSKNIKLE 496
            A  GPIVHANC S +SV DLGL+  +KLE
Sbjct: 1094 AGNGPIVHANCISESSVHDLGLAGGVKLE 1122


>ref|XP_004246564.1| PREDICTED: uncharacterized protein LOC101244024 [Solanum
            lycopersicum]
          Length = 1040

 Score =  199 bits (505), Expect = 3e-48
 Identities = 115/268 (42%), Positives = 153/268 (57%), Gaps = 19/268 (7%)
 Frame = -2

Query: 1260 NPLSSLLSTLVAKGLISSPSKEMPTLTS----PQVASRLPKQXXXXXXXXXXXXXXXXXX 1093
            +PLSS+LSTLVAKGLIS+  K+ P  T     PQ  + +P                    
Sbjct: 783  SPLSSILSTLVAKGLISASKKDPPIYTPSDTPPQTQNLIPPASSISTPALSAPTSSSVPS 842

Query: 1092 XXXSGNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHK 913
                 ++L     +A+    + + MK E K+L+G+ FKP++IR SHP+VISDL DD+P +
Sbjct: 843  SAHK-DELSHSKPSAETPEVLLQSMKEEAKSLIGLVFKPDVIRNSHPAVISDLVDDVPLQ 901

Query: 912  CSICGHRLKFQEQLDLHLEWHASK-------TLSRRWYPSLGVWVAGNEG-----SSSGP 769
            C ICG   KFQ +LD HLEWH+ +         SR+WY + G W+A   G      S GP
Sbjct: 902  CGICGFGFKFQVKLDRHLEWHSLRNPDVKLLNNSRKWYLNSGEWIAAFGGLPCGDKSEGP 961

Query: 768  ---SVETAEKSEPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 598
               S ET+E +E +VPADE QCVC+LCGE FEDFY+ + DEWM+K A YMS+P       
Sbjct: 962  AGGSSETSECTETMVPADECQCVCVLCGEFFEDFYNEESDEWMFKDAVYMSIP------- 1014

Query: 597  TTDGCASLGPIVHANCASPTSVSDLGLS 514
            +   C   GPIVH NC S +S  +LG +
Sbjct: 1015 SESDCQ--GPIVHKNCISESSCQELGFA 1040


>ref|XP_003625749.1| Pre-mRNA cleavage complex 2 protein Pcf11 [Medicago truncatula]
            gi|355500764|gb|AES81967.1| Pre-mRNA cleavage complex 2
            protein Pcf11 [Medicago truncatula]
          Length = 1039

 Score =  182 bits (462), Expect = 3e-43
 Identities = 103/273 (37%), Positives = 150/273 (54%), Gaps = 19/273 (6%)
 Frame = -2

Query: 1260 NPLSSLLSTLVAKGLISSPSKEMPTLTSPQVASRLPKQXXXXXXXXXXXXXXXXXXXXXS 1081
            NP+S+LLS+LVAKGLIS+ ++   T+ S  V     +                       
Sbjct: 770  NPISNLLSSLVAKGLISAGTESATTVRSETVMRSKDQTESIAVSSSLPVASVPVSSAVPV 829

Query: 1080 GNDLLFKGSAAKITSTVSKPMKVERKNLLGIEFKPEIIRESHPSVISDLFDDLPHKCSIC 901
             +  +    AAK +  +S+    E +NL+G +FKP++IRE HP VI +L D+LPH C  C
Sbjct: 830  KSSRIEADDAAKASLALSQSTSTEIRNLIGFDFKPDVIREMHPHVIEELLDELPHHCGDC 889

Query: 900  GHRLKFQEQLDLHLEWHASK--------TLSRRWYPSLGVWVAGN----EGSSSGPSVET 757
            G RLK QEQ + HLEWHA+K          SRRWY +   W+A        S    SV+ 
Sbjct: 890  GIRLKQQEQFNRHLEWHATKEREQNGLTVASRRWYVTSDDWIASKAECLSESEFTDSVDE 949

Query: 756  AEKS-------EPVVPADESQCVCILCGEPFEDFYSHDRDEWMYKGATYMSLPAVDGDIG 598
             + +       + +V ADE+QC+C+LCGE FED Y  +RDEWM+KGA Y++ P  D ++ 
Sbjct: 950  YDDNKTDGSQLDTMVVADENQCLCVLCGELFEDVYCQERDEWMFKGAVYLNNPDSDSEME 1009

Query: 597  TTDGCASLGPIVHANCASPTSVSDLGLSKNIKL 499
            +     ++GPI+HA C S  S+  LG++  ++L
Sbjct: 1010 S----RNVGPIIHARCLSDNSI--LGVTNTVRL 1036


Top