BLASTX nr result

ID: Forsythia21_contig00022238 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00022238
         (1904 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011074351.1| PREDICTED: polyadenylation and cleavage fact...   502   e-139
ref|XP_011074350.1| PREDICTED: polyadenylation and cleavage fact...   502   e-139
ref|XP_011074352.1| PREDICTED: polyadenylation and cleavage fact...   493   e-136
ref|XP_010655357.1| PREDICTED: polyadenylation and cleavage fact...   431   e-117
ref|XP_002518518.1| conserved hypothetical protein [Ricinus comm...   407   e-110
ref|XP_007026009.1| PCF11P-similar protein 4, putative isoform 2...   406   e-110
ref|XP_007026008.1| PCF11P-similar protein 4, putative isoform 1...   406   e-110
ref|XP_012091393.1| PREDICTED: polyadenylation and cleavage fact...   401   e-108
ref|XP_011037705.1| PREDICTED: polyadenylation and cleavage fact...   397   e-107
gb|EYU36382.1| hypothetical protein MIMGU_mgv1a0020322mg, partia...   397   e-107
ref|XP_011000684.1| PREDICTED: polyadenylation and cleavage fact...   396   e-107
ref|XP_012838214.1| PREDICTED: polyadenylation and cleavage fact...   394   e-106
ref|XP_009601448.1| PREDICTED: uncharacterized protein LOC104096...   393   e-106
gb|KJB67159.1| hypothetical protein B456_010G178200 [Gossypium r...   392   e-106
ref|XP_012450328.1| PREDICTED: polyadenylation and cleavage fact...   392   e-106
ref|XP_011037707.1| PREDICTED: polyadenylation and cleavage fact...   392   e-106
ref|XP_011037706.1| PREDICTED: polyadenylation and cleavage fact...   392   e-106
ref|XP_011037702.1| PREDICTED: polyadenylation and cleavage fact...   392   e-106
ref|XP_010275999.1| PREDICTED: uncharacterized protein LOC104610...   391   e-105
ref|XP_010275998.1| PREDICTED: uncharacterized protein LOC104610...   391   e-105

>ref|XP_011074351.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X2
            [Sesamum indicum]
          Length = 964

 Score =  502 bits (1292), Expect = e-139
 Identities = 285/524 (54%), Positives = 334/524 (63%), Gaps = 11/524 (2%)
 Frame = -1

Query: 1697 GLGVFNKIT----GLRTPTS-QITASSSARESWKFP-----DHLNXXXXXXXXXXXXXXS 1548
            G G  NKI      +  P+   I    S RES   P      HLN              +
Sbjct: 450  GRGSINKIVEVFPNVAGPSDLPIQIPPSFRESLILPHLQSQSHLNVKGGGSFSESRSSLT 509

Query: 1547 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQXX 1368
              E K  +I NF + DGK        ST SS +D+   +I +A   A T++W PAK Q  
Sbjct: 510  GGEQKLPLIDNFSNTDGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTP 569

Query: 1367 XXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQ 1188
                       QM +RGQ+   ++ N++ DQGLNK+I+S+Q  G T +M Q  LP  P Q
Sbjct: 570  HMPSLSALPP-QMHIRGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQ 628

Query: 1187 RPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGS 1008
            RP L+P+NLQ +AQ +  Q    M+QG              S+  + P ++ Y A   G 
Sbjct: 629  RPSLIPINLQGTAQPSLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGP 682

Query: 1007 P-SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 831
            P   T+L NIV GVQSSLP+ NAPN S H               GT+Q++P  Q +G++A
Sbjct: 683  PIGTTSLSNIVPGVQSSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVA 742

Query: 830  PSPPAGGALSGLISSLVAQGLITLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTT 651
            P+PP GGALSGLISSLVAQGLI+LTKQDS+GVEFDQD LKVRHES ITALYADLPRQC T
Sbjct: 743  PNPPGGGALSGLISSLVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKT 802

Query: 650  CGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLP 471
            CGLRFKSQEEHSKHMDWHV           KPSP WFVS+SMWL GAEALGTEAVPGFLP
Sbjct: 803  CGLRFKSQEEHSKHMDWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLP 862

Query: 470  AENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGM 291
            AEN VEK + EEMAVPADE+Q  CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS  GM
Sbjct: 863  AENTVEKPEDEEMAVPADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGM 922

Query: 290  NRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            +RSQLGPIVHAKCRSDSH IP E+  KDE E TEEG QRKR+RS
Sbjct: 923  DRSQLGPIVHAKCRSDSHGIPPEE--KDERESTEEGSQRKRLRS 964


>ref|XP_011074350.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1
            [Sesamum indicum]
          Length = 967

 Score =  502 bits (1292), Expect = e-139
 Identities = 285/524 (54%), Positives = 334/524 (63%), Gaps = 11/524 (2%)
 Frame = -1

Query: 1697 GLGVFNKIT----GLRTPTS-QITASSSARESWKFP-----DHLNXXXXXXXXXXXXXXS 1548
            G G  NKI      +  P+   I    S RES   P      HLN              +
Sbjct: 453  GRGSINKIVEVFPNVAGPSDLPIQIPPSFRESLILPHLQSQSHLNVKGGGSFSESRSSLT 512

Query: 1547 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQXX 1368
              E K  +I NF + DGK        ST SS +D+   +I +A   A T++W PAK Q  
Sbjct: 513  GGEQKLPLIDNFSNTDGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTP 572

Query: 1367 XXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQ 1188
                       QM +RGQ+   ++ N++ DQGLNK+I+S+Q  G T +M Q  LP  P Q
Sbjct: 573  HMPSLSALPP-QMHIRGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQ 631

Query: 1187 RPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGS 1008
            RP L+P+NLQ +AQ +  Q    M+QG              S+  + P ++ Y A   G 
Sbjct: 632  RPSLIPINLQGTAQPSLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGP 685

Query: 1007 P-SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 831
            P   T+L NIV GVQSSLP+ NAPN S H               GT+Q++P  Q +G++A
Sbjct: 686  PIGTTSLSNIVPGVQSSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVA 745

Query: 830  PSPPAGGALSGLISSLVAQGLITLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTT 651
            P+PP GGALSGLISSLVAQGLI+LTKQDS+GVEFDQD LKVRHES ITALYADLPRQC T
Sbjct: 746  PNPPGGGALSGLISSLVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKT 805

Query: 650  CGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLP 471
            CGLRFKSQEEHSKHMDWHV           KPSP WFVS+SMWL GAEALGTEAVPGFLP
Sbjct: 806  CGLRFKSQEEHSKHMDWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLP 865

Query: 470  AENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGM 291
            AEN VEK + EEMAVPADE+Q  CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS  GM
Sbjct: 866  AENTVEKPEDEEMAVPADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGM 925

Query: 290  NRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            +RSQLGPIVHAKCRSDSH IP E+  KDE E TEEG QRKR+RS
Sbjct: 926  DRSQLGPIVHAKCRSDSHGIPPEE--KDERESTEEGSQRKRLRS 967


>ref|XP_011074352.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X3
            [Sesamum indicum]
          Length = 940

 Score =  493 bits (1268), Expect = e-136
 Identities = 265/449 (59%), Positives = 309/449 (68%), Gaps = 1/449 (0%)
 Frame = -1

Query: 1502 DGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXXXXXXSQMQM 1323
            DGK        ST SS +D+   +I +A   A T++W PAK Q             QM +
Sbjct: 501  DGKLGGPSSTASTFSSTYDTPISDIRTAHDAALTKAWRPAKFQTPHMPSLSALPP-QMHI 559

Query: 1322 RGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQRPGLVPLNLQSSAQA 1143
            RGQ+   ++ N++ DQGLNK+I+S+Q  G T +M Q  LP  P QRP L+P+NLQ +AQ 
Sbjct: 560  RGQYGMKTAPNIVADQGLNKTIYSEQHLGTTRNMPQVTLPLIPSQRPSLIPINLQGTAQP 619

Query: 1142 ARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSP-SDTALHNIVLGVQ 966
            +  Q    M+QG              S+  + P ++ Y A   G P   T+L NIV GVQ
Sbjct: 620  SLAQS---MAQGA---GQLPSSVPAPSNTMVPPKSYGYLAHAQGPPIGTTSLSNIVPGVQ 673

Query: 965  SSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALSGLISS 786
            SSLP+ NAPN S H               GT+Q++P  Q +G++AP+PP GGALSGLISS
Sbjct: 674  SSLPVLNAPNMSFHVPGAALQPLPGVPLPGTSQALPSGQTVGRVAPNPPGGGALSGLISS 733

Query: 785  LVAQGLITLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFKSQEEHSKHM 606
            LVAQGLI+LTKQDS+GVEFDQD LKVRHES ITALYADLPRQC TCGLRFKSQEEHSKHM
Sbjct: 734  LVAQGLISLTKQDSVGVEFDQDSLKVRHESTITALYADLPRQCKTCGLRFKSQEEHSKHM 793

Query: 605  DWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVEKKKYEEMAV 426
            DWHV           KPSP WFVS+SMWL GAEALGTEAVPGFLPAEN VEK + EEMAV
Sbjct: 794  DWHVNKNRTLKTRKTKPSPKWFVSVSMWLSGAEALGTEAVPGFLPAENTVEKPEDEEMAV 853

Query: 425  PADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSQLGPIVHAKCRS 246
            PADE+Q  CALCGEPFDDFYSDEM+EWMYKGAVYM +PAGS  GM+RSQLGPIVHAKCRS
Sbjct: 854  PADEDQNTCALCGEPFDDFYSDEMEEWMYKGAVYMYAPAGSIVGMDRSQLGPIVHAKCRS 913

Query: 245  DSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            DSH IP E+  KDE E TEEG QRKR+RS
Sbjct: 914  DSHGIPPEE--KDERESTEEGSQRKRLRS 940


>ref|XP_010655357.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Vitis
            vinifera]
          Length = 1046

 Score =  431 bits (1107), Expect = e-117
 Identities = 237/467 (50%), Positives = 289/467 (61%), Gaps = 5/467 (1%)
 Frame = -1

Query: 1547 ACEVKPTIIGNFPSADGKFCRRPDVVSTI-SSMFDSLSPEIPSADAPASTESWLPAKLQX 1371
            A E    +I N P AD +  R P V S + SS  +S++ E+ SA APAST  W P  +  
Sbjct: 588  AAETISPLISNIPDADAQLRRLPTVASRMGSSSLNSMNVEVQSAAAPASTGMWPPVNVHK 647

Query: 1370 XXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPG 1191
                          Q+R QFN M++   +V+Q  NKS+   +          S+LPQ   
Sbjct: 648  THLPPLLSNLPQTKQIRNQFNLMNATTAVVNQDPNKSLFLPE--------LDSKLPQMAN 699

Query: 1190 QRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHG 1011
            ++ G +PLN ++  Q  R+QP  L  Q    N          S++   PLN  Y  Q H 
Sbjct: 700  RQAGSIPLNGKNQTQVTRLQPQFL-PQETHGNFVPSTTAPVSSYSVAPPLNPGYTPQGHA 758

Query: 1010 SPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLA 831
            + + T L N V GV SS+PI+N  N+S+H                T+Q I IPQN G + 
Sbjct: 759  AATSTILLNPVPGVHSSIPIHNISNSSVHFQGGALPPLPPGPPPATSQMINIPQNTGPIV 818

Query: 830  PSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYADLPR 663
             +   G ALSGLISSL+AQGLI+L KQ    DS+G+EF+ DLLKVRHESAI+ALY D+ R
Sbjct: 819  SNQQPGSALSGLISSLMAQGLISLAKQPTVQDSVGIEFNVDLLKVRHESAISALYGDMSR 878

Query: 662  QCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVP 483
            QCTTCGLRFK QEEHS HMDWHV           KPS  WFVS SMWL  AEALGT+AVP
Sbjct: 879  QCTTCGLRFKCQEEHSSHMDWHVTKNRISKNRKQKPSRKWFVSASMWLSSAEALGTDAVP 938

Query: 482  GFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGS 303
            GFLP E I EKK  EE+AVPADE+Q  CALCGEPFDDFYSDE +EWMYKGAVY+N+P GS
Sbjct: 939  GFLPTETIAEKKDDEELAVPADEDQNVCALCGEPFDDFYSDETEEWMYKGAVYLNAPEGS 998

Query: 302  FEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMR 162
              GM+RSQLGPIVHAKCRS+S+V+  EDF +DEG   EEG +RKRMR
Sbjct: 999  AAGMDRSQLGPIVHAKCRSESNVVSPEDFGQDEGGNMEEGSKRKRMR 1045


>ref|XP_002518518.1| conserved hypothetical protein [Ricinus communis]
            gi|223542363|gb|EEF43905.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1023

 Score =  407 bits (1046), Expect = e-110
 Identities = 241/531 (45%), Positives = 290/531 (54%), Gaps = 18/531 (3%)
 Frame = -1

Query: 1697 GLGVFNKITGLRTPTSQITASSSARESWKFPDHL------------NXXXXXXXXXXXXX 1554
            G G   K++G +T  +Q   S   RE+WK P H             N             
Sbjct: 517  GRGSGGKLSGFQTDRNQTMGSRYPREAWKSPHHFSQSADLINAKGRNRDLQMPFSGSGIS 576

Query: 1553 XSACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1374
             S  E+  +++   P AD +  R P + S +SS           + A +ST  W    + 
Sbjct: 577  SSGSEILASLVDQLPDADAQIIRPPTLPSRMSS-----------STALSSTGVWPLVNVH 625

Query: 1373 XXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIH-SKQRFGGTNSMAQS--ELP 1203
                         QMQ R   +P ++ N  V+QG  KS   S+Q+  G  S   S  + P
Sbjct: 626  KSHQPPLRPIFPPQMQSRSLLDPRNASNTAVNQGFQKSSFLSEQQLNGLESKEHSLTKQP 685

Query: 1202 QFPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAA 1023
              P Q   +   N Q+  Q    QP        Q             H      +HRY  
Sbjct: 686  LLPSQHAAM---NQQNQGQVNPFQP--------QRENFPPSVASLPPHPLAPTFDHRYVT 734

Query: 1022 QWHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNI 843
            Q HGS       N+V  +   LP+NN PNT                    +  IPIPQN 
Sbjct: 735  QAHGSAMSRIHSNLVSSMPLPLPVNNIPNTM--HLQVGVRPPLPPGPPPASHMIPIPQNA 792

Query: 842  GQLAPSPPAGGALSGLISSLVAQGLITLTK---QDSLGVEFDQDLLKVRHESAITALYAD 672
            G +A + PAGGA SGLI+SLVAQGLI+L +   QDS+G+EF+ DLLKVRHESAI+ALYAD
Sbjct: 793  GPVASNQPAGGAFSGLINSLVAQGLISLKQTPVQDSVGLEFNADLLKVRHESAISALYAD 852

Query: 671  LPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 492
            LPRQCTTCGLRFK QE+HS HMDWHV           KPS  WFVS +MWL GAEALGT+
Sbjct: 853  LPRQCTTCGLRFKCQEDHSSHMDWHVTRNRMSKNRKQKPSRKWFVSATMWLRGAEALGTD 912

Query: 491  AVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 312
            AVPGFLP E +VEKK  EEMAVPADE Q ACALCGEPFDDFYSDE +EWMYKGAVY+N+P
Sbjct: 913  AVPGFLPTEAVVEKKDDEEMAVPADEEQNACALCGEPFDDFYSDETEEWMYKGAVYLNAP 972

Query: 311  AGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            +GS   M+RSQLGPIVHAKCRS+S V P ED   +EG  TEE  QRKRMRS
Sbjct: 973  SGSTASMDRSQLGPIVHAKCRSESSVAPPEDIRSNEGPDTEEASQRKRMRS 1023


>ref|XP_007026009.1| PCF11P-similar protein 4, putative isoform 2 [Theobroma cacao]
            gi|508781375|gb|EOY28631.1| PCF11P-similar protein 4,
            putative isoform 2 [Theobroma cacao]
          Length = 733

 Score =  406 bits (1043), Expect = e-110
 Identities = 225/463 (48%), Positives = 279/463 (60%), Gaps = 7/463 (1%)
 Frame = -1

Query: 1526 IIGNFPSADGKFCRRPDVVS-TISSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXX 1350
            +I   P    +F R P VV  T SS  DS++     A  P++T  W P  +         
Sbjct: 275  LIDKLPDGGSQFLRPPAVVPRTGSSSLDSVTVGARPAIIPSTTGVWPPVNVHKSQPPAMH 334

Query: 1349 XXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--ELPQFPGQRPGL 1176
                 Q   R QF+ ++  N+++++G NK  +  ++F    S  QS   +PQ P QR  L
Sbjct: 335  SNYSLQQHSRSQFDSINPINMVMNEGPNKRSYMAEQFDRFESKEQSLTRVPQLPDQRAAL 394

Query: 1175 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSPSDT 996
               + ++  Q   +QPH L SQ ++ N                 LNH Y  Q HG+    
Sbjct: 395  ---HQRNQMQVTSLQPHFLPSQDLRENFLSSATAPLPPRLLAPSLNHGYTPQMHGAVISM 451

Query: 995  ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 816
               N +   Q  LPI N P  S+                  +Q IP  QN G L P+   
Sbjct: 452  VPSNPIHVAQPPLPIPNMPTVSLQLQGGALPPLPPGPPP-ASQMIPATQNAGPLLPNQAQ 510

Query: 815  GGALSGLISSLVAQGLITLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 648
             G  SGLISSL+AQGLI+LTK    QD +G+EF+ DLLKVRHES+I+ALYADLPRQCTTC
Sbjct: 511  SGPYSGLISSLMAQGLISLTKPTPIQDPVGLEFNADLLKVRHESSISALYADLPRQCTTC 570

Query: 647  GLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 468
            GLRFK QEEHS HMDWHV           KPS  WFVS SMWL GAEALGT+AVPGFLP 
Sbjct: 571  GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 630

Query: 467  ENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 288
            EN+VEKK  EE+AVPADE+Q+ CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EGM+
Sbjct: 631  ENVVEKKDDEELAVPADEDQSVCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSIEGMD 690

Query: 287  RSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            RSQLGPIVHAKCRS+S V+P+EDF + +G  +E+  QRKR+RS
Sbjct: 691  RSQLGPIVHAKCRSESSVVPSEDFVRCDGGNSEDSSQRKRLRS 733


>ref|XP_007026008.1| PCF11P-similar protein 4, putative isoform 1 [Theobroma cacao]
            gi|508781374|gb|EOY28630.1| PCF11P-similar protein 4,
            putative isoform 1 [Theobroma cacao]
          Length = 1004

 Score =  406 bits (1043), Expect = e-110
 Identities = 225/463 (48%), Positives = 279/463 (60%), Gaps = 7/463 (1%)
 Frame = -1

Query: 1526 IIGNFPSADGKFCRRPDVVS-TISSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXX 1350
            +I   P    +F R P VV  T SS  DS++     A  P++T  W P  +         
Sbjct: 546  LIDKLPDGGSQFLRPPAVVPRTGSSSLDSVTVGARPAIIPSTTGVWPPVNVHKSQPPAMH 605

Query: 1349 XXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--ELPQFPGQRPGL 1176
                 Q   R QF+ ++  N+++++G NK  +  ++F    S  QS   +PQ P QR  L
Sbjct: 606  SNYSLQQHSRSQFDSINPINMVMNEGPNKRSYMAEQFDRFESKEQSLTRVPQLPDQRAAL 665

Query: 1175 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSPSDT 996
               + ++  Q   +QPH L SQ ++ N                 LNH Y  Q HG+    
Sbjct: 666  ---HQRNQMQVTSLQPHFLPSQDLRENFLSSATAPLPPRLLAPSLNHGYTPQMHGAVISM 722

Query: 995  ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 816
               N +   Q  LPI N P  S+                  +Q IP  QN G L P+   
Sbjct: 723  VPSNPIHVAQPPLPIPNMPTVSLQLQGGALPPLPPGPPP-ASQMIPATQNAGPLLPNQAQ 781

Query: 815  GGALSGLISSLVAQGLITLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 648
             G  SGLISSL+AQGLI+LTK    QD +G+EF+ DLLKVRHES+I+ALYADLPRQCTTC
Sbjct: 782  SGPYSGLISSLMAQGLISLTKPTPIQDPVGLEFNADLLKVRHESSISALYADLPRQCTTC 841

Query: 647  GLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 468
            GLRFK QEEHS HMDWHV           KPS  WFVS SMWL GAEALGT+AVPGFLP 
Sbjct: 842  GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 901

Query: 467  ENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 288
            EN+VEKK  EE+AVPADE+Q+ CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EGM+
Sbjct: 902  ENVVEKKDDEELAVPADEDQSVCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSIEGMD 961

Query: 287  RSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            RSQLGPIVHAKCRS+S V+P+EDF + +G  +E+  QRKR+RS
Sbjct: 962  RSQLGPIVHAKCRSESSVVPSEDFVRCDGGNSEDSSQRKRLRS 1004


>ref|XP_012091393.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Jatropha
            curcas] gi|643703717|gb|KDP20781.1| hypothetical protein
            JCGZ_21252 [Jatropha curcas]
          Length = 1029

 Score =  401 bits (1030), Expect = e-108
 Identities = 242/533 (45%), Positives = 294/533 (55%), Gaps = 19/533 (3%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHL-----------NXXXXXXXXXXXXX 1554
            SG G   K+ G +   +QI AS   RE+WK  +H            N             
Sbjct: 517  SGRGSTAKLPGFQPERNQIMASHYPREAWKLLNHYPQSTDLNAKGRNREFRMPFSRSVIS 576

Query: 1553 XSACEVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1374
             S  +    ++   P  DG++ R P + S + S             AP++   W    + 
Sbjct: 577  SSVSDSLAPLVDKLPDTDGQYVRPPTLPSRVGSSI-----------APSTAGVWPLVNVH 625

Query: 1373 XXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKS-IHSKQRFGGTNSMAQS--ELP 1203
                         Q Q R QF+  ++ N +V+QGL +S   S+Q+F G  SM  S  + P
Sbjct: 626  KSHPPPVHPIFPPQKQSRSQFDSTNARNTVVNQGLQQSTFSSEQQFNGFESMEPSLTKQP 685

Query: 1202 QFPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHAT-IQPLNHRYA 1026
              P +      LN Q+ AQ    QP  L S   + N           H T +  L+  +A
Sbjct: 686  LLPSRH---ATLNQQNQAQVNHFQPQFLPSNEARENFPLSISSLP--HQTRVSTLDPVHA 740

Query: 1025 AQWHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQN 846
             Q HG+       N V      LP+NN PNT                     Q I +PQN
Sbjct: 741  TQGHGAAMSMVRSNPV-PFMLPLPVNNIPNT---LQPHAGTRPPLPPGPHPAQMIHVPQN 796

Query: 845  IGQLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALY 678
            +G +AP+ P G A SGLI SL+AQGLI+LTKQ    DS+G+EF+ DL+KVRHESAI+ALY
Sbjct: 797  VGPVAPNQPPGSAFSGLIGSLMAQGLISLTKQTPGQDSVGLEFNADLIKVRHESAISALY 856

Query: 677  ADLPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALG 498
            ADLPRQCTTCGLRFK QEEHS HMDWHV           KPS  WFV  SMWL GAEALG
Sbjct: 857  ADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRMSKNRKHKPSRKWFVDTSMWLSGAEALG 916

Query: 497  TEAVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMN 318
            T+AVPGFLP E++VEKK  EEMAVPADE Q ACALCGEPFDDFYSDE +EWMYKGAVYMN
Sbjct: 917  TDAVPGFLPTESVVEKKDDEEMAVPADEEQNACALCGEPFDDFYSDETEEWMYKGAVYMN 976

Query: 317  SPAGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            +P GS  GM RSQLGPIVHAKCRS+S V P EDF  D+G  +EE   RKR+RS
Sbjct: 977  APNGSTAGMERSQLGPIVHAKCRSESSVAPPEDFRCDDGGDSEETSHRKRLRS 1029


>ref|XP_011037705.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform
            X4 [Populus euphratica]
          Length = 1051

 Score =  397 bits (1019), Expect = e-107
 Identities = 232/531 (43%), Positives = 287/531 (54%), Gaps = 17/531 (3%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1521
            SG G  +KI G RT  +QI  S   +E+W FP H++                  +  + +
Sbjct: 526  SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 585

Query: 1520 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1377
             +             P  D +  R P + S   S  DS S    S+  P S+  W P   
Sbjct: 586  SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 645

Query: 1376 QXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSELPQ 1200
            +               Q R QF+P+++ + +++Q L K S   +Q F G  +   + +  
Sbjct: 646  RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 705

Query: 1199 FPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 1020
             P        LN Q+ A     QP  L S   + N               QPLNH Y   
Sbjct: 706  TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 764

Query: 1019 WHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 840
             H +       N +  VQ  LP+NN PN  +H                  Q++P  QN+ 
Sbjct: 765  GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 820

Query: 839  QLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 672
               P  P+G A SGL +SL+AQGLI+LTKQ    DS+G+EF+ DLLK+R+ESAI+ALY D
Sbjct: 821  SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 880

Query: 671  LPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 492
            LPRQCTTCGLRFK QEEHS HMDWHV           K S +WFVS SMWL GAEALGT+
Sbjct: 881  LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 940

Query: 491  AVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 312
            A PGFLP E  VEKK    MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS 
Sbjct: 941  AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 1000

Query: 311  AGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
             GS  GM+RSQLGPIVHAKCRSDS V+P EDF  DEG  +EEG QRKRMRS
Sbjct: 1001 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDEGVNSEEGNQRKRMRS 1051


>gb|EYU36382.1| hypothetical protein MIMGU_mgv1a0020322mg, partial [Erythranthe
            guttata]
          Length = 571

 Score =  397 bits (1019), Expect = e-107
 Identities = 230/464 (49%), Positives = 288/464 (62%), Gaps = 3/464 (0%)
 Frame = -1

Query: 1541 EVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPAS-TESWLPAKLQXXX 1365
            E+ P + GNF + DGKF R P         +DS +PEI SADA A  T++W P+K Q   
Sbjct: 160  ELNPALTGNFSNTDGKF-RLP---------YDSTAPEIQSADAAAPLTKAWHPSKFQNSH 209

Query: 1364 XXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQR 1185
                     SQMQ+RGQF      N  VDQ     +HS+Q+ G     +Q+ LP     R
Sbjct: 210  IRPSLSALPSQMQIRGQFGM----NNAVDQ-----LHSEQQLG----RSQANLPHISSIR 256

Query: 1184 PGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSP 1005
            PG VP NLQ +AQ     P+L +      +           +A++ P+N+RY       P
Sbjct: 257  PGPVPANLQHTAQ-----PNLYLPSPYSEHIPS--------NASVPPMNYRYFG-----P 298

Query: 1004 SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPS 825
            S T   N+V G  S            H               GT Q +PI  N  Q+A +
Sbjct: 299  SGTTSSNLVPGFPS-----------FHVPRPTLQSLPRGPFPGTAQPLPIGSNANQVAQN 347

Query: 824  PPAGGALSGLISSLVAQGLITLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCG 645
            P AG ALSGLI+SL+AQGLI+L+ QDS+GVEFD D+LKVRHESAIT+LYA+LPRQC TCG
Sbjct: 348  PSAGPALSGLINSLMAQGLISLSNQDSVGVEFDPDILKVRHESAITSLYAELPRQCKTCG 407

Query: 644  LRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAE 465
            LRFKSQEEHS HMDWHV           KPSP WFV+ +MWL G EA+GTEAVPGF+PAE
Sbjct: 408  LRFKSQEEHSSHMDWHVNKNRTLRNRKAKPSPKWFVNAAMWLSGTEAMGTEAVPGFMPAE 467

Query: 464  NIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNR 285
            N  EK++ EEMAVPADE+Q +CALCGEPF+D+YSD+++EWMYKGAVYM++P G+  GM+R
Sbjct: 468  NSAEKEEDEEMAVPADEDQNSCALCGEPFEDYYSDDLEEWMYKGAVYMHAPTGATVGMDR 527

Query: 284  SQLGPIVHAKCRSDSHVIPTEDFTKDEGEL--TEEGIQRKRMRS 159
            SQLGPIVHAKC SDSH + +E+  KDE      ++ +QRKR RS
Sbjct: 528  SQLGPIVHAKCMSDSHAVSSENNKKDEEVRFNLKKVVQRKRFRS 571


>ref|XP_011000684.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like
            [Populus euphratica]
          Length = 980

 Score =  396 bits (1018), Expect = e-107
 Identities = 232/532 (43%), Positives = 293/532 (55%), Gaps = 19/532 (3%)
 Frame = -1

Query: 1697 GLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPT--- 1527
            G G  NK+ GL T  + I+ S  ++E+W FP H+                   +  +   
Sbjct: 464  GHGSTNKMPGLLTERNHISGSRYSQEAWNFPPHIRQPSHLLNAKGRGRDFQMPLSGSGVS 523

Query: 1526 ---------IIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQ 1374
                     ++   P  D +  R P + S   S  DS S    S+  P  + +W P  + 
Sbjct: 524  SMGGENFNPLVDKLPDMDAQLVRPPAIASRFGSSIDSNSSGTWSSAVPPISGAWPPVNVH 583

Query: 1373 XXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSM--AQSELP 1203
                         + Q RGQF+P+++ + + +Q L K S+  +Q F    S      +  
Sbjct: 584  KSLPPPVHSSFPPEKQGRGQFDPVNTNSTVTNQALQKASVMPEQSFNSFESKDYVLMKPT 643

Query: 1202 QFPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAA 1023
              P Q  GL   N Q+ A     QP  L S   + N               +P+NH Y  
Sbjct: 644  PLPNQHAGL---NQQNQAHFNPFQPKFLPSHEARENFHPSGIALLPPRRLARPMNHGYTT 700

Query: 1022 QWHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNI 843
              H S       N++  VQ  L ++N PNT +H                 +Q+IP PQN 
Sbjct: 701  HGHSSS------NVLPAVQLPLAVSNVPNT-LHSQVGVRPTLPQGP----SQTIPFPQNA 749

Query: 842  GQLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYA 675
               A + P+G A SGLI+SL+AQGLIT+TKQ    DS+G+EF+ DLLK+R+ESAI+ALY+
Sbjct: 750  SSGALAQPSGSAFSGLINSLMAQGLITMTKQTPLQDSVGLEFNADLLKLRYESAISALYS 809

Query: 674  DLPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGT 495
            DLPRQCTTCGLR K QEEHS HMDWHV            PS  WFVS SMWL GAEALGT
Sbjct: 810  DLPRQCTTCGLRLKCQEEHSSHMDWHVTKNRMSKNRKQNPSRKWFVSASMWLSGAEALGT 869

Query: 494  EAVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNS 315
            +AVPGFLP E IVEKK  +EMAVPADE Q+ CALCGEPFDDFYSDE +EWMYKGAVY+N+
Sbjct: 870  DAVPGFLPTETIVEKKDDDEMAVPADEEQSTCALCGEPFDDFYSDETEEWMYKGAVYLNA 929

Query: 314  PAGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
              GS   M+RSQLGPIVHAKCRSDS  +P+EDF  +EG  TEEG  RKRMRS
Sbjct: 930  SDGSTADMDRSQLGPIVHAKCRSDSSGVPSEDFGHEEGGNTEEG-SRKRMRS 980


>ref|XP_012838214.1| PREDICTED: polyadenylation and cleavage factor homolog 4 [Erythranthe
            guttatus]
          Length = 865

 Score =  394 bits (1011), Expect = e-106
 Identities = 225/451 (49%), Positives = 281/451 (62%), Gaps = 1/451 (0%)
 Frame = -1

Query: 1541 EVKPTIIGNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPAS-TESWLPAKLQXXX 1365
            E+ P + GNF + DGKF R P         +DS +PEI SADA A  T++W P+K Q   
Sbjct: 467  ELNPALTGNFSNTDGKF-RLP---------YDSTAPEIQSADAAAPLTKAWHPSKFQNSH 516

Query: 1364 XXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQR 1185
                     SQMQ+RGQF      N  VDQ     +HS+Q+ G     +Q+ LP     R
Sbjct: 517  IRPSLSALPSQMQIRGQFGM----NNAVDQ-----LHSEQQLG----RSQANLPHISSIR 563

Query: 1184 PGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSP 1005
            PG VP NLQ +AQ     P+L +      +           +A++ P+N+RY       P
Sbjct: 564  PGPVPANLQHTAQ-----PNLYLPSPYSEHIPS--------NASVPPMNYRYFG-----P 605

Query: 1004 SDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPS 825
            S T   N+V G  S            H               GT Q +PI  N  Q+A +
Sbjct: 606  SGTTSSNLVPGFPS-----------FHVPRPTLQSLPRGPFPGTAQPLPIGSNANQVAQN 654

Query: 824  PPAGGALSGLISSLVAQGLITLTKQDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCG 645
            P AG ALSGLI+SL+AQGLI+L+ QDS+GVEFD D+LKVRHESAIT+LYA+LPRQC TCG
Sbjct: 655  PSAGPALSGLINSLMAQGLISLSNQDSVGVEFDPDILKVRHESAITSLYAELPRQCKTCG 714

Query: 644  LRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAE 465
            LRFKSQEEHS HMDWHV           KPSP WFV+ +MWL G EA+GTEAVPGF+PAE
Sbjct: 715  LRFKSQEEHSSHMDWHVNKNRTLRNRKAKPSPKWFVNAAMWLSGTEAMGTEAVPGFMPAE 774

Query: 464  NIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNR 285
            N  EK++ EEMAVPADE+Q +CALCGEPF+D+YSD+++EWMYKGAVYM++P G+  GM+R
Sbjct: 775  NSAEKEEDEEMAVPADEDQNSCALCGEPFEDYYSDDLEEWMYKGAVYMHAPTGATVGMDR 834

Query: 284  SQLGPIVHAKCRSDSHVIPTEDFTKDEGELT 192
            SQLGPIVHAKC SDSH + +E+  KDE + T
Sbjct: 835  SQLGPIVHAKCMSDSHAVSSENNKKDEEDST 865


>ref|XP_009601448.1| PREDICTED: uncharacterized protein LOC104096744 isoform X3 [Nicotiana
            tomentosiformis]
          Length = 980

 Score =  393 bits (1010), Expect = e-106
 Identities = 229/519 (44%), Positives = 289/519 (55%), Gaps = 5/519 (0%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1521
            SG G  NKITG    TS I+ S   +   K P+++                  E K  +I
Sbjct: 470  SGRGARNKITGYCDETSLISGSPYLQ---KLPENVPLLHQRHLKVEGSGIVTGEPKHPLI 526

Query: 1520 GNFPSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXXXXX 1341
             N   ADG   R P +   ++  F+S   +I +    A    W P  +            
Sbjct: 527  SNLV-ADGHTWRPPYIPPRMNPTFESSVQDIRAVTGRAPIVPWPPTDVHNPQSLTSKPFV 585

Query: 1340 XSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQSELPQFPGQRPGLVPLNL 1161
                 +R  F   +  N + +  L+K +   Q+   + S +  + PQFP Q P     +L
Sbjct: 586  LPHQHIRSPFEVKNGSNSVANHNLDKPVLPGQQIDNSKSNSYIKFPQFPSQHPASFSASL 645

Query: 1160 QSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSPSDTALHNI 981
            Q+  Q A  +  LL SQ +             +H  + P+ + Y  Q  GS   T L   
Sbjct: 646  QNPEQVASAESQLLFSQRMHQTTVPSASLPASNHFLLPPI-YGYNPQGPGSSVGTLLPLP 704

Query: 980  VLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPAGGALS 801
            V G Q SLP+ N PNTS                  ++Q  P  QN+GQ+ P+PPAGG  S
Sbjct: 705  VSGPQVSLPLVNIPNTSSQFSSGALPPLPRGPLPMSSQFTPTSQNLGQVTPNPPAGG-FS 763

Query: 800  GLISSLVAQGLITLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTCGLRFK 633
             LISSL+AQGLI+LT     QDS+G++F+ DLLKVRH+SA+TALYADLPRQCTTCGLRFK
Sbjct: 764  SLISSLMAQGLISLTNEAPPQDSVGLDFNPDLLKVRHDSAVTALYADLPRQCTTCGLRFK 823

Query: 632  SQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPAENIVE 453
             QE HS HMDWHV           K S  WFVS++MW  G EALG++A PGFLPAE +VE
Sbjct: 824  CQEAHSSHMDWHVTKNRVSKNRKQKSSRKWFVSVNMWFSGTEALGSDAAPGFLPAEQVVE 883

Query: 452  KKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMNRSQLG 273
            KK  EE+AVPAD+ Q  CALCGEPFDDFYSDE +EWMYKGAVYMN+P+GS  GM +SQLG
Sbjct: 884  KKDDEELAVPADDEQNVCALCGEPFDDFYSDETEEWMYKGAVYMNAPSGSTAGMEKSQLG 943

Query: 272  PIVHAKCRSDSHVIPTEDFTK-DEGELTEEGIQRKRMRS 159
            PI+HAKCRS+S   P ED  + DEG   E+G QRKRMRS
Sbjct: 944  PIIHAKCRSESSATPQEDSRRVDEG--LEDGSQRKRMRS 980


>gb|KJB67159.1| hypothetical protein B456_010G178200 [Gossypium raimondii]
          Length = 980

 Score =  392 bits (1006), Expect = e-106
 Identities = 221/463 (47%), Positives = 273/463 (58%), Gaps = 7/463 (1%)
 Frame = -1

Query: 1526 IIGNFPSADGKFCRRPDVVSTI-SSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXX 1350
            +I   P    +F R P +V    SS  D+++     A  P +  +W P  +         
Sbjct: 524  LIEKLPEGGSQFVRPPALVPRSGSSSLDTVTVVTQPAMLPLTAGAWPPVNVPKSQPPNAH 583

Query: 1349 XXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--ELPQFPGQRPGL 1176
                 Q   R  F+ ++  N  ++QG NK  +  ++F    S  QS   +PQ PGQRP L
Sbjct: 584  TNYSLQQHGRSHFDSLNPINAAMNQGQNKHPYMPEQFDNFESKEQSLKTVPQLPGQRPAL 643

Query: 1175 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSPSDT 996
                 Q ++    +QPH   +     +                 +NH Y+ Q HG+    
Sbjct: 644  Q----QRNSLHGSLQPHFPPNDARD-SFLSSATGPLPPRLLAPSMNHGYSPQMHGAGISM 698

Query: 995  ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 816
               N +   Q  L I N P  S+H                T+Q +P  QN G L P+ P 
Sbjct: 699  VPSNPIPVAQPPLSIPNMPTGSLHLQGGAMPPLPPGPRP-TSQMMPAAQNAGPLLPNQPQ 757

Query: 815  GGALSGLISSLVAQGLITLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 648
            GG  +GLISSL+AQGLI+LTK    QDS+G+EFD DLLKVRHESAI+ALYADLPRQCTTC
Sbjct: 758  GGPFTGLISSLMAQGLISLTKPTPIQDSVGLEFDADLLKVRHESAISALYADLPRQCTTC 817

Query: 647  GLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 468
            GLRFK QEEHS HMDWHV           KPS  WFVS SMWL GAEALGT+AVPGFLP 
Sbjct: 818  GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 877

Query: 467  ENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 288
            E+IVEKK  EE+AVPADE+Q  CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EG++
Sbjct: 878  EDIVEKKDDEELAVPADEDQNLCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSVEGID 937

Query: 287  RSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            RSQLGPIVHAKCRS+S V+P EDF + +G   E+  QRKR+RS
Sbjct: 938  RSQLGPIVHAKCRSESSVVPPEDFVRYDGGNPEDSSQRKRLRS 980


>ref|XP_012450328.1| PREDICTED: polyadenylation and cleavage factor homolog 4 isoform X1
            [Gossypium raimondii] gi|763800201|gb|KJB67156.1|
            hypothetical protein B456_010G178200 [Gossypium
            raimondii]
          Length = 1004

 Score =  392 bits (1006), Expect = e-106
 Identities = 221/463 (47%), Positives = 273/463 (58%), Gaps = 7/463 (1%)
 Frame = -1

Query: 1526 IIGNFPSADGKFCRRPDVVSTI-SSMFDSLSPEIPSADAPASTESWLPAKLQXXXXXXXX 1350
            +I   P    +F R P +V    SS  D+++     A  P +  +W P  +         
Sbjct: 548  LIEKLPEGGSQFVRPPALVPRSGSSSLDTVTVVTQPAMLPLTAGAWPPVNVPKSQPPNAH 607

Query: 1349 XXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGGTNSMAQS--ELPQFPGQRPGL 1176
                 Q   R  F+ ++  N  ++QG NK  +  ++F    S  QS   +PQ PGQRP L
Sbjct: 608  TNYSLQQHGRSHFDSLNPINAAMNQGQNKHPYMPEQFDNFESKEQSLKTVPQLPGQRPAL 667

Query: 1175 VPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQWHGSPSDT 996
                 Q ++    +QPH   +     +                 +NH Y+ Q HG+    
Sbjct: 668  Q----QRNSLHGSLQPHFPPNDARD-SFLSSATGPLPPRLLAPSMNHGYSPQMHGAGISM 722

Query: 995  ALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIGQLAPSPPA 816
               N +   Q  L I N P  S+H                T+Q +P  QN G L P+ P 
Sbjct: 723  VPSNPIPVAQPPLSIPNMPTGSLHLQGGAMPPLPPGPRP-TSQMMPAAQNAGPLLPNQPQ 781

Query: 815  GGALSGLISSLVAQGLITLTK----QDSLGVEFDQDLLKVRHESAITALYADLPRQCTTC 648
            GG  +GLISSL+AQGLI+LTK    QDS+G+EFD DLLKVRHESAI+ALYADLPRQCTTC
Sbjct: 782  GGPFTGLISSLMAQGLISLTKPTPIQDSVGLEFDADLLKVRHESAISALYADLPRQCTTC 841

Query: 647  GLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTEAVPGFLPA 468
            GLRFK QEEHS HMDWHV           KPS  WFVS SMWL GAEALGT+AVPGFLP 
Sbjct: 842  GLRFKFQEEHSTHMDWHVTRNRMSKNRKQKPSRKWFVSASMWLSGAEALGTDAVPGFLPT 901

Query: 467  ENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSPAGSFEGMN 288
            E+IVEKK  EE+AVPADE+Q  CALCGEPFDDFYSDE +EWMY+GAVYMN+P GS EG++
Sbjct: 902  EDIVEKKDDEELAVPADEDQNLCALCGEPFDDFYSDETEEWMYRGAVYMNAPNGSVEGID 961

Query: 287  RSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQRKRMRS 159
            RSQLGPIVHAKCRS+S V+P EDF + +G   E+  QRKR+RS
Sbjct: 962  RSQLGPIVHAKCRSESSVVPPEDFVRYDGGNPEDSSQRKRLRS 1004


>ref|XP_011037707.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform
            X6 [Populus euphratica]
          Length = 886

 Score =  392 bits (1006), Expect = e-106
 Identities = 232/533 (43%), Positives = 287/533 (53%), Gaps = 19/533 (3%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1521
            SG G  +KI G RT  +QI  S   +E+W FP H++                  +  + +
Sbjct: 359  SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 418

Query: 1520 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1377
             +             P  D +  R P + S   S  DS S    S+  P S+  W P   
Sbjct: 419  SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 478

Query: 1376 QXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSELPQ 1200
            +               Q R QF+P+++ + +++Q L K S   +Q F G  +   + +  
Sbjct: 479  RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 538

Query: 1199 FPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 1020
             P        LN Q+ A     QP  L S   + N               QPLNH Y   
Sbjct: 539  TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 597

Query: 1019 WHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 840
             H +       N +  VQ  LP+NN PN  +H                  Q++P  QN+ 
Sbjct: 598  GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 653

Query: 839  QLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 672
               P  P+G A SGL +SL+AQGLI+LTKQ    DS+G+EF+ DLLK+R+ESAI+ALY D
Sbjct: 654  SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 713

Query: 671  LPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 492
            LPRQCTTCGLRFK QEEHS HMDWHV           K S +WFVS SMWL GAEALGT+
Sbjct: 714  LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 773

Query: 491  AVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 312
            A PGFLP E  VEKK    MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS 
Sbjct: 774  AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 833

Query: 311  AGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEG--ELTEEGIQRKRMRS 159
             GS  GM+RSQLGPIVHAKCRSDS V+P EDF  DEG    +EEG QRKRMRS
Sbjct: 834  NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDEGLQVNSEEGNQRKRMRS 886


>ref|XP_011037706.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform
            X5 [Populus euphratica]
          Length = 1035

 Score =  392 bits (1006), Expect = e-106
 Identities = 232/533 (43%), Positives = 287/533 (53%), Gaps = 19/533 (3%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1521
            SG G  +KI G RT  +QI  S   +E+W FP H++                  +  + +
Sbjct: 508  SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 567

Query: 1520 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1377
             +             P  D +  R P + S   S  DS S    S+  P S+  W P   
Sbjct: 568  SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 627

Query: 1376 QXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSELPQ 1200
            +               Q R QF+P+++ + +++Q L K S   +Q F G  +   + +  
Sbjct: 628  RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 687

Query: 1199 FPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 1020
             P        LN Q+ A     QP  L S   + N               QPLNH Y   
Sbjct: 688  TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 746

Query: 1019 WHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 840
             H +       N +  VQ  LP+NN PN  +H                  Q++P  QN+ 
Sbjct: 747  GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 802

Query: 839  QLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 672
               P  P+G A SGL +SL+AQGLI+LTKQ    DS+G+EF+ DLLK+R+ESAI+ALY D
Sbjct: 803  SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 862

Query: 671  LPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 492
            LPRQCTTCGLRFK QEEHS HMDWHV           K S +WFVS SMWL GAEALGT+
Sbjct: 863  LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 922

Query: 491  AVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 312
            A PGFLP E  VEKK    MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS 
Sbjct: 923  AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 982

Query: 311  AGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEG--ELTEEGIQRKRMRS 159
             GS  GM+RSQLGPIVHAKCRSDS V+P EDF  DEG    +EEG QRKRMRS
Sbjct: 983  NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDEGLQVNSEEGNQRKRMRS 1035


>ref|XP_011037702.1| PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform
            X1 [Populus euphratica] gi|743885952|ref|XP_011037703.1|
            PREDICTED: polyadenylation and cleavage factor homolog
            4-like isoform X2 [Populus euphratica]
            gi|743885954|ref|XP_011037704.1| PREDICTED:
            polyadenylation and cleavage factor homolog 4-like
            isoform X3 [Populus euphratica]
          Length = 1053

 Score =  392 bits (1006), Expect = e-106
 Identities = 232/533 (43%), Positives = 287/533 (53%), Gaps = 19/533 (3%)
 Frame = -1

Query: 1700 SGLGVFNKITGLRTPTSQITASSSARESWKFPDHLNXXXXXXXXXXXXXXSACEVKPTII 1521
            SG G  +KI G RT  +QI  S   +E+W FP H++                  +  + +
Sbjct: 526  SGRGSTSKIPGFRTERNQILGSRHHQEAWNFPPHIHQSAHLLNSKGRGRDFQMPLSGSGV 585

Query: 1520 GNF------------PSADGKFCRRPDVVSTISSMFDSLSPEIPSADAPASTESWLPAKL 1377
             +             P  D +  R P + S   S  DS S    S+  P S+  W P   
Sbjct: 586  SSLGGENYSPLAEKLPDIDAQLNRSPAIASRWGSNIDSTSSGTWSSVVPPSSGVWPPVNA 645

Query: 1376 QXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNK-SIHSKQRFGGTNSMAQSELPQ 1200
            +               Q R QF+P+++ + +++Q L K S   +Q F G  +   + +  
Sbjct: 646  RKSLPPPVHRIFPPPEQSRSQFDPINASSTVINQVLQKGSAMPEQPFNGFENKDYNSMKP 705

Query: 1199 FPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXSHATIQPLNHRYAAQ 1020
             P        LN Q+ A     QP  L S   + N               QPLNH Y   
Sbjct: 706  TPMSNQHAA-LNQQNQAHVNPFQPQQLPSHETRENFHPSGVTSMPPRPLGQPLNHGYNTH 764

Query: 1019 WHGSPSDTALHNIVLGVQSSLPINNAPNTSIHXXXXXXXXXXXXXXXGTTQSIPIPQNIG 840
             H +       N +  VQ  LP+NN PN  +H                  Q++P  QN+ 
Sbjct: 765  GHSTAISMVPSNALPAVQLPLPVNNIPNM-LHSQVGLRPPLPPGPPP---QTMPFSQNVS 820

Query: 839  QLAPSPPAGGALSGLISSLVAQGLITLTKQ----DSLGVEFDQDLLKVRHESAITALYAD 672
               P  P+G A SGL +SL+AQGLI+LTKQ    DS+G+EF+ DLLK+R+ESAI+ALY D
Sbjct: 821  SSVPGQPSGSAFSGLFNSLMAQGLISLTKQSPVQDSVGLEFNADLLKLRYESAISALYGD 880

Query: 671  LPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFVSISMWLHGAEALGTE 492
            LPRQCTTCGLRFK QEEHS HMDWHV           K S +WFVS SMWL GAEALGT+
Sbjct: 881  LPRQCTTCGLRFKCQEEHSTHMDWHVTKNRMSKNRKQKSSRNWFVSASMWLSGAEALGTD 940

Query: 491  AVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDEMDEWMYKGAVYMNSP 312
            A PGFLP E  VEKK    MAVPADE Q+ CALCGEPFDDFYSDE +EWMY+GAVY+NS 
Sbjct: 941  AAPGFLPTETTVEKKDDHGMAVPADEEQSTCALCGEPFDDFYSDETEEWMYRGAVYLNSS 1000

Query: 311  AGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEG--ELTEEGIQRKRMRS 159
             GS  GM+RSQLGPIVHAKCRSDS V+P EDF  DEG    +EEG QRKRMRS
Sbjct: 1001 NGSTAGMDRSQLGPIVHAKCRSDSSVVPPEDFGHDEGLQVNSEEGNQRKRMRS 1053


>ref|XP_010275999.1| PREDICTED: uncharacterized protein LOC104610875 isoform X2 [Nelumbo
            nucifera]
          Length = 895

 Score =  391 bits (1004), Expect = e-105
 Identities = 231/486 (47%), Positives = 282/486 (58%), Gaps = 23/486 (4%)
 Frame = -1

Query: 1547 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSM------FDSLSPEIPSADAPASTES--- 1395
            A +  P+ + NF   D +F R   VVS + S        ++LS  +P A A         
Sbjct: 416  AIKKMPSQVDNFLDTDAQFQRFSGVVSRMGSSNRDTMNVEALSTMMPPASALQKHRGQRP 475

Query: 1394 ------WLPAKLQXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGG 1233
                  W P  +              Q Q++ Q N M    +      NKS+    +  G
Sbjct: 476  SLAPLVWPPVNVPKSHPPPPLSVLPQQNQIKSQSNIMDISRIP-----NKSLTLPGQHLG 530

Query: 1232 T---NSMAQSELPQFPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXS 1062
                N++  ++L QFP Q+ GL+ LN +S  QA+ +    LMSQ  Q N          +
Sbjct: 531  VIERNTLTPTKLLQFPNQQAGLISLNQRSQGQASHLPAQPLMSQNAQENFVPSAVAQMST 590

Query: 1061 HATIQPLNHRYAAQWHGSPSDTALHNIVLGV-QSSLPINNAPNTSIHXXXXXXXXXXXXX 885
            H   QPLNH +  Q H S + + L N + G+  SS+ I+   NT  H             
Sbjct: 591  HKMEQPLNHGHIPQGHLSVTSSILPNPIPGLASSSVTIHGLSNTPFHLPGRALPPLPPGP 650

Query: 884  XXGTTQSIPIPQNIGQLAPSPPAGGALSGLISSLVAQGLITLTK----QDSLGVEFDQDL 717
               ++Q  PI QN+G +A    +G A SGLISSL+AQGLI+LT     QDS+GVEF+ DL
Sbjct: 651  PPVSSQIEPISQNVGPIATHASSGSAFSGLISSLMAQGLISLTTPASVQDSIGVEFNLDL 710

Query: 716  LKVRHESAITALYADLPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFV 537
            LKVRHESAI ALYADLPRQCTTCGLRFK QEEHS HMDWHV           KPS  WFV
Sbjct: 711  LKVRHESAIKALYADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRISKSRKQKPSRKWFV 770

Query: 536  SISMWLHGAEALGTEAVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDE 357
            S ++WL GAEALG +AVPGFLP E + EK   +EMAVPADENQ  CALCGEPFDDFYSDE
Sbjct: 771  STNVWLSGAEALGVDAVPGFLPTEAVAEKDD-QEMAVPADENQNVCALCGEPFDDFYSDE 829

Query: 356  MDEWMYKGAVYMNSPAGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQ 177
             +EWMYKGAVY+N+P G    M+RSQLGPIVHAKCRS+S V+P EDF  DEG  TEEG Q
Sbjct: 830  TEEWMYKGAVYLNAPDGPPADMDRSQLGPIVHAKCRSESTVVPPEDFQLDEGGTTEEGNQ 889

Query: 176  RKRMRS 159
            RKRMRS
Sbjct: 890  RKRMRS 895


>ref|XP_010275998.1| PREDICTED: uncharacterized protein LOC104610875 isoform X1 [Nelumbo
            nucifera]
          Length = 1071

 Score =  391 bits (1004), Expect = e-105
 Identities = 231/486 (47%), Positives = 282/486 (58%), Gaps = 23/486 (4%)
 Frame = -1

Query: 1547 ACEVKPTIIGNFPSADGKFCRRPDVVSTISSM------FDSLSPEIPSADAPASTES--- 1395
            A +  P+ + NF   D +F R   VVS + S        ++LS  +P A A         
Sbjct: 592  AIKKMPSQVDNFLDTDAQFQRFSGVVSRMGSSNRDTMNVEALSTMMPPASALQKHRGQRP 651

Query: 1394 ------WLPAKLQXXXXXXXXXXXXSQMQMRGQFNPMSSGNVIVDQGLNKSIHSKQRFGG 1233
                  W P  +              Q Q++ Q N M    +      NKS+    +  G
Sbjct: 652  SLAPLVWPPVNVPKSHPPPPLSVLPQQNQIKSQSNIMDISRIP-----NKSLTLPGQHLG 706

Query: 1232 T---NSMAQSELPQFPGQRPGLVPLNLQSSAQAARVQPHLLMSQGVQLNXXXXXXXXXXS 1062
                N++  ++L QFP Q+ GL+ LN +S  QA+ +    LMSQ  Q N          +
Sbjct: 707  VIERNTLTPTKLLQFPNQQAGLISLNQRSQGQASHLPAQPLMSQNAQENFVPSAVAQMST 766

Query: 1061 HATIQPLNHRYAAQWHGSPSDTALHNIVLGV-QSSLPINNAPNTSIHXXXXXXXXXXXXX 885
            H   QPLNH +  Q H S + + L N + G+  SS+ I+   NT  H             
Sbjct: 767  HKMEQPLNHGHIPQGHLSVTSSILPNPIPGLASSSVTIHGLSNTPFHLPGRALPPLPPGP 826

Query: 884  XXGTTQSIPIPQNIGQLAPSPPAGGALSGLISSLVAQGLITLTK----QDSLGVEFDQDL 717
               ++Q  PI QN+G +A    +G A SGLISSL+AQGLI+LT     QDS+GVEF+ DL
Sbjct: 827  PPVSSQIEPISQNVGPIATHASSGSAFSGLISSLMAQGLISLTTPASVQDSIGVEFNLDL 886

Query: 716  LKVRHESAITALYADLPRQCTTCGLRFKSQEEHSKHMDWHVXXXXXXXXXXXKPSPSWFV 537
            LKVRHESAI ALYADLPRQCTTCGLRFK QEEHS HMDWHV           KPS  WFV
Sbjct: 887  LKVRHESAIKALYADLPRQCTTCGLRFKCQEEHSSHMDWHVTKNRISKSRKQKPSRKWFV 946

Query: 536  SISMWLHGAEALGTEAVPGFLPAENIVEKKKYEEMAVPADENQTACALCGEPFDDFYSDE 357
            S ++WL GAEALG +AVPGFLP E + EK   +EMAVPADENQ  CALCGEPFDDFYSDE
Sbjct: 947  STNVWLSGAEALGVDAVPGFLPTEAVAEKDD-QEMAVPADENQNVCALCGEPFDDFYSDE 1005

Query: 356  MDEWMYKGAVYMNSPAGSFEGMNRSQLGPIVHAKCRSDSHVIPTEDFTKDEGELTEEGIQ 177
             +EWMYKGAVY+N+P G    M+RSQLGPIVHAKCRS+S V+P EDF  DEG  TEEG Q
Sbjct: 1006 TEEWMYKGAVYLNAPDGPPADMDRSQLGPIVHAKCRSESTVVPPEDFQLDEGGTTEEGNQ 1065

Query: 176  RKRMRS 159
            RKRMRS
Sbjct: 1066 RKRMRS 1071


Top