BLASTX nr result

ID: Forsythia22_contig00000654 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00000654
         (2698 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011083480.1| PREDICTED: uncharacterized protein LOC105166...   226   1e-55
emb|CDP07372.1| unnamed protein product [Coffea canephora]            179   1e-41
ref|XP_011074998.1| PREDICTED: uncharacterized protein LOC105159...   178   2e-41
ref|XP_009772830.1| PREDICTED: glutamic acid-rich protein-like [...   134   5e-28
ref|XP_010265994.1| PREDICTED: protein FAM133 isoform X2 [Nelumb...   132   1e-27
ref|XP_010265992.1| PREDICTED: splicing regulatory glutamine/lys...   132   1e-27
ref|XP_012469303.1| PREDICTED: splicing regulatory glutamine/lys...   125   2e-25
gb|KJB17613.1| hypothetical protein B456_003G007900 [Gossypium r...   125   2e-25
ref|XP_012835702.1| PREDICTED: uncharacterized protein LOC105956...   125   2e-25
ref|XP_009589462.1| PREDICTED: uncharacterized protein LOC104086...   124   4e-25
ref|XP_010266830.1| PREDICTED: glutamic acid-rich protein-like [...   122   2e-24
gb|KHG14510.1| Heat shock factor 4 [Gossypium arboreum]               121   3e-24
ref|XP_006342067.1| PREDICTED: muscle M-line assembly protein un...   119   1e-23
ref|XP_004238373.1| PREDICTED: exocyst complex component 6 [Sola...   111   3e-21
ref|XP_002300694.1| hypothetical protein POPTR_0002s02070g [Popu...   110   5e-21
ref|XP_006386168.1| hypothetical protein POPTR_0002s02070g [Popu...   110   5e-21
ref|XP_012073759.1| PREDICTED: uncharacterized protein LOC105635...   110   8e-21
ref|XP_010029657.1| PREDICTED: uncharacterized protein LOC104419...   109   1e-20
ref|XP_011034990.1| PREDICTED: DNA ligase 1 [Populus euphratica]      107   7e-20
ref|XP_007017860.1| JHL20J20.12 protein, putative [Theobroma cac...   103   1e-18

>ref|XP_011083480.1| PREDICTED: uncharacterized protein LOC105166005 [Sesamum indicum]
          Length = 750

 Score =  226 bits (575), Expect = 1e-55
 Identities = 126/278 (45%), Positives = 167/278 (60%), Gaps = 10/278 (3%)
 Frame = -2

Query: 1314 TNQNSGKSHNVDNVCVGENIWVDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXX 1135
            T+QN  KS  V    +G+ IW D++  LL++G +AE EQLERSSLTEEH QPVC   P  
Sbjct: 477  THQNLDKSGIVQGH-IGKKIWPDAKVELLYRGSQAEPEQLERSSLTEEHGQPVCLCAPST 535

Query: 1134 XXXSTENSNKRKRHTSSIDGSNSLGSII---------KIRLPSKKKNVPDTSSNKPQLCS 982
               STENSNKRKR +S  D +     I+         +IRLP KK   P+ S +  ++CS
Sbjct: 536  SSDSTENSNKRKRQSSPADVARGHSKIVFPRLPLINYRIRLPVKK---PNESGDTDRICS 592

Query: 981  TSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRTDQELVFSASRQMESAAQG 802
            TSG   FP Q KD+I+L+ ++G VC T Q T N+ QG   RTD+E + S S Q+      
Sbjct: 593  TSGSTPFPSQNKDDISLKYNRGNVCCTLQETSNIAQGLSRRTDREQICSTSGQINPVTAV 652

Query: 801  KLGTSSVANAVLTPTQRLELQYNNLVENWVPSE-LHTLVNDKDDQEWLFQSKDKGVHPEK 625
            K G  S +N V+TP QR+ELQY NL+ENW+P + L + +N  D+Q+WLF  K++G   EK
Sbjct: 653  KTGIPSASNTVMTPMQRMELQYKNLIENWIPPKWLDSSLNSDDEQDWLFLGKNEGQRAEK 712

Query: 624  XXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
                         S A+WP ++YL D D+ ALPF VPF
Sbjct: 713  RQKAGNDSLPCSSSSAIWPHAQYLQDVDVYALPFTVPF 750


>emb|CDP07372.1| unnamed protein product [Coffea canephora]
          Length = 319

 Score =  179 bits (454), Expect = 1e-41
 Identities = 104/262 (39%), Positives = 141/262 (53%)
 Frame = -2

Query: 1296 KSHNVDNVCVGENIWVDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTE 1117
            K+   D+   G   W + +GG L K +K + EQLERSS+TEEHEQPVCS+ P     ST+
Sbjct: 62   KASQFDHDACGGESWENVKGGFLQKERKDDSEQLERSSITEEHEQPVCSQNPSYSSDSTQ 121

Query: 1116 NSNKRKRHTSSIDGSNSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEI 937
            NSNKRKRH   ++ +   G+I++IRLPS+K    D+      LCSTSGR D P + KD  
Sbjct: 122  NSNKRKRHDPPLNATRVQGNILRIRLPSQKHIQHDSKDRDELLCSTSGRTDIPAEHKD-- 179

Query: 936  ALRASKGTVCPTSQGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPT 757
              RA     C TS G+  ++ G P R+DQ L    S Q       ++  +   +      
Sbjct: 180  -ARADPDKSCSTSLGSDLILHGLPLRSDQGLARGNSSQQPDVTSQEIVHTDSGSKRHRKL 238

Query: 756  QRLELQYNNLVENWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLA 577
            +R   +Y +L+ENW P    +   + DD+ WLF SK     PEK             SL 
Sbjct: 239  KRAVKRYTDLIENWTPPSRLSEHTEIDDEGWLFGSKHAEKQPEKKVRCSSDISCSSSSL- 297

Query: 576  LWPQSKYLPDADICALPFVVPF 511
            LWP++ +L DADI ALP+ VPF
Sbjct: 298  LWPRACHLHDADIYALPYTVPF 319


>ref|XP_011074998.1| PREDICTED: uncharacterized protein LOC105159585, partial [Sesamum
            indicum]
          Length = 310

 Score =  178 bits (451), Expect = 2e-41
 Identities = 120/296 (40%), Positives = 152/296 (51%), Gaps = 28/296 (9%)
 Frame = -2

Query: 1314 TNQNSGKSHNVDNVCVGENIWVDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXX 1135
            TN+N  KS ++  VC     W D +G + H  ++AE EQLERSSLTE+H QP+   +P  
Sbjct: 22   TNKNLEKS-DITKVC-----W-DGKGKVHHYSEEAETEQLERSSLTEDHGQPISFPVPSS 74

Query: 1134 XXXSTENSNKRKRHTS--SIDGSNSLGSIIKIRLPSKKKNVPDT---------------- 1009
               STEN+NKRKRH+S   +DGS S G +I IRLPSKK+N  D                 
Sbjct: 75   SSDSTENTNKRKRHSSVSPMDGSRSHGKVILIRLPSKKQNEFDALKKVGSASLEKDLSVQ 134

Query: 1008 ---SSNKPQLCSTSG----RVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRTDQ 850
                +        SG    + DFP Q KD+I  R        T +GT N+ QG   RT+ 
Sbjct: 135  SKDDAGLKDRSENSGCAFFQTDFPAQSKDDIGRRNRLENTHSTMKGTSNIKQGITLRTNS 194

Query: 849  ELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWV--PSELHTLVNDKD 676
            E V S S Q+E+ A GK G  SV  AVL   Q+ ELQY NL+E WV    E   L  D  
Sbjct: 195  EQVCSTSGQIEAVAPGKTGIKSVNKAVLKSVQKRELQYKNLLEKWVAPQPEDGCLYADDP 254

Query: 675  DQEWLFQSKDK-GVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
            D +WLF  KDK   H +K             S   WP ++YL + D+ ALPF VP+
Sbjct: 255  DSDWLFDCKDKNNTHAKKRQRRGSESISCSRSSTWWPHTEYLHEIDVYALPFTVPY 310


>ref|XP_009772830.1| PREDICTED: glutamic acid-rich protein-like [Nicotiana sylvestris]
          Length = 358

 Score =  134 bits (336), Expect = 5e-28
 Identities = 98/249 (39%), Positives = 132/249 (53%), Gaps = 3/249 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGS- 1072
            +S+G  L K  + E EQLERS+LTEEH Q VCS+       ST+NSNKRKR  S   G+ 
Sbjct: 122  ESKGMYLFKCLEDEAEQLERSNLTEEHGQAVCSQNSSCSSDSTQNSNKRKRPASPSHGNI 181

Query: 1071 NSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQG 892
             + GSII+IRL SKK     TS+ K +      ++  P Q+  E+ +R S     P  + 
Sbjct: 182  QAHGSIIRIRL-SKKDGQGKTSTAKEK------QLRKPAQKDVEVTVRTSVERANPLLKA 234

Query: 891  THNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWV 712
            T+N  QG P+ +  E   S S   +  A  K  T+S + A       +E QY NL+ENW+
Sbjct: 235  TNN--QGCPSPSVLEPSPSTSGWRDCVAVDKAATASCSKA---HENSIEFQYRNLIENWL 289

Query: 711  PSELHTLVNDKDDQEWLFQSKDK--GVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADI 538
            P  L T   D D + WLFQ K K   V  +              S ALWP+++Y+ DA++
Sbjct: 290  PPSLQTEHLDVDGEAWLFQRKPKHTRVGEKSAVSKEVSNDSTCGSSALWPRAQYIHDAEL 349

Query: 537  CALPFVVPF 511
             ALPF VPF
Sbjct: 350  YALPFTVPF 358


>ref|XP_010265994.1| PREDICTED: protein FAM133 isoform X2 [Nelumbo nucifera]
          Length = 325

 Score =  132 bits (332), Expect = 1e-27
 Identities = 87/252 (34%), Positives = 130/252 (51%), Gaps = 5/252 (1%)
 Frame = -2

Query: 1251 VDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGS 1072
            VD++GG   +    E  QLE+S LTEEH Q   S+ P     ST+NS+KRK+H++S D S
Sbjct: 78   VDNKGGEHPRNSHDESGQLEKSGLTEEHGQAAVSQNPDDSSDSTQNSHKRKKHSTSSDVS 137

Query: 1071 NSLGSIIKIRLPSKKKNVPD-TSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQ 895
            ++  SI++IRLP  K   P+   S +   CS+SGR+    Q K E      +  VC TS 
Sbjct: 138  HNHASILRIRLPLVKHKDPEMLPSKEVAACSSSGRIVISSQGKCEATPEPKREEVCSTSD 197

Query: 894  GTHNLVQGFPTRTDQELV---FSASRQMESAAQGKLGTSSVANAVLTPTQRLEL-QYNNL 727
             +   +Q          +    S SR+ E  A+ + GT + +    +   ++EL +Y +L
Sbjct: 198  RSEIAIQDKHANVPCSSIGVRCSTSRRNEVVAEDRTGTCTSSFPAESDEMKIELRKYRDL 257

Query: 726  VENWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPD 547
            ++NWVP  + +  N+ D+Q+WLF+ +    H  K             S   WP+  YL +
Sbjct: 258  IQNWVPPAIQSEYNEFDNQDWLFEVR----HESKKVKVDGGSSSHGTSSDPWPRCCYLRE 313

Query: 546  ADICALPFVVPF 511
             DI ALPF VPF
Sbjct: 314  VDIYALPFTVPF 325


>ref|XP_010265992.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1
            isoform X1 [Nelumbo nucifera]
            gi|720032033|ref|XP_010265993.1| PREDICTED: splicing
            regulatory glutamine/lysine-rich protein 1 isoform X1
            [Nelumbo nucifera]
          Length = 327

 Score =  132 bits (332), Expect = 1e-27
 Identities = 87/252 (34%), Positives = 130/252 (51%), Gaps = 5/252 (1%)
 Frame = -2

Query: 1251 VDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGS 1072
            VD++GG   +    E  QLE+S LTEEH Q   S+ P     ST+NS+KRK+H++S D S
Sbjct: 80   VDNKGGEHPRNSHDESGQLEKSGLTEEHGQAAVSQNPDDSSDSTQNSHKRKKHSTSSDVS 139

Query: 1071 NSLGSIIKIRLPSKKKNVPD-TSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQ 895
            ++  SI++IRLP  K   P+   S +   CS+SGR+    Q K E      +  VC TS 
Sbjct: 140  HNHASILRIRLPLVKHKDPEMLPSKEVAACSSSGRIVISSQGKCEATPEPKREEVCSTSD 199

Query: 894  GTHNLVQGFPTRTDQELV---FSASRQMESAAQGKLGTSSVANAVLTPTQRLEL-QYNNL 727
             +   +Q          +    S SR+ E  A+ + GT + +    +   ++EL +Y +L
Sbjct: 200  RSEIAIQDKHANVPCSSIGVRCSTSRRNEVVAEDRTGTCTSSFPAESDEMKIELRKYRDL 259

Query: 726  VENWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPD 547
            ++NWVP  + +  N+ D+Q+WLF+ +    H  K             S   WP+  YL +
Sbjct: 260  IQNWVPPAIQSEYNEFDNQDWLFEVR----HESKKVKVDGGSSSHGTSSDPWPRCCYLRE 315

Query: 546  ADICALPFVVPF 511
             DI ALPF VPF
Sbjct: 316  VDIYALPFTVPF 327


>ref|XP_012469303.1| PREDICTED: splicing regulatory glutamine/lysine-rich protein 1
            [Gossypium raimondii] gi|763750226|gb|KJB17614.1|
            hypothetical protein B456_003G007900 [Gossypium
            raimondii]
          Length = 325

 Score =  125 bits (313), Expect = 2e-25
 Identities = 92/250 (36%), Positives = 128/250 (51%), Gaps = 4/250 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSN 1069
            D +GG   K ++ E+E  E+S+LTEEH Q V    P     ST NS+KR++ +S  D   
Sbjct: 84   DQKGGDRQKKREYEVECFEKSTLTEEHGQAVG---PQNSSDSTLNSSKRQKLSSPPDSGQ 140

Query: 1068 SLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFP-VQRKDEIALRASK---GTVCPT 901
            + GSII+IRLPS++   P+   +K Q CSTSG  D   VQR  E A R  K      C T
Sbjct: 141  NPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRVHEHAPRPGKELEEQPCST 200

Query: 900  SQGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVE 721
            S            +  +E   S+SR  E+ A       +++N   T   +L LQ+ NLVE
Sbjct: 201  SDIKR---PELTFKLGKEKACSSSRTSETLAH-NTKAPTLSNLCTTCPPKLALQFKNLVE 256

Query: 720  NWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDAD 541
            +WV     + +    D +WLFQ K + ++ E              S A WP++ +LP+AD
Sbjct: 257  DWVMPTPQSELTSSGDDDWLFQKK-QNLNTEVKTHKDGNLNSNQMSSATWPRACFLPEAD 315

Query: 540  ICALPFVVPF 511
            I ALPF VPF
Sbjct: 316  IYALPFTVPF 325


>gb|KJB17613.1| hypothetical protein B456_003G007900 [Gossypium raimondii]
          Length = 323

 Score =  125 bits (313), Expect = 2e-25
 Identities = 92/250 (36%), Positives = 128/250 (51%), Gaps = 4/250 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSN 1069
            D +GG   K ++ E+E  E+S+LTEEH Q V    P     ST NS+KR++ +S  D   
Sbjct: 82   DQKGGDRQKKREYEVECFEKSTLTEEHGQAVG---PQNSSDSTLNSSKRQKLSSPPDSGQ 138

Query: 1068 SLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFP-VQRKDEIALRASK---GTVCPT 901
            + GSII+IRLPS++   P+   +K Q CSTSG  D   VQR  E A R  K      C T
Sbjct: 139  NPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRVHEHAPRPGKELEEQPCST 198

Query: 900  SQGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVE 721
            S            +  +E   S+SR  E+ A       +++N   T   +L LQ+ NLVE
Sbjct: 199  SDIKR---PELTFKLGKEKACSSSRTSETLAH-NTKAPTLSNLCTTCPPKLALQFKNLVE 254

Query: 720  NWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDAD 541
            +WV     + +    D +WLFQ K + ++ E              S A WP++ +LP+AD
Sbjct: 255  DWVMPTPQSELTSSGDDDWLFQKK-QNLNTEVKTHKDGNLNSNQMSSATWPRACFLPEAD 313

Query: 540  ICALPFVVPF 511
            I ALPF VPF
Sbjct: 314  IYALPFTVPF 323


>ref|XP_012835702.1| PREDICTED: uncharacterized protein LOC105956404 [Erythranthe
            guttatus] gi|604334711|gb|EYU38783.1| hypothetical
            protein MIMGU_mgv1a008728mg [Erythranthe guttata]
          Length = 364

 Score =  125 bits (313), Expect = 2e-25
 Identities = 99/274 (36%), Positives = 132/274 (48%), Gaps = 33/274 (12%)
 Frame = -2

Query: 1233 LLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSI 1054
            LL K  KAE +QLERSSLTEE  +PV   +P     STENSNKRKR +S +D + + G I
Sbjct: 93   LLQKVSKAEADQLERSSLTEERGKPVV--LPSTSADSTENSNKRKRQSSPLDCARAPGKI 150

Query: 1053 IKIRLPSKKKNVP--DTSSN---KPQLCSTSGRVDFPVQRKDEIALR------------- 928
            I+I+L SK +N    D S N   + Q CSTSGR  FP   KDE+  R             
Sbjct: 151  IRIKLSSKNQNPSPIDASVNEQQQTQTCSTSGRPSFPSFNKDEVVFRQRTEDLSSCTLKA 210

Query: 927  ----ASKGTVCPTSQG-THNLVQGFP----TRTDQELVFSASRQMESAAQGKLGTSSVAN 775
                  +  +C +SQ   H  VQ  P    T   Q       + + S  +          
Sbjct: 211  QIPVIGRDPICSSSQQIEHVPVQKMPVPSVTTPMQRSALVTGKDICSIPKPIEPVQKTPA 270

Query: 774  AVLTPTQRLELQYNNLVENWVPSELH-TLVNDKDDQEWLFQSK--DKGVHPEKXXXXXXX 604
              L+  QR  L+Y NL E W P +L   L  D DD +WLF+ K   +G+  EK       
Sbjct: 271  PHLSRVQRNALRYKNLTEMWAPPQLEFALPEDTDDVDWLFKGKKNQEGISSEKRCCSTSV 330

Query: 603  XXXXXXSLA--LW-PQSKYLPDADICALPFVVPF 511
                  S +  +W P+++YL + DI ALP+ +PF
Sbjct: 331  NDAKSCSSSSIMWPPRAQYLQEVDIYALPYTIPF 364


>ref|XP_009589462.1| PREDICTED: uncharacterized protein LOC104086823 [Nicotiana
            tomentosiformis]
          Length = 311

 Score =  124 bits (311), Expect = 4e-25
 Identities = 93/249 (37%), Positives = 128/249 (51%), Gaps = 3/249 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGS- 1072
            +S+G  L K  + E EQLERS+LTEEH Q VCS+       ST+NSNKRKR  S   G+ 
Sbjct: 75   ESKGMYLFKCLEDEAEQLERSNLTEEHGQAVCSQNSSCSSDSTQNSNKRKRPASPSHGNI 134

Query: 1071 NSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQG 892
             + GSII+IRL SKK    + S+ K +      ++  P Q+  E+ +R       P  + 
Sbjct: 135  QAHGSIIRIRL-SKKGGQGEMSTAKEK------QLRKPAQKDAEVTVRTIAERANPLQKA 187

Query: 891  THNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWV 712
            T+N  Q  P+ +  E   S S   +  A  K  T+  +         +E QY NL+ENW+
Sbjct: 188  TNN--QCCPSLSVLEPSPSTSGWTDCVAVDKTATALCSK---EHENSIEFQYKNLIENWL 242

Query: 711  PSELHTLVNDKDDQEWLFQSKDK--GVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADI 538
            P  L T     DD+ WLFQ K K   V  +              S ALWP+++Y+ DA++
Sbjct: 243  PPSLQTEHLGVDDESWLFQRKPKHTRVGEKSVVSKEVSNDSTCGSSALWPRAQYIHDAEL 302

Query: 537  CALPFVVPF 511
             ALPF VPF
Sbjct: 303  YALPFTVPF 311


>ref|XP_010266830.1| PREDICTED: glutamic acid-rich protein-like [Nelumbo nucifera]
            gi|720034790|ref|XP_010266831.1| PREDICTED: glutamic
            acid-rich protein-like [Nelumbo nucifera]
          Length = 323

 Score =  122 bits (305), Expect = 2e-24
 Identities = 93/268 (34%), Positives = 128/268 (47%), Gaps = 5/268 (1%)
 Frame = -2

Query: 1299 GKSHNVDNVCVGENIWVDSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXST 1120
            GK HN +     +   VD +GG   K  + + EQLE+S LTE+H Q   S+       ST
Sbjct: 60   GKKHNDEKK--HKKSKVDQKGGEHPKNIQDDSEQLEKSVLTEDHGQAAVSQNVYDSSDST 117

Query: 1119 ENSNKRKRHTSSIDGSNSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDE 940
             NS+KRKR +S  DGS + GSI++IRLP  K   P+   +K      S   D   Q + E
Sbjct: 118  GNSHKRKRLSSPSDGSQNNGSILRIRLPLMKHKEPEALPSKAVSSCNSIGSDVVAQGRCE 177

Query: 939  IALRASKGTVCPTS--QGTHNLVQGFPTRTDQ---ELVFSASRQMESAAQGKLGTSSVAN 775
            +  R+     C +S    T   VQ    +      EL  S S  +   AQ K  +S+   
Sbjct: 178  VTHRSGNEQACSSSCRIETATAVQNSHVKASSSSIELPCSTS-SIGIVAQDKAPSSTCGV 236

Query: 774  AVLTPTQRLELQYNNLVENWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXX 595
                  ++   +Y +LVENWVP  +     + DDQ+WLF+ K  G H  K          
Sbjct: 237  PKRDKIKKELQKYRDLVENWVPPPIQREYAEFDDQDWLFEVKPHGRHEAKRVKVDSDSLC 296

Query: 594  XXXSLALWPQSKYLPDADICALPFVVPF 511
               S  LWPQ+ YLP+ D+ ALP+ VPF
Sbjct: 297  RGSS-DLWPQACYLPEVDVYALPYTVPF 323


>gb|KHG14510.1| Heat shock factor 4 [Gossypium arboreum]
          Length = 325

 Score =  121 bits (303), Expect = 3e-24
 Identities = 91/250 (36%), Positives = 126/250 (50%), Gaps = 4/250 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSN 1069
            D +GG   K ++ E+E  E+S+LTEEH Q V    P     ST NS+KR++ +S  D   
Sbjct: 84   DQKGGDRQKKRENEVECFEKSTLTEEHGQAVG---PQNSSDSTLNSSKRQKLSSPPDSGQ 140

Query: 1068 SLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFP-VQRKDEIALRASK---GTVCPT 901
            + GSII+IRLPS++   P+   +K Q CSTSG  D   VQR  E A R  K      C T
Sbjct: 141  NPGSIIRIRLPSQRHKDPEVLPSKEQPCSTSGNTDEAFVQRVHEHAPRPGKELEEQPCST 200

Query: 900  SQGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVE 721
            S            +  +E   S+S   E+ A       +++N   T   +L LQ+ NLVE
Sbjct: 201  SDIKR---PELTFKLGKEKACSSSLTSETLAH-NAKVPTLSNLCTTCPPKLALQFKNLVE 256

Query: 720  NWVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDAD 541
            +WV   L +      D +WL Q K + ++ E              S A WP++ +LP+AD
Sbjct: 257  DWVMPTLQSESTSSGDDDWLVQKK-QNLNTEVKTHKDGNLNSNQMSSATWPRACFLPEAD 315

Query: 540  ICALPFVVPF 511
            I ALPF VPF
Sbjct: 316  IYALPFTVPF 325


>ref|XP_006342067.1| PREDICTED: muscle M-line assembly protein unc-89-like [Solanum
            tuberosum]
          Length = 308

 Score =  119 bits (299), Expect = 1e-23
 Identities = 90/255 (35%), Positives = 128/255 (50%), Gaps = 9/255 (3%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTS-SIDGS 1072
            +S+G  L K  + E EQLERS+LTEEHE  VCS+       ST+NSNKRKR  S S  G 
Sbjct: 71   ESKGKYLFKCLEDEAEQLERSNLTEEHEPAVCSQNSSCSSDSTQNSNKRKRPASPSRGGI 130

Query: 1071 NSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQG 892
             + GSII+IRL  K      ++S+K +       +  P Q+  E+ +RAS     P  + 
Sbjct: 131  QAHGSIIRIRLSKKGMQGEISASSKEK------HLPKPAQQVAEVTVRASAERANPLLKT 184

Query: 891  THN------LVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNN 730
            T+       +V   P+ ++   V   +    + +  K+  +S+           E QY N
Sbjct: 185  TNKRSCPPPVVVSEPSTSNCGWVDRVAVDNATPSCSKVHENSI-----------EFQYKN 233

Query: 729  LVENWVPSELHTLVND-KDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALW-PQSKY 556
            L+ENW+P  L +   D  DDQ WLFQ K K    E+               +LW P+++Y
Sbjct: 234  LIENWLPPSLPSDNLDLDDDQSWLFQRKPKQARVEEKNVGSSNDKTCGSCSSLWQPRAQY 293

Query: 555  LPDADICALPFVVPF 511
            LPD D+ ALP+ VPF
Sbjct: 294  LPDVDLYALPYTVPF 308


>ref|XP_004238373.1| PREDICTED: exocyst complex component 6 [Solanum lycopersicum]
          Length = 309

 Score =  111 bits (278), Expect = 3e-21
 Identities = 90/252 (35%), Positives = 124/252 (49%), Gaps = 6/252 (2%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTS---SID 1078
            +S+G  L K  + E EQLERS+LTEEHE  VCS+       ST+NSNKRKR TS   S  
Sbjct: 71   ESKGKYLFKCFEDEPEQLERSNLTEEHEPAVCSQNSSCSSDSTQNSNKRKRPTSPSPSRG 130

Query: 1077 GSNSLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTS 898
            G  + GSII+IRL SKK    + S +K +       +  P Q+  E+ +R S     P  
Sbjct: 131  GIQAHGSIIRIRL-SKKGVQGEISVSKEK------HLPKPAQQVAEVTVRTSAERANPLL 183

Query: 897  QGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVEN 718
            + T+      P    +    +       A      + S  +        +E QY NL+EN
Sbjct: 184  KTTNKRSCPPPVAVSEPSTSNCGWVDRVAEDNATPSCSKVH-----ENSIEFQYKNLIEN 238

Query: 717  WVPSELHTLVND-KDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALW--PQSKYLPD 547
            W+P  L +   D +DDQ WLFQ K K    E+             S +LW  P+++YLPD
Sbjct: 239  WLPPSLPSDNLDLEDDQSWLFQRKPKQARVEEKNLGGGDKTCGSCS-SLWQQPRAQYLPD 297

Query: 546  ADICALPFVVPF 511
             ++ ALP+ VPF
Sbjct: 298  VELYALPYTVPF 309


>ref|XP_002300694.1| hypothetical protein POPTR_0002s02070g [Populus trichocarpa]
            gi|222842420|gb|EEE79967.1| hypothetical protein
            POPTR_0002s02070g [Populus trichocarpa]
          Length = 284

 Score =  110 bits (276), Expect = 5e-21
 Identities = 76/235 (32%), Positives = 113/235 (48%)
 Frame = -2

Query: 1215 KAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSIIKIRLP 1036
            K + E+ E+S LTEEH +PVC +           SNKR++  SS    +   ++ +IRLP
Sbjct: 63   KEKREEAEKSGLTEEHNEPVCLQNVCYLSDDGIRSNKRRKLDSSTTTDDKPRNVFRIRLP 122

Query: 1035 SKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRT 856
              +   PD S N   LCSTSG  D  V  + EI   + + TV   +    +  +  P   
Sbjct: 123  LTRHKEPDVSLNSKGLCSTSGGAD-SVSGQSEIVRLSDQETVNSKAGELASPPENIPCS- 180

Query: 855  DQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWVPSELHTLVNDKD 676
                  S S ++ES+    +  +S        T + + QY  LVE+WVP  L   + D D
Sbjct: 181  ------SVSDKLESS----VSETSWFRFHDRKTLKADSQYKGLVEDWVPPPLQFELKDSD 230

Query: 675  DQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
            D+EWLF +  +  H  K             S  LWP++ YLP++D+ ALP+ +PF
Sbjct: 231  DEEWLFGTLKQERHGNKRLNARHDISCRESS-TLWPRAHYLPESDVYALPYTIPF 284


>ref|XP_006386168.1| hypothetical protein POPTR_0002s02070g [Populus trichocarpa]
            gi|550344098|gb|ERP63965.1| hypothetical protein
            POPTR_0002s02070g [Populus trichocarpa]
          Length = 290

 Score =  110 bits (276), Expect = 5e-21
 Identities = 76/235 (32%), Positives = 113/235 (48%)
 Frame = -2

Query: 1215 KAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSIIKIRLP 1036
            K + E+ E+S LTEEH +PVC +           SNKR++  SS    +   ++ +IRLP
Sbjct: 69   KEKREEAEKSGLTEEHNEPVCLQNVCYLSDDGIRSNKRRKLDSSTTTDDKPRNVFRIRLP 128

Query: 1035 SKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRT 856
              +   PD S N   LCSTSG  D  V  + EI   + + TV   +    +  +  P   
Sbjct: 129  LTRHKEPDVSLNSKGLCSTSGGAD-SVSGQSEIVRLSDQETVNSKAGELASPPENIPCS- 186

Query: 855  DQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWVPSELHTLVNDKD 676
                  S S ++ES+    +  +S        T + + QY  LVE+WVP  L   + D D
Sbjct: 187  ------SVSDKLESS----VSETSWFRFHDRKTLKADSQYKGLVEDWVPPPLQFELKDSD 236

Query: 675  DQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
            D+EWLF +  +  H  K             S  LWP++ YLP++D+ ALP+ +PF
Sbjct: 237  DEEWLFGTLKQERHGNKRLNARHDISCRESS-TLWPRAHYLPESDVYALPYTIPF 290


>ref|XP_012073759.1| PREDICTED: uncharacterized protein LOC105635312 [Jatropha curcas]
            gi|317106597|dbj|BAJ53105.1| JHL20J20.12 [Jatropha
            curcas] gi|643728958|gb|KDP36895.1| hypothetical protein
            JCGZ_08186 [Jatropha curcas]
          Length = 307

 Score =  110 bits (274), Expect = 8e-21
 Identities = 79/236 (33%), Positives = 124/236 (52%), Gaps = 1/236 (0%)
 Frame = -2

Query: 1215 KAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSIIKIRLP 1036
            K + E+ ERS LTEEH+QPVCS+       ST +S+KRKR   S + + S G+II+IRLP
Sbjct: 83   KVQEEEAERSGLTEEHDQPVCSQSLCYSPDSTRSSDKRKRDDLSYNITKSSGNIIRIRLP 142

Query: 1035 SKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRT 856
             +K    D S++   + S+S + DF  Q K  I +   +      S+   N+       +
Sbjct: 143  LQKHREVDASTSGEHVRSSSRKSDFLAQ-KQIITVPDKEQPSSINSKTGINI-------S 194

Query: 855  DQELVFSASRQME-SAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWVPSELHTLVNDK 679
            D  +   A+ + +  + + ++ T+S  ++ +   Q  E  Y +L+E+WVP  L    N+ 
Sbjct: 195  DPIVTPCANLEADKDSVRKRVITASGVSSRVRGVQNAESLYKDLLEDWVPLPLGCDQNNI 254

Query: 678  DDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
             DQEWLF +K +  H                S  LWP ++YLP+A++ ALP+ VPF
Sbjct: 255  GDQEWLFGTKKQEKHKR---LKSQCDEPCHGSSTLWPCARYLPEAEVYALPYTVPF 307


>ref|XP_010029657.1| PREDICTED: uncharacterized protein LOC104419639 [Eucalyptus grandis]
            gi|629090353|gb|KCW56606.1| hypothetical protein
            EUGRSUZ_I02328 [Eucalyptus grandis]
          Length = 315

 Score =  109 bits (272), Expect = 1e-20
 Identities = 82/249 (32%), Positives = 124/249 (49%), Gaps = 3/249 (1%)
 Frame = -2

Query: 1248 DSRGGLLHKGKKAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSN 1069
            D + G   K +K + E LE+S+LTEEH +PV S        ST NS+K+++     DG  
Sbjct: 76   DQKAGDHRKKRKHDTEHLEKSNLTEEHGKPVNS---LNSTDSTMNSSKKQKQILPPDGGL 132

Query: 1068 SLGSIIKIRLPSKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRAS---KGTVCPTS 898
            +  SII+IRLP ++    +   +  Q CS   R D     K E A R+S   +  +C TS
Sbjct: 133  NPASIIRIRLPLQRHKDLEMLPSGEQPCSAPVRTDVVDHEKHEHAPRSSTDRREHLCSTS 192

Query: 897  QGTHNLVQGFPTRTDQELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVEN 718
                   +G  ++        +S   E+++Q K GTSS+ +       R E++Y NL+EN
Sbjct: 193  SSAG---EGTASKLGLMEQCPSSGVAEASSQ-KNGTSSLPSLDDRGLSRSEIKYRNLIEN 248

Query: 717  WVPSELHTLVNDKDDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADI 538
            WV    H+   D DDQ+WLF  K   ++ +              S + WP+  YLP+ D+
Sbjct: 249  WVAPSFHSGCADLDDQDWLFGRKQ--LNCDAGNCKADYDGSTYGSPSPWPRMHYLPEVDM 306

Query: 537  CALPFVVPF 511
             ALP+ VP+
Sbjct: 307  YALPYTVPY 315


>ref|XP_011034990.1| PREDICTED: DNA ligase 1 [Populus euphratica]
          Length = 295

 Score =  107 bits (266), Expect = 7e-20
 Identities = 74/236 (31%), Positives = 113/236 (47%), Gaps = 1/236 (0%)
 Frame = -2

Query: 1215 KAEIEQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSIIKIRLP 1036
            K + E+ E+S LTEEH +PVC +           SNKR++   S    +   ++ +IRLP
Sbjct: 71   KEKREETEKSGLTEEHNEPVCLQNVCYLSDDGIRSNKRRKLDPSTTTDDKPRNVFRIRLP 130

Query: 1035 SKKKNVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRT 856
              +   PD S N   LCSTSG  D  V  + EI   + + TV   +    +L +  P  +
Sbjct: 131  LTRHKEPDVSLNSKGLCSTSGGAD-SVSGQSEIVRLSDQETVNSKAGELASLPKNIPCSS 189

Query: 855  D-QELVFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWVPSELHTLVNDK 679
               +L  S S + E++  G              T + + QY  L+E+WVP  L   + D 
Sbjct: 190  VLDKLESSISHESETSRFGFHDHK---------TLKADSQYKGLIEDWVPPPLQFELKDS 240

Query: 678  DDQEWLFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
            DD+EWLF +  +  H  K             S +L P++ YLP++D+ ALP+ +PF
Sbjct: 241  DDEEWLFGTLKQESHGNKRLNARHDILCRESSTSL-PRAHYLPESDVYALPYTIPF 295


>ref|XP_007017860.1| JHL20J20.12 protein, putative [Theobroma cacao]
            gi|508723188|gb|EOY15085.1| JHL20J20.12 protein, putative
            [Theobroma cacao]
          Length = 289

 Score =  103 bits (256), Expect = 1e-18
 Identities = 81/231 (35%), Positives = 110/231 (47%)
 Frame = -2

Query: 1203 EQLERSSLTEEHEQPVCSRIPXXXXXSTENSNKRKRHTSSIDGSNSLGSIIKIRLPSKKK 1024
            EQL  S LTEEHE PVC          T+NSNKRKR T S       GSI KIR   KK 
Sbjct: 76   EQLGNSDLTEEHEPPVC-----YLSDGTQNSNKRKRETPSSSECRVNGSI-KIRFSFKKP 129

Query: 1023 NVPDTSSNKPQLCSTSGRVDFPVQRKDEIALRASKGTVCPTSQGTHNLVQGFPTRTDQEL 844
               D S  + ++CSTSGR D   Q    IA      +    +  TH   Q   T  +Q+L
Sbjct: 130  RESDASLCEERVCSTSGRADCSTQ---PIAQEQPDPSNQKENIITHVPEQKITTVLEQKL 186

Query: 843  VFSASRQMESAAQGKLGTSSVANAVLTPTQRLELQYNNLVENWVPSELHTLVNDKDDQEW 664
                 R+ +  + G   TS   N +    ++  LQY  L+E+ +P  L    +D  D +W
Sbjct: 187  WRDNERKQQIPSSG---TSVFGNKM----KKAALQYKTLLEDLMPLPLQLQNHDDYDDDW 239

Query: 663  LFQSKDKGVHPEKXXXXXXXXXXXXXSLALWPQSKYLPDADICALPFVVPF 511
            LF+SK +G H  +             + +  P++ +LPD +I ALP+ VPF
Sbjct: 240  LFKSKQQGKHAGERSKVDDDVRCPTIATSC-PRAHFLPDVEIYALPYTVPF 289


Top