BLASTX nr result

ID: Papaver29_contig00000095 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver29_contig00000095
         (1973 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010652946.1| PREDICTED: filament-like plant protein [Viti...   572   e-160
ref|XP_010242807.1| PREDICTED: filament-like plant protein isofo...   570   e-159
ref|XP_010242801.1| PREDICTED: filament-like plant protein isofo...   570   e-159
ref|XP_010243856.1| PREDICTED: filament-like plant protein [Nelu...   569   e-159
emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera]   566   e-158
ref|XP_007019074.1| Filament-like plant protein, putative isofor...   536   e-149
ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [...   527   e-146
ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma...   527   e-146
gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum]       516   e-143
ref|XP_012455840.1| PREDICTED: filament-like plant protein isofo...   516   e-143
ref|XP_010664206.1| PREDICTED: filament-like plant protein [Viti...   513   e-142
ref|XP_010025716.1| PREDICTED: filament-like plant protein 3 [Eu...   502   e-139
ref|XP_012078241.1| PREDICTED: filament-like plant protein isofo...   499   e-138
ref|XP_012078245.1| PREDICTED: filament-like plant protein isofo...   499   e-138
ref|XP_011031090.1| PREDICTED: filament-like plant protein 3 [Po...   498   e-138
ref|XP_010242808.1| PREDICTED: filament-like plant protein isofo...   495   e-137
gb|KHF97687.1| Filament-like plant protein [Gossypium arboreum] ...   485   e-134
ref|XP_012455846.1| PREDICTED: filament-like plant protein isofo...   484   e-133
ref|XP_010093113.1| hypothetical protein L484_007922 [Morus nota...   481   e-132
ref|XP_012446394.1| PREDICTED: filament-like plant protein isofo...   478   e-132

>ref|XP_010652946.1| PREDICTED: filament-like plant protein [Vitis vinifera]
            gi|731397640|ref|XP_010652947.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731397642|ref|XP_010652948.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731397644|ref|XP_010652949.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
          Length = 672

 Score =  572 bits (1475), Expect = e-160
 Identities = 334/583 (57%), Positives = 412/583 (70%), Gaps = 4/583 (0%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSKSA   EEVND++++LTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENE 
Sbjct: 47   PEVTSKSAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEV 106

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
             + KQQLE A QKNSALEDRVGHLDGALKEC+RQLRQA+EEQEQKIHEAVV++THEWEST
Sbjct: 107  FSLKQQLEAAAQKNSALEDRVGHLDGALKECLRQLRQAREEQEQKIHEAVVKRTHEWEST 166

Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTD--LYPKLEAAEKENSTLKLELSAQAEELEIMT 1439
            K+ELESQ+ E++ +L  AKAE  +++ D  L  KL AAEKEN+ LKL+L ++ EELEI T
Sbjct: 167  KSELESQIVEIQAQLQTAKAE-TVATVDPGLELKLGAAEKENAALKLQLLSREEELEIRT 225

Query: 1438 LERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICV 1262
            +E++LS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARKA++ ND KS T   +SS+CV
Sbjct: 226  IEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSANDHKSIT---ASSVCV 282

Query: 1261 ESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSK 1082
            ES TDSQSD G+RLLA+E      +D      ++ NE EP  SDSWAS LI ELD+FK++
Sbjct: 283  ESLTDSQSDSGERLLALE------IDTRKMTGLDTNECEPSRSDSWASGLIQELDRFKNE 336

Query: 1081 EKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEK-GQDRDQLK 905
            + +            ++  MDDFLEMERLAALPETE  S        +D+  G     LK
Sbjct: 337  KPLVKN---LMAPSVELDLMDDFLEMERLAALPETENRSRCLESGAISDKHIGGSESPLK 393

Query: 904  ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725
            A L+AMI RT                      E QN+L TS+ +L+E EEKL+ELQ ++ 
Sbjct: 394  AQLEAMIDRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQTQLA 453

Query: 724  LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545
            L + +K+  E EI++ N KR+  ES+++ +E E+ T+ +KV SLE +VE ERA+SA    
Sbjct: 454  LASESKRNAEEEIQTTNAKREVAESRLIAVEAEIKTMLSKVLSLEEEVEKERALSAEAAS 513

Query: 544  XXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSL 365
                      + + ETE R   +SNGELKIKQ+KELAVAA KLAECQKTIASLG+QLKSL
Sbjct: 514  KCRKFEDELSRMKRETELRNLASSNGELKIKQEKELAVAASKLAECQKTIASLGRQLKSL 573

Query: 364  ATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236
            ATLED ++DS EKPL    EG   L   K   E W LH  ++Y
Sbjct: 574  ATLEDLLLDS-EKPLQPMSEG---LHHPKDGAEQWTLHPGNSY 612


>ref|XP_010242807.1| PREDICTED: filament-like plant protein isoform X2 [Nelumbo nucifera]
          Length = 678

 Score =  570 bits (1470), Expect = e-159
 Identities = 338/601 (56%), Positives = 419/601 (69%), Gaps = 6/601 (0%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PE+TSK   S EEVNDN+++LT+KL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAENE 
Sbjct: 51   PEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAENEV 110

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
             A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV+KT EWES 
Sbjct: 111  VALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVVEKTKEWESV 170

Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            K ELESQ+  L+ +++AAK E   +S DL  KLE+AEK+N+ LKLEL ++ EELEI TLE
Sbjct: 171  KLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRVEELEIRTLE 229

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIASSSSICVES 1256
            RDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T   +SS  VES
Sbjct: 230  RDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT---ASSFYVES 286

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
             TDSQSD G+RLL +E D  HK+     +++ELN+ E   SDSWASALI ELDQFK  + 
Sbjct: 287  LTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAELDQFKQDKA 340

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
            +  GR L      +I  MDDFLEMERLAALPETE+            ++G+    LKA+L
Sbjct: 341  I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE--SSLKAEL 395

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            + MIQR+                      E Q++L  S +QL+  EEKL+ELQ  + L  
Sbjct: 396  ETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQRCLDLAN 455

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536
              KQ TE ++E+ NT+++ +ES+++  + E+  LR KV SLE ++E ER +S        
Sbjct: 456  NLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTLSEEIVVKCR 515

Query: 535  XXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATL 356
                   K++ E E  RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG+QLKSLATL
Sbjct: 516  KLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATL 575

Query: 355  EDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLSR 191
            EDF++D  EKPLD+ V      +P    G+ WKLHSNDA+     + +++ A DGSG S 
Sbjct: 576  EDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKIAGDGSGPST 629

Query: 190  N 188
            N
Sbjct: 630  N 630


>ref|XP_010242801.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera]
            gi|720083139|ref|XP_010242802.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083142|ref|XP_010242803.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083146|ref|XP_010242804.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083149|ref|XP_010242805.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083152|ref|XP_010242806.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
          Length = 679

 Score =  570 bits (1470), Expect = e-159
 Identities = 338/601 (56%), Positives = 419/601 (69%), Gaps = 6/601 (0%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PE+TSK   S EEVNDN+++LT+KL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAENE 
Sbjct: 52   PEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAENEV 111

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
             A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV+KT EWES 
Sbjct: 112  VALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVVEKTKEWESV 171

Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            K ELESQ+  L+ +++AAK E   +S DL  KLE+AEK+N+ LKLEL ++ EELEI TLE
Sbjct: 172  KLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRVEELEIRTLE 230

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIASSSSICVES 1256
            RDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T   +SS  VES
Sbjct: 231  RDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT---ASSFYVES 287

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
             TDSQSD G+RLL +E D  HK+     +++ELN+ E   SDSWASALI ELDQFK  + 
Sbjct: 288  LTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAELDQFKQDKA 341

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
            +  GR L      +I  MDDFLEMERLAALPETE+            ++G+    LKA+L
Sbjct: 342  I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE--SSLKAEL 396

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            + MIQR+                      E Q++L  S +QL+  EEKL+ELQ  + L  
Sbjct: 397  ETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQRCLDLAN 456

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536
              KQ TE ++E+ NT+++ +ES+++  + E+  LR KV SLE ++E ER +S        
Sbjct: 457  NLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTLSEEIVVKCR 516

Query: 535  XXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATL 356
                   K++ E E  RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG+QLKSLATL
Sbjct: 517  KLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATL 576

Query: 355  EDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLSR 191
            EDF++D  EKPLD+ V      +P    G+ WKLHSNDA+     + +++ A DGSG S 
Sbjct: 577  EDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKIAGDGSGPST 630

Query: 190  N 188
            N
Sbjct: 631  N 631


>ref|XP_010243856.1| PREDICTED: filament-like plant protein [Nelumbo nucifera]
            gi|720086488|ref|XP_010243857.1| PREDICTED: filament-like
            plant protein [Nelumbo nucifera]
            gi|720086491|ref|XP_010243858.1| PREDICTED: filament-like
            plant protein [Nelumbo nucifera]
          Length = 675

 Score =  569 bits (1467), Expect = e-159
 Identities = 344/585 (58%), Positives = 415/585 (70%), Gaps = 6/585 (1%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSK     EEV+D++++LTEKL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAE E 
Sbjct: 52   PEVTSKVTNRSEEVSDSVKSLTEKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAEKEV 111

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
             + KQQLE AVQKNS+LEDRVGHLDGALKECVRQLRQA+EEQEQKIHEAV +K  EWES 
Sbjct: 112  VSLKQQLEAAVQKNSSLEDRVGHLDGALKECVRQLRQAREEQEQKIHEAVAKKASEWESA 171

Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            K ELE+Q+ EL+ +++AAK E   S + +  KLEAAEKEN+ LKL+L A+ EELEI TLE
Sbjct: 172  KFELENQVVELQTQVEAAKLE-AASDSGIQLKLEAAEKENAALKLQLLARIEELEIRTLE 230

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS Q AESA+KQ LESI+KVA+LEAECRRLRA +RKA   ND KS    ++SSI VES
Sbjct: 231  RDLSTQTAESASKQHLESIKKVARLEAECRRLRAISRKAALANDHKS---VAASSIYVES 287

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
             TDSQSD G+RLL +E D       ISS  +ELN+ EP CSDSWASALI ELDQFK  + 
Sbjct: 288  LTDSQSDSGERLLGVETD----TRKISS--LELNDCEPSCSDSWASALIAELDQFKQDKA 341

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETET---ESAVASCQQHNDEKGQDRDQLK 905
            +  GR L      +I  MDDFLEMERLAALPETE+   E   AS     D+    ++ +K
Sbjct: 342  I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGRPEPVAAS-----DQIDSGQNSIK 393

Query: 904  ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725
            A+L+AMI RT                      E+Q RL  S++QL E EEKL+ELQ ++ 
Sbjct: 394  AELEAMIHRTAELEEKLEKMEEEKAALDMALAESQGRLEMSQNQLWEAEEKLVELQRQLD 453

Query: 724  LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545
            L    KQA E++IE++NT+R+ +ES ++  + EV  LR KV SLE ++E ERA+SA    
Sbjct: 454  LANNLKQAAEVKIEASNTQRELVESHLVSADAEVWALRTKVCSLEAEIEKERALSAEVAA 513

Query: 544  XXXXXXXXXXKRRFETERRRA--TNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLK 371
                       +R E E RRA  + SN ELK KQ+KELAVAAGKL+ECQKTIASLG+QLK
Sbjct: 514  KCKKLEDELLGKRNEAELRRASISKSNDELKTKQEKELAVAAGKLSECQKTIASLGRQLK 573

Query: 370  SLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236
            +LATLEDF++DS EKPLD++   GS   P    GE WKLHSN+AY
Sbjct: 574  ALATLEDFLIDS-EKPLDLS---GS---PIPKIGESWKLHSNEAY 611


>emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera]
          Length = 749

 Score =  566 bits (1458), Expect = e-158
 Identities = 334/583 (57%), Positives = 410/583 (70%), Gaps = 4/583 (0%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSK+A   EEVND++++LTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENE 
Sbjct: 24   PEVTSKAAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEV 83

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
             + KQQLE   QKNS LEDRVGHLDGALKEC+RQLRQA+EEQEQKIHEAVV++THEWEST
Sbjct: 84   FSLKQQLEAXXQKNSXLEDRVGHLDGALKECLRQLRQAREEQEQKIHEAVVKRTHEWEST 143

Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTD--LYPKLEAAEKENSTLKLELSAQAEELEIMT 1439
            K+ELESQ+ E++ +L  AKAE  +++ D  L  KL AAEKEN+ LKL+L ++ EELEI T
Sbjct: 144  KSELESQIVEIQAQLQTAKAE-XVATVDPGLELKLGAAEKENAALKLQLLSREEELEIRT 202

Query: 1438 LERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICV 1262
            +E++LS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARKA++ ND KS T   +SS+CV
Sbjct: 203  IEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSANDHKSXT---ASSVCV 259

Query: 1261 ESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSK 1082
            ES TDSQSD G+RLLA+E      +D      ++ NE EP  SDSWAS LI ELD+FK+ 
Sbjct: 260  ESLTDSQSDSGERLLALE------IDTRKMTGLDTNECEPSRSDSWASGLIQELDRFKN- 312

Query: 1081 EKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEK-GQDRDQLK 905
            EK      +      D+  MDDFLEMERLAALPETE  S        +D+  G     LK
Sbjct: 313  EKPLVKNLMAPSVEXDL--MDDFLEMERLAALPETENRSRCLESGAISDKHIGGSESPLK 370

Query: 904  ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725
            A L+AMI RT                      E QN+L TS+ +L+E EEKL+ELQ ++ 
Sbjct: 371  AQLEAMIDRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQTQLA 430

Query: 724  LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545
            L + +K+  E EI++ N KR+  ES+++ +E E+ T+ +KV SLE +VE ERA+SA    
Sbjct: 431  LASESKRNAEEEIQATNAKREVAESRLIXVEAEIKTMLSKVLSLEEEVEKERALSAEAAS 490

Query: 544  XXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSL 365
                      + + ETE R   +SNGELKIKQ+KELAVAA KLAECQKTIASLG+QLKSL
Sbjct: 491  KCRKFEDELSRMKRETELRNLASSNGELKIKQEKELAVAASKLAECQKTIASLGRQLKSL 550

Query: 364  ATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236
            ATLED ++DS EKPL    EG   L   K   E W LH  ++Y
Sbjct: 551  ATLEDLLLDS-EKPLQPMSEG---LHHPKDGAEQWTLHPGNSY 589


>ref|XP_007019074.1| Filament-like plant protein, putative isoform 1 [Theobroma cacao]
            gi|508724402|gb|EOY16299.1| Filament-like plant protein,
            putative isoform 1 [Theobroma cacao]
          Length = 713

 Score =  536 bits (1382), Expect = e-149
 Identities = 322/614 (52%), Positives = 405/614 (65%), Gaps = 34/614 (5%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEV+SK++ + E+VND+++ LTEKLSAAL+N+SAKEDLVKQHAKVAEEA++GWEKAENE 
Sbjct: 51   PEVSSKASANCEDVNDSIKRLTEKLSAALVNVSAKEDLVKQHAKVAEEAIAGWEKAENEV 110

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
               KQ+LE AVQ+NSALEDRV HLDGALKECVRQLRQA+EEQEQKI+EAV + T +WE+T
Sbjct: 111  VLLKQKLEAAVQQNSALEDRVSHLDGALKECVRQLRQAREEQEQKINEAVAKTTRDWETT 170

Query: 1612 KTELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K ELESQ  EL+ K +A K+EP    S DL+ K+EA EKENS LKLELS+Q+EE EI T+
Sbjct: 171  KFELESQFLELQDKAEAVKSEPPPHFSPDLWHKIEALEKENSALKLELSSQSEEFEIRTI 230

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259
            ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K++  ND KS    ++SSI VE
Sbjct: 231  ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIACKSSLVNDHKS---PAASSIYVE 287

Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079
            S TDSQSD G+RL  +E D  HK+  +     E N+ EP CSDSWASALI ELDQFK+++
Sbjct: 288  SVTDSQSDSGERLNVVEIDT-HKMSGL-----EANKGEPSCSDSWASALIAELDQFKNEK 341

Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899
             ++           +I  MDDFLEMERLAALPE ++E+     +    +       LKA+
Sbjct: 342  VISRN---LPSSSIEIDLMDDFLEMERLAALPEIKSENQFLESKATARQSNDGDSSLKAE 398

Query: 898  LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719
            L+AMI RT                      ++Q  L  S  QLR+TE KL EL+ +  + 
Sbjct: 399  LEAMIHRTAELEQKLEKIELEKAELEIALAKSQESLEASALQLRDTETKLEELEREFHMA 458

Query: 718  AVAKQATELEIESANTKRKAM--------------------------------ESQILVM 635
              AKQ  E ++ S  T  + M                                ESQ++ +
Sbjct: 459  NEAKQHLESQLSSMETDAETMSSKIDSLKAEIEKEMALSAEISVNATESKQLLESQLISI 518

Query: 634  EEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKI 455
            E E  T+ AK+DSLE +VE ERA+SA              ++R E E ++  NSN E+KI
Sbjct: 519  EAEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELLRKRQEAELQQTANSNVEVKI 578

Query: 454  KQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKS 275
            KQ+ +LAVAAGKLAECQKTIASLGQQLKSLATLEDF++D+   P      GGSL+  +K+
Sbjct: 579  KQE-DLAVAAGKLAECQKTIASLGQQLKSLATLEDFLIDTTSIP--EFSRGGSLV--SKA 633

Query: 274  SGEYWKLHSNDAYS 233
             GE WKLHSN+ YS
Sbjct: 634  GGEPWKLHSNETYS 647


>ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508700815|gb|EOX92711.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 649

 Score =  527 bits (1357), Expect = e-146
 Identities = 307/565 (54%), Positives = 396/565 (70%), Gaps = 8/565 (1%)
 Frame = -2

Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790
            EVTSK+    EEVNDN+++LTEKLSAAL+NISAKEDLVKQHAKVAEEAVSGWEKAE +  
Sbjct: 49   EVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAKVAEEAVSGWEKAEKDVL 108

Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610
            A KQQL+ A++K +ALEDRVGHLDGALKECVRQLRQA+EEQE++IHEAV +K HEWES+K
Sbjct: 109  ALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQERRIHEAVAKKCHEWESSK 168

Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            +ELESQL +L+ +L   K+E   S   DL+PKLEA EKENS LKL+L ++AEEL++  +E
Sbjct: 169  SELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSALKLQLLSRAEELQLRIIE 228

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS QAAE+A+KQ LESI+K+AKLEAECR+L+  ARKA+  NDQKS+   ++SSICV+S
Sbjct: 229  RDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPANDQKSY---AASSICVDS 285

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
            FTDSQSD GDRLLA+E ++         + +E+NE E   S+SW SALITELDQF++++ 
Sbjct: 286  FTDSQSDSGDRLLAVETNMR------KMSGLEMNECETSRSESWTSALITELDQFRNEKA 339

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
            V  GR +      +I  MDDFLEMERLAALP+TE+ +        +D+     + LKA++
Sbjct: 340  V--GRNI-MAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENPLKAEV 396

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            +  I R                       E+Q +L T ++QLRE E KL +LQ ++ L  
Sbjct: 397  ETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQLALAD 456

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSA------X 554
             +KQA E E++ AN  R+  ES+    E EV TL +KV SLE +V  E+A+SA       
Sbjct: 457  NSKQAAEDEVKVANMNREVAESRFRDAEIEVKTLLSKVTSLEEEVGREQALSARNVSKCK 516

Query: 553  XXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374
                         + R + ER+   + N ELK +QDKELA+AA KLAECQKTIASLG+QL
Sbjct: 517  ELEDELSKLKREAELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQL 576

Query: 373  KSLATLEDFMMDSDEKPLDINVEGG 299
            KSLATL+DF++D D KPL++ V+GG
Sbjct: 577  KSLATLDDFLIDPD-KPLEL-VDGG 599


>ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700814|gb|EOX92710.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 675

 Score =  527 bits (1357), Expect = e-146
 Identities = 307/565 (54%), Positives = 396/565 (70%), Gaps = 8/565 (1%)
 Frame = -2

Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790
            EVTSK+    EEVNDN+++LTEKLSAAL+NISAKEDLVKQHAKVAEEAVSGWEKAE +  
Sbjct: 49   EVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAKVAEEAVSGWEKAEKDVL 108

Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610
            A KQQL+ A++K +ALEDRVGHLDGALKECVRQLRQA+EEQE++IHEAV +K HEWES+K
Sbjct: 109  ALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQERRIHEAVAKKCHEWESSK 168

Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            +ELESQL +L+ +L   K+E   S   DL+PKLEA EKENS LKL+L ++AEEL++  +E
Sbjct: 169  SELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSALKLQLLSRAEELQLRIIE 228

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS QAAE+A+KQ LESI+K+AKLEAECR+L+  ARKA+  NDQKS+   ++SSICV+S
Sbjct: 229  RDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPANDQKSY---AASSICVDS 285

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
            FTDSQSD GDRLLA+E ++         + +E+NE E   S+SW SALITELDQF++++ 
Sbjct: 286  FTDSQSDSGDRLLAVETNMR------KMSGLEMNECETSRSESWTSALITELDQFRNEKA 339

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
            V  GR +      +I  MDDFLEMERLAALP+TE+ +        +D+     + LKA++
Sbjct: 340  V--GRNI-MAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENPLKAEV 396

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            +  I R                       E+Q +L T ++QLRE E KL +LQ ++ L  
Sbjct: 397  ETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQLALAD 456

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSA------X 554
             +KQA E E++ AN  R+  ES+    E EV TL +KV SLE +V  E+A+SA       
Sbjct: 457  NSKQAAEDEVKVANMNREVAESRFRDAEIEVKTLLSKVTSLEEEVGREQALSARNVSKCK 516

Query: 553  XXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374
                         + R + ER+   + N ELK +QDKELA+AA KLAECQKTIASLG+QL
Sbjct: 517  ELEDELSKLKREAELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQL 576

Query: 373  KSLATLEDFMMDSDEKPLDINVEGG 299
            KSLATL+DF++D D KPL++ V+GG
Sbjct: 577  KSLATLDDFLIDPD-KPLEL-VDGG 599


>gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum]
          Length = 679

 Score =  516 bits (1329), Expect = e-143
 Identities = 308/571 (53%), Positives = 392/571 (68%), Gaps = 8/571 (1%)
 Frame = -2

Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790
            EVTSK A  V+E ++N+R+LTEKLS AL+NISAKE+LVKQHAKVAEEAVSGWEKAE +  
Sbjct: 49   EVTSK-AVPVDEESNNVRSLTEKLSTALMNISAKEELVKQHAKVAEEAVSGWEKAEKDVV 107

Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610
            A KQQL+ A++KN+ALEDRVGHLDGALKECVRQLRQA+EEQE+KIHEAV +K HEWES+K
Sbjct: 108  ALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQERKIHEAVSKKCHEWESSK 167

Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            +ELESQL  L+ +L+ AK++   S   DL  KL+A EKENS LKL+L ++AEELE   +E
Sbjct: 168  SELESQLLNLKAQLETAKSDAAASVDPDLQLKLDACEKENSALKLQLHSRAEELERRIIE 227

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS QAAE+A+KQ L+SI+K+AKLE ECRRL+A ARKA+  NDQKS+T   +SSICVES
Sbjct: 228  RDLSTQAAETASKQHLDSIKKLAKLEIECRRLKAIARKASPANDQKSYT---ASSICVES 284

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
            FTDSQSD G+RLLA+E D+         N +E+N  +   SD+WASALITELDQF+ KEK
Sbjct: 285  FTDSQSDSGERLLAVETDMQ------KMNGLEMNGCDRSRSDAWASALITELDQFR-KEK 337

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
              G   +      +I  MDDFLEMERLAALP+TE+ S        + +     + LKADL
Sbjct: 338  AVGRNIM--APSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQNSIVENPLKADL 395

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            + ++ R                       E+Q +L T ++QL E E +  ++Q ++ L  
Sbjct: 396  ETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQLALAD 455

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536
             +KQA E E++ AN  R+  ES++   E E+ TL +KV SLE     E+A+S        
Sbjct: 456  NSKQAAEKEVKVANMNRQVAESRLRDAETEIKTLMSKVTSLEEAFGKEQALSTENMNKCK 515

Query: 535  XXXXXXXKRRFETERRR------ATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374
                   K + ET+ RR      A   N ELK++QDKEL++AA K AECQKTIASLGQQL
Sbjct: 516  ELENELSKMKCETKLRREAELQHAAKYNEELKVQQDKELSIAARKFAECQKTIASLGQQL 575

Query: 373  KSLATLEDFMMDSDEKPLDINVEGGSLLTPN 281
            KSLATLEDF++DSD KPL++ V+GG   T N
Sbjct: 576  KSLATLEDFLIDSD-KPLEL-VDGGLKCTGN 604


>ref|XP_012455840.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246344|ref|XP_012455841.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246346|ref|XP_012455843.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246348|ref|XP_012455844.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246350|ref|XP_012455845.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|763804407|gb|KJB71345.1| hypothetical
            protein B456_011G117600 [Gossypium raimondii]
          Length = 679

 Score =  516 bits (1328), Expect = e-143
 Identities = 309/571 (54%), Positives = 392/571 (68%), Gaps = 8/571 (1%)
 Frame = -2

Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790
            EVTSK A  V+E N+N+R+LTEKLSAAL+NISAKE+LVKQHAKVAEEAVSGWEKAE +  
Sbjct: 49   EVTSK-AVPVDEENNNVRSLTEKLSAALMNISAKEELVKQHAKVAEEAVSGWEKAEKDVV 107

Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610
            A KQQL+ A++KN+ALEDRVGHLDGALKECVRQLRQA+EEQE+KIHEAV +K HEWES+K
Sbjct: 108  ALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQERKIHEAVSKKCHEWESSK 167

Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
            +ELESQL  L+ +L+ AK +   S   DL  KL+A EKENS LKL+L ++AEELE   +E
Sbjct: 168  SELESQLLNLKAQLETAKNDTAASVDPDLQLKLDAFEKENSALKLQLHSRAEELERRIIE 227

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS QAAE+A+KQ LESI+K+AKLE ECRRL+A ARKA+  NDQKS+    +SSICVES
Sbjct: 228  RDLSTQAAETASKQHLESIKKLAKLEIECRRLKAIARKASPANDQKSY---PASSICVES 284

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
            FTDSQSD G+RLLA+E D+         N +E+N  +   SD+WASALITELDQF+ KEK
Sbjct: 285  FTDSQSDSGERLLAVETDMQ------KMNGLEMNGCDRSSSDAWASALITELDQFR-KEK 337

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
              G   +      +I  MDDFLEMERLAALP+TE+ S        + +     + LKADL
Sbjct: 338  AVGRNIM--APSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQTSIVENPLKADL 395

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            + ++ R                       E+Q +L T ++QL E E +  ++Q ++ L  
Sbjct: 396  ETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQLALAD 455

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536
             +KQA E E++ AN  R+  ES++   E E+ TL +KV SLE  +  E+A+S        
Sbjct: 456  NSKQAAEKEVKVANMNREVAESRLRDAETEIKTLMSKVTSLEEALGKEQALSTENMNKCK 515

Query: 535  XXXXXXXKRRFETERRR------ATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374
                   K + ET+ R+      A   N ELK++QDKEL++AA K AECQKTIASLGQQL
Sbjct: 516  ELENELSKMKCETKLRQEAELQHAAKYNEELKVQQDKELSIAACKFAECQKTIASLGQQL 575

Query: 373  KSLATLEDFMMDSDEKPLDINVEGGSLLTPN 281
            KSLATLEDF++DSD KPL++ V+GG   T N
Sbjct: 576  KSLATLEDFLIDSD-KPLEL-VDGGLKCTGN 604


>ref|XP_010664206.1| PREDICTED: filament-like plant protein [Vitis vinifera]
            gi|731428065|ref|XP_010664207.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731428067|ref|XP_010664208.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731428069|ref|XP_010664209.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
          Length = 689

 Score =  513 bits (1321), Expect = e-142
 Identities = 307/616 (49%), Positives = 398/616 (64%), Gaps = 23/616 (3%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSK ATS +EVNDN+++LTEKLSAALLN+ AK+DLVKQHAKVAEEAV+GWEKAENE 
Sbjct: 51   PEVTSKVATSGDEVNDNVKSLTEKLSAALLNVGAKDDLVKQHAKVAEEAVAGWEKAENEV 110

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
               KQQLE AVQ+N  LEDRV  LDGA+KECVRQLRQA+EEQE+KI EAVV+KT EWEST
Sbjct: 111  VVLKQQLEAAVQENLVLEDRVSRLDGAIKECVRQLRQAREEQEEKISEAVVKKTREWEST 170

Query: 1612 KTELESQLDELRKKLDAAKAEP----------------------KISSTDLYPKLEAAEK 1499
            K ELESQL EL+ ++DAAKAEP                      K        +L+A EK
Sbjct: 171  KFELESQLLELQTQVDAAKAEPPVPFDPDLCHMLQALEKQNSALKYELLSQSEELQALEK 230

Query: 1498 ENSTLKLELSAQAEELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARK 1319
            ENSTLKLEL +Q+EELEI T+ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARK
Sbjct: 231  ENSTLKLELLSQSEELEIRTIERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAMARK 290

Query: 1318 ANN-NDQKSFTIASSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEP 1142
            +++ +D +S    ++SS+ +ES TDSQSD G++L  ++  L        +++ ++N+ EP
Sbjct: 291  SSSIHDHRS---VAASSLHIESLTDSQSDNGEQLNMVDISLH------QTSSFDVNDCEP 341

Query: 1141 GCSDSWASALITELDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESA 962
             CSDSWASALI ELDQFK+++ V+           +I  MDDFLEMERLAALP+ E  S 
Sbjct: 342  SCSDSWASALIAELDQFKNEKVVSRN---LPASSIEIDLMDDFLEMERLAALPQAEHGSR 398

Query: 961  VASCQQHNDEKGQDRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTS 782
                Q   ++   +   L+A+L+ M  R                        +Q+ +  S
Sbjct: 399  SLESQAVTNQTSNEDSSLRAELETMTHRMAELEEKLEKMEAEKAELEIALTVSQDCIEAS 458

Query: 781  EDQLRETEEKLLELQNKMGLEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKV 602
            + QLRE E KL E+Q               E++ AN  ++A+ESQ++ ME E  T+ A+V
Sbjct: 459  KIQLREAEMKLEEMQK--------------ELDFANESKQALESQLIAMEAEARTMSARV 504

Query: 601  DSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAG 422
            DSLE +++ E AMSA              K++ E + ++A +SN E K+KQ+ ELA+AAG
Sbjct: 505  DSLEAEIKKEHAMSAEIGVKCQELEDELLKKKQELKFQQAASSNSERKVKQE-ELAIAAG 563

Query: 421  KLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSND 242
            KLAECQKTIASLG+QLKSLATLEDF+ D+     ++    G  +    + GE W+LHSND
Sbjct: 564  KLAECQKTIASLGKQLKSLATLEDFLTDAG----NLADFSGKSVISTAAGGETWQLHSND 619

Query: 241  AYSTANQAAEDGSGLS 194
             +     A  D S +S
Sbjct: 620  TFLPRRSA--DSSNMS 633


>ref|XP_010025716.1| PREDICTED: filament-like plant protein 3 [Eucalyptus grandis]
            gi|702451484|ref|XP_010025717.1| PREDICTED: filament-like
            plant protein 3 [Eucalyptus grandis]
            gi|702451488|ref|XP_010025718.1| PREDICTED: filament-like
            plant protein 3 [Eucalyptus grandis]
            gi|629096439|gb|KCW62434.1| hypothetical protein
            EUGRSUZ_H05077 [Eucalyptus grandis]
          Length = 621

 Score =  502 bits (1292), Expect = e-139
 Identities = 298/572 (52%), Positives = 379/572 (66%), Gaps = 8/572 (1%)
 Frame = -2

Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790
            EVTSK A + EEV + +R L++KLSAALLNISAKE+LVKQHAKVAEEAVSGWEKAENE +
Sbjct: 48   EVTSKVAVADEEVGEGVRTLSDKLSAALLNISAKEELVKQHAKVAEEAVSGWEKAENEVS 107

Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610
              K+QLE A Q+NS LEDR+ HLDGALKECVRQLRQ +EEQEQKI E VV+KTHEWESTK
Sbjct: 108  VLKKQLEVATQRNSTLEDRISHLDGALKECVRQLRQVREEQEQKIQETVVKKTHEWESTK 167

Query: 1609 TELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433
             ELE++L  +  +L AAK+E   +  +DL PKL+AAEKEN  LK ++ + +EELE+  +E
Sbjct: 168  AELETKLSNVHAQLQAAKSEASSVICSDLGPKLDAAEKENVALKAKVLSMSEELELRIIE 227

Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256
            RDLS QAAE+A+KQ LESI+KVA+LEAECRRLRA +RKA+  ND KSF   S+SS+CVES
Sbjct: 228  RDLSTQAAETASKQHLESIKKVARLEAECRRLRAMSRKASAANDLKSF---SASSVCVES 284

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
            F DSQSD GDRLLA+END+        ++ +E ++ EP  S+SWASALITELD FK KEK
Sbjct: 285  FADSQSDVGDRLLAVENDVQ------KASCLEPSDCEPCHSESWASALITELDHFK-KEK 337

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896
              G   +      D+  MDDFLEMERLAALP+ E+ S         D+   D   LK++L
Sbjct: 338  SFGKSLMVSSGELDL--MDDFLEMERLAALPDAESGSCSHGMGPSLDQNRSDEVTLKSEL 395

Query: 895  DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716
            +AMI RT                      + Q +L TS  QL E E +L EL NK+    
Sbjct: 396  EAMINRTAELEEELEKKEEEKEKLEMALSQCQKQLETSWSQLNEVEMRLTELNNKLSAAQ 455

Query: 715  VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536
             +KQA E E    + + +  ES+++ +EEEV  L   V  L+ +VE ERA+SA       
Sbjct: 456  KSKQAAEEEARITHERMEVTESRLMDVEEEVKNLLLNVKLLQEEVERERALSAENEAKCQ 515

Query: 535  XXXXXXXKRR------FETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374
                     +       E E +R   S+ +LK+KQ++ELAVAA + AECQKTI+SL QQL
Sbjct: 516  ELEDENLNMKRDAELQHEIELQRVAVSDEQLKVKQEQELAVAASRFAECQKTISSLAQQL 575

Query: 373  KSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278
            KSLA ++DF  DSD  P+  +++ G L   N+
Sbjct: 576  KSLAAVDDFFADSD--PMCNHMDEGLLSPENE 605


>ref|XP_012078241.1| PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas]
            gi|802635970|ref|XP_012078242.1| PREDICTED: filament-like
            plant protein isoform X1 [Jatropha curcas]
            gi|802636033|ref|XP_012078243.1| PREDICTED: filament-like
            plant protein isoform X1 [Jatropha curcas]
          Length = 682

 Score =  499 bits (1285), Expect = e-138
 Identities = 302/602 (50%), Positives = 403/602 (66%), Gaps = 7/602 (1%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSK+    E+VND++R LTEKLSAAL+N+SAK+DLVKQH+KVAEEAV+GWEKAENE 
Sbjct: 52   PEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 111

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
            AA K+QLE A+Q+N ALEDRV HLDGALKECVRQLRQA+EE E+K++EAV +KT EWES 
Sbjct: 112  AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 171

Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K+ELE+QL EL+ K +A K+E P     DL+ KLE  EK+N++LKLE+ + +EELE+  +
Sbjct: 172  KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 231

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259
            ERDLS QAAE+A+KQ L+SI+KVAKLEAECRRL+A A K+++ ND K+   + +SS+ VE
Sbjct: 232  ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKT---SIASSMYVE 288

Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079
            S TDSQSD G+RL A+E D  HK+     + +E ++ EP CSDSWASALI ELDQFK+++
Sbjct: 289  SLTDSQSDSGERLNAVELD-AHKI-----SCLEPSKCEPSCSDSWASALIAELDQFKNEK 342

Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899
             V   R L      +I  MDDFLEMERLA+LPE E+ +  +  +    +       L+A+
Sbjct: 343  AV--NRNL-PASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAE 399

Query: 898  LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719
            L+ MI RT                      +        E  L  + EK  E Q ++G  
Sbjct: 400  LEIMIHRTAELEKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEA 459

Query: 718  AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539
             +  +    E+  AN  ++ +ESQ++ ME E  T+ +KVDSLE ++E E+ +SA      
Sbjct: 460  ELKMKQLHQELSIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKC 519

Query: 538  XXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLAT 359
                    ++  E E +++ +SNGELKIKQ+ +LAVAAGKLAECQKTIASLG+QLKSLAT
Sbjct: 520  RTLEEELSEKNKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLAT 578

Query: 358  LEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLS 194
            LEDF++D+   P       G  L P K++ E WKLHS+D       S++++ A + SG S
Sbjct: 579  LEDFLIDTASLP---EFTAGGALMP-KATEEPWKLHSSDTLSPKRDSSSSRIASENSGPS 634

Query: 193  RN 188
             N
Sbjct: 635  VN 636


>ref|XP_012078245.1| PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas]
            gi|802636039|ref|XP_012078246.1| PREDICTED: filament-like
            plant protein isoform X2 [Jatropha curcas]
            gi|643723198|gb|KDP32803.1| hypothetical protein
            JCGZ_12095 [Jatropha curcas]
          Length = 681

 Score =  499 bits (1285), Expect = e-138
 Identities = 302/602 (50%), Positives = 403/602 (66%), Gaps = 7/602 (1%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTSK+    E+VND++R LTEKLSAAL+N+SAK+DLVKQH+KVAEEAV+GWEKAENE 
Sbjct: 51   PEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 110

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
            AA K+QLE A+Q+N ALEDRV HLDGALKECVRQLRQA+EE E+K++EAV +KT EWES 
Sbjct: 111  AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 170

Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K+ELE+QL EL+ K +A K+E P     DL+ KLE  EK+N++LKLE+ + +EELE+  +
Sbjct: 171  KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 230

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259
            ERDLS QAAE+A+KQ L+SI+KVAKLEAECRRL+A A K+++ ND K+   + +SS+ VE
Sbjct: 231  ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKT---SIASSMYVE 287

Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079
            S TDSQSD G+RL A+E D  HK+     + +E ++ EP CSDSWASALI ELDQFK+++
Sbjct: 288  SLTDSQSDSGERLNAVELD-AHKI-----SCLEPSKCEPSCSDSWASALIAELDQFKNEK 341

Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899
             V   R L      +I  MDDFLEMERLA+LPE E+ +  +  +    +       L+A+
Sbjct: 342  AV--NRNL-PASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAE 398

Query: 898  LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719
            L+ MI RT                      +        E  L  + EK  E Q ++G  
Sbjct: 399  LEIMIHRTAELEKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEA 458

Query: 718  AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539
             +  +    E+  AN  ++ +ESQ++ ME E  T+ +KVDSLE ++E E+ +SA      
Sbjct: 459  ELKMKQLHQELSIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKC 518

Query: 538  XXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLAT 359
                    ++  E E +++ +SNGELKIKQ+ +LAVAAGKLAECQKTIASLG+QLKSLAT
Sbjct: 519  RTLEEELSEKNKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLAT 577

Query: 358  LEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLS 194
            LEDF++D+   P       G  L P K++ E WKLHS+D       S++++ A + SG S
Sbjct: 578  LEDFLIDTASLP---EFTAGGALMP-KATEEPWKLHSSDTLSPKRDSSSSRIASENSGPS 633

Query: 193  RN 188
             N
Sbjct: 634  VN 635


>ref|XP_011031090.1| PREDICTED: filament-like plant protein 3 [Populus euphratica]
            gi|743861318|ref|XP_011031091.1| PREDICTED: filament-like
            plant protein 3 [Populus euphratica]
          Length = 672

 Score =  498 bits (1283), Expect = e-138
 Identities = 301/603 (49%), Positives = 403/603 (66%), Gaps = 9/603 (1%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEVTS++  + E++ DN+R LT+KLSAALLN+SAKE+LVKQHAKVAEEAVSGWEKAE E 
Sbjct: 47   PEVTSEAVLTDEDIRDNVRTLTDKLSAALLNLSAKEELVKQHAKVAEEAVSGWEKAEKEL 106

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
            +A K+Q+E A +KNS LEDRV HLD ALKECVRQLRQ++EEQE++I+EAV +K  EWEST
Sbjct: 107  SALKKQIEAATKKNSGLEDRVSHLDAALKECVRQLRQSREEQERRINEAVTKKICEWEST 166

Query: 1612 KTELESQLDELRKKLDAAKAEPKISS-TDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K+ELE+QL EL+ +L  AK++  +S+ ++L+ KL A EKEN +LK EL ++AEE+++  L
Sbjct: 167  KSELEAQLIELQARLQTAKSDATVSADSELWQKLNAVEKENLSLKHELFSRAEEIQVRIL 226

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVE 1259
            ERDLS QAAE+A+K +LES++K+AKLEAECR+L+A ARKA+  ND KS T   +SSIC E
Sbjct: 227  ERDLSTQAAETASKLQLESLKKLAKLEAECRKLKAMARKASAANDHKSLT---ASSICAE 283

Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079
            SFTDSQSD G+RLLA+E+      D+   + +E+NE E  CSDSWA A   ELDQ K+++
Sbjct: 284  SFTDSQSDNGERLLAVES------DSCKRSGLEMNECEQICSDSWACAHAIELDQSKNQK 337

Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899
             +  GR +      +I  MDDFLEMERLAALP+TE+  +       +D+     + LK +
Sbjct: 338  PI--GRNV-MVPSLEINLMDDFLEMERLAALPDTESGISYLEAGPVSDKGNGSGNPLKEE 394

Query: 898  LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719
            L+ MI RT                      E Q +L T   QL+E + K+ ELQ  + L 
Sbjct: 395  LECMINRTTELEEKLDKMEEEKFKSEMALTECQRQLETLRSQLKEADAKIGELQGLLTLA 454

Query: 718  AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539
              ++QA E EI+ ++++RK  ESQ+ + E E+ TL +K+ SL+ +VE ERA+SA      
Sbjct: 455  NESRQAREEEIKRSDSRRKETESQLRIAEAEIKTLLSKIVSLDAEVEKERALSAENAAKS 514

Query: 538  XXXXXXXXKRR------FETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQ 377
                    K +       E ER+R  + N ELKI Q+KELAVAA KLAECQKTI+SLG Q
Sbjct: 515  QELEDELSKMKCEVELQHEIERKRIASFNEELKITQEKELAVAASKLAECQKTISSLGLQ 574

Query: 376  LKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLH-SNDAYSTANQAAEDGSG 200
            LKSLATLED + DSD K  D++ E     + +  +GE W+L   N +    ++A E   G
Sbjct: 575  LKSLATLED-LFDSD-KSSDVSSEE----SKDHENGERWRLDLGNQSSGRESEAIEVTGG 628

Query: 199  LSR 191
              R
Sbjct: 629  ALR 631


>ref|XP_010242808.1| PREDICTED: filament-like plant protein isoform X3 [Nelumbo nucifera]
          Length = 599

 Score =  495 bits (1275), Expect = e-137
 Identities = 298/550 (54%), Positives = 373/550 (67%), Gaps = 6/550 (1%)
 Frame = -2

Query: 1819 GWEKAENEAAAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVV 1640
            GWEKAENE  A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV
Sbjct: 23   GWEKAENEVVALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVV 82

Query: 1639 QKTHEWESTKTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQA 1460
            +KT EWES K ELESQ+  L+ +++AAK E   +S DL  KLE+AEK+N+ LKLEL ++ 
Sbjct: 83   EKTKEWESVKLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRV 141

Query: 1459 EELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIA 1283
            EELEI TLERDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T  
Sbjct: 142  EELEIRTLERDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT-- 199

Query: 1282 SSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITE 1103
             +SS  VES TDSQSD G+RLL +E D  HK+     +++ELN+ E   SDSWASALI E
Sbjct: 200  -ASSFYVESLTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAE 252

Query: 1102 LDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQ 923
            LDQFK  + +  GR L      +I  MDDFLEMERLAALPETE+            ++G+
Sbjct: 253  LDQFKQDKAI--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE 309

Query: 922  DRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLE 743
                LKA+L+ MIQR+                      E Q++L  S +QL+  EEKL+E
Sbjct: 310  --SSLKAELETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVE 367

Query: 742  LQNKMGLEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAM 563
            LQ  + L    KQ TE ++E+ NT+++ +ES+++  + E+  LR KV SLE ++E ER +
Sbjct: 368  LQRCLDLANNLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTL 427

Query: 562  SAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLG 383
            S               K++ E E  RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG
Sbjct: 428  SEEIVVKCRKLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLG 487

Query: 382  QQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQA 218
            +QLKSLATLEDF++D  EKPLD+ V      +P    G+ WKLHSNDA+     + +++ 
Sbjct: 488  RQLKSLATLEDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKI 541

Query: 217  AEDGSGLSRN 188
            A DGSG S N
Sbjct: 542  AGDGSGPSTN 551


>gb|KHF97687.1| Filament-like plant protein [Gossypium arboreum]
            gi|728836668|gb|KHG16111.1| Filament-like plant protein
            [Gossypium arboreum]
          Length = 702

 Score =  485 bits (1249), Expect = e-134
 Identities = 304/630 (48%), Positives = 395/630 (62%), Gaps = 35/630 (5%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEV+SK++TS E+V D ++ LT+KLSAAL+NISAKE+LVKQH+KVAEEA++GWE AENE 
Sbjct: 53   PEVSSKASTSSEDVTDGVKILTQKLSAALVNISAKENLVKQHSKVAEEAIAGWENAENEV 112

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
               KQ+LE ++Q+N ALEDRV HLDGALKECVRQLRQ +EEQE+KI EA+ +   +WE+T
Sbjct: 113  VVLKQKLEASIQQNLALEDRVSHLDGALKECVRQLRQVREEQEEKISEAIAKAAQDWETT 172

Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K EL+S+L +L+ K +A  ++ P     ++  K+E  EK+N+ LKLELS+Q EE+EI T+
Sbjct: 173  KFELKSRLLDLQAKSEAINSKLPPQVGPEVRRKIEDLEKKNADLKLELSSQLEEMEIRTI 232

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANNNDQKSFTIASSSSICVES 1256
            ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K+ N           SSI VE 
Sbjct: 233  ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIAGKSPN----------ISSIYVEL 282

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
             TDSQSD G+R+  +E D  HK+  I S   E N+ E  CSDSWASALI ELDQFK+++ 
Sbjct: 283  LTDSQSDSGERVNLVEID-THKM--ICS---EANKGELSCSDSWASALIAELDQFKNEKT 336

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQ--LKA 902
            V   R+L      +I  MDDFLEMERLAALPET++++     +        D D   LKA
Sbjct: 337  V--NRSL-PGSSIEIDIMDDFLEMERLAALPETKSKNQCLESKATAKVSNNDGDSLLLKA 393

Query: 901  DLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGL 722
            +L+AMI RT                      + Q  L  SE QLR+T  KL ELQ ++ +
Sbjct: 394  ELEAMIHRTTELEKKLEKIEVEKAELETALTKTQESLNESELQLRDTGLKLEELQRELSM 453

Query: 721  EAVAKQ---------------------ATELEIE-----------SANTKRKAMESQILV 638
               AKQ                     + E EIE           +AN  +K +ESQ++ 
Sbjct: 454  ANEAKQNLESQLRNMEADVETMSSKIESLEKEIEKESTLSAEVSVNANESKKMLESQLIS 513

Query: 637  MEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELK 458
            +E E  T+ AK+DSLE +VE ERA+SA              +++ ETE ++  NSN E+K
Sbjct: 514  IEVEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELSRKKQETELQQTVNSNVEVK 573

Query: 457  IKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278
            IKQ+ +L  AAGKLAECQ+TIASLGQQLKSLATLEDF++DS   P       G  L P K
Sbjct: 574  IKQE-DLTAAAGKLAECQRTIASLGQQLKSLATLEDFLIDSASIP---EFPKGRSLIP-K 628

Query: 277  SSGEYWKLHSNDAYSTANQAAEDGSGLSRN 188
            + GE W LHSN+ +S         +   +N
Sbjct: 629  AGGEPWNLHSNETFSPKRDPESPRTSFDKN 658


>ref|XP_012455846.1| PREDICTED: filament-like plant protein isoform X2 [Gossypium
            raimondii]
          Length = 604

 Score =  484 bits (1246), Expect = e-133
 Identities = 289/543 (53%), Positives = 368/543 (67%), Gaps = 8/543 (1%)
 Frame = -2

Query: 1885 LNISAKEDLVKQHAKVAEEAVSGWEKAENEAAAFKQQLETAVQKNSALEDRVGHLDGALK 1706
            +NISAKE+LVKQHAKVAEEAVSGWEKAE +  A KQQL+ A++KN+ALEDRVGHLDGALK
Sbjct: 1    MNISAKEELVKQHAKVAEEAVSGWEKAEKDVVALKQQLDAAMKKNAALEDRVGHLDGALK 60

Query: 1705 ECVRQLRQAKEEQEQKIHEAVVQKTHEWESTKTELESQLDELRKKLDAAKAEPKIS-STD 1529
            ECVRQLRQA+EEQE+KIHEAV +K HEWES+K+ELESQL  L+ +L+ AK +   S   D
Sbjct: 61   ECVRQLRQAREEQERKIHEAVSKKCHEWESSKSELESQLLNLKAQLETAKNDTAASVDPD 120

Query: 1528 LYPKLEAAEKENSTLKLELSAQAEELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAE 1349
            L  KL+A EKENS LKL+L ++AEELE   +ERDLS QAAE+A+KQ LESI+K+AKLE E
Sbjct: 121  LQLKLDAFEKENSALKLQLHSRAEELERRIIERDLSTQAAETASKQHLESIKKLAKLEIE 180

Query: 1348 CRRLRASARKAN-NNDQKSFTIASSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISS 1172
            CRRL+A ARKA+  NDQKS+    +SSICVESFTDSQSD G+RLLA+E D+         
Sbjct: 181  CRRLKAIARKASPANDQKSY---PASSICVESFTDSQSDSGERLLAVETDMQ------KM 231

Query: 1171 NAVELNESEPGCSDSWASALITELDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLA 992
            N +E+N  +   SD+WASALITELDQF+ KEK  G   +      +I  MDDFLEMERLA
Sbjct: 232  NGLEMNGCDRSSSDAWASALITELDQFR-KEKAVGRNIM--APSVEINLMDDFLEMERLA 288

Query: 991  ALPETETESAVASCQQHNDEKGQDRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXX 812
            ALP+TE+ S        + +     + LKADL+ ++ R                      
Sbjct: 289  ALPDTESGSGFNDAGPVSYQTSIVENPLKADLETLVHRVAELEEKLALTEEEKSEMQIAF 348

Query: 811  XENQNRLGTSEDQLRETEEKLLELQNKMGLEAVAKQATELEIESANTKRKAMESQILVME 632
             E+Q +L T ++QL E E +  ++Q ++ L   +KQA E E++ AN  R+  ES++   E
Sbjct: 349  TESQKQLKTLQNQLSEAEIRFKDVQTQLALADNSKQAAEKEVKVANMNREVAESRLRDAE 408

Query: 631  EEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRR------ATNSN 470
             E+ TL +KV SLE  +  E+A+S               K + ET+ R+      A   N
Sbjct: 409  TEIKTLMSKVTSLEEALGKEQALSTENMNKCKELENELSKMKCETKLRQEAELQHAAKYN 468

Query: 469  GELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLL 290
             ELK++QDKEL++AA K AECQKTIASLGQQLKSLATLEDF++DSD KPL++ V+GG   
Sbjct: 469  EELKVQQDKELSIAACKFAECQKTIASLGQQLKSLATLEDFLIDSD-KPLEL-VDGGLKC 526

Query: 289  TPN 281
            T N
Sbjct: 527  TGN 529


>ref|XP_010093113.1| hypothetical protein L484_007922 [Morus notabilis]
            gi|587863800|gb|EXB53551.1| hypothetical protein
            L484_007922 [Morus notabilis]
          Length = 643

 Score =  481 bits (1238), Expect = e-132
 Identities = 295/559 (52%), Positives = 371/559 (66%), Gaps = 10/559 (1%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEV SK+A + E  N++++ LT+KLSAAL +ISAKEDLVKQHAKVAEEAVSGWE AENE 
Sbjct: 47   PEVMSKAAPNDEYSNESVKTLTDKLSAALRSISAKEDLVKQHAKVAEEAVSGWENAENEV 106

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
               KQ+LE A QKNS LEDR+GHLDGALKECVRQLRQA+EEQEQKIH+AV +KTHEWES 
Sbjct: 107  LILKQKLEAANQKNSVLEDRLGHLDGALKECVRQLRQAREEQEQKIHDAVAKKTHEWESL 166

Query: 1612 KTELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K+ L+SQL EL+ +L   K E      +DL  KLEAAEK+NS LKLEL ++AEELEI  +
Sbjct: 167  KSLLQSQLLELQVELQNVKTEAAAPIDSDLQAKLEAAEKQNSALKLELLSKAEELEIRII 226

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259
            ERDLS +AAE+A+KQ LESI+KVAKLEAECRRL+A ARK +  N+QKS    SSSS+ VE
Sbjct: 227  ERDLSTKAAETASKQHLESIKKVAKLEAECRRLKAMARKVSQVNNQKS---GSSSSVYVE 283

Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079
            S TDSQSD G+RLL IE+        +   ++ELNE EP  S S AS+L+TE  QF++ E
Sbjct: 284  SLTDSQSDSGERLLTIES------GTLKMGSLELNECEPSDSGSCASSLVTE-HQFRN-E 335

Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETES--AVASCQQHNDEKGQDRDQLK 905
            K+ G   +      +I  MDDFLEMERLAALP  + ES   VA    H    G+ R   K
Sbjct: 336  KIIGKNRM--VPSIEINLMDDFLEMERLAALPVRDIESGFTVAGSASHQPIGGESR--FK 391

Query: 904  ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725
              LDAMIQR                         +  L TS+ QL   E++L ELQ ++ 
Sbjct: 392  TKLDAMIQRIAELEDKLEKIEMEKVELEVALSLCEKHLETSQSQLLVAEKRLKELQKQLV 451

Query: 724  LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545
            L   +K+A E E  +  TK++  ESQ+ V+E E+  L +K+ SLE +V+ ERA+SA    
Sbjct: 452  LANESKRAAEEEERATRTKQELAESQLRVVENEINALLSKIGSLEEEVQKERALSADNVA 511

Query: 544  XXXXXXXXXXKRRFETERR------RATNSNGELKIKQDKELAVAAGKLAECQKTIASLG 383
                        + E E +      R  ++N  LKIKQ+KEL++AA K AECQKTIASLG
Sbjct: 512  RCQKMENELLIVKREAENKQEAELERIQSANVNLKIKQEKELSLAADKFAECQKTIASLG 571

Query: 382  QQLKSLATLEDFMMDSDEK 326
            QQLKSLA+LED ++D +++
Sbjct: 572  QQLKSLASLEDVLLDPEKQ 590


>ref|XP_012446394.1| PREDICTED: filament-like plant protein isoform X3 [Gossypium
            raimondii] gi|763792239|gb|KJB59235.1| hypothetical
            protein B456_009G245900 [Gossypium raimondii]
          Length = 668

 Score =  478 bits (1231), Expect = e-132
 Identities = 299/615 (48%), Positives = 392/615 (63%), Gaps = 35/615 (5%)
 Frame = -2

Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793
            PEV+SK++T+ E+V D+++ LTEKLSAAL+NISAKEDLVKQH+KVAEEA++GWE AENE 
Sbjct: 19   PEVSSKASTNSEDVTDSVKILTEKLSAALVNISAKEDLVKQHSKVAEEAIAGWENAENEV 78

Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613
               KQ+LE ++Q+N  LEDRV HLDGALKECVRQLRQA+EEQE+KI EA+ +   +WE+T
Sbjct: 79   VVLKQKLEASIQQNLTLEDRVSHLDGALKECVRQLRQAREEQEEKIGEAIAKAAQDWETT 138

Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436
            K ELES+L +L+ K +A  ++ P     +++ K+E  EK+N+ LKLELS+Q EE+EI T+
Sbjct: 139  KLELESKLLDLQAKSEAINSKLPPQVGPEVWRKIEDLEKKNADLKLELSSQLEEMEIRTI 198

Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANNNDQKSFTIASSSSICVES 1256
            ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K+ N           SSI V  
Sbjct: 199  ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIAGKSLN----------ISSIYVGP 248

Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076
             TDSQSD G+R+  +E D  HK+  I S   E N+ E  CSDSWASALI ELDQFK+++ 
Sbjct: 249  LTDSQSDSGERVNLVEID-THKM--ICS---EANKGELSCSDSWASALIAELDQFKNEKT 302

Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQ--LKA 902
            V   R+L      +I  MDDFLEMERLAALP T++++     +        D D   LKA
Sbjct: 303  V--NRSL-PGSSIEIDIMDDFLEMERLAALPATKSKNQCLELKATAKVSNNDGDSLLLKA 359

Query: 901  DLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGL 722
            +L+AMI RT                      + +  L  S+ QLR++  KL ELQ ++ +
Sbjct: 360  ELEAMIHRTTELEKKLEKIEVEKAELETALTKTRESLNESKLQLRDSGLKLEELQRELSM 419

Query: 721  EAVAKQ---------------------ATELEIE-----------SANTKRKAMESQILV 638
               AKQ                     + E EIE           +AN  +K +ESQ++ 
Sbjct: 420  VNEAKQNLESQLRNMEADVETMSSKIESLEKEIEKERTLSAEVSVNANESKKMLESQLIS 479

Query: 637  MEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELK 458
            +E E  T+ AK+DSLE +VE ERA+SA              +++ ETE ++  NSN E+K
Sbjct: 480  IEVEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELSRKKQETELQQTVNSNVEVK 539

Query: 457  IKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278
            IKQ+ +L  AAGKLAECQ+TIASLGQQLKSLATLEDF++DS   P       G  L P +
Sbjct: 540  IKQE-DLTAAAGKLAECQRTIASLGQQLKSLATLEDFLIDSASIP---EFPKGRSLIP-E 594

Query: 277  SSGEYWKLHSNDAYS 233
            + GE W LHSN+ +S
Sbjct: 595  AGGEPWNLHSNETFS 609


Top