BLASTX nr result
ID: Papaver29_contig00000095
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Papaver29_contig00000095 (1973 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010652946.1| PREDICTED: filament-like plant protein [Viti... 572 e-160 ref|XP_010242807.1| PREDICTED: filament-like plant protein isofo... 570 e-159 ref|XP_010242801.1| PREDICTED: filament-like plant protein isofo... 570 e-159 ref|XP_010243856.1| PREDICTED: filament-like plant protein [Nelu... 569 e-159 emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera] 566 e-158 ref|XP_007019074.1| Filament-like plant protein, putative isofor... 536 e-149 ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [... 527 e-146 ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma... 527 e-146 gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum] 516 e-143 ref|XP_012455840.1| PREDICTED: filament-like plant protein isofo... 516 e-143 ref|XP_010664206.1| PREDICTED: filament-like plant protein [Viti... 513 e-142 ref|XP_010025716.1| PREDICTED: filament-like plant protein 3 [Eu... 502 e-139 ref|XP_012078241.1| PREDICTED: filament-like plant protein isofo... 499 e-138 ref|XP_012078245.1| PREDICTED: filament-like plant protein isofo... 499 e-138 ref|XP_011031090.1| PREDICTED: filament-like plant protein 3 [Po... 498 e-138 ref|XP_010242808.1| PREDICTED: filament-like plant protein isofo... 495 e-137 gb|KHF97687.1| Filament-like plant protein [Gossypium arboreum] ... 485 e-134 ref|XP_012455846.1| PREDICTED: filament-like plant protein isofo... 484 e-133 ref|XP_010093113.1| hypothetical protein L484_007922 [Morus nota... 481 e-132 ref|XP_012446394.1| PREDICTED: filament-like plant protein isofo... 478 e-132 >ref|XP_010652946.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731397640|ref|XP_010652947.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731397642|ref|XP_010652948.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731397644|ref|XP_010652949.1| PREDICTED: filament-like plant protein [Vitis vinifera] Length = 672 Score = 572 bits (1475), Expect = e-160 Identities = 334/583 (57%), Positives = 412/583 (70%), Gaps = 4/583 (0%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSKSA EEVND++++LTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENE Sbjct: 47 PEVTSKSAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEV 106 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 + KQQLE A QKNSALEDRVGHLDGALKEC+RQLRQA+EEQEQKIHEAVV++THEWEST Sbjct: 107 FSLKQQLEAAAQKNSALEDRVGHLDGALKECLRQLRQAREEQEQKIHEAVVKRTHEWEST 166 Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTD--LYPKLEAAEKENSTLKLELSAQAEELEIMT 1439 K+ELESQ+ E++ +L AKAE +++ D L KL AAEKEN+ LKL+L ++ EELEI T Sbjct: 167 KSELESQIVEIQAQLQTAKAE-TVATVDPGLELKLGAAEKENAALKLQLLSREEELEIRT 225 Query: 1438 LERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICV 1262 +E++LS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARKA++ ND KS T +SS+CV Sbjct: 226 IEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSANDHKSIT---ASSVCV 282 Query: 1261 ESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSK 1082 ES TDSQSD G+RLLA+E +D ++ NE EP SDSWAS LI ELD+FK++ Sbjct: 283 ESLTDSQSDSGERLLALE------IDTRKMTGLDTNECEPSRSDSWASGLIQELDRFKNE 336 Query: 1081 EKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEK-GQDRDQLK 905 + + ++ MDDFLEMERLAALPETE S +D+ G LK Sbjct: 337 KPLVKN---LMAPSVELDLMDDFLEMERLAALPETENRSRCLESGAISDKHIGGSESPLK 393 Query: 904 ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725 A L+AMI RT E QN+L TS+ +L+E EEKL+ELQ ++ Sbjct: 394 AQLEAMIDRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQTQLA 453 Query: 724 LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545 L + +K+ E EI++ N KR+ ES+++ +E E+ T+ +KV SLE +VE ERA+SA Sbjct: 454 LASESKRNAEEEIQTTNAKREVAESRLIAVEAEIKTMLSKVLSLEEEVEKERALSAEAAS 513 Query: 544 XXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSL 365 + + ETE R +SNGELKIKQ+KELAVAA KLAECQKTIASLG+QLKSL Sbjct: 514 KCRKFEDELSRMKRETELRNLASSNGELKIKQEKELAVAASKLAECQKTIASLGRQLKSL 573 Query: 364 ATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236 ATLED ++DS EKPL EG L K E W LH ++Y Sbjct: 574 ATLEDLLLDS-EKPLQPMSEG---LHHPKDGAEQWTLHPGNSY 612 >ref|XP_010242807.1| PREDICTED: filament-like plant protein isoform X2 [Nelumbo nucifera] Length = 678 Score = 570 bits (1470), Expect = e-159 Identities = 338/601 (56%), Positives = 419/601 (69%), Gaps = 6/601 (0%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PE+TSK S EEVNDN+++LT+KL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAENE Sbjct: 51 PEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAENEV 110 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV+KT EWES Sbjct: 111 VALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVVEKTKEWESV 170 Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 K ELESQ+ L+ +++AAK E +S DL KLE+AEK+N+ LKLEL ++ EELEI TLE Sbjct: 171 KLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRVEELEIRTLE 229 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIASSSSICVES 1256 RDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T +SS VES Sbjct: 230 RDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT---ASSFYVES 286 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 TDSQSD G+RLL +E D HK+ +++ELN+ E SDSWASALI ELDQFK + Sbjct: 287 LTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAELDQFKQDKA 340 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 + GR L +I MDDFLEMERLAALPETE+ ++G+ LKA+L Sbjct: 341 I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE--SSLKAEL 395 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + MIQR+ E Q++L S +QL+ EEKL+ELQ + L Sbjct: 396 ETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQRCLDLAN 455 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536 KQ TE ++E+ NT+++ +ES+++ + E+ LR KV SLE ++E ER +S Sbjct: 456 NLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTLSEEIVVKCR 515 Query: 535 XXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATL 356 K++ E E RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG+QLKSLATL Sbjct: 516 KLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATL 575 Query: 355 EDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLSR 191 EDF++D EKPLD+ V +P G+ WKLHSNDA+ + +++ A DGSG S Sbjct: 576 EDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKIAGDGSGPST 629 Query: 190 N 188 N Sbjct: 630 N 630 >ref|XP_010242801.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] gi|720083139|ref|XP_010242802.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] gi|720083142|ref|XP_010242803.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] gi|720083146|ref|XP_010242804.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] gi|720083149|ref|XP_010242805.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] gi|720083152|ref|XP_010242806.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera] Length = 679 Score = 570 bits (1470), Expect = e-159 Identities = 338/601 (56%), Positives = 419/601 (69%), Gaps = 6/601 (0%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PE+TSK S EEVNDN+++LT+KL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAENE Sbjct: 52 PEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAENEV 111 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV+KT EWES Sbjct: 112 VALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVVEKTKEWESV 171 Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 K ELESQ+ L+ +++AAK E +S DL KLE+AEK+N+ LKLEL ++ EELEI TLE Sbjct: 172 KLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRVEELEIRTLE 230 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIASSSSICVES 1256 RDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T +SS VES Sbjct: 231 RDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT---ASSFYVES 287 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 TDSQSD G+RLL +E D HK+ +++ELN+ E SDSWASALI ELDQFK + Sbjct: 288 LTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAELDQFKQDKA 341 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 + GR L +I MDDFLEMERLAALPETE+ ++G+ LKA+L Sbjct: 342 I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE--SSLKAEL 396 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + MIQR+ E Q++L S +QL+ EEKL+ELQ + L Sbjct: 397 ETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQRCLDLAN 456 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536 KQ TE ++E+ NT+++ +ES+++ + E+ LR KV SLE ++E ER +S Sbjct: 457 NLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTLSEEIVVKCR 516 Query: 535 XXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATL 356 K++ E E RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG+QLKSLATL Sbjct: 517 KLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATL 576 Query: 355 EDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLSR 191 EDF++D EKPLD+ V +P G+ WKLHSNDA+ + +++ A DGSG S Sbjct: 577 EDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKIAGDGSGPST 630 Query: 190 N 188 N Sbjct: 631 N 631 >ref|XP_010243856.1| PREDICTED: filament-like plant protein [Nelumbo nucifera] gi|720086488|ref|XP_010243857.1| PREDICTED: filament-like plant protein [Nelumbo nucifera] gi|720086491|ref|XP_010243858.1| PREDICTED: filament-like plant protein [Nelumbo nucifera] Length = 675 Score = 569 bits (1467), Expect = e-159 Identities = 344/585 (58%), Positives = 415/585 (70%), Gaps = 6/585 (1%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSK EEV+D++++LTEKL+AAL NISAKEDLVKQHAKVAEEAVSGWEKAE E Sbjct: 52 PEVTSKVTNRSEEVSDSVKSLTEKLAAALSNISAKEDLVKQHAKVAEEAVSGWEKAEKEV 111 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 + KQQLE AVQKNS+LEDRVGHLDGALKECVRQLRQA+EEQEQKIHEAV +K EWES Sbjct: 112 VSLKQQLEAAVQKNSSLEDRVGHLDGALKECVRQLRQAREEQEQKIHEAVAKKASEWESA 171 Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 K ELE+Q+ EL+ +++AAK E S + + KLEAAEKEN+ LKL+L A+ EELEI TLE Sbjct: 172 KFELENQVVELQTQVEAAKLE-AASDSGIQLKLEAAEKENAALKLQLLARIEELEIRTLE 230 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS Q AESA+KQ LESI+KVA+LEAECRRLRA +RKA ND KS ++SSI VES Sbjct: 231 RDLSTQTAESASKQHLESIKKVARLEAECRRLRAISRKAALANDHKS---VAASSIYVES 287 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 TDSQSD G+RLL +E D ISS +ELN+ EP CSDSWASALI ELDQFK + Sbjct: 288 LTDSQSDSGERLLGVETD----TRKISS--LELNDCEPSCSDSWASALIAELDQFKQDKA 341 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETET---ESAVASCQQHNDEKGQDRDQLK 905 + GR L +I MDDFLEMERLAALPETE+ E AS D+ ++ +K Sbjct: 342 I--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGRPEPVAAS-----DQIDSGQNSIK 393 Query: 904 ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725 A+L+AMI RT E+Q RL S++QL E EEKL+ELQ ++ Sbjct: 394 AELEAMIHRTAELEEKLEKMEEEKAALDMALAESQGRLEMSQNQLWEAEEKLVELQRQLD 453 Query: 724 LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545 L KQA E++IE++NT+R+ +ES ++ + EV LR KV SLE ++E ERA+SA Sbjct: 454 LANNLKQAAEVKIEASNTQRELVESHLVSADAEVWALRTKVCSLEAEIEKERALSAEVAA 513 Query: 544 XXXXXXXXXXKRRFETERRRA--TNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLK 371 +R E E RRA + SN ELK KQ+KELAVAAGKL+ECQKTIASLG+QLK Sbjct: 514 KCKKLEDELLGKRNEAELRRASISKSNDELKTKQEKELAVAAGKLSECQKTIASLGRQLK 573 Query: 370 SLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236 +LATLEDF++DS EKPLD++ GS P GE WKLHSN+AY Sbjct: 574 ALATLEDFLIDS-EKPLDLS---GS---PIPKIGESWKLHSNEAY 611 >emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera] Length = 749 Score = 566 bits (1458), Expect = e-158 Identities = 334/583 (57%), Positives = 410/583 (70%), Gaps = 4/583 (0%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSK+A EEVND++++LTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENE Sbjct: 24 PEVTSKAAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEV 83 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 + KQQLE QKNS LEDRVGHLDGALKEC+RQLRQA+EEQEQKIHEAVV++THEWEST Sbjct: 84 FSLKQQLEAXXQKNSXLEDRVGHLDGALKECLRQLRQAREEQEQKIHEAVVKRTHEWEST 143 Query: 1612 KTELESQLDELRKKLDAAKAEPKISSTD--LYPKLEAAEKENSTLKLELSAQAEELEIMT 1439 K+ELESQ+ E++ +L AKAE +++ D L KL AAEKEN+ LKL+L ++ EELEI T Sbjct: 144 KSELESQIVEIQAQLQTAKAE-XVATVDPGLELKLGAAEKENAALKLQLLSREEELEIRT 202 Query: 1438 LERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICV 1262 +E++LS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARKA++ ND KS T +SS+CV Sbjct: 203 IEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSANDHKSXT---ASSVCV 259 Query: 1261 ESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSK 1082 ES TDSQSD G+RLLA+E +D ++ NE EP SDSWAS LI ELD+FK+ Sbjct: 260 ESLTDSQSDSGERLLALE------IDTRKMTGLDTNECEPSRSDSWASGLIQELDRFKN- 312 Query: 1081 EKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEK-GQDRDQLK 905 EK + D+ MDDFLEMERLAALPETE S +D+ G LK Sbjct: 313 EKPLVKNLMAPSVEXDL--MDDFLEMERLAALPETENRSRCLESGAISDKHIGGSESPLK 370 Query: 904 ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725 A L+AMI RT E QN+L TS+ +L+E EEKL+ELQ ++ Sbjct: 371 AQLEAMIDRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQTQLA 430 Query: 724 LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545 L + +K+ E EI++ N KR+ ES+++ +E E+ T+ +KV SLE +VE ERA+SA Sbjct: 431 LASESKRNAEEEIQATNAKREVAESRLIXVEAEIKTMLSKVLSLEEEVEKERALSAEAAS 490 Query: 544 XXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSL 365 + + ETE R +SNGELKIKQ+KELAVAA KLAECQKTIASLG+QLKSL Sbjct: 491 KCRKFEDELSRMKRETELRNLASSNGELKIKQEKELAVAASKLAECQKTIASLGRQLKSL 550 Query: 364 ATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY 236 ATLED ++DS EKPL EG L K E W LH ++Y Sbjct: 551 ATLEDLLLDS-EKPLQPMSEG---LHHPKDGAEQWTLHPGNSY 589 >ref|XP_007019074.1| Filament-like plant protein, putative isoform 1 [Theobroma cacao] gi|508724402|gb|EOY16299.1| Filament-like plant protein, putative isoform 1 [Theobroma cacao] Length = 713 Score = 536 bits (1382), Expect = e-149 Identities = 322/614 (52%), Positives = 405/614 (65%), Gaps = 34/614 (5%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEV+SK++ + E+VND+++ LTEKLSAAL+N+SAKEDLVKQHAKVAEEA++GWEKAENE Sbjct: 51 PEVSSKASANCEDVNDSIKRLTEKLSAALVNVSAKEDLVKQHAKVAEEAIAGWEKAENEV 110 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 KQ+LE AVQ+NSALEDRV HLDGALKECVRQLRQA+EEQEQKI+EAV + T +WE+T Sbjct: 111 VLLKQKLEAAVQQNSALEDRVSHLDGALKECVRQLRQAREEQEQKINEAVAKTTRDWETT 170 Query: 1612 KTELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K ELESQ EL+ K +A K+EP S DL+ K+EA EKENS LKLELS+Q+EE EI T+ Sbjct: 171 KFELESQFLELQDKAEAVKSEPPPHFSPDLWHKIEALEKENSALKLELSSQSEEFEIRTI 230 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259 ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K++ ND KS ++SSI VE Sbjct: 231 ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIACKSSLVNDHKS---PAASSIYVE 287 Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079 S TDSQSD G+RL +E D HK+ + E N+ EP CSDSWASALI ELDQFK+++ Sbjct: 288 SVTDSQSDSGERLNVVEIDT-HKMSGL-----EANKGEPSCSDSWASALIAELDQFKNEK 341 Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899 ++ +I MDDFLEMERLAALPE ++E+ + + LKA+ Sbjct: 342 VISRN---LPSSSIEIDLMDDFLEMERLAALPEIKSENQFLESKATARQSNDGDSSLKAE 398 Query: 898 LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719 L+AMI RT ++Q L S QLR+TE KL EL+ + + Sbjct: 399 LEAMIHRTAELEQKLEKIELEKAELEIALAKSQESLEASALQLRDTETKLEELEREFHMA 458 Query: 718 AVAKQATELEIESANTKRKAM--------------------------------ESQILVM 635 AKQ E ++ S T + M ESQ++ + Sbjct: 459 NEAKQHLESQLSSMETDAETMSSKIDSLKAEIEKEMALSAEISVNATESKQLLESQLISI 518 Query: 634 EEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKI 455 E E T+ AK+DSLE +VE ERA+SA ++R E E ++ NSN E+KI Sbjct: 519 EAEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELLRKRQEAELQQTANSNVEVKI 578 Query: 454 KQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKS 275 KQ+ +LAVAAGKLAECQKTIASLGQQLKSLATLEDF++D+ P GGSL+ +K+ Sbjct: 579 KQE-DLAVAAGKLAECQKTIASLGQQLKSLATLEDFLIDTTSIP--EFSRGGSLV--SKA 633 Query: 274 SGEYWKLHSNDAYS 233 GE WKLHSN+ YS Sbjct: 634 GGEPWKLHSNETYS 647 >ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508700815|gb|EOX92711.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 649 Score = 527 bits (1357), Expect = e-146 Identities = 307/565 (54%), Positives = 396/565 (70%), Gaps = 8/565 (1%) Frame = -2 Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790 EVTSK+ EEVNDN+++LTEKLSAAL+NISAKEDLVKQHAKVAEEAVSGWEKAE + Sbjct: 49 EVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAKVAEEAVSGWEKAEKDVL 108 Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610 A KQQL+ A++K +ALEDRVGHLDGALKECVRQLRQA+EEQE++IHEAV +K HEWES+K Sbjct: 109 ALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQERRIHEAVAKKCHEWESSK 168 Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 +ELESQL +L+ +L K+E S DL+PKLEA EKENS LKL+L ++AEEL++ +E Sbjct: 169 SELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSALKLQLLSRAEELQLRIIE 228 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS QAAE+A+KQ LESI+K+AKLEAECR+L+ ARKA+ NDQKS+ ++SSICV+S Sbjct: 229 RDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPANDQKSY---AASSICVDS 285 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 FTDSQSD GDRLLA+E ++ + +E+NE E S+SW SALITELDQF++++ Sbjct: 286 FTDSQSDSGDRLLAVETNMR------KMSGLEMNECETSRSESWTSALITELDQFRNEKA 339 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 V GR + +I MDDFLEMERLAALP+TE+ + +D+ + LKA++ Sbjct: 340 V--GRNI-MAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENPLKAEV 396 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + I R E+Q +L T ++QLRE E KL +LQ ++ L Sbjct: 397 ETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQLALAD 456 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSA------X 554 +KQA E E++ AN R+ ES+ E EV TL +KV SLE +V E+A+SA Sbjct: 457 NSKQAAEDEVKVANMNREVAESRFRDAEIEVKTLLSKVTSLEEEVGREQALSARNVSKCK 516 Query: 553 XXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374 + R + ER+ + N ELK +QDKELA+AA KLAECQKTIASLG+QL Sbjct: 517 ELEDELSKLKREAELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQL 576 Query: 373 KSLATLEDFMMDSDEKPLDINVEGG 299 KSLATL+DF++D D KPL++ V+GG Sbjct: 577 KSLATLDDFLIDPD-KPLEL-VDGG 599 >ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508700814|gb|EOX92710.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 675 Score = 527 bits (1357), Expect = e-146 Identities = 307/565 (54%), Positives = 396/565 (70%), Gaps = 8/565 (1%) Frame = -2 Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790 EVTSK+ EEVNDN+++LTEKLSAAL+NISAKEDLVKQHAKVAEEAVSGWEKAE + Sbjct: 49 EVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAKVAEEAVSGWEKAEKDVL 108 Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610 A KQQL+ A++K +ALEDRVGHLDGALKECVRQLRQA+EEQE++IHEAV +K HEWES+K Sbjct: 109 ALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQERRIHEAVAKKCHEWESSK 168 Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 +ELESQL +L+ +L K+E S DL+PKLEA EKENS LKL+L ++AEEL++ +E Sbjct: 169 SELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSALKLQLLSRAEELQLRIIE 228 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS QAAE+A+KQ LESI+K+AKLEAECR+L+ ARKA+ NDQKS+ ++SSICV+S Sbjct: 229 RDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPANDQKSY---AASSICVDS 285 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 FTDSQSD GDRLLA+E ++ + +E+NE E S+SW SALITELDQF++++ Sbjct: 286 FTDSQSDSGDRLLAVETNMR------KMSGLEMNECETSRSESWTSALITELDQFRNEKA 339 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 V GR + +I MDDFLEMERLAALP+TE+ + +D+ + LKA++ Sbjct: 340 V--GRNI-MAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENPLKAEV 396 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + I R E+Q +L T ++QLRE E KL +LQ ++ L Sbjct: 397 ETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQLALAD 456 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSA------X 554 +KQA E E++ AN R+ ES+ E EV TL +KV SLE +V E+A+SA Sbjct: 457 NSKQAAEDEVKVANMNREVAESRFRDAEIEVKTLLSKVTSLEEEVGREQALSARNVSKCK 516 Query: 553 XXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374 + R + ER+ + N ELK +QDKELA+AA KLAECQKTIASLG+QL Sbjct: 517 ELEDELSKLKREAELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQL 576 Query: 373 KSLATLEDFMMDSDEKPLDINVEGG 299 KSLATL+DF++D D KPL++ V+GG Sbjct: 577 KSLATLDDFLIDPD-KPLEL-VDGG 599 >gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum] Length = 679 Score = 516 bits (1329), Expect = e-143 Identities = 308/571 (53%), Positives = 392/571 (68%), Gaps = 8/571 (1%) Frame = -2 Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790 EVTSK A V+E ++N+R+LTEKLS AL+NISAKE+LVKQHAKVAEEAVSGWEKAE + Sbjct: 49 EVTSK-AVPVDEESNNVRSLTEKLSTALMNISAKEELVKQHAKVAEEAVSGWEKAEKDVV 107 Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610 A KQQL+ A++KN+ALEDRVGHLDGALKECVRQLRQA+EEQE+KIHEAV +K HEWES+K Sbjct: 108 ALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQERKIHEAVSKKCHEWESSK 167 Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 +ELESQL L+ +L+ AK++ S DL KL+A EKENS LKL+L ++AEELE +E Sbjct: 168 SELESQLLNLKAQLETAKSDAAASVDPDLQLKLDACEKENSALKLQLHSRAEELERRIIE 227 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS QAAE+A+KQ L+SI+K+AKLE ECRRL+A ARKA+ NDQKS+T +SSICVES Sbjct: 228 RDLSTQAAETASKQHLDSIKKLAKLEIECRRLKAIARKASPANDQKSYT---ASSICVES 284 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 FTDSQSD G+RLLA+E D+ N +E+N + SD+WASALITELDQF+ KEK Sbjct: 285 FTDSQSDSGERLLAVETDMQ------KMNGLEMNGCDRSRSDAWASALITELDQFR-KEK 337 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 G + +I MDDFLEMERLAALP+TE+ S + + + LKADL Sbjct: 338 AVGRNIM--APSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQNSIVENPLKADL 395 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + ++ R E+Q +L T ++QL E E + ++Q ++ L Sbjct: 396 ETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQLALAD 455 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536 +KQA E E++ AN R+ ES++ E E+ TL +KV SLE E+A+S Sbjct: 456 NSKQAAEKEVKVANMNRQVAESRLRDAETEIKTLMSKVTSLEEAFGKEQALSTENMNKCK 515 Query: 535 XXXXXXXKRRFETERRR------ATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374 K + ET+ RR A N ELK++QDKEL++AA K AECQKTIASLGQQL Sbjct: 516 ELENELSKMKCETKLRREAELQHAAKYNEELKVQQDKELSIAARKFAECQKTIASLGQQL 575 Query: 373 KSLATLEDFMMDSDEKPLDINVEGGSLLTPN 281 KSLATLEDF++DSD KPL++ V+GG T N Sbjct: 576 KSLATLEDFLIDSD-KPLEL-VDGGLKCTGN 604 >ref|XP_012455840.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium raimondii] gi|823246344|ref|XP_012455841.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium raimondii] gi|823246346|ref|XP_012455843.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium raimondii] gi|823246348|ref|XP_012455844.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium raimondii] gi|823246350|ref|XP_012455845.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium raimondii] gi|763804407|gb|KJB71345.1| hypothetical protein B456_011G117600 [Gossypium raimondii] Length = 679 Score = 516 bits (1328), Expect = e-143 Identities = 309/571 (54%), Positives = 392/571 (68%), Gaps = 8/571 (1%) Frame = -2 Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790 EVTSK A V+E N+N+R+LTEKLSAAL+NISAKE+LVKQHAKVAEEAVSGWEKAE + Sbjct: 49 EVTSK-AVPVDEENNNVRSLTEKLSAALMNISAKEELVKQHAKVAEEAVSGWEKAEKDVV 107 Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610 A KQQL+ A++KN+ALEDRVGHLDGALKECVRQLRQA+EEQE+KIHEAV +K HEWES+K Sbjct: 108 ALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQERKIHEAVSKKCHEWESSK 167 Query: 1609 TELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 +ELESQL L+ +L+ AK + S DL KL+A EKENS LKL+L ++AEELE +E Sbjct: 168 SELESQLLNLKAQLETAKNDTAASVDPDLQLKLDAFEKENSALKLQLHSRAEELERRIIE 227 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS QAAE+A+KQ LESI+K+AKLE ECRRL+A ARKA+ NDQKS+ +SSICVES Sbjct: 228 RDLSTQAAETASKQHLESIKKLAKLEIECRRLKAIARKASPANDQKSY---PASSICVES 284 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 FTDSQSD G+RLLA+E D+ N +E+N + SD+WASALITELDQF+ KEK Sbjct: 285 FTDSQSDSGERLLAVETDMQ------KMNGLEMNGCDRSSSDAWASALITELDQFR-KEK 337 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 G + +I MDDFLEMERLAALP+TE+ S + + + LKADL Sbjct: 338 AVGRNIM--APSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQTSIVENPLKADL 395 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 + ++ R E+Q +L T ++QL E E + ++Q ++ L Sbjct: 396 ETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQLALAD 455 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536 +KQA E E++ AN R+ ES++ E E+ TL +KV SLE + E+A+S Sbjct: 456 NSKQAAEKEVKVANMNREVAESRLRDAETEIKTLMSKVTSLEEALGKEQALSTENMNKCK 515 Query: 535 XXXXXXXKRRFETERRR------ATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374 K + ET+ R+ A N ELK++QDKEL++AA K AECQKTIASLGQQL Sbjct: 516 ELENELSKMKCETKLRQEAELQHAAKYNEELKVQQDKELSIAACKFAECQKTIASLGQQL 575 Query: 373 KSLATLEDFMMDSDEKPLDINVEGGSLLTPN 281 KSLATLEDF++DSD KPL++ V+GG T N Sbjct: 576 KSLATLEDFLIDSD-KPLEL-VDGGLKCTGN 604 >ref|XP_010664206.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731428065|ref|XP_010664207.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731428067|ref|XP_010664208.1| PREDICTED: filament-like plant protein [Vitis vinifera] gi|731428069|ref|XP_010664209.1| PREDICTED: filament-like plant protein [Vitis vinifera] Length = 689 Score = 513 bits (1321), Expect = e-142 Identities = 307/616 (49%), Positives = 398/616 (64%), Gaps = 23/616 (3%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSK ATS +EVNDN+++LTEKLSAALLN+ AK+DLVKQHAKVAEEAV+GWEKAENE Sbjct: 51 PEVTSKVATSGDEVNDNVKSLTEKLSAALLNVGAKDDLVKQHAKVAEEAVAGWEKAENEV 110 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 KQQLE AVQ+N LEDRV LDGA+KECVRQLRQA+EEQE+KI EAVV+KT EWEST Sbjct: 111 VVLKQQLEAAVQENLVLEDRVSRLDGAIKECVRQLRQAREEQEEKISEAVVKKTREWEST 170 Query: 1612 KTELESQLDELRKKLDAAKAEP----------------------KISSTDLYPKLEAAEK 1499 K ELESQL EL+ ++DAAKAEP K +L+A EK Sbjct: 171 KFELESQLLELQTQVDAAKAEPPVPFDPDLCHMLQALEKQNSALKYELLSQSEELQALEK 230 Query: 1498 ENSTLKLELSAQAEELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARK 1319 ENSTLKLEL +Q+EELEI T+ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A ARK Sbjct: 231 ENSTLKLELLSQSEELEIRTIERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAMARK 290 Query: 1318 ANN-NDQKSFTIASSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEP 1142 +++ +D +S ++SS+ +ES TDSQSD G++L ++ L +++ ++N+ EP Sbjct: 291 SSSIHDHRS---VAASSLHIESLTDSQSDNGEQLNMVDISLH------QTSSFDVNDCEP 341 Query: 1141 GCSDSWASALITELDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESA 962 CSDSWASALI ELDQFK+++ V+ +I MDDFLEMERLAALP+ E S Sbjct: 342 SCSDSWASALIAELDQFKNEKVVSRN---LPASSIEIDLMDDFLEMERLAALPQAEHGSR 398 Query: 961 VASCQQHNDEKGQDRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTS 782 Q ++ + L+A+L+ M R +Q+ + S Sbjct: 399 SLESQAVTNQTSNEDSSLRAELETMTHRMAELEEKLEKMEAEKAELEIALTVSQDCIEAS 458 Query: 781 EDQLRETEEKLLELQNKMGLEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKV 602 + QLRE E KL E+Q E++ AN ++A+ESQ++ ME E T+ A+V Sbjct: 459 KIQLREAEMKLEEMQK--------------ELDFANESKQALESQLIAMEAEARTMSARV 504 Query: 601 DSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAG 422 DSLE +++ E AMSA K++ E + ++A +SN E K+KQ+ ELA+AAG Sbjct: 505 DSLEAEIKKEHAMSAEIGVKCQELEDELLKKKQELKFQQAASSNSERKVKQE-ELAIAAG 563 Query: 421 KLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSND 242 KLAECQKTIASLG+QLKSLATLEDF+ D+ ++ G + + GE W+LHSND Sbjct: 564 KLAECQKTIASLGKQLKSLATLEDFLTDAG----NLADFSGKSVISTAAGGETWQLHSND 619 Query: 241 AYSTANQAAEDGSGLS 194 + A D S +S Sbjct: 620 TFLPRRSA--DSSNMS 633 >ref|XP_010025716.1| PREDICTED: filament-like plant protein 3 [Eucalyptus grandis] gi|702451484|ref|XP_010025717.1| PREDICTED: filament-like plant protein 3 [Eucalyptus grandis] gi|702451488|ref|XP_010025718.1| PREDICTED: filament-like plant protein 3 [Eucalyptus grandis] gi|629096439|gb|KCW62434.1| hypothetical protein EUGRSUZ_H05077 [Eucalyptus grandis] Length = 621 Score = 502 bits (1292), Expect = e-139 Identities = 298/572 (52%), Positives = 379/572 (66%), Gaps = 8/572 (1%) Frame = -2 Query: 1969 EVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEAA 1790 EVTSK A + EEV + +R L++KLSAALLNISAKE+LVKQHAKVAEEAVSGWEKAENE + Sbjct: 48 EVTSKVAVADEEVGEGVRTLSDKLSAALLNISAKEELVKQHAKVAEEAVSGWEKAENEVS 107 Query: 1789 AFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWESTK 1610 K+QLE A Q+NS LEDR+ HLDGALKECVRQLRQ +EEQEQKI E VV+KTHEWESTK Sbjct: 108 VLKKQLEVATQRNSTLEDRISHLDGALKECVRQLRQVREEQEQKIQETVVKKTHEWESTK 167 Query: 1609 TELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTLE 1433 ELE++L + +L AAK+E + +DL PKL+AAEKEN LK ++ + +EELE+ +E Sbjct: 168 AELETKLSNVHAQLQAAKSEASSVICSDLGPKLDAAEKENVALKAKVLSMSEELELRIIE 227 Query: 1432 RDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVES 1256 RDLS QAAE+A+KQ LESI+KVA+LEAECRRLRA +RKA+ ND KSF S+SS+CVES Sbjct: 228 RDLSTQAAETASKQHLESIKKVARLEAECRRLRAMSRKASAANDLKSF---SASSVCVES 284 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 F DSQSD GDRLLA+END+ ++ +E ++ EP S+SWASALITELD FK KEK Sbjct: 285 FADSQSDVGDRLLAVENDVQ------KASCLEPSDCEPCHSESWASALITELDHFK-KEK 337 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKADL 896 G + D+ MDDFLEMERLAALP+ E+ S D+ D LK++L Sbjct: 338 SFGKSLMVSSGELDL--MDDFLEMERLAALPDAESGSCSHGMGPSLDQNRSDEVTLKSEL 395 Query: 895 DAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLEA 716 +AMI RT + Q +L TS QL E E +L EL NK+ Sbjct: 396 EAMINRTAELEEELEKKEEEKEKLEMALSQCQKQLETSWSQLNEVEMRLTELNNKLSAAQ 455 Query: 715 VAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXX 536 +KQA E E + + + ES+++ +EEEV L V L+ +VE ERA+SA Sbjct: 456 KSKQAAEEEARITHERMEVTESRLMDVEEEVKNLLLNVKLLQEEVERERALSAENEAKCQ 515 Query: 535 XXXXXXXKRR------FETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQL 374 + E E +R S+ +LK+KQ++ELAVAA + AECQKTI+SL QQL Sbjct: 516 ELEDENLNMKRDAELQHEIELQRVAVSDEQLKVKQEQELAVAASRFAECQKTISSLAQQL 575 Query: 373 KSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278 KSLA ++DF DSD P+ +++ G L N+ Sbjct: 576 KSLAAVDDFFADSD--PMCNHMDEGLLSPENE 605 >ref|XP_012078241.1| PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas] gi|802635970|ref|XP_012078242.1| PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas] gi|802636033|ref|XP_012078243.1| PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas] Length = 682 Score = 499 bits (1285), Expect = e-138 Identities = 302/602 (50%), Positives = 403/602 (66%), Gaps = 7/602 (1%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSK+ E+VND++R LTEKLSAAL+N+SAK+DLVKQH+KVAEEAV+GWEKAENE Sbjct: 52 PEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 111 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 AA K+QLE A+Q+N ALEDRV HLDGALKECVRQLRQA+EE E+K++EAV +KT EWES Sbjct: 112 AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 171 Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K+ELE+QL EL+ K +A K+E P DL+ KLE EK+N++LKLE+ + +EELE+ + Sbjct: 172 KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 231 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259 ERDLS QAAE+A+KQ L+SI+KVAKLEAECRRL+A A K+++ ND K+ + +SS+ VE Sbjct: 232 ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKT---SIASSMYVE 288 Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079 S TDSQSD G+RL A+E D HK+ + +E ++ EP CSDSWASALI ELDQFK+++ Sbjct: 289 SLTDSQSDSGERLNAVELD-AHKI-----SCLEPSKCEPSCSDSWASALIAELDQFKNEK 342 Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899 V R L +I MDDFLEMERLA+LPE E+ + + + + L+A+ Sbjct: 343 AV--NRNL-PASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAE 399 Query: 898 LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719 L+ MI RT + E L + EK E Q ++G Sbjct: 400 LEIMIHRTAELEKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEA 459 Query: 718 AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539 + + E+ AN ++ +ESQ++ ME E T+ +KVDSLE ++E E+ +SA Sbjct: 460 ELKMKQLHQELSIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKC 519 Query: 538 XXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLAT 359 ++ E E +++ +SNGELKIKQ+ +LAVAAGKLAECQKTIASLG+QLKSLAT Sbjct: 520 RTLEEELSEKNKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLAT 578 Query: 358 LEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLS 194 LEDF++D+ P G L P K++ E WKLHS+D S++++ A + SG S Sbjct: 579 LEDFLIDTASLP---EFTAGGALMP-KATEEPWKLHSSDTLSPKRDSSSSRIASENSGPS 634 Query: 193 RN 188 N Sbjct: 635 VN 636 >ref|XP_012078245.1| PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas] gi|802636039|ref|XP_012078246.1| PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas] gi|643723198|gb|KDP32803.1| hypothetical protein JCGZ_12095 [Jatropha curcas] Length = 681 Score = 499 bits (1285), Expect = e-138 Identities = 302/602 (50%), Positives = 403/602 (66%), Gaps = 7/602 (1%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTSK+ E+VND++R LTEKLSAAL+N+SAK+DLVKQH+KVAEEAV+GWEKAENE Sbjct: 51 PEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLVKQHSKVAEEAVAGWEKAENEV 110 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 AA K+QLE A+Q+N ALEDRV HLDGALKECVRQLRQA+EE E+K++EAV +KT EWES Sbjct: 111 AALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAREEHEEKVYEAVTKKTIEWESV 170 Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K+ELE+QL EL+ K +A K+E P DL+ KLE EK+N++LKLE+ + +EELE+ + Sbjct: 171 KSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKDNASLKLEILSLSEELELRII 230 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259 ERDLS QAAE+A+KQ L+SI+KVAKLEAECRRL+A A K+++ ND K+ + +SS+ VE Sbjct: 231 ERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKSSSLNDHKT---SIASSMYVE 287 Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079 S TDSQSD G+RL A+E D HK+ + +E ++ EP CSDSWASALI ELDQFK+++ Sbjct: 288 SLTDSQSDSGERLNAVELD-AHKI-----SCLEPSKCEPSCSDSWASALIAELDQFKNEK 341 Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899 V R L +I MDDFLEMERLA+LPE E+ + + + + L+A+ Sbjct: 342 AV--NRNL-PASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTDVESSLRAE 398 Query: 898 LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719 L+ MI RT + E L + EK E Q ++G Sbjct: 399 LEIMIHRTAELEKQLQKMEGEKVELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEA 458 Query: 718 AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539 + + E+ AN ++ +ESQ++ ME E T+ +KVDSLE ++E E+ +SA Sbjct: 459 ELKMKQLHQELSIANESKQQIESQLVSMEVEARTMASKVDSLEAELEKEKVLSAELAVKC 518 Query: 538 XXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLAT 359 ++ E E +++ +SNGELKIKQ+ +LAVAAGKLAECQKTIASLG+QLKSLAT Sbjct: 519 RTLEEELSEKNKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLAT 577 Query: 358 LEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQAAEDGSGLS 194 LEDF++D+ P G L P K++ E WKLHS+D S++++ A + SG S Sbjct: 578 LEDFLIDTASLP---EFTAGGALMP-KATEEPWKLHSSDTLSPKRDSSSSRIASENSGPS 633 Query: 193 RN 188 N Sbjct: 634 VN 635 >ref|XP_011031090.1| PREDICTED: filament-like plant protein 3 [Populus euphratica] gi|743861318|ref|XP_011031091.1| PREDICTED: filament-like plant protein 3 [Populus euphratica] Length = 672 Score = 498 bits (1283), Expect = e-138 Identities = 301/603 (49%), Positives = 403/603 (66%), Gaps = 9/603 (1%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEVTS++ + E++ DN+R LT+KLSAALLN+SAKE+LVKQHAKVAEEAVSGWEKAE E Sbjct: 47 PEVTSEAVLTDEDIRDNVRTLTDKLSAALLNLSAKEELVKQHAKVAEEAVSGWEKAEKEL 106 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 +A K+Q+E A +KNS LEDRV HLD ALKECVRQLRQ++EEQE++I+EAV +K EWEST Sbjct: 107 SALKKQIEAATKKNSGLEDRVSHLDAALKECVRQLRQSREEQERRINEAVTKKICEWEST 166 Query: 1612 KTELESQLDELRKKLDAAKAEPKISS-TDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K+ELE+QL EL+ +L AK++ +S+ ++L+ KL A EKEN +LK EL ++AEE+++ L Sbjct: 167 KSELEAQLIELQARLQTAKSDATVSADSELWQKLNAVEKENLSLKHELFSRAEEIQVRIL 226 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKAN-NNDQKSFTIASSSSICVE 1259 ERDLS QAAE+A+K +LES++K+AKLEAECR+L+A ARKA+ ND KS T +SSIC E Sbjct: 227 ERDLSTQAAETASKLQLESLKKLAKLEAECRKLKAMARKASAANDHKSLT---ASSICAE 283 Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079 SFTDSQSD G+RLLA+E+ D+ + +E+NE E CSDSWA A ELDQ K+++ Sbjct: 284 SFTDSQSDNGERLLAVES------DSCKRSGLEMNECEQICSDSWACAHAIELDQSKNQK 337 Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQLKAD 899 + GR + +I MDDFLEMERLAALP+TE+ + +D+ + LK + Sbjct: 338 PI--GRNV-MVPSLEINLMDDFLEMERLAALPDTESGISYLEAGPVSDKGNGSGNPLKEE 394 Query: 898 LDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGLE 719 L+ MI RT E Q +L T QL+E + K+ ELQ + L Sbjct: 395 LECMINRTTELEEKLDKMEEEKFKSEMALTECQRQLETLRSQLKEADAKIGELQGLLTLA 454 Query: 718 AVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXXXX 539 ++QA E EI+ ++++RK ESQ+ + E E+ TL +K+ SL+ +VE ERA+SA Sbjct: 455 NESRQAREEEIKRSDSRRKETESQLRIAEAEIKTLLSKIVSLDAEVEKERALSAENAAKS 514 Query: 538 XXXXXXXXKRR------FETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLGQQ 377 K + E ER+R + N ELKI Q+KELAVAA KLAECQKTI+SLG Q Sbjct: 515 QELEDELSKMKCEVELQHEIERKRIASFNEELKITQEKELAVAASKLAECQKTISSLGLQ 574 Query: 376 LKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLH-SNDAYSTANQAAEDGSG 200 LKSLATLED + DSD K D++ E + + +GE W+L N + ++A E G Sbjct: 575 LKSLATLED-LFDSD-KSSDVSSEE----SKDHENGERWRLDLGNQSSGRESEAIEVTGG 628 Query: 199 LSR 191 R Sbjct: 629 ALR 631 >ref|XP_010242808.1| PREDICTED: filament-like plant protein isoform X3 [Nelumbo nucifera] Length = 599 Score = 495 bits (1275), Expect = e-137 Identities = 298/550 (54%), Positives = 373/550 (67%), Gaps = 6/550 (1%) Frame = -2 Query: 1819 GWEKAENEAAAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVV 1640 GWEKAENE A KQ+LE+A QKNS LEDRV HLDGALKECVRQLRQA+EEQEQKIHEAVV Sbjct: 23 GWEKAENEVVALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAREEQEQKIHEAVV 82 Query: 1639 QKTHEWESTKTELESQLDELRKKLDAAKAEPKISSTDLYPKLEAAEKENSTLKLELSAQA 1460 +KT EWES K ELESQ+ L+ +++AAK E +S DL KLE+AEK+N+ LKLEL ++ Sbjct: 83 EKTKEWESVKLELESQVVNLQSQVEAAKLEAAANS-DLCSKLESAEKKNAALKLELLSRV 141 Query: 1459 EELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKA-NNNDQKSFTIA 1283 EELEI TLERDLS Q AE+A+KQ LESI+KVAKLEAECRRLRA +RKA + ND +S T Sbjct: 142 EELEIRTLERDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPSANDHRSVT-- 199 Query: 1282 SSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITE 1103 +SS VES TDSQSD G+RLL +E D HK+ +++ELN+ E SDSWASALI E Sbjct: 200 -ASSFYVESLTDSQSDSGERLLGMEID-THKM-----SSMELNDGEASYSDSWASALIAE 252 Query: 1102 LDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQ 923 LDQFK + + GR L +I MDDFLEMERLAALPETE+ ++G+ Sbjct: 253 LDQFKQDKAI--GRNL-TTSSVEIDLMDDFLEMERLAALPETESGDPEPVAVPDQIDRGE 309 Query: 922 DRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLE 743 LKA+L+ MIQR+ E Q++L S +QL+ EEKL+E Sbjct: 310 --SSLKAELETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVE 367 Query: 742 LQNKMGLEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAM 563 LQ + L KQ TE ++E+ NT+++ +ES+++ + E+ LR KV SLE ++E ER + Sbjct: 368 LQRCLDLANNLKQTTEEKLETINTQKEVIESRLVGADAEIRALRGKVGSLESEIEKERTL 427 Query: 562 SAXXXXXXXXXXXXXXKRRFETERRRATNSNGELKIKQDKELAVAAGKLAECQKTIASLG 383 S K++ E E RA+ SNGELKIKQ+KELAVAAGKL ECQKTIASLG Sbjct: 428 SEEIVVKCRKLEDELTKKKHEAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLG 487 Query: 382 QQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNKSSGEYWKLHSNDAY-----STANQA 218 +QLKSLATLEDF++D EKPLD+ V +P G+ WKLHSNDA+ + +++ Sbjct: 488 RQLKSLATLEDFLIDY-EKPLDLTVG-----SPIPKGGDLWKLHSNDAHLPKAEAYSSKI 541 Query: 217 AEDGSGLSRN 188 A DGSG S N Sbjct: 542 AGDGSGPSTN 551 >gb|KHF97687.1| Filament-like plant protein [Gossypium arboreum] gi|728836668|gb|KHG16111.1| Filament-like plant protein [Gossypium arboreum] Length = 702 Score = 485 bits (1249), Expect = e-134 Identities = 304/630 (48%), Positives = 395/630 (62%), Gaps = 35/630 (5%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEV+SK++TS E+V D ++ LT+KLSAAL+NISAKE+LVKQH+KVAEEA++GWE AENE Sbjct: 53 PEVSSKASTSSEDVTDGVKILTQKLSAALVNISAKENLVKQHSKVAEEAIAGWENAENEV 112 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 KQ+LE ++Q+N ALEDRV HLDGALKECVRQLRQ +EEQE+KI EA+ + +WE+T Sbjct: 113 VVLKQKLEASIQQNLALEDRVSHLDGALKECVRQLRQVREEQEEKISEAIAKAAQDWETT 172 Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K EL+S+L +L+ K +A ++ P ++ K+E EK+N+ LKLELS+Q EE+EI T+ Sbjct: 173 KFELKSRLLDLQAKSEAINSKLPPQVGPEVRRKIEDLEKKNADLKLELSSQLEEMEIRTI 232 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANNNDQKSFTIASSSSICVES 1256 ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K+ N SSI VE Sbjct: 233 ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIAGKSPN----------ISSIYVEL 282 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 TDSQSD G+R+ +E D HK+ I S E N+ E CSDSWASALI ELDQFK+++ Sbjct: 283 LTDSQSDSGERVNLVEID-THKM--ICS---EANKGELSCSDSWASALIAELDQFKNEKT 336 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQ--LKA 902 V R+L +I MDDFLEMERLAALPET++++ + D D LKA Sbjct: 337 V--NRSL-PGSSIEIDIMDDFLEMERLAALPETKSKNQCLESKATAKVSNNDGDSLLLKA 393 Query: 901 DLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGL 722 +L+AMI RT + Q L SE QLR+T KL ELQ ++ + Sbjct: 394 ELEAMIHRTTELEKKLEKIEVEKAELETALTKTQESLNESELQLRDTGLKLEELQRELSM 453 Query: 721 EAVAKQ---------------------ATELEIE-----------SANTKRKAMESQILV 638 AKQ + E EIE +AN +K +ESQ++ Sbjct: 454 ANEAKQNLESQLRNMEADVETMSSKIESLEKEIEKESTLSAEVSVNANESKKMLESQLIS 513 Query: 637 MEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELK 458 +E E T+ AK+DSLE +VE ERA+SA +++ ETE ++ NSN E+K Sbjct: 514 IEVEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELSRKKQETELQQTVNSNVEVK 573 Query: 457 IKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278 IKQ+ +L AAGKLAECQ+TIASLGQQLKSLATLEDF++DS P G L P K Sbjct: 574 IKQE-DLTAAAGKLAECQRTIASLGQQLKSLATLEDFLIDSASIP---EFPKGRSLIP-K 628 Query: 277 SSGEYWKLHSNDAYSTANQAAEDGSGLSRN 188 + GE W LHSN+ +S + +N Sbjct: 629 AGGEPWNLHSNETFSPKRDPESPRTSFDKN 658 >ref|XP_012455846.1| PREDICTED: filament-like plant protein isoform X2 [Gossypium raimondii] Length = 604 Score = 484 bits (1246), Expect = e-133 Identities = 289/543 (53%), Positives = 368/543 (67%), Gaps = 8/543 (1%) Frame = -2 Query: 1885 LNISAKEDLVKQHAKVAEEAVSGWEKAENEAAAFKQQLETAVQKNSALEDRVGHLDGALK 1706 +NISAKE+LVKQHAKVAEEAVSGWEKAE + A KQQL+ A++KN+ALEDRVGHLDGALK Sbjct: 1 MNISAKEELVKQHAKVAEEAVSGWEKAEKDVVALKQQLDAAMKKNAALEDRVGHLDGALK 60 Query: 1705 ECVRQLRQAKEEQEQKIHEAVVQKTHEWESTKTELESQLDELRKKLDAAKAEPKIS-STD 1529 ECVRQLRQA+EEQE+KIHEAV +K HEWES+K+ELESQL L+ +L+ AK + S D Sbjct: 61 ECVRQLRQAREEQERKIHEAVSKKCHEWESSKSELESQLLNLKAQLETAKNDTAASVDPD 120 Query: 1528 LYPKLEAAEKENSTLKLELSAQAEELEIMTLERDLSVQAAESAAKQRLESIRKVAKLEAE 1349 L KL+A EKENS LKL+L ++AEELE +ERDLS QAAE+A+KQ LESI+K+AKLE E Sbjct: 121 LQLKLDAFEKENSALKLQLHSRAEELERRIIERDLSTQAAETASKQHLESIKKLAKLEIE 180 Query: 1348 CRRLRASARKAN-NNDQKSFTIASSSSICVESFTDSQSDGGDRLLAIENDLPHKLDNISS 1172 CRRL+A ARKA+ NDQKS+ +SSICVESFTDSQSD G+RLLA+E D+ Sbjct: 181 CRRLKAIARKASPANDQKSY---PASSICVESFTDSQSDSGERLLAVETDMQ------KM 231 Query: 1171 NAVELNESEPGCSDSWASALITELDQFKSKEKVAGGRALXXXXXADICFMDDFLEMERLA 992 N +E+N + SD+WASALITELDQF+ KEK G + +I MDDFLEMERLA Sbjct: 232 NGLEMNGCDRSSSDAWASALITELDQFR-KEKAVGRNIM--APSVEINLMDDFLEMERLA 288 Query: 991 ALPETETESAVASCQQHNDEKGQDRDQLKADLDAMIQRTXXXXXXXXXXXXXXXXXXXXX 812 ALP+TE+ S + + + LKADL+ ++ R Sbjct: 289 ALPDTESGSGFNDAGPVSYQTSIVENPLKADLETLVHRVAELEEKLALTEEEKSEMQIAF 348 Query: 811 XENQNRLGTSEDQLRETEEKLLELQNKMGLEAVAKQATELEIESANTKRKAMESQILVME 632 E+Q +L T ++QL E E + ++Q ++ L +KQA E E++ AN R+ ES++ E Sbjct: 349 TESQKQLKTLQNQLSEAEIRFKDVQTQLALADNSKQAAEKEVKVANMNREVAESRLRDAE 408 Query: 631 EEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRR------ATNSN 470 E+ TL +KV SLE + E+A+S K + ET+ R+ A N Sbjct: 409 TEIKTLMSKVTSLEEALGKEQALSTENMNKCKELENELSKMKCETKLRQEAELQHAAKYN 468 Query: 469 GELKIKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLL 290 ELK++QDKEL++AA K AECQKTIASLGQQLKSLATLEDF++DSD KPL++ V+GG Sbjct: 469 EELKVQQDKELSIAACKFAECQKTIASLGQQLKSLATLEDFLIDSD-KPLEL-VDGGLKC 526 Query: 289 TPN 281 T N Sbjct: 527 TGN 529 >ref|XP_010093113.1| hypothetical protein L484_007922 [Morus notabilis] gi|587863800|gb|EXB53551.1| hypothetical protein L484_007922 [Morus notabilis] Length = 643 Score = 481 bits (1238), Expect = e-132 Identities = 295/559 (52%), Positives = 371/559 (66%), Gaps = 10/559 (1%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEV SK+A + E N++++ LT+KLSAAL +ISAKEDLVKQHAKVAEEAVSGWE AENE Sbjct: 47 PEVMSKAAPNDEYSNESVKTLTDKLSAALRSISAKEDLVKQHAKVAEEAVSGWENAENEV 106 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 KQ+LE A QKNS LEDR+GHLDGALKECVRQLRQA+EEQEQKIH+AV +KTHEWES Sbjct: 107 LILKQKLEAANQKNSVLEDRLGHLDGALKECVRQLRQAREEQEQKIHDAVAKKTHEWESL 166 Query: 1612 KTELESQLDELRKKLDAAKAEPKIS-STDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K+ L+SQL EL+ +L K E +DL KLEAAEK+NS LKLEL ++AEELEI + Sbjct: 167 KSLLQSQLLELQVELQNVKTEAAAPIDSDLQAKLEAAEKQNSALKLELLSKAEELEIRII 226 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANN-NDQKSFTIASSSSICVE 1259 ERDLS +AAE+A+KQ LESI+KVAKLEAECRRL+A ARK + N+QKS SSSS+ VE Sbjct: 227 ERDLSTKAAETASKQHLESIKKVAKLEAECRRLKAMARKVSQVNNQKS---GSSSSVYVE 283 Query: 1258 SFTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKE 1079 S TDSQSD G+RLL IE+ + ++ELNE EP S S AS+L+TE QF++ E Sbjct: 284 SLTDSQSDSGERLLTIES------GTLKMGSLELNECEPSDSGSCASSLVTE-HQFRN-E 335 Query: 1078 KVAGGRALXXXXXADICFMDDFLEMERLAALPETETES--AVASCQQHNDEKGQDRDQLK 905 K+ G + +I MDDFLEMERLAALP + ES VA H G+ R K Sbjct: 336 KIIGKNRM--VPSIEINLMDDFLEMERLAALPVRDIESGFTVAGSASHQPIGGESR--FK 391 Query: 904 ADLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMG 725 LDAMIQR + L TS+ QL E++L ELQ ++ Sbjct: 392 TKLDAMIQRIAELEDKLEKIEMEKVELEVALSLCEKHLETSQSQLLVAEKRLKELQKQLV 451 Query: 724 LEAVAKQATELEIESANTKRKAMESQILVMEEEVATLRAKVDSLEMDVEDERAMSAXXXX 545 L +K+A E E + TK++ ESQ+ V+E E+ L +K+ SLE +V+ ERA+SA Sbjct: 452 LANESKRAAEEEERATRTKQELAESQLRVVENEINALLSKIGSLEEEVQKERALSADNVA 511 Query: 544 XXXXXXXXXXKRRFETERR------RATNSNGELKIKQDKELAVAAGKLAECQKTIASLG 383 + E E + R ++N LKIKQ+KEL++AA K AECQKTIASLG Sbjct: 512 RCQKMENELLIVKREAENKQEAELERIQSANVNLKIKQEKELSLAADKFAECQKTIASLG 571 Query: 382 QQLKSLATLEDFMMDSDEK 326 QQLKSLA+LED ++D +++ Sbjct: 572 QQLKSLASLEDVLLDPEKQ 590 >ref|XP_012446394.1| PREDICTED: filament-like plant protein isoform X3 [Gossypium raimondii] gi|763792239|gb|KJB59235.1| hypothetical protein B456_009G245900 [Gossypium raimondii] Length = 668 Score = 478 bits (1231), Expect = e-132 Identities = 299/615 (48%), Positives = 392/615 (63%), Gaps = 35/615 (5%) Frame = -2 Query: 1972 PEVTSKSATSVEEVNDNMRNLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWEKAENEA 1793 PEV+SK++T+ E+V D+++ LTEKLSAAL+NISAKEDLVKQH+KVAEEA++GWE AENE Sbjct: 19 PEVSSKASTNSEDVTDSVKILTEKLSAALVNISAKEDLVKQHSKVAEEAIAGWENAENEV 78 Query: 1792 AAFKQQLETAVQKNSALEDRVGHLDGALKECVRQLRQAKEEQEQKIHEAVVQKTHEWEST 1613 KQ+LE ++Q+N LEDRV HLDGALKECVRQLRQA+EEQE+KI EA+ + +WE+T Sbjct: 79 VVLKQKLEASIQQNLTLEDRVSHLDGALKECVRQLRQAREEQEEKIGEAIAKAAQDWETT 138 Query: 1612 KTELESQLDELRKKLDAAKAE-PKISSTDLYPKLEAAEKENSTLKLELSAQAEELEIMTL 1436 K ELES+L +L+ K +A ++ P +++ K+E EK+N+ LKLELS+Q EE+EI T+ Sbjct: 139 KLELESKLLDLQAKSEAINSKLPPQVGPEVWRKIEDLEKKNADLKLELSSQLEEMEIRTI 198 Query: 1435 ERDLSVQAAESAAKQRLESIRKVAKLEAECRRLRASARKANNNDQKSFTIASSSSICVES 1256 ERDLS QAAE+A+KQ LESI+KVAKLEAECRRL+A A K+ N SSI V Sbjct: 199 ERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIAGKSLN----------ISSIYVGP 248 Query: 1255 FTDSQSDGGDRLLAIENDLPHKLDNISSNAVELNESEPGCSDSWASALITELDQFKSKEK 1076 TDSQSD G+R+ +E D HK+ I S E N+ E CSDSWASALI ELDQFK+++ Sbjct: 249 LTDSQSDSGERVNLVEID-THKM--ICS---EANKGELSCSDSWASALIAELDQFKNEKT 302 Query: 1075 VAGGRALXXXXXADICFMDDFLEMERLAALPETETESAVASCQQHNDEKGQDRDQ--LKA 902 V R+L +I MDDFLEMERLAALP T++++ + D D LKA Sbjct: 303 V--NRSL-PGSSIEIDIMDDFLEMERLAALPATKSKNQCLELKATAKVSNNDGDSLLLKA 359 Query: 901 DLDAMIQRTXXXXXXXXXXXXXXXXXXXXXXENQNRLGTSEDQLRETEEKLLELQNKMGL 722 +L+AMI RT + + L S+ QLR++ KL ELQ ++ + Sbjct: 360 ELEAMIHRTTELEKKLEKIEVEKAELETALTKTRESLNESKLQLRDSGLKLEELQRELSM 419 Query: 721 EAVAKQ---------------------ATELEIE-----------SANTKRKAMESQILV 638 AKQ + E EIE +AN +K +ESQ++ Sbjct: 420 VNEAKQNLESQLRNMEADVETMSSKIESLEKEIEKERTLSAEVSVNANESKKMLESQLIS 479 Query: 637 MEEEVATLRAKVDSLEMDVEDERAMSAXXXXXXXXXXXXXXKRRFETERRRATNSNGELK 458 +E E T+ AK+DSLE +VE ERA+SA +++ ETE ++ NSN E+K Sbjct: 480 IEVEARTMSAKIDSLETEVEKERALSAQITVKCQELEEELSRKKQETELQQTVNSNVEVK 539 Query: 457 IKQDKELAVAAGKLAECQKTIASLGQQLKSLATLEDFMMDSDEKPLDINVEGGSLLTPNK 278 IKQ+ +L AAGKLAECQ+TIASLGQQLKSLATLEDF++DS P G L P + Sbjct: 540 IKQE-DLTAAAGKLAECQRTIASLGQQLKSLATLEDFLIDSASIP---EFPKGRSLIP-E 594 Query: 277 SSGEYWKLHSNDAYS 233 + GE W LHSN+ +S Sbjct: 595 AGGEPWNLHSNETFS 609