BLASTX nr result

ID: Forsythia22_contig00021870 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00021870
         (2078 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011097046.1| PREDICTED: filament-like plant protein 3 [Se...   627   e-177
ref|XP_010652946.1| PREDICTED: filament-like plant protein [Viti...   531   e-148
emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera]   518   e-144
emb|CDP04584.1| unnamed protein product [Coffea canephora]            516   e-143
ref|XP_010093113.1| hypothetical protein L484_007922 [Morus nota...   505   e-140
ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [...   505   e-140
ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma...   505   e-140
gb|KDO52203.1| hypothetical protein CISIN_1g006305mg [Citrus sin...   502   e-139
ref|XP_010242807.1| PREDICTED: filament-like plant protein isofo...   491   e-136
gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum]       491   e-136
ref|XP_010242801.1| PREDICTED: filament-like plant protein isofo...   491   e-136
ref|XP_006432073.1| hypothetical protein CICLE_v10000549mg [Citr...   491   e-136
ref|XP_012455840.1| PREDICTED: filament-like plant protein isofo...   486   e-134
ref|XP_009764915.1| PREDICTED: filament-like plant protein 3 iso...   481   e-133
ref|XP_008227685.1| PREDICTED: filament-like plant protein [Prun...   471   e-129
ref|XP_012078245.1| PREDICTED: filament-like plant protein isofo...   456   e-125
ref|XP_012078241.1| PREDICTED: filament-like plant protein isofo...   456   e-125
ref|XP_007019074.1| Filament-like plant protein, putative isofor...   455   e-125
ref|XP_012450685.1| PREDICTED: filament-like plant protein [Goss...   449   e-123
ref|XP_004503890.1| PREDICTED: filament-like plant protein [Cice...   447   e-122

>ref|XP_011097046.1| PREDICTED: filament-like plant protein 3 [Sesamum indicum]
            gi|747098110|ref|XP_011097047.1| PREDICTED: filament-like
            plant protein 3 [Sesamum indicum]
          Length = 619

 Score =  627 bits (1618), Expect = e-177
 Identities = 363/593 (61%), Positives = 427/593 (72%), Gaps = 10/593 (1%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQ  S  NT SPEV SKAAP  ++LN+SVK LSEKLSEALLNIRAKEDLVKQHAK
Sbjct: 32   ERFSDDQTLSALNTQSPEVTSKAAPPGDELNDSVKALSEKLSEALLNIRAKEDLVKQHAK 91

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWERAENEV VLKKQ +AL QKN +LEERVGHLDGALK              +
Sbjct: 92   VAEEAVSGWERAENEVSVLKKQNDALAQKNLILEERVGHLDGALKECLRQLRQAREEQEE 151

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSA----KTEV----TKLETMEKENTILK 1563
            KIYDA ++K  EWES KSEL+N+LVEL +QLQSA    KT +    +KL+  EKEN+ILK
Sbjct: 152  KIYDAVAKKGCEWESKKSELENKLVELHAQLQSATDADKTMLAEVRSKLDAAEKENSILK 211

Query: 1562 RELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPANY 1383
             +L S+AEEL+L T ERDLS +AAE ASKQHLDSIK+ AKLEAECRRLKA+A K + AN 
Sbjct: 212  LKLLSKAEELELRTSERDLSIQAAETASKQHLDSIKRVAKLEAECRRLKALARKGTLAND 271

Query: 1382 LPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPE--SQVLVAELD 1209
              S T+S+ YVESF DSQSDS +R LVI  + CK  + EP      HP+  S  L  E+ 
Sbjct: 272  QRSGTASSFYVESFTDSQSDSAERALVIENDGCKSVDSEP-----RHPDSWSSALATEIG 326

Query: 1208 QFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALGGKQPLKAELQT 1029
             FK+ER LGR+ +V SVEIDLMDDFLEME++AALPET SGS P       +  L+ EL+ 
Sbjct: 327  HFKHERTLGRSLIVPSVEIDLMDDFLEMEKIAALPETNSGSDPTARVDDREASLQDELEA 386

Query: 1028 LINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQLAMVNEA 849
            +I RT             K+NLE ALSECQ QLK S +QLK TE+KLV+L  QLA+ NEA
Sbjct: 387  MIRRTTELEEKLKMMTAEKVNLEFALSECQIQLKASGDQLKATEVKLVELNKQLALANEA 446

Query: 848  KRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTSNVKKQXX 669
            +R AE EVE    KL  +T  L+EAEV+L++ Q +LI ANE KS+VE  L+  N+KK   
Sbjct: 447  RRDAEKEVENTTVKLKNATNLLDEAEVNLLKIQVQLIEANEAKSIVEVALEEGNLKKAEA 506

Query: 668  XXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDSQLQRSAV 489
                         LHSSI TLE+EV+KER +S EAVA+C  LE E+SRMK DSQ QRSA+
Sbjct: 507  ESEIKVMKLELETLHSSISTLEKEVEKERNLSREAVAKCDILEAELSRMKSDSQFQRSAI 566

Query: 488  IEGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSEQPIELL 330
            IE F+INQDKELAVAASKF ECQKTIASL RQLKSLATL+DFLIDS+ P+ +L
Sbjct: 567  IEEFRINQDKELAVAASKFAECQKTIASLDRQLKSLATLEDFLIDSDSPVAVL 619


>ref|XP_010652946.1| PREDICTED: filament-like plant protein [Vitis vinifera]
            gi|731397640|ref|XP_010652947.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731397642|ref|XP_010652948.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
            gi|731397644|ref|XP_010652949.1| PREDICTED: filament-like
            plant protein [Vitis vinifera]
          Length = 672

 Score =  531 bits (1369), Expect = e-148
 Identities = 316/600 (52%), Positives = 408/600 (68%), Gaps = 19/600 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQ   N N+ SPEV SK+AP DE++N+SVK+L+EKLS ALLNI AKEDLVKQHAK
Sbjct: 31   ERFSDDQVYPNQNSPSPEVTSKSAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAK 90

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AENEV  LK+Q EA  QKNS LE+RVGHLDGALK              Q
Sbjct: 91   VAEEAVSGWEKAENEVFSLKQQLEAAAQKNSALEDRVGHLDGALKECLRQLRQAREEQEQ 150

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTIL 1566
            KI++A  +++ EWESTKSEL++Q+VE+++QLQ+AK E           KL   EKEN  L
Sbjct: 151  KIHEAVVKRTHEWESTKSELESQIVEIQAQLQTAKAETVATVDPGLELKLGAAEKENAAL 210

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K +L S+ EEL++ T E++LST+AAE ASKQ+L+SIKK AKLEAECRRLKA+A KAS AN
Sbjct: 211  KLQLLSREEELEIRTIEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSAN 270

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPESQV--LVAEL 1212
               S+T+S+V VES  DSQSDSG+RLL +  +  K+  L+  +C     +S    L+ EL
Sbjct: 271  DHKSITASSVCVESLTDSQSDSGERLLALEIDTRKMTGLDTNECEPSRSDSWASGLIQEL 330

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGAL------GGKQ 1053
            D+FKNE+PL +N M  SVE+DLMDDFLEMERLAALPETE+ S C E+GA+      G + 
Sbjct: 331  DRFKNEKPLVKNLMAPSVELDLMDDFLEMERLAALPETENRSRCLESGAISDKHIGGSES 390

Query: 1052 PLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLET 873
            PLKA+L+ +I+RT             K+ L++ALSECQ QL+TS+ +LK  E KLV+L+T
Sbjct: 391  PLKAQLEAMIDRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQT 450

Query: 872  QLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKT 693
            QLA+ +E+KR AE E++  N K        E AE  L+               VE+++KT
Sbjct: 451  QLALASESKRNAEEEIQTTNAK-------REVAESRLI--------------AVEAEIKT 489

Query: 692  SNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLD 513
                                 + S + +LEEEV+KER +S EA ++C+  E+E+SRMK +
Sbjct: 490  ---------------------MLSKVLSLEEEVEKERALSAEAASKCRKFEDELSRMKRE 528

Query: 512  SQLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSEQPIE 336
            ++L+  A   G  KI Q+KELAVAASK  ECQKTIASL RQLKSLATL+D L+DSE+P++
Sbjct: 529  TELRNLASSNGELKIKQEKELAVAASKLAECQKTIASLGRQLKSLATLEDLLLDSEKPLQ 588


>emb|CAN83687.1| hypothetical protein VITISV_031800 [Vitis vinifera]
          Length = 749

 Score =  518 bits (1333), Expect = e-144
 Identities = 311/590 (52%), Positives = 400/590 (67%), Gaps = 19/590 (3%)
 Frame = -1

Query: 2048 NHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAKVAEEAVSGWE 1869
            N N+ SPEV SKAAP DE++N+SVK+L+EKLS ALLNI AKEDLVKQHAKVAEEAVSGWE
Sbjct: 18   NQNSPSPEVTSKAAPVDEEVNDSVKSLTEKLSAALLNISAKEDLVKQHAKVAEEAVSGWE 77

Query: 1868 RAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQKIYDADSQKS 1689
            +AENEV  LK+Q EA  QKNS LE+RVGHLDGALK              QKI++A  +++
Sbjct: 78   KAENEVFSLKQQLEAXXQKNSXLEDRVGHLDGALKECLRQLRQAREEQEQKIHEAVVKRT 137

Query: 1688 DEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTILKRELHSQAEE 1536
             EWESTKSEL++Q+VE+++QLQ+AK E           KL   EKEN  LK +L S+ EE
Sbjct: 138  HEWESTKSELESQIVEIQAQLQTAKAEXVATVDPGLELKLGAAEKENAALKLQLLSREEE 197

Query: 1535 LKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPANYLPSVTSSTV 1356
            L++ T E++LST+AAE ASKQ+L+SIKK AKLEAECRRLKA+A KAS AN   S T+S+V
Sbjct: 198  LEIRTIEQELSTQAAETASKQNLESIKKVAKLEAECRRLKAMARKASSANDHKSXTASSV 257

Query: 1355 YVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPESQV--LVAELDQFKNERPLG 1182
             VES  DSQSDSG+RLL +  +  K+  L+  +C     +S    L+ ELD+FKNE+PL 
Sbjct: 258  CVESLTDSQSDSGERLLALEIDTRKMTGLDTNECEPSRSDSWASGLIQELDRFKNEKPLV 317

Query: 1181 RNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGAL------GGKQPLKAELQTLI 1023
            +N M  SVE DLMDDFLEMERLAALPETE+ S C E+GA+      G + PLKA+L+ +I
Sbjct: 318  KNLMAPSVEXDLMDDFLEMERLAALPETENRSRCLESGAISDKHIGGSESPLKAQLEAMI 377

Query: 1022 NRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQLAMVNEAKR 843
            +RT             K+ L++ALSECQ QL+TS+ +LK  E KLV+L+TQLA+ +E+KR
Sbjct: 378  DRTAELEEKLEKMEAEKMELDMALSECQNQLETSQGRLKEVEEKLVELQTQLALASESKR 437

Query: 842  VAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTSNVKKQXXXX 663
             AE E++A N K        E AE  L+               VE+++KT          
Sbjct: 438  NAEEEIQATNAK-------REVAESRLI--------------XVEAEIKT---------- 466

Query: 662  XXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDSQLQRSAVIE 483
                       + S + +LEEEV+KER +S EA ++C+  E+E+SRMK +++L+  A   
Sbjct: 467  -----------MLSKVLSLEEEVEKERALSAEAASKCRKFEDELSRMKRETELRNLASSN 515

Query: 482  G-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSEQPIE 336
            G  KI Q+KELAVAASK  ECQKTIASL RQLKSLATL+D L+DSE+P++
Sbjct: 516  GELKIKQEKELAVAASKLAECQKTIASLGRQLKSLATLEDLLLDSEKPLQ 565


>emb|CDP04584.1| unnamed protein product [Coffea canephora]
          Length = 661

 Score =  516 bits (1328), Expect = e-143
 Identities = 309/636 (48%), Positives = 400/636 (62%), Gaps = 54/636 (8%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAA-PSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHA 1902
            ERFSDDQA  N+N  SPEV SKA  PS E+L++++K LS+KLSEAL+N+RAKEDLVKQHA
Sbjct: 32   ERFSDDQALLNNNIQSPEVTSKATTPSIEELHDNMKALSDKLSEALVNLRAKEDLVKQHA 91

Query: 1901 KVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXX 1722
            KVAEEAVSGWE+AE EVLVLK++ EAL Q+N  LEER+G+LD ALK              
Sbjct: 92   KVAEEAVSGWEKAEAEVLVLKRRAEALTQENLALEERIGNLDSALKECLRQLRQAKEEQE 151

Query: 1721 QKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTI 1569
            QKI +A + K  EWES K+EL+NQLV L+++LQ+A+TE           KLE  E +N +
Sbjct: 152  QKINEAVAHKIVEWESAKTELENQLVNLQTKLQNAETEAVTSTFPDLCIKLEAAENKNAV 211

Query: 1568 LKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPA 1389
            LK EL S+ +ELKL T ERDL   AAE ASKQHL+SIKK  +LEAECRRLK +  K +  
Sbjct: 212  LKLELLSKDKELKLRTSERDLIVHAAETASKQHLESIKKVVRLEAECRRLKMLNRKGATV 271

Query: 1388 NYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKIN--ELEPVDCGFCHPESQVLVAE 1215
            N   S+ S       F DS SD G+RL  +  E+CK++  EL   + G     + + ++E
Sbjct: 272  NDHRSLAS-------FTDSLSDCGERLSAVDNESCKMSGLELNDYEPGLSDLSASISISE 324

Query: 1214 LDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALGGKQPLKAEL 1035
            LDQFKNE+PLGRN MV S E+ LM+DFLEMERLAALPE E  SCP++G+      LK EL
Sbjct: 325  LDQFKNEKPLGRNFMVPSDELHLMNDFLEMERLAALPEAEEESCPDSGSDNRVNILKTEL 384

Query: 1034 QTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKL----------- 888
            + +INRT             K+ L+++L+ECQ QL+ SR QL+ TE KL           
Sbjct: 385  EAMINRTAELEEKLEKMEEEKVQLKLSLTECQHQLEASRYQLEETETKLTELRIQLVMAN 444

Query: 887  -------------------------------VDLETQLAMVNEAKRVAELEVEAANEKLT 801
                                           +DL+++L+M NEAK  AE++V+A +EKL 
Sbjct: 445  EGRKTVEAEVESTNKQLEKFMEEIAKAEVTILDLKSELSMANEAKSAAEMDVKATSEKLM 504

Query: 800  KSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTSNVKKQXXXXXXXXXXXXXXXLHS 621
            KSTK LEE E++L E   +L  AN+     +++L+ +N+KK+               L S
Sbjct: 505  KSTKLLEETEINLSEVSAQLANANKSNKKRDAELEATNIKKEVAESRVKALELELQMLRS 564

Query: 620  SIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDSQLQRSAVIEGFKINQDKELAVAA 441
            SI  LEE+++KER +S EA A CQ L  EI ++K  SQL ++A     KINQ+KELA+AA
Sbjct: 565  SICNLEEDIQKERALSDEAFANCQKLNAEILQLKSKSQLWKAATTGEVKINQEKELALAA 624

Query: 440  SKFTECQKTIASLSRQLKSLATLDDFLIDSEQPIEL 333
            SKF ECQKTIASL +QL SLA  +DF IDS  PIE+
Sbjct: 625  SKFAECQKTIASLGQQLSSLAKFEDFFIDSGIPIEI 660


>ref|XP_010093113.1| hypothetical protein L484_007922 [Morus notabilis]
            gi|587863800|gb|EXB53551.1| hypothetical protein
            L484_007922 [Morus notabilis]
          Length = 643

 Score =  505 bits (1301), Expect = e-140
 Identities = 306/609 (50%), Positives = 396/609 (65%), Gaps = 27/609 (4%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQA +  +T SPEV+SKAAP+DE  NESVKTL++KLS AL +I AKEDLVKQHAK
Sbjct: 31   ERFSDDQAYATQSTQSPEVMSKAAPNDEYSNESVKTLTDKLSAALRSISAKEDLVKQHAK 90

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE AENEVL+LK++ EA NQKNSVLE+R+GHLDGALK              Q
Sbjct: 91   VAEEAVSGWENAENEVLILKQKLEAANQKNSVLEDRLGHLDGALKECVRQLRQAREEQEQ 150

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV---------TKLETMEKENTIL 1566
            KI+DA ++K+ EWES KS LQ+QL+EL+ +LQ+ KTE           KLE  EK+N+ L
Sbjct: 151  KIHDAVAKKTHEWESLKSLLQSQLLELQVELQNVKTEAAAPIDSDLQAKLEAAEKQNSAL 210

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K EL S+AEEL++   ERDLST+AAE ASKQHL+SIKK AKLEAECRRLKA+A K S  N
Sbjct: 211  KLELLSKAEELEIRIIERDLSTKAAETASKQHLESIKKVAKLEAECRRLKAMARKVSQVN 270

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCK-----INELEPVDCGFCHPESQVLV 1221
               S +SS+VYVES  DSQSDSG+RLL I     K     +NE EP D G C   +  LV
Sbjct: 271  NQKSGSSSSVYVESLTDSQSDSGERLLTIESGTLKMGSLELNECEPSDSGSC---ASSLV 327

Query: 1220 AELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALP--ETESGSCPETGA----LGG 1059
             E  QF+NE+ +G+N+MV S+EI+LMDDFLEMERLAALP  + ESG      A    +GG
Sbjct: 328  TE-HQFRNEKIIGKNRMVPSIEINLMDDFLEMERLAALPVRDIESGFTVAGSASHQPIGG 386

Query: 1058 KQPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDL 879
            +   K +L  +I R              K+ LE+ALS C+K L+TS+ QL V E +L +L
Sbjct: 387  ESRFKTKLDAMIQRIAELEDKLEKIEMEKVELEVALSLCEKHLETSQSQLLVAEKRLKEL 446

Query: 878  ETQLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKL 699
            + QL + NE+KR AE E     E+ T++ + L E+++ +VE++   ++            
Sbjct: 447  QKQLVLANESKRAAEEE-----ERATRTKQELAESQLRVVENEINALL------------ 489

Query: 698  KTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMK 519
                                     S IG+LEEEV+KER +S + VARCQ +ENE+  +K
Sbjct: 490  -------------------------SKIGSLEEEVQKERALSADNVARCQKMENELLIVK 524

Query: 518  LDSQLQRSAVIE-------GFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFL 360
             +++ ++ A +E         KI Q+KEL++AA KF ECQKTIASL +QLKSLA+L+D L
Sbjct: 525  REAENKQEAELERIQSANVNLKIKQEKELSLAADKFAECQKTIASLGQQLKSLASLEDVL 584

Query: 359  IDSEQPIEL 333
            +D E+  E+
Sbjct: 585  LDPEKQQEI 593


>ref|XP_007048554.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508700815|gb|EOX92711.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 649

 Score =  505 bits (1301), Expect = e-140
 Identities = 296/607 (48%), Positives = 395/607 (65%), Gaps = 24/607 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSD+QA + H++ S EV SKA P DE++N++VK+L+EKLS AL+NI AKEDLVKQHAK
Sbjct: 32   ERFSDEQAGATHSSLSLEVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAK 91

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AE +VL LK+Q +A  +K + LE+RVGHLDGALK              +
Sbjct: 92   VAEEAVSGWEKAEKDVLALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQER 151

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTIL 1566
            +I++A ++K  EWES+KSEL++QLV+L++QLQ+ K+E           KLE  EKEN+ L
Sbjct: 152  RIHEAVAKKCHEWESSKSELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSAL 211

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K +L S+AEEL+L   ERDLST+AAE ASKQHL+SIKK AKLEAECR+LK +A KASPAN
Sbjct: 212  KLQLLSRAEELQLRIIERDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPAN 271

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QVLVAEL 1212
               S  +S++ V+SF DSQSDSGDRLL +     K++ LE  +C     ES    L+ EL
Sbjct: 272  DQKSYAASSICVDSFTDSQSDSGDRLLAVETNMRKMSGLEMNECETSRSESWTSALITEL 331

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALGGKQ------P 1050
            DQF+NE+ +GRN M  SVEI+LMDDFLEMERLAALP+TES +      L   Q      P
Sbjct: 332  DQFRNEKAVGRNIMAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENP 391

Query: 1049 LKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQ 870
            LKAE++T I+R              K+ L++A +E QKQL+T + QL+  E KL DL+TQ
Sbjct: 392  LKAEVETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQ 451

Query: 869  LAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTS 690
            LA+ + +K+ AE EV+                            VAN  + V ES+ + +
Sbjct: 452  LALADNSKQAAEDEVK----------------------------VANMNREVAESRFRDA 483

Query: 689  NVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS 510
             ++ +                 S + +LEEEV +E+ +S   V++C+ LE+E+S++K ++
Sbjct: 484  EIEVKTLL--------------SKVTSLEEEVGREQALSARNVSKCKELEDELSKLKREA 529

Query: 509  QLQRSA-------VIEGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
            +L+  A         E  K  QDKELA+AASK  ECQKTIASL RQLKSLATLDDFLID 
Sbjct: 530  ELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQLKSLATLDDFLIDP 589

Query: 350  EQPIELL 330
            ++P+EL+
Sbjct: 590  DKPLELV 596


>ref|XP_007048553.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700814|gb|EOX92710.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 675

 Score =  505 bits (1301), Expect = e-140
 Identities = 296/607 (48%), Positives = 395/607 (65%), Gaps = 24/607 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSD+QA + H++ S EV SKA P DE++N++VK+L+EKLS AL+NI AKEDLVKQHAK
Sbjct: 32   ERFSDEQAGATHSSLSLEVTSKAVPMDEEVNDNVKSLTEKLSAALINISAKEDLVKQHAK 91

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AE +VL LK+Q +A  +K + LE+RVGHLDGALK              +
Sbjct: 92   VAEEAVSGWEKAEKDVLALKQQLDAAIKKTAALEDRVGHLDGALKECVRQLRQAREEQER 151

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTIL 1566
            +I++A ++K  EWES+KSEL++QLV+L++QLQ+ K+E           KLE  EKEN+ L
Sbjct: 152  RIHEAVAKKCHEWESSKSELESQLVDLKAQLQTTKSETAASVDPDLHPKLEAFEKENSAL 211

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K +L S+AEEL+L   ERDLST+AAE ASKQHL+SIKK AKLEAECR+LK +A KASPAN
Sbjct: 212  KLQLLSRAEELQLRIIERDLSTQAAETASKQHLESIKKLAKLEAECRKLKVIARKASPAN 271

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QVLVAEL 1212
               S  +S++ V+SF DSQSDSGDRLL +     K++ LE  +C     ES    L+ EL
Sbjct: 272  DQKSYAASSICVDSFTDSQSDSGDRLLAVETNMRKMSGLEMNECETSRSESWTSALITEL 331

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALGGKQ------P 1050
            DQF+NE+ +GRN M  SVEI+LMDDFLEMERLAALP+TES +      L   Q      P
Sbjct: 332  DQFRNEKAVGRNIMAPSVEINLMDDFLEMERLAALPDTESATGFNEAGLVSDQTSTVENP 391

Query: 1049 LKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQ 870
            LKAE++T I+R              K+ L++A +E QKQL+T + QL+  E KL DL+TQ
Sbjct: 392  LKAEVETFIHRIAELEGKLAMTEAEKLELKLAFTESQKQLETLQNQLREAETKLADLQTQ 451

Query: 869  LAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTS 690
            LA+ + +K+ AE EV+                            VAN  + V ES+ + +
Sbjct: 452  LALADNSKQAAEDEVK----------------------------VANMNREVAESRFRDA 483

Query: 689  NVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS 510
             ++ +                 S + +LEEEV +E+ +S   V++C+ LE+E+S++K ++
Sbjct: 484  EIEVKTLL--------------SKVTSLEEEVGREQALSARNVSKCKELEDELSKLKREA 529

Query: 509  QLQRSA-------VIEGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
            +L+  A         E  K  QDKELA+AASK  ECQKTIASL RQLKSLATLDDFLID 
Sbjct: 530  ELRLDAERQLVASYNEELKAQQDKELAIAASKLAECQKTIASLGRQLKSLATLDDFLIDP 589

Query: 350  EQPIELL 330
            ++P+EL+
Sbjct: 590  DKPLELV 596


>gb|KDO52203.1| hypothetical protein CISIN_1g006305mg [Citrus sinensis]
          Length = 651

 Score =  502 bits (1293), Expect = e-139
 Identities = 301/605 (49%), Positives = 400/605 (66%), Gaps = 24/605 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQ    H++ S E  SKA P DE +N+SVKTL+EKLS ALLN+ AKEDLVKQHAK
Sbjct: 32   ERFSDDQT---HSSQSSEATSKAPPLDEVVNDSVKTLTEKLSAALLNVSAKEDLVKQHAK 88

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AENE+  LK+Q +A +QKNS LE RV HLDGALK              Q
Sbjct: 89   VAEEAVSGWEKAENELSTLKQQLKAASQKNSALENRVSHLDGALKECVRQLRQAREEQEQ 148

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV---------TKLETMEKENTIL 1566
            +I +  S+++ EWES KSEL+++LV+L+ +LQ+AK+E          +KLE  EK+N+ L
Sbjct: 149  RIQETVSKQNLEWESKKSELESKLVDLQKKLQTAKSEAAASADRDLCSKLEAAEKQNSAL 208

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K EL S  +EL+L   ERDLST+AAE ASKQHL+SIKK AK+EAEC RLKAV  KASP  
Sbjct: 209  KLELLSLVKELELRIVERDLSTKAAETASKQHLESIKKLAKVEAECLRLKAVVRKASPNT 268

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPESQVLVAELDQ 1206
               S T S++YV SF DSQSD+G+RLL    +NCKI++ E  +C      S      ++ 
Sbjct: 269  ENKSFTPSSIYVGSFTDSQSDNGERLLGNETDNCKISDSEVNECEPNSSTSWASALAIEP 328

Query: 1205 FKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGAL-----GGKQPLK 1044
             KN + +GRN MV SV+I+LMDDFLEMERLAALP+TES S C E G         +  +K
Sbjct: 329  DKNVKAVGRNVMVPSVDINLMDDFLEMERLAALPDTESRSFCVEVGPASDQPNADESSIK 388

Query: 1043 AELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQLA 864
            AEL+ LI+RT             K  LE+ L E Q++L+TS+ QLK  ELKL +LETQLA
Sbjct: 389  AELEVLIHRTAELEEELENMRAEKSELEMDLKESQRRLETSQNQLKEAELKLEELETQLA 448

Query: 863  MVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQ--NRLIVANEEKSVVESKLKTS 690
              N++K+  E+E++AA      + + + E+++S+VE +   +L +AN+ K   E ++K++
Sbjct: 449  FANKSKQAVEVEMKAA-----IAARGVAESKLSVVEAEMKTQLALANKSKQAAEEEVKSA 503

Query: 689  NVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS 510
              KK+               L S + +LE+EV+KER +S E +A  Q  ++E+S++K + 
Sbjct: 504  KSKKEAAESRLRAVEAEMETLRSKVISLEDEVEKERALSEENIANFQKSKDELSKVKQEI 563

Query: 509  QLQRSAVI-------EGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
            +LQ    +       +  KINQ++ELAVAASKF ECQKTIASL RQL+SL TLDDFLIDS
Sbjct: 564  ELQHEVKLQYLAGSNQELKINQEEELAVAASKFAECQKTIASLGRQLRSLVTLDDFLIDS 623

Query: 350  EQPIE 336
            E+P+E
Sbjct: 624  EKPLE 628


>ref|XP_010242807.1| PREDICTED: filament-like plant protein isoform X2 [Nelumbo nucifera]
          Length = 678

 Score =  491 bits (1265), Expect = e-136
 Identities = 298/601 (49%), Positives = 389/601 (64%), Gaps = 19/601 (3%)
 Frame = -1

Query: 2078 ERFSDDQ----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVK 1911
            ER SDDQ    AS NHNT SPE+ SK   S E++N++VK+L++KL+ AL NI AKEDLVK
Sbjct: 31   ERCSDDQETSRASPNHNTLSPEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLVK 90

Query: 1910 QHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXX 1731
            QHAKVAEEAVSGWE+AENEV+ LK++ E+  QKNS LE+RV HLDGALK           
Sbjct: 91   QHAKVAEEAVSGWEKAENEVVALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQARE 150

Query: 1730 XXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV-------TKLETMEKENT 1572
               QKI++A  +K+ EWES K EL++Q+V L+SQ+++AK E        +KLE+ EK+N 
Sbjct: 151  EQEQKIHEAVVEKTKEWESVKLELESQVVNLQSQVEAAKLEAAANSDLCSKLESAEKKNA 210

Query: 1571 ILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASP 1392
             LK EL S+ EEL++ T ERDLST+ AE ASKQHL+SIKK AKLEAECRRL+A++ KA  
Sbjct: 211  ALKLELLSRVEELEIRTLERDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAPS 270

Query: 1391 ANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QVLVA 1218
            AN   SVT+S+ YVES  DSQSDSG+RLL +  +  K++ +E  D    + +S    L+A
Sbjct: 271  ANDHRSVTASSFYVESLTDSQSDSGERLLGMEIDTHKMSSMELNDGEASYSDSWASALIA 330

Query: 1217 ELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGAL-----GGKQ 1053
            ELDQFK ++ +GRN   SSVEIDLMDDFLEMERLAALPETESG  PE  A+      G+ 
Sbjct: 331  ELDQFKQDKAIGRNLTTSSVEIDLMDDFLEMERLAALPETESGD-PEPVAVPDQIDRGES 389

Query: 1052 PLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLET 873
             LKAEL+T+I R+             K  L IAL+E Q QL+ S  QLK  E KLV+L+ 
Sbjct: 390  SLKAELETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQR 449

Query: 872  QLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKT 693
             L + N  K+  E ++E                              N +K V+ES+L  
Sbjct: 450  CLDLANNLKQTTEEKLE----------------------------TINTQKEVIESRLVG 481

Query: 692  SNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLD 513
            ++ +                 L   +G+LE E++KER +S E V +C+ LE+E+++ K +
Sbjct: 482  ADAE--------------IRALRGKVGSLESEIEKERTLSEEIVVKCRKLEDELTKKKHE 527

Query: 512  SQLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSEQPIE 336
            ++L R++   G  KI Q+KELAVAA K TECQKTIASL RQLKSLATL+DFLID E+P++
Sbjct: 528  AELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATLEDFLIDYEKPLD 587

Query: 335  L 333
            L
Sbjct: 588  L 588


>gb|KHG29921.1| Filament-like plant protein [Gossypium arboreum]
          Length = 679

 Score =  491 bits (1264), Expect = e-136
 Identities = 291/607 (47%), Positives = 391/607 (64%), Gaps = 24/607 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSD+QAS+  ++ S EV SKA P DE+ N +V++L+EKLS AL+NI AKE+LVKQHAK
Sbjct: 32   ERFSDEQASATISSQSLEVTSKAVPVDEESN-NVRSLTEKLSTALMNISAKEELVKQHAK 90

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AE +V+ LK+Q +A  +KN+ LE+RVGHLDGALK              +
Sbjct: 91   VAEEAVSGWEKAEKDVVALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQER 150

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTIL 1566
            KI++A S+K  EWES+KSEL++QL+ L++QL++AK++           KL+  EKEN+ L
Sbjct: 151  KIHEAVSKKCHEWESSKSELESQLLNLKAQLETAKSDAAASVDPDLQLKLDACEKENSAL 210

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K +LHS+AEEL+    ERDLST+AAE ASKQHLDSIKK AKLE ECRRLKA+A KASPAN
Sbjct: 211  KLQLHSRAEELERRIIERDLSTQAAETASKQHLDSIKKLAKLEIECRRLKAIARKASPAN 270

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPE--SQVLVAEL 1212
               S T+S++ VESF DSQSDSG+RLL +  +  K+N LE   C     +  +  L+ EL
Sbjct: 271  DQKSYTASSICVESFTDSQSDSGERLLAVETDMQKMNGLEMNGCDRSRSDAWASALITEL 330

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGALG-----GKQP 1050
            DQF+ E+ +GRN M  SVEI+LMDDFLEMERLAALP+TESGS   + G +       + P
Sbjct: 331  DQFRKEKAVGRNIMAPSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQNSIVENP 390

Query: 1049 LKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQ 870
            LKA+L+TL++R              K  ++IA +E QKQLKT + QL   E++  D++TQ
Sbjct: 391  LKADLETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQ 450

Query: 869  LAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTS 690
            LA+ + +K+ AE EV+ AN     +   L +AE  +    ++ + + EE    E  L T 
Sbjct: 451  LALADNSKQAAEKEVKVANMNRQVAESRLRDAETEIKTLMSK-VTSLEEAFGKEQALSTE 509

Query: 689  NVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS 510
            N+ K                                         C+ LENE+S+MK ++
Sbjct: 510  NMNK-----------------------------------------CKELENELSKMKCET 528

Query: 509  QLQRSAVI-------EGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
            +L+R A +       E  K+ QDKEL++AA KF ECQKTIASL +QLKSLATL+DFLIDS
Sbjct: 529  KLRREAELQHAAKYNEELKVQQDKELSIAARKFAECQKTIASLGQQLKSLATLEDFLIDS 588

Query: 350  EQPIELL 330
            ++P+EL+
Sbjct: 589  DKPLELV 595


>ref|XP_010242801.1| PREDICTED: filament-like plant protein isoform X1 [Nelumbo nucifera]
            gi|720083139|ref|XP_010242802.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083142|ref|XP_010242803.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083146|ref|XP_010242804.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083149|ref|XP_010242805.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
            gi|720083152|ref|XP_010242806.1| PREDICTED: filament-like
            plant protein isoform X1 [Nelumbo nucifera]
          Length = 679

 Score =  491 bits (1264), Expect = e-136
 Identities = 298/602 (49%), Positives = 389/602 (64%), Gaps = 20/602 (3%)
 Frame = -1

Query: 2078 ERFSDDQ-----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLV 1914
            ER SDDQ     AS NHNT SPE+ SK   S E++N++VK+L++KL+ AL NI AKEDLV
Sbjct: 31   ERCSDDQQETSRASPNHNTLSPEITSKVTASSEEVNDNVKSLTDKLAAALSNISAKEDLV 90

Query: 1913 KQHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXX 1734
            KQHAKVAEEAVSGWE+AENEV+ LK++ E+  QKNS LE+RV HLDGALK          
Sbjct: 91   KQHAKVAEEAVSGWEKAENEVVALKQKLESATQKNSTLEDRVSHLDGALKECVRQLRQAR 150

Query: 1733 XXXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV-------TKLETMEKEN 1575
                QKI++A  +K+ EWES K EL++Q+V L+SQ+++AK E        +KLE+ EK+N
Sbjct: 151  EEQEQKIHEAVVEKTKEWESVKLELESQVVNLQSQVEAAKLEAAANSDLCSKLESAEKKN 210

Query: 1574 TILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKAS 1395
              LK EL S+ EEL++ T ERDLST+ AE ASKQHL+SIKK AKLEAECRRL+A++ KA 
Sbjct: 211  AALKLELLSRVEELEIRTLERDLSTQTAETASKQHLESIKKVAKLEAECRRLRAMSRKAP 270

Query: 1394 PANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QVLV 1221
             AN   SVT+S+ YVES  DSQSDSG+RLL +  +  K++ +E  D    + +S    L+
Sbjct: 271  SANDHRSVTASSFYVESLTDSQSDSGERLLGMEIDTHKMSSMELNDGEASYSDSWASALI 330

Query: 1220 AELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGAL-----GGK 1056
            AELDQFK ++ +GRN   SSVEIDLMDDFLEMERLAALPETESG  PE  A+      G+
Sbjct: 331  AELDQFKQDKAIGRNLTTSSVEIDLMDDFLEMERLAALPETESGD-PEPVAVPDQIDRGE 389

Query: 1055 QPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLE 876
              LKAEL+T+I R+             K  L IAL+E Q QL+ S  QLK  E KLV+L+
Sbjct: 390  SSLKAELETMIQRSVELEEKLEKLEEEKAQLNIALAETQSQLEMSNNQLKTAEEKLVELQ 449

Query: 875  TQLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLK 696
              L + N  K+  E ++E                              N +K V+ES+L 
Sbjct: 450  RCLDLANNLKQTTEEKLE----------------------------TINTQKEVIESRLV 481

Query: 695  TSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKL 516
             ++ +                 L   +G+LE E++KER +S E V +C+ LE+E+++ K 
Sbjct: 482  GADAE--------------IRALRGKVGSLESEIEKERTLSEEIVVKCRKLEDELTKKKH 527

Query: 515  DSQLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSEQPI 339
            +++L R++   G  KI Q+KELAVAA K TECQKTIASL RQLKSLATL+DFLID E+P+
Sbjct: 528  EAELWRASRSNGELKIKQEKELAVAAGKLTECQKTIASLGRQLKSLATLEDFLIDYEKPL 587

Query: 338  EL 333
            +L
Sbjct: 588  DL 589


>ref|XP_006432073.1| hypothetical protein CICLE_v10000549mg [Citrus clementina]
            gi|568820911|ref|XP_006464943.1| PREDICTED: filament-like
            plant protein 3-like [Citrus sinensis]
            gi|557534195|gb|ESR45313.1| hypothetical protein
            CICLE_v10000549mg [Citrus clementina]
          Length = 647

 Score =  491 bits (1264), Expect = e-136
 Identities = 301/606 (49%), Positives = 399/606 (65%), Gaps = 25/606 (4%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQ    H++ S E  SKA P DE +N+SVKTL+EKLS ALLN+ AKEDLVKQHAK
Sbjct: 32   ERFSDDQT---HSSQSSEATSKAPPLDEVVNDSVKTLTEKLSAALLNVSAKEDLVKQHAK 88

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AENE+  LK+Q +A +QKNS LE RV HLDGALK              Q
Sbjct: 89   VAEEAVSGWEKAENELSTLKQQLKAASQKNSALENRVSHLDGALKECVRQLRQAREEQEQ 148

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV---------TKLETMEKENTIL 1566
            +I +  S+++ EWES KSEL+++LV+L+ +LQ+AK+E          +KLE  EK+N+ L
Sbjct: 149  RIQETVSKQNLEWESKKSELESKLVDLQKKLQTAKSEAAASADRDLRSKLEAAEKQNSAL 208

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K EL S+ +EL+L   ERDLST+AAE ASKQHL+SIKK AK+EAEC RLKAV  KASP  
Sbjct: 209  KLELLSRVKELELRIVERDLSTKAAETASKQHLESIKKLAKVEAECLRLKAVVRKASPNT 268

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPESQVLVAELDQ 1206
               S T S++YV SF DSQSD+G R L    +NCKI++ E  +   C P S    A    
Sbjct: 269  ENKSFTPSSIYVGSFTDSQSDNGKRPLGNETDNCKISDSEVNE---CEPNSSTSWASALA 325

Query: 1205 FKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGALGGKQP------L 1047
              + + +GRN MV SV+I+LMDDFLEMERLAALP+TES S C E G     QP      +
Sbjct: 326  I-DVKAVGRNVMVPSVDINLMDDFLEMERLAALPDTESRSFCVEVGP-ASDQPNADETSI 383

Query: 1046 KAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQL 867
            KAEL+ LI+RT             K  LE+ L E Q++L+TS+ QLK  ELKL +LETQL
Sbjct: 384  KAELEVLIHRTAELEEELENMREEKSELEMDLKESQRRLETSQNQLKEAELKLEELETQL 443

Query: 866  AMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQ--NRLIVANEEKSVVESKLKT 693
            A  N++K+  E++++AA      + + + E+++S+VE +   +L +AN+ K   E ++K+
Sbjct: 444  AFANKSKQAVEVKMKAA-----IAARGVAESKLSVVEAEMKTQLALANKSKQAAEEEVKS 498

Query: 692  SNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLD 513
            +  KK+               L S + +LE+EV+KER +S E +A  Q  ++E+S++K +
Sbjct: 499  AKSKKEAAESRLRAVEAEMETLRSKVISLEDEVEKERALSEENIANFQKSKDELSKVKQE 558

Query: 512  SQLQRSAVI-------EGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLID 354
             +LQ    +       +  KINQ++ELAVAASKF ECQKTIASL RQL+SL TLDDFLID
Sbjct: 559  IELQHEVKLQYLAGSNQELKINQEEELAVAASKFAECQKTIASLGRQLRSLVTLDDFLID 618

Query: 353  SEQPIE 336
            SE+P+E
Sbjct: 619  SEKPLE 624


>ref|XP_012455840.1| PREDICTED: filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246344|ref|XP_012455841.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246346|ref|XP_012455843.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246348|ref|XP_012455844.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|823246350|ref|XP_012455845.1| PREDICTED:
            filament-like plant protein isoform X1 [Gossypium
            raimondii] gi|763804407|gb|KJB71345.1| hypothetical
            protein B456_011G117600 [Gossypium raimondii]
          Length = 679

 Score =  486 bits (1251), Expect = e-134
 Identities = 288/607 (47%), Positives = 389/607 (64%), Gaps = 24/607 (3%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSD+QAS+  ++ S EV SKA P DE+ N +V++L+EKLS AL+NI AKE+LVKQHAK
Sbjct: 32   ERFSDEQASATISSQSLEVTSKAVPVDEE-NNNVRSLTEKLSAALMNISAKEELVKQHAK 90

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AE +V+ LK+Q +A  +KN+ LE+RVGHLDGALK              +
Sbjct: 91   VAEEAVSGWEKAEKDVVALKQQLDAAMKKNAALEDRVGHLDGALKECVRQLRQAREEQER 150

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKENTIL 1566
            KI++A S+K  EWES+KSEL++QL+ L++QL++AK +           KL+  EKEN+ L
Sbjct: 151  KIHEAVSKKCHEWESSKSELESQLLNLKAQLETAKNDTAASVDPDLQLKLDAFEKENSAL 210

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            K +LHS+AEEL+    ERDLST+AAE ASKQHL+SIKK AKLE ECRRLKA+A KASPAN
Sbjct: 211  KLQLHSRAEELERRIIERDLSTQAAETASKQHLESIKKLAKLEIECRRLKAIARKASPAN 270

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPE--SQVLVAEL 1212
               S  +S++ VESF DSQSDSG+RLL +  +  K+N LE   C     +  +  L+ EL
Sbjct: 271  DQKSYPASSICVESFTDSQSDSGERLLAVETDMQKMNGLEMNGCDRSSSDAWASALITEL 330

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGS-CPETGALG-----GKQP 1050
            DQF+ E+ +GRN M  SVEI+LMDDFLEMERLAALP+TESGS   + G +       + P
Sbjct: 331  DQFRKEKAVGRNIMAPSVEINLMDDFLEMERLAALPDTESGSGFNDAGPVSYQTSIVENP 390

Query: 1049 LKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQ 870
            LKA+L+TL++R              K  ++IA +E QKQLKT + QL   E++  D++TQ
Sbjct: 391  LKADLETLVHRVAELEEKLALTEEEKSEMQIAFTESQKQLKTLQNQLSEAEIRFKDVQTQ 450

Query: 869  LAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTS 690
            LA+ + +K+ AE EV+ AN     +   L +AE  +    ++ + + EE    E  L T 
Sbjct: 451  LALADNSKQAAEKEVKVANMNREVAESRLRDAETEIKTLMSK-VTSLEEALGKEQALSTE 509

Query: 689  NVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS 510
            N+ K                                         C+ LENE+S+MK ++
Sbjct: 510  NMNK-----------------------------------------CKELENELSKMKCET 528

Query: 509  QLQRSAVI-------EGFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
            +L++ A +       E  K+ QDKEL++AA KF ECQKTIASL +QLKSLATL+DFLIDS
Sbjct: 529  KLRQEAELQHAAKYNEELKVQQDKELSIAACKFAECQKTIASLGQQLKSLATLEDFLIDS 588

Query: 350  EQPIELL 330
            ++P+EL+
Sbjct: 589  DKPLELV 595


>ref|XP_009764915.1| PREDICTED: filament-like plant protein 3 isoform X2 [Nicotiana
            sylvestris] gi|698537731|ref|XP_009764916.1| PREDICTED:
            filament-like plant protein 3 isoform X2 [Nicotiana
            sylvestris]
          Length = 710

 Score =  481 bits (1239), Expect = e-133
 Identities = 301/673 (44%), Positives = 397/673 (58%), Gaps = 93/673 (13%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERF DDQA  NHNT SPEV SK APSDE+L+E+VKTLS KLSEAL+N+R KEDLVKQHAK
Sbjct: 36   ERFFDDQALQNHNTQSPEVTSKTAPSDEELSETVKTLSAKLSEALVNVREKEDLVKQHAK 95

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AE EVL+ K+  E  NQKNS+LEER+ HLDGALK              Q
Sbjct: 96   VAEEAVSGWEKAEGEVLIQKRLVETANQKNSILEERIKHLDGALKECLRQLRQAREEQAQ 155

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT-------KLETMEKENTILKR 1560
             +  A ++ S EWE  KSEL+N+LV+L+SQLQS+K E +       KLE  EK+N++LK 
Sbjct: 156  NVQVAVAKTSCEWEFKKSELENKLVQLQSQLQSSKAEDSNVQDLQHKLEYAEKQNSVLKL 215

Query: 1559 ELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPANYL 1380
            EL S +EELKLMT ERDLST AAE ASKQHL+SI K AKLEAECR LKA A K S  N  
Sbjct: 216  ELVSISEELKLMTSERDLSTHAAETASKQHLESITKVAKLEAECRMLKAFARKRSTVNDH 275

Query: 1379 PSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEP--VDCGFCHPESQVLVAELDQ 1206
             S  +S+ Y E  ADS SD+G+RL  +  ++CKI+ LEP   D    +  S  LV+EL+Q
Sbjct: 276  KSTAASSAYFEPSADSLSDTGERLSTVENDSCKISGLEPNNYDQNSSYFLSSALVSELNQ 335

Query: 1205 FKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALGGKQPLKAELQTL 1026
            +K E+P  R+ + SSVEI+LMDDFLEME+LAA P+T S      GA   +  LK EL+ +
Sbjct: 336  YKYEKPHRRDLIASSVEINLMDDFLEMEKLAARPDTVSEISNVRGAHISEPILKTELRAI 395

Query: 1025 INRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVT------------------ 900
            +++T             K+ LE  L+ECQ +LK S+EQLK T                  
Sbjct: 396  VSQT-AEAEKLAKMEVEKLKLEKELTECQDELKISKEQLKETKDNLIEVKAQLSMANEAR 454

Query: 899  ------------------------ELKLVDLETQLAMVNEAKRVAELEVEAANEKLTKST 792
                                    E ++V+L+ QL++ NE K+ AE EVE+AN +L    
Sbjct: 455  KKLEPEFKATITKLKDLTEQLQKMEAEIVELKAQLSVANEVKKKAEAEVESANTRLKNLV 514

Query: 791  KHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTSNVKKQXXXXXXXXXXXXXXXLHSSIG 612
            + LEEAEV   E Q +LI ANE K   E +++ +N+K +               L + + 
Sbjct: 515  ERLEEAEVDAAEFQAQLITANEAKRAAEVEVEATNLKLKKSEFRLEETEVKLLGLQTQLE 574

Query: 611  T---------------------LEEEVKKERGVSGEAVARCQTLENEISR---------- 525
            T                     +E ++K         V++  +L+ E+ +          
Sbjct: 575  TVKGMKSGVEAELEATNAKKDVVESQLKATELELQTLVSKVDSLQEELCKETALHQETAA 634

Query: 524  -----------MKLDSQLQRSAVIEGFKINQDKELAVAASKFTECQKTIASLSRQLKSLA 378
                       +K  SQL+++ ++E FKIN+DK++A+AAS+F ECQKTIAS+  QLKSLA
Sbjct: 635  KLQKLETDNSLIKSASQLRKATIVEEFKINKDKQMAIAASQFAECQKTIASIGWQLKSLA 694

Query: 377  TLDDFLIDSEQPI 339
            T+DDFL+DS +P+
Sbjct: 695  TMDDFLVDSGEPL 707


>ref|XP_008227685.1| PREDICTED: filament-like plant protein [Prunus mume]
            gi|645242775|ref|XP_008227686.1| PREDICTED: filament-like
            plant protein [Prunus mume]
            gi|645242777|ref|XP_008227687.1| PREDICTED: filament-like
            plant protein [Prunus mume]
          Length = 620

 Score =  471 bits (1212), Expect = e-129
 Identities = 291/619 (47%), Positives = 385/619 (62%), Gaps = 27/619 (4%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVKQHAK 1899
            ERFSDDQA+  H T  PEV SKA  +++  NESV+TL+EKLS AL N  AK+DLVKQHAK
Sbjct: 31   ERFSDDQANPTHTTLLPEVTSKAPCNEQKDNESVETLTEKLSAALRNSSAKDDLVKQHAK 90

Query: 1898 VAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXXXXXQ 1719
            VAEEAVSGWE+AENEVL LK+Q EA NQK S LE+RVGHLDGALK              Q
Sbjct: 91   VAEEAVSGWEKAENEVLGLKQQLEAANQKCSALEDRVGHLDGALKECVRQIRQAREEQDQ 150

Query: 1718 KIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEV---------TKLETMEKENTIL 1566
               +  + K+ EWES+KS LQ+QLV+L++QLQ+A TE          +KLE  EKEN+ L
Sbjct: 151  NTREVVAIKTREWESSKSMLQSQLVDLQAQLQTANTEAAASIDFDLRSKLEATEKENSAL 210

Query: 1565 KRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKASPAN 1386
            + +L S+ +EL++ T ERDLS +AAE ASKQ+L+SIK+ +KLEAECR LKA+  K  PAN
Sbjct: 211  QLKLLSRVKELEVRTIERDLSAQAAETASKQYLESIKRVSKLEAECRMLKALTRKTLPAN 270

Query: 1385 YLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPESQ--VLVAEL 1212
                 ++S+VY+ESF DSQSDSG+++L I  +  K++ L P        +SQ    + E 
Sbjct: 271  DHKPFSTSSVYIESFTDSQSDSGEKVLAIDPDPHKVSGLYPCQYDPSQSDSQASAQITEH 330

Query: 1211 DQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALG-----GKQPL 1047
             QFKNE+  G+N MV SVEI+LMDDFLEMERLAAL +TE+ SC     +G      + PL
Sbjct: 331  GQFKNEKDFGKNLMVPSVEINLMDDFLEMERLAALSDTENDSCHLESGIGYQPHTEENPL 390

Query: 1046 KAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVDLETQL 867
            K E +T+I R              K+ LE+ L+ECQKQL+TS+ QL   ++KL DL+ +L
Sbjct: 391  KTEFETMIQRATELERKLEKMAAEKVELEMTLTECQKQLETSQSQLVEADMKLEDLKREL 450

Query: 866  AMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESKLKTSN 687
            A+ N++   A+ EV                       +Q   +VA  +   V++K  +  
Sbjct: 451  ALANDSVYAADEEVRT---------------------YQTMRVVAESQLIAVQTKFNSLL 489

Query: 686  VKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRMKLDS- 510
            +K                     +G+LEEEV KER  S E VA+C  LENE+  MK ++ 
Sbjct: 490  LK---------------------VGSLEEEVWKERNFSAENVAKCLKLENELFSMKHEAE 528

Query: 509  -----QLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDSE 348
                 +LQR A   G  KI Q+KELA+AA++F ECQKTIASL +QLKSL TL+D L+DSE
Sbjct: 529  HQREVELQRLASTNGELKIKQEKELALAANRFAECQKTIASLGQQLKSLTTLEDILVDSE 588

Query: 347  QPIELL*K----HISNLKP 303
            +P EL+ +    HI++ +P
Sbjct: 589  RPPELIEEGMQCHINSPEP 607


>ref|XP_012078245.1| PREDICTED: filament-like plant protein isoform X2 [Jatropha curcas]
            gi|802636039|ref|XP_012078246.1| PREDICTED: filament-like
            plant protein isoform X2 [Jatropha curcas]
            gi|643723198|gb|KDP32803.1| hypothetical protein
            JCGZ_12095 [Jatropha curcas]
          Length = 681

 Score =  456 bits (1174), Expect = e-125
 Identities = 284/598 (47%), Positives = 377/598 (63%), Gaps = 22/598 (3%)
 Frame = -1

Query: 2078 ERFSDDQ----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVK 1911
            ERFSD+Q    AS N+ T SPEV SK    DED+N+SV+ L+EKLS AL+N+ AK+DLVK
Sbjct: 31   ERFSDEQDNLKASPNNETQSPEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLVK 90

Query: 1910 QHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXX 1731
            QH+KVAEEAV+GWE+AENEV  LKKQ EA  Q+N  LE+RV HLDGALK           
Sbjct: 91   QHSKVAEEAVAGWEKAENEVAALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQARE 150

Query: 1730 XXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKE 1578
               +K+Y+A ++K+ EWES KSEL+NQL+EL+++ ++ K+E           KLE +EK+
Sbjct: 151  EHEEKVYEAVTKKTIEWESVKSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEKD 210

Query: 1577 NTILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKA 1398
            N  LK E+ S +EEL+L   ERDLST+AAE ASKQHLDSIKK AKLEAECRRLKAVA K+
Sbjct: 211  NASLKLEILSLSEELELRIIERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACKS 270

Query: 1397 SPANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QVL 1224
            S  N   +  +S++YVES  DSQSDSG+RL  +  +  KI+ LEP  C     +S    L
Sbjct: 271  SSLNDHKTSIASSMYVESLTDSQSDSGERLNAVELDAHKISCLEPSKCEPSCSDSWASAL 330

Query: 1223 VAELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESG---SCPE---TGALG 1062
            +AELDQFKNE+ + RN   SS+EIDLMDDFLEMERLA+LPE ESG   S PE   T +  
Sbjct: 331  IAELDQFKNEKAVNRNLPASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQSTD 390

Query: 1061 GKQPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVD 882
             +  L+AEL+ +I+RT                     +E +KQL+         E + V+
Sbjct: 391  VESSLRAELEIMIHRT---------------------AELEKQLQKM-------EGEKVE 422

Query: 881  LETQLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVESK 702
            LE +L  +   +   E+ +  + EK  +    L EAE+ + +    L +ANE K  +ES+
Sbjct: 423  LEEKLEKILVERTELEMSLTISREKNEEFQIQLGEAELKMKQLHQELSIANESKQQIESQ 482

Query: 701  LKTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISRM 522
            L +  V+ +                 S + +LE E++KE+ +S E   +C+TLE E+S  
Sbjct: 483  LVSMEVEARTMA--------------SKVDSLEAELEKEKVLSAELAVKCRTLEEELSEK 528

Query: 521  KLDSQLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
              + +LQ+SA   G  KI Q+ +LAVAA K  ECQKTIASL +QLKSLATL+DFLID+
Sbjct: 529  NKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLATLEDFLIDT 585


>ref|XP_012078241.1| PREDICTED: filament-like plant protein isoform X1 [Jatropha curcas]
            gi|802635970|ref|XP_012078242.1| PREDICTED: filament-like
            plant protein isoform X1 [Jatropha curcas]
            gi|802636033|ref|XP_012078243.1| PREDICTED: filament-like
            plant protein isoform X1 [Jatropha curcas]
          Length = 682

 Score =  456 bits (1173), Expect = e-125
 Identities = 284/599 (47%), Positives = 377/599 (62%), Gaps = 23/599 (3%)
 Frame = -1

Query: 2078 ERFSDDQ-----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLV 1914
            ERFSD+Q     AS N+ T SPEV SK    DED+N+SV+ L+EKLS AL+N+ AK+DLV
Sbjct: 31   ERFSDEQQDNLKASPNNETQSPEVTSKTVVRDEDVNDSVRILTEKLSAALVNVSAKDDLV 90

Query: 1913 KQHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXX 1734
            KQH+KVAEEAV+GWE+AENEV  LKKQ EA  Q+N  LE+RV HLDGALK          
Sbjct: 91   KQHSKVAEEAVAGWEKAENEVAALKKQLEAAIQQNCALEDRVSHLDGALKECVRQLRQAR 150

Query: 1733 XXXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEK 1581
                +K+Y+A ++K+ EWES KSEL+NQL+EL+++ ++ K+E           KLE +EK
Sbjct: 151  EEHEEKVYEAVTKKTIEWESVKSELENQLLELKTKAEATKSESPPQIVPDLWHKLEYLEK 210

Query: 1580 ENTILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALK 1401
            +N  LK E+ S +EEL+L   ERDLST+AAE ASKQHLDSIKK AKLEAECRRLKAVA K
Sbjct: 211  DNASLKLEILSLSEELELRIIERDLSTQAAETASKQHLDSIKKVAKLEAECRRLKAVACK 270

Query: 1400 ASPANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELEPVDCGFCHPES--QV 1227
            +S  N   +  +S++YVES  DSQSDSG+RL  +  +  KI+ LEP  C     +S    
Sbjct: 271  SSSLNDHKTSIASSMYVESLTDSQSDSGERLNAVELDAHKISCLEPSKCEPSCSDSWASA 330

Query: 1226 LVAELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESG---SCPE---TGAL 1065
            L+AELDQFKNE+ + RN   SS+EIDLMDDFLEMERLA+LPE ESG   S PE   T + 
Sbjct: 331  LIAELDQFKNEKAVNRNLPASSIEIDLMDDFLEMERLASLPENESGTHQSEPEPVATQST 390

Query: 1064 GGKQPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLV 885
              +  L+AEL+ +I+RT                     +E +KQL+         E + V
Sbjct: 391  DVESSLRAELEIMIHRT---------------------AELEKQLQKM-------EGEKV 422

Query: 884  DLETQLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEKSVVES 705
            +LE +L  +   +   E+ +  + EK  +    L EAE+ + +    L +ANE K  +ES
Sbjct: 423  ELEEKLEKILVERTELEMSLTISREKNEEFQIQLGEAELKMKQLHQELSIANESKQQIES 482

Query: 704  KLKTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENEISR 525
            +L +  V+ +                 S + +LE E++KE+ +S E   +C+TLE E+S 
Sbjct: 483  QLVSMEVEARTMA--------------SKVDSLEAELEKEKVLSAELAVKCRTLEEELSE 528

Query: 524  MKLDSQLQRSAVIEG-FKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLIDS 351
               + +LQ+SA   G  KI Q+ +LAVAA K  ECQKTIASL +QLKSLATL+DFLID+
Sbjct: 529  KNKEVELQKSASSNGELKIKQE-DLAVAAGKLAECQKTIASLGKQLKSLATLEDFLIDT 586


>ref|XP_007019074.1| Filament-like plant protein, putative isoform 1 [Theobroma cacao]
            gi|508724402|gb|EOY16299.1| Filament-like plant protein,
            putative isoform 1 [Theobroma cacao]
          Length = 713

 Score =  455 bits (1171), Expect = e-125
 Identities = 282/602 (46%), Positives = 384/602 (63%), Gaps = 26/602 (4%)
 Frame = -1

Query: 2078 ERFSDDQ----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLVK 1911
            ER+SDDQ    AS N+N  SPEV SKA+ + ED+N+S+K L+EKLS AL+N+ AKEDLVK
Sbjct: 31   ERYSDDQEAFKASPNNNAQSPEVSSKASANCEDVNDSIKRLTEKLSAALVNVSAKEDLVK 90

Query: 1910 QHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXXX 1731
            QHAKVAEEA++GWE+AENEV++LK++ EA  Q+NS LE+RV HLDGALK           
Sbjct: 91   QHAKVAEEAIAGWEKAENEVVLLKQKLEAAVQQNSALEDRVSHLDGALKECVRQLRQARE 150

Query: 1730 XXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETMEKE 1578
               QKI +A ++ + +WE+TK EL++Q +EL+ + ++ K+E           K+E +EKE
Sbjct: 151  EQEQKINEAVAKTTRDWETTKFELESQFLELQDKAEAVKSEPPPHFSPDLWHKIEALEKE 210

Query: 1577 NTILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALKA 1398
            N+ LK EL SQ+EE ++ T ERDLST+AAE ASKQHL+SIKK AKLEAECRRLKA+A K+
Sbjct: 211  NSALKLELSSQSEEFEIRTIERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAIACKS 270

Query: 1397 SPANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELE--PVDCGFCHPESQVL 1224
            S  N   S  +S++YVES  DSQSDSG+RL V+  +  K++ LE    +       +  L
Sbjct: 271  SLVNDHKSPAASSIYVESVTDSQSDSGERLNVVEIDTHKMSGLEANKGEPSCSDSWASAL 330

Query: 1223 VAELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETES------GSCPETGALG 1062
            +AELDQFKNE+ + RN   SS+EIDLMDDFLEMERLAALPE +S             +  
Sbjct: 331  IAELDQFKNEKVISRNLPSSSIEIDLMDDFLEMERLAALPEIKSENQFLESKATARQSND 390

Query: 1061 GKQPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLVD 882
            G   LKAEL+ +I+RT             K  LEIAL++ Q+ L+ S  QL+ TE KL +
Sbjct: 391  GDSSLKAELEAMIHRTAELEQKLEKIELEKAELEIALAKSQESLEASALQLRDTETKLEE 450

Query: 881  LETQLAMVNEAKRVAELE---VEAANEKLTKSTKHLE-EAEVSLVEHQNRLIVANEEKSV 714
            LE +  M NEAK+  E +   +E   E ++     L+ E E  +       + A E K +
Sbjct: 451  LEREFHMANEAKQHLESQLSSMETDAETMSSKIDSLKAEIEKEMALSAEISVNATESKQL 510

Query: 713  VESKLKTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLENE 534
            +ES+L +   + +               + + I +LE EV+KER +S +   +CQ LE E
Sbjct: 511  LESQLISIEAEAR--------------TMSAKIDSLETEVEKERALSAQITVKCQELEEE 556

Query: 533  ISRMKLDSQLQRSAVIE-GFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLDDFLI 357
            + R + +++LQ++A      KI Q+ +LAVAA K  ECQKTIASL +QLKSLATL+DFLI
Sbjct: 557  LLRKRQEAELQQTANSNVEVKIKQE-DLAVAAGKLAECQKTIASLGQQLKSLATLEDFLI 615

Query: 356  DS 351
            D+
Sbjct: 616  DT 617


>ref|XP_012450685.1| PREDICTED: filament-like plant protein [Gossypium raimondii]
            gi|823236073|ref|XP_012450686.1| PREDICTED: filament-like
            plant protein [Gossypium raimondii]
            gi|823236075|ref|XP_012450687.1| PREDICTED: filament-like
            plant protein [Gossypium raimondii]
            gi|823236077|ref|XP_012450688.1| PREDICTED: filament-like
            plant protein [Gossypium raimondii]
            gi|823236079|ref|XP_012450689.1| PREDICTED: filament-like
            plant protein [Gossypium raimondii]
            gi|823236081|ref|XP_012450690.1| PREDICTED: filament-like
            plant protein [Gossypium raimondii]
            gi|763797513|gb|KJB64468.1| hypothetical protein
            B456_010G050400 [Gossypium raimondii]
            gi|763797514|gb|KJB64469.1| hypothetical protein
            B456_010G050400 [Gossypium raimondii]
            gi|763797515|gb|KJB64470.1| hypothetical protein
            B456_010G050400 [Gossypium raimondii]
            gi|763797516|gb|KJB64471.1| hypothetical protein
            B456_010G050400 [Gossypium raimondii]
            gi|763797517|gb|KJB64472.1| hypothetical protein
            B456_010G050400 [Gossypium raimondii]
          Length = 714

 Score =  449 bits (1154), Expect = e-123
 Identities = 276/606 (45%), Positives = 390/606 (64%), Gaps = 30/606 (4%)
 Frame = -1

Query: 2078 ERFSDDQ-----ASSNHNTHSPEVISKAAPSDEDLNESVKTLSEKLSEALLNIRAKEDLV 1914
            ERFSDDQ     +S N  T SPEV SKA+   E++N+S+++L+EKLS AL+N+ AKEDLV
Sbjct: 31   ERFSDDQEAFKASSPNDCTKSPEVSSKASAVPEEVNDSIRSLTEKLSAALVNVSAKEDLV 90

Query: 1913 KQHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXXXX 1734
            KQHAKVAEEA++GWE+AENEV+VLK++ E   Q+NS LE+RV HLDGALK          
Sbjct: 91   KQHAKVAEEAIAGWEKAENEVVVLKQKLETTVQQNSALEDRVTHLDGALKECVRQLRQAR 150

Query: 1733 XXXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTE---------VTKLETMEK 1581
                QKI +A ++ + +WE+T+ EL++QL+EL+++ +S K+E         + K+E ++K
Sbjct: 151  EEQEQKINEAVAKTTRDWETTQFELESQLLELQNKAESVKSEPPPPFSPDLLHKIEALKK 210

Query: 1580 ENTILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVALK 1401
            EN+ LK EL SQ EEL++ T ERDLST+AAE ASKQHL+SIK+  KLEAECRRLKA+  K
Sbjct: 211  ENSALKLELSSQLEELQIRTIERDLSTQAAETASKQHLESIKRATKLEAECRRLKAIGSK 270

Query: 1400 ASPANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKINELE--PVDCGFCHPESQV 1227
            +S  N   S  +S++YVESF  SQSDSG+RL V+  +  K++ LE    +       +  
Sbjct: 271  SSFTNDCKSPAASSIYVESFMGSQSDSGERLHVVDTDTQKMSGLEANKGEPSCSDSWASA 330

Query: 1226 LVAELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETES-GSCPETGAL----- 1065
            L+AELDQFKNE+ + RN   SS+EIDLMDDFLEME+LAALP+T++   C E+ A      
Sbjct: 331  LIAELDQFKNEKVINRNVPSSSIEIDLMDDFLEMEQLAALPDTKNENQCLESKATVKQSN 390

Query: 1064 GGKQPLKAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVTELKLV 885
             G   LKAEL+ +I RT             K  LEIAL++ ++ L+ S  +L+ +ELKL 
Sbjct: 391  DGDSSLKAELEAMILRTTELEEKLEKIEAEKAELEIALAKSKESLEASELELRDSELKLE 450

Query: 884  DLETQLAMVNEAKR-------VAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANE 726
            +L+ +L+  NEAK+       + E + E  + K+      +E+     V+       ANE
Sbjct: 451  ELQRELSKANEAKQHLESQLSIMETDAETMSAKIDALGAEIEKERALSVQIS---ADANE 507

Query: 725  EKSVVESKLKTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQT 546
             K ++ES+L +   + +               + + +G+LE EV+KE+ +S +   +CQ 
Sbjct: 508  SKQLLESQLVSIEAEAR--------------MMSAKVGSLETEVEKEKALSAQITVKCQE 553

Query: 545  LENEISRMKLDSQLQRSAVIE-GFKINQDKELAVAASKFTECQKTIASLSRQLKSLATLD 369
            LE E+SR + +++LQ++A      KI Q+ +LAVAA K  ECQKTIASL +QLKSLATL+
Sbjct: 554  LEEELSRTRQEAELQQTANSNVEVKIKQE-DLAVAAGKLAECQKTIASLGQQLKSLATLE 612

Query: 368  DFLIDS 351
            DFLID+
Sbjct: 613  DFLIDT 618


>ref|XP_004503890.1| PREDICTED: filament-like plant protein [Cicer arietinum]
            gi|502139761|ref|XP_004503891.1| PREDICTED: filament-like
            plant protein [Cicer arietinum]
            gi|502139763|ref|XP_004503892.1| PREDICTED: filament-like
            plant protein [Cicer arietinum]
          Length = 660

 Score =  447 bits (1150), Expect = e-122
 Identities = 283/616 (45%), Positives = 377/616 (61%), Gaps = 34/616 (5%)
 Frame = -1

Query: 2078 ERFSDDQASSNHNTHSPEVISKAAPSDE-----DLNE--SVKTLSEKLSEALLNIRAKED 1920
            ERFSD+Q   +  T SPEV SKAAP++E     +  E   VKTL+ +L++ALL+I AKED
Sbjct: 31   ERFSDEQLYPSQATLSPEVTSKAAPNEEVNTPKNYKEVTDVKTLTNELAKALLDISAKED 90

Query: 1919 LVKQHAKVAEEAVSGWERAENEVLVLKKQTEALNQKNSVLEERVGHLDGALKXXXXXXXX 1740
            LVKQH+KVAEEAVSGWE+AENEVL LK+Q +A  QKNS LE+RV HLDGALK        
Sbjct: 91   LVKQHSKVAEEAVSGWEKAENEVLSLKQQLDAARQKNSGLEDRVSHLDGALKECMRQLRQ 150

Query: 1739 XXXXXXQKIYDADSQKSDEWESTKSELQNQLVELRSQLQSAKTEVT---------KLETM 1587
                  QKI++A +  S++ ES +SEL+ ++ EL +QLQ++K +           +LE +
Sbjct: 151  AREVQEQKIHEAVANDSNDRESRRSELERKVAELETQLQTSKADAAASIRSDLHRRLEAV 210

Query: 1586 EKENTILKRELHSQAEELKLMTCERDLSTRAAEAASKQHLDSIKKTAKLEAECRRLKAVA 1407
            EK+N  L+ EL S+ EEL+    ERDLST+AAE ASKQHL+SIKK AKLEAECRRLKA+ 
Sbjct: 211  EKKNLGLQLELQSRLEELEFRIAERDLSTQAAETASKQHLESIKKVAKLEAECRRLKAMT 270

Query: 1406 LKASPANYLPSVTSSTVYVESFADSQSDSGDRLLVIAKENCKI-----NELEPVDCGFCH 1242
             K    N   S+T+S+VYVESF DS SDSG+RLL +  +  K+     NE EP     C 
Sbjct: 271  RKTFNVNDNRSLTASSVYVESFTDSMSDSGERLLAVESDVHKLGGWEMNECEPSCSDSC- 329

Query: 1241 PESQVLVAELDQFKNERPLGRNQMVSSVEIDLMDDFLEMERLAALPETESGSCPETGALG 1062
              S  L+ ELDQFKN++  G+N   +S+EI+LMDDFLEMERLAALP+TESGS    G L 
Sbjct: 330  --SSALITELDQFKNKKTTGKNHTATSIEINLMDDFLEMERLAALPDTESGSRYAKGGLA 387

Query: 1061 GKQPL------KAELQTLINRTXXXXXXXXXXXXXKINLEIALSECQKQLKTSREQLKVT 900
              Q +      +AE++ +I +              K  +EI+L+ECQ QL+TS  ++   
Sbjct: 388  SDQSIVGQVTVEAEVEAMIQKNTELEKQLEKMVADKHEIEISLTECQMQLETSESRI--- 444

Query: 899  ELKLVDLETQLAMVNEAKRVAELEVEAANEKLTKSTKHLEEAEVSLVEHQNRLIVANEEK 720
                              R AEL+VE    +L+ + K  +EA   L E + +       K
Sbjct: 445  ------------------RAAELKVEELQTQLSLAKKSNQEAYEELKETRTK-------K 479

Query: 719  SVVESKLKTSNVKKQXXXXXXXXXXXXXXXLHSSIGTLEEEVKKERGVSGEAVARCQTLE 540
             +V+SKLK    + +                 S I +LEE+++KER +S   + + + LE
Sbjct: 480  EIVDSKLKLVQTEVEELI--------------SKIHSLEEQIQKERALSAVNLIKSKKLE 525

Query: 539  NEISRMKLDSQLQRSA-------VIEGFKINQDKELAVAASKFTECQKTIASLSRQLKSL 381
            +E+SRMK ++Q+Q+ A       V    K  QDKELA+A SKF ECQKTIASL +QLKSL
Sbjct: 526  DELSRMKHEAQVQQDADTLLKENVNRDLKSKQDKELALATSKFAECQKTIASLGKQLKSL 585

Query: 380  ATLDDFLIDSEQPIEL 333
            ATL+DFL+DS+ PIEL
Sbjct: 586  ATLEDFLLDSDNPIEL 601


Top