BLASTX nr result

ID: Sinomenium21_contig00026336 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00026336
         (1253 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007048292.1| Tetratricopeptide repeat-containing protein,...   347   5e-93
ref|XP_007048291.1| Tetratricopeptide repeat-containing protein,...   347   5e-93
ref|XP_002309890.1| hypothetical protein POPTR_0007s03710g [Popu...   340   1e-90
emb|CBI37575.3| unnamed protein product [Vitis vinifera]              338   3e-90
ref|XP_002275533.1| PREDICTED: protein TONSOKU-like [Vitis vinif...   338   3e-90
ref|XP_006464604.1| PREDICTED: protein TONSOKU-like isoform X2 [...   328   3e-87
ref|XP_006464603.1| PREDICTED: protein TONSOKU-like isoform X1 [...   328   3e-87
ref|XP_006427817.1| hypothetical protein CICLE_v10024723mg [Citr...   328   3e-87
gb|EXC02646.1| Protein TONSOKU [Morus notabilis]                      305   2e-80
ref|XP_002517217.1| brushy protein, putative [Ricinus communis] ...   296   1e-77
ref|XP_006406573.1| hypothetical protein EUTSA_v10019906mg [Eutr...   283   1e-73
gb|AAS67383.1| BRUSHY1 [Arabidopsis thaliana]                         279   2e-72
tpe|CAE30337.1| TPA: 3g18730 protein [Arabidopsis thaliana]           277   8e-72
ref|NP_188503.2| protein BRUSHY 1 [Arabidopsis thaliana] gi|5278...   277   8e-72
ref|XP_002885276.1| hypothetical protein ARALYDRAFT_898246 [Arab...   275   4e-71
ref|XP_006585323.1| PREDICTED: protein TONSOKU-like isoform X2 [...   272   2e-70
ref|XP_006299481.1| hypothetical protein CARUB_v10015646mg [Caps...   272   2e-70
ref|XP_003532859.1| PREDICTED: protein TONSOKU-like isoform X1 [...   272   2e-70
ref|XP_004955243.1| PREDICTED: protein TONSOKU-like [Setaria ita...   271   6e-70
ref|XP_002452932.1| hypothetical protein SORBIDRAFT_04g035180 [S...   270   1e-69

>ref|XP_007048292.1| Tetratricopeptide repeat-containing protein, putative isoform 2
            [Theobroma cacao] gi|508700553|gb|EOX92449.1|
            Tetratricopeptide repeat-containing protein, putative
            isoform 2 [Theobroma cacao]
          Length = 1278

 Score =  347 bits (891), Expect = 5e-93
 Identities = 186/357 (52%), Positives = 239/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            +G+GS L+QL +G+N P++GN + NLL KLA +KRFS+L+LNGLKLSKPVVD LC LAKT
Sbjct: 927  LGTGSALSQLLIGYNNPISGNAITNLLGKLAKMKRFSDLSLNGLKLSKPVVDGLCYLAKT 986

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCLS LM+ GT IG DGAL L  +L     E LK DLSYCG+ S     L TD+  + GI
Sbjct: 987  SCLSRLMLEGTGIGTDGALGLTQSLFSSTQEPLKLDLSYCGVTSTYVYQLNTDVTFISGI 1046

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNLGGN I  EG NALASLL N QC LK L+LNKC LG+AG+ +IIQALAEN  LEEL
Sbjct: 1047 LELNLGGNPIMLEGGNALASLLINPQCCLKALILNKCQLGMAGILQIIQALAENDSLEEL 1106

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NLA+NA+ +  +T   D   + SS+ LQP+  +S+   ++C S++  D++QG+CV+N++C
Sbjct: 1107 NLADNADTNKQLTIQCDKLTKESSEYLQPDHTISEPYLNQCVSKEC-DVEQGMCVINADC 1165

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSEDD ++ G  A   DD                 Q I+DL TAI M + LQ+L
Sbjct: 1166 SKLEVADSEDDEVRVGTAACEFDDSCASSCQRNSSME---CQFIQDLSTAIGMVKQLQVL 1222

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+VE +E+L+ AW            HI +Q IH SV+   CC +K CCK+D
Sbjct: 1223 DLSNNGFSVEASEALFNAW-SSGSRVGLAWRHIDNQTIHLSVEVNKCCRVKSCCKKD 1278


>ref|XP_007048291.1| Tetratricopeptide repeat-containing protein, putative isoform 1
            [Theobroma cacao] gi|508700552|gb|EOX92448.1|
            Tetratricopeptide repeat-containing protein, putative
            isoform 1 [Theobroma cacao]
          Length = 1294

 Score =  347 bits (891), Expect = 5e-93
 Identities = 186/357 (52%), Positives = 239/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            +G+GS L+QL +G+N P++GN + NLL KLA +KRFS+L+LNGLKLSKPVVD LC LAKT
Sbjct: 943  LGTGSALSQLLIGYNNPISGNAITNLLGKLAKMKRFSDLSLNGLKLSKPVVDGLCYLAKT 1002

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCLS LM+ GT IG DGAL L  +L     E LK DLSYCG+ S     L TD+  + GI
Sbjct: 1003 SCLSRLMLEGTGIGTDGALGLTQSLFSSTQEPLKLDLSYCGVTSTYVYQLNTDVTFISGI 1062

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNLGGN I  EG NALASLL N QC LK L+LNKC LG+AG+ +IIQALAEN  LEEL
Sbjct: 1063 LELNLGGNPIMLEGGNALASLLINPQCCLKALILNKCQLGMAGILQIIQALAENDSLEEL 1122

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NLA+NA+ +  +T   D   + SS+ LQP+  +S+   ++C S++  D++QG+CV+N++C
Sbjct: 1123 NLADNADTNKQLTIQCDKLTKESSEYLQPDHTISEPYLNQCVSKEC-DVEQGMCVINADC 1181

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSEDD ++ G  A   DD                 Q I+DL TAI M + LQ+L
Sbjct: 1182 SKLEVADSEDDEVRVGTAACEFDDSCASSCQRNSSME---CQFIQDLSTAIGMVKQLQVL 1238

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+VE +E+L+ AW            HI +Q IH SV+   CC +K CCK+D
Sbjct: 1239 DLSNNGFSVEASEALFNAW-SSGSRVGLAWRHIDNQTIHLSVEVNKCCRVKSCCKKD 1294


>ref|XP_002309890.1| hypothetical protein POPTR_0007s03710g [Populus trichocarpa]
            gi|222852793|gb|EEE90340.1| hypothetical protein
            POPTR_0007s03710g [Populus trichocarpa]
          Length = 1353

 Score =  340 bits (871), Expect = 1e-90
 Identities = 188/357 (52%), Positives = 232/357 (64%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + +  VLAQL +G+N PV+GN ++NLLAKLATLK F+ LNL+GLKL+KPVVDSLC LAKT
Sbjct: 1003 LNASLVLAQLSIGYNNPVSGNAIINLLAKLATLKSFAALNLSGLKLTKPVVDSLCQLAKT 1062

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCLS LM+G T IG DGAL L  +L  G  E +K DLSYCGL       L TD   + GI
Sbjct: 1063 SCLSRLMLGSTGIGTDGALQLTASLFEGSQESVKLDLSYCGLMPAYTHMLSTD-TLICGI 1121

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNL GN I  EG NA+ SLLTN QC LK LVLNKC LGL G+ ++IQALAEN  LEEL
Sbjct: 1122 LELNLAGNPIMQEGTNAMVSLLTNPQCCLKVLVLNKCQLGLTGILQMIQALAENDCLEEL 1181

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            +LA+NAN++      YDST+   S  LQPNLN S+ ++     E   D +QG+CV+N+EC
Sbjct: 1182 HLADNANLEKTYMIQYDSTKGSCSDILQPNLNKSESSKMSVPKESDSD-KQGVCVMNTEC 1240

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSED PI+     S  DD                 Q I++L TAI MA+ LQ +
Sbjct: 1241 NQLEVADSEDGPIRAEAAPSDFDDSCTSSCQKNSLLE---CQFIQELTTAISMAKQLQFM 1297

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            +L  NGFT ++AE+LY AW            HI+DQ IHFS++   CC  K CC+RD
Sbjct: 1298 ELGNNGFTTQVAEALYTAW-SSRLENGLAWRHIEDQTIHFSMETNKCCRAKPCCRRD 1353


>emb|CBI37575.3| unnamed protein product [Vitis vinifera]
          Length = 1342

 Score =  338 bits (867), Expect = 3e-90
 Identities = 186/357 (52%), Positives = 237/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S SVLAQL LGHN P++GN ++NL+ KL+TL+RFSELNLNGLKLSK VVDSLC L K+
Sbjct: 990  LDSQSVLAQLCLGHNNPISGNSIMNLMGKLSTLERFSELNLNGLKLSKTVVDSLCQLVKS 1049

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCLSGLM+GG+SIG DGAL L  +L  G  EL+K DLSYCGL S     L  ++  VGGI
Sbjct: 1050 SCLSGLMLGGSSIGTDGALQLTKSLFSGAQELVKLDLSYCGLTSEYITNLNAEVPMVGGI 1109

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LE+NLGGN +  +G +ALASLL N  C LK LVLN C LGLAGV +IIQAL+EN  LEEL
Sbjct: 1110 LEINLGGNPVMQKGGSALASLLMNPHCCLKVLVLNNCQLGLAGVLQIIQALSENDSLEEL 1169

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            N+A NA++D   T   +     SS++    LN+S  +   C  ++    Q+G C++N++ 
Sbjct: 1170 NVAGNADLDRHCTSQNNLKALESSETFPQILNISVSSPKVCVLKEVAAAQEGSCIMNTDY 1229

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSEDDPI   P A+S DD             F  S+ I+ L TAI MA+ LQ+L
Sbjct: 1230 NQLEVADSEDDPITAEP-AASYDD--SCTNSCKRMLQFSESEFIQGLSTAIGMAKKLQLL 1286

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+ +  E++Y AW            HIK+Q +H  V+G+ CCG+K CCKRD
Sbjct: 1287 DLSNNGFSTQDTETIYTAW-SLGSRSGLAQRHIKEQTVHLLVRGQKCCGVKPCCKRD 1342


>ref|XP_002275533.1| PREDICTED: protein TONSOKU-like [Vitis vinifera]
          Length = 1309

 Score =  338 bits (867), Expect = 3e-90
 Identities = 186/357 (52%), Positives = 237/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S SVLAQL LGHN P++GN ++NL+ KL+TL+RFSELNLNGLKLSK VVDSLC L K+
Sbjct: 957  LDSQSVLAQLCLGHNNPISGNSIMNLMGKLSTLERFSELNLNGLKLSKTVVDSLCQLVKS 1016

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCLSGLM+GG+SIG DGAL L  +L  G  EL+K DLSYCGL S     L  ++  VGGI
Sbjct: 1017 SCLSGLMLGGSSIGTDGALQLTKSLFSGAQELVKLDLSYCGLTSEYITNLNAEVPMVGGI 1076

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LE+NLGGN +  +G +ALASLL N  C LK LVLN C LGLAGV +IIQAL+EN  LEEL
Sbjct: 1077 LEINLGGNPVMQKGGSALASLLMNPHCCLKVLVLNNCQLGLAGVLQIIQALSENDSLEEL 1136

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            N+A NA++D   T   +     SS++    LN+S  +   C  ++    Q+G C++N++ 
Sbjct: 1137 NVAGNADLDRHCTSQNNLKALESSETFPQILNISVSSPKVCVLKEVAAAQEGSCIMNTDY 1196

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSEDDPI   P A+S DD             F  S+ I+ L TAI MA+ LQ+L
Sbjct: 1197 NQLEVADSEDDPITAEP-AASYDD--SCTNSCKRMLQFSESEFIQGLSTAIGMAKKLQLL 1253

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+ +  E++Y AW            HIK+Q +H  V+G+ CCG+K CCKRD
Sbjct: 1254 DLSNNGFSTQDTETIYTAW-SLGSRSGLAQRHIKEQTVHLLVRGQKCCGVKPCCKRD 1309


>ref|XP_006464604.1| PREDICTED: protein TONSOKU-like isoform X2 [Citrus sinensis]
          Length = 1296

 Score =  328 bits (841), Expect = 3e-87
 Identities = 181/357 (50%), Positives = 236/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            +G+ S LAQL +G+N PVTGN + NLL KL TLK FSELNLNGLKLSKPVVD LC LAKT
Sbjct: 952  LGAESTLAQLCIGYNSPVTGNAITNLLVKLDTLKSFSELNLNGLKLSKPVVDRLCQLAKT 1011

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCL+ LM+G T++G DG+L L+ +L     E +K DLSYCGL+S         ++ V GI
Sbjct: 1012 SCLTHLMLGCTNLGSDGSLQLVESLFSRAQESVKLDLSYCGLESTCIHKFTASVSLVHGI 1071

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNLGGN I  EG NALASLL N QC LK LVL+KC LGLAGV ++I+AL+EN  LEEL
Sbjct: 1072 LELNLGGNPIMKEGANALASLLMNPQCCLKVLVLSKCQLGLAGVLQLIKALSENDTLEEL 1131

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NLA+NA+ +  +  +  S   V+S++LQP L  SD     C S++ +  Q GL  +N++C
Sbjct: 1132 NLADNASKELTLQQNLSS---VNSENLQPALKTSD-----CVSKEVDTDQHGLFAMNTDC 1183

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
             DL+VADSEDD I+    AS  D+                 Q +++L +AI MA+ LQ+L
Sbjct: 1184 NDLEVADSEDDKIRVESAASGFDNSCTSSCQKNSSFE---CQFVQELSSAIGMAKPLQLL 1240

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+ +  ++LY AW            HIK+Q+IHFSV+G  CC +K CC+++
Sbjct: 1241 DLSNNGFSTQAVKTLYCAW-SSRSGAGPAWKHIKEQIIHFSVEGNKCCRVKPCCRKN 1296


>ref|XP_006464603.1| PREDICTED: protein TONSOKU-like isoform X1 [Citrus sinensis]
          Length = 1312

 Score =  328 bits (841), Expect = 3e-87
 Identities = 181/357 (50%), Positives = 236/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            +G+ S LAQL +G+N PVTGN + NLL KL TLK FSELNLNGLKLSKPVVD LC LAKT
Sbjct: 968  LGAESTLAQLCIGYNSPVTGNAITNLLVKLDTLKSFSELNLNGLKLSKPVVDRLCQLAKT 1027

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCL+ LM+G T++G DG+L L+ +L     E +K DLSYCGL+S         ++ V GI
Sbjct: 1028 SCLTHLMLGCTNLGSDGSLQLVESLFSRAQESVKLDLSYCGLESTCIHKFTASVSLVHGI 1087

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNLGGN I  EG NALASLL N QC LK LVL+KC LGLAGV ++I+AL+EN  LEEL
Sbjct: 1088 LELNLGGNPIMKEGANALASLLMNPQCCLKVLVLSKCQLGLAGVLQLIKALSENDTLEEL 1147

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NLA+NA+ +  +  +  S   V+S++LQP L  SD     C S++ +  Q GL  +N++C
Sbjct: 1148 NLADNASKELTLQQNLSS---VNSENLQPALKTSD-----CVSKEVDTDQHGLFAMNTDC 1199

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
             DL+VADSEDD I+    AS  D+                 Q +++L +AI MA+ LQ+L
Sbjct: 1200 NDLEVADSEDDKIRVESAASGFDNSCTSSCQKNSSFE---CQFVQELSSAIGMAKPLQLL 1256

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+ +  ++LY AW            HIK+Q+IHFSV+G  CC +K CC+++
Sbjct: 1257 DLSNNGFSTQAVKTLYCAW-SSRSGAGPAWKHIKEQIIHFSVEGNKCCRVKPCCRKN 1312


>ref|XP_006427817.1| hypothetical protein CICLE_v10024723mg [Citrus clementina]
            gi|557529807|gb|ESR41057.1| hypothetical protein
            CICLE_v10024723mg [Citrus clementina]
          Length = 1307

 Score =  328 bits (841), Expect = 3e-87
 Identities = 181/357 (50%), Positives = 236/357 (66%), Gaps = 1/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            +G+ S LAQL +G+N PVTGN + NLL KL TLK FSELNLNGLKLSKPVVD LC LAKT
Sbjct: 963  LGAESTLAQLCIGYNSPVTGNAITNLLVKLDTLKSFSELNLNGLKLSKPVVDRLCQLAKT 1022

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
            SCL+ LM+G T++G DG+L L+ +L     E +K DLSYCGL+S         ++ V GI
Sbjct: 1023 SCLTHLMLGCTNLGSDGSLQLVESLFSRAQESVKLDLSYCGLESTCIHKFTASVSLVHGI 1082

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNLGGN I  EG NALASLL N QC LK LVL+KC LGLAGV ++I+AL+EN  LEEL
Sbjct: 1083 LELNLGGNPIMKEGANALASLLMNPQCCLKVLVLSKCQLGLAGVLQLIKALSENDTLEEL 1142

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NLA+NA+ +  +  +  S   V+S++LQP L  SD     C S++ +  Q GL  +N++C
Sbjct: 1143 NLADNASKELTLQQNLSS---VNSENLQPALKTSD-----CVSKEVDTDQHGLFAMNTDC 1194

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
             DL+VADSEDD I+    AS  D+                 Q +++L +AI MA+ LQ+L
Sbjct: 1195 NDLEVADSEDDKIRVESAASGFDNSCTSSCQKNSSFE---CQFVQELSSAIGMAKPLQLL 1251

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            DLS NGF+ +  ++LY AW            HIK+Q+IHFSV+G  CC +K CC+++
Sbjct: 1252 DLSNNGFSTQAVKTLYCAW-SSRSGAGPAWKHIKEQIIHFSVEGNKCCRVKPCCRKN 1307


>gb|EXC02646.1| Protein TONSOKU [Morus notabilis]
          Length = 1323

 Score =  305 bits (782), Expect = 2e-80
 Identities = 170/357 (47%), Positives = 227/357 (63%), Gaps = 2/357 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + SGS+L QL +GHN P++G+ L++LLAKL TLKRFS+L+LNGLK  KP+++SLC LAK 
Sbjct: 967  LDSGSILEQLCIGHNNPISGDALISLLAKLTTLKRFSKLSLNGLKQKKPIINSLCELAKA 1026

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPH-ELLKFDLSYCGLKSHSFETLFTDIASVGG 354
            SCLS LM+G T IG +GAL +  +L +G   E LK DLSYC L S     L  D++ +  
Sbjct: 1027 SCLSALMLGETGIGTEGALLVTESLFIGTEPESLKLDLSYCELTSEYILRLNADVSLISR 1086

Query: 355  ILELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEE 534
            I ELNL GN IG EG NAL+SLL+N QC LK LVL KC LG+ G  +I++ALA+N+ LE+
Sbjct: 1087 ISELNLAGNPIGQEGGNALSSLLSNPQCGLKVLVLEKCQLGVVGTLQILKALADNESLED 1146

Query: 535  LNLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSE 714
            LNLA N +VD   TP  D T + S + LQP + +   ++     E+ E  QQGL   N++
Sbjct: 1147 LNLANNVDVDQHSTPRRDVTTKDSEELLQPEIGVPKSSRKASVPEEVEPAQQGLVPENND 1206

Query: 715  CKDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQI 894
               L+VADSE+DPI     AS +DD            + P  QL +++ TAI  A+ LQ+
Sbjct: 1207 LDQLEVADSEEDPIGGEAAASGIDD--SCASSSQRNSSSPEWQLAQEVSTAISKAKTLQL 1264

Query: 895  LDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKR 1065
            LDLS NG + +++E LY AW            HIKDQ IH   +G+ CC IK CC++
Sbjct: 1265 LDLSNNGLSTQVSEKLYTAW--ASSRPRPAYKHIKDQTIHLLTKGRKCC-IKPCCRK 1318


>ref|XP_002517217.1| brushy protein, putative [Ricinus communis]
            gi|223543588|gb|EEF45117.1| brushy protein, putative
            [Ricinus communis]
          Length = 1327

 Score =  296 bits (758), Expect = 1e-77
 Identities = 171/372 (45%), Positives = 228/372 (61%), Gaps = 18/372 (4%)
 Frame = +1

Query: 7    SGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKTSC 183
            SGSVL+QL +GHN  ++GN +VNLL KLA LK F+ELNL+G+K+++PV D+LC LAK SC
Sbjct: 963  SGSVLSQLSIGHNNQISGNAIVNLLTKLAALKSFAELNLSGIKINRPVTDNLCQLAKISC 1022

Query: 184  LSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGILE 363
            LS +M+G T IG DGA+ +  +L  G  E +K DLSYCGL +     L  +   V GILE
Sbjct: 1023 LSRVMLGSTGIGTDGAVQVTESLFSGSQEYVKLDLSYCGLTAAYAHQLNIEDTLVCGILE 1082

Query: 364  LNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEELNL 543
            LNL GN I  EGVNA+ SLL N +C LK LVLNKC LGL GV ++I+ L+EN  LEEL++
Sbjct: 1083 LNLEGNPIMQEGVNAITSLLVNPRCCLKVLVLNKCQLGLTGVLQVIKTLSENHHLEELHV 1142

Query: 544  AENANVDNDITPHYDSTQEVSSKSLQPNLNLSDI-----------TQSK----CTSEKTE 678
            A+N++ D      YDST   S+  LQPN + S+            T+++    C  EK +
Sbjct: 1143 ADNSSQDEKHMMRYDSTTRCSADLLQPNFSTSESSLKVCGPKKADTENEALKVCAPEKAD 1202

Query: 679  DIQQGLCVVNSECKDLQVADSEDDPIK--EGPVASSLDDXXXXXXXXXXXXAFPWSQLIE 852
               + LC VN++C  L+VADSED+ I+   GP     DD                 Q I+
Sbjct: 1203 INHEALCAVNTDCNQLEVADSEDNEIRVEAGP---EFDDSCTSSSQKNSSLE---CQFIQ 1256

Query: 853  DLVTAIRMARHLQILDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGK 1032
            +L  AI MA+ L++LDLS NGF+  +AE+L  AW            HIKDQ+IHFS+  +
Sbjct: 1257 ELSAAISMAKQLKLLDLSNNGFSNPVAETLSNAW-SSRFTTDVSWRHIKDQIIHFSMSDE 1315

Query: 1033 SCCGIKLCCKRD 1068
             CC  K CC++D
Sbjct: 1316 MCCRRKPCCRKD 1327


>ref|XP_006406573.1| hypothetical protein EUTSA_v10019906mg [Eutrema salsugineum]
            gi|557107719|gb|ESQ48026.1| hypothetical protein
            EUTSA_v10019906mg [Eutrema salsugineum]
          Length = 1326

 Score =  283 bits (723), Expect = 1e-73
 Identities = 161/361 (44%), Positives = 221/361 (61%), Gaps = 9/361 (2%)
 Frame = +1

Query: 13   SVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKTSCLS 189
            S L+QLY+G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL +L KT  LS
Sbjct: 981  SGLSQLYIGYNNPVSGSAIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLSVLVKTPSLS 1040

Query: 190  GLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGILELN 369
             L++G + IG DGA+ +  +L     E ++ DLS CGL S  F  L  DI     ILELN
Sbjct: 1041 KLLVGSSGIGTDGAIKITESLCYQKEETVRLDLSCCGLASPFFLRLIQDITLTSSILELN 1100

Query: 370  LGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEELNLAE 549
            +GGN I  EG++AL  LLTN    +K L ++KC+L L+G+  +IQAL+ENK LEELN++E
Sbjct: 1101 VGGNPITEEGISALGVLLTNPCSKIKVLTVSKCHLKLSGILCVIQALSENKNLEELNISE 1160

Query: 550  NANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSE-------KTEDIQQGLCVVN 708
            NA +D   T   +  +E S    Q +     IT    T         +  + +Q LC  +
Sbjct: 1161 NAKLDE--TVFGELVKESSEMGQQEHGTCESITAMDKTHRVESKNHCQNPEKEQELCETS 1218

Query: 709  SECKDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARH 885
             EC +L+VADSEDD I+E   ASS               + P  + +IE+L TA+ MA  
Sbjct: 1219 MECDNLEVADSEDDQIEEQTAASS-------------SLSLPRKNHIIEELSTALAMANQ 1265

Query: 886  LQILDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKR 1065
            LQILDLS NGF+VE+ E+LY AW            H+KD+++H+ V+GK CCG+K CC++
Sbjct: 1266 LQILDLSNNGFSVEVLETLYMAWSSSGSRTGIAQRHVKDEIVHYYVEGKICCGVKSCCRK 1325

Query: 1066 D 1068
            D
Sbjct: 1326 D 1326


>gb|AAS67383.1| BRUSHY1 [Arabidopsis thaliana]
          Length = 1311

 Score =  279 bits (713), Expect = 2e-72
 Identities = 157/358 (43%), Positives = 220/358 (61%), Gaps = 2/358 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S S L+QL +G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL  L KT
Sbjct: 976  LDSKSGLSQLCIGYNNPVSGSSIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLYALVKT 1035

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LS L++G + IG DGA+ +  +L     E +K DLS CGL S  F  L  D+     I
Sbjct: 1036 PSLSKLLVGSSGIGTDGAIKVTESLCYQKEETVKLDLSCCGLASSFFIKLNQDVTLTSSI 1095

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LE N+GGN I  EG++AL  LL N   ++K L+LNKC+L LAG+  IIQAL++NK LEEL
Sbjct: 1096 LEFNVGGNPITEEGISALGELLRNPCSNIKVLILNKCHLKLAGLLCIIQALSDNKNLEEL 1155

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NL++NA ++++        Q V  +S+     + +     C S  + D +Q LC  N EC
Sbjct: 1156 NLSDNAKIEDETV----FGQPVKERSV-----MVEQEHGTCKSVTSMDKEQELCETNMEC 1206

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARHLQI 894
             DL+VADSED+ I+EG   SS               + P  + ++++L TA+ MA  L+I
Sbjct: 1207 DDLEVADSEDEQIEEGTATSS-------------SLSLPRKNHIVKELSTALSMANQLKI 1253

Query: 895  LDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            LDLS NGF+VE  E+LY +W            H+K++ +HF V+GK CCG+K CC++D
Sbjct: 1254 LDLSNNGFSVEALETLYMSWSSSSSRTGIAQRHVKEETVHFYVEGKMCCGVKSCCRKD 1311


>tpe|CAE30337.1| TPA: 3g18730 protein [Arabidopsis thaliana]
          Length = 1311

 Score =  277 bits (708), Expect = 8e-72
 Identities = 156/358 (43%), Positives = 220/358 (61%), Gaps = 2/358 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S S L+QL +G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL  L KT
Sbjct: 976  LDSKSGLSQLCIGYNNPVSGSSIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLYALVKT 1035

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LS L++G + IG DGA+ +  +L     E +K DLS CGL S  F  L  D+     I
Sbjct: 1036 PSLSKLLVGSSGIGTDGAIKVTESLCYQKEETVKLDLSCCGLASSFFIKLNQDVTLTSSI 1095

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LE N+GGN I  EG++AL  LL N   ++K L+L+KC+L LAG+  IIQAL++NK LEEL
Sbjct: 1096 LEFNVGGNPITEEGISALGELLRNPCSNIKVLILSKCHLKLAGLLCIIQALSDNKNLEEL 1155

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NL++NA ++++        Q V  +S+     + +     C S  + D +Q LC  N EC
Sbjct: 1156 NLSDNAKIEDETV----FGQPVKERSV-----MVEQEHGTCKSVTSMDKEQELCETNMEC 1206

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARHLQI 894
             DL+VADSED+ I+EG   SS               + P  + ++++L TA+ MA  L+I
Sbjct: 1207 DDLEVADSEDEQIEEGTATSS-------------SLSLPRKNHIVKELSTALSMANQLKI 1253

Query: 895  LDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            LDLS NGF+VE  E+LY +W            H+K++ +HF V+GK CCG+K CC++D
Sbjct: 1254 LDLSNNGFSVEALETLYMSWSSSSSRTGIAQRHVKEETVHFYVEGKMCCGVKSCCRKD 1311


>ref|NP_188503.2| protein BRUSHY 1 [Arabidopsis thaliana]
            gi|52782719|sp|Q6Q4D0.2|TONS_ARATH RecName: Full=Protein
            TONSOKU; AltName: Full=Protein BRUSHY 1; AltName:
            Full=Protein MGOUN 3 gi|38707436|dbj|BAD04041.1| TONSOKU
            protein [Arabidopsis thaliana]
            gi|332642616|gb|AEE76137.1| protein BRUSHY 1 [Arabidopsis
            thaliana]
          Length = 1311

 Score =  277 bits (708), Expect = 8e-72
 Identities = 156/358 (43%), Positives = 220/358 (61%), Gaps = 2/358 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S S L+QL +G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL  L KT
Sbjct: 976  LDSKSGLSQLCIGYNNPVSGSSIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLYALVKT 1035

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LS L++G + IG DGA+ +  +L     E +K DLS CGL S  F  L  D+     I
Sbjct: 1036 PSLSKLLVGSSGIGTDGAIKVTESLCYQKEETVKLDLSCCGLASSFFIKLNQDVTLTSSI 1095

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LE N+GGN I  EG++AL  LL N   ++K L+L+KC+L LAG+  IIQAL++NK LEEL
Sbjct: 1096 LEFNVGGNPITEEGISALGELLRNPCSNIKVLILSKCHLKLAGLLCIIQALSDNKNLEEL 1155

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NL++NA ++++        Q V  +S+     + +     C S  + D +Q LC  N EC
Sbjct: 1156 NLSDNAKIEDETV----FGQPVKERSV-----MVEQEHGTCKSVTSMDKEQELCETNMEC 1206

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARHLQI 894
             DL+VADSED+ I+EG   SS               + P  + ++++L TA+ MA  L+I
Sbjct: 1207 DDLEVADSEDEQIEEGTATSS-------------SLSLPRKNHIVKELSTALSMANQLKI 1253

Query: 895  LDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            LDLS NGF+VE  E+LY +W            H+K++ +HF V+GK CCG+K CC++D
Sbjct: 1254 LDLSNNGFSVEALETLYMSWSSSSSRTGIAQRHVKEETVHFYVEGKMCCGVKSCCRKD 1311


>ref|XP_002885276.1| hypothetical protein ARALYDRAFT_898246 [Arabidopsis lyrata subsp.
            lyrata] gi|297331116|gb|EFH61535.1| hypothetical protein
            ARALYDRAFT_898246 [Arabidopsis lyrata subsp. lyrata]
          Length = 1295

 Score =  275 bits (702), Expect = 4e-71
 Identities = 158/358 (44%), Positives = 213/358 (59%), Gaps = 2/358 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S S L+QL +G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL  L KT
Sbjct: 969  LDSESGLSQLCIGYNNPVSGSSIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLSALVKT 1028

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LS L++G + IG DGA+ +  +L     E +K DLS CGL S  F  L  DI     I
Sbjct: 1029 PSLSKLLVGSSGIGTDGAIKVTESLCYQKEETVKLDLSCCGLASSFFIKLNQDITLTSSI 1088

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELN+GGN I  EG++AL  LL N   ++K L+LNKC+L L G+  IIQAL++NK LEEL
Sbjct: 1089 LELNVGGNPITEEGISALGELLRNPCSNIKALILNKCHLKLGGLVCIIQALSDNKNLEEL 1148

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            NL+ENA +D  +       Q V                  C S  + D +Q LC  N EC
Sbjct: 1149 NLSENAKIDETV-----FGQPVKE-------------HGTCESVTSMDKEQELCETNMEC 1190

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARHLQI 894
             DL+VADSED+ I+E    SS               + P  + ++++L  A+ +A  LQI
Sbjct: 1191 DDLEVADSEDEQIEERTATSS-------------SLSLPRKNHIVKELSIALAVANQLQI 1237

Query: 895  LDLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            LDLS NGF+VE  E+LY +W            H+KD+++HF V+GK CCG+K CC++D
Sbjct: 1238 LDLSNNGFSVEALETLYMSWSSSSSRTGIAQRHVKDEIVHFYVEGKMCCGVKSCCRKD 1295


>ref|XP_006585323.1| PREDICTED: protein TONSOKU-like isoform X2 [Glycine max]
          Length = 1316

 Score =  272 bits (696), Expect = 2e-70
 Identities = 166/356 (46%), Positives = 213/356 (59%), Gaps = 1/356 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S SVLA L +G+N PV+GN +VNL++KL+TLKRFSELN++GLKL KPVVD+LC LA T
Sbjct: 979  LDSTSVLAHLCIGYNSPVSGNAIVNLVSKLSTLKRFSELNMSGLKLGKPVVDTLCKLAGT 1038

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LSGL++GGT +G +GA+ L  +L  G  EL+K DLSYCGL  +    L T +     I
Sbjct: 1039 LNLSGLILGGTGVGTEGAIKLAESLLQGTEELVKLDLSYCGLTFNF--VLNTSVNFFCSI 1096

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNL GN I PEG N L SLL N QC LK LVL KC LGLAG+  II+ALAEN  LEEL
Sbjct: 1097 LELNLEGNPIMPEGSNTLFSLLVNPQCCLKVLVLKKCQLGLAGILHIIEALAENSCLEEL 1156

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            N+A N+         YD    +S KS   N       + K  + K +D Q+ L  +NS  
Sbjct: 1157 NVANNSIPKEVSALQYD----LSVKSCSQN------QEQKLDTMKVDDNQEVLGSLNSAD 1206

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSED P++    AS  DD            + P     +    AI  A++LQ+L
Sbjct: 1207 HLLEVADSEDVPVE--TAASGFDD---SCASSCQRNSSPECHFTQQFSIAIGKAKNLQLL 1261

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKR 1065
            DLS NGF+ + AE+ Y +W            HI +Q+IHFS +   CC +K CCK+
Sbjct: 1262 DLSNNGFSAQAAEAFYGSW--ATLRPLSSQNHITEQIIHFSTRENKCCRVKPCCKK 1315


>ref|XP_006299481.1| hypothetical protein CARUB_v10015646mg [Capsella rubella]
            gi|482568190|gb|EOA32379.1| hypothetical protein
            CARUB_v10015646mg [Capsella rubella]
          Length = 1322

 Score =  272 bits (696), Expect = 2e-70
 Identities = 155/356 (43%), Positives = 214/356 (60%), Gaps = 4/356 (1%)
 Frame = +1

Query: 13   SVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKTSCLS 189
            S L+QL +G+N PV+G+ + NLLAKLATL  F+EL++NG+KLS  VVDSL    KT  LS
Sbjct: 980  SGLSQLCIGYNNPVSGSAIQNLLAKLATLSSFAELSMNGIKLSSQVVDSLSAFVKTPSLS 1039

Query: 190  GLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGILELN 369
             L++G + IG +GA+ +  +L     E +K DLS CGL S  F  L  DI     ILELN
Sbjct: 1040 KLLVGSSGIGTEGAIKVTKSLCYQKEETVKLDLSCCGLASPFFLKLTQDITLTSCILELN 1099

Query: 370  LGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEELNLAE 549
            +GGNSI  EG++AL  LL N   ++K L+LNKC+L L+G+  IIQ+L++NK LEELNL+E
Sbjct: 1100 VGGNSITEEGISALGMLLMNPCSNIKVLILNKCHLKLSGILCIIQSLSDNKNLEELNLSE 1159

Query: 550  NANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTED--IQQGLCVVNSECKD 723
            NA +D  +   +             ++   D TQ   T    ++   +Q LC  + EC D
Sbjct: 1160 NAKIDETVFGQHVKAMGQQEHGTCESVESVDKTQRVETKHHCQNPGKEQKLCETDMECDD 1219

Query: 724  LQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFP-WSQLIEDLVTAIRMARHLQILD 900
            L+VADSED+ I+E    S                + P  + ++E+L TA+ MA  LQILD
Sbjct: 1220 LEVADSEDEQIEEQTATS-------------RSLSLPRKNHIMEELSTALAMANQLQILD 1266

Query: 901  LSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            LS NGF+VE  E+LY +W            H+KD+ +HF V+GK CCG+K CC++D
Sbjct: 1267 LSNNGFSVEALETLYMSWSSSSLRSGIAQKHVKDETVHFYVEGKICCGVKSCCRKD 1322


>ref|XP_003532859.1| PREDICTED: protein TONSOKU-like isoform X1 [Glycine max]
          Length = 1318

 Score =  272 bits (696), Expect = 2e-70
 Identities = 166/356 (46%), Positives = 213/356 (59%), Gaps = 1/356 (0%)
 Frame = +1

Query: 1    IGSGSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKT 177
            + S SVLA L +G+N PV+GN +VNL++KL+TLKRFSELN++GLKL KPVVD+LC LA T
Sbjct: 981  LDSTSVLAHLCIGYNSPVSGNAIVNLVSKLSTLKRFSELNMSGLKLGKPVVDTLCKLAGT 1040

Query: 178  SCLSGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGI 357
              LSGL++GGT +G +GA+ L  +L  G  EL+K DLSYCGL  +    L T +     I
Sbjct: 1041 LNLSGLILGGTGVGTEGAIKLAESLLQGTEELVKLDLSYCGLTFNF--VLNTSVNFFCSI 1098

Query: 358  LELNLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEEL 537
            LELNL GN I PEG N L SLL N QC LK LVL KC LGLAG+  II+ALAEN  LEEL
Sbjct: 1099 LELNLEGNPIMPEGSNTLFSLLVNPQCCLKVLVLKKCQLGLAGILHIIEALAENSCLEEL 1158

Query: 538  NLAENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSEC 717
            N+A N+         YD    +S KS   N       + K  + K +D Q+ L  +NS  
Sbjct: 1159 NVANNSIPKEVSALQYD----LSVKSCSQN------QEQKLDTMKVDDNQEVLGSLNSAD 1208

Query: 718  KDLQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQIL 897
              L+VADSED P++    AS  DD            + P     +    AI  A++LQ+L
Sbjct: 1209 HLLEVADSEDVPVE--TAASGFDD---SCASSCQRNSSPECHFTQQFSIAIGKAKNLQLL 1263

Query: 898  DLSGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKR 1065
            DLS NGF+ + AE+ Y +W            HI +Q+IHFS +   CC +K CCK+
Sbjct: 1264 DLSNNGFSAQAAEAFYGSW--ATLRPLSSQNHITEQIIHFSTRENKCCRVKPCCKK 1317


>ref|XP_004955243.1| PREDICTED: protein TONSOKU-like [Setaria italica]
          Length = 1319

 Score =  271 bits (692), Expect = 6e-70
 Identities = 154/354 (43%), Positives = 223/354 (62%), Gaps = 1/354 (0%)
 Frame = +1

Query: 10   GSVLAQLYLG-HNPVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKTSCL 186
            GSVL+ L LG +NP++GN ++NLL+KLA+L RFSEL+L G+KL+K +VD LCLLA++SCL
Sbjct: 980  GSVLSHLSLGKNNPISGNAMLNLLSKLASLTRFSELSLTGIKLNKLMVDKLCLLAQSSCL 1039

Query: 187  SGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGILEL 366
            SGL++GGTSIG  G + L +ALS    +LL+ +LS CGL +  F  + T+++ +  IL+L
Sbjct: 1040 SGLLLGGTSIGPVGTIRLTDALSCTSQDLLRLELSNCGLTAPDFAQICTNLSCI-NILDL 1098

Query: 367  NLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEELNLA 546
            NLGGNSI  EG +A+ ++L N QCS+++L+L++CNLGLAG+  IIQAL+ N  LEEL LA
Sbjct: 1099 NLGGNSINLEGCDAIQAMLVNPQCSIRSLMLDRCNLGLAGIVCIIQALSGNDQLEELRLA 1158

Query: 547  ENANVDNDITPHYDSTQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSECKDL 726
            EN N   +    Y+  QEVS+ + +   N           E +  I QG    + + +++
Sbjct: 1159 ENTNSSLE-RMQYEDMQEVSTSNEKKQCN---------NPETSNAIAQG----SLDFENM 1204

Query: 727  QVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQILDLS 906
            QV DSED+   E     S+              ++   Q+I++L  A+  A+ L++LDLS
Sbjct: 1205 QVPDSEDEAENEN--HRSVSGPHRSCASSSQKNSYSNCQIIQELAEALISAKRLKVLDLS 1262

Query: 907  GNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
             NG + E  +SLY+AW            H+   V+HFSV G  CCG+K CC+RD
Sbjct: 1263 QNGLSDEAIQSLYSAWASVPRGDGMARKHVNKDVVHFSVDGMRCCGMKPCCRRD 1316


>ref|XP_002452932.1| hypothetical protein SORBIDRAFT_04g035180 [Sorghum bicolor]
            gi|241932763|gb|EES05908.1| hypothetical protein
            SORBIDRAFT_04g035180 [Sorghum bicolor]
          Length = 1292

 Score =  270 bits (690), Expect = 1e-69
 Identities = 151/355 (42%), Positives = 220/355 (61%), Gaps = 2/355 (0%)
 Frame = +1

Query: 10   GSVLAQLYLGHN-PVTGNVLVNLLAKLATLKRFSELNLNGLKLSKPVVDSLCLLAKTSCL 186
            GSVL+ L LG N P++ N ++NLL+KLA+L RFSEL+L G+KL+K +VD LCLLA++SCL
Sbjct: 951  GSVLSHLSLGKNHPISSNTMLNLLSKLASLTRFSELSLTGIKLNKLMVDKLCLLAQSSCL 1010

Query: 187  SGLMIGGTSIGDDGALNLINALSMGPHELLKFDLSYCGLKSHSFETLFTDIASVGGILEL 366
            SGL++GGTSIG    + L  ALS    ELL+ +LS CGL +     + T+++ +  ILEL
Sbjct: 1011 SGLLLGGTSIGPVETIKLTEALSCTSQELLRLELSNCGLTTPDLTQICTNLSRI-NILEL 1069

Query: 367  NLGGNSIGPEGVNALASLLTNLQCSLKTLVLNKCNLGLAGVTKIIQALAENKILEELNLA 546
            NLGGN I  EG +A+  +L N QCS+++L L+KCNLGLAG+ ++IQ+L+EN  LEEL ++
Sbjct: 1070 NLGGNPINLEGCDAIQGMLVNPQCSIRSLTLDKCNLGLAGIVRVIQSLSENSQLEELRMS 1129

Query: 547  ENANVDNDITPHYD-STQEVSSKSLQPNLNLSDITQSKCTSEKTEDIQQGLCVVNSECKD 723
            +N N++++ T  YD   QEVS+ + Q   N           E   DI  G    + +  +
Sbjct: 1130 KNTNLESERTIKYDEDMQEVSTTAEQKQCN---------NPETKNDIAPG----DIDFAN 1176

Query: 724  LQVADSEDDPIKEGPVASSLDDXXXXXXXXXXXXAFPWSQLIEDLVTAIRMARHLQILDL 903
            +QV DSED+   +     ++              ++   Q+IE+L  A+  A+ L++LDL
Sbjct: 1177 MQVPDSEDE--ADNDAHHAISGPHRSCASSSQKNSYSSCQIIEELAEALISAKQLKVLDL 1234

Query: 904  SGNGFTVEIAESLYAAWXXXXXXXXXXXXHIKDQVIHFSVQGKSCCGIKLCCKRD 1068
            S NG + E  +SLY+AW            H+  +V+HFSV G  CCG+K CC+RD
Sbjct: 1235 SCNGLSEEAIQSLYSAWASVPRGDGMARKHVNKEVVHFSVDGMRCCGLKPCCRRD 1289


Top