BLASTX nr result

ID: Sinomenium22_contig00035194 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00035194
         (753 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282785.1| PREDICTED: armadillo repeat-containing prote...   290   3e-76
emb|CAN74401.1| hypothetical protein VITISV_043630 [Vitis vinifera]   288   1e-75
ref|XP_006393063.1| hypothetical protein EUTSA_v10011291mg [Eutr...   276   6e-72
ref|XP_006306946.1| hypothetical protein CARUB_v10008512mg [Caps...   271   2e-70
ref|XP_002891633.1| armadillo/beta-catenin repeat family protein...   270   5e-70
gb|AAD30652.1|AC006085_25 Hypothetical protein [Arabidopsis thal...   268   1e-69
ref|NP_175546.1| ARM repeat superfamily protein [Arabidopsis tha...   268   1e-69
ref|XP_002313338.1| armadillo/beta-catenin repeat family protein...   259   7e-67
ref|XP_006575729.1| PREDICTED: armadillo repeat-containing prote...   259   9e-67
gb|EXB93306.1| Armadillo repeat-containing protein 8 [Morus nota...   257   3e-66
ref|XP_007142472.1| hypothetical protein PHAVU_008G283500g [Phas...   257   3e-66
ref|XP_004291788.1| PREDICTED: armadillo repeat-containing prote...   256   6e-66
ref|XP_006838345.1| hypothetical protein AMTR_s00103p00158520 [A...   255   1e-65
ref|XP_006470614.1| PREDICTED: armadillo repeat-containing prote...   254   2e-65
ref|XP_007015057.1| ARM repeat superfamily protein isoform 4 [Th...   253   5e-65
ref|XP_007015056.1| F11M15.21 protein isoform 3 [Theobroma cacao...   253   5e-65
ref|XP_007015055.1| ARM repeat superfamily protein isoform 2 [Th...   253   5e-65
ref|XP_007015054.1| ARM repeat superfamily protein isoform 1 [Th...   253   5e-65
ref|XP_003617912.1| Armadillo repeat-containing protein [Medicag...   252   9e-65
ref|XP_002513503.1| conserved hypothetical protein [Ricinus comm...   251   2e-64

>ref|XP_002282785.1| PREDICTED: armadillo repeat-containing protein 8 [Vitis vinifera]
           gi|297746314|emb|CBI16370.3| unnamed protein product
           [Vitis vinifera]
          Length = 655

 Score =  290 bits (743), Expect = 3e-76
 Identities = 156/242 (64%), Positives = 187/242 (77%), Gaps = 2/242 (0%)
 Frame = -3

Query: 721 MPASATANRPEDLIVRLRSG--DGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXX 548
           MP SA+ +RPEDL+ RLRS   D + KLKALRE+KNQIIGN+TKKLSYIKLGAVP VV  
Sbjct: 1   MPTSASTHRPEDLLTRLRSANADADAKLKALREVKNQIIGNRTKKLSYIKLGAVPAVVSV 60

Query: 547 XXXXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGA 368
                               IGSFACG +AGV+AVL  GAFPHL R+LSN + KVVD GA
Sbjct: 61  IAATADDCSSVLVQSAA--AIGSFACGFEAGVQAVLRAGAFPHLLRLLSNSNGKVVDAGA 118

Query: 367 RSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCG 188
           RSLRMI+QS+LAPKYDF + +NME LLSLLNS+NENVTGLGASIITHSC+ SAEQ ALC 
Sbjct: 119 RSLRMIYQSKLAPKYDFLQEKNMEFLLSLLNSENENVTGLGASIITHSCETSAEQNALCD 178

Query: 187 AGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSP 8
           AGVL++LI LL GSL QR+ASL+++A V ++N EV+S F+  +NGRALS++TEL +D+ P
Sbjct: 179 AGVLKKLIGLLQGSLSQRDASLESIATVIKSNPEVVSKFVGPENGRALSAVTELTKDRYP 238

Query: 7   RT 2
           RT
Sbjct: 239 RT 240


>emb|CAN74401.1| hypothetical protein VITISV_043630 [Vitis vinifera]
          Length = 637

 Score =  288 bits (738), Expect = 1e-75
 Identities = 155/242 (64%), Positives = 186/242 (76%), Gaps = 2/242 (0%)
 Frame = -3

Query: 721 MPASATANRPEDLIVRLRSG--DGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXX 548
           MP SA+ +RPEDL+ RLRS   D + KLKALRE+KNQIIGN+TKKLSYIKLGAVP V   
Sbjct: 1   MPTSASTHRPEDLLTRLRSANADADAKLKALREVKNQIIGNRTKKLSYIKLGAVPAVXSV 60

Query: 547 XXXXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGA 368
                               IGSFACG +AGV+AVL  GAFPHL R+LSN + KVVD GA
Sbjct: 61  IAATADDCSSVLVQSAA--AIGSFACGFEAGVQAVLRAGAFPHLLRLLSNSNGKVVDAGA 118

Query: 367 RSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCG 188
           RSLRMI+QS+LAPKYDF + +NME LLSLLNS+NENVTGLGASIITHSC+ SAEQ ALC 
Sbjct: 119 RSLRMIYQSKLAPKYDFLQEKNMEFLLSLLNSENENVTGLGASIITHSCETSAEQNALCD 178

Query: 187 AGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSP 8
           AGVL++LI LL GSL QR+ASL+++A V ++N EV+S F+  +NGRALS++TEL +D+ P
Sbjct: 179 AGVLKKLIGLLQGSLSQRDASLESIATVIKSNPEVVSKFVGPENGRALSAVTELTKDRYP 238

Query: 7   RT 2
           RT
Sbjct: 239 RT 240


>ref|XP_006393063.1| hypothetical protein EUTSA_v10011291mg [Eutrema salsugineum]
           gi|557089641|gb|ESQ30349.1| hypothetical protein
           EUTSA_v10011291mg [Eutrema salsugineum]
          Length = 666

 Score =  276 bits (706), Expect = 6e-72
 Identities = 143/243 (58%), Positives = 178/243 (73%)
 Frame = -3

Query: 730 SGEMPASATANRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVX 551
           S    +S++ NR  D+  RL S D EVKLKALRE+KNQIIGN+TKKLS++KLGAVP +  
Sbjct: 10  SAAASSSSSGNRQADVFSRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKLGAVPAIAS 69

Query: 550 XXXXXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTG 371
                              A +GSFACG +AGV+AVLD G FPHL R+L+NP +KVVD G
Sbjct: 70  ALSDADDSDKCNNILVQSAAALGSFACGFEAGVQAVLDAGVFPHLLRLLTNPDEKVVDAG 129

Query: 370 ARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALC 191
           ARSLRMIFQS LAPKYDF + +NME L SLLNS+NENV+GLGASII H+C  S EQ+ LC
Sbjct: 130 ARSLRMIFQSNLAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGTSVEQRLLC 189

Query: 190 GAGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKS 11
            AGVL++L+ILL+GSL QREA L++LA V +NN + +S F+ L+ GR LSS+TEL +D+ 
Sbjct: 190 DAGVLEKLVILLDGSLSQREACLESLATVLKNNPDAVSHFVGLEGGRYLSSVTELTKDRY 249

Query: 10  PRT 2
            RT
Sbjct: 250 TRT 252


>ref|XP_006306946.1| hypothetical protein CARUB_v10008512mg [Capsella rubella]
           gi|482575657|gb|EOA39844.1| hypothetical protein
           CARUB_v10008512mg [Capsella rubella]
          Length = 669

 Score =  271 bits (693), Expect = 2e-70
 Identities = 140/233 (60%), Positives = 171/233 (73%)
 Frame = -3

Query: 700 NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXXXXXXXX 521
           NR  D+  RL S D EVKLKALRE+KNQIIGN+TKKLS++KLGAVP +            
Sbjct: 22  NRQADVFCRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKLGAVPAIASALADADDSEK 81

Query: 520 XXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSLRMIFQS 341
                    A +GSFACG +AGV AVLD G FPHL R+L+NP DKVVD GARSLRMIFQS
Sbjct: 82  CNNILVQSAAALGSFACGFEAGVHAVLDAGVFPHLLRLLTNPDDKVVDAGARSLRMIFQS 141

Query: 340 RLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGVLQRLII 161
             APKYDF + +NME L SLLNS+NENV+GLGASII H+C    EQ+ LC AGVL++L+I
Sbjct: 142 NQAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGTVVEQRVLCDAGVLEKLVI 201

Query: 160 LLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           LL+GSL QREA L++LA V +NN E +S F+ L++GR L+S+ EL +D+ PRT
Sbjct: 202 LLDGSLSQREACLESLATVLKNNPEAVSDFVGLESGRYLNSVIELTKDRYPRT 254


>ref|XP_002891633.1| armadillo/beta-catenin repeat family protein [Arabidopsis lyrata
           subsp. lyrata] gi|297337475|gb|EFH67892.1|
           armadillo/beta-catenin repeat family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 666

 Score =  270 bits (689), Expect = 5e-70
 Identities = 137/233 (58%), Positives = 172/233 (73%)
 Frame = -3

Query: 700 NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXXXXXXXX 521
           NR  D+  RL S D EVKLKALRE+KNQIIGN+TKKLS++KLGA+P +            
Sbjct: 19  NRQADVFSRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKLGAIPAIASVLADADDSDK 78

Query: 520 XXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSLRMIFQS 341
                    A +GSFACG +AGV+AVLD G FPHL R+L+NP +KVVD GARSLRMIFQS
Sbjct: 79  CNNILVQSAAALGSFACGFEAGVQAVLDAGVFPHLLRLLTNPDEKVVDAGARSLRMIFQS 138

Query: 340 RLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGVLQRLII 161
             APKYDF + +NME L SLLNS+NENV+GLGASII H+C  S EQ+ LC AGVL++L+I
Sbjct: 139 NQAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGTSVEQRVLCEAGVLEKLVI 198

Query: 160 LLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           LL+GS  QREA L++LA V +NN E +S F+ L++G+  +S+TEL +D+ PRT
Sbjct: 199 LLDGSFSQREACLESLATVLKNNPEAVSDFVGLESGKYFNSVTELTKDRYPRT 251


>gb|AAD30652.1|AC006085_25 Hypothetical protein [Arabidopsis thaliana]
          Length = 688

 Score =  268 bits (686), Expect = 1e-69
 Identities = 137/233 (58%), Positives = 172/233 (73%)
 Frame = -3

Query: 700 NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXXXXXXXX 521
           NR  D+  RL S D EVKLKALRE+KNQIIGN+TKKLS++KLGA+P +            
Sbjct: 19  NRQSDVFSRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKLGAIPAIASVLADADDSDE 78

Query: 520 XXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSLRMIFQS 341
                    A +GSFACG +AGV+AVLD G FPHL R+L+N  +KVVD GARSLRMIFQS
Sbjct: 79  CNNILVQSAAALGSFACGFEAGVQAVLDAGVFPHLLRLLTNTDEKVVDAGARSLRMIFQS 138

Query: 340 RLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGVLQRLII 161
             APKYDF + +NME L SLLNS+NENV+GLGASII H+C  S EQ+ LC AGVL++L+I
Sbjct: 139 NQAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGTSVEQRVLCEAGVLEKLVI 198

Query: 160 LLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           LL+GSL QREA L++LA V +NN E +S F+ L++G+  +S+TEL +D+ PRT
Sbjct: 199 LLDGSLSQREACLESLATVLKNNPEAVSDFVGLESGKYFNSVTELTKDRYPRT 251


>ref|NP_175546.1| ARM repeat superfamily protein [Arabidopsis thaliana]
           gi|20260482|gb|AAM13139.1| unknown protein [Arabidopsis
           thaliana] gi|30725506|gb|AAP37775.1| At1g51350
           [Arabidopsis thaliana] gi|332194533|gb|AEE32654.1| ARM
           repeat superfamily protein [Arabidopsis thaliana]
          Length = 666

 Score =  268 bits (686), Expect = 1e-69
 Identities = 137/233 (58%), Positives = 172/233 (73%)
 Frame = -3

Query: 700 NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXXXXXXXX 521
           NR  D+  RL S D EVKLKALRE+KNQIIGN+TKKLS++KLGA+P +            
Sbjct: 19  NRQSDVFSRLASSDPEVKLKALREVKNQIIGNRTKKLSFLKLGAIPAIASVLADADDSDE 78

Query: 520 XXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSLRMIFQS 341
                    A +GSFACG +AGV+AVLD G FPHL R+L+N  +KVVD GARSLRMIFQS
Sbjct: 79  CNNILVQSAAALGSFACGFEAGVQAVLDAGVFPHLLRLLTNTDEKVVDAGARSLRMIFQS 138

Query: 340 RLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGVLQRLII 161
             APKYDF + +NME L SLLNS+NENV+GLGASII H+C  S EQ+ LC AGVL++L+I
Sbjct: 139 NQAPKYDFLQEKNMEFLFSLLNSENENVSGLGASIIAHACGTSVEQRVLCEAGVLEKLVI 198

Query: 160 LLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           LL+GSL QREA L++LA V +NN E +S F+ L++G+  +S+TEL +D+ PRT
Sbjct: 199 LLDGSLSQREACLESLATVLKNNPEAVSDFVGLESGKYFNSVTELTKDRYPRT 251


>ref|XP_002313338.1| armadillo/beta-catenin repeat family protein [Populus trichocarpa]
           gi|222849746|gb|EEE87293.1| armadillo/beta-catenin
           repeat family protein [Populus trichocarpa]
          Length = 666

 Score =  259 bits (662), Expect = 7e-67
 Identities = 142/252 (56%), Positives = 177/252 (70%), Gaps = 12/252 (4%)
 Frame = -3

Query: 721 MPASATA---NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVX 551
           MP + T    + P DL+ RL S D E KLKALRE+KNQIIGN+TKKLS++KLGAVP V  
Sbjct: 1   MPTANTTIPIHGPVDLLTRLDSPDPETKLKALREIKNQIIGNRTKKLSFLKLGAVPAVAS 60

Query: 550 XXXXXXXXXXXXXXXXXXXA---------VIGSFACGVDAGVEAVLDNGAFPHLFRILSN 398
                                        V+GSFACG DAGV AVLD G+FPHL R+L N
Sbjct: 61  ILSSYATEADSQLADADVSISNVIVQSAAVLGSFACGFDAGVRAVLDAGSFPHLIRLLFN 120

Query: 397 PHDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCK 218
           P +KVVD  ARSLRMI+QS+LAPKY+F + +NME L+SL+NS+ ENVTGLGASIITHSC+
Sbjct: 121 PAEKVVDASARSLRMIYQSKLAPKYEFVQEKNMEFLISLINSECENVTGLGASIITHSCE 180

Query: 217 KSAEQKALCGAGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSS 38
            SAEQ+ALC AGVL++LI LL GSL QR++SL++L  V +NN E +S F+  ++G ALSS
Sbjct: 181 TSAEQRALCDAGVLKKLISLLEGSLSQRDSSLESLGTVLKNNPESVSKFVGPESGSALSS 240

Query: 37  LTELMEDKSPRT 2
           + EL +D+  RT
Sbjct: 241 IIELTKDRYART 252


>ref|XP_006575729.1| PREDICTED: armadillo repeat-containing protein 8-like isoform X1
           [Glycine max]
          Length = 652

 Score =  259 bits (661), Expect = 9e-67
 Identities = 144/242 (59%), Positives = 175/242 (72%), Gaps = 2/242 (0%)
 Frame = -3

Query: 721 MPASATANRPEDLIV-RLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXX 545
           MPA+  ++   DLI+ RL S D E+KLKA+RE+KNQIIGN+TKKLSYIKLGAVP +    
Sbjct: 1   MPATPPSS---DLILHRLTSSDCEIKLKAIREVKNQIIGNRTKKLSYIKLGAVPALAAAL 57

Query: 544 XXXXXXXXXXXXXXXXXAV-IGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGA 368
                            A  +GSFACGVDAGV AVLD GAFPHL R+LS   DKVVD  A
Sbjct: 58  AQADADSASGSTLIVQSAAALGSFACGVDAGVRAVLDAGAFPHLIRLLSAADDKVVDAAA 117

Query: 367 RSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCG 188
           RSLRMI+QS LAPKYDF K ++M+ LLSLL S NEN+TGLGASI+ HSC+K  EQ  LC 
Sbjct: 118 RSLRMIYQSNLAPKYDFFKEQDMQFLLSLLKSGNENLTGLGASIVIHSCEKRDEQNMLCC 177

Query: 187 AGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSP 8
           AG L+ LI LL+GSL QR++SL++LAA+ +NN EV+  F+DL NGR LSS+ EL +D+  
Sbjct: 178 AGALETLISLLDGSLSQRDSSLESLAAILKNNPEVVYKFVDLQNGRVLSSVIELTKDRYS 237

Query: 7   RT 2
           RT
Sbjct: 238 RT 239


>gb|EXB93306.1| Armadillo repeat-containing protein 8 [Morus notabilis]
          Length = 670

 Score =  257 bits (657), Expect = 3e-66
 Identities = 139/240 (57%), Positives = 177/240 (73%), Gaps = 2/240 (0%)
 Frame = -3

Query: 715 ASATANRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDV--VXXXX 542
           + ATA    +L+ RL S D +V++KALR+LKN+IIGN+TKKLS+IKLG VP V  +    
Sbjct: 16  SQATARHHSELLSRLSSPDAQVQVKALRDLKNRIIGNRTKKLSFIKLGLVPAVADILASA 75

Query: 541 XXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARS 362
                           AV+GSFACG DAGV AVLD G FP L R+LS+P +KVVD+GARS
Sbjct: 76  ADAGADRDSNLLIQSAAVVGSFACGFDAGVLAVLDAGVFPSLVRLLSHPDEKVVDSGARS 135

Query: 361 LRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAG 182
           LRMI+QS+LAPKYDF + ++ME LLSLLNS+NENVTGLGASII HSC+   EQ AL  AG
Sbjct: 136 LRMIYQSKLAPKYDFLEQKSMEFLLSLLNSENENVTGLGASIIMHSCETVEEQNALSQAG 195

Query: 181 VLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           VL++LI LL GSL QR+ASL++LA + +NN E +S+F+  ++GRALSS+  L +D+ PRT
Sbjct: 196 VLKKLIGLLEGSLNQRDASLESLATIIKNNVEAVSMFVGSESGRALSSVVALTKDRYPRT 255


>ref|XP_007142472.1| hypothetical protein PHAVU_008G283500g [Phaseolus vulgaris]
           gi|561015605|gb|ESW14466.1| hypothetical protein
           PHAVU_008G283500g [Phaseolus vulgaris]
          Length = 652

 Score =  257 bits (657), Expect = 3e-66
 Identities = 143/237 (60%), Positives = 172/237 (72%), Gaps = 3/237 (1%)
 Frame = -3

Query: 703 ANRPE-DLIV-RLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXXXXX 530
           A RP  DLI+ RL S D E+KLKA+RE+KNQIIGN+TKKLSYIKLGAVP +         
Sbjct: 3   ATRPSSDLIIHRLASSDCEIKLKAIREVKNQIIGNRTKKLSYIKLGAVPVLAAVLAGADA 62

Query: 529 XXXXXXXXXXXXAV-IGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSLRM 353
                       A  +GSFACGVDAGV AVLD GAFPHL R+LS   DKV D  ARSLRM
Sbjct: 63  DSAPGSSLIVQSAAALGSFACGVDAGVRAVLDAGAFPHLIRLLSAADDKVADAAARSLRM 122

Query: 352 IFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGVLQ 173
           I+QS+LAPKYDF K ++M  LLSLL S NEN+TGLGASI+  SC+ S EQ  LC AG L+
Sbjct: 123 IYQSKLAPKYDFYKEQDMGFLLSLLKSGNENLTGLGASIVIQSCETSDEQNILCCAGALE 182

Query: 172 RLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           RLI LL+G+L QR+ SL+++AA+ +NN EV+S F+DL NGRALSS+ EL +D+  RT
Sbjct: 183 RLISLLDGTLSQRDFSLESIAAILKNNPEVVSKFVDLQNGRALSSIIELTKDRYSRT 239


>ref|XP_004291788.1| PREDICTED: armadillo repeat-containing protein 8-like [Fragaria
           vesca subsp. vesca]
          Length = 677

 Score =  256 bits (654), Expect = 6e-66
 Identities = 139/243 (57%), Positives = 174/243 (71%), Gaps = 5/243 (2%)
 Frame = -3

Query: 715 ASATANRPE-DLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXX 539
           A+AT   P  DL+ RL S D +V+LKALRELKN+IIGN+TKKL+++KLG VP V      
Sbjct: 16  ATATVTAPNSDLLTRLSSPDSQVQLKALRELKNRIIGNRTKKLAFVKLGLVPAVAAILAS 75

Query: 538 XXXXXXXXXXXXXXXA----VIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTG 371
                                +GSFACG DAGV AVLD GA P+L  +LS+P DKVVD G
Sbjct: 76  TAQAHTRPDHDSNLLVQSAAALGSFACGFDAGVRAVLDAGACPNLLLLLSHPDDKVVDAG 135

Query: 370 ARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALC 191
           ARSLRMI+QS LAPK DF +++NME LLSLLNS+NENV GLGASII HSC+  AE+KALC
Sbjct: 136 ARSLRMIYQSNLAPKCDFLQDKNMEFLLSLLNSENENVNGLGASIIIHSCETMAEKKALC 195

Query: 190 GAGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKS 11
            AGVL++L+ LL GS  QR+ SL++LA + +NN E +S F D ++GRALSS+ EL +D++
Sbjct: 196 QAGVLKKLVSLLEGSPSQRDNSLESLATIMKNNDEAVSEFADCESGRALSSVIELTKDRN 255

Query: 10  PRT 2
           PRT
Sbjct: 256 PRT 258


>ref|XP_006838345.1| hypothetical protein AMTR_s00103p00158520 [Amborella trichopoda]
           gi|548840813|gb|ERN00914.1| hypothetical protein
           AMTR_s00103p00158520 [Amborella trichopoda]
          Length = 649

 Score =  255 bits (652), Expect = 1e-65
 Identities = 135/240 (56%), Positives = 173/240 (72%)
 Frame = -3

Query: 721 MPASATANRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXX 542
           MP+SA   RPE+L+  L S D   +LKALR+LKNQIIGN+TKKL Y+KLGAVP V     
Sbjct: 1   MPSSAACKRPENLVEGLSSKDPGTRLKALRDLKNQIIGNRTKKLRYVKLGAVPLV----S 56

Query: 541 XXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARS 362
                           A +GSF CG+DAGVEA++ +GA P+LF  L+NP +KVV+ GARS
Sbjct: 57  EMLESGSESVVLVQAIATLGSFGCGLDAGVEAIVKSGALPYLFSTLANPDEKVVEAGARS 116

Query: 361 LRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAG 182
           LRMIFQS+L PKY+  +++ M+ L SLLNS+NE VT L ASII HSC+ + EQKALCG+G
Sbjct: 117 LRMIFQSKLTPKYNILEDKEMDFLFSLLNSENETVTELAASIIMHSCETTEEQKALCGSG 176

Query: 181 VLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           VL +L +LL GS  QR+ASLD+LA+V +NN EV+S F+ L NG+ LSS+  L +D+SPRT
Sbjct: 177 VLDKLALLLEGSSNQRDASLDSLASVVKNNREVVSKFVGLHNGKVLSSVNGLTKDRSPRT 236


>ref|XP_006470614.1| PREDICTED: armadillo repeat-containing protein 8-like isoform X1
           [Citrus sinensis]
          Length = 669

 Score =  254 bits (649), Expect = 2e-65
 Identities = 137/239 (57%), Positives = 175/239 (73%), Gaps = 10/239 (4%)
 Frame = -3

Query: 688 DLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDV----------VXXXXX 539
           ++++RL SG+ +VKLKALRELKNQIIGN+TKKLS++KLGAVP V          V     
Sbjct: 16  NILLRLTSGERDVKLKALRELKNQIIGNRTKKLSFLKLGAVPAVAGILSDAVSAVDVADN 75

Query: 538 XXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGARSL 359
                          AV+GSFACG +AGV AVLD GAFP+L R+LS   +KVVD GARSL
Sbjct: 76  DENDRIVKDIIVQSAAVLGSFACGFEAGVRAVLDAGAFPNLSRLLSYTDEKVVDAGARSL 135

Query: 358 RMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCGAGV 179
           +MI+QS++APKYDF + ENME LLSLLN+++ENV+GLGASII+HSCK S EQK L  AGV
Sbjct: 136 KMIYQSKMAPKYDFLQEENMEFLLSLLNNESENVSGLGASIISHSCKTSLEQKLLFDAGV 195

Query: 178 LQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSPRT 2
           L+RL  LL GSL QR+ASL+++A +F+NN EV+S F+  D GR LS + E+++D+  RT
Sbjct: 196 LKRLTSLLGGSLIQRDASLESIATIFKNNPEVVSQFVGPDTGRTLSCIIEIVKDRFART 254


>ref|XP_007015057.1| ARM repeat superfamily protein isoform 4 [Theobroma cacao]
           gi|508785420|gb|EOY32676.1| ARM repeat superfamily
           protein isoform 4 [Theobroma cacao]
          Length = 693

 Score =  253 bits (646), Expect = 5e-65
 Identities = 141/252 (55%), Positives = 181/252 (71%), Gaps = 15/252 (5%)
 Frame = -3

Query: 712 SATAN--RPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXX 539
           SAT N  RP +L+ RL S + EVKL+ALRE+KNQIIGN+TKKLS++KLGAVP V      
Sbjct: 3   SATVNNHRPSELLSRLASSEPEVKLRALREVKNQIIGNRTKKLSFLKLGAVPAVAGILAD 62

Query: 538 XXXXXXXXXXXXXXXA------------VIGSFACGVDAGVEAVLDNGAFPHLFRILSNP 395
                                        +GSFACG DAGV+AVLD GAFP+L R+LSN 
Sbjct: 63  SADDVIDCNNNNCNVNNNINNILVQSAAALGSFACGFDAGVQAVLDAGAFPNLLRLLSNS 122

Query: 394 HDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKK 215
           ++KVVD GAR+LRMI+QS+LAPKYDF + +NME L+SLLNS+NENV+GLGASIIT+SC+ 
Sbjct: 123 NEKVVDAGARALRMIYQSKLAPKYDFLQQKNMEFLISLLNSENENVSGLGASIITNSCET 182

Query: 214 SAEQKALCGAGVLQRLIILLN-GSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSS 38
           S EQKAL  AG+L+RL  LL  GSL Q++ASL++LA++F+NNSEV+S F   +  R L S
Sbjct: 183 SLEQKALFDAGILRRLNSLLECGSLSQKDASLESLASIFKNNSEVVSKFAGPEIERPLGS 242

Query: 37  LTELMEDKSPRT 2
           + +L++D+ PRT
Sbjct: 243 IIDLLKDRYPRT 254


>ref|XP_007015056.1| F11M15.21 protein isoform 3 [Theobroma cacao]
           gi|508785419|gb|EOY32675.1| F11M15.21 protein isoform 3
           [Theobroma cacao]
          Length = 626

 Score =  253 bits (646), Expect = 5e-65
 Identities = 141/252 (55%), Positives = 181/252 (71%), Gaps = 15/252 (5%)
 Frame = -3

Query: 712 SATAN--RPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXX 539
           SAT N  RP +L+ RL S + EVKL+ALRE+KNQIIGN+TKKLS++KLGAVP V      
Sbjct: 3   SATVNNHRPSELLSRLASSEPEVKLRALREVKNQIIGNRTKKLSFLKLGAVPAVAGILAD 62

Query: 538 XXXXXXXXXXXXXXXA------------VIGSFACGVDAGVEAVLDNGAFPHLFRILSNP 395
                                        +GSFACG DAGV+AVLD GAFP+L R+LSN 
Sbjct: 63  SADDVIDCNNNNCNVNNNINNILVQSAAALGSFACGFDAGVQAVLDAGAFPNLLRLLSNS 122

Query: 394 HDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKK 215
           ++KVVD GAR+LRMI+QS+LAPKYDF + +NME L+SLLNS+NENV+GLGASIIT+SC+ 
Sbjct: 123 NEKVVDAGARALRMIYQSKLAPKYDFLQQKNMEFLISLLNSENENVSGLGASIITNSCET 182

Query: 214 SAEQKALCGAGVLQRLIILLN-GSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSS 38
           S EQKAL  AG+L+RL  LL  GSL Q++ASL++LA++F+NNSEV+S F   +  R L S
Sbjct: 183 SLEQKALFDAGILRRLNSLLECGSLSQKDASLESLASIFKNNSEVVSKFAGPEIERPLGS 242

Query: 37  LTELMEDKSPRT 2
           + +L++D+ PRT
Sbjct: 243 IIDLLKDRYPRT 254


>ref|XP_007015055.1| ARM repeat superfamily protein isoform 2 [Theobroma cacao]
           gi|508785418|gb|EOY32674.1| ARM repeat superfamily
           protein isoform 2 [Theobroma cacao]
          Length = 695

 Score =  253 bits (646), Expect = 5e-65
 Identities = 141/252 (55%), Positives = 181/252 (71%), Gaps = 15/252 (5%)
 Frame = -3

Query: 712 SATAN--RPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXX 539
           SAT N  RP +L+ RL S + EVKL+ALRE+KNQIIGN+TKKLS++KLGAVP V      
Sbjct: 3   SATVNNHRPSELLSRLASSEPEVKLRALREVKNQIIGNRTKKLSFLKLGAVPAVAGILAD 62

Query: 538 XXXXXXXXXXXXXXXA------------VIGSFACGVDAGVEAVLDNGAFPHLFRILSNP 395
                                        +GSFACG DAGV+AVLD GAFP+L R+LSN 
Sbjct: 63  SADDVIDCNNNNCNVNNNINNILVQSAAALGSFACGFDAGVQAVLDAGAFPNLLRLLSNS 122

Query: 394 HDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKK 215
           ++KVVD GAR+LRMI+QS+LAPKYDF + +NME L+SLLNS+NENV+GLGASIIT+SC+ 
Sbjct: 123 NEKVVDAGARALRMIYQSKLAPKYDFLQQKNMEFLISLLNSENENVSGLGASIITNSCET 182

Query: 214 SAEQKALCGAGVLQRLIILLN-GSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSS 38
           S EQKAL  AG+L+RL  LL  GSL Q++ASL++LA++F+NNSEV+S F   +  R L S
Sbjct: 183 SLEQKALFDAGILRRLNSLLECGSLSQKDASLESLASIFKNNSEVVSKFAGPEIERPLGS 242

Query: 37  LTELMEDKSPRT 2
           + +L++D+ PRT
Sbjct: 243 IIDLLKDRYPRT 254


>ref|XP_007015054.1| ARM repeat superfamily protein isoform 1 [Theobroma cacao]
           gi|508785417|gb|EOY32673.1| ARM repeat superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 667

 Score =  253 bits (646), Expect = 5e-65
 Identities = 141/252 (55%), Positives = 181/252 (71%), Gaps = 15/252 (5%)
 Frame = -3

Query: 712 SATAN--RPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXXXX 539
           SAT N  RP +L+ RL S + EVKL+ALRE+KNQIIGN+TKKLS++KLGAVP V      
Sbjct: 3   SATVNNHRPSELLSRLASSEPEVKLRALREVKNQIIGNRTKKLSFLKLGAVPAVAGILAD 62

Query: 538 XXXXXXXXXXXXXXXA------------VIGSFACGVDAGVEAVLDNGAFPHLFRILSNP 395
                                        +GSFACG DAGV+AVLD GAFP+L R+LSN 
Sbjct: 63  SADDVIDCNNNNCNVNNNINNILVQSAAALGSFACGFDAGVQAVLDAGAFPNLLRLLSNS 122

Query: 394 HDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKK 215
           ++KVVD GAR+LRMI+QS+LAPKYDF + +NME L+SLLNS+NENV+GLGASIIT+SC+ 
Sbjct: 123 NEKVVDAGARALRMIYQSKLAPKYDFLQQKNMEFLISLLNSENENVSGLGASIITNSCET 182

Query: 214 SAEQKALCGAGVLQRLIILLN-GSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSS 38
           S EQKAL  AG+L+RL  LL  GSL Q++ASL++LA++F+NNSEV+S F   +  R L S
Sbjct: 183 SLEQKALFDAGILRRLNSLLECGSLSQKDASLESLASIFKNNSEVVSKFAGPEIERPLGS 242

Query: 37  LTELMEDKSPRT 2
           + +L++D+ PRT
Sbjct: 243 IIDLLKDRYPRT 254


>ref|XP_003617912.1| Armadillo repeat-containing protein [Medicago truncatula]
           gi|355519247|gb|AET00871.1| Armadillo repeat-containing
           protein [Medicago truncatula]
          Length = 789

 Score =  252 bits (644), Expect = 9e-65
 Identities = 140/242 (57%), Positives = 173/242 (71%), Gaps = 2/242 (0%)
 Frame = -3

Query: 721 MPASATANRPEDLIV-RLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVPDVVXXX 545
           MP+S  ++   DLI+ RL S D E+KLKA+RE+KNQIIGN+TKKLSYIKLGAVP V    
Sbjct: 1   MPSSHPSS---DLILHRLSSSDYEIKLKAIREIKNQIIGNRTKKLSYIKLGAVPSVADAL 57

Query: 544 XXXXXXXXXXXXXXXXXA-VIGSFACGVDAGVEAVLDNGAFPHLFRILSNPHDKVVDTGA 368
                            A V+GSFACGVD GV AVLD GAFPHL R LS   +KVVD  A
Sbjct: 58  ATANSDSDFGSNLIVQSAAVLGSFACGVDQGVRAVLDAGAFPHLIRFLSAADEKVVDAAA 117

Query: 367 RSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGASIITHSCKKSAEQKALCG 188
           RSLRMI+QS+LAPK+DF K +NM+ LLSLL S+NEN+TGLGA +I HSC+ S EQ  LC 
Sbjct: 118 RSLRMIYQSKLAPKFDFYKEQNMDFLLSLLRSENENLTGLGAGVIIHSCETSDEQNILCQ 177

Query: 187 AGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDLDNGRALSSLTELMEDKSP 8
           AG L++LI  L+GS+ QR+ASL++LA + +NN E +S F +L NGRAL S+ EL +D+  
Sbjct: 178 AGSLEKLISGLDGSINQRDASLESLATILKNNPEAVSKFAELQNGRALRSVIELTKDRYS 237

Query: 7   RT 2
           RT
Sbjct: 238 RT 239


>ref|XP_002513503.1| conserved hypothetical protein [Ricinus communis]
           gi|223547411|gb|EEF48906.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 675

 Score =  251 bits (641), Expect = 2e-64
 Identities = 141/260 (54%), Positives = 181/260 (69%), Gaps = 20/260 (7%)
 Frame = -3

Query: 721 MPASATA--NRPEDLIVRLRSGDGEVKLKALRELKNQIIGNKTKKLSYIKLGAVP----- 563
           MP ++TA  +RP DL+ RL S D +VKLKALRE+KNQIIGN+TKKL+++KLGA+P     
Sbjct: 1   MPTASTAVTHRPMDLLARLTSPDLDVKLKALREVKNQIIGNRTKKLAFLKLGAIPAVSSI 60

Query: 562 -------------DVVXXXXXXXXXXXXXXXXXXXXAVIGSFACGVDAGVEAVLDNGAFP 422
                        D                      A +GSFACG D+GV AV+D G+  
Sbjct: 61  LSAAIAESDSQLSDDEDNNNNKKNSYFNNNIIVQSAAALGSFACGFDSGVRAVIDAGSLL 120

Query: 421 HLFRILSNPHDKVVDTGARSLRMIFQSRLAPKYDFSKNENMECLLSLLNSDNENVTGLGA 242
            L ++LSNP +KVV+ GARSLRMI+QS+L PKY+F + + ME L+SLLNS++ENVTGLGA
Sbjct: 121 LLIQLLSNPDEKVVNAGARSLRMIYQSKLTPKYEFLQEKRMEFLISLLNSESENVTGLGA 180

Query: 241 SIITHSCKKSAEQKALCGAGVLQRLIILLNGSLYQREASLDALAAVFRNNSEVISIFIDL 62
           +IITHSC+ SAEQ ALC AGVL+RL+ LL GSL QR+ASL++LAAVFRNN +VIS  +  
Sbjct: 181 TIITHSCETSAEQIALCDAGVLKRLLSLLEGSLSQRDASLESLAAVFRNNPDVISESLRE 240

Query: 61  DNGRALSSLTELMEDKSPRT 2
           +NGRAL+S+  L  D+ PRT
Sbjct: 241 ENGRALTSIIGLTNDRYPRT 260