BLASTX nr result

ID: Catharanthus23_contig00016633 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00016633
         (1512 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix...   349   2e-93
ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251...   342   3e-91
ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Popu...   306   2e-80
gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao]    292   3e-76
ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261...   291   4e-76
gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao]    291   5e-76
ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Popu...   287   7e-75
emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]   285   4e-74
ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus c...   280   9e-73
ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citr...   269   2e-69
ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix...   264   7e-68
ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix...   260   9e-67
ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix...   260   1e-66
gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus...   255   3e-65
ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307...   240   1e-60
gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus...   236   2e-59
gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis]     236   3e-59
ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [...   228   4e-57
ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224...   219   2e-54
ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206...   219   2e-54

>ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix protein 3-like, partial
            [Solanum tuberosum]
          Length = 652

 Score =  349 bits (896), Expect = 2e-93
 Identities = 218/430 (50%), Positives = 260/430 (60%), Gaps = 2/430 (0%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D             
Sbjct: 250  RPGKMVSVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 308

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       ++N                 RSNSRK E SPYRRNPL EIDTN+V EQ+
Sbjct: 309  ARVNAKVLNERDNNAHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDTNVVLEQM 365

Query: 389  PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 568
            P  GLKV +  L+    S                N K KEQQ               +NV
Sbjct: 366  PAPGLKVPSQKLNAETVS----------------NGKVKEQQ---------------HNV 394

Query: 569  AVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQKNN- 745
            A++V+ SGPE  KP                INPEALSNP  SYTALLLEDIQNFHQK N 
Sbjct: 395  AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 451

Query: 746  -TPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLYDNSSFGG 922
             TPAF+LP CV+KACSI++AVADL          AF DD+RR P ++QF++  DN+SF  
Sbjct: 452  TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSAFSDDRRRNPTSEQFSQ-NDNASF-- 508

Query: 923  QLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 1102
               GKK L  K+P ++SEV    DLMEPS  KYVT RRGT    DMEEQESSGSNS+  G
Sbjct: 509  DPLGKKKLGIKDPFMESEVAVSGDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 563

Query: 1103 GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 1282
            G Q+W S SSWEPNSADS DCW  SS+S  R+D++SP+ FQR A+SE G+++ E +RR++
Sbjct: 564  GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEIGHDMEEGKRRVN 622

Query: 1283 VKKRDSDQQQ 1312
            VK+R+SD QQ
Sbjct: 623  VKRRESDNQQ 632


>ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251847 [Solanum
            lycopersicum]
          Length = 690

 Score =  342 bits (876), Expect = 3e-91
 Identities = 213/430 (49%), Positives = 258/430 (60%), Gaps = 2/430 (0%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKM+SVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D             
Sbjct: 287  RPGKMISVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 345

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       ++N                 RSNSRK E SPYRRNPL EID+N+V EQ+
Sbjct: 346  ARVNAKVLNERDNNTHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDSNVVLEQM 402

Query: 389  PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 568
            P  GLKV +  L+    S                N K KEQQ               +NV
Sbjct: 403  PAPGLKVPSQKLNAETVS----------------NGKVKEQQ--------------QHNV 432

Query: 569  AVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQKNN- 745
            A++V+ SGPE  KP                INPEALSNP  SYTALLLEDIQNFHQK N 
Sbjct: 433  AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 489

Query: 746  -TPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLYDNSSFGG 922
             TPAF+LP CV+KACSI++AVADL          A  DD+RR   ++Q+++  DN+SF  
Sbjct: 490  TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSALSDDRRRNATSEQYSQ-NDNASF-- 546

Query: 923  QLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 1102
               GKK L  K+P ++SEV   DDLMEPS  KYVT RRGT    DMEEQESSGSNS+  G
Sbjct: 547  DPLGKKKLGIKDPFMESEVTVSDDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 601

Query: 1103 GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 1282
            G Q+W S SSWEPNSADS DCW  SS+S  R+D++SP+ FQR A+SE  +++ E +RR++
Sbjct: 602  GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEISHDMEEGKRRVN 660

Query: 1283 VKKRDSDQQQ 1312
            VK+R+SD QQ
Sbjct: 661  VKRRESDNQQ 670


>ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Populus trichocarpa]
            gi|550327002|gb|EEE97021.2| hypothetical protein
            POPTR_0012s12820g [Populus trichocarpa]
          Length = 754

 Score =  306 bits (783), Expect = 2e-80
 Identities = 201/447 (44%), Positives = 242/447 (54%), Gaps = 29/447 (6%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSSL MDKSNN    P+  +G   KRI VKRN G               
Sbjct: 305  RPGKMVSVPATVSSLVMDKSNNIGVEPQATAGT--KRISVKRNVGEAAVAGSRTAASPRS 362

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       N+N                 RSNSRKA+ SPYRRNPL EID N +    
Sbjct: 363  QSPARANAKTSNENNQQPCLS----------RSNSRKADQSPYRRNPLSEIDPNSLQHSQ 412

Query: 389  PLSGLKVH--NNNLSQA----------------PASDSRISKGILDKN------IISINC 496
            P SG K    +NN SQ                 P + + + K   +KN      + +  C
Sbjct: 413  P-SGNKATCTSNNRSQIRNKDIEGQAVAKETFNPLNQTPMKKQNSEKNNRVNVQVANYRC 471

Query: 497  KEKEQ-QNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEA 673
                  +N +++E+++ +   +  V  +VV  G E LKP  +T            +NPE 
Sbjct: 472  SSMASLENKLSKEQQMEEAKGHPPVTTNVVDLGGESLKPQALTRSRSARRSRDLDLNPET 531

Query: 674  LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFP 853
            L NP PSYTALLLEDIQNFHQKN  P+F+LPACV+KACSILEAVADL          AF 
Sbjct: 532  LLNPTPSYTALLLEDIQNFHQKNTPPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 591

Query: 854  DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVR 1033
            DD+   P     N           L GKK  E K+P ++SE++  DDLMEPSFHKYVTVR
Sbjct: 592  DDRISPPAVAAVN-----------LVGKKLPEAKDPFVESEIIASDDLMEPSFHKYVTVR 640

Query: 1034 R--GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-ED 1201
            R  GTL GEDM+ QESSGSNS  GG  QH   S+SSWEPNSADS D W  SSRSN R ED
Sbjct: 641  RGGGTLCGEDMDGQESSGSNSFVGGSQQHLGLSTSSWEPNSADSTDRW--SSRSNTRDED 698

Query: 1202 SRSPVPFQRLALSEPGYEVGEARRRMS 1282
             +SP+ +Q+  L E G +V +ARR  S
Sbjct: 699  DKSPLGYQKHGLPETGRDVEQARRAFS 725


>gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 718

 Score =  292 bits (747), Expect = 3e-76
 Identities = 202/457 (44%), Positives = 241/457 (52%), Gaps = 30/457 (6%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 202
            RPGKMVSVPATVSSL MDKS N   G E  T +  A+KRI VKRN G             
Sbjct: 267  RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 320

Query: 203  XXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSE 382
                         N N                 RS+SRKAEHSPYRRNPL EID N ++ 
Sbjct: 321  GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 380

Query: 383  QIPLS------------GLKVHNNNLSQ-------APASDSRISKGILDKNIISINCKEK 505
                +            GLK + N L+           ++   S G  D  ++++N   K
Sbjct: 381  PQSAANKTSTCINKGQGGLKEYTNKLNVEMNNKVVVQGANKAGSIGTADNKVVNVNSTAK 440

Query: 506  EQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNP 685
            EQ+             M   V  +    G E  KP  +T            +NPE L NP
Sbjct: 441  EQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPETLLNP 487

Query: 686  APS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDK 862
             PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL          AF +D+
Sbjct: 488  IPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAFSEDR 547

Query: 863  RRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRR 1036
            +            D SS  G  A  G+K  ET++P ++SEVVG DDLMEPSFHKYVTVRR
Sbjct: 548  KGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYVTVRR 599

Query: 1037 G-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSR 1207
            G TL G DMEEQESSGSNS  G G  QHW  S SSWEPNSADS D WTS ++S   ED  
Sbjct: 600  GATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-EEDHS 658

Query: 1208 SPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 1309
            S +  QR AL+EP  G ++    R+ +S ++RD D Q
Sbjct: 659  SSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 695


>ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261489 [Vitis vinifera]
          Length = 710

 Score =  291 bits (746), Expect = 4e-76
 Identities = 196/433 (45%), Positives = 236/433 (54%), Gaps = 4/433 (0%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATV    +DK NN   G E+ +  AV+R+ VKRN+G               
Sbjct: 285  RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       N +                 R++SRKAE SPYRRNPL EID NI +   
Sbjct: 341  ANARVVSNDSQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386

Query: 389  PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 565
             L   ++  +   +    D    K ++   N  S +  +  Q      E K LQ   N+ 
Sbjct: 387  -LKAREIEPDCQQKPNMKDMNNGKVVVHGTNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445

Query: 566  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQKN- 742
                VV SG E LKP  +T            +NPE L NP PSYT LLLEDIQNFHQKN 
Sbjct: 446  ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNPTPSYTTLLLEDIQNFHQKNT 505

Query: 743  NTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLYDNSSFGG 922
             TP+ +LPACVSKA SILEAVADL          AF DD+R       F + + NS    
Sbjct: 506  TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559

Query: 923  QLAGKKGLETKEP-CLQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 1096
              AGKK LE K+P  ++SE+V  +DLMEPS HKYVTV+RGT+  G +MEEQESSGSNS  
Sbjct: 560  NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619

Query: 1097 GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 1276
            G    H     SWEPNSADS DCWT  SRSN RE+  SPV FQR ALSEPG E  E ++R
Sbjct: 620  GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672

Query: 1277 MSVKKRDSDQQQN 1315
            M  +K++ D QQN
Sbjct: 673  MGRRKKEIDHQQN 685


>gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 785

 Score =  291 bits (745), Expect = 5e-76
 Identities = 202/461 (43%), Positives = 242/461 (52%), Gaps = 34/461 (7%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 202
            RPGKMVSVPATVSSL MDKS N   G E  T +  A+KRI VKRN G             
Sbjct: 330  RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 383

Query: 203  XXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSE 382
                         N N                 RS+SRKAEHSPYRRNPL EID N ++ 
Sbjct: 384  GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 443

Query: 383  QIPLS------------GLKVHNNNLSQ-----------APASDSRISKGILDKNIISIN 493
                +            GLK + N ++Q              ++   S G  D  ++++N
Sbjct: 444  PQSAANKTSTCINKGQGGLKEYTNVINQKLNVEMNNKVVVQGANKAGSIGTADNKVVNVN 503

Query: 494  CKEKEQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEA 673
               KEQ+             M   V  +    G E  KP  +T            +NPE 
Sbjct: 504  STAKEQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPET 550

Query: 674  LSNPAPS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAF 850
            L NP PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL          AF
Sbjct: 551  LLNPIPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAF 610

Query: 851  PDDKRRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPCLQSEVVGGDDLMEPSFHKYV 1024
             +D++            D SS  G  A  G+K  ET++P ++SEVVG DDLMEPSFHKYV
Sbjct: 611  SEDRKGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYV 662

Query: 1025 TVRRG-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCR 1195
            TVRRG TL G DMEEQESSGSNS  G G  QHW  S SSWEPNSADS D WTS ++S   
Sbjct: 663  TVRRGATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-E 721

Query: 1196 EDSRSPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 1309
            ED  S +  QR AL+EP  G ++    R+ +S ++RD D Q
Sbjct: 722  EDHSSSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 762


>ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Populus trichocarpa]
            gi|550322594|gb|EEF06059.2| hypothetical protein
            POPTR_0015s12740g [Populus trichocarpa]
          Length = 736

 Score =  287 bits (735), Expect = 7e-75
 Identities = 197/446 (44%), Positives = 236/446 (52%), Gaps = 28/446 (6%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGK+VSVPATVSSL +DKSNN   G E  + A ++RI VKRN G               
Sbjct: 290  RPGKLVSVPATVSSLVVDKSNN---GVEPQATAGIRRISVKRNVGEAALTCSRMVASPSS 346

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVS-EQ 385
                       N+N                 RSNSRKA+ SPYRRNPL EID N +   Q
Sbjct: 347  KSPARTNAKTSNENNQQPSLS----------RSNSRKADQSPYRRNPLSEIDLNSLQYSQ 396

Query: 386  IPLSGLKVHNNN---------------------LSQAPASDSRISKGILDKNIISINCKE 502
             P +     +NN                     L+Q P       K     N    NC+ 
Sbjct: 397  PPANKATCTSNNRARIRNKDIEGQVVVKESFNLLNQTPMKKQNSEKNNR-VNAQVTNCRG 455

Query: 503  KE---QQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEA 673
                  +N I++E+++ +          VV  G E LKP  +T            +NPE 
Sbjct: 456  SSIVSLENKISKEQQMEEAKGQPTDMTTVVDLGVESLKPQTLTRSRSARRSRDLDLNPET 515

Query: 674  LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFP 853
            L NP PSYTALLLEDIQNFH K NTP+F+LPACV+KACSILEAVADL          AF 
Sbjct: 516  LLNPTPSYTALLLEDIQNFHLK-NTPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 574

Query: 854  DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVR 1033
             D+R  P     N           L GKK  E K+P ++SEV+  DDL+EPSFHKYVTVR
Sbjct: 575  YDRRSPPTVAAAN-----------LVGKKPPEAKDPFVESEVLASDDLIEPSFHKYVTVR 623

Query: 1034 R-GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDS 1204
            R GTL GEDM+ QESSG +S+ GG  QH   S+SSWEPNSADSID WT  SRSN R ED 
Sbjct: 624  RAGTLCGEDMDGQESSGRDSVVGGSQQHLGFSTSSWEPNSADSIDHWT--SRSNWRDEDE 681

Query: 1205 RSPVPFQRLALSEPGYEVGEARRRMS 1282
            +SP+ FQ+  LSE   +V +ARR  S
Sbjct: 682  KSPLGFQKHELSETWRDVEQARRPFS 707


>emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]
          Length = 685

 Score =  285 bits (729), Expect = 4e-74
 Identities = 195/431 (45%), Positives = 234/431 (54%), Gaps = 4/431 (0%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATV    +DK NN   G E+ +  AV+R+ VKRN+G               
Sbjct: 285  RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       N +                 R++SRKAE SPYRRNPL EID NI +   
Sbjct: 341  ANARVVSNXNQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386

Query: 389  PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 565
             L   ++  +   +    D    K ++   N  S +  +  Q      E K LQ   N+ 
Sbjct: 387  -LKAREIEPDCQQKPNMKDMNNGKVVVHGSNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445

Query: 566  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQKN- 742
                VV SG E LKP  +T            +NPE L N  PSYT LLLEDIQNFHQKN 
Sbjct: 446  ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNLTPSYTTLLLEDIQNFHQKNT 505

Query: 743  NTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLYDNSSFGG 922
             TP+ +LPACVSKA SILEAVADL          AF DD+R       F + + NS    
Sbjct: 506  TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559

Query: 923  QLAGKKGLETKEP-CLQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 1096
              AGKK LE K+P  ++SE+V  +DLMEPS HKYVTV+RGT+  G +MEEQESSGSNS  
Sbjct: 560  NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619

Query: 1097 GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 1276
            G    H     SWEPNSADS DCWT  SRSN RE+  SPV FQR ALSEPG E  E ++R
Sbjct: 620  GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672

Query: 1277 MSVKKRDSDQQ 1309
            M  +KR+ D Q
Sbjct: 673  MGRRKREIDHQ 683


>ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus communis]
            gi|223529895|gb|EEF31825.1| hypothetical protein
            RCOM_0303940 [Ricinus communis]
          Length = 725

 Score =  280 bits (717), Expect = 9e-73
 Identities = 199/469 (42%), Positives = 243/469 (51%), Gaps = 17/469 (3%)
 Frame = +2

Query: 29   RPGK-MVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 205
            RPGK MVSVPATVSSL MDKSN    G E  +   VKRI VKRN G              
Sbjct: 289  RPGKKMVSVPATVSSLTMDKSNI---GVEPQAANGVKRISVKRNVGGGEAGSRSAASPRS 345

Query: 206  XXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTN-IVSE 382
                          N                 RS+SRKAE SPYRRNPL EIDTN +V  
Sbjct: 346  QSPA--------RTNAKGGGSNENNQQQPSLSRSSSRKAEQSPYRRNPLSEIDTNSLVYA 397

Query: 383  QIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNN 562
            Q   +    +NN+ S+A   +  +   ++ K  +++  + +  + +     KI  Q  N 
Sbjct: 398  QATGNNTTANNNSNSRAQTRNKELEGKLMVKESVNVLNQAQMHKPNAEANSKINAQGSNK 457

Query: 563  NVAVDVV---GSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFH 733
             V    V    SG + LKP  V              NPE   NP PSYTALLLEDIQNFH
Sbjct: 458  GVKEQTVTAEASGAD-LKPQTVARSRSARRSRDLDFNPETSLNPNPSYTALLLEDIQNFH 516

Query: 734  QKN-----NTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKL 898
            QK+     NTP+F++PACV+KACSI+EAVADL          AF D+KR           
Sbjct: 517  QKSTNTNTNTPSFSVPACVTKACSIVEAVADLNSTTSSNLSCAFSDEKRSP--------- 567

Query: 899  YDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRR-----GTLSGEDME 1063
               ++    L GKK  E K+P ++SEV+  DDLMEPSFHKYVTVRR     GT S EDM+
Sbjct: 568  ---TTVVSNLVGKKLEEGKDPFVESEVLVNDDLMEPSFHKYVTVRRGGNGKGTSSVEDMD 624

Query: 1064 EQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDSRSPVPFQRLAL 1237
             QESSGSNS  G   QHW  S+SSWEPNSADS D WT  SRSN R E+ +SP+ FQ+   
Sbjct: 625  GQESSGSNSFVGSSQQHWGYSTSSWEPNSADSTDRWT--SRSNTRDEEEKSPLGFQKHTS 682

Query: 1238 SEPGYEVGEARRRMSVKKRDSDQQQNXXXXXXXXXXXVQSLPTAAAAAS 1384
            SE G ++ EARR        S Q+             + S P  AAA++
Sbjct: 683  SESGRDMEEARRGF------SGQRNGIGRGRVGSSKNLNSTPIVAAAST 725


>ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citrus clementina]
            gi|568855457|ref|XP_006481321.1| PREDICTED:
            serine/arginine repetitive matrix protein 2-like [Citrus
            sinensis] gi|557531784|gb|ESR42967.1| hypothetical
            protein CICLE_v10011149mg [Citrus clementina]
          Length = 740

 Score =  269 bits (688), Expect = 2e-69
 Identities = 209/488 (42%), Positives = 245/488 (50%), Gaps = 59/488 (12%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRN------------AGSDX 172
            RPGKMVSVPATV+      SN++           VKRI VKRN            A S  
Sbjct: 258  RPGKMVSVPATVAVEPATASNSS----------GVKRISVKRNVGEAAGAVGSRMAASPR 307

Query: 173  XXXXXXXXXXXXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNS-RKAEHSPYRRNP 349
                                   +                     NS RKAEHSPYRRNP
Sbjct: 308  SKSPARVNGNNVKEQQHPSLSRSSSRKGEQHSPYRRNPSSEIDHPNSTRKAEHSPYRRNP 367

Query: 350  LGEIDTNIVSEQIPLSGL----------KVHN-----------------NNLSQAP---A 439
            L EID N  S Q P S            +V N                 N L QAP    
Sbjct: 368  LSEIDPN--SLQYPQSACNNKASNVITNRVRNKSRDFEGEGVFVRDSSANVLYQAPIHKP 425

Query: 440  SDSRISKG-----------ILDKNIISINCKEKEQQNSITEEEKILQQAMNNNVAVDVVG 586
            +   I++G            L+  +   N  EKEQ+  I EE+K  Q  M  N AV    
Sbjct: 426  NAENIAQGTNNHKSSCRGTTLNNKVTGANITEKEQR-QILEEDK-AQLPMTANAAVVTES 483

Query: 587  SGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQKNNTPAFTLP 766
              P+ L     T            +NPE L NP PSYTALLLEDIQNFHQK +TP+ +LP
Sbjct: 484  QKPQTLTR---TRSSRRSRDLDLDLNPETLLNPTPSYTALLLEDIQNFHQK-STPSVSLP 539

Query: 767  ACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQF-NKLYDNSSFGGQLAGKKG 943
            ACV+KACSILEAVADL          AF +D R+ P ADQ  NK   N S G  L GKK 
Sbjct: 540  ACVTKACSILEAVADLNSTTSSNLSCAFSED-RKPPSADQSNNKNAYNFSAGVNLVGKKM 598

Query: 944  LETKEPCLQSEVVGGDDLMEPSFHKYVTVRRG--TLSGEDMEEQESSGSNSIAG-GGPQH 1114
             E K+P ++SEV+  DDLMEPSFH+YVTVRRG   L G DM+ QESSGSNS  G    Q+
Sbjct: 599  TEAKDPFVESEVLADDDLMEPSFHRYVTVRRGGSELGGVDMDGQESSGSNSFVGCTTQQN 658

Query: 1115 WASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-PGYEVGEARRRMSVKK 1291
            W SSSSWEPNSADS D WT  SRSN +E+ +SP+ FQR A+SE  G E  + R+  S K+
Sbjct: 659  WTSSSSWEPNSADSTDRWT--SRSNMKEEDQSPLGFQRQAMSEAAGCEATKNRKGFSGKR 716

Query: 1292 RDSDQQQN 1315
            RD+D QQN
Sbjct: 717  RDTDYQQN 724


>ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Glycine
            max]
          Length = 725

 Score =  264 bits (675), Expect = 7e-68
 Identities = 185/444 (41%), Positives = 246/444 (55%), Gaps = 15/444 (3%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNN---AEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXX 199
            RPGKMVSVPATVSSL MDKSNN     GG E+ +   +KRI VKRN G+           
Sbjct: 279  RPGKMVSVPATVSSLVMDKSNNNGGGGGGGESGATTGIKRITVKRNVGA---ASPRSQSP 335

Query: 200  XXXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVS 379
                          N+N                 RSNSRKAE SPY+RNPL EI+ N ++
Sbjct: 336  ARANGNAASGNKAFNENQQQPSLS----------RSNSRKAEQSPYKRNPLSEIEPNSLA 385

Query: 380  ---EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKI 541
                    S  KV N    +     ++ + G    LDK + ++NCK K QQ    EE+  
Sbjct: 386  FPHSTANNSSSKVQNRPKKEFETEANQKTNGSRTALDKGM-NVNCKTKVQQ----EEDVK 440

Query: 542  LQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLE 715
            +Q ++ +NV V  +V  G + LKP + +T            +NPEAL NP  SY +LLLE
Sbjct: 441  VQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRQSRDLDLNPEALLNPPQSYASLLLE 500

Query: 716  DIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNK 895
            DIQNFHQK NTP  +LPACV+KACSILEAVADL          A   + RR+P+A Q ++
Sbjct: 501  DIQNFHQK-NTPPVSLPACVTKACSILEAVADLNSNAGLNFCGA---EDRRSPLAFQCSR 556

Query: 896  LYDNSSFGGQLAGKKGLETKEPCLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQ 1069
               N S      GK+  + ++P ++S ++   DD+ME S HKYVTV R G L G DM++Q
Sbjct: 557  NDYNVSLTTHDYGKREPDAEDPVVESMLLFNDDDVMEQSLHKYVTVNRGGLLGGVDMDDQ 616

Query: 1070 ESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE 1243
            ESSGSNS     G Q W  SSSSWEP+S +S DCWT  SRSN  ++    +  +    SE
Sbjct: 617  ESSGSNSFTVSSGQQRWGVSSSSWEPSSVESKDCWT--SRSNYSKEEGQKLGLEGRVASE 674

Query: 1244 PGYEVGEARRRMSVKKRDSDQQQN 1315
             G + GEA+++++ ++R+ D  Q+
Sbjct: 675  AGLDAGEAKKKLNSQRRECDHHQH 698


>ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform
            X2 [Glycine max]
          Length = 733

 Score =  260 bits (665), Expect = 9e-67
 Identities = 189/472 (40%), Positives = 250/472 (52%), Gaps = 20/472 (4%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSSL MDKSNN  G  E+ +   +KRI VKRN G+              
Sbjct: 287  RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEID-------- 364
                        +                  RSNSRKAE SPY+RNPL EI+        
Sbjct: 333  SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392

Query: 365  --TNIVSEQIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEK 538
              TN  S ++     K      +Q   + +R +    DK + +INCK K QQ    EE+ 
Sbjct: 393  STTNNSSSRVQNRPKKEFETEANQQKTNGNRTAS---DKGV-TINCKTKVQQ----EEDV 444

Query: 539  ILQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXXINPEALSNPAP-SYTALL 709
             +Q ++ +NV V  +V  G + LKP + +T            IN EAL NP P SY +LL
Sbjct: 445  KVQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLL 504

Query: 710  LEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQF 889
            LEDIQNFHQKN TP  +LPACV+KACSILEAVADL              + RR+P+A Q 
Sbjct: 505  LEDIQNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQC 560

Query: 890  NKLYDNSSFGGQLAGKKGLETKEPCLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDME 1063
            ++   N        GK+  + ++P ++S +V   DD+MEP+ HKYVTV R G+L G DM+
Sbjct: 561  SRNDYNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMD 620

Query: 1064 EQESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLAL 1237
            +QESSGSNS     G QHW  SSSSWEP+S +S DCWTS S  +  E  RSP+  +    
Sbjct: 621  DQESSGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVA 680

Query: 1238 SE-PGYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXXVQSLPTAAAAAS 1384
            SE  G + G A+++++ ++R+ D Q               + ++P   AAAS
Sbjct: 681  SEVAGRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 732


>ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform
            X1 [Glycine max]
          Length = 732

 Score =  260 bits (664), Expect = 1e-66
 Identities = 188/468 (40%), Positives = 250/468 (53%), Gaps = 16/468 (3%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSSL MDKSNN  G  E+ +   +KRI VKRN G+              
Sbjct: 287  RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVS--- 379
                        +                  RSNSRKAE SPY+RNPL EI+ N ++   
Sbjct: 333  SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392

Query: 380  EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKILQQ 550
                 S  +V N    +     ++ + G     DK + +INCK K QQ    EE+  +Q 
Sbjct: 393  STTNNSSSRVQNRPKKEFETEANQKTNGNRTASDKGV-TINCKTKVQQ----EEDVKVQS 447

Query: 551  AMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXXINPEALSNPAP-SYTALLLEDI 721
            ++ +NV V  +V  G + LKP + +T            IN EAL NP P SY +LLLEDI
Sbjct: 448  SITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLLLEDI 507

Query: 722  QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLY 901
            QNFHQKN TP  +LPACV+KACSILEAVADL              + RR+P+A Q ++  
Sbjct: 508  QNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQCSRND 563

Query: 902  DNSSFGGQLAGKKGLETKEPCLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 1075
             N        GK+  + ++P ++S +V   DD+MEP+ HKYVTV R G+L G DM++QES
Sbjct: 564  YNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMDDQES 623

Query: 1076 SGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-P 1246
            SGSNS     G QHW  SSSSWEP+S +S DCWTS S  +  E  RSP+  +    SE  
Sbjct: 624  SGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVASEVA 683

Query: 1247 GYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXXVQSLPTAAAAAS 1384
            G + G A+++++ ++R+ D Q               + ++P   AAAS
Sbjct: 684  GRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 731


>gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus vulgaris]
          Length = 718

 Score =  255 bits (652), Expect = 3e-65
 Identities = 174/440 (39%), Positives = 236/440 (53%), Gaps = 13/440 (2%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSSL MDKSNN  GG E+ +   +KRI VKRN G+              
Sbjct: 275  RPGKMVSVPATVSSLVMDKSNNNGGGGESAATTGIKRITVKRNVGA---ASPRSQSPARA 331

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 388
                       N+N                 RS+SRKAE SPY+RNPL EI+ N ++   
Sbjct: 332  NGNAANANKAFNENQPPPSLS----------RSSSRKAEQSPYKRNPLSEIEPNSLA--F 379

Query: 389  PLSGLKVHNNNLSQAPASD--------SRISKGILDKNI-ISINCKEKEQQNSITEEEKI 541
            P S    +++ +   P  +        +  S+  LDK + ++ N K + + +      K+
Sbjct: 380  PHSTANNNSSRVQNRPKKEFETEAIQRTNSSRTALDKGMTVTYNTKVQPEGDI-----KV 434

Query: 542  LQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDI 721
                 +N V   +V  G + LKPH +T            +NPEAL NP  SY +LLLEDI
Sbjct: 435  QSLITDNAVVKTMVPPGLDNLKPHKLTRSRSSRRSQDLDLNPEALLNPPQSYASLLLEDI 494

Query: 722  QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLY 901
            QNFHQK+ TP  +LPACV+KACSILEAVA+L          A   + RR+P   Q ++  
Sbjct: 495  QNFHQKS-TPPVSLPACVTKACSILEAVAELNSNTNLNFGGA---EDRRSPPTFQCSRND 550

Query: 902  DNSSFGGQLAGKKGLETKEPCLQSEVV-GGDDLMEPSFHKYVTVRRG-TLSGEDMEEQES 1075
             N        GK+  + ++P ++S +V   DD++E S HKYVTV RG ++ G DME+QES
Sbjct: 551  YNVPLTANDYGKREPDAEDPVVESMLVFNDDDVLESSLHKYVTVNRGGSVGGVDMEDQES 610

Query: 1076 SGSNSIA-GGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPG 1249
            SGSNS   G G Q W  SSSSWEP+S +S DCWTS    +  E  +SP+  +    SE G
Sbjct: 611  SGSNSFTVGNGQQQWGISSSSWEPSSVESRDCWTSRLNYSREEGQKSPLGLEGSVSSETG 670

Query: 1250 YEVGEARRRMSVKKRDSDQQ 1309
             +V  AR++++   R+ D Q
Sbjct: 671  CDVDGARKKLNSNGRECDHQ 690


>ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307599 [Fragaria vesca
            subsp. vesca]
          Length = 683

 Score =  240 bits (613), Expect = 1e-60
 Identities = 188/465 (40%), Positives = 232/465 (49%), Gaps = 13/465 (2%)
 Frame = +2

Query: 29   RPGKM--VSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXX 202
            RPGKM  VSVPATV    MDK++N E      +  ++KRI VKRNAG D           
Sbjct: 275  RPGKMKMVSVPATV----MDKNSNGESA----TTGSIKRISVKRNAG-DAVNVTVGSRTA 325

Query: 203  XXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSE 382
                         N                   RS+SRKAE SPYRRNPL E+D N    
Sbjct: 326  ASPRSQSPARGGANAKASNDSLQPSLS------RSSSRKAEQSPYRRNPLSELDPN---- 375

Query: 383  QIPLSGLKVHNNNLSQAPASDSRISKGILD--KNIISINCKEKEQQNSITEEEKILQQAM 556
               L+  + H NN      ++++ S  +L+  K  + I C +   Q  I         AM
Sbjct: 376  --SLAYPQAHINN------TNNKSSCNVLNQLKPNVEITCNKIITQG-INYRSSTASSAM 426

Query: 557  NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQ 736
            +N V      SG + LK   +T            INP+ LSNP PSYT LLLEDIQNFHQ
Sbjct: 427  DNKVVEPAGASGVDCLKHQTLTRSRSSRRSRDLDINPQTLSNPPPSYTRLLLEDIQNFHQ 486

Query: 737  K-NNTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNK--LYDN 907
            + +N    +LP CV+KACSILEAVADL           F  D R++P  DQ NK   Y N
Sbjct: 487  QSSNAAVVSLPQCVTKACSILEAVADLNSTTN------FSAD-RKSPSIDQINKSSCYYN 539

Query: 908  SSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSN 1087
             S       +K +    P ++SEV+ GDDL+ PSFHKYVTVRRG   G DME+QESSGSN
Sbjct: 540  CSLDANPVPRKDI----PFVESEVLVGDDLVAPSFHKYVTVRRG---GTDMEDQESSGSN 592

Query: 1088 SIAGGGPQ-HWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGE 1264
            S   G  Q  W  SSSWEPNSADS DCWT  SRS+ RED ++             +++ E
Sbjct: 593  SFVSGSQQPQWGLSSSWEPNSADSTDCWT--SRSSTREDDQN-------------FDMDE 637

Query: 1265 -ARRRMSVKKRDSDQQQNXXXXXXXXXXXVQS----LPTAAAAAS 1384
             ARRR+S +K D    Q+                  +P  AAAAS
Sbjct: 638  AARRRLSRRKTDGQNTQSSCGIGRGKLAAASKGLPIMPVVAAAAS 682


>gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus vulgaris]
          Length = 652

 Score =  236 bits (602), Expect = 2e-59
 Identities = 169/403 (41%), Positives = 210/403 (52%), Gaps = 10/403 (2%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVP TVSSLAMDKSNN  G   T      KRI VKRN G               
Sbjct: 247  RPGKMVSVPPTVSSLAMDKSNNCGGESGT------KRITVKRNVGD------VGSRGAAS 294

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNS-RKAEHSPYRRNPLGEIDTNIVSEQ 385
                       N                   R+NS RKAE SPYRRNPL E+D N   +Q
Sbjct: 295  PRTQSPARVNGNVASARVLSENQQHQQPSLSRNNSSRKAEQSPYRRNPLSEVDNNSKVQQ 354

Query: 386  IPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNN 565
                     N   ++A A      +  L+K + ++NCK KE    ++ +  +        
Sbjct: 355  ---------NKPKTEAEAMQKPNGRVALEKGV-TVNCKTKEHHEDVSLDSAV-------- 396

Query: 566  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSN--PAPSYTALLLEDIQNFHQK 739
            V   V  SG + LKP G+T            INPE++ N  P  SY +LLLEDIQNFHQK
Sbjct: 397  VKTTVASSGVDNLKPQGLTRSRSSRRSRDLDINPESVVNVNPTHSYASLLLEDIQNFHQK 456

Query: 740  NNT--PAFT-LPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKLYDNS 910
            N    P+ T LPAC++KACSI+EAV DL          AF +D R++P   Q        
Sbjct: 457  NTPQQPSSTSLPACLTKACSIIEAVGDLSYTTSSNFSGAFSED-RKSPSTQQ-------- 507

Query: 911  SFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNS 1090
            SF     GKK   +K+P ++SEV  GDD+MEPS HKYVTV+RG+ +  DM++QESSGSNS
Sbjct: 508  SFRNGYYGKKVQGSKDPFVESEVDVGDDVMEPSLHKYVTVKRGS-AVVDMDDQESSGSNS 566

Query: 1091 --IAGGGPQHWA--SSSSWEPNSADSIDCWTSSSRSNCREDSR 1207
              ++  G  HW   S SSWEPNSADS D WT  SR + RE+ +
Sbjct: 567  FTVSSSGQHHWGAISCSSWEPNSADSTDSWT--SRLSSREEGQ 607


>gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis]
          Length = 676

 Score =  236 bits (601), Expect = 3e-59
 Identities = 184/416 (44%), Positives = 206/416 (49%), Gaps = 25/416 (6%)
 Frame = +2

Query: 29   RPGKMVSVPATVSS-LAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 205
            RPGKMVSVPATVSS L MDKSNN +      S   +KRI VKRN G              
Sbjct: 252  RPGKMVSVPATVSSSLVMDKSNNMDSAANANS---IKRISVKRNVGEAGSRGAASPRSQS 308

Query: 206  XXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIVSEQ 385
                        N+                  R++SRKAE SPYRRNPL EID N +S  
Sbjct: 309  PARGGNGNAKSSNE----------PQAQPSLSRNSSRKAEQSPYRRNPLSEIDPNSLSYP 358

Query: 386  IPLSGLKVHNNNLSQA------------PASDSRISKGILDKNIISINCKEKEQQNSITE 529
             P      HNNN +              P  D  I    L       N +   + N    
Sbjct: 359  NP------HNNNGNNGRAQSKSKRETCVPEEDENILVKELPTQAQKPNAETNYRSNGRVS 412

Query: 530  EEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEAL--SNPAPSYTA 703
             E    Q     V   VV SG +      +T            INPE L   NP PSYT 
Sbjct: 413  AENKNSQPKQAMVETTVVISGADNKPQQTLTRSRSSRRSRDLDINPETLLNPNPTPSYTR 472

Query: 704  LLLEDIQNFHQKNN---TPAFTLPACVSKACSILEAVADL-XXXXXXXXXXAFPDDKRRT 871
            LLLEDIQNFHQKNN   T   +LP CVSKACSILEAVADL           AF +     
Sbjct: 473  LLLEDIQNFHQKNNNATTAVVSLPPCVSKACSILEAVADLNSATGSNLSCSAFSE----- 527

Query: 872  PIADQFNKLYDNSSFGGQLAGKKGLETKEPCLQSEV-VGGDDLMEPSFHKYVTVRRGTLS 1048
               DQFNK   N+++   L        KEP ++SEV VG DDL EPSFHKYVTVRRG  S
Sbjct: 528  ---DQFNK-GTNNAYSSLLG-----PAKEPFVESEVIVGSDDLTEPSFHKYVTVRRGGGS 578

Query: 1049 G---EDMEEQESSGSNSIAGGGP-QHWA-SSSSWEPNSADSIDCWTSSSRSNCRED 1201
            G    D E+QESSGSNSIAGG   Q+W  SSSSWEPNSADS DC  S+SRSN RE+
Sbjct: 579  GGLVVDAEDQESSGSNSIAGGSQIQNWVLSSSSWEPNSADSTDC--STSRSNNREE 632


>ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max]
          Length = 678

 Score =  228 bits (582), Expect = 4e-57
 Identities = 175/447 (39%), Positives = 227/447 (50%), Gaps = 18/447 (4%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 208
            RPGKMVSVPATVSSL MDKSN+  G   T      K   VKRN G               
Sbjct: 249  RPGKMVSVPATVSSLVMDKSNSCGGDSGT-----KKITTVKRNVGD---AGSKGAASPRA 300

Query: 209  XXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNS-RKAEHSPYRRNPLGEIDTNIV--S 379
                        D                  R+NS RK E SPYRRNP  E+D N    +
Sbjct: 301  QSPARVNGNVGRDKMLNENLQQQHQQQPSLSRNNSSRKVEQSPYRRNPQSEVDHNSSRKA 360

Query: 380  EQIPLSGLKVHNNNLS-QAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAM 556
            EQ P S  KV  N    +A A      +  L+K + S+NCK KEQ     EE  +   A+
Sbjct: 361  EQSPYSNSKVQQNKPKIEAEAIQKPNGRVALEKGV-SVNCKTKEQHEE--EESSVPISAV 417

Query: 557  NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPAPSYTALLLEDIQNFHQ 736
               V    V SG + LKP G+T             + +  +N   SY +LLLEDIQNFHQ
Sbjct: 418  ---VKTTAVSSGVDNLKPQGLTRSRSSRR------SRDLDTNATNSYASLLLEDIQNFHQ 468

Query: 737  KNNTP------AFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTPIADQFNKL 898
            KN         + +LPAC++K CSILEAVADL           F +DKR +P   Q N +
Sbjct: 469  KNTQQQQQQPSSVSLPACLNKVCSILEAVADLNSTTSSN----FTEDKR-SPSTQQSN-I 522

Query: 899  YDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 1075
             ++  +G ++AG      K+P ++SEV   DD+MEPS HKYVTV+R G +  EDME+QES
Sbjct: 523  RNDEYYGKKVAGSN----KDPFVESEVAVSDDVMEPSLHKYVTVKRGGGVVVEDMEDQES 578

Query: 1076 SGSNSI---AGGGPQHWAS----SSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLA 1234
            SGSNS    +  G  HW +    SSSWEPNSADS DCWTSS  S+  E+++        +
Sbjct: 579  SGSNSFTVSSSSGQHHWGNNISCSSSWEPNSADSTDCWTSSRLSSREEEAQKTPLGLGCS 638

Query: 1235 LSEPGYEVGEARRRMSVKKRDSDQQQN 1315
            LS    E  + ++ ++ K+R+ D + +
Sbjct: 639  LSS---EAKKKKKGLNSKRRECDHEHS 662


>ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224225 [Cucumis sativus]
          Length = 750

 Score =  219 bits (559), Expect = 2e-54
 Identities = 167/453 (36%), Positives = 221/453 (48%), Gaps = 25/453 (5%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 196
            RP KMVSVPATVS    DK+N+A     GG ++ +   VKRI VKRN G           
Sbjct: 297  RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 356

Query: 197  XXXXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIV 376
                             N                 RS+SRKAE SPYRRNPLGEIDTN  
Sbjct: 357  SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 409

Query: 377  SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 529
                  +  K            N ++Q P +D +    ++   +  +N  +     + T 
Sbjct: 410  QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 466

Query: 530  E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPA--PSY 697
                 I      +N  V VV    E  KP G+             INPE L N +  PSY
Sbjct: 467  GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 522

Query: 698  TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTP 874
            T +LL+DIQNFHQK+ NT   +LPACV+KACSI+EAVADL          AF +++   P
Sbjct: 523  TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 582

Query: 875  IADQFNKLYDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRG----T 1042
                    Y +  + G L G    E ++P ++SEV   DD++EPSFHKYVTVRRG     
Sbjct: 583  TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 639

Query: 1043 LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 1213
              G D ++QESSGSNS  G   Q   W  S++SWEPN+ADS D  + +SR   +E+    
Sbjct: 640  AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 697

Query: 1214 VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 1312
            +       S+PG +  + RRR + ++RDSD Q+
Sbjct: 698  LQ------SKPGLDRDDNRRRTAERRRDSDAQR 724


>ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206761 [Cucumis sativus]
          Length = 742

 Score =  219 bits (559), Expect = 2e-54
 Identities = 167/453 (36%), Positives = 221/453 (48%), Gaps = 25/453 (5%)
 Frame = +2

Query: 29   RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 196
            RP KMVSVPATVS    DK+N+A     GG ++ +   VKRI VKRN G           
Sbjct: 289  RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 348

Query: 197  XXXXXXXXXXXXXXXNDNXXXXXXXXXXXXXXXXXRSNSRKAEHSPYRRNPLGEIDTNIV 376
                             N                 RS+SRKAE SPYRRNPLGEIDTN  
Sbjct: 349  SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 401

Query: 377  SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 529
                  +  K            N ++Q P +D +    ++   +  +N  +     + T 
Sbjct: 402  QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 458

Query: 530  E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXXINPEALSNPA--PSY 697
                 I      +N  V VV    E  KP G+             INPE L N +  PSY
Sbjct: 459  GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 514

Query: 698  TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXXAFPDDKRRTP 874
            T +LL+DIQNFHQK+ NT   +LPACV+KACSI+EAVADL          AF +++   P
Sbjct: 515  TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 574

Query: 875  IADQFNKLYDNSSFGGQLAGKKGLETKEPCLQSEVVGGDDLMEPSFHKYVTVRRG----T 1042
                    Y +  + G L G    E ++P ++SEV   DD++EPSFHKYVTVRRG     
Sbjct: 575  TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 631

Query: 1043 LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 1213
              G D ++QESSGSNS  G   Q   W  S++SWEPN+ADS D  + +SR   +E+    
Sbjct: 632  AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 689

Query: 1214 VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 1312
            +       S+PG +  + RRR + ++RDSD Q+
Sbjct: 690  LQ------SKPGLDRDDNRRRTAERRRDSDAQR 716


Top