BLASTX nr result

ID: Catharanthus22_contig00011364 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00011364
         (1913 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix...   352   2e-94
ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251...   345   5e-92
ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Popu...   309   3e-81
gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao]    295   5e-77
gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao]    294   8e-77
ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261...   294   8e-77
ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Popu...   290   1e-75
emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]   288   8e-75
ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus c...   283   1e-73
ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citr...   272   3e-70
ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix...   264   9e-68
ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix...   260   1e-66
ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix...   260   2e-66
gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus...   255   4e-65
ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307...   243   2e-61
gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus...   239   3e-60
gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis]     239   4e-60
ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [...   231   6e-58
ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224...   223   3e-55
ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206...   223   3e-55

>ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix protein 3-like, partial
            [Solanum tuberosum]
          Length = 652

 Score =  352 bits (904), Expect = 2e-94
 Identities = 221/430 (51%), Positives = 263/430 (61%), Gaps = 2/430 (0%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D             
Sbjct: 250  RPGKMVSVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 308

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                       ++N                 RSNSRK E SPYRRNPL EIDTN+V EQ+
Sbjct: 309  ARVNAKVLNERDNNAHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDTNVVLEQM 365

Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 974
            P  GLKV +  L+    S                N K KEQQ               +NV
Sbjct: 366  PAPGLKVPSQKLNAETVS----------------NGKVKEQQ---------------HNV 394

Query: 973  AVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNN- 797
            A++V+ SGPE  KP               DINPEALSNP  SYTALLLEDIQNFHQK N 
Sbjct: 395  AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 451

Query: 796  -TPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620
             TPAF+LP CV+KACSI++AVADL         SAF DD+RR P ++QF++  DN+SF  
Sbjct: 452  TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSAFSDDRRRNPTSEQFSQ-NDNASF-- 508

Query: 619  QLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 440
               GKK L  K+PF++SEV    DLMEPS  KYVT RRGT    DMEEQESSGSNS+  G
Sbjct: 509  DPLGKKKLGIKDPFMESEVAVSGDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 563

Query: 439  GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 260
            G Q+W S SSWEPNSADS DCW  SS+S  R+D++SP+ FQR A+SE G+++ E +RR++
Sbjct: 564  GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEIGHDMEEGKRRVN 622

Query: 259  VKKRDSDQQQ 230
            VK+R+SD QQ
Sbjct: 623  VKRRESDNQQ 632


>ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251847 [Solanum
            lycopersicum]
          Length = 690

 Score =  345 bits (884), Expect = 5e-92
 Identities = 216/430 (50%), Positives = 261/430 (60%), Gaps = 2/430 (0%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKM+SVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D             
Sbjct: 287  RPGKMISVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 345

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                       ++N                 RSNSRK E SPYRRNPL EID+N+V EQ+
Sbjct: 346  ARVNAKVLNERDNNTHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDSNVVLEQM 402

Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 974
            P  GLKV +  L+    S                N K KEQQ               +NV
Sbjct: 403  PAPGLKVPSQKLNAETVS----------------NGKVKEQQ--------------QHNV 432

Query: 973  AVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNN- 797
            A++V+ SGPE  KP               DINPEALSNP  SYTALLLEDIQNFHQK N 
Sbjct: 433  AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 489

Query: 796  -TPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620
             TPAF+LP CV+KACSI++AVADL         SA  DD+RR   ++Q+++  DN+SF  
Sbjct: 490  TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSALSDDRRRNATSEQYSQ-NDNASF-- 546

Query: 619  QLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 440
               GKK L  K+PF++SEV   DDLMEPS  KYVT RRGT    DMEEQESSGSNS+  G
Sbjct: 547  DPLGKKKLGIKDPFMESEVTVSDDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 601

Query: 439  GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 260
            G Q+W S SSWEPNSADS DCW  SS+S  R+D++SP+ FQR A+SE  +++ E +RR++
Sbjct: 602  GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEISHDMEEGKRRVN 660

Query: 259  VKKRDSDQQQ 230
            VK+R+SD QQ
Sbjct: 661  VKRRESDNQQ 670


>ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Populus trichocarpa]
            gi|550327002|gb|EEE97021.2| hypothetical protein
            POPTR_0012s12820g [Populus trichocarpa]
          Length = 754

 Score =  309 bits (791), Expect = 3e-81
 Identities = 204/447 (45%), Positives = 245/447 (54%), Gaps = 29/447 (6%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSSL MDKSNN    P+  +G   KRI VKRN G               
Sbjct: 305  RPGKMVSVPATVSSLVMDKSNNIGVEPQATAGT--KRISVKRNVGEAAVAGSRTAASPRS 362

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                      SN+N                 RSNSRKA+ SPYRRNPL EID N +    
Sbjct: 363  QSPARANAKTSNENNQQPCLS----------RSNSRKADQSPYRRNPLSEIDPNSLQHSQ 412

Query: 1153 PLSGLKVH--NNNLSQA----------------PASDSRISKGILDKN------IISINC 1046
            P SG K    +NN SQ                 P + + + K   +KN      + +  C
Sbjct: 413  P-SGNKATCTSNNRSQIRNKDIEGQAVAKETFNPLNQTPMKKQNSEKNNRVNVQVANYRC 471

Query: 1045 KEKEQ-QNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869
                  +N +++E+++ +   +  V  +VV  G E LKP  +T           D+NPE 
Sbjct: 472  SSMASLENKLSKEQQMEEAKGHPPVTTNVVDLGGESLKPQALTRSRSARRSRDLDLNPET 531

Query: 868  LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFP 689
            L NP PSYTALLLEDIQNFHQKN  P+F+LPACV+KACSILEAVADL          AF 
Sbjct: 532  LLNPTPSYTALLLEDIQNFHQKNTPPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 591

Query: 688  DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVR 509
            DD+   P     N           L GKK  E K+PF++SE++  DDLMEPSFHKYVTVR
Sbjct: 592  DDRISPPAVAAVN-----------LVGKKLPEAKDPFVESEIIASDDLMEPSFHKYVTVR 640

Query: 508  R--GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-ED 341
            R  GTL GEDM+ QESSGSNS  GG  QH   S+SSWEPNSADS D W  SSRSN R ED
Sbjct: 641  RGGGTLCGEDMDGQESSGSNSFVGGSQQHLGLSTSSWEPNSADSTDRW--SSRSNTRDED 698

Query: 340  SRSPVPFQRLALSEPGYEVGEARRRMS 260
             +SP+ +Q+  L E G +V +ARR  S
Sbjct: 699  DKSPLGYQKHGLPETGRDVEQARRAFS 725


>gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 718

 Score =  295 bits (755), Expect = 5e-77
 Identities = 205/457 (44%), Positives = 245/457 (53%), Gaps = 30/457 (6%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340
            RPGKMVSVPATVSSL MDKS N   G E  T +  A+KRI VKRN G             
Sbjct: 267  RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 320

Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160
                        +N N                SRS+SRKAEHSPYRRNPL EID N ++ 
Sbjct: 321  GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 380

Query: 1159 QIPLS------------GLKVHNNNLSQ-------APASDSRISKGILDKNIISINCKEK 1037
                +            GLK + N L+           ++   S G  D  ++++N   K
Sbjct: 381  PQSAANKTSTCINKGQGGLKEYTNKLNVEMNNKVVVQGANKAGSIGTADNKVVNVNSTAK 440

Query: 1036 EQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNP 857
            EQ+             M   V  +    G E  KP  +T           D+NPE L NP
Sbjct: 441  EQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPETLLNP 487

Query: 856  APS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDK 680
             PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL          AF +D+
Sbjct: 488  IPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAFSEDR 547

Query: 679  RRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR 506
            +            D SS  G  A  G+K  ET++PF++SEVVG DDLMEPSFHKYVTVRR
Sbjct: 548  KGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYVTVRR 599

Query: 505  G-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSR 335
            G TL G DMEEQESSGSNS  G G  QHW  S SSWEPNSADS D WTS ++S   ED  
Sbjct: 600  GATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-EEDHS 658

Query: 334  SPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 233
            S +  QR AL+EP  G ++    R+ +S ++RD D Q
Sbjct: 659  SSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 695


>gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 785

 Score =  294 bits (753), Expect = 8e-77
 Identities = 205/461 (44%), Positives = 246/461 (53%), Gaps = 34/461 (7%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340
            RPGKMVSVPATVSSL MDKS N   G E  T +  A+KRI VKRN G             
Sbjct: 330  RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 383

Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160
                        +N N                SRS+SRKAEHSPYRRNPL EID N ++ 
Sbjct: 384  GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 443

Query: 1159 QIPLS------------GLKVHNNNLSQ-----------APASDSRISKGILDKNIISIN 1049
                +            GLK + N ++Q              ++   S G  D  ++++N
Sbjct: 444  PQSAANKTSTCINKGQGGLKEYTNVINQKLNVEMNNKVVVQGANKAGSIGTADNKVVNVN 503

Query: 1048 CKEKEQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869
               KEQ+             M   V  +    G E  KP  +T           D+NPE 
Sbjct: 504  STAKEQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPET 550

Query: 868  LSNPAPS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAF 692
            L NP PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL          AF
Sbjct: 551  LLNPIPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAF 610

Query: 691  PDDKRRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPFLQSEVVGGDDLMEPSFHKYV 518
             +D++            D SS  G  A  G+K  ET++PF++SEVVG DDLMEPSFHKYV
Sbjct: 611  SEDRKGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYV 662

Query: 517  TVRRG-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCR 347
            TVRRG TL G DMEEQESSGSNS  G G  QHW  S SSWEPNSADS D WTS ++S   
Sbjct: 663  TVRRGATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-E 721

Query: 346  EDSRSPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 233
            ED  S +  QR AL+EP  G ++    R+ +S ++RD D Q
Sbjct: 722  EDHSSSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 762


>ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261489 [Vitis vinifera]
          Length = 710

 Score =  294 bits (753), Expect = 8e-77
 Identities = 198/433 (45%), Positives = 238/433 (54%), Gaps = 4/433 (0%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATV    +DK NN   G E+ +  AV+R+ VKRN+G               
Sbjct: 285  RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                       N +                 R++SRKAE SPYRRNPL EID NI +   
Sbjct: 341  ANARVVSNDSQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386

Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 977
             L   ++  +   +    D    K ++   N  S +  +  Q      E K LQ   N+ 
Sbjct: 387  -LKAREIEPDCQQKPNMKDMNNGKVVVHGTNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445

Query: 976  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKN- 800
                VV SG E LKP  +T           D+NPE L NP PSYT LLLEDIQNFHQKN 
Sbjct: 446  ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNPTPSYTTLLLEDIQNFHQKNT 505

Query: 799  NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620
             TP+ +LPACVSKA SILEAVADL          AF DD+R       F + + NS    
Sbjct: 506  TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559

Query: 619  QLAGKKGLETKEPF-LQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 446
              AGKK LE K+PF ++SE+V  +DLMEPS HKYVTV+RGT+  G +MEEQESSGSNS  
Sbjct: 560  NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619

Query: 445  GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 266
            G    H     SWEPNSADS DCWT  SRSN RE+  SPV FQR ALSEPG E  E ++R
Sbjct: 620  GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672

Query: 265  MSVKKRDSDQQQN 227
            M  +K++ D QQN
Sbjct: 673  MGRRKKEIDHQQN 685


>ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Populus trichocarpa]
            gi|550322594|gb|EEF06059.2| hypothetical protein
            POPTR_0015s12740g [Populus trichocarpa]
          Length = 736

 Score =  290 bits (743), Expect = 1e-75
 Identities = 200/446 (44%), Positives = 239/446 (53%), Gaps = 28/446 (6%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGK+VSVPATVSSL +DKSNN   G E  + A ++RI VKRN G               
Sbjct: 290  RPGKLVSVPATVSSLVVDKSNN---GVEPQATAGIRRISVKRNVGEAALTCSRMVASPSS 346

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS-EQ 1157
                      SN+N                 RSNSRKA+ SPYRRNPL EID N +   Q
Sbjct: 347  KSPARTNAKTSNENNQQPSLS----------RSNSRKADQSPYRRNPLSEIDLNSLQYSQ 396

Query: 1156 IPLSGLKVHNNN---------------------LSQAPASDSRISKGILDKNIISINCKE 1040
             P +     +NN                     L+Q P       K     N    NC+ 
Sbjct: 397  PPANKATCTSNNRARIRNKDIEGQVVVKESFNLLNQTPMKKQNSEKNNR-VNAQVTNCRG 455

Query: 1039 KE---QQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869
                  +N I++E+++ +          VV  G E LKP  +T           D+NPE 
Sbjct: 456  SSIVSLENKISKEQQMEEAKGQPTDMTTVVDLGVESLKPQTLTRSRSARRSRDLDLNPET 515

Query: 868  LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFP 689
            L NP PSYTALLLEDIQNFH K NTP+F+LPACV+KACSILEAVADL          AF 
Sbjct: 516  LLNPTPSYTALLLEDIQNFHLK-NTPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 574

Query: 688  DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVR 509
             D+R  P     N           L GKK  E K+PF++SEV+  DDL+EPSFHKYVTVR
Sbjct: 575  YDRRSPPTVAAAN-----------LVGKKPPEAKDPFVESEVLASDDLIEPSFHKYVTVR 623

Query: 508  R-GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDS 338
            R GTL GEDM+ QESSG +S+ GG  QH   S+SSWEPNSADSID WT  SRSN R ED 
Sbjct: 624  RAGTLCGEDMDGQESSGRDSVVGGSQQHLGFSTSSWEPNSADSIDHWT--SRSNWRDEDE 681

Query: 337  RSPVPFQRLALSEPGYEVGEARRRMS 260
            +SP+ FQ+  LSE   +V +ARR  S
Sbjct: 682  KSPLGFQKHELSETWRDVEQARRPFS 707


>emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera]
          Length = 685

 Score =  288 bits (736), Expect = 8e-75
 Identities = 197/431 (45%), Positives = 236/431 (54%), Gaps = 4/431 (0%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATV    +DK NN   G E+ +  AV+R+ VKRN+G               
Sbjct: 285  RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                       N +                 R++SRKAE SPYRRNPL EID NI +   
Sbjct: 341  ANARVVSNXNQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386

Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 977
             L   ++  +   +    D    K ++   N  S +  +  Q      E K LQ   N+ 
Sbjct: 387  -LKAREIEPDCQQKPNMKDMNNGKVVVHGSNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445

Query: 976  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKN- 800
                VV SG E LKP  +T           D+NPE L N  PSYT LLLEDIQNFHQKN 
Sbjct: 446  ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNLTPSYTTLLLEDIQNFHQKNT 505

Query: 799  NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620
             TP+ +LPACVSKA SILEAVADL          AF DD+R       F + + NS    
Sbjct: 506  TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559

Query: 619  QLAGKKGLETKEPF-LQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 446
              AGKK LE K+PF ++SE+V  +DLMEPS HKYVTV+RGT+  G +MEEQESSGSNS  
Sbjct: 560  NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619

Query: 445  GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 266
            G    H     SWEPNSADS DCWT  SRSN RE+  SPV FQR ALSEPG E  E ++R
Sbjct: 620  GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672

Query: 265  MSVKKRDSDQQ 233
            M  +KR+ D Q
Sbjct: 673  MGRRKREIDHQ 683


>ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus communis]
            gi|223529895|gb|EEF31825.1| hypothetical protein
            RCOM_0303940 [Ricinus communis]
          Length = 725

 Score =  283 bits (725), Expect = 1e-73
 Identities = 202/469 (43%), Positives = 246/469 (52%), Gaps = 17/469 (3%)
 Frame = -2

Query: 1513 RPGK-MVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 1337
            RPGK MVSVPATVSSL MDKSN    G E  +   VKRI VKRN G              
Sbjct: 289  RPGKKMVSVPATVSSLTMDKSNI---GVEPQAANGVKRISVKRNVGGGEAGSRSAASPRS 345

Query: 1336 XXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTN-IVSE 1160
                          N                SRS+SRKAE SPYRRNPL EIDTN +V  
Sbjct: 346  QSPA--------RTNAKGGGSNENNQQQPSLSRSSSRKAEQSPYRRNPLSEIDTNSLVYA 397

Query: 1159 QIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNN 980
            Q   +    +NN+ S+A   +  +   ++ K  +++  + +  + +     KI  Q  N 
Sbjct: 398  QATGNNTTANNNSNSRAQTRNKELEGKLMVKESVNVLNQAQMHKPNAEANSKINAQGSNK 457

Query: 979  NVAVDVV---GSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFH 809
             V    V    SG + LKP  V            D NPE   NP PSYTALLLEDIQNFH
Sbjct: 458  GVKEQTVTAEASGAD-LKPQTVARSRSARRSRDLDFNPETSLNPNPSYTALLLEDIQNFH 516

Query: 808  QKN-----NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKL 644
            QK+     NTP+F++PACV+KACSI+EAVADL          AF D+KR           
Sbjct: 517  QKSTNTNTNTPSFSVPACVTKACSIVEAVADLNSTTSSNLSCAFSDEKRSP--------- 567

Query: 643  YDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR-----GTLSGEDME 479
               ++    L GKK  E K+PF++SEV+  DDLMEPSFHKYVTVRR     GT S EDM+
Sbjct: 568  ---TTVVSNLVGKKLEEGKDPFVESEVLVNDDLMEPSFHKYVTVRRGGNGKGTSSVEDMD 624

Query: 478  EQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDSRSPVPFQRLAL 305
             QESSGSNS  G   QHW  S+SSWEPNSADS D WT  SRSN R E+ +SP+ FQ+   
Sbjct: 625  GQESSGSNSFVGSSQQHWGYSTSSWEPNSADSTDRWT--SRSNTRDEEEKSPLGFQKHTS 682

Query: 304  SEPGYEVGEARRRMSVKKRDSDQQQNXXXXXXXXXXGVQSLPTAAAAAS 158
            SE G ++ EARR        S Q+             + S P  AAA++
Sbjct: 683  SESGRDMEEARRGF------SGQRNGIGRGRVGSSKNLNSTPIVAAAST 725


>ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citrus clementina]
            gi|568855457|ref|XP_006481321.1| PREDICTED:
            serine/arginine repetitive matrix protein 2-like [Citrus
            sinensis] gi|557531784|gb|ESR42967.1| hypothetical
            protein CICLE_v10011149mg [Citrus clementina]
          Length = 740

 Score =  272 bits (696), Expect = 3e-70
 Identities = 212/488 (43%), Positives = 248/488 (50%), Gaps = 59/488 (12%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRN------------AGSDX 1370
            RPGKMVSVPATV+      SN++           VKRI VKRN            A S  
Sbjct: 258  RPGKMVSVPATVAVEPATASNSS----------GVKRISVKRNVGEAAGAVGSRMAASPR 307

Query: 1369 XXXXXXXXXXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNP 1193
                                  S+                     NS RKAEHSPYRRNP
Sbjct: 308  SKSPARVNGNNVKEQQHPSLSRSSSRKGEQHSPYRRNPSSEIDHPNSTRKAEHSPYRRNP 367

Query: 1192 LGEIDTNIVSEQIPLSGL----------KVHN-----------------NNLSQAP---A 1103
            L EID N  S Q P S            +V N                 N L QAP    
Sbjct: 368  LSEIDPN--SLQYPQSACNNKASNVITNRVRNKSRDFEGEGVFVRDSSANVLYQAPIHKP 425

Query: 1102 SDSRISKG-----------ILDKNIISINCKEKEQQNSITEEEKILQQAMNNNVAVDVVG 956
            +   I++G            L+  +   N  EKEQ+  I EE+K  Q  M  N AV    
Sbjct: 426  NAENIAQGTNNHKSSCRGTTLNNKVTGANITEKEQR-QILEEDK-AQLPMTANAAVVTES 483

Query: 955  SGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNNTPAFTLP 776
              P+ L     T           D+NPE L NP PSYTALLLEDIQNFHQK +TP+ +LP
Sbjct: 484  QKPQTLTR---TRSSRRSRDLDLDLNPETLLNPTPSYTALLLEDIQNFHQK-STPSVSLP 539

Query: 775  ACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQF-NKLYDNSSFGGQLAGKKG 599
            ACV+KACSILEAVADL          AF +D R+ P ADQ  NK   N S G  L GKK 
Sbjct: 540  ACVTKACSILEAVADLNSTTSSNLSCAFSED-RKPPSADQSNNKNAYNFSAGVNLVGKKM 598

Query: 598  LETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG--TLSGEDMEEQESSGSNSIAG-GGPQH 428
             E K+PF++SEV+  DDLMEPSFH+YVTVRRG   L G DM+ QESSGSNS  G    Q+
Sbjct: 599  TEAKDPFVESEVLADDDLMEPSFHRYVTVRRGGSELGGVDMDGQESSGSNSFVGCTTQQN 658

Query: 427  WASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-PGYEVGEARRRMSVKK 251
            W SSSSWEPNSADS D WT  SRSN +E+ +SP+ FQR A+SE  G E  + R+  S K+
Sbjct: 659  WTSSSSWEPNSADSTDRWT--SRSNMKEEDQSPLGFQRQAMSEAAGCEATKNRKGFSGKR 716

Query: 250  RDSDQQQN 227
            RD+D QQN
Sbjct: 717  RDTDYQQN 724


>ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Glycine
            max]
          Length = 725

 Score =  264 bits (675), Expect = 9e-68
 Identities = 186/444 (41%), Positives = 247/444 (55%), Gaps = 15/444 (3%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNN---AEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXX 1343
            RPGKMVSVPATVSSL MDKSNN     GG E+ +   +KRI VKRN G+           
Sbjct: 279  RPGKMVSVPATVSSLVMDKSNNNGGGGGGGESGATTGIKRITVKRNVGA---ASPRSQSP 335

Query: 1342 XXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS 1163
                          N+N                 RSNSRKAE SPY+RNPL EI+ N ++
Sbjct: 336  ARANGNAASGNKAFNENQQQPSLS----------RSNSRKAEQSPYKRNPLSEIEPNSLA 385

Query: 1162 ---EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKI 1001
                    S  KV N    +     ++ + G    LDK + ++NCK K QQ    EE+  
Sbjct: 386  FPHSTANNSSSKVQNRPKKEFETEANQKTNGSRTALDKGM-NVNCKTKVQQ----EEDVK 440

Query: 1000 LQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLE 827
            +Q ++ +NV V  +V  G + LKP + +T           D+NPEAL NP  SY +LLLE
Sbjct: 441  VQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRQSRDLDLNPEALLNPPQSYASLLLE 500

Query: 826  DIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNK 647
            DIQNFHQK NTP  +LPACV+KACSILEAVADL          A   + RR+P+A Q ++
Sbjct: 501  DIQNFHQK-NTPPVSLPACVTKACSILEAVADLNSNAGLNFCGA---EDRRSPLAFQCSR 556

Query: 646  LYDNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQ 473
               N S      GK+  + ++P ++S ++   DD+ME S HKYVTV R G L G DM++Q
Sbjct: 557  NDYNVSLTTHDYGKREPDAEDPVVESMLLFNDDDVMEQSLHKYVTVNRGGLLGGVDMDDQ 616

Query: 472  ESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE 299
            ESSGSNS     G Q W  SSSSWEP+S +S DCWT  SRSN  ++    +  +    SE
Sbjct: 617  ESSGSNSFTVSSGQQRWGVSSSSWEPSSVESKDCWT--SRSNYSKEEGQKLGLEGRVASE 674

Query: 298  PGYEVGEARRRMSVKKRDSDQQQN 227
             G + GEA+++++ ++R+ D  Q+
Sbjct: 675  AGLDAGEAKKKLNSQRRECDHHQH 698


>ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform
            X2 [Glycine max]
          Length = 733

 Score =  260 bits (665), Expect = 1e-66
 Identities = 192/472 (40%), Positives = 254/472 (53%), Gaps = 20/472 (4%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSSL MDKSNN  G  E+ +   +KRI VKRN G+              
Sbjct: 287  RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEID-------- 1178
                      + +                 SRSNSRKAE SPY+RNPL EI+        
Sbjct: 333  SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392

Query: 1177 --TNIVSEQIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEK 1004
              TN  S ++     K      +Q   + +R +    DK + +INCK K QQ    EE+ 
Sbjct: 393  STTNNSSSRVQNRPKKEFETEANQQKTNGNRTAS---DKGV-TINCKTKVQQ----EEDV 444

Query: 1003 ILQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAP-SYTALL 833
             +Q ++ +NV V  +V  G + LKP + +T           DIN EAL NP P SY +LL
Sbjct: 445  KVQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLL 504

Query: 832  LEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQF 653
            LEDIQNFHQKN TP  +LPACV+KACSILEAVADL         S    + RR+P+A Q 
Sbjct: 505  LEDIQNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQC 560

Query: 652  NKLYDNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDME 479
            ++   N        GK+  + ++P ++S +V   DD+MEP+ HKYVTV R G+L G DM+
Sbjct: 561  SRNDYNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMD 620

Query: 478  EQESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLAL 305
            +QESSGSNS     G QHW  SSSSWEP+S +S DCWTS S  +  E  RSP+  +    
Sbjct: 621  DQESSGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVA 680

Query: 304  SE-PGYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXGVQSLPTAAAAAS 158
            SE  G + G A+++++ ++R+ D Q               + ++P   AAAS
Sbjct: 681  SEVAGRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 732


>ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform
            X1 [Glycine max]
          Length = 732

 Score =  260 bits (664), Expect = 2e-66
 Identities = 191/468 (40%), Positives = 254/468 (54%), Gaps = 16/468 (3%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSSL MDKSNN  G  E+ +   +KRI VKRN G+              
Sbjct: 287  RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS--- 1163
                      + +                 SRSNSRKAE SPY+RNPL EI+ N ++   
Sbjct: 333  SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392

Query: 1162 EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKILQQ 992
                 S  +V N    +     ++ + G     DK + +INCK K QQ    EE+  +Q 
Sbjct: 393  STTNNSSSRVQNRPKKEFETEANQKTNGNRTASDKGV-TINCKTKVQQ----EEDVKVQS 447

Query: 991  AMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAP-SYTALLLEDI 821
            ++ +NV V  +V  G + LKP + +T           DIN EAL NP P SY +LLLEDI
Sbjct: 448  SITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLLLEDI 507

Query: 820  QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLY 641
            QNFHQKN TP  +LPACV+KACSILEAVADL         S    + RR+P+A Q ++  
Sbjct: 508  QNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQCSRND 563

Query: 640  DNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 467
             N        GK+  + ++P ++S +V   DD+MEP+ HKYVTV R G+L G DM++QES
Sbjct: 564  YNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMDDQES 623

Query: 466  SGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-P 296
            SGSNS     G QHW  SSSSWEP+S +S DCWTS S  +  E  RSP+  +    SE  
Sbjct: 624  SGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVASEVA 683

Query: 295  GYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXGVQSLPTAAAAAS 158
            G + G A+++++ ++R+ D Q               + ++P   AAAS
Sbjct: 684  GRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 731


>gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus vulgaris]
          Length = 718

 Score =  255 bits (652), Expect = 4e-65
 Identities = 175/440 (39%), Positives = 237/440 (53%), Gaps = 13/440 (2%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSSL MDKSNN  GG E+ +   +KRI VKRN G+              
Sbjct: 275  RPGKMVSVPATVSSLVMDKSNNNGGGGESAATTGIKRITVKRNVGA---ASPRSQSPARA 331

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154
                       N+N                 RS+SRKAE SPY+RNPL EI+ N ++   
Sbjct: 332  NGNAANANKAFNENQPPPSLS----------RSSSRKAEQSPYKRNPLSEIEPNSLA--F 379

Query: 1153 PLSGLKVHNNNLSQAPASD--------SRISKGILDKNI-ISINCKEKEQQNSITEEEKI 1001
            P S    +++ +   P  +        +  S+  LDK + ++ N K + + +      K+
Sbjct: 380  PHSTANNNSSRVQNRPKKEFETEAIQRTNSSRTALDKGMTVTYNTKVQPEGDI-----KV 434

Query: 1000 LQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDI 821
                 +N V   +V  G + LKPH +T           D+NPEAL NP  SY +LLLEDI
Sbjct: 435  QSLITDNAVVKTMVPPGLDNLKPHKLTRSRSSRRSQDLDLNPEALLNPPQSYASLLLEDI 494

Query: 820  QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLY 641
            QNFHQK+ TP  +LPACV+KACSILEAVA+L          A   + RR+P   Q ++  
Sbjct: 495  QNFHQKS-TPPVSLPACVTKACSILEAVAELNSNTNLNFGGA---EDRRSPPTFQCSRND 550

Query: 640  DNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRRG-TLSGEDMEEQES 467
             N        GK+  + ++P ++S +V   DD++E S HKYVTV RG ++ G DME+QES
Sbjct: 551  YNVPLTANDYGKREPDAEDPVVESMLVFNDDDVLESSLHKYVTVNRGGSVGGVDMEDQES 610

Query: 466  SGSNSIA-GGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPG 293
            SGSNS   G G Q W  SSSSWEP+S +S DCWTS    +  E  +SP+  +    SE G
Sbjct: 611  SGSNSFTVGNGQQQWGISSSSWEPSSVESRDCWTSRLNYSREEGQKSPLGLEGSVSSETG 670

Query: 292  YEVGEARRRMSVKKRDSDQQ 233
             +V  AR++++   R+ D Q
Sbjct: 671  CDVDGARKKLNSNGRECDHQ 690


>ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307599 [Fragaria vesca
            subsp. vesca]
          Length = 683

 Score =  243 bits (621), Expect = 2e-61
 Identities = 190/465 (40%), Positives = 235/465 (50%), Gaps = 13/465 (2%)
 Frame = -2

Query: 1513 RPGKM--VSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340
            RPGKM  VSVPATV    MDK++N E      +  ++KRI VKRNAG D           
Sbjct: 275  RPGKMKMVSVPATV----MDKNSNGESA----TTGSIKRISVKRNAG-DAVNVTVGSRTA 325

Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160
                        +N                   RS+SRKAE SPYRRNPL E+D N    
Sbjct: 326  ASPRSQSPARGGANAKASNDSLQPSLS------RSSSRKAEQSPYRRNPLSELDPN---- 375

Query: 1159 QIPLSGLKVHNNNLSQAPASDSRISKGILD--KNIISINCKEKEQQNSITEEEKILQQAM 986
               L+  + H NN      ++++ S  +L+  K  + I C +   Q  I         AM
Sbjct: 376  --SLAYPQAHINN------TNNKSSCNVLNQLKPNVEITCNKIITQG-INYRSSTASSAM 426

Query: 985  NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQ 806
            +N V      SG + LK   +T           DINP+ LSNP PSYT LLLEDIQNFHQ
Sbjct: 427  DNKVVEPAGASGVDCLKHQTLTRSRSSRRSRDLDINPQTLSNPPPSYTRLLLEDIQNFHQ 486

Query: 805  K-NNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNK--LYDN 635
            + +N    +LP CV+KACSILEAVADL           F  D R++P  DQ NK   Y N
Sbjct: 487  QSSNAAVVSLPQCVTKACSILEAVADLNSTTN------FSAD-RKSPSIDQINKSSCYYN 539

Query: 634  SSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSN 455
             S       +K +    PF++SEV+ GDDL+ PSFHKYVTVRRG   G DME+QESSGSN
Sbjct: 540  CSLDANPVPRKDI----PFVESEVLVGDDLVAPSFHKYVTVRRG---GTDMEDQESSGSN 592

Query: 454  SIAGGGPQ-HWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGE 278
            S   G  Q  W  SSSWEPNSADS DCWT  SRS+ RED ++             +++ E
Sbjct: 593  SFVSGSQQPQWGLSSSWEPNSADSTDCWT--SRSSTREDDQN-------------FDMDE 637

Query: 277  -ARRRMSVKKRDSDQQQNXXXXXXXXXXGVQS----LPTAAAAAS 158
             ARRR+S +K D    Q+                  +P  AAAAS
Sbjct: 638  AARRRLSRRKTDGQNTQSSCGIGRGKLAAASKGLPIMPVVAAAAS 682


>gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus vulgaris]
          Length = 652

 Score =  239 bits (610), Expect = 3e-60
 Identities = 172/403 (42%), Positives = 213/403 (52%), Gaps = 10/403 (2%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVP TVSSLAMDKSNN  G   T      KRI VKRN G               
Sbjct: 247  RPGKMVSVPPTVSSLAMDKSNNCGGESGT------KRITVKRNVGD------VGSRGAAS 294

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNPLGEIDTNIVSEQ 1157
                       N                  SR+NS RKAE SPYRRNPL E+D N   +Q
Sbjct: 295  PRTQSPARVNGNVASARVLSENQQHQQPSLSRNNSSRKAEQSPYRRNPLSEVDNNSKVQQ 354

Query: 1156 IPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNN 977
                     N   ++A A      +  L+K + ++NCK KE    ++ +  +        
Sbjct: 355  ---------NKPKTEAEAMQKPNGRVALEKGV-TVNCKTKEHHEDVSLDSAV-------- 396

Query: 976  VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSN--PAPSYTALLLEDIQNFHQK 803
            V   V  SG + LKP G+T           DINPE++ N  P  SY +LLLEDIQNFHQK
Sbjct: 397  VKTTVASSGVDNLKPQGLTRSRSSRRSRDLDINPESVVNVNPTHSYASLLLEDIQNFHQK 456

Query: 802  NNT--PAFT-LPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNS 632
            N    P+ T LPAC++KACSI+EAV DL          AF +D R++P   Q        
Sbjct: 457  NTPQQPSSTSLPACLTKACSIIEAVGDLSYTTSSNFSGAFSED-RKSPSTQQ-------- 507

Query: 631  SFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNS 452
            SF     GKK   +K+PF++SEV  GDD+MEPS HKYVTV+RG+ +  DM++QESSGSNS
Sbjct: 508  SFRNGYYGKKVQGSKDPFVESEVDVGDDVMEPSLHKYVTVKRGS-AVVDMDDQESSGSNS 566

Query: 451  --IAGGGPQHWA--SSSSWEPNSADSIDCWTSSSRSNCREDSR 335
              ++  G  HW   S SSWEPNSADS D WT  SR + RE+ +
Sbjct: 567  FTVSSSGQHHWGAISCSSWEPNSADSTDSWT--SRLSSREEGQ 607


>gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis]
          Length = 676

 Score =  239 bits (609), Expect = 4e-60
 Identities = 189/416 (45%), Positives = 211/416 (50%), Gaps = 25/416 (6%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSS-LAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 1337
            RPGKMVSVPATVSS L MDKSNN +      S   +KRI VKRN G              
Sbjct: 252  RPGKMVSVPATVSSSLVMDKSNNMDSAANANS---IKRISVKRNVGEAGSRGAASPRSQS 308

Query: 1336 XXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQ 1157
                       SN+                 SR++SRKAE SPYRRNPL EID N +S  
Sbjct: 309  PARGGNGNAKSSNE----------PQAQPSLSRNSSRKAEQSPYRRNPLSEIDPNSLSYP 358

Query: 1156 IPLSGLKVHNNNLSQA------------PASDSRISKGILDKNIISINCKEKEQQNSITE 1013
             P      HNNN +              P  D  I    L       N +   + N    
Sbjct: 359  NP------HNNNGNNGRAQSKSKRETCVPEEDENILVKELPTQAQKPNAETNYRSNGRVS 412

Query: 1012 EEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEAL--SNPAPSYTA 839
             E    Q     V   VV SG +      +T           DINPE L   NP PSYT 
Sbjct: 413  AENKNSQPKQAMVETTVVISGADNKPQQTLTRSRSSRRSRDLDINPETLLNPNPTPSYTR 472

Query: 838  LLLEDIQNFHQKNN---TPAFTLPACVSKACSILEAVADL-XXXXXXXXXSAFPDDKRRT 671
            LLLEDIQNFHQKNN   T   +LP CVSKACSILEAVADL          SAF +     
Sbjct: 473  LLLEDIQNFHQKNNNATTAVVSLPPCVSKACSILEAVADLNSATGSNLSCSAFSE----- 527

Query: 670  PIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEV-VGGDDLMEPSFHKYVTVRRGTLS 494
               DQFNK   N+++   L        KEPF++SEV VG DDL EPSFHKYVTVRRG  S
Sbjct: 528  ---DQFNK-GTNNAYSSLLG-----PAKEPFVESEVIVGSDDLTEPSFHKYVTVRRGGGS 578

Query: 493  G---EDMEEQESSGSNSIAGGGP-QHWA-SSSSWEPNSADSIDCWTSSSRSNCRED 341
            G    D E+QESSGSNSIAGG   Q+W  SSSSWEPNSADS DC  S+SRSN RE+
Sbjct: 579  GGLVVDAEDQESSGSNSIAGGSQIQNWVLSSSSWEPNSADSTDC--STSRSNNREE 632


>ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max]
          Length = 678

 Score =  231 bits (590), Expect = 6e-58
 Identities = 177/447 (39%), Positives = 229/447 (51%), Gaps = 18/447 (4%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334
            RPGKMVSVPATVSSL MDKSN+  G   T      K   VKRN G               
Sbjct: 249  RPGKMVSVPATVSSLVMDKSNSCGGDSGT-----KKITTVKRNVGD---AGSKGAASPRA 300

Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNPLGEIDTNIV--S 1163
                        D                 SR+NS RK E SPYRRNP  E+D N    +
Sbjct: 301  QSPARVNGNVGRDKMLNENLQQQHQQQPSLSRNNSSRKVEQSPYRRNPQSEVDHNSSRKA 360

Query: 1162 EQIPLSGLKVHNNNLS-QAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAM 986
            EQ P S  KV  N    +A A      +  L+K + S+NCK KEQ     EE  +   A+
Sbjct: 361  EQSPYSNSKVQQNKPKIEAEAIQKPNGRVALEKGV-SVNCKTKEQHEE--EESSVPISAV 417

Query: 985  NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQ 806
               V    V SG + LKP G+T             + +  +N   SY +LLLEDIQNFHQ
Sbjct: 418  ---VKTTAVSSGVDNLKPQGLTRSRSSRR------SRDLDTNATNSYASLLLEDIQNFHQ 468

Query: 805  KNNTP------AFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKL 644
            KN         + +LPAC++K CSILEAVADL           F +DKR +P   Q N +
Sbjct: 469  KNTQQQQQQPSSVSLPACLNKVCSILEAVADLNSTTSSN----FTEDKR-SPSTQQSN-I 522

Query: 643  YDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 467
             ++  +G ++AG      K+PF++SEV   DD+MEPS HKYVTV+R G +  EDME+QES
Sbjct: 523  RNDEYYGKKVAGSN----KDPFVESEVAVSDDVMEPSLHKYVTVKRGGGVVVEDMEDQES 578

Query: 466  SGSNSI---AGGGPQHWAS----SSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLA 308
            SGSNS    +  G  HW +    SSSWEPNSADS DCWTSS  S+  E+++        +
Sbjct: 579  SGSNSFTVSSSSGQHHWGNNISCSSSWEPNSADSTDCWTSSRLSSREEEAQKTPLGLGCS 638

Query: 307  LSEPGYEVGEARRRMSVKKRDSDQQQN 227
            LS    E  + ++ ++ K+R+ D + +
Sbjct: 639  LSS---EAKKKKKGLNSKRRECDHEHS 662


>ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224225 [Cucumis sativus]
          Length = 750

 Score =  223 bits (567), Expect = 3e-55
 Identities = 171/453 (37%), Positives = 225/453 (49%), Gaps = 25/453 (5%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 1346
            RP KMVSVPATVS    DK+N+A     GG ++ +   VKRI VKRN G           
Sbjct: 297  RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 356

Query: 1345 XXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIV 1166
                             N                SRS+SRKAE SPYRRNPLGEIDTN  
Sbjct: 357  SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 409

Query: 1165 SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 1013
                  +  K            N ++Q P +D +    ++   +  +N  +     + T 
Sbjct: 410  QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 466

Query: 1012 E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPA--PSY 845
                 I      +N  V VV    E  KP G+            DINPE L N +  PSY
Sbjct: 467  GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 522

Query: 844  TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTP 668
            T +LL+DIQNFHQK+ NT   +LPACV+KACSI+EAVADL         SAF +++   P
Sbjct: 523  TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 582

Query: 667  IADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG----T 500
                    Y +  + G L G    E ++PF++SEV   DD++EPSFHKYVTVRRG     
Sbjct: 583  TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 639

Query: 499  LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 329
              G D ++QESSGSNS  G   Q   W  S++SWEPN+ADS D  + +SR   +E+    
Sbjct: 640  AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 697

Query: 328  VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 230
            +       S+PG +  + RRR + ++RDSD Q+
Sbjct: 698  LQ------SKPGLDRDDNRRRTAERRRDSDAQR 724


>ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206761 [Cucumis sativus]
          Length = 742

 Score =  223 bits (567), Expect = 3e-55
 Identities = 171/453 (37%), Positives = 225/453 (49%), Gaps = 25/453 (5%)
 Frame = -2

Query: 1513 RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 1346
            RP KMVSVPATVS    DK+N+A     GG ++ +   VKRI VKRN G           
Sbjct: 289  RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 348

Query: 1345 XXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIV 1166
                             N                SRS+SRKAE SPYRRNPLGEIDTN  
Sbjct: 349  SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 401

Query: 1165 SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 1013
                  +  K            N ++Q P +D +    ++   +  +N  +     + T 
Sbjct: 402  QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 458

Query: 1012 E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPA--PSY 845
                 I      +N  V VV    E  KP G+            DINPE L N +  PSY
Sbjct: 459  GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 514

Query: 844  TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTP 668
            T +LL+DIQNFHQK+ NT   +LPACV+KACSI+EAVADL         SAF +++   P
Sbjct: 515  TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 574

Query: 667  IADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG----T 500
                    Y +  + G L G    E ++PF++SEV   DD++EPSFHKYVTVRRG     
Sbjct: 575  TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 631

Query: 499  LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 329
              G D ++QESSGSNS  G   Q   W  S++SWEPN+ADS D  + +SR   +E+    
Sbjct: 632  AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 689

Query: 328  VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 230
            +       S+PG +  + RRR + ++RDSD Q+
Sbjct: 690  LQ------SKPGLDRDDNRRRTAERRRDSDAQR 716


Top