BLASTX nr result
ID: Catharanthus22_contig00011364
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00011364 (1913 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix... 352 2e-94 ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251... 345 5e-92 ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Popu... 309 3e-81 gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao] 295 5e-77 gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao] 294 8e-77 ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261... 294 8e-77 ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Popu... 290 1e-75 emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera] 288 8e-75 ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus c... 283 1e-73 ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citr... 272 3e-70 ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix... 264 9e-68 ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix... 260 1e-66 ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix... 260 2e-66 gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus... 255 4e-65 ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307... 243 2e-61 gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus... 239 3e-60 gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis] 239 4e-60 ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [... 231 6e-58 ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224... 223 3e-55 ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206... 223 3e-55 >ref|XP_006348213.1| PREDICTED: serine/arginine repetitive matrix protein 3-like, partial [Solanum tuberosum] Length = 652 Score = 352 bits (904), Expect = 2e-94 Identities = 221/430 (51%), Positives = 263/430 (61%), Gaps = 2/430 (0%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D Sbjct: 250 RPGKMVSVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 308 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 ++N RSNSRK E SPYRRNPL EIDTN+V EQ+ Sbjct: 309 ARVNAKVLNERDNNAHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDTNVVLEQM 365 Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 974 P GLKV + L+ S N K KEQQ +NV Sbjct: 366 PAPGLKVPSQKLNAETVS----------------NGKVKEQQ---------------HNV 394 Query: 973 AVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNN- 797 A++V+ SGPE KP DINPEALSNP SYTALLLEDIQNFHQK N Sbjct: 395 AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 451 Query: 796 -TPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620 TPAF+LP CV+KACSI++AVADL SAF DD+RR P ++QF++ DN+SF Sbjct: 452 TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSAFSDDRRRNPTSEQFSQ-NDNASF-- 508 Query: 619 QLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 440 GKK L K+PF++SEV DLMEPS KYVT RRGT DMEEQESSGSNS+ G Sbjct: 509 DPLGKKKLGIKDPFMESEVAVSGDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 563 Query: 439 GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 260 G Q+W S SSWEPNSADS DCW SS+S R+D++SP+ FQR A+SE G+++ E +RR++ Sbjct: 564 GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEIGHDMEEGKRRVN 622 Query: 259 VKKRDSDQQQ 230 VK+R+SD QQ Sbjct: 623 VKRRESDNQQ 632 >ref|XP_004233775.1| PREDICTED: uncharacterized protein LOC101251847 [Solanum lycopersicum] Length = 690 Score = 345 bits (884), Expect = 5e-92 Identities = 216/430 (50%), Positives = 261/430 (60%), Gaps = 2/430 (0%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKM+SVPATVSS+ MDKS +A GG + IS AAVKRIQVKRNAG D Sbjct: 287 RPGKMISVPATVSSMVMDKSIDA-GGTDNISAAAVKRIQVKRNAGGDGPRTAASPRARSP 345 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 ++N RSNSRK E SPYRRNPL EID+N+V EQ+ Sbjct: 346 ARVNAKVLNERDNNTHSNQNQQQPMSLS---RSNSRKHEQSPYRRNPLSEIDSNVVLEQM 402 Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNNV 974 P GLKV + L+ S N K KEQQ +NV Sbjct: 403 PAPGLKVPSQKLNAETVS----------------NGKVKEQQ--------------QHNV 432 Query: 973 AVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNN- 797 A++V+ SGPE KP DINPEALSNP SYTALLLEDIQNFHQK N Sbjct: 433 AMNVIVSGPESHKPQ---RSRSLRLSRDLDINPEALSNPPQSYTALLLEDIQNFHQKTNT 489 Query: 796 -TPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620 TPAF+LP CV+KACSI++AVADL SA DD+RR ++Q+++ DN+SF Sbjct: 490 TTPAFSLPPCVTKACSIVDAVADLNSTTSSNLSSALSDDRRRNATSEQYSQ-NDNASF-- 546 Query: 619 QLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNSIAGG 440 GKK L K+PF++SEV DDLMEPS KYVT RRGT DMEEQESSGSNS+ G Sbjct: 547 DPLGKKKLGIKDPFMESEVTVSDDLMEPSIQKYVTFRRGT----DMEEQESSGSNSVV-G 601 Query: 439 GPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRRMS 260 G Q+W S SSWEPNSADS DCW SS+S R+D++SP+ FQR A+SE +++ E +RR++ Sbjct: 602 GQQNWLSPSSWEPNSADSTDCW-PSSKSYSRDDNKSPLGFQRHAISEISHDMEEGKRRVN 660 Query: 259 VKKRDSDQQQ 230 VK+R+SD QQ Sbjct: 661 VKRRESDNQQ 670 >ref|XP_002318801.2| hypothetical protein POPTR_0012s12820g [Populus trichocarpa] gi|550327002|gb|EEE97021.2| hypothetical protein POPTR_0012s12820g [Populus trichocarpa] Length = 754 Score = 309 bits (791), Expect = 3e-81 Identities = 204/447 (45%), Positives = 245/447 (54%), Gaps = 29/447 (6%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSSL MDKSNN P+ +G KRI VKRN G Sbjct: 305 RPGKMVSVPATVSSLVMDKSNNIGVEPQATAGT--KRISVKRNVGEAAVAGSRTAASPRS 362 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 SN+N RSNSRKA+ SPYRRNPL EID N + Sbjct: 363 QSPARANAKTSNENNQQPCLS----------RSNSRKADQSPYRRNPLSEIDPNSLQHSQ 412 Query: 1153 PLSGLKVH--NNNLSQA----------------PASDSRISKGILDKN------IISINC 1046 P SG K +NN SQ P + + + K +KN + + C Sbjct: 413 P-SGNKATCTSNNRSQIRNKDIEGQAVAKETFNPLNQTPMKKQNSEKNNRVNVQVANYRC 471 Query: 1045 KEKEQ-QNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869 +N +++E+++ + + V +VV G E LKP +T D+NPE Sbjct: 472 SSMASLENKLSKEQQMEEAKGHPPVTTNVVDLGGESLKPQALTRSRSARRSRDLDLNPET 531 Query: 868 LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFP 689 L NP PSYTALLLEDIQNFHQKN P+F+LPACV+KACSILEAVADL AF Sbjct: 532 LLNPTPSYTALLLEDIQNFHQKNTPPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 591 Query: 688 DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVR 509 DD+ P N L GKK E K+PF++SE++ DDLMEPSFHKYVTVR Sbjct: 592 DDRISPPAVAAVN-----------LVGKKLPEAKDPFVESEIIASDDLMEPSFHKYVTVR 640 Query: 508 R--GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-ED 341 R GTL GEDM+ QESSGSNS GG QH S+SSWEPNSADS D W SSRSN R ED Sbjct: 641 RGGGTLCGEDMDGQESSGSNSFVGGSQQHLGLSTSSWEPNSADSTDRW--SSRSNTRDED 698 Query: 340 SRSPVPFQRLALSEPGYEVGEARRRMS 260 +SP+ +Q+ L E G +V +ARR S Sbjct: 699 DKSPLGYQKHGLPETGRDVEQARRAFS 725 >gb|EOY03077.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 718 Score = 295 bits (755), Expect = 5e-77 Identities = 205/457 (44%), Positives = 245/457 (53%), Gaps = 30/457 (6%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340 RPGKMVSVPATVSSL MDKS N G E T + A+KRI VKRN G Sbjct: 267 RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 320 Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160 +N N SRS+SRKAEHSPYRRNPL EID N ++ Sbjct: 321 GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 380 Query: 1159 QIPLS------------GLKVHNNNLSQ-------APASDSRISKGILDKNIISINCKEK 1037 + GLK + N L+ ++ S G D ++++N K Sbjct: 381 PQSAANKTSTCINKGQGGLKEYTNKLNVEMNNKVVVQGANKAGSIGTADNKVVNVNSTAK 440 Query: 1036 EQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNP 857 EQ+ M V + G E KP +T D+NPE L NP Sbjct: 441 EQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPETLLNP 487 Query: 856 APS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDK 680 PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL AF +D+ Sbjct: 488 IPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAFSEDR 547 Query: 679 RRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR 506 + D SS G A G+K ET++PF++SEVVG DDLMEPSFHKYVTVRR Sbjct: 548 KGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYVTVRR 599 Query: 505 G-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSR 335 G TL G DMEEQESSGSNS G G QHW S SSWEPNSADS D WTS ++S ED Sbjct: 600 GATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-EEDHS 658 Query: 334 SPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 233 S + QR AL+EP G ++ R+ +S ++RD D Q Sbjct: 659 SSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 695 >gb|EOY03076.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 785 Score = 294 bits (753), Expect = 8e-77 Identities = 205/461 (44%), Positives = 246/461 (53%), Gaps = 34/461 (7%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPE--TISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340 RPGKMVSVPATVSSL MDKS N G E T + A+KRI VKRN G Sbjct: 330 RPGKMVSVPATVSSLVMDKSTNGAAGVEAPTTTANAIKRISVKRNVGE------AAVGSR 383 Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160 +N N SRS+SRKAEHSPYRRNPL EID N ++ Sbjct: 384 GTASPRSQSPARTNPNANNPKGCNENQLQPTLSRSSSRKAEHSPYRRNPLSEIDPNSLAY 443 Query: 1159 QIPLS------------GLKVHNNNLSQ-----------APASDSRISKGILDKNIISIN 1049 + GLK + N ++Q ++ S G D ++++N Sbjct: 444 PQSAANKTSTCINKGQGGLKEYTNVINQKLNVEMNNKVVVQGANKAGSIGTADNKVVNVN 503 Query: 1048 CKEKEQQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869 KEQ+ M V + G E KP +T D+NPE Sbjct: 504 STAKEQR-------------MVEEVKTEPPMPGAENPKPQTLTRSRSSRRSRDLDLNPET 550 Query: 868 LSNPAPS-YTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAF 692 L NP PS YT LLLEDIQNFHQ NN P+F+LP+CVSKACSILEAVADL AF Sbjct: 551 LLNPIPSSYTTLLLEDIQNFHQTNNPPSFSLPSCVSKACSILEAVADLNSTTSSNLSCAF 610 Query: 691 PDDKRRTPIADQFNKLYDNSSFGGQLA--GKKGLETKEPFLQSEVVGGDDLMEPSFHKYV 518 +D++ D SS G A G+K ET++PF++SEVVG DDLMEPSFHKYV Sbjct: 611 SEDRKGLST--------DESSKNGYNATVGRKMAETRDPFVESEVVGRDDLMEPSFHKYV 662 Query: 517 TVRRG-TLSGEDMEEQESSGSNSIAGGG-PQHWA-SSSSWEPNSADSIDCWTSSSRSNCR 347 TVRRG TL G DMEEQESSGSNS G G QHW S SSWEPNSADS D WTS ++S Sbjct: 663 TVRRGATLGGTDMEEQESSGSNSFVGSGQQQHWGFSPSSWEPNSADSTDRWTSRTKSR-E 721 Query: 346 EDSRSPVPFQRLALSEP--GYEV-GEARRRMSVKKRDSDQQ 233 ED S + QR AL+EP G ++ R+ +S ++RD D Q Sbjct: 722 EDHSSSLEPQRQALAEPQSGSDIKNSTRKGLSGRRRDVDLQ 762 >ref|XP_002263918.1| PREDICTED: uncharacterized protein LOC100261489 [Vitis vinifera] Length = 710 Score = 294 bits (753), Expect = 8e-77 Identities = 198/433 (45%), Positives = 238/433 (54%), Gaps = 4/433 (0%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATV +DK NN G E+ + AV+R+ VKRN+G Sbjct: 285 RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 N + R++SRKAE SPYRRNPL EID NI + Sbjct: 341 ANARVVSNDSQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386 Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 977 L ++ + + D K ++ N S + + Q E K LQ N+ Sbjct: 387 -LKAREIEPDCQQKPNMKDMNNGKVVVHGTNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445 Query: 976 VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKN- 800 VV SG E LKP +T D+NPE L NP PSYT LLLEDIQNFHQKN Sbjct: 446 ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNPTPSYTTLLLEDIQNFHQKNT 505 Query: 799 NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620 TP+ +LPACVSKA SILEAVADL AF DD+R F + + NS Sbjct: 506 TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559 Query: 619 QLAGKKGLETKEPF-LQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 446 AGKK LE K+PF ++SE+V +DLMEPS HKYVTV+RGT+ G +MEEQESSGSNS Sbjct: 560 NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619 Query: 445 GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 266 G H SWEPNSADS DCWT SRSN RE+ SPV FQR ALSEPG E E ++R Sbjct: 620 GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672 Query: 265 MSVKKRDSDQQQN 227 M +K++ D QQN Sbjct: 673 MGRRKKEIDHQQN 685 >ref|XP_002321932.2| hypothetical protein POPTR_0015s12740g [Populus trichocarpa] gi|550322594|gb|EEF06059.2| hypothetical protein POPTR_0015s12740g [Populus trichocarpa] Length = 736 Score = 290 bits (743), Expect = 1e-75 Identities = 200/446 (44%), Positives = 239/446 (53%), Gaps = 28/446 (6%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGK+VSVPATVSSL +DKSNN G E + A ++RI VKRN G Sbjct: 290 RPGKLVSVPATVSSLVVDKSNN---GVEPQATAGIRRISVKRNVGEAALTCSRMVASPSS 346 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS-EQ 1157 SN+N RSNSRKA+ SPYRRNPL EID N + Q Sbjct: 347 KSPARTNAKTSNENNQQPSLS----------RSNSRKADQSPYRRNPLSEIDLNSLQYSQ 396 Query: 1156 IPLSGLKVHNNN---------------------LSQAPASDSRISKGILDKNIISINCKE 1040 P + +NN L+Q P K N NC+ Sbjct: 397 PPANKATCTSNNRARIRNKDIEGQVVVKESFNLLNQTPMKKQNSEKNNR-VNAQVTNCRG 455 Query: 1039 KE---QQNSITEEEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEA 869 +N I++E+++ + VV G E LKP +T D+NPE Sbjct: 456 SSIVSLENKISKEQQMEEAKGQPTDMTTVVDLGVESLKPQTLTRSRSARRSRDLDLNPET 515 Query: 868 LSNPAPSYTALLLEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFP 689 L NP PSYTALLLEDIQNFH K NTP+F+LPACV+KACSILEAVADL AF Sbjct: 516 LLNPTPSYTALLLEDIQNFHLK-NTPSFSLPACVTKACSILEAVADLNSTTSSNLSCAFS 574 Query: 688 DDKRRTPIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVR 509 D+R P N L GKK E K+PF++SEV+ DDL+EPSFHKYVTVR Sbjct: 575 YDRRSPPTVAAAN-----------LVGKKPPEAKDPFVESEVLASDDLIEPSFHKYVTVR 623 Query: 508 R-GTLSGEDMEEQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDS 338 R GTL GEDM+ QESSG +S+ GG QH S+SSWEPNSADSID WT SRSN R ED Sbjct: 624 RAGTLCGEDMDGQESSGRDSVVGGSQQHLGFSTSSWEPNSADSIDHWT--SRSNWRDEDE 681 Query: 337 RSPVPFQRLALSEPGYEVGEARRRMS 260 +SP+ FQ+ LSE +V +ARR S Sbjct: 682 KSPLGFQKHELSETWRDVEQARRPFS 707 >emb|CAN76723.1| hypothetical protein VITISV_042980 [Vitis vinifera] Length = 685 Score = 288 bits (736), Expect = 8e-75 Identities = 197/431 (45%), Positives = 236/431 (54%), Gaps = 4/431 (0%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATV +DK NN G E+ + AV+R+ VKRN+G Sbjct: 285 RPGKMVSVPATV----IDKGNNGSSGVESGNNGAVRRVLVKRNSGEVAASGSKTPRSRSP 340 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 N + R++SRKAE SPYRRNPL EID NI + Sbjct: 341 ANARVVSNXNQNQHPSLS-------------RNSSRKAEQSPYRRNPLSEIDPNINNRG- 386 Query: 1153 PLSGLKVHNNNLSQAPASDSRISKGILD-KNIISINCKEKEQQNSITEEEKILQQAMNNN 977 L ++ + + D K ++ N S + + Q E K LQ N+ Sbjct: 387 -LKAREIEPDCQQKPNMKDMNNGKVVVHGSNNRSSSRGKVFQVVEEAGEPKGLQPRTNSI 445 Query: 976 VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKN- 800 VV SG E LKP +T D+NPE L N PSYT LLLEDIQNFHQKN Sbjct: 446 ETTIVVASGAESLKPQALTRTRSSRRSRDLDLNPETLLNLTPSYTTLLLEDIQNFHQKNT 505 Query: 799 NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNSSFGG 620 TP+ +LPACVSKA SILEAVADL AF DD+R F + + NS Sbjct: 506 TTPSISLPACVSKAHSILEAVADLNSCTSSNPSYAFSDDRR------NFTETHQNSMDDK 559 Query: 619 QLAGKKGLETKEPF-LQSEVVGGDDLMEPSFHKYVTVRRGTL-SGEDMEEQESSGSNSIA 446 AGKK LE K+PF ++SE+V +DLMEPS HKYVTV+RGT+ G +MEEQESSGSNS Sbjct: 560 NPAGKKRLEAKDPFVVESEIVVCNDLMEPSLHKYVTVKRGTIGGGGEMEEQESSGSNSFV 619 Query: 445 GGGPQHWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGEARRR 266 G H SWEPNSADS DCWT SRSN RE+ SPV FQR ALSEPG E E ++R Sbjct: 620 GVSQLH-----SWEPNSADSTDCWT--SRSNTREEYPSPVCFQRHALSEPGRESEETQKR 672 Query: 265 MSVKKRDSDQQ 233 M +KR+ D Q Sbjct: 673 MGRRKREIDHQ 683 >ref|XP_002530557.1| hypothetical protein RCOM_0303940 [Ricinus communis] gi|223529895|gb|EEF31825.1| hypothetical protein RCOM_0303940 [Ricinus communis] Length = 725 Score = 283 bits (725), Expect = 1e-73 Identities = 202/469 (43%), Positives = 246/469 (52%), Gaps = 17/469 (3%) Frame = -2 Query: 1513 RPGK-MVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 1337 RPGK MVSVPATVSSL MDKSN G E + VKRI VKRN G Sbjct: 289 RPGKKMVSVPATVSSLTMDKSNI---GVEPQAANGVKRISVKRNVGGGEAGSRSAASPRS 345 Query: 1336 XXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTN-IVSE 1160 N SRS+SRKAE SPYRRNPL EIDTN +V Sbjct: 346 QSPA--------RTNAKGGGSNENNQQQPSLSRSSSRKAEQSPYRRNPLSEIDTNSLVYA 397 Query: 1159 QIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNN 980 Q + +NN+ S+A + + ++ K +++ + + + + KI Q N Sbjct: 398 QATGNNTTANNNSNSRAQTRNKELEGKLMVKESVNVLNQAQMHKPNAEANSKINAQGSNK 457 Query: 979 NVAVDVV---GSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFH 809 V V SG + LKP V D NPE NP PSYTALLLEDIQNFH Sbjct: 458 GVKEQTVTAEASGAD-LKPQTVARSRSARRSRDLDFNPETSLNPNPSYTALLLEDIQNFH 516 Query: 808 QKN-----NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKL 644 QK+ NTP+F++PACV+KACSI+EAVADL AF D+KR Sbjct: 517 QKSTNTNTNTPSFSVPACVTKACSIVEAVADLNSTTSSNLSCAFSDEKRSP--------- 567 Query: 643 YDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR-----GTLSGEDME 479 ++ L GKK E K+PF++SEV+ DDLMEPSFHKYVTVRR GT S EDM+ Sbjct: 568 ---TTVVSNLVGKKLEEGKDPFVESEVLVNDDLMEPSFHKYVTVRRGGNGKGTSSVEDMD 624 Query: 478 EQESSGSNSIAGGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCR-EDSRSPVPFQRLAL 305 QESSGSNS G QHW S+SSWEPNSADS D WT SRSN R E+ +SP+ FQ+ Sbjct: 625 GQESSGSNSFVGSSQQHWGYSTSSWEPNSADSTDRWT--SRSNTRDEEEKSPLGFQKHTS 682 Query: 304 SEPGYEVGEARRRMSVKKRDSDQQQNXXXXXXXXXXGVQSLPTAAAAAS 158 SE G ++ EARR S Q+ + S P AAA++ Sbjct: 683 SESGRDMEEARRGF------SGQRNGIGRGRVGSSKNLNSTPIVAAAST 725 >ref|XP_006429727.1| hypothetical protein CICLE_v10011149mg [Citrus clementina] gi|568855457|ref|XP_006481321.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Citrus sinensis] gi|557531784|gb|ESR42967.1| hypothetical protein CICLE_v10011149mg [Citrus clementina] Length = 740 Score = 272 bits (696), Expect = 3e-70 Identities = 212/488 (43%), Positives = 248/488 (50%), Gaps = 59/488 (12%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRN------------AGSDX 1370 RPGKMVSVPATV+ SN++ VKRI VKRN A S Sbjct: 258 RPGKMVSVPATVAVEPATASNSS----------GVKRISVKRNVGEAAGAVGSRMAASPR 307 Query: 1369 XXXXXXXXXXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNP 1193 S+ NS RKAEHSPYRRNP Sbjct: 308 SKSPARVNGNNVKEQQHPSLSRSSSRKGEQHSPYRRNPSSEIDHPNSTRKAEHSPYRRNP 367 Query: 1192 LGEIDTNIVSEQIPLSGL----------KVHN-----------------NNLSQAP---A 1103 L EID N S Q P S +V N N L QAP Sbjct: 368 LSEIDPN--SLQYPQSACNNKASNVITNRVRNKSRDFEGEGVFVRDSSANVLYQAPIHKP 425 Query: 1102 SDSRISKG-----------ILDKNIISINCKEKEQQNSITEEEKILQQAMNNNVAVDVVG 956 + I++G L+ + N EKEQ+ I EE+K Q M N AV Sbjct: 426 NAENIAQGTNNHKSSCRGTTLNNKVTGANITEKEQR-QILEEDK-AQLPMTANAAVVTES 483 Query: 955 SGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQKNNTPAFTLP 776 P+ L T D+NPE L NP PSYTALLLEDIQNFHQK +TP+ +LP Sbjct: 484 QKPQTLTR---TRSSRRSRDLDLDLNPETLLNPTPSYTALLLEDIQNFHQK-STPSVSLP 539 Query: 775 ACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQF-NKLYDNSSFGGQLAGKKG 599 ACV+KACSILEAVADL AF +D R+ P ADQ NK N S G L GKK Sbjct: 540 ACVTKACSILEAVADLNSTTSSNLSCAFSED-RKPPSADQSNNKNAYNFSAGVNLVGKKM 598 Query: 598 LETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG--TLSGEDMEEQESSGSNSIAG-GGPQH 428 E K+PF++SEV+ DDLMEPSFH+YVTVRRG L G DM+ QESSGSNS G Q+ Sbjct: 599 TEAKDPFVESEVLADDDLMEPSFHRYVTVRRGGSELGGVDMDGQESSGSNSFVGCTTQQN 658 Query: 427 WASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-PGYEVGEARRRMSVKK 251 W SSSSWEPNSADS D WT SRSN +E+ +SP+ FQR A+SE G E + R+ S K+ Sbjct: 659 WTSSSSWEPNSADSTDRWT--SRSNMKEEDQSPLGFQRQAMSEAAGCEATKNRKGFSGKR 716 Query: 250 RDSDQQQN 227 RD+D QQN Sbjct: 717 RDTDYQQN 724 >ref|XP_006602044.1| PREDICTED: serine/arginine repetitive matrix protein 2-like [Glycine max] Length = 725 Score = 264 bits (675), Expect = 9e-68 Identities = 186/444 (41%), Positives = 247/444 (55%), Gaps = 15/444 (3%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNN---AEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXX 1343 RPGKMVSVPATVSSL MDKSNN GG E+ + +KRI VKRN G+ Sbjct: 279 RPGKMVSVPATVSSLVMDKSNNNGGGGGGGESGATTGIKRITVKRNVGA---ASPRSQSP 335 Query: 1342 XXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS 1163 N+N RSNSRKAE SPY+RNPL EI+ N ++ Sbjct: 336 ARANGNAASGNKAFNENQQQPSLS----------RSNSRKAEQSPYKRNPLSEIEPNSLA 385 Query: 1162 ---EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKI 1001 S KV N + ++ + G LDK + ++NCK K QQ EE+ Sbjct: 386 FPHSTANNSSSKVQNRPKKEFETEANQKTNGSRTALDKGM-NVNCKTKVQQ----EEDVK 440 Query: 1000 LQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLE 827 +Q ++ +NV V +V G + LKP + +T D+NPEAL NP SY +LLLE Sbjct: 441 VQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRQSRDLDLNPEALLNPPQSYASLLLE 500 Query: 826 DIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNK 647 DIQNFHQK NTP +LPACV+KACSILEAVADL A + RR+P+A Q ++ Sbjct: 501 DIQNFHQK-NTPPVSLPACVTKACSILEAVADLNSNAGLNFCGA---EDRRSPLAFQCSR 556 Query: 646 LYDNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQ 473 N S GK+ + ++P ++S ++ DD+ME S HKYVTV R G L G DM++Q Sbjct: 557 NDYNVSLTTHDYGKREPDAEDPVVESMLLFNDDDVMEQSLHKYVTVNRGGLLGGVDMDDQ 616 Query: 472 ESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE 299 ESSGSNS G Q W SSSSWEP+S +S DCWT SRSN ++ + + SE Sbjct: 617 ESSGSNSFTVSSGQQRWGVSSSSWEPSSVESKDCWT--SRSNYSKEEGQKLGLEGRVASE 674 Query: 298 PGYEVGEARRRMSVKKRDSDQQQN 227 G + GEA+++++ ++R+ D Q+ Sbjct: 675 AGLDAGEAKKKLNSQRRECDHHQH 698 >ref|XP_006591278.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X2 [Glycine max] Length = 733 Score = 260 bits (665), Expect = 1e-66 Identities = 192/472 (40%), Positives = 254/472 (53%), Gaps = 20/472 (4%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSSL MDKSNN G E+ + +KRI VKRN G+ Sbjct: 287 RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEID-------- 1178 + + SRSNSRKAE SPY+RNPL EI+ Sbjct: 333 SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392 Query: 1177 --TNIVSEQIPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEK 1004 TN S ++ K +Q + +R + DK + +INCK K QQ EE+ Sbjct: 393 STTNNSSSRVQNRPKKEFETEANQQKTNGNRTAS---DKGV-TINCKTKVQQ----EEDV 444 Query: 1003 ILQQAMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAP-SYTALL 833 +Q ++ +NV V +V G + LKP + +T DIN EAL NP P SY +LL Sbjct: 445 KVQSSITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLL 504 Query: 832 LEDIQNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQF 653 LEDIQNFHQKN TP +LPACV+KACSILEAVADL S + RR+P+A Q Sbjct: 505 LEDIQNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQC 560 Query: 652 NKLYDNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDME 479 ++ N GK+ + ++P ++S +V DD+MEP+ HKYVTV R G+L G DM+ Sbjct: 561 SRNDYNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMD 620 Query: 478 EQESSGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLAL 305 +QESSGSNS G QHW SSSSWEP+S +S DCWTS S + E RSP+ + Sbjct: 621 DQESSGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVA 680 Query: 304 SE-PGYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXGVQSLPTAAAAAS 158 SE G + G A+++++ ++R+ D Q + ++P AAAS Sbjct: 681 SEVAGRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 732 >ref|XP_003537379.1| PREDICTED: serine/arginine repetitive matrix protein 2-like isoform X1 [Glycine max] Length = 732 Score = 260 bits (664), Expect = 2e-66 Identities = 191/468 (40%), Positives = 254/468 (54%), Gaps = 16/468 (3%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSSL MDKSNN G E+ + +KRI VKRN G+ Sbjct: 287 RPGKMVSVPATVSSLVMDKSNN-NGSGESGATTGIKRIAVKRNVGA-------------A 332 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVS--- 1163 + + SRSNSRKAE SPY+RNPL EI+ N ++ Sbjct: 333 SPRSQSPARANGNGANGNKAFSENQQQPSLSRSNSRKAEQSPYKRNPLSEIEPNSLAFPH 392 Query: 1162 EQIPLSGLKVHNNNLSQAPASDSRISKG---ILDKNIISINCKEKEQQNSITEEEKILQQ 992 S +V N + ++ + G DK + +INCK K QQ EE+ +Q Sbjct: 393 STTNNSSSRVQNRPKKEFETEANQKTNGNRTASDKGV-TINCKTKVQQ----EEDVKVQS 447 Query: 991 AMNNNVAVD-VVGSGPEFLKP-HGVTXXXXXXXXXXXDINPEALSNPAP-SYTALLLEDI 821 ++ +NV V +V G + LKP + +T DIN EAL NP P SY +LLLEDI Sbjct: 448 SITDNVVVKTMVPPGVDNLKPPYTLTRSRSSRRSQELDINCEALLNPPPQSYASLLLEDI 507 Query: 820 QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLY 641 QNFHQKN TP +LPACV+KACSILEAVADL S + RR+P+A Q ++ Sbjct: 508 QNFHQKN-TPPVSLPACVTKACSILEAVADLNSNAGLNFCSG---EDRRSPLAFQCSRND 563 Query: 640 DNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 467 N GK+ + ++P ++S +V DD+MEP+ HKYVTV R G+L G DM++QES Sbjct: 564 YNVPLTTNDYGKREPDAEDPVVESMLVFNDDDVMEPNLHKYVTVNRGGSLGGADMDDQES 623 Query: 466 SGSNSI-AGGGPQHW-ASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSE-P 296 SGSNS G QHW SSSSWEP+S +S DCWTS S + E RSP+ + SE Sbjct: 624 SGSNSFTVSSGQQHWGVSSSSWEPSSVESKDCWTSRSNYSKEECQRSPLGLEGTVASEVA 683 Query: 295 GYEVGEARRRMSVKKRDSDQQ--QNXXXXXXXXXXGVQSLPTAAAAAS 158 G + G A+++++ ++R+ D Q + ++P AAAS Sbjct: 684 GRDAGGAKKKLNSQRRECDHQHGSGIGRGRLGANKVLHNIPVVTAAAS 731 >gb|ESW18934.1| hypothetical protein PHAVU_006G083400g [Phaseolus vulgaris] Length = 718 Score = 255 bits (652), Expect = 4e-65 Identities = 175/440 (39%), Positives = 237/440 (53%), Gaps = 13/440 (2%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSSL MDKSNN GG E+ + +KRI VKRN G+ Sbjct: 275 RPGKMVSVPATVSSLVMDKSNNNGGGGESAATTGIKRITVKRNVGA---ASPRSQSPARA 331 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQI 1154 N+N RS+SRKAE SPY+RNPL EI+ N ++ Sbjct: 332 NGNAANANKAFNENQPPPSLS----------RSSSRKAEQSPYKRNPLSEIEPNSLA--F 379 Query: 1153 PLSGLKVHNNNLSQAPASD--------SRISKGILDKNI-ISINCKEKEQQNSITEEEKI 1001 P S +++ + P + + S+ LDK + ++ N K + + + K+ Sbjct: 380 PHSTANNNSSRVQNRPKKEFETEAIQRTNSSRTALDKGMTVTYNTKVQPEGDI-----KV 434 Query: 1000 LQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDI 821 +N V +V G + LKPH +T D+NPEAL NP SY +LLLEDI Sbjct: 435 QSLITDNAVVKTMVPPGLDNLKPHKLTRSRSSRRSQDLDLNPEALLNPPQSYASLLLEDI 494 Query: 820 QNFHQKNNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLY 641 QNFHQK+ TP +LPACV+KACSILEAVA+L A + RR+P Q ++ Sbjct: 495 QNFHQKS-TPPVSLPACVTKACSILEAVAELNSNTNLNFGGA---EDRRSPPTFQCSRND 550 Query: 640 DNSSFGGQLAGKKGLETKEPFLQSEVV-GGDDLMEPSFHKYVTVRRG-TLSGEDMEEQES 467 N GK+ + ++P ++S +V DD++E S HKYVTV RG ++ G DME+QES Sbjct: 551 YNVPLTANDYGKREPDAEDPVVESMLVFNDDDVLESSLHKYVTVNRGGSVGGVDMEDQES 610 Query: 466 SGSNSIA-GGGPQHWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPG 293 SGSNS G G Q W SSSSWEP+S +S DCWTS + E +SP+ + SE G Sbjct: 611 SGSNSFTVGNGQQQWGISSSSWEPSSVESRDCWTSRLNYSREEGQKSPLGLEGSVSSETG 670 Query: 292 YEVGEARRRMSVKKRDSDQQ 233 +V AR++++ R+ D Q Sbjct: 671 CDVDGARKKLNSNGRECDHQ 690 >ref|XP_004301811.1| PREDICTED: uncharacterized protein LOC101307599 [Fragaria vesca subsp. vesca] Length = 683 Score = 243 bits (621), Expect = 2e-61 Identities = 190/465 (40%), Positives = 235/465 (50%), Gaps = 13/465 (2%) Frame = -2 Query: 1513 RPGKM--VSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXX 1340 RPGKM VSVPATV MDK++N E + ++KRI VKRNAG D Sbjct: 275 RPGKMKMVSVPATV----MDKNSNGESA----TTGSIKRISVKRNAG-DAVNVTVGSRTA 325 Query: 1339 XXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSE 1160 +N RS+SRKAE SPYRRNPL E+D N Sbjct: 326 ASPRSQSPARGGANAKASNDSLQPSLS------RSSSRKAEQSPYRRNPLSELDPN---- 375 Query: 1159 QIPLSGLKVHNNNLSQAPASDSRISKGILD--KNIISINCKEKEQQNSITEEEKILQQAM 986 L+ + H NN ++++ S +L+ K + I C + Q I AM Sbjct: 376 --SLAYPQAHINN------TNNKSSCNVLNQLKPNVEITCNKIITQG-INYRSSTASSAM 426 Query: 985 NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQ 806 +N V SG + LK +T DINP+ LSNP PSYT LLLEDIQNFHQ Sbjct: 427 DNKVVEPAGASGVDCLKHQTLTRSRSSRRSRDLDINPQTLSNPPPSYTRLLLEDIQNFHQ 486 Query: 805 K-NNTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNK--LYDN 635 + +N +LP CV+KACSILEAVADL F D R++P DQ NK Y N Sbjct: 487 QSSNAAVVSLPQCVTKACSILEAVADLNSTTN------FSAD-RKSPSIDQINKSSCYYN 539 Query: 634 SSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSN 455 S +K + PF++SEV+ GDDL+ PSFHKYVTVRRG G DME+QESSGSN Sbjct: 540 CSLDANPVPRKDI----PFVESEVLVGDDLVAPSFHKYVTVRRG---GTDMEDQESSGSN 592 Query: 454 SIAGGGPQ-HWASSSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLALSEPGYEVGE 278 S G Q W SSSWEPNSADS DCWT SRS+ RED ++ +++ E Sbjct: 593 SFVSGSQQPQWGLSSSWEPNSADSTDCWT--SRSSTREDDQN-------------FDMDE 637 Query: 277 -ARRRMSVKKRDSDQQQNXXXXXXXXXXGVQS----LPTAAAAAS 158 ARRR+S +K D Q+ +P AAAAS Sbjct: 638 AARRRLSRRKTDGQNTQSSCGIGRGKLAAASKGLPIMPVVAAAAS 682 >gb|ESW13982.1| hypothetical protein PHAVU_008G243000g [Phaseolus vulgaris] Length = 652 Score = 239 bits (610), Expect = 3e-60 Identities = 172/403 (42%), Positives = 213/403 (52%), Gaps = 10/403 (2%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVP TVSSLAMDKSNN G T KRI VKRN G Sbjct: 247 RPGKMVSVPPTVSSLAMDKSNNCGGESGT------KRITVKRNVGD------VGSRGAAS 294 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNPLGEIDTNIVSEQ 1157 N SR+NS RKAE SPYRRNPL E+D N +Q Sbjct: 295 PRTQSPARVNGNVASARVLSENQQHQQPSLSRNNSSRKAEQSPYRRNPLSEVDNNSKVQQ 354 Query: 1156 IPLSGLKVHNNNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAMNNN 977 N ++A A + L+K + ++NCK KE ++ + + Sbjct: 355 ---------NKPKTEAEAMQKPNGRVALEKGV-TVNCKTKEHHEDVSLDSAV-------- 396 Query: 976 VAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSN--PAPSYTALLLEDIQNFHQK 803 V V SG + LKP G+T DINPE++ N P SY +LLLEDIQNFHQK Sbjct: 397 VKTTVASSGVDNLKPQGLTRSRSSRRSRDLDINPESVVNVNPTHSYASLLLEDIQNFHQK 456 Query: 802 NNT--PAFT-LPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKLYDNS 632 N P+ T LPAC++KACSI+EAV DL AF +D R++P Q Sbjct: 457 NTPQQPSSTSLPACLTKACSIIEAVGDLSYTTSSNFSGAFSED-RKSPSTQQ-------- 507 Query: 631 SFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRGTLSGEDMEEQESSGSNS 452 SF GKK +K+PF++SEV GDD+MEPS HKYVTV+RG+ + DM++QESSGSNS Sbjct: 508 SFRNGYYGKKVQGSKDPFVESEVDVGDDVMEPSLHKYVTVKRGS-AVVDMDDQESSGSNS 566 Query: 451 --IAGGGPQHWA--SSSSWEPNSADSIDCWTSSSRSNCREDSR 335 ++ G HW S SSWEPNSADS D WT SR + RE+ + Sbjct: 567 FTVSSSGQHHWGAISCSSWEPNSADSTDSWT--SRLSSREEGQ 607 >gb|EXC20585.1| hypothetical protein L484_027140 [Morus notabilis] Length = 676 Score = 239 bits (609), Expect = 4e-60 Identities = 189/416 (45%), Positives = 211/416 (50%), Gaps = 25/416 (6%) Frame = -2 Query: 1513 RPGKMVSVPATVSS-LAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXX 1337 RPGKMVSVPATVSS L MDKSNN + S +KRI VKRN G Sbjct: 252 RPGKMVSVPATVSSSLVMDKSNNMDSAANANS---IKRISVKRNVGEAGSRGAASPRSQS 308 Query: 1336 XXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIVSEQ 1157 SN+ SR++SRKAE SPYRRNPL EID N +S Sbjct: 309 PARGGNGNAKSSNE----------PQAQPSLSRNSSRKAEQSPYRRNPLSEIDPNSLSYP 358 Query: 1156 IPLSGLKVHNNNLSQA------------PASDSRISKGILDKNIISINCKEKEQQNSITE 1013 P HNNN + P D I L N + + N Sbjct: 359 NP------HNNNGNNGRAQSKSKRETCVPEEDENILVKELPTQAQKPNAETNYRSNGRVS 412 Query: 1012 EEKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEAL--SNPAPSYTA 839 E Q V VV SG + +T DINPE L NP PSYT Sbjct: 413 AENKNSQPKQAMVETTVVISGADNKPQQTLTRSRSSRRSRDLDINPETLLNPNPTPSYTR 472 Query: 838 LLLEDIQNFHQKNN---TPAFTLPACVSKACSILEAVADL-XXXXXXXXXSAFPDDKRRT 671 LLLEDIQNFHQKNN T +LP CVSKACSILEAVADL SAF + Sbjct: 473 LLLEDIQNFHQKNNNATTAVVSLPPCVSKACSILEAVADLNSATGSNLSCSAFSE----- 527 Query: 670 PIADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEV-VGGDDLMEPSFHKYVTVRRGTLS 494 DQFNK N+++ L KEPF++SEV VG DDL EPSFHKYVTVRRG S Sbjct: 528 ---DQFNK-GTNNAYSSLLG-----PAKEPFVESEVIVGSDDLTEPSFHKYVTVRRGGGS 578 Query: 493 G---EDMEEQESSGSNSIAGGGP-QHWA-SSSSWEPNSADSIDCWTSSSRSNCRED 341 G D E+QESSGSNSIAGG Q+W SSSSWEPNSADS DC S+SRSN RE+ Sbjct: 579 GGLVVDAEDQESSGSNSIAGGSQIQNWVLSSSSWEPNSADSTDC--STSRSNNREE 632 >ref|XP_003518355.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max] Length = 678 Score = 231 bits (590), Expect = 6e-58 Identities = 177/447 (39%), Positives = 229/447 (51%), Gaps = 18/447 (4%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAEGGPETISGAAVKRIQVKRNAGSDXXXXXXXXXXXXX 1334 RPGKMVSVPATVSSL MDKSN+ G T K VKRN G Sbjct: 249 RPGKMVSVPATVSSLVMDKSNSCGGDSGT-----KKITTVKRNVGD---AGSKGAASPRA 300 Query: 1333 XXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNS-RKAEHSPYRRNPLGEIDTNIV--S 1163 D SR+NS RK E SPYRRNP E+D N + Sbjct: 301 QSPARVNGNVGRDKMLNENLQQQHQQQPSLSRNNSSRKVEQSPYRRNPQSEVDHNSSRKA 360 Query: 1162 EQIPLSGLKVHNNNLS-QAPASDSRISKGILDKNIISINCKEKEQQNSITEEEKILQQAM 986 EQ P S KV N +A A + L+K + S+NCK KEQ EE + A+ Sbjct: 361 EQSPYSNSKVQQNKPKIEAEAIQKPNGRVALEKGV-SVNCKTKEQHEE--EESSVPISAV 417 Query: 985 NNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPAPSYTALLLEDIQNFHQ 806 V V SG + LKP G+T + + +N SY +LLLEDIQNFHQ Sbjct: 418 ---VKTTAVSSGVDNLKPQGLTRSRSSRR------SRDLDTNATNSYASLLLEDIQNFHQ 468 Query: 805 KNNTP------AFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTPIADQFNKL 644 KN + +LPAC++K CSILEAVADL F +DKR +P Q N + Sbjct: 469 KNTQQQQQQPSSVSLPACLNKVCSILEAVADLNSTTSSN----FTEDKR-SPSTQQSN-I 522 Query: 643 YDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRR-GTLSGEDMEEQES 467 ++ +G ++AG K+PF++SEV DD+MEPS HKYVTV+R G + EDME+QES Sbjct: 523 RNDEYYGKKVAGSN----KDPFVESEVAVSDDVMEPSLHKYVTVKRGGGVVVEDMEDQES 578 Query: 466 SGSNSI---AGGGPQHWAS----SSSWEPNSADSIDCWTSSSRSNCREDSRSPVPFQRLA 308 SGSNS + G HW + SSSWEPNSADS DCWTSS S+ E+++ + Sbjct: 579 SGSNSFTVSSSSGQHHWGNNISCSSSWEPNSADSTDCWTSSRLSSREEEAQKTPLGLGCS 638 Query: 307 LSEPGYEVGEARRRMSVKKRDSDQQQN 227 LS E + ++ ++ K+R+ D + + Sbjct: 639 LSS---EAKKKKKGLNSKRRECDHEHS 662 >ref|XP_004155763.1| PREDICTED: uncharacterized protein LOC101224225 [Cucumis sativus] Length = 750 Score = 223 bits (567), Expect = 3e-55 Identities = 171/453 (37%), Positives = 225/453 (49%), Gaps = 25/453 (5%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 1346 RP KMVSVPATVS DK+N+A GG ++ + VKRI VKRN G Sbjct: 297 RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 356 Query: 1345 XXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIV 1166 N SRS+SRKAE SPYRRNPLGEIDTN Sbjct: 357 SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 409 Query: 1165 SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 1013 + K N ++Q P +D + ++ + +N + + T Sbjct: 410 QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 466 Query: 1012 E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPA--PSY 845 I +N V VV E KP G+ DINPE L N + PSY Sbjct: 467 GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 522 Query: 844 TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTP 668 T +LL+DIQNFHQK+ NT +LPACV+KACSI+EAVADL SAF +++ P Sbjct: 523 TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 582 Query: 667 IADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG----T 500 Y + + G L G E ++PF++SEV DD++EPSFHKYVTVRRG Sbjct: 583 TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 639 Query: 499 LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 329 G D ++QESSGSNS G Q W S++SWEPN+ADS D + +SR +E+ Sbjct: 640 AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 697 Query: 328 VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 230 + S+PG + + RRR + ++RDSD Q+ Sbjct: 698 LQ------SKPGLDRDDNRRRTAERRRDSDAQR 724 >ref|XP_004140353.1| PREDICTED: uncharacterized protein LOC101206761 [Cucumis sativus] Length = 742 Score = 223 bits (567), Expect = 3e-55 Identities = 171/453 (37%), Positives = 225/453 (49%), Gaps = 25/453 (5%) Frame = -2 Query: 1513 RPGKMVSVPATVSSLAMDKSNNAE----GGPETISGAAVKRIQVKRNAGSDXXXXXXXXX 1346 RP KMVSVPATVS DK+N+A GG ++ + VKRI VKRN G Sbjct: 289 RPAKMVSVPATVSHAETDKNNSAANVGCGGNDSATVTGVKRISVKRNVGEATAMTGSRVA 348 Query: 1345 XXXXXXXXXXXXXXSNDNXXXXXXXXXXXXXXXXSRSNSRKAEHSPYRRNPLGEIDTNIV 1166 N SRS+SRKAE SPYRRNPLGEIDTN Sbjct: 349 SSPRSQSPAR-------NNGNVKASDENQQQPSLSRSSSRKAEQSPYRRNPLGEIDTNSQ 401 Query: 1165 SEQIPLSGLKVHN---------NNLSQAPASDSRISKGILDKNIISINCKEKEQQNSITE 1013 + K N ++Q P +D + ++ + +N + + T Sbjct: 402 QHNRIQNRSKKETEEVIAKDSINGVNQRPKADPKSVNKVI---VSQVNGSKPSSTATATR 458 Query: 1012 E--EKILQQAMNNNVAVDVVGSGPEFLKPHGVTXXXXXXXXXXXDINPEALSNPA--PSY 845 I +N V VV E KP G+ DINPE L N + PSY Sbjct: 459 GVVNIITSTTPLSNTEVLVV----EHQKPQGLARSRSARHSRELDINPETLLNQSQTPSY 514 Query: 844 TALLLEDIQNFHQKN-NTPAFTLPACVSKACSILEAVADLXXXXXXXXXSAFPDDKRRTP 668 T +LL+DIQNFHQK+ NT +LPACV+KACSI+EAVADL SAF +++ P Sbjct: 515 TKMLLQDIQNFHQKSTNTNPVSLPACVTKACSIVEAVADLNSTTSSNFSSAFSENRSNPP 574 Query: 667 IADQFNKLYDNSSFGGQLAGKKGLETKEPFLQSEVVGGDDLMEPSFHKYVTVRRG----T 500 Y + + G L G E ++PF++SEV DD++EPSFHKYVTVRRG Sbjct: 575 TYQSSRNEY-SVPYSGSLKGTA--ELRDPFVESEVAMDDDILEPSFHKYVTVRRGGPVVA 631 Query: 499 LSGEDMEEQESSGSNSIAGGGPQ--HWA-SSSSWEPNSADSIDCWTSSSRSNCREDSRSP 329 G D ++QESSGSNS G Q W S++SWEPN+ADS D + +SR +E+ Sbjct: 632 AGGGDTDDQESSGSNSYVGSVQQQHQWGISTASWEPNTADSND--SRTSRQITKEEGHPH 689 Query: 328 VPFQRLALSEPGYEVGEARRRMSVKKRDSDQQQ 230 + S+PG + + RRR + ++RDSD Q+ Sbjct: 690 LQ------SKPGLDRDDNRRRTAERRRDSDAQR 716