BLASTX nr result
ID: Catharanthus23_contig00007516
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00007516 (1647 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXC15604.1| hypothetical protein L484_006868 [Morus notabilis] 247 9e-63 ref|XP_004144556.1| PREDICTED: uncharacterized protein LOC101216... 236 3e-59 gb|EMJ16906.1| hypothetical protein PRUPE_ppa008901mg [Prunus pe... 235 5e-59 ref|XP_006469415.1| PREDICTED: RRP15-like protein-like isoform X... 233 1e-58 ref|XP_002264683.1| PREDICTED: uncharacterized protein LOC100249... 233 1e-58 ref|XP_006469413.1| PREDICTED: RRP15-like protein-like isoform X... 233 2e-58 ref|XP_004238678.1| PREDICTED: uncharacterized protein LOC101255... 230 1e-57 ref|XP_006447845.1| hypothetical protein CICLE_v10015959mg [Citr... 230 2e-57 ref|XP_006355975.1| PREDICTED: RRP15-like protein-like [Solanum ... 229 3e-57 ref|XP_002320535.1| hypothetical protein POPTR_0014s16860g [Popu... 228 4e-57 emb|CBI28475.3| unnamed protein product [Vitis vinifera] 226 2e-56 ref|XP_006603696.1| PREDICTED: uncharacterized protein LOC100780... 224 8e-56 ref|XP_003521708.1| PREDICTED: RRP15-like protein-like isoform X... 224 8e-56 ref|NP_001242109.1| uncharacterized protein LOC100780091 [Glycin... 224 8e-56 gb|ESW19049.1| hypothetical protein PHAVU_006G092400g [Phaseolus... 223 2e-55 gb|EOX93569.1| Uncharacterized protein isoform 2 [Theobroma cacao] 222 4e-55 gb|EOX93568.1| Uncharacterized protein isoform 1 [Theobroma cacao] 219 2e-54 ref|XP_002524600.1| conserved hypothetical protein [Ricinus comm... 216 2e-53 ref|XP_006395203.1| hypothetical protein EUTSA_v10004636mg [Eutr... 215 4e-53 gb|EOX93570.1| Uncharacterized protein isoform 3, partial [Theob... 215 5e-53 >gb|EXC15604.1| hypothetical protein L484_006868 [Morus notabilis] Length = 317 Score = 247 bits (631), Expect = 9e-63 Identities = 141/312 (45%), Positives = 171/312 (54%), Gaps = 3/312 (0%) Frame = +2 Query: 233 ELDKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGV---RKMDRKMKKMLQKKARD 403 E G KKRK + K + +K K KK E G +K DR +KK+ +K+ARD Sbjct: 9 ESTTGPKKRK--FGKRKNPKMMKTKKKKKKVSHFPEGVGELKPKKSDRDIKKLFRKRARD 66 Query: 404 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQPGTMRFSDG 583 Y + + QPG +F++G Sbjct: 67 YNSDEEESEDKREENDGDGDSSEAAEELENQ-EVNKEDNEFSDVDEAGEIQPGITKFTEG 125 Query: 584 CKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKG 763 C++F+MAF LGPVLSAHKKLV EKLAEEEAERKVKG+ KKEK L+GEKG Sbjct: 126 CRAFRMAFKSIIKKAVNDDPLGPVLSAHKKLVAEKLAEEEAERKVKGQTKKEKHLVGEKG 185 Query: 764 HVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFF 943 HVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLNP FF Sbjct: 186 HVKPATYLDSHEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSSFKDKKAIRKRRKEAFF 245 Query: 944 SQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDF 1123 ++L KT S K S+ D +GP WAPLRDNYMLT SKLKDWD+MP+ T D+ Sbjct: 246 TELGKTPSAAVNATAKLQTSSGQADDEGPAWAPLRDNYMLTSSKLKDWDKMPDKTVTDEI 305 Query: 1124 GMPQDTHTSSDE 1159 G + +S D+ Sbjct: 306 GNVSEDSSSDDD 317 >ref|XP_004144556.1| PREDICTED: uncharacterized protein LOC101216086 [Cucumis sativus] gi|449528421|ref|XP_004171203.1| PREDICTED: uncharacterized LOC101216086 [Cucumis sativus] Length = 320 Score = 236 bits (601), Expect = 3e-59 Identities = 142/325 (43%), Positives = 174/325 (53%), Gaps = 13/325 (4%) Frame = +2 Query: 227 AGELDKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDY 406 A EL +G K+RK ++N R + K K+D+KMKK+ QK+AR+Y Sbjct: 7 AVELVRGAKRRKKMGSRNNKRPRMMGGSGNKV-----------KIDKKMKKLFQKRAREY 55 Query: 407 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIDEG-----------NQXXXXXXXXXXXXX 553 DE N Sbjct: 56 NSDDDDDDGEKAPRVKKESKILVRSHEEEVGDEEFSEGEEERKDVNADVELSEDDENGEI 115 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F++GC++F+ AF LGP+LSA+KKLV EKLAEEEAERKVKGEA+ Sbjct: 116 QPGITKFTEGCRAFRAAFMSILKKNISDETLGPILSANKKLVAEKLAEEEAERKVKGEAR 175 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 K K+L+GEKGHVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 176 KAKQLVGEKGHVKPATYLDSHEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRTKDAKA 235 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K + +N K S D +GP WAPLRDNYMLT SKLKDWD+ Sbjct: 236 INKRRKEAFFSELGKPTLSATNSNAKLNTSGGAADTEGPAWAPLRDNYMLTNSKLKDWDK 295 Query: 1094 MPET--TDGDDFGMPQDTHTSSDED 1162 MP+ T +D G + +SSDED Sbjct: 296 MPDNMMTAAEDNGRVLE-DSSSDED 319 >gb|EMJ16906.1| hypothetical protein PRUPE_ppa008901mg [Prunus persica] Length = 315 Score = 235 bits (599), Expect = 5e-59 Identities = 139/308 (45%), Positives = 173/308 (56%), Gaps = 6/308 (1%) Frame = +2 Query: 257 RKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXXXXXXXXX 436 RK K A K+ K L+ ++ +K+D+KM+K+ +K+ARDY Sbjct: 15 RKRKIGKKKGSKAKKKAKMLQSAESI-RAMKPKKIDKKMQKLYRKRARDYNSEDESEQDN 73 Query: 437 XXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQ------PGTMRFSDGCKSFQ 598 ++G + PG MRF++G +F+ Sbjct: 74 ATPLGNNEDELIGGSSSGEEAEKGGDHSERNVDNGFSDDEEHGEILPGIMRFTEGSNAFR 133 Query: 599 MAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPA 778 +AF LGPVLS KKLV EKLAEEE ERKVKGEAKKEK+L+ EKGHVKPA Sbjct: 134 LAFRSIIKKTVPEDVLGPVLSGQKKLVAEKLAEEENERKVKGEAKKEKQLVIEKGHVKPA 193 Query: 779 DYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFFSQLSK 958 +YLDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLNP FFS+L K Sbjct: 194 NYLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNPSKFKDAKVIKKRRKEAFFSELGK 253 Query: 959 TSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFGMPQD 1138 TSS+ A A A DG+GP WAPL+DNYMLT SKLKDWD+MP+T GDD G + Sbjct: 254 TSSRGA----SASAKVGPVDGEGPAWAPLQDNYMLTNSKLKDWDKMPDTVVGDDIG--RV 307 Query: 1139 THTSSDED 1162 + SSD+D Sbjct: 308 SEDSSDDD 315 >ref|XP_006469415.1| PREDICTED: RRP15-like protein-like isoform X3 [Citrus sinensis] Length = 320 Score = 233 bits (595), Expect = 1e-58 Identities = 144/319 (45%), Positives = 175/319 (54%), Gaps = 11/319 (3%) Frame = +2 Query: 239 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 415 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 416 XXXXXXXXXXXXXXXXXXXXXXXXXXX---------IDEGNQXXXXXXXXXXXXXQPGTM 568 +D + QPG Sbjct: 62 DDEDESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 569 RFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 748 F++G ++F+MAF LGPVLSAHKKLV EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLVGEKLAEEEAERKVKGEAKKERHL 181 Query: 749 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 928 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 929 XXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETT 1108 FFS+L KTS TA + K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP++T Sbjct: 242 KETFFSELGKTSVSTADASAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPDST 301 Query: 1109 -DGDDFGMPQDTHTSSDED 1162 D+ G + SSD+D Sbjct: 302 VAADETGRLSEEDGSSDDD 320 >ref|XP_002264683.1| PREDICTED: uncharacterized protein LOC100249313 isoform 1 [Vitis vinifera] gi|359485781|ref|XP_003633334.1| PREDICTED: uncharacterized protein LOC100249313 isoform 2 [Vitis vinifera] Length = 320 Score = 233 bits (595), Expect = 1e-58 Identities = 140/323 (43%), Positives = 179/323 (55%), Gaps = 18/323 (5%) Frame = +2 Query: 248 TKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXXX 424 ++K +T + R K+ KK + G R K+DR+MKK+ +K+ARDY Sbjct: 5 SQKLETVNLSKKRRLGRKKGNKTKKKLKGFDGSGERIKIDRRMKKLFRKRARDYNSDDDE 64 Query: 425 XXXXXXXXXXXXXXXXXXXXXXXX----------------IDEGNQXXXXXXXXXXXXXQ 556 IDE N+ Q Sbjct: 65 DEGDDGGYAVATKGENAVPIIKEVALDGKDSSEEEEDHQDIDEENEISEDEEGEI----Q 120 Query: 557 PGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKK 736 PG + ++GC++F++AF LGPVLS HKKLV EKLAEEEA++KVKG+ KK Sbjct: 121 PGITKLTEGCRAFRLAFKKIIKKNVSDISLGPVLSGHKKLVAEKLAEEEADQKVKGDVKK 180 Query: 737 EKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXX 916 EK LLGEKGHVKPA++LDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLN Sbjct: 181 EKHLLGEKGHVKPANFLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNASRFKDEKAI 240 Query: 917 XXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRM 1096 FFS+L KT+ +AGT KA S+ DG+GP WAPLRD+YMLT SKLK+WD+M Sbjct: 241 IKRRKEAFFSELGKTAWSSAGTPAKAHRSSGPVDGEGPAWAPLRDSYMLTSSKLKNWDKM 300 Query: 1097 PETTDGDDF-GMPQDTHTSSDED 1162 P++ D+ MP D SSD+D Sbjct: 301 PDSITEDEIERMPLD---SSDDD 320 >ref|XP_006469413.1| PREDICTED: RRP15-like protein-like isoform X1 [Citrus sinensis] gi|568830250|ref|XP_006469414.1| PREDICTED: RRP15-like protein-like isoform X2 [Citrus sinensis] Length = 321 Score = 233 bits (593), Expect = 2e-58 Identities = 144/320 (45%), Positives = 174/320 (54%), Gaps = 12/320 (3%) Frame = +2 Query: 239 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 415 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 416 XXXXXXXXXXXXXXXXXXXXXXXXXXX---------IDEGNQXXXXXXXXXXXXXQPGTM 568 +D + QPG Sbjct: 62 DDEDESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 569 RFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 748 F++G ++F+MAF LGPVLSAHKKLV EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLVGEKLAEEEAERKVKGEAKKERHL 181 Query: 749 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 928 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 929 XXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPE-- 1102 FFS+L KTS TA + K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP+ Sbjct: 242 KETFFSELGKTSVSTADASAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPQDS 301 Query: 1103 TTDGDDFGMPQDTHTSSDED 1162 T D+ G + SSD+D Sbjct: 302 TVAADETGRLSEEDGSSDDD 321 >ref|XP_004238678.1| PREDICTED: uncharacterized protein LOC101255093 [Solanum lycopersicum] Length = 345 Score = 230 bits (587), Expect = 1e-57 Identities = 122/202 (60%), Positives = 140/202 (69%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F DGC +F++AF LGPVLSAHKKLV EKLAEE+ ERKVKGEAK Sbjct: 147 QPGITKFIDGCNAFRLAFKKILKKSASDDILGPVLSAHKKLVAEKLAEEDVERKVKGEAK 206 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK L+ EKGH KPA++LD++EKSLI +AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 207 KEKHLIREKGHEKPANFLDTYEKSLIAVATKGVVKLFNAVNKAQHAQKGLNPSRAKDEKV 266 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K SQT T KA AS S ED +GP WAPLRD YMLT KLKDWD+ Sbjct: 267 IKKRRREVFFSELGKAPSQT--TVSKAGASNSLED-EGPAWAPLRDTYMLTNPKLKDWDK 323 Query: 1094 MPETTDGDDFGMPQDTHTSSDE 1159 P+TT DD MP D+ +S DE Sbjct: 324 NPDTTVDDDVRMPADSDSSDDE 345 >ref|XP_006447845.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|567911065|ref|XP_006447846.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|557550456|gb|ESR61085.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|557550457|gb|ESR61086.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] Length = 320 Score = 230 bits (586), Expect = 2e-57 Identities = 143/319 (44%), Positives = 173/319 (54%), Gaps = 11/319 (3%) Frame = +2 Query: 239 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 415 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 416 XXXXXXXXXXXXXXXXXXXXXXXXXXX---------IDEGNQXXXXXXXXXXXXXQPGTM 568 +D + QPG Sbjct: 62 DDEAESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 569 RFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 748 F++G ++F+MAF LGPVLSAHKKL EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLFGEKLAEEEAERKVKGEAKKERHL 181 Query: 749 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 928 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 929 XXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETT 1108 FFS+L KTS TA K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP++T Sbjct: 242 KETFFSELGKTSVSTADACAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPDST 301 Query: 1109 -DGDDFGMPQDTHTSSDED 1162 D+ G + SSD+D Sbjct: 302 VAADETGRLSEEDGSSDDD 320 >ref|XP_006355975.1| PREDICTED: RRP15-like protein-like [Solanum tuberosum] Length = 347 Score = 229 bits (584), Expect = 3e-57 Identities = 121/202 (59%), Positives = 140/202 (69%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F DGC +F++AF LGPVLSAHK LV EKLAEE+ ERKVKG+AK Sbjct: 149 QPGITKFIDGCNAFRLAFKKILKKSASDDILGPVLSAHKNLVAEKLAEEDVERKVKGDAK 208 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK L+ EKGHVKPA++LD++EKSLI +AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 209 KEKHLIREKGHVKPANFLDAYEKSLIAVATKGVVKLFNAVNKAQHAQKGLNPSRAKDEKA 268 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K SQT T KA AS S ED +GP WAPLRDNYMLT KLKDWD+ Sbjct: 269 IKKRRREVFFSELGKAPSQT--TASKAGASNSLED-EGPSWAPLRDNYMLTNPKLKDWDK 325 Query: 1094 MPETTDGDDFGMPQDTHTSSDE 1159 +TT DD MP D+ +S DE Sbjct: 326 NADTTVEDDVRMPADSDSSDDE 347 >ref|XP_002320535.1| hypothetical protein POPTR_0014s16860g [Populus trichocarpa] gi|222861308|gb|EEE98850.1| hypothetical protein POPTR_0014s16860g [Populus trichocarpa] Length = 307 Score = 228 bits (582), Expect = 4e-57 Identities = 138/312 (44%), Positives = 173/312 (55%), Gaps = 6/312 (1%) Frame = +2 Query: 242 KGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXXXX 421 K KKRK+ + K N K+ K N V +K+D ++KK+L+KKARDY Sbjct: 12 KPFKKRKSGHKKGG--NFKKKQKQFLGNKEKV-----KKIDPRLKKLLRKKARDYNSDDD 64 Query: 422 XXXXXXXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQ------PGTMRFSDG 583 D+G + + PG +FS+G Sbjct: 65 NEDETAHASEDDNADSMDDDVSS---DDGKEKKNLGIEIEGSENEDDDEIPPGITKFSEG 121 Query: 584 CKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKG 763 C++F++AF LGPVLS HK LV EKLAEE AER+VKG+AKKEK L+GEKG Sbjct: 122 CRAFRIAFKSISKKAISDDSLGPVLSGHKTLVAEKLAEEVAERRVKGDAKKEKHLVGEKG 181 Query: 764 HVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFF 943 HVKPA+YLD+HEK LI +AT+GVVKLFNAVNKAQ+ QKGLNP FF Sbjct: 182 HVKPANYLDAHEKFLISVATKGVVKLFNAVNKAQNAQKGLNPSRSKDAKVIKKRRKERFF 241 Query: 944 SQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDF 1123 S+L KT A T+ K AS+ G+GP WAPLRDNYMLT SKLKDWD+MP+ DD Sbjct: 242 SELGKT--PVADTSTKVHASS----GEGPSWAPLRDNYMLTNSKLKDWDKMPDKVVHDDI 295 Query: 1124 GMPQDTHTSSDE 1159 G + +S D+ Sbjct: 296 GRMSEDSSSDDD 307 >emb|CBI28475.3| unnamed protein product [Vitis vinifera] Length = 274 Score = 226 bits (577), Expect = 2e-56 Identities = 117/204 (57%), Positives = 144/204 (70%), Gaps = 1/204 (0%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG + ++GC++F++AF LGPVLS HKKLV EKLAEEEA++KVKG+ K Sbjct: 74 QPGITKLTEGCRAFRLAFKKIIKKNVSDISLGPVLSGHKKLVAEKLAEEEADQKVKGDVK 133 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK LLGEKGHVKPA++LDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLN Sbjct: 134 KEKHLLGEKGHVKPANFLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNASRFKDEKA 193 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L KT+ +AGT KA S+ DG+GP WAPLRD+YMLT SKLK+WD+ Sbjct: 194 IIKRRKEAFFSELGKTAWSSAGTPAKAHRSSGPVDGEGPAWAPLRDSYMLTSSKLKNWDK 253 Query: 1094 MPETTDGDDF-GMPQDTHTSSDED 1162 MP++ D+ MP D SSD+D Sbjct: 254 MPDSITEDEIERMPLD---SSDDD 274 >ref|XP_006603696.1| PREDICTED: uncharacterized protein LOC100780091 isoform X1 [Glycine max] gi|571552806|ref|XP_006603697.1| PREDICTED: uncharacterized protein LOC100780091 isoform X2 [Glycine max] gi|571552810|ref|XP_006603698.1| PREDICTED: uncharacterized protein LOC100780091 isoform X3 [Glycine max] gi|571552814|ref|XP_006603699.1| PREDICTED: uncharacterized protein LOC100780091 isoform X4 [Glycine max] Length = 334 Score = 224 bits (571), Expect = 8e-56 Identities = 114/203 (56%), Positives = 136/203 (66%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAE KVKGEAK Sbjct: 133 QPGITKFTEGCRAFKMAFKNIMQKSVPDDMLGPILSGQRKLVVDKLAEEEAESKVKGEAK 192 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 193 KEKKMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRTKDAKE 252 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K S GT KA S + + P WAPLRDNYMLT S+LKDWD+ Sbjct: 253 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSRLKDWDK 312 Query: 1094 MPETTDGDDFGMPQDTHTSSDED 1162 MP+ DD G + +SSDED Sbjct: 313 MPDKNVSDDMGKTSE-DSSSDED 334 >ref|XP_003521708.1| PREDICTED: RRP15-like protein-like isoform X1 [Glycine max] gi|571446997|ref|XP_006577248.1| PREDICTED: RRP15-like protein-like isoform X2 [Glycine max] Length = 335 Score = 224 bits (571), Expect = 8e-56 Identities = 114/203 (56%), Positives = 137/203 (67%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAERKVKGEAK Sbjct: 134 QPGITKFTEGCRAFKMAFRNLMKKSVPDDMLGPILSGQRKLVVDKLAEEEAERKVKGEAK 193 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ Q+GL+P Sbjct: 194 KEKQMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQRGLDPSRTKDAKE 253 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K S GT KA S + + P WAPLRDNYMLT SKLKDWD+ Sbjct: 254 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSKLKDWDK 313 Query: 1094 MPETTDGDDFGMPQDTHTSSDED 1162 MP+ DD G + +SSDED Sbjct: 314 MPDKNVSDDMGKTSE-DSSSDED 335 >ref|NP_001242109.1| uncharacterized protein LOC100780091 [Glycine max] gi|255641272|gb|ACU20913.1| unknown [Glycine max] Length = 334 Score = 224 bits (571), Expect = 8e-56 Identities = 114/203 (56%), Positives = 136/203 (66%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAE KVKGEAK Sbjct: 133 QPGITKFTEGCRAFKMAFKNIMQKSVPDDMLGPILSGQRKLVVDKLAEEEAESKVKGEAK 192 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 193 KEKKMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRTKDAKE 252 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K S GT KA S + + P WAPLRDNYMLT S+LKDWD+ Sbjct: 253 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSRLKDWDK 312 Query: 1094 MPETTDGDDFGMPQDTHTSSDED 1162 MP+ DD G + +SSDED Sbjct: 313 MPDKNVSDDMGKTSE-DSSSDED 334 >gb|ESW19049.1| hypothetical protein PHAVU_006G092400g [Phaseolus vulgaris] Length = 338 Score = 223 bits (567), Expect = 2e-55 Identities = 116/203 (57%), Positives = 134/203 (66%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG +F++GC++F+MAF LGP+LSAHKKLVIEKL EEEAERK+KGEAK Sbjct: 140 QPGITKFTEGCRAFKMAFRNVMKKSIPDDMLGPILSAHKKLVIEKLGEEEAERKIKGEAK 199 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK+ L EKGHVKPA YLDSH+K LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 200 KEKQTLAEKGHVKPATYLDSHDKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRIKEAKE 259 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L K S + G K D P WAPLRDNYMLT SKLKDWD+ Sbjct: 260 IRKRTKKAFFSELGKPSLPSIGATTKVTEGT---DDQQPAWAPLRDNYMLTSSKLKDWDK 316 Query: 1094 MPETTDGDDFGMPQDTHTSSDED 1162 MP+ DD G + +SSDED Sbjct: 317 MPDKNVSDDMGRASE-DSSSDED 338 >gb|EOX93569.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 301 Score = 222 bits (565), Expect = 4e-55 Identities = 131/306 (42%), Positives = 167/306 (54%), Gaps = 1/306 (0%) Frame = +2 Query: 245 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 421 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 422 XXXXXXXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQPGTMRFSDGCKSFQM 601 +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 602 AFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 781 AF LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 782 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFFSQLSKT 961 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P FFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 962 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFGMPQDT 1141 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G + Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKMADSAVADDVGRMSED 295 Query: 1142 HTSSDE 1159 S D+ Sbjct: 296 SGSDDD 301 >gb|EOX93568.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 400 Score = 219 bits (559), Expect = 2e-54 Identities = 131/301 (43%), Positives = 167/301 (55%), Gaps = 2/301 (0%) Frame = +2 Query: 245 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 421 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 422 XXXXXXXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQPGTMRFSDGCKSFQM 601 +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 602 AFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 781 AF LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 782 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFFSQLSKT 961 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P FFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 962 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFG-MPQD 1138 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G M +D Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKMADSAVADDVGRMSED 295 Query: 1139 T 1141 + Sbjct: 296 S 296 >ref|XP_002524600.1| conserved hypothetical protein [Ricinus communis] gi|223536153|gb|EEF37808.1| conserved hypothetical protein [Ricinus communis] Length = 307 Score = 216 bits (550), Expect = 2e-53 Identities = 112/201 (55%), Positives = 132/201 (65%) Frame = +2 Query: 554 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAK 733 QPG + +GC++F++AF LGPVLS HKKLV EKLAEE++ERKVKGEAK Sbjct: 117 QPGITKLIEGCRAFKIAFNSIIKKSVSDDSLGPVLSGHKKLVAEKLAEEDSERKVKGEAK 176 Query: 734 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 913 KEK+L+ EKGHVKPA+YLDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGL+P Sbjct: 177 KEKKLMEEKGHVKPANYLDSHEKYLIGLATKGVVKLFNAVNKAQNSQKGLDPSRTKDAKV 236 Query: 914 XXXXXXXXFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 1093 FFS+L KTS A +GP WAPLRDNYMLT SKLKDWD+ Sbjct: 237 INKRRKEAFFSELGKTSVSNAAAK-----------PEGPSWAPLRDNYMLTNSKLKDWDK 285 Query: 1094 MPETTDGDDFGMPQDTHTSSD 1156 MP+T DD G + S D Sbjct: 286 MPDTVVADDIGKMSEDSGSDD 306 >ref|XP_006395203.1| hypothetical protein EUTSA_v10004636mg [Eutrema salsugineum] gi|557091842|gb|ESQ32489.1| hypothetical protein EUTSA_v10004636mg [Eutrema salsugineum] Length = 314 Score = 215 bits (548), Expect = 4e-53 Identities = 130/303 (42%), Positives = 163/303 (53%), Gaps = 8/303 (2%) Frame = +2 Query: 239 DKGTKKRKTSYIKSNSRNAVK-RNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXX 415 ++GT+KR+ ++N K + K LK + F K+ K KK+ QK+ARDY Sbjct: 13 ERGTRKRRVG-----TKNGGKSKKKKLKTRPPSSDRF---KLTMKDKKIFQKRARDYNSD 64 Query: 416 XXXXXXXXXXXXXXXXXXXXXXXXXXXI------DEGNQXXXXXXXXXXXXXQPGTMRFS 577 +EG+ Q G RF+ Sbjct: 65 EDDEDESTKPPEVTIREKIFSDANMGPNYDEIEDEEGSDPEDNSDGEDHGEIQSGITRFA 124 Query: 578 -DGCKSFQMAFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLG 754 DGC +F+ AF LGPVLSAHK L+ +KLAEEEAE+K KG+A+K K L+ Sbjct: 125 TDGCNAFRTAFKAIMKKTKGDDTLGPVLSAHKHLIAQKLAEEEAEKKAKGQARKAKHLVA 184 Query: 755 EKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXX 934 EKGHVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLN Sbjct: 185 EKGHVKPASYLDSHEKILIGVATKGVVKLFNAVNKAQHAQKGLNASRSKDAKVLKKRRKE 244 Query: 935 XFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDG 1114 FFS+L KTS T+ KA S++ + + P WAPLRDNYML KLKDWD+ ET++G Sbjct: 245 AFFSELGKTSK----TDSKAQNSSNSHEDEAPAWAPLRDNYMLANPKLKDWDKKQETSEG 300 Query: 1115 DDF 1123 DDF Sbjct: 301 DDF 303 >gb|EOX93570.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 289 Score = 215 bits (547), Expect = 5e-53 Identities = 129/295 (43%), Positives = 163/295 (55%), Gaps = 1/295 (0%) Frame = +2 Query: 245 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 421 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 422 XXXXXXXXXXXXXXXXXXXXXXXXXIDEGNQXXXXXXXXXXXXXQPGTMRFSDGCKSFQM 601 +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 602 AFXXXXXXXXXXXXLGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 781 AF LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 782 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXXFFSQLSKT 961 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P FFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 962 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFG 1126 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKM-DSAVADDVG 289