BLASTX nr result
ID: Catharanthus22_contig00011475
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00011475 (1663 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EXC15604.1| hypothetical protein L484_006868 [Morus notabilis] 247 9e-63 ref|XP_004144556.1| PREDICTED: uncharacterized protein LOC101216... 236 3e-59 gb|EMJ16906.1| hypothetical protein PRUPE_ppa008901mg [Prunus pe... 235 5e-59 ref|XP_006469415.1| PREDICTED: RRP15-like protein-like isoform X... 233 1e-58 ref|XP_002264683.1| PREDICTED: uncharacterized protein LOC100249... 233 1e-58 ref|XP_006469413.1| PREDICTED: RRP15-like protein-like isoform X... 233 2e-58 ref|XP_004238678.1| PREDICTED: uncharacterized protein LOC101255... 230 1e-57 ref|XP_006447845.1| hypothetical protein CICLE_v10015959mg [Citr... 230 2e-57 ref|XP_006355975.1| PREDICTED: RRP15-like protein-like [Solanum ... 229 3e-57 ref|XP_002320535.1| hypothetical protein POPTR_0014s16860g [Popu... 228 5e-57 emb|CBI28475.3| unnamed protein product [Vitis vinifera] 226 2e-56 ref|XP_006603696.1| PREDICTED: uncharacterized protein LOC100780... 224 9e-56 ref|XP_003521708.1| PREDICTED: RRP15-like protein-like isoform X... 224 9e-56 ref|NP_001242109.1| uncharacterized protein LOC100780091 [Glycin... 224 9e-56 gb|ESW19049.1| hypothetical protein PHAVU_006G092400g [Phaseolus... 223 2e-55 gb|EOX93569.1| Uncharacterized protein isoform 2 [Theobroma cacao] 222 4e-55 gb|EOX93568.1| Uncharacterized protein isoform 1 [Theobroma cacao] 219 2e-54 ref|XP_002524600.1| conserved hypothetical protein [Ricinus comm... 216 2e-53 ref|XP_006395203.1| hypothetical protein EUTSA_v10004636mg [Eutr... 215 4e-53 gb|EOX93570.1| Uncharacterized protein isoform 3, partial [Theob... 215 5e-53 >gb|EXC15604.1| hypothetical protein L484_006868 [Morus notabilis] Length = 317 Score = 247 bits (631), Expect = 9e-63 Identities = 142/312 (45%), Positives = 173/312 (55%), Gaps = 3/312 (0%) Frame = -1 Query: 1429 ELDKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGV---RKMDRKMKKMLQKKARD 1259 E G KKRK + K + +K K KK E G +K DR +KK+ +K+ARD Sbjct: 9 ESTTGPKKRK--FGKRKNPKMMKTKKKKKKVSHFPEGVGELKPKKSDRDIKKLFRKRARD 66 Query: 1258 YXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQPGTMRFSDG 1079 Y + + +QPG +F++G Sbjct: 67 YNSDEEESEDKREENDGDGDSSEAAEELENQ-EVNKEDNEFSDVDEAGEIQPGITKFTEG 125 Query: 1078 CKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKG 899 C++F+MAF LGPVLSAHKKLV EKLAEEEAERKVKG+ KKEK L+GEKG Sbjct: 126 CRAFRMAFKSIIKKAVNDDPLGPVLSAHKKLVAEKLAEEEAERKVKGQTKKEKHLVGEKG 185 Query: 898 HVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFF 719 HVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLNP AFF Sbjct: 186 HVKPATYLDSHEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSSFKDKKAIRKRRKEAFF 245 Query: 718 SQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDF 539 ++L KT S K S+ D +GP WAPLRDNYMLT SKLKDWD+MP+ T D+ Sbjct: 246 TELGKTPSAAVNATAKLQTSSGQADDEGPAWAPLRDNYMLTSSKLKDWDKMPDKTVTDEI 305 Query: 538 GMPQDTHTSSDE 503 G + +S D+ Sbjct: 306 GNVSEDSSSDDD 317 >ref|XP_004144556.1| PREDICTED: uncharacterized protein LOC101216086 [Cucumis sativus] gi|449528421|ref|XP_004171203.1| PREDICTED: uncharacterized LOC101216086 [Cucumis sativus] Length = 320 Score = 236 bits (601), Expect = 3e-59 Identities = 143/325 (44%), Positives = 176/325 (54%), Gaps = 13/325 (4%) Frame = -1 Query: 1435 AGELDKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDY 1256 A EL +G K+RK ++N R + K K+D+KMKK+ QK+AR+Y Sbjct: 7 AVELVRGAKRRKKMGSRNNKRPRMMGGSGNKV-----------KIDKKMKKLFQKRAREY 55 Query: 1255 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNIDEG-----------NQXXXXXXXXXXXXV 1109 DE N + Sbjct: 56 NSDDDDDDGEKAPRVKKESKILVRSHEEEVGDEEFSEGEEERKDVNADVELSEDDENGEI 115 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F++GC++F+ AF LGP+LSA+KKLV EKLAEEEAERKVKGEA+ Sbjct: 116 QPGITKFTEGCRAFRAAFMSILKKNISDETLGPILSANKKLVAEKLAEEEAERKVKGEAR 175 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 K K+L+GEKGHVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 176 KAKQLVGEKGHVKPATYLDSHEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRTKDAKA 235 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L K + +N K S D +GP WAPLRDNYMLT SKLKDWD+ Sbjct: 236 INKRRKEAFFSELGKPTLSATNSNAKLNTSGGAADTEGPAWAPLRDNYMLTNSKLKDWDK 295 Query: 568 MPET--TDGDDFGMPQDTHTSSDED 500 MP+ T +D G + +SSDED Sbjct: 296 MPDNMMTAAEDNGRVLE-DSSSDED 319 >gb|EMJ16906.1| hypothetical protein PRUPE_ppa008901mg [Prunus persica] Length = 315 Score = 235 bits (599), Expect = 5e-59 Identities = 140/308 (45%), Positives = 174/308 (56%), Gaps = 6/308 (1%) Frame = -1 Query: 1405 RKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXXXXXXXXX 1226 RK K A K+ K L+ ++ +K+D+KM+K+ +K+ARDY Sbjct: 15 RKRKIGKKKGSKAKKKAKMLQSAESI-RAMKPKKIDKKMQKLYRKRARDYNSEDESEQDN 73 Query: 1225 XXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQ------PGTMRFSDGCKSFQ 1064 ++G + PG MRF++G +F+ Sbjct: 74 ATPLGNNEDELIGGSSSGEEAEKGGDHSERNVDNGFSDDEEHGEILPGIMRFTEGSNAFR 133 Query: 1063 MAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPA 884 +AF LGPVLS KKLV EKLAEEE ERKVKGEAKKEK+L+ EKGHVKPA Sbjct: 134 LAFRSIIKKTVPEDVLGPVLSGQKKLVAEKLAEEENERKVKGEAKKEKQLVIEKGHVKPA 193 Query: 883 DYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFFSQLSK 704 +YLDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLNP AFFS+L K Sbjct: 194 NYLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNPSKFKDAKVIKKRRKEAFFSELGK 253 Query: 703 TSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFGMPQD 524 TSS+ A A A DG+GP WAPL+DNYMLT SKLKDWD+MP+T GDD G + Sbjct: 254 TSSRGA----SASAKVGPVDGEGPAWAPLQDNYMLTNSKLKDWDKMPDTVVGDDIG--RV 307 Query: 523 THTSSDED 500 + SSD+D Sbjct: 308 SEDSSDDD 315 >ref|XP_006469415.1| PREDICTED: RRP15-like protein-like isoform X3 [Citrus sinensis] Length = 320 Score = 233 bits (595), Expect = 1e-58 Identities = 145/319 (45%), Positives = 177/319 (55%), Gaps = 11/319 (3%) Frame = -1 Query: 1423 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 1247 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 1246 XXXXXXXXXXXXXXXXXXXXXXXXXXN---------IDEGNQXXXXXXXXXXXXVQPGTM 1094 +D + +QPG Sbjct: 62 DDEDESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 1093 RFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 914 F++G ++F+MAF ALGPVLSAHKKLV EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLVGEKLAEEEAERKVKGEAKKERHL 181 Query: 913 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 734 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 733 XXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETT 554 FFS+L KTS TA + K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP++T Sbjct: 242 KETFFSELGKTSVSTADASAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPDST 301 Query: 553 -DGDDFGMPQDTHTSSDED 500 D+ G + SSD+D Sbjct: 302 VAADETGRLSEEDGSSDDD 320 >ref|XP_002264683.1| PREDICTED: uncharacterized protein LOC100249313 isoform 1 [Vitis vinifera] gi|359485781|ref|XP_003633334.1| PREDICTED: uncharacterized protein LOC100249313 isoform 2 [Vitis vinifera] Length = 320 Score = 233 bits (595), Expect = 1e-58 Identities = 141/323 (43%), Positives = 181/323 (56%), Gaps = 18/323 (5%) Frame = -1 Query: 1414 TKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXXX 1238 ++K +T + R K+ KK + G R K+DR+MKK+ +K+ARDY Sbjct: 5 SQKLETVNLSKKRRLGRKKGNKTKKKLKGFDGSGERIKIDRRMKKLFRKRARDYNSDDDE 64 Query: 1237 XXXXXXXXXXXXXXXXXXXXXXXN----------------IDEGNQXXXXXXXXXXXXVQ 1106 IDE N+ Q Sbjct: 65 DEGDDGGYAVATKGENAVPIIKEVALDGKDSSEEEEDHQDIDEENEISEDEEGEI----Q 120 Query: 1105 PGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKK 926 PG + ++GC++F++AF +LGPVLS HKKLV EKLAEEEA++KVKG+ KK Sbjct: 121 PGITKLTEGCRAFRLAFKKIIKKNVSDISLGPVLSGHKKLVAEKLAEEEADQKVKGDVKK 180 Query: 925 EKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXX 746 EK LLGEKGHVKPA++LDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLN Sbjct: 181 EKHLLGEKGHVKPANFLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNASRFKDEKAI 240 Query: 745 XXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRM 566 AFFS+L KT+ +AGT KA S+ DG+GP WAPLRD+YMLT SKLK+WD+M Sbjct: 241 IKRRKEAFFSELGKTAWSSAGTPAKAHRSSGPVDGEGPAWAPLRDSYMLTSSKLKNWDKM 300 Query: 565 PETTDGDDF-GMPQDTHTSSDED 500 P++ D+ MP D SSD+D Sbjct: 301 PDSITEDEIERMPLD---SSDDD 320 >ref|XP_006469413.1| PREDICTED: RRP15-like protein-like isoform X1 [Citrus sinensis] gi|568830250|ref|XP_006469414.1| PREDICTED: RRP15-like protein-like isoform X2 [Citrus sinensis] Length = 321 Score = 233 bits (593), Expect = 2e-58 Identities = 145/320 (45%), Positives = 176/320 (55%), Gaps = 12/320 (3%) Frame = -1 Query: 1423 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 1247 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 1246 XXXXXXXXXXXXXXXXXXXXXXXXXXN---------IDEGNQXXXXXXXXXXXXVQPGTM 1094 +D + +QPG Sbjct: 62 DDEDESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 1093 RFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 914 F++G ++F+MAF ALGPVLSAHKKLV EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLVGEKLAEEEAERKVKGEAKKERHL 181 Query: 913 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 734 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 733 XXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPE-- 560 FFS+L KTS TA + K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP+ Sbjct: 242 KETFFSELGKTSVSTADASAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPQDS 301 Query: 559 TTDGDDFGMPQDTHTSSDED 500 T D+ G + SSD+D Sbjct: 302 TVAADETGRLSEEDGSSDDD 321 >ref|XP_004238678.1| PREDICTED: uncharacterized protein LOC101255093 [Solanum lycopersicum] Length = 345 Score = 230 bits (587), Expect = 1e-57 Identities = 122/202 (60%), Positives = 140/202 (69%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F DGC +F++AF LGPVLSAHKKLV EKLAEE+ ERKVKGEAK Sbjct: 147 QPGITKFIDGCNAFRLAFKKILKKSASDDILGPVLSAHKKLVAEKLAEEDVERKVKGEAK 206 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK L+ EKGH KPA++LD++EKSLI +AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 207 KEKHLIREKGHEKPANFLDTYEKSLIAVATKGVVKLFNAVNKAQHAQKGLNPSRAKDEKV 266 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 FFS+L K SQT T KA AS S ED +GP WAPLRD YMLT KLKDWD+ Sbjct: 267 IKKRRREVFFSELGKAPSQT--TVSKAGASNSLED-EGPAWAPLRDTYMLTNPKLKDWDK 323 Query: 568 MPETTDGDDFGMPQDTHTSSDE 503 P+TT DD MP D+ +S DE Sbjct: 324 NPDTTVDDDVRMPADSDSSDDE 345 >ref|XP_006447845.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|567911065|ref|XP_006447846.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|557550456|gb|ESR61085.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] gi|557550457|gb|ESR61086.1| hypothetical protein CICLE_v10015959mg [Citrus clementina] Length = 320 Score = 230 bits (586), Expect = 2e-57 Identities = 144/319 (45%), Positives = 175/319 (54%), Gaps = 11/319 (3%) Frame = -1 Query: 1423 DKGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXX 1247 + G +KRK+S K K K LK V+ G R K++ KM+K+ +K+AR Y Sbjct: 11 ESGPRKRKSSKKKGG-----KGKKKLK----VMPGSGERVKINNKMRKLFRKRARAYNSD 61 Query: 1246 XXXXXXXXXXXXXXXXXXXXXXXXXXN---------IDEGNQXXXXXXXXXXXXVQPGTM 1094 +D + +QPG Sbjct: 62 DDEAESAPEFRGDSSLSVKNQEVEGRGSSDTEREDGMDLDVENEEFSDDEENGEIQPGIA 121 Query: 1093 RFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRL 914 F++G ++F+MAF ALGPVLSAHKKL EKLAEEEAERKVKGEAKKE+ L Sbjct: 122 NFAEGSRAFKMAFKSILRKSVADDALGPVLSAHKKLFGEKLAEEEAERKVKGEAKKERHL 181 Query: 913 LGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXX 734 EKGHVKPA+YLDS EK LIG+AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 182 AAEKGHVKPANYLDSCEKFLIGVATKGVVKLFNAVNKAQHAQKGLNPSRSKDEKLLKKRR 241 Query: 733 XXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETT 554 FFS+L KTS TA K S+ DG+GP WAPLRDNYMLT SKLKDWD+MP++T Sbjct: 242 KETFFSELGKTSVSTADACAKGPNSSGTADGEGPAWAPLRDNYMLTSSKLKDWDKMPDST 301 Query: 553 -DGDDFGMPQDTHTSSDED 500 D+ G + SSD+D Sbjct: 302 VAADETGRLSEEDGSSDDD 320 >ref|XP_006355975.1| PREDICTED: RRP15-like protein-like [Solanum tuberosum] Length = 347 Score = 229 bits (584), Expect = 3e-57 Identities = 121/202 (59%), Positives = 140/202 (69%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F DGC +F++AF LGPVLSAHK LV EKLAEE+ ERKVKG+AK Sbjct: 149 QPGITKFIDGCNAFRLAFKKILKKSASDDILGPVLSAHKNLVAEKLAEEDVERKVKGDAK 208 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK L+ EKGHVKPA++LD++EKSLI +AT+GVVKLFNAVNKAQH QKGLNP Sbjct: 209 KEKHLIREKGHVKPANFLDAYEKSLIAVATKGVVKLFNAVNKAQHAQKGLNPSRAKDEKA 268 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 FFS+L K SQT T KA AS S ED +GP WAPLRDNYMLT KLKDWD+ Sbjct: 269 IKKRRREVFFSELGKAPSQT--TASKAGASNSLED-EGPSWAPLRDNYMLTNPKLKDWDK 325 Query: 568 MPETTDGDDFGMPQDTHTSSDE 503 +TT DD MP D+ +S DE Sbjct: 326 NADTTVEDDVRMPADSDSSDDE 347 >ref|XP_002320535.1| hypothetical protein POPTR_0014s16860g [Populus trichocarpa] gi|222861308|gb|EEE98850.1| hypothetical protein POPTR_0014s16860g [Populus trichocarpa] Length = 307 Score = 228 bits (582), Expect = 5e-57 Identities = 138/312 (44%), Positives = 174/312 (55%), Gaps = 6/312 (1%) Frame = -1 Query: 1420 KGTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXXXX 1241 K KKRK+ + K N K+ K N V +K+D ++KK+L+KKARDY Sbjct: 12 KPFKKRKSGHKKGG--NFKKKQKQFLGNKEKV-----KKIDPRLKKLLRKKARDYNSDDD 64 Query: 1240 XXXXXXXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQ------PGTMRFSDG 1079 D+G + + PG +FS+G Sbjct: 65 NEDETAHASEDDNADSMDDDVSS---DDGKEKKNLGIEIEGSENEDDDEIPPGITKFSEG 121 Query: 1078 CKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKG 899 C++F++AF +LGPVLS HK LV EKLAEE AER+VKG+AKKEK L+GEKG Sbjct: 122 CRAFRIAFKSISKKAISDDSLGPVLSGHKTLVAEKLAEEVAERRVKGDAKKEKHLVGEKG 181 Query: 898 HVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFF 719 HVKPA+YLD+HEK LI +AT+GVVKLFNAVNKAQ+ QKGLNP FF Sbjct: 182 HVKPANYLDAHEKFLISVATKGVVKLFNAVNKAQNAQKGLNPSRSKDAKVIKKRRKERFF 241 Query: 718 SQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDF 539 S+L KT A T+ K AS+ G+GP WAPLRDNYMLT SKLKDWD+MP+ DD Sbjct: 242 SELGKT--PVADTSTKVHASS----GEGPSWAPLRDNYMLTNSKLKDWDKMPDKVVHDDI 295 Query: 538 GMPQDTHTSSDE 503 G + +S D+ Sbjct: 296 GRMSEDSSSDDD 307 >emb|CBI28475.3| unnamed protein product [Vitis vinifera] Length = 274 Score = 226 bits (577), Expect = 2e-56 Identities = 118/204 (57%), Positives = 146/204 (71%), Gaps = 1/204 (0%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG + ++GC++F++AF +LGPVLS HKKLV EKLAEEEA++KVKG+ K Sbjct: 74 QPGITKLTEGCRAFRLAFKKIIKKNVSDISLGPVLSGHKKLVAEKLAEEEADQKVKGDVK 133 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK LLGEKGHVKPA++LDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGLN Sbjct: 134 KEKHLLGEKGHVKPANFLDSHEKFLIGVATKGVVKLFNAVNKAQNAQKGLNASRFKDEKA 193 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L KT+ +AGT KA S+ DG+GP WAPLRD+YMLT SKLK+WD+ Sbjct: 194 IIKRRKEAFFSELGKTAWSSAGTPAKAHRSSGPVDGEGPAWAPLRDSYMLTSSKLKNWDK 253 Query: 568 MPETTDGDDF-GMPQDTHTSSDED 500 MP++ D+ MP D SSD+D Sbjct: 254 MPDSITEDEIERMPLD---SSDDD 274 >ref|XP_006603696.1| PREDICTED: uncharacterized protein LOC100780091 isoform X1 [Glycine max] gi|571552806|ref|XP_006603697.1| PREDICTED: uncharacterized protein LOC100780091 isoform X2 [Glycine max] gi|571552810|ref|XP_006603698.1| PREDICTED: uncharacterized protein LOC100780091 isoform X3 [Glycine max] gi|571552814|ref|XP_006603699.1| PREDICTED: uncharacterized protein LOC100780091 isoform X4 [Glycine max] Length = 334 Score = 224 bits (571), Expect = 9e-56 Identities = 115/203 (56%), Positives = 137/203 (67%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAE KVKGEAK Sbjct: 133 QPGITKFTEGCRAFKMAFKNIMQKSVPDDMLGPILSGQRKLVVDKLAEEEAESKVKGEAK 192 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 193 KEKKMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRTKDAKE 252 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L K S GT KA S + + P WAPLRDNYMLT S+LKDWD+ Sbjct: 253 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSRLKDWDK 312 Query: 568 MPETTDGDDFGMPQDTHTSSDED 500 MP+ DD G + +SSDED Sbjct: 313 MPDKNVSDDMGKTSE-DSSSDED 334 >ref|XP_003521708.1| PREDICTED: RRP15-like protein-like isoform X1 [Glycine max] gi|571446997|ref|XP_006577248.1| PREDICTED: RRP15-like protein-like isoform X2 [Glycine max] Length = 335 Score = 224 bits (571), Expect = 9e-56 Identities = 115/203 (56%), Positives = 138/203 (67%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAERKVKGEAK Sbjct: 134 QPGITKFTEGCRAFKMAFRNLMKKSVPDDMLGPILSGQRKLVVDKLAEEEAERKVKGEAK 193 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ Q+GL+P Sbjct: 194 KEKQMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQRGLDPSRTKDAKE 253 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L K S GT KA S + + P WAPLRDNYMLT SKLKDWD+ Sbjct: 254 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSKLKDWDK 313 Query: 568 MPETTDGDDFGMPQDTHTSSDED 500 MP+ DD G + +SSDED Sbjct: 314 MPDKNVSDDMGKTSE-DSSSDED 335 >ref|NP_001242109.1| uncharacterized protein LOC100780091 [Glycine max] gi|255641272|gb|ACU20913.1| unknown [Glycine max] Length = 334 Score = 224 bits (571), Expect = 9e-56 Identities = 115/203 (56%), Positives = 137/203 (67%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F++GC++F+MAF LGP+LS +KLV++KLAEEEAE KVKGEAK Sbjct: 133 QPGITKFTEGCRAFKMAFKNIMQKSVPDDMLGPILSGQRKLVVDKLAEEEAESKVKGEAK 192 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK++L EKGHVKPA YLDSHEK LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 193 KEKKMLAEKGHVKPATYLDSHEKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRTKDAKE 252 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L K S GT KA S + + P WAPLRDNYMLT S+LKDWD+ Sbjct: 253 IRKRTKQAFFSELGKPSLPAIGTTTKAKESTDRVEDEQPAWAPLRDNYMLTSSRLKDWDK 312 Query: 568 MPETTDGDDFGMPQDTHTSSDED 500 MP+ DD G + +SSDED Sbjct: 313 MPDKNVSDDMGKTSE-DSSSDED 334 >gb|ESW19049.1| hypothetical protein PHAVU_006G092400g [Phaseolus vulgaris] Length = 338 Score = 223 bits (567), Expect = 2e-55 Identities = 117/203 (57%), Positives = 135/203 (66%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG +F++GC++F+MAF LGP+LSAHKKLVIEKL EEEAERK+KGEAK Sbjct: 140 QPGITKFTEGCRAFKMAFRNVMKKSIPDDMLGPILSAHKKLVIEKLGEEEAERKIKGEAK 199 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK+ L EKGHVKPA YLDSH+K LI +AT+GVVKLFNAVNKAQ QKGLNP Sbjct: 200 KEKQTLAEKGHVKPATYLDSHDKFLISVATKGVVKLFNAVNKAQTAQKGLNPSRIKEAKE 259 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L K S + G K D P WAPLRDNYMLT SKLKDWD+ Sbjct: 260 IRKRTKKAFFSELGKPSLPSIGATTKVTEGT---DDQQPAWAPLRDNYMLTSSKLKDWDK 316 Query: 568 MPETTDGDDFGMPQDTHTSSDED 500 MP+ DD G + +SSDED Sbjct: 317 MPDKNVSDDMGRASE-DSSSDED 338 >gb|EOX93569.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 301 Score = 222 bits (565), Expect = 4e-55 Identities = 132/306 (43%), Positives = 170/306 (55%), Gaps = 1/306 (0%) Frame = -1 Query: 1417 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 1241 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 1240 XXXXXXXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQPGTMRFSDGCKSFQM 1061 + +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 1060 AFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 881 AF +LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 880 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFFSQLSKT 701 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P AFFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 700 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFGMPQDT 521 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G + Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKMADSAVADDVGRMSED 295 Query: 520 HTSSDE 503 S D+ Sbjct: 296 SGSDDD 301 >gb|EOX93568.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 400 Score = 219 bits (559), Expect = 2e-54 Identities = 132/301 (43%), Positives = 170/301 (56%), Gaps = 2/301 (0%) Frame = -1 Query: 1417 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 1241 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 1240 XXXXXXXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQPGTMRFSDGCKSFQM 1061 + +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 1060 AFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 881 AF +LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 880 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFFSQLSKT 701 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P AFFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 700 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFG-MPQD 524 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G M +D Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKMADSAVADDVGRMSED 295 Query: 523 T 521 + Sbjct: 296 S 296 >ref|XP_002524600.1| conserved hypothetical protein [Ricinus communis] gi|223536153|gb|EEF37808.1| conserved hypothetical protein [Ricinus communis] Length = 307 Score = 216 bits (550), Expect = 2e-53 Identities = 113/201 (56%), Positives = 134/201 (66%) Frame = -1 Query: 1108 QPGTMRFSDGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAK 929 QPG + +GC++F++AF +LGPVLS HKKLV EKLAEE++ERKVKGEAK Sbjct: 117 QPGITKLIEGCRAFKIAFNSIIKKSVSDDSLGPVLSGHKKLVAEKLAEEDSERKVKGEAK 176 Query: 928 KEKRLLGEKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXX 749 KEK+L+ EKGHVKPA+YLDSHEK LIG+AT+GVVKLFNAVNKAQ+ QKGL+P Sbjct: 177 KEKKLMEEKGHVKPANYLDSHEKYLIGLATKGVVKLFNAVNKAQNSQKGLDPSRTKDAKV 236 Query: 748 XXXXXXXAFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDR 569 AFFS+L KTS A +GP WAPLRDNYMLT SKLKDWD+ Sbjct: 237 INKRRKEAFFSELGKTSVSNAAAK-----------PEGPSWAPLRDNYMLTNSKLKDWDK 285 Query: 568 MPETTDGDDFGMPQDTHTSSD 506 MP+T DD G + S D Sbjct: 286 MPDTVVADDIGKMSEDSGSDD 306 >ref|XP_006395203.1| hypothetical protein EUTSA_v10004636mg [Eutrema salsugineum] gi|557091842|gb|ESQ32489.1| hypothetical protein EUTSA_v10004636mg [Eutrema salsugineum] Length = 314 Score = 215 bits (548), Expect = 4e-53 Identities = 131/303 (43%), Positives = 165/303 (54%), Gaps = 8/303 (2%) Frame = -1 Query: 1423 DKGTKKRKTSYIKSNSRNAVK-RNKNLKKNHTVVENFGVRKMDRKMKKMLQKKARDYXXX 1247 ++GT+KR+ ++N K + K LK + F K+ K KK+ QK+ARDY Sbjct: 13 ERGTRKRRVG-----TKNGGKSKKKKLKTRPPSSDRF---KLTMKDKKIFQKRARDYNSD 64 Query: 1246 XXXXXXXXXXXXXXXXXXXXXXXXXXNI------DEGNQXXXXXXXXXXXXVQPGTMRFS 1085 +EG+ +Q G RF+ Sbjct: 65 EDDEDESTKPPEVTIREKIFSDANMGPNYDEIEDEEGSDPEDNSDGEDHGEIQSGITRFA 124 Query: 1084 -DGCKSFQMAFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLG 908 DGC +F+ AF LGPVLSAHK L+ +KLAEEEAE+K KG+A+K K L+ Sbjct: 125 TDGCNAFRTAFKAIMKKTKGDDTLGPVLSAHKHLIAQKLAEEEAEKKAKGQARKAKHLVA 184 Query: 907 EKGHVKPADYLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXX 728 EKGHVKPA YLDSHEK LIG+AT+GVVKLFNAVNKAQH QKGLN Sbjct: 185 EKGHVKPASYLDSHEKILIGVATKGVVKLFNAVNKAQHAQKGLNASRSKDAKVLKKRRKE 244 Query: 727 AFFSQLSKTSSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDG 548 AFFS+L KTS T+ KA S++ + + P WAPLRDNYML KLKDWD+ ET++G Sbjct: 245 AFFSELGKTSK----TDSKAQNSSNSHEDEAPAWAPLRDNYMLANPKLKDWDKKQETSEG 300 Query: 547 DDF 539 DDF Sbjct: 301 DDF 303 >gb|EOX93570.1| Uncharacterized protein isoform 3, partial [Theobroma cacao] Length = 289 Score = 215 bits (547), Expect = 5e-53 Identities = 130/295 (44%), Positives = 166/295 (56%), Gaps = 1/295 (0%) Frame = -1 Query: 1417 GTKKRKTSYIKSNSRNAVKRNKNLKKNHTVVENFGVR-KMDRKMKKMLQKKARDYXXXXX 1241 G +KR K N +NK LK ++E G + ++ +KM+ + +K+ARDY Sbjct: 11 GKRKRNLGKKKGNK----PKNKKLK----MLEGKGKKLRVSKKMRNLFEKRARDYNSDDE 62 Query: 1240 XXXXXXXXXXXXXXXXXXXXXXXXNIDEGNQXXXXXXXXXXXXVQPGTMRFSDGCKSFQM 1061 + +EG + QPG MRF++G ++F++ Sbjct: 63 EEAEEEEEAALDDIRMGGGGDDSSDENEGEEAEDDEI-------QPGIMRFTEGVRAFRL 115 Query: 1060 AFXXXXXXXXXXXALGPVLSAHKKLVIEKLAEEEAERKVKGEAKKEKRLLGEKGHVKPAD 881 AF +LGPVLS HK+LV +KLAEEEAERKVKGEAKKEK L+ EKGHVKPA+ Sbjct: 116 AFKNIIKRSVADDSLGPVLSGHKQLVAKKLAEEEAERKVKGEAKKEKHLVAEKGHVKPAN 175 Query: 880 YLDSHEKSLIGIATRGVVKLFNAVNKAQHVQKGLNPXXXXXXXXXXXXXXXAFFSQLSKT 701 YLDS EK LIGIAT+GVVKLFNAVNKAQ QKGL+P AFFS+L K Sbjct: 176 YLDSREKFLIGIATKGVVKLFNAVNKAQKAQKGLDPSRSKDAKMIRKRRKEAFFSELGKP 235 Query: 700 SSQTAGTNGKAVASASHEDGDGPEWAPLRDNYMLTKSKLKDWDRMPETTDGDDFG 536 S ++ K S+ + DGP WAPLRDNYMLT KLK WD+M ++ DD G Sbjct: 236 SLTAHDSSSKGNKSSDPRNDDGPAWAPLRDNYMLTNPKLKSWDKM-DSAVADDVG 289