BLASTX nr result
ID: Atropa21_contig00028957
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00028957 (773 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006348725.1| PREDICTED: uncharacterized protein LOC102579... 400 e-109 ref|XP_004239084.1| PREDICTED: uncharacterized protein LOC101252... 389 e-106 ref|XP_002512130.1| conserved hypothetical protein [Ricinus comm... 251 2e-64 ref|XP_002311326.1| hypothetical protein POPTR_0008s09190g [Popu... 245 1e-62 ref|XP_002316104.1| hypothetical protein POPTR_0010s16990g [Popu... 241 2e-61 gb|EXB65061.1| hypothetical protein L484_004237 [Morus notabilis] 239 6e-61 ref|XP_002266340.1| PREDICTED: uncharacterized protein LOC100265... 238 2e-60 gb|EOY01796.1| Uncharacterized protein isoform 2, partial [Theob... 235 1e-59 gb|EOY01795.1| Uncharacterized protein isoform 1 [Theobroma cacao] 235 1e-59 ref|XP_004297266.1| PREDICTED: uncharacterized protein LOC101294... 234 2e-59 gb|EMJ24536.1| hypothetical protein PRUPE_ppa007391mg [Prunus pe... 233 7e-59 ref|XP_006391025.1| hypothetical protein EUTSA_v10018736mg [Eutr... 232 1e-58 ref|XP_006438776.1| hypothetical protein CICLE_v10031839mg [Citr... 231 2e-58 ref|NP_564963.1| uncharacterized protein [Arabidopsis thaliana] ... 224 3e-56 gb|AAM61045.1| unknown [Arabidopsis thaliana] 224 3e-56 ref|XP_002888717.1| hypothetical protein ARALYDRAFT_476065 [Arab... 221 2e-55 emb|CAN60864.1| hypothetical protein VITISV_030819 [Vitis vinifera] 221 2e-55 gb|ESW29686.1| hypothetical protein PHAVU_002G090200g [Phaseolus... 216 5e-54 ref|XP_006302411.1| hypothetical protein CARUB_v10020483mg [Caps... 215 1e-53 gb|EPS66903.1| hypothetical protein M569_07873, partial [Genlise... 214 3e-53 >ref|XP_006348725.1| PREDICTED: uncharacterized protein LOC102579102 [Solanum tuberosum] Length = 373 Score = 400 bits (1028), Expect = e-109 Identities = 208/246 (84%), Positives = 217/246 (88%), Gaps = 4/246 (1%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAP----Q 213 MGRWR AN VARI+ KS+SI PT Q P KI NFS VP IQFLNFRSFSAAP Q Sbjct: 1 MGRWRAANFVARIV-AKSRSIQPTPQNPFQKIRNFSSVSVPTIQFLNFRSFSAAPATYPQ 59 Query: 214 YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRS 393 YVDDFEY P++IDNT SLE DD+ IGKIPVKAYFLCTSIDLKRMQAE RDVLPSSSRS Sbjct: 60 YVDDFEYKPYKIDNTQSLEPEDDDGIGKIPVKAYFLCTSIDLKRMQAEIPRDVLPSSSRS 119 Query: 394 PNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYAS 573 PNHIALRFFNLLPTNT LRFG +TN CS MVVFQYGSVVLFNVEDDEAEYYLQIVRR+AS Sbjct: 120 PNHIALRFFNLLPTNTLLRFGGNTNGCSYMVVFQYGSVVLFNVEDDEAEYYLQIVRRFAS 179 Query: 574 GLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQI 753 GLLREMKKDDYA+KEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQ+ Sbjct: 180 GLLREMKKDDYAVKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQV 239 Query: 754 DGMVEE 771 DGMVEE Sbjct: 240 DGMVEE 245 >ref|XP_004239084.1| PREDICTED: uncharacterized protein LOC101252584 [Solanum lycopersicum] Length = 374 Score = 389 bits (998), Expect = e-106 Identities = 205/247 (82%), Positives = 216/247 (87%), Gaps = 5/247 (2%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPK-IQFLNFRSFSAAP---- 210 MGRWR AN VARI+ KS+SI T Q P KI FS VPK IQFLNFRSFSA+P Sbjct: 1 MGRWRAANFVARIV-AKSRSIQDTPQNPFLKIRKFSSVSVPKPIQFLNFRSFSASPATYP 59 Query: 211 QYVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSR 390 QYVDDFEY P++IDNT SLES DD+ IGKIPVKAYFLCTSIDLKRMQAE RDVLPSSSR Sbjct: 60 QYVDDFEYKPYKIDNTQSLESEDDDGIGKIPVKAYFLCTSIDLKRMQAEIPRDVLPSSSR 119 Query: 391 SPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYA 570 SPNHIALRFFNL+PTNT LRFG +TN CS MVVFQYGSVVLFNVED EAEYYLQIVRR+A Sbjct: 120 SPNHIALRFFNLIPTNTLLRFGGNTNGCSYMVVFQYGSVVLFNVEDHEAEYYLQIVRRFA 179 Query: 571 SGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQ 750 SGLLREMKKDDYA+KEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQ Sbjct: 180 SGLLREMKKDDYAVKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQ 239 Query: 751 IDGMVEE 771 +DGMVEE Sbjct: 240 VDGMVEE 246 >ref|XP_002512130.1| conserved hypothetical protein [Ricinus communis] gi|223549310|gb|EEF50799.1| conserved hypothetical protein [Ricinus communis] Length = 339 Score = 251 bits (641), Expect = 2e-64 Identities = 141/246 (57%), Positives = 167/246 (67%), Gaps = 4/246 (1%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ---- 213 MGRW A S+ I + S + F + + FRSFSA P Sbjct: 1 MGRWGAAASLLFNHIARKPSSSTLPNISHRSVRQFYPSN-QGVYLFGFRSFSALPSRVSV 59 Query: 214 YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRS 393 Y ++ EY H + + E +DE+IGKIPVKAYFLCTSIDLK MQ+EN +V+P +SRS Sbjct: 60 YSNEIEYGSHDFASIYDFEPKEDEEIGKIPVKAYFLCTSIDLKSMQSENLINVVPPTSRS 119 Query: 394 PNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYAS 573 N+I LR+ TALR + C MVVFQYGS VLFN+ED E E +L+IVRR+AS Sbjct: 120 TNYIVLRYCGFPSEITALRVKDYIK-CQYMVVFQYGSAVLFNIEDHEVESFLEIVRRHAS 178 Query: 574 GLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQI 753 GLL EM+KDDYAIKEKPLLVEDMQGGAD+IVLK LDTDSIRI+ SVLGQSIALDYFVSQ+ Sbjct: 179 GLLPEMRKDDYAIKEKPLLVEDMQGGADYIVLKTLDTDSIRIMGSVLGQSIALDYFVSQV 238 Query: 754 DGMVEE 771 DGMVEE Sbjct: 239 DGMVEE 244 >ref|XP_002311326.1| hypothetical protein POPTR_0008s09190g [Populus trichocarpa] gi|222851146|gb|EEE88693.1| hypothetical protein POPTR_0008s09190g [Populus trichocarpa] Length = 366 Score = 245 bits (625), Expect = 1e-62 Identities = 140/246 (56%), Positives = 169/246 (68%), Gaps = 6/246 (2%) Frame = +1 Query: 52 RWR-TANSVARIIITKSKS-IPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ---- 213 RWR TA+ + I TK+ + P PL++ S ++ ++ FR FSA P Sbjct: 4 RWRATASLLLDHITTKASDFLSPNLPKPLNR----SHPLIHTVRGFKFRPFSAIPSRVSV 59 Query: 214 YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRS 393 Y ++ E H + + L +DE+ GKIPVKAYFLCTSI+LK MQAEN +V+P +SRS Sbjct: 60 YSNEIESGSHDLALNYPLGPKEDEETGKIPVKAYFLCTSINLKSMQAENLSNVVPPTSRS 119 Query: 394 PNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYAS 573 N+ LRFFN +AL G + C MVVFQYGS VLFN+ED E E YL+IVRR+ S Sbjct: 120 TNYTVLRFFNFSSDISALGIGGYVS-CRYMVVFQYGSAVLFNIEDHEVERYLEIVRRHTS 178 Query: 574 GLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQI 753 GLL EM+KDDYAI EKPLL EDMQGG D+IVLK LDTDSIRII SVLGQSIALDYFVSQ+ Sbjct: 179 GLLSEMRKDDYAIIEKPLLAEDMQGGLDYIVLKTLDTDSIRIIGSVLGQSIALDYFVSQV 238 Query: 754 DGMVEE 771 DGMVEE Sbjct: 239 DGMVEE 244 >ref|XP_002316104.1| hypothetical protein POPTR_0010s16990g [Populus trichocarpa] gi|222865144|gb|EEF02275.1| hypothetical protein POPTR_0010s16990g [Populus trichocarpa] Length = 366 Score = 241 bits (615), Expect = 2e-61 Identities = 141/253 (55%), Positives = 171/253 (67%), Gaps = 13/253 (5%) Frame = +1 Query: 52 RWR-TANSVARIIITKS-KSIPPT------HQYPLHK-ICNFSLTIVPKIQFLNFRSFSA 204 RWR TA+ + I TK+ K + P H +PL + +C F FRSFSA Sbjct: 4 RWRATASLLLNHITTKAFKFLSPNLPRPIYHSHPLTQTVCGFK-----------FRSFSA 52 Query: 205 APQ----YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDV 372 P Y ++ E H + + L +DE+ GKIPVKAYFLCTSI+LK MQAEN V Sbjct: 53 IPSRVSVYSNEIESGSHDLAINYDLGPKEDEESGKIPVKAYFLCTSINLKSMQAENLSYV 112 Query: 373 LPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQ 552 +P +SRS N++ L+FF+ +AL E + C MVVFQYGS VLFN+ED + E YL+ Sbjct: 113 VPPTSRSTNYVVLKFFDFSSDISALGIREYIS-CRYMVVFQYGSAVLFNIEDPDVERYLE 171 Query: 553 IVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIAL 732 +VRR+ SGLL EM+KDDYAIKEKPLL EDMQGG D+IVLK LDTDSIRII SVLGQSIAL Sbjct: 172 MVRRHTSGLLSEMRKDDYAIKEKPLLDEDMQGGLDYIVLKTLDTDSIRIIGSVLGQSIAL 231 Query: 733 DYFVSQIDGMVEE 771 DYFVSQ+DGMVEE Sbjct: 232 DYFVSQVDGMVEE 244 >gb|EXB65061.1| hypothetical protein L484_004237 [Morus notabilis] Length = 371 Score = 239 bits (611), Expect = 6e-61 Identities = 136/251 (54%), Positives = 168/251 (66%), Gaps = 9/251 (3%) Frame = +1 Query: 46 MGRWRTANSVARIIIT-KSKSI----PPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAP 210 MG+WR S+ IT SKS+ PP P I PK +F +FR+FSA P Sbjct: 1 MGKWRGGVSLLFHHITFSSKSLLAPNPPFFFSPSSSI--------PKSRFFSFRAFSALP 52 Query: 211 QYVD----DFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLP 378 + +F D + +DED GKIPVKAYFL TSI+LK +QAEN +V+P Sbjct: 53 SRISIDTGEFGSGSLYFDRNYGFGPKEDEDSGKIPVKAYFLSTSINLKSIQAENLSNVVP 112 Query: 379 SSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIV 558 ++SRS N+IALR++++ T + N +VVFQYGS VLFN+EDDE E YL +V Sbjct: 113 ATSRSSNYIALRYYDIPSQTTGFGLKRNLNCFRYVVVFQYGSAVLFNIEDDEVESYLDMV 172 Query: 559 RRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDY 738 RR+ASGLL EM+KDDYA+KEKPL+ EDM GGAD+IVLK LDTD +RII SVLGQSIALDY Sbjct: 173 RRHASGLLPEMRKDDYAVKEKPLMREDMLGGADYIVLKTLDTDGVRIIGSVLGQSIALDY 232 Query: 739 FVSQIDGMVEE 771 FVSQIDGM+EE Sbjct: 233 FVSQIDGMLEE 243 >ref|XP_002266340.1| PREDICTED: uncharacterized protein LOC100265119 [Vitis vinifera] gi|297738194|emb|CBI27395.3| unnamed protein product [Vitis vinifera] Length = 374 Score = 238 bits (607), Expect = 2e-60 Identities = 127/246 (51%), Positives = 161/246 (65%), Gaps = 4/246 (1%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ---- 213 MGRWR + + ++ + SK P P + P +F F FSA P Sbjct: 1 MGRWRCSILLGTLLTSTSKYPTPYFCRPFTPLRYLPFPSNPISRFFRFFPFSALPSPASV 60 Query: 214 YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRS 393 Y +DF H + + + +D+ KIPVKA+FLCTSIDL+ MQAE+ +++P SSRS Sbjct: 61 YANDFVSGSHDFPHDYIFQPREDDGSEKIPVKAFFLCTSIDLRSMQAEHWSNIVPPSSRS 120 Query: 394 PNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYAS 573 N+I LR+++ T + ++ C MVVFQYGS VLFN+ D+E E YL+IVRRYAS Sbjct: 121 ANYIVLRYYDFPSEITGIGGEDNVGCCHYMVVFQYGSAVLFNIVDNEVEAYLKIVRRYAS 180 Query: 574 GLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQI 753 GLL EM+KDDYA+K+ P+L EDMQGG D+IVLKNLD D IRII VLGQSIALDYFVSQI Sbjct: 181 GLLPEMRKDDYAVKQNPVLAEDMQGGTDYIVLKNLDIDGIRIIGRVLGQSIALDYFVSQI 240 Query: 754 DGMVEE 771 DGMVEE Sbjct: 241 DGMVEE 246 >gb|EOY01796.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 297 Score = 235 bits (599), Expect = 1e-59 Identities = 135/261 (51%), Positives = 167/261 (63%), Gaps = 19/261 (7%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIV--PK-----------IQFLN 186 MGRWR A A ++ P +P HK L + PK ++FL Sbjct: 1 MGRWRAA---APLLFNHLAKTP----FPSHKFLTRFLPLKGRPKPLCLPPSLNLPLRFLF 53 Query: 187 FRSFSAAPQ----YVDDFEYNPHRI--DNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRM 348 R FSA P Y D E+ N + ++E+ GKIP+KAYFLCTSIDLK M Sbjct: 54 TRPFSAIPSQVSVYTSDSEHGSPDFFHQNYGFVSQEEEEETGKIPIKAYFLCTSIDLKSM 113 Query: 349 QAENSRDVLPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVED 528 QAEN +++P SSRS N+IALR+ + P TA + + C +VVFQYGS VLFN+ED Sbjct: 114 QAENLSNIVPPSSRSSNYIALRYCDFPPDITAFGMKDKVSSCRYIVVFQYGSAVLFNIED 173 Query: 529 DEAEYYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISS 708 E E YL+IVRR+ SGLL EM++DDYA+KE+P L +DMQGG D++VLK LDTDSIRII S Sbjct: 174 HEVESYLEIVRRHGSGLLPEMRRDDYAVKEQPQLAKDMQGGPDYVVLKTLDTDSIRIIGS 233 Query: 709 VLGQSIALDYFVSQIDGMVEE 771 VLGQSIALDYFVSQ+DGMVEE Sbjct: 234 VLGQSIALDYFVSQVDGMVEE 254 >gb|EOY01795.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 382 Score = 235 bits (599), Expect = 1e-59 Identities = 135/261 (51%), Positives = 167/261 (63%), Gaps = 19/261 (7%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIV--PK-----------IQFLN 186 MGRWR A A ++ P +P HK L + PK ++FL Sbjct: 1 MGRWRAA---APLLFNHLAKTP----FPSHKFLTRFLPLKGRPKPLCLPPSLNLPLRFLF 53 Query: 187 FRSFSAAPQ----YVDDFEYNPHRI--DNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRM 348 R FSA P Y D E+ N + ++E+ GKIP+KAYFLCTSIDLK M Sbjct: 54 TRPFSAIPSQVSVYTSDSEHGSPDFFHQNYGFVSQEEEEETGKIPIKAYFLCTSIDLKSM 113 Query: 349 QAENSRDVLPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVED 528 QAEN +++P SSRS N+IALR+ + P TA + + C +VVFQYGS VLFN+ED Sbjct: 114 QAENLSNIVPPSSRSSNYIALRYCDFPPDITAFGMKDKVSSCRYIVVFQYGSAVLFNIED 173 Query: 529 DEAEYYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISS 708 E E YL+IVRR+ SGLL EM++DDYA+KE+P L +DMQGG D++VLK LDTDSIRII S Sbjct: 174 HEVESYLEIVRRHGSGLLPEMRRDDYAVKEQPQLAKDMQGGPDYVVLKTLDTDSIRIIGS 233 Query: 709 VLGQSIALDYFVSQIDGMVEE 771 VLGQSIALDYFVSQ+DGMVEE Sbjct: 234 VLGQSIALDYFVSQVDGMVEE 254 >ref|XP_004297266.1| PREDICTED: uncharacterized protein LOC101294649 [Fragaria vesca subsp. vesca] Length = 368 Score = 234 bits (598), Expect = 2e-59 Identities = 139/253 (54%), Positives = 170/253 (67%), Gaps = 11/253 (4%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNF---RSFSAAPQY 216 MGRWR A+ ++ K+ +I T FS +I P QF F R FSA P Sbjct: 1 MGRWRAAS-----LLLKNITIITTPPKSSSSFTPFSSSI-PSYQFNFFIKSRPFSAIPSR 54 Query: 217 VD----DFE----YNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDV 372 V DFE Y PH + N+++D+GKIP+KAYFLCTSI+LK MQAE +V Sbjct: 55 VSIDPTDFESEPAYCPHE-------QLNENDDVGKIPIKAYFLCTSINLKSMQAEYLSNV 107 Query: 373 LPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQ 552 +P SSRS N+IALRF + N+ E+ + MVVFQYGS VLFNVE+ E + YL Sbjct: 108 IPPSSRSTNYIALRFCDFPSENSRFGVWENPSPWRYMVVFQYGSTVLFNVEEHEVQAYLN 167 Query: 553 IVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIAL 732 +V+ +ASG+LREM KDDYA+KEKP LV+DMQGG D+IVL+ LDTDSIRIISSVLGQSIAL Sbjct: 168 LVKGHASGMLREMVKDDYAVKEKPQLVDDMQGGPDYIVLRTLDTDSIRIISSVLGQSIAL 227 Query: 733 DYFVSQIDGMVEE 771 DYFVSQ+DGMVEE Sbjct: 228 DYFVSQVDGMVEE 240 >gb|EMJ24536.1| hypothetical protein PRUPE_ppa007391mg [Prunus persica] Length = 369 Score = 233 bits (593), Expect = 7e-59 Identities = 134/244 (54%), Positives = 160/244 (65%), Gaps = 2/244 (0%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQYV-- 219 MGRWR A+ + + T + + P H S F+N FSA P V Sbjct: 1 MGRWRAASLLLNHVTTTASKSSVAAKPPFHLSRPSSPIFC--FHFVNSIPFSAIPSRVSI 58 Query: 220 DDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRSPN 399 D +Y+ HS ++ +DE+ K+PVKAYFLCTSI+LK MQAEN +V+P SSRS N Sbjct: 59 DTNDYDSEPPYYAHS-QTQEDEETKKVPVKAYFLCTSINLKSMQAENLSNVIPPSSRSTN 117 Query: 400 HIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYASGL 579 +IALRF + N L + C MVVFQYGS VLFNVED E YL IV R+ASGL Sbjct: 118 YIALRFCDFPSQNAELGVWGKPSYCRYMVVFQYGSAVLFNVEDHEVGAYLDIVIRHASGL 177 Query: 580 LREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQIDG 759 L EM+KDDYA++EKP L EDMQGG D+IVLK LDTD+IRII SVLGQSIALDYFVSQ+DG Sbjct: 178 LPEMRKDDYAVREKPKLEEDMQGGPDYIVLKTLDTDAIRIIGSVLGQSIALDYFVSQVDG 237 Query: 760 MVEE 771 MVEE Sbjct: 238 MVEE 241 >ref|XP_006391025.1| hypothetical protein EUTSA_v10018736mg [Eutrema salsugineum] gi|557087459|gb|ESQ28311.1| hypothetical protein EUTSA_v10018736mg [Eutrema salsugineum] Length = 371 Score = 232 bits (591), Expect = 1e-58 Identities = 132/248 (53%), Positives = 165/248 (66%), Gaps = 6/248 (2%) Frame = +1 Query: 46 MGRWRTANSVA--RIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ-- 213 MGRWR ++ I+ SK P ++P + + L+FR FSA P Sbjct: 1 MGRWRAVAALLLRNQIVNSSKPFPCVWRHPALGSHRSQAS-----RLLDFRHFSAFPSPI 55 Query: 214 --YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSS 387 Y +D + + + + ++ED GKIP+KAYFL TSIDLK MQA+N +V+P +S Sbjct: 56 SIYNNDSDSGSGDVYQNYEFGTKEEEDRGKIPIKAYFLSTSIDLKGMQADNLCNVVPPTS 115 Query: 388 RSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRY 567 RS N IAL+F + AL ES + C MVVFQYGS VLFNV+DD+ E YL IVRR+ Sbjct: 116 RSTNSIALKFSDSSSRIPALDERESVSSCRFMVVFQYGSAVLFNVDDDDVESYLDIVRRH 175 Query: 568 ASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVS 747 ASGLL EM+KDDYA+KEKPLL E+M+GG D+IVLK LDTDSIRII SVLGQSIALDYFVS Sbjct: 176 ASGLLTEMRKDDYAVKEKPLLTEEMRGGPDYIVLKTLDTDSIRIIGSVLGQSIALDYFVS 235 Query: 748 QIDGMVEE 771 Q+D +VEE Sbjct: 236 QVDKLVEE 243 >ref|XP_006438776.1| hypothetical protein CICLE_v10031839mg [Citrus clementina] gi|568859092|ref|XP_006483076.1| PREDICTED: uncharacterized protein LOC102620927 isoform X1 [Citrus sinensis] gi|557540972|gb|ESR52016.1| hypothetical protein CICLE_v10031839mg [Citrus clementina] Length = 379 Score = 231 bits (589), Expect = 2e-58 Identities = 134/255 (52%), Positives = 166/255 (65%), Gaps = 13/255 (5%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSL------------TIVPKIQFLNF 189 MGRWR S+ TH H+ CNF+L + + F N Sbjct: 1 MGRWRLP------------SLLLTHMARTHENCNFNLFSHLEHLQSLKSPLFDRCFFFNL 48 Query: 190 RSFSAAPQYVDDFEYNPHRID-NTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSR 366 R FSA P + E + +D + + +++DE+IGKIPVKAYFL TSIDLK MQAEN Sbjct: 49 RHFSAIPHRLCT-ELDSGSVDFHPNYGLADEDEEIGKIPVKAYFLSTSIDLKSMQAENLT 107 Query: 367 DVLPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYY 546 V+P SSRS +IALR+ + +AL + + C MVVF YGS VLFN+ED E E Y Sbjct: 108 HVVPPSSRSTKYIALRYSDFPSEISALGVHGNVSHCRYMVVFHYGSAVLFNIEDHEVENY 167 Query: 547 LQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSI 726 L I+RR+ASG+L EM+KDDYAIKEKPLL EDMQGG D+IVLKNLDTDS+R+I SVLGQS+ Sbjct: 168 LHIIRRHASGMLPEMRKDDYAIKEKPLLAEDMQGGPDYIVLKNLDTDSVRVIGSVLGQSM 227 Query: 727 ALDYFVSQIDGMVEE 771 ALDYFVSQ+D +VEE Sbjct: 228 ALDYFVSQVDCLVEE 242 >ref|NP_564963.1| uncharacterized protein [Arabidopsis thaliana] gi|12325085|gb|AAG52494.1|AC018364_12 hypothetical protein; 13477-15179 [Arabidopsis thaliana] gi|12597794|gb|AAG60106.1|AC073178_17 hypothetical protein [Arabidopsis thaliana] gi|332196796|gb|AEE34917.1| uncharacterized protein AT1G69380 [Arabidopsis thaliana] Length = 373 Score = 224 bits (570), Expect = 3e-56 Identities = 130/257 (50%), Positives = 170/257 (66%), Gaps = 15/257 (5%) Frame = +1 Query: 46 MGRWRTA----------NSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRS 195 MG+WR NS R+ ++ S P ++P TI +FLNFR Sbjct: 1 MGKWRAVAALLLRNQLLNSSKRLNLSSS---PCVSKHP---------TIGLASRFLNFRH 48 Query: 196 FSAAPQ----YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENS 363 FSA P Y +D + + + + +E +GKIP+KAYFL TSIDLK MQAEN Sbjct: 49 FSAFPSPISIYNNDSDSGSNDAYQNYEFGTEAEEALGKIPIKAYFLSTSIDLKAMQAENL 108 Query: 364 RDVLPSSSRSPNHIALRFFNLLPTNT-ALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAE 540 +V+P +SRS N+IAL+F + P+ +L ES + C MVVFQYGS +LFNV+D++ + Sbjct: 109 CNVVPPTSRSTNYIALKFSDFTPSGIYSLDERESVSNCKFMVVFQYGSAILFNVDDNDVD 168 Query: 541 YYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQ 720 YL IVRR+ASGLL EM+KDDYA+KEKPLL+E+M+GG D+IVLK LDT+SIRII SVLGQ Sbjct: 169 RYLDIVRRHASGLLTEMRKDDYAVKEKPLLIEEMKGGPDYIVLKTLDTNSIRIIGSVLGQ 228 Query: 721 SIALDYFVSQIDGMVEE 771 SIALDY VSQ++ +VEE Sbjct: 229 SIALDYSVSQVNKLVEE 245 >gb|AAM61045.1| unknown [Arabidopsis thaliana] Length = 373 Score = 224 bits (570), Expect = 3e-56 Identities = 130/257 (50%), Positives = 170/257 (66%), Gaps = 15/257 (5%) Frame = +1 Query: 46 MGRWRTA----------NSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRS 195 MG+WR NS R+ ++ S P ++P TI +FLNFR Sbjct: 1 MGKWRAVAALLLRNQLLNSSKRLNLSSS---PCVSKHP---------TIGLASRFLNFRH 48 Query: 196 FSAAPQ----YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENS 363 FSA P Y +D + + + + +E +GKIP+KAYFL TSIDLK MQAEN Sbjct: 49 FSAFPSPISIYNNDSDSGSNDAYQNYEFGTEAEEALGKIPIKAYFLSTSIDLKAMQAENL 108 Query: 364 RDVLPSSSRSPNHIALRFFNLLPTNT-ALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAE 540 +V+P +SRS N+IAL+F + P+ +L ES + C MVVFQYGS +LFNV+D++ + Sbjct: 109 CNVVPPTSRSTNYIALKFSDFTPSGIYSLDERESVSNCKFMVVFQYGSAILFNVDDNDVD 168 Query: 541 YYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQ 720 YL IVRR+ASGLL EM+KDDYA+KEKPLL+E+M+GG D+IVLK LDT+SIRII SVLGQ Sbjct: 169 RYLDIVRRHASGLLTEMRKDDYAVKEKPLLIEEMKGGPDYIVLKTLDTNSIRIIGSVLGQ 228 Query: 721 SIALDYFVSQIDGMVEE 771 SIALDY VSQ++ +VEE Sbjct: 229 SIALDYSVSQVNKLVEE 245 >ref|XP_002888717.1| hypothetical protein ARALYDRAFT_476065 [Arabidopsis lyrata subsp. lyrata] gi|297334558|gb|EFH64976.1| hypothetical protein ARALYDRAFT_476065 [Arabidopsis lyrata subsp. lyrata] Length = 373 Score = 221 bits (563), Expect = 2e-55 Identities = 124/249 (49%), Positives = 167/249 (67%), Gaps = 7/249 (2%) Frame = +1 Query: 46 MGRWRTANSVA--RIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ-- 213 MG+WR ++ ++ SK + + Y + I +FL+FR FSA P Sbjct: 1 MGKWRAVAALLLRNQLLNSSKRLNLSSPY----VSKHHPAIGLASRFLDFRHFSAFPSPI 56 Query: 214 --YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSS 387 Y +D + + ++++E++GKIP+KAYFL T IDLK MQAEN +V+P +S Sbjct: 57 SIYNNDSDSGSTDAYQNYEFGTHEEEELGKIPIKAYFLSTGIDLKAMQAENLCNVVPPTS 116 Query: 388 RSPNHIALRFFNLLPTNT-ALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRR 564 RS N IAL+F + P+ + ES + C MVVFQYGS +LFNV+D++ + YL IVRR Sbjct: 117 RSTNSIALKFSDFTPSGIHTMDERESVSNCRFMVVFQYGSAILFNVDDNDVDRYLDIVRR 176 Query: 565 YASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFV 744 +ASGLL EM+KDDYA+KEKPLL E+M+GG D+IVLK LDT+SIRII SVLGQSIALDYFV Sbjct: 177 HASGLLTEMRKDDYAVKEKPLLTEEMKGGHDYIVLKTLDTNSIRIIGSVLGQSIALDYFV 236 Query: 745 SQIDGMVEE 771 SQ++ +VEE Sbjct: 237 SQVNKLVEE 245 >emb|CAN60864.1| hypothetical protein VITISV_030819 [Vitis vinifera] Length = 318 Score = 221 bits (563), Expect = 2e-55 Identities = 126/274 (45%), Positives = 160/274 (58%), Gaps = 32/274 (11%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ---- 213 MGRWR + + ++ + K P P + P +F F FSA P Sbjct: 1 MGRWRRSILLGTLLTSTCKYPTPYFCRPFTPLRYLPFPSNPISRFFRFFPFSALPSPASV 60 Query: 214 YVDDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCT---------------------- 327 Y +DF H + + + +D+ KIPVKA+FLCT Sbjct: 61 YANDFVSGSHDFPHDYIFQPREDDGSEKIPVKAFFLCTRSGIAIPILYSNIPNVTSGLRF 120 Query: 328 ------SIDLKRMQAENSRDVLPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVV 489 SIDL+ MQAE+ +++P SSRS N+I LR+++ T + ++ C MVV Sbjct: 121 QIHGFGSIDLRSMQAEHWSNIVPPSSRSANYIVLRYYDFPSEITGIGGEDNVGCCHYMVV 180 Query: 490 FQYGSVVLFNVEDDEAEYYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVL 669 FQYGS VLFN+ D+E E YL+IVRRYASGLL EM+KDDYA+K+ P+L EDMQGG D+IVL Sbjct: 181 FQYGSAVLFNIVDNEVEAYLKIVRRYASGLLPEMRKDDYAVKQNPVLAEDMQGGTDYIVL 240 Query: 670 KNLDTDSIRIISSVLGQSIALDYFVSQIDGMVEE 771 KNLD D IRII VLGQSIALDYFVSQIDGMVEE Sbjct: 241 KNLDIDGIRIIGRVLGQSIALDYFVSQIDGMVEE 274 >gb|ESW29686.1| hypothetical protein PHAVU_002G090200g [Phaseolus vulgaris] Length = 349 Score = 216 bits (551), Expect = 5e-54 Identities = 127/246 (51%), Positives = 158/246 (64%), Gaps = 5/246 (2%) Frame = +1 Query: 49 GRWRTANSVA-RIIITKSKSIPPTHQYPLHKICNFSLTIVPKIQFLNFRSFSAAPQYVDD 225 GRW+T + R+ + S S +P F S +AAP + D Sbjct: 3 GRWKTFSLFYNRLTTSSSSSYSSKSHFP---------------SFNRSLSLAAAPSEIPD 47 Query: 226 ---FEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDVLPSSSRSP 396 FE+ +D T KIPVKAYFL TSI+LK +QA+N R+++P SSRS Sbjct: 48 PDPFEFGAPHVDPTV-----------KIPVKAYFLSTSINLKGIQADNHRNIVPPSSRSS 96 Query: 397 -NHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQIVRRYAS 573 N++ALRF + + F + C MVV+QYGS VLFN+ED E E YL++V+R+AS Sbjct: 97 SNYVALRFCDFNLDSNGPGFHVKASNCQYMVVYQYGSAVLFNIEDHEVESYLELVKRHAS 156 Query: 574 GLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIALDYFVSQI 753 GLLREM+KDDYAIKEKPLLVEDMQGG D+IVLK+LDTD IRII SVLGQSIALDYFVSQ+ Sbjct: 157 GLLREMRKDDYAIKEKPLLVEDMQGGPDYIVLKSLDTDGIRIIGSVLGQSIALDYFVSQV 216 Query: 754 DGMVEE 771 DG+VEE Sbjct: 217 DGLVEE 222 >ref|XP_006302411.1| hypothetical protein CARUB_v10020483mg [Capsella rubella] gi|482571121|gb|EOA35309.1| hypothetical protein CARUB_v10020483mg [Capsella rubella] Length = 376 Score = 215 bits (548), Expect = 1e-53 Identities = 124/253 (49%), Positives = 166/253 (65%), Gaps = 11/253 (4%) Frame = +1 Query: 46 MGRWRTANSVA--RIIITKSKSI-----PPTHQYPLHKICNFSLTIVPKIQFLNFRSFSA 204 MGRWR ++ I+ SK + P ++P SL +VP +FL+FR S+ Sbjct: 1 MGRWRAVAALLLRNQILKSSKRLLDPCSPCVSKHP-------SLGLVPS-RFLDFRYLSS 52 Query: 205 APQYV----DDFEYNPHRIDNTHSLESNDDEDIGKIPVKAYFLCTSIDLKRMQAENSRDV 372 P + +D + + + +DE+ GKIP+KAYFL TSIDLK MQAEN +V Sbjct: 53 LPSPISTHSNDSDSGSTYAYQKYEFGTEEDEEQGKIPIKAYFLSTSIDLKAMQAENLCNV 112 Query: 373 LPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSCMVVFQYGSVVLFNVEDDEAEYYLQ 552 +P +SRS N IAL+F + L ES + C MVVFQYGS +LFN++D++ E YL Sbjct: 113 VPPTSRSTNFIALKFSDFASGIQTLDERESVSNCQFMVVFQYGSAILFNIDDNDVERYLD 172 Query: 553 IVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQSIAL 732 IVRR+ASGLL+ M+KDDYA+KE+PLL E+ +GG D+IVL+ LDTDSIRII +VLGQSIAL Sbjct: 173 IVRRHASGLLKTMQKDDYAVKEEPLLTEEFKGGPDYIVLRTLDTDSIRIIGTVLGQSIAL 232 Query: 733 DYFVSQIDGMVEE 771 DYFVSQ+ ++EE Sbjct: 233 DYFVSQVHKLLEE 245 >gb|EPS66903.1| hypothetical protein M569_07873, partial [Genlisea aurea] Length = 381 Score = 214 bits (544), Expect = 3e-53 Identities = 129/257 (50%), Positives = 163/257 (63%), Gaps = 15/257 (5%) Frame = +1 Query: 46 MGRWRTANSVARIIITKSKSIPPTH--QYPLHKICNFSLTIVPKIQFLNFRSFSAAPQ-- 213 MGRWRT + I+ SKSI + + K S+ + P+I N R FSA P Sbjct: 1 MGRWRTP-VILDHILRASKSIGNLNSSRNACPKSLFASIVLQPRIFLSNLRWFSALPSPS 59 Query: 214 --YVDDFEYNPHRIDNTHSLESNDDEDIGK------IPVKAYFLCTSIDLKRMQAENSRD 369 YV+D E+ ++D+ +DDE + IPVKA+F+CTS+DLK +QAENS Sbjct: 60 PCYVEDNEF---KVDSLSKSRISDDEAVKSDSETVTIPVKAFFVCTSVDLKSLQAENSSY 116 Query: 370 VLPSSSRSPNHIALRFFNLLPTNTALRFGESTNVCSC---MVVFQYGSVVLFNVEDDEAE 540 V+P + RS N IALRF L F SC M+VFQYGS VLFN++D E + Sbjct: 117 VIPQAGRSSNSIALRFRRPLFGQVIHSFQVVAGGVSCHRYMLVFQYGSAVLFNIQDQEVD 176 Query: 541 YYLQIVRRYASGLLREMKKDDYAIKEKPLLVEDMQGGADHIVLKNLDTDSIRIISSVLGQ 720 +YLQ V+R+ SGLL +++KDDYAIKE LL EDM+GG DHI+LK LD DSIRII VLGQ Sbjct: 177 HYLQTVKRHGSGLLSDIRKDDYAIKEMALLEEDMKGGPDHIILKTLDMDSIRIIGRVLGQ 236 Query: 721 SIALDYFVSQIDGMVEE 771 SIALD+FVSQ+DGMVEE Sbjct: 237 SIALDHFVSQVDGMVEE 253