BLASTX nr result
ID: Catharanthus22_contig00033213
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00033213 (1120 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOX93854.1| Uncharacterized protein TCM_002832 [Theobroma cacao] 159 1e-36 ref|XP_006443555.1| hypothetical protein CICLE_v10024373mg [Citr... 145 2e-32 ref|XP_002301802.1| hypothetical protein POPTR_0002s24780g [Popu... 141 5e-31 ref|XP_002532735.1| hypothetical protein RCOM_1749890 [Ricinus c... 138 4e-30 ref|XP_002265382.2| PREDICTED: uncharacterized protein LOC100259... 131 4e-28 ref|XP_006340010.1| PREDICTED: protein MNN4-like [Solanum tubero... 127 8e-27 gb|EMJ01297.1| hypothetical protein PRUPE_ppa019374mg [Prunus pe... 126 2e-26 ref|XP_004292357.1| PREDICTED: uncharacterized protein LOC101302... 123 2e-25 gb|EXB99101.1| hypothetical protein L484_007008 [Morus notabilis] 122 2e-25 ref|XP_004144391.1| PREDICTED: uncharacterized protein LOC101214... 120 1e-24 ref|XP_004174080.1| PREDICTED: uncharacterized LOC101214978 [Cuc... 117 6e-24 ref|XP_004515037.1| PREDICTED: uncharacterized protein LOC101507... 114 7e-23 ref|XP_006586221.1| PREDICTED: uncharacterized protein LOC102663... 110 8e-22 ref|XP_003622856.1| hypothetical protein MTR_7g055560 [Medicago ... 110 8e-22 gb|ESW12665.1| hypothetical protein PHAVU_008G132000g [Phaseolus... 105 3e-20 ref|XP_006297814.1| hypothetical protein CARUB_v10013848mg [Caps... 103 1e-19 ref|NP_189149.2| uncharacterized protein [Arabidopsis thaliana] ... 102 3e-19 ref|XP_002883574.1| hypothetical protein ARALYDRAFT_480018 [Arab... 97 1e-17 ref|XP_002449446.1| hypothetical protein SORBIDRAFT_05g012520 [S... 72 3e-10 ref|XP_004979245.1| PREDICTED: DNA ligase 1-like [Setaria italica] 69 3e-09 >gb|EOX93854.1| Uncharacterized protein TCM_002832 [Theobroma cacao] Length = 503 Score = 159 bits (403), Expect = 1e-36 Identities = 130/396 (32%), Positives = 192/396 (48%), Gaps = 51/396 (12%) Frame = -3 Query: 1115 LLLLAFLTTISPPHHAAPNST------TKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXX 954 LLLLAF T ++P S +K+SFL+ Y L++ L S Sbjct: 65 LLLLAFFT-LTPSFVNQAGSCYLELPESKVSFLLTTYQTLVETLRSK--TDDESEGFACL 121 Query: 953 XXXEVFRIVFDTAISHQI-----EVLEVGEIQIGAE------------------------ 861 E ++IVF+T+ + +I +VLE+ + G + Sbjct: 122 EELEAYKIVFETSTTLEIRENPDQVLELESKEDGLQAVEAPVAKGSSRESKSLGVPETLT 181 Query: 860 ------------RKNSNWVAEDIKMKEEKKLEQLDEFDNTNGVELKKIEAAMDTNAHKAV 717 R +N V +K+ E+ L++ + +N + + +K ++ ++K Sbjct: 182 SIILDEKSAEIARPETNQVMAVVKIFEDF-LQEKEGVENLSSKKREKEAKSLSVESNKGE 240 Query: 716 EKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQNHSERILKGP-MNSSHRTHEEEEN 540 E+ +E + +R+GS+A GN P V GG++ ++ ++ + ++ T + +N Sbjct: 241 EQKEE--AFMRSGSKAILGNKISDPK--VRADNGGEHAAKAMVNSKRVIANWSTENDGDN 296 Query: 539 SSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLW 360 SS KV + +LGN+GSMRKEK+WKRTLACKLFEER GMDLLW Sbjct: 297 SSSKVTDNNKTMGSSLGNFGSMRKEKEWKRTLACKLFEER-------HNVDGGEGMDLLW 349 Query: 359 ETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFS 180 ETYE+ +K + +D +S+GQLCCLQALKFS Sbjct: 350 ETYETDSNKVQLKSSSKKGKKGGNEYYD----------DEDDYEEDSDGQLCCLQALKFS 399 Query: 179 AGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKK 81 AGKMNLGMGR ++ISKA+KGIGWLH VSSRH KK Sbjct: 400 AGKMNLGMGRPNLVKISKALKGIGWLHHVSSRHGKK 435 >ref|XP_006443555.1| hypothetical protein CICLE_v10024373mg [Citrus clementina] gi|568851101|ref|XP_006479232.1| PREDICTED: uncharacterized protein LOC102628840 [Citrus sinensis] gi|557545817|gb|ESR56795.1| hypothetical protein CICLE_v10024373mg [Citrus clementina] Length = 431 Score = 145 bits (367), Expect = 2e-32 Identities = 129/389 (33%), Positives = 181/389 (46%), Gaps = 40/389 (10%) Frame = -3 Query: 1118 ALLLLAFLTTISPPHHAAPNSTTKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXXXXXEV 939 +LLLLA LTT NS K+SFL++AY ++KL SN E Sbjct: 63 SLLLLALLTTFVRDTELCENS--KVSFLLSAYRNAVEKLRSNSDDSTTDEQSLNLEDLEA 120 Query: 938 FRIVFDTAISHQIEVLEVGEIQIGAERKNSNW------------------VAEDIKMKEE 813 ++IVFDT+ ++EVGEI +G + + + E I + E Sbjct: 121 YKIVFDTS-----SIIEVGEISVGVSEETNGLSSSNSEAAPVDKHLCRESLVEIITLAEI 175 Query: 812 KKLE--------QLDEFDNTNG--VELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEAES 663 K E QL + + G +E + + +D + K ++ + SSV G + Sbjct: 176 MKAESDRQQSSSQLIAEEKSLGGFLEEEDHDVFVDVSCEKGEKEEVKPSSV---GLHNNN 232 Query: 662 GNNRKSPNYTVEI-----GKGGQN----HSERILKGPMNSSHRTHEEEENSSLKVESSRA 510 NN K+ V++ K +N +S+R+L H +E + S Sbjct: 233 NNNDKAEETKVDLFMSSGSKALENKVRLNSQRVLLLG-GGDHLWSDENDGGEFTHSPSFG 291 Query: 509 LNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSESKP 330 + LG++GSMRKEK+W+RTLACKLFEER GMD+LWETYE+ Sbjct: 292 SS---LGSFGSMRKEKEWRRTLACKLFEER----HNNVDQGSCEGMDMLWETYEADHEST 344 Query: 329 KGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR 150 K K K+ +GQLCCLQALKFSAGKMNLGMGR Sbjct: 345 KQQQR-QQLLAKSKTKKGKKWRSKYDDDEEEDEEEIDDGQLCCLQALKFSAGKMNLGMGR 403 Query: 149 ---LRISKAIKGIGWLHQVSSRHSKKVHN 72 ++ISKA KGIGWLH V ++H KK+++ Sbjct: 404 PNLVKISKAFKGIGWLHNV-TKHGKKIYH 431 >ref|XP_002301802.1| hypothetical protein POPTR_0002s24780g [Populus trichocarpa] gi|222843528|gb|EEE81075.1| hypothetical protein POPTR_0002s24780g [Populus trichocarpa] Length = 448 Score = 141 bits (355), Expect = 5e-31 Identities = 94/250 (37%), Positives = 137/250 (54%), Gaps = 5/250 (2%) Frame = -3 Query: 806 LEQLDEFDNT-NGVELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTV 630 L Q +EF++ E K+ ++ N++KA ++ +E S ++ E + S Sbjct: 219 LHQKEEFEDIWFQKEEKEALKPLNVNSNKAEDRKEEQSMIISGSKEI---GQKISEAKVS 275 Query: 629 EIGKGGQNHSERILKGPMNSSHRTHEEEENSSLKV-ESSRALNYINLGNYGSMRKEKDWK 453 + G G +S ++ + ++ + + KV ++S+ L + NLG++GSMRKEK+W+ Sbjct: 276 DDGGGEHYYSPKLSSQELEANPWSPGNGGGYNSKVKDNSQTLGHSNLGSFGSMRKEKEWR 335 Query: 452 RTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDV 273 RTLACKLFEER GMD+LWETYE+ +K + +D Sbjct: 336 RTLACKLFEER-------HNVDGGEGMDMLWETYETDSTKVQAKGRAKKGKKGSIEYYD- 387 Query: 272 KYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQV 102 +S+GQLCCLQALKFSAGKMNLGMGR ++ISKA+KGIGWLH V Sbjct: 388 --------DEEDLEEEKSDGQLCCLQALKFSAGKMNLGMGRPNLVKISKALKGIGWLHHV 439 Query: 101 SSRHSKKVHN 72 S+HSKK H+ Sbjct: 440 -SKHSKKGHH 448 >ref|XP_002532735.1| hypothetical protein RCOM_1749890 [Ricinus communis] gi|223527512|gb|EEF29637.1| hypothetical protein RCOM_1749890 [Ricinus communis] Length = 424 Score = 138 bits (348), Expect = 4e-30 Identities = 124/377 (32%), Positives = 177/377 (46%), Gaps = 38/377 (10%) Frame = -3 Query: 1115 LLLLAFLTTISP----PHHAAPNSTTKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXXXX 948 LLLL FLT +SP + + S +K+SFL+ Y ++++L S Sbjct: 64 LLLLVFLT-VSPNLVHDNLSTELSESKVSFLLGTYQTVVERLRSKVEEHGNPELNQFEEL 122 Query: 947 XEVFRIVFDTAI----SHQIEVLE--------------VGEIQIGAERKNSNWV----AE 834 V++IVFDT+ + I+VLE V + N N V +E Sbjct: 123 E-VYKIVFDTSDFDIGENPIQVLESDAKENCLTSDATQVKNNSSSEDSGNENLVVITRSE 181 Query: 833 DIKMKEEKK-----LEQLDEFDNTNGVELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEA 669 ++ E K L Q +EF+ + K + +N +K VE Q+ +R+GS+A Sbjct: 182 SSQLIAEAKPLGVFLHQKEEFEELASKKEAKDVKPLSSNFNK-VESEQKEEPYMRSGSKA 240 Query: 668 ESGNNRKSPNYTVEIGKGGQ----NHSERILKGPMNSSHRTHEEEENSSLKVESSRALNY 501 R + + GG+ +S+++ P +S E + ++ A Sbjct: 241 MGYKLRDAK---ISADDGGECLSRMNSQKLDSNPWSSPDNGGEYNSKAMNNSQTMGA--- 294 Query: 500 INLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSESKPKGH 321 NLG++GSMRKEK+W+RTLACKLFEER GMD+LWETYE+ K +G Sbjct: 295 -NLGSFGSMRKEKEWRRTLACKLFEER-------HNADGGEGMDMLWETYETDSIKVQGK 346 Query: 320 HGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR--- 150 + SNGQLCCLQALKFSAGKM+LGMGR Sbjct: 347 SKSKKGKKGNIERH------HDDDVDDEDEDELSNGQLCCLQALKFSAGKMSLGMGRPNL 400 Query: 149 LRISKAIKGIGWLHQVS 99 ++ISKA+KGIGWLH V+ Sbjct: 401 VKISKALKGIGWLHHVT 417 >ref|XP_002265382.2| PREDICTED: uncharacterized protein LOC100259312 [Vitis vinifera] Length = 398 Score = 131 bits (330), Expect = 4e-28 Identities = 119/362 (32%), Positives = 172/362 (47%), Gaps = 16/362 (4%) Frame = -3 Query: 1112 LLLAFLTTISPPHHAAPNST-TKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXXXXXEVF 936 LL+ L T+SP +P S+ +KL FL+ ++LDKL E + Sbjct: 63 LLVLALLTVSPTLLLSPESSDSKLGFLLEKCGSVLDKLRP--IVDGQCEDLRCFEELEAY 120 Query: 935 RIVFDTAISHQIEVLEVGEIQIGAERKNSNWVAED-IKMKEEKKLEQLDEFDNTNGVELK 759 +IVF+ A + ++ E +++ +E K+ E + +K E N E K Sbjct: 121 KIVFEAA-TFEVRDEERQPLELESEEKHCLPAFEGAVVVKTE------------NVAEEK 167 Query: 758 KIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYT-VEIGKGGQNHSERILKG 582 + E ++ + + ++ V G+E++ + ++ T V G G + + Sbjct: 168 RGEGLLEVGEDGNISEKVKDKKVKAVGAESDKVDGQEERLTTGVSEGVGSKIGEIALRVT 227 Query: 581 PMNSSHRTHEEEENSSL---KVESSRALNYI-------NLGNYGSMRKEKDWKRTLACKL 432 N T + ++S + V+SS Y NLG++GSMRKEK+WKRTLACKL Sbjct: 228 ADNGGDYTSKGADDSQMVAASVKSSEGDYYYSPKRDMENLGSFGSMRKEKEWKRTLACKL 287 Query: 431 FEERXXXXXXXXXXXXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXX 252 FEER GMDLLWETYE+ SK +V Y+ Sbjct: 288 FEER-------NNADGGEGMDLLWETYETDSSKV--IKAKNDRKKSKKKGEEVGYY--SE 336 Query: 251 XXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKK 81 + QLCCLQALKFSAGKMNLGMGR ++ +KA+KGIGWLHQV SRH +K Sbjct: 337 EEDEGEEEEGMDRQLCCLQALKFSAGKMNLGMGRPNLVKFTKALKGIGWLHQV-SRHGRK 395 Query: 80 VH 75 H Sbjct: 396 AH 397 >ref|XP_006340010.1| PREDICTED: protein MNN4-like [Solanum tuberosum] Length = 374 Score = 127 bits (319), Expect = 8e-27 Identities = 125/375 (33%), Positives = 164/375 (43%), Gaps = 22/375 (5%) Frame = -3 Query: 1118 ALLLLAFLT-TISPPHH-AAPNSTTKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXXXXX 945 +LLLLA + TISP ++P+S + L++ NALL+ Sbjct: 64 SLLLLALVNNTISPAFFISSPDSDNVSTILLSFKNALLEA-------DAEIEEFDRFEDF 116 Query: 944 EVFRIVFDT---AISHQI--EVLEVGEIQIGAERKNSNWVAEDIKMKEEKKLEQLDEFDN 780 EV++IVF H E E + + K+S + ++ + ++DEF++ Sbjct: 117 EVYKIVFQENPIEFFHYTSPEESEKSLLDSSVQEKDSAIATATVDLENSGVVVEMDEFES 176 Query: 779 TN---GVELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQ 609 N VE KKIE M T K VEK + ++ NGS+ E+ K + Sbjct: 177 KNCADNVERKKIEE-MGTKVEKVVEKQE---MMMGNGSK--------------EVDKVKK 218 Query: 608 NHSERILKGPMNSSHRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLF 429 HS L NLG+YGSMRKEK+W RTLACKL+ Sbjct: 219 AHSWSNLDQ----------------------------NLGSYGSMRKEKEWTRTLACKLY 250 Query: 428 EERXXXXXXXXXXXXXXGMDLLWETYESSESKPK---------GHHGIXXXXXXXXXKFD 276 EER MDLLWETYE K K G K Sbjct: 251 EERHNSSSDEG-------MDLLWETYELDSGKSKLKRDNTTKKKKKGESTSKSKSKSKSY 303 Query: 275 VKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQ 105 KY + QLCCLQALKFSAGK+NLGMG+ ++ISKAIKG GWLH Sbjct: 304 KKY--EEDKGEEEEEEDMNEQQLCCLQALKFSAGKINLGMGKPNLVKISKAIKGFGWLHH 361 Query: 104 VSSRHSKKVHNGDRF 60 V+ ++ KVH GDRF Sbjct: 362 VTKKN--KVHCGDRF 374 >gb|EMJ01297.1| hypothetical protein PRUPE_ppa019374mg [Prunus persica] Length = 424 Score = 126 bits (316), Expect = 2e-26 Identities = 120/382 (31%), Positives = 172/382 (45%), Gaps = 36/382 (9%) Frame = -3 Query: 1118 ALLLLAFLTTISPP---HHAAPN--STTKLSFLIAAYNALLDKLCSNFXXXXXXXXXXXX 954 ALL+LA+LT +SPP + A + S+ K+ L+ Y +L++L + Sbjct: 63 ALLVLAYLT-VSPPLVQDNVANSELSSIKVGCLVTTYQTVLERLQKSKADDSDGDDHEHE 121 Query: 953 XXXE-----VFRIVFDTAISHQIEVLEVGEIQIGAERKNSNWVAEDIKMKEEKKLEQLDE 789 V++IVFDT+ S +I V EI + V+ LE + Sbjct: 122 EFRSFEELEVYKIVFDTS-SFEISENPVEEICSQVSEAPVDDVSSHEGNATSAPLEAASD 180 Query: 788 FDNTNGVEL---KKIEAA----MDTNAHKAV--EKSQENSSVLRNGSEAESGNNRKSPNY 636 + N E+ ++E + N K EK + +S L N + + R Sbjct: 181 ILDENPAEVIAWPRVETLAAFFQEENWSKDFKEEKEVKPASTLSNKVDEDGKEKRSMRRA 240 Query: 635 TVEIGK-------GGQNHSERILKGPMNSSHRTHEEEENSSLKV-ESSRALNYINLGNYG 480 + ++ + M++S R ++KV E S+ L NLG++G Sbjct: 241 SKDLSSKTSFCEVSADYDEAQFTSKSMSNSQRLGANFGEDNIKVMEDSQMLMGPNLGSFG 300 Query: 479 SMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSES------KPKGHH 318 SMRKEK+W+RTLACKLFEER MD+LWETY+ +ES K K Sbjct: 301 SMRKEKEWRRTLACKLFEERHHNVEGGGEG-----MDMLWETYDETESIKATKGKSKSKK 355 Query: 317 GIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---L 147 G + + F +GQLCCLQALKFSAGKMNLGMGR + Sbjct: 356 GKNGKVEEEDDGEEEEDF---------------DGQLCCLQALKFSAGKMNLGMGRPNLV 400 Query: 146 RISKAIKGIGWLHQVSSRHSKK 81 + SKA+KG GWLH V ++H KK Sbjct: 401 KFSKALKGFGWLHHV-TKHGKK 421 >ref|XP_004292357.1| PREDICTED: uncharacterized protein LOC101302725 [Fragaria vesca subsp. vesca] Length = 570 Score = 123 bits (308), Expect = 2e-25 Identities = 93/238 (39%), Positives = 122/238 (51%), Gaps = 13/238 (5%) Frame = -3 Query: 755 IEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNN-----RKSPNYTVEIGKGGQNHSERI 591 +EAA KAVE+ +E + + E G R+S + + GG + Sbjct: 342 LEAASVILIQKAVEEEKEVKPLSAYFDKVEDGEEKRLTRRESKDRDLGANDGGFRSKSMV 401 Query: 590 LKGPMNSSHRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXX 411 +K S+ E+ +E S+ + NLG++GSMRKEK+W+RTLACKLFEER Sbjct: 402 IKSQFLGSNLGSPGEK----AMEDSQIMGP-NLGSFGSMRKEKEWRRTLACKLFEER--- 453 Query: 410 XXXXXXXXXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYF----VXXXXXX 243 GMD+LWETY+ +ES K GI K + V Sbjct: 454 --HHNVDGGGEGMDMLWETYDETES-GKALQGIKSKSKKQGKKINGNKIDHNEVDGDDGE 510 Query: 242 XXXXXXESNGQLCCLQALKFSAGKMNLG-MGR---LRISKAIKGIGWLHQVSSRHSKK 81 NGQLCCLQALKFSAGKMNLG MGR ++I+KA+KG GWLH V ++HSKK Sbjct: 511 EEEDEELDNGQLCCLQALKFSAGKMNLGHMGRPNLVKITKALKGFGWLHHV-TKHSKK 567 >gb|EXB99101.1| hypothetical protein L484_007008 [Morus notabilis] Length = 442 Score = 122 bits (307), Expect = 2e-25 Identities = 99/261 (37%), Positives = 134/261 (51%), Gaps = 13/261 (4%) Frame = -3 Query: 824 MKEEKKLEQLD-EFDNTNGVELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRK 648 ++EE++LE + + ++ E+K + H+ +K QE + R+GS+ G+ K Sbjct: 217 LQEERELENMSCKKEDKEDTEVKPWIVESEKVDHQDQDKKQE-VLLTRSGSKV-IGSRIK 274 Query: 647 SPNYTVEIGKGGQNHSERILKGPMNSSHRTHEEEENSSLKVESSRALNYINLGNYGSMRK 468 S + + S+ P H H+ SS+ +S + +LG++GSMRK Sbjct: 275 SLS---------RASSQEYFASP--DRHFDHQYSWKSSMDQDSQTFDS--SLGSFGSMRK 321 Query: 467 EKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSESKP---------KGHHG 315 EK+W+RTLACKLFEER MDLLWETYE+SESK KG G Sbjct: 322 EKEWRRTLACKLFEERHNVDGGEG-------MDLLWETYETSESKKVQSSRSNSKKGKKG 374 Query: 314 IXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LR 144 V+Y E+ GQLCCLQALKFSAGKMNLGMGR ++ Sbjct: 375 ------------SVEY---SDMDDDDDYEDEAEGQLCCLQALKFSAGKMNLGMGRPNLVK 419 Query: 143 ISKAIKGIGWLHQVSSRHSKK 81 ISKA+KGIGW+ V RH KK Sbjct: 420 ISKALKGIGWITNV-GRHGKK 439 >ref|XP_004144391.1| PREDICTED: uncharacterized protein LOC101214978 [Cucumis sativus] Length = 357 Score = 120 bits (300), Expect = 1e-24 Identities = 83/229 (36%), Positives = 118/229 (51%), Gaps = 5/229 (2%) Frame = -3 Query: 743 MDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQNHSERILKGPMNSS- 567 ++ + +++E + + +L + +EA++ ++++ +IG + + K SS Sbjct: 147 LEVDFQESMENFPQETQILPDETEAKTEESKEA-----QIGNRENEMMKDLRKLTEESSI 201 Query: 566 -HRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXX 390 RT +S S N LG+YGSMRKEK+W+RTLACKLFEER Sbjct: 202 SSRTESSPWSSPGSFSSREYNNNYTLGSYGSMRKEKEWRRTLACKLFEER-------HNS 254 Query: 389 XXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQ 210 GMD LWETYE+SESK + K GQ Sbjct: 255 EGTEGMDSLWETYENSESK-----NLQKKEKMNGKSTKGKKIQKKTDDDDEEEEDGEQGQ 309 Query: 209 LCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKKVHN 72 LCCLQALKFSAGKMNLGMG+ L+++KA+KG GWL++ SR K +H+ Sbjct: 310 LCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSR-KKLIHS 357 >ref|XP_004174080.1| PREDICTED: uncharacterized LOC101214978 [Cucumis sativus] Length = 270 Score = 117 bits (294), Expect = 6e-24 Identities = 82/229 (35%), Positives = 117/229 (51%), Gaps = 5/229 (2%) Frame = -3 Query: 743 MDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQNHSERILKGPMNSS- 567 ++ + +++E + + +L + +EA++ ++++ +IG + + K SS Sbjct: 60 LEVDFQESMENFPQETQILPDETEAKTEESKEA-----QIGNRENEMMKDLRKLTEESSI 114 Query: 566 -HRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXX 390 RT +S S N LG+YGSMRKEK+W+RTLACKLFEER Sbjct: 115 SSRTESSPWSSPGSFSSREYNNNYTLGSYGSMRKEKEWRRTLACKLFEER-------HNS 167 Query: 389 XXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQ 210 GMD LWETYE+SE K + K GQ Sbjct: 168 EGTEGMDSLWETYENSELK-----NLQKKEKMNGKLTKGKKIQKKTDDDDEEEEDGEQGQ 222 Query: 209 LCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKKVHN 72 LCCLQALKFSAGKMNLGMG+ L+++KA+KG GWL++ SR K +H+ Sbjct: 223 LCCLQALKFSAGKMNLGMGKPNLLKMTKALKGFGWLNRNGSR-KKLIHS 270 >ref|XP_004515037.1| PREDICTED: uncharacterized protein LOC101507381 [Cicer arietinum] Length = 436 Score = 114 bits (285), Expect = 7e-23 Identities = 99/293 (33%), Positives = 141/293 (48%), Gaps = 13/293 (4%) Frame = -3 Query: 914 ISHQIEVLEVGEIQIGAERKNSNWVAEDIKMKEEKKLEQL-DEFDNTNGV-------ELK 759 + ++ V EI+ E+ + + V + + K+L L E+ V E+K Sbjct: 170 VEEELNVKLDDEIENQVEKVDDHEVESIKPVSDVKRLVSLFQEYAELENVSCEKEEKEVK 229 Query: 758 KIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQNHS--ERILK 585 K +++ +K VE+S++ S+ R+GS+ +S NR V G + H+ ++ Sbjct: 230 KTILLLNSKFNK-VEESEKQWSI-RSGSKVKS--NRDMFGNKVR-GNSDEEHAFVAKVKV 284 Query: 584 GPMNSSHRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXX 405 + S R ++K S L NLG++GSMR EK+W+RTLACKLFEER Sbjct: 285 KKLESPQR--------NIKENDSGELCSTNLGSFGSMRVEKEWRRTLACKLFEERHNNSD 336 Query: 404 XXXXXXXXXGMDLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXX 225 MD+LWETY+ ES G+ K +V+ Sbjct: 337 GSEG------MDMLWETYDEKESNKVV--GMKKSNTKRGKKSEVE-----CSEDEDEDED 383 Query: 224 ESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKKVH 75 E +LCCLQALKFS GKMNLGMGR L+ SKA+KGIGWLH V K H Sbjct: 384 EIGAKLCCLQALKFSTGKMNLGMGRPNLLKFSKALKGIGWLHHVGKNGKKNNH 436 >ref|XP_006586221.1| PREDICTED: uncharacterized protein LOC102663802 [Glycine max] Length = 510 Score = 110 bits (276), Expect = 8e-22 Identities = 86/222 (38%), Positives = 105/222 (47%), Gaps = 7/222 (3%) Frame = -3 Query: 719 VEKSQENSSVLRNGSEAESGN--NRKSPNYTVEIGKGGQNHSERILKGPMNSSHRTHEEE 546 VE+S+E LR+GS+ GN N+ S N E S R+ E Sbjct: 311 VEESKEKWP-LRSGSKVVMGNRDNKVSTNSDGEFAFAA---SGRVKSLSQRLEANIGSPE 366 Query: 545 ENSSLKVESSRALNYIN--LGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGM 372 N V S + + N LG++GSMR EK+W+RTLACKLFEER M Sbjct: 367 SNW---VYSGKGMGNNNQALGSFGSMRVEKEWRRTLACKLFEERHNADGSEG-------M 416 Query: 371 DLLWETYESSESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQA 192 D+LWETYE+ +K K V + G+LCCLQA Sbjct: 417 DMLWETYETESNKILKKSNTKRGKK--------KGEVENSEDDEEEEEEDMEGKLCCLQA 468 Query: 191 LKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKKVH 75 LKFS GKMNLGMGR L+ SKA+KGIGWLH V K H Sbjct: 469 LKFSTGKMNLGMGRPNLLKFSKALKGIGWLHNVGKNGRKSNH 510 >ref|XP_003622856.1| hypothetical protein MTR_7g055560 [Medicago truncatula] gi|355497871|gb|AES79074.1| hypothetical protein MTR_7g055560 [Medicago truncatula] Length = 429 Score = 110 bits (276), Expect = 8e-22 Identities = 110/382 (28%), Positives = 159/382 (41%), Gaps = 35/382 (9%) Frame = -3 Query: 1115 LLLLAFLT-TISPPHHAAPNSTTKLSFLIAA------YNALLDKLCSNFXXXXXXXXXXX 957 LLL+AFLT T + HH + +T S + + + ++L + F Sbjct: 65 LLLVAFLTFTPNLVHHKGSSKSTSTSSVESYESKWCFFLSILQTFLAWFEADDKDEEIGL 124 Query: 956 XXXXEVFRIVFDTAISHQIEVLEVGEIQIGAERKNSNWVAED----IKMKEEKKLEQLDE 789 E + ++F +I E V + E + + E+ +M EEKK+ LDE Sbjct: 125 LNELEAYLVMFQASIFEVHEPKSVEDFVEEFEEADEEFSVEEKVVSCQMDEEKKVN-LDE 183 Query: 788 FDNTNGVELK---KIEAAMDTNAH----------KAVEKSQENSSVLRNGSEAESGNNRK 648 + VE+ K E +D + + V +E V++ + + + Sbjct: 184 ENKVEKVEIVESIKEEKVLDVKSLVTLFQEYAELENVSCEKEEKEVVKPILDTKFNKVEE 243 Query: 647 SPNYTVEIGKGGQNHSER-ILKGPMNSSHRTHEEEENSSL-------KVESSRALNYINL 492 S IG G + R + + +T +E+ S K + NL Sbjct: 244 SKETLWSIGNGSKVKGNRDMYANKVKVKSQTLDEDFGSPKSNWEYGGKGIGNNEEVCSNL 303 Query: 491 GNYGSMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYESSESKPKGHHGI 312 G++GSMR EK+W+RTLACKLFEER MD+LWETYE +K Sbjct: 304 GSFGSMRVEKEWRRTLACKLFEERHNNGDGSEG------MDMLWETYEKESNKVVKKSNT 357 Query: 311 XXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRI 141 +F E +LCCLQALKFS GKMNLGMGR ++ Sbjct: 358 KKGKKLSEVEFS----------EDELEEEEVGAKLCCLQALKFSTGKMNLGMGRPNLVKF 407 Query: 140 SKAIKGIGWLHQVSSRHSKKVH 75 SKA+KGIGWLH V K H Sbjct: 408 SKALKGIGWLHHVGKNGKKNNH 429 >gb|ESW12665.1| hypothetical protein PHAVU_008G132000g [Phaseolus vulgaris] Length = 477 Score = 105 bits (263), Expect = 3e-20 Identities = 95/296 (32%), Positives = 132/296 (44%), Gaps = 16/296 (5%) Frame = -3 Query: 914 ISHQIEVLEVGEIQIGAERKNSNWVAEDIKMKEEKKLEQLDEFDNTNGVELKKIEAAMDT 735 + +Q E+L+ ++ + V + + E K LE L F G+E E + Sbjct: 213 VENQKEILDENPVE------KVDKVEATMPIVEVKCLESL--FQAKEGLEDLSCEHKEEK 264 Query: 734 NAHKAVEKSQENSSVL--RNGSEAESGN----NRKSPNYTVEIGKGGQNHSERILKGPMN 573 K +EN L R+GS+ S N+ SP E G + + + + Sbjct: 265 PLIAEYNKVEENKEKLPLRSGSKVMSNRDIYTNKVSPVSDGEFGFAAPGLVKSLSQRLES 324 Query: 572 SSHRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXXX 393 + S + SS+AL N G++GSMR EK+W+RTLACKLFEER Sbjct: 325 NVGSPESNWVYSGKGIGSSQALGS-NHGSFGSMRVEKEWRRTLACKLFEER-------HN 376 Query: 392 XXXXXGMDLLWETYESSESK-------PKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXX 234 GMD+LWETYE+ +K KG G + + + Sbjct: 377 ADGSEGMDMLWETYETESNKVLQKSNTKKGKKGEIEKSEDEEEEEEEE------------ 424 Query: 233 XXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKKVH 75 + G+LCCLQALKFS GKMNLGMGR L+ SKA+KG GW + V K H Sbjct: 425 ---DMEGKLCCLQALKFSTGKMNLGMGRPNLLKFSKALKGFGWFNHVGKYGRKSNH 477 >ref|XP_006297814.1| hypothetical protein CARUB_v10013848mg [Capsella rubella] gi|482566523|gb|EOA30712.1| hypothetical protein CARUB_v10013848mg [Capsella rubella] Length = 406 Score = 103 bits (258), Expect = 1e-19 Identities = 98/293 (33%), Positives = 133/293 (45%), Gaps = 12/293 (4%) Frame = -3 Query: 923 DTAISHQIEVLEVGEIQIGAERKNSNWVAEDI----KMKEEKKLEQLDEFDNTNGVELKK 756 D SH+ +V E + AE K + ED+ K E KK EQ +E ++ K Sbjct: 152 DKFCSHESKVSEALTDEEPAEIKPLKF--EDLIDLEKEVETKKCEQEEEEEHKVKT---K 206 Query: 755 IEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRKSPNYTVEIGKGGQNHSERILKGPM 576 EA +D E+S+ L S ES + K ++ G+ +++ K Sbjct: 207 SEAVLDKGEEPTKEESKVQKVDLVGDSNDESNDLPKLSDFL------GEGKRDKVTK--- 257 Query: 575 NSSHRTHEEEENSSLKVESSRALNYINLGNYGSMRKEKDWKRTLACKLFEERXXXXXXXX 396 EEE+N SL+ ++GSMRKEK+W+RTLACKLFEER Sbjct: 258 KKEEEEDEEEDNVSLQ-------------SFGSMRKEKEWRRTLACKLFEER-------H 297 Query: 395 XXXXXXGMDLLWETYES-----SESKPKGHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXX 231 GMD LWETYE+ E K K D K + Sbjct: 298 NADVGQGMDQLWETYETQTEKKEEDKKKKLKKKTKSMMMKTKSIDHKEVI----VEEEDD 353 Query: 230 XXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIKGIGWLHQVSSRHSKK 81 + QLCCLQALKFS GKM+LG+ R L++SKA KGIG + +++HSKK Sbjct: 354 DVVDHQQLCCLQALKFSTGKMHLGIARPNLLKLSKAFKGIGRFYN-ANKHSKK 405 >ref|NP_189149.2| uncharacterized protein [Arabidopsis thaliana] gi|9294169|dbj|BAB02071.1| unnamed protein product [Arabidopsis thaliana] gi|332643461|gb|AEE76982.1| uncharacterized protein AT3G25130 [Arabidopsis thaliana] Length = 406 Score = 102 bits (254), Expect = 3e-19 Identities = 85/257 (33%), Positives = 122/257 (47%), Gaps = 5/257 (1%) Frame = -3 Query: 836 EDIKMKEEKKLEQLDEFDNTNGVELK-KIEAAMDTNAHKAVEKSQENSSVLRNGSEAESG 660 ED+ + E+++ + E + ++K K + +D E+S+ L S ES Sbjct: 183 EDVIVLEKEEETKKCEKEEVEEQKVKHKSDVVLDNREEPTKEESKAQKVDLVGDSNNESY 242 Query: 659 NNRKSPNYTVEIGKGGQNHSERILKGPMNSSHRTHEEEENSSLKVESSRALNYINLGNYG 480 + K N+ E G+G +N + +EEE+N SL+ ++G Sbjct: 243 DLPKLSNFLGE-GEGKRNVVTK------------NEEEDNVSLQ-------------SFG 276 Query: 479 SMRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYES-SESKPKGHHGIXXX 303 SMRKEK+W+RTLACKLFEER GMD LWETYE+ +E K + Sbjct: 277 SMRKEKEWRRTLACKLFEER-------HNADVGQGMDQLWETYETQTEKKQQTEEEKKKL 329 Query: 302 XXXXXXKFDVKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKA 132 K + QLCCLQALKFS GKM+LG+ R L++SKA Sbjct: 330 KKKTKSMMKTKSIEKEVIVEEEDDDGIDHQQLCCLQALKFSTGKMHLGIARPNLLKLSKA 389 Query: 131 IKGIGWLHQVSSRHSKK 81 KGIG + +++HSKK Sbjct: 390 FKGIGRFYN-ANKHSKK 405 >ref|XP_002883574.1| hypothetical protein ARALYDRAFT_480018 [Arabidopsis lyrata subsp. lyrata] gi|297329414|gb|EFH59833.1| hypothetical protein ARALYDRAFT_480018 [Arabidopsis lyrata subsp. lyrata] Length = 406 Score = 97.1 bits (240), Expect = 1e-17 Identities = 83/255 (32%), Positives = 117/255 (45%), Gaps = 6/255 (2%) Frame = -3 Query: 827 KMKEEKKLEQLDEFDNTNGVELKKIEAAMDTNAHKAVEKSQENSSVLRNGSEAESGNNRK 648 K +E KK E+ +E E +K++ D E ++E S + + N Sbjct: 191 KEEETKKCEKEEE-------EEQKVKPESDVVLDNEEEPTKEESKAQKVDLVGDFNNESY 243 Query: 647 S-PNYTVEIGKGGQNHSERILKGPMNSSHRTHEEEENSSLKVESSRALNYINLGNYGSMR 471 P + +G+G +N + + EEE+N SL+ ++GSMR Sbjct: 244 DLPKLSKFLGEGKRNEATK------------KEEEDNVSLQ-------------SFGSMR 278 Query: 470 KEKDWKRTLACKLFEERXXXXXXXXXXXXXXGMDLLWETYES-SESKPKGHHGIXXXXXX 294 KEK+W+RTLACKLFEER GMD LWETYE+ +E K + Sbjct: 279 KEKEWRRTLACKLFEER-------HNADVGQGMDQLWETYETQTEKKHQTEEEKKKLKKK 331 Query: 293 XXXKFDVKYF-VXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGMGR---LRISKAIK 126 K + QLCCLQALKFS GKM+LG+ R L++SKA K Sbjct: 332 TKSMLKTKSIEKEVIVEEEDHDDGIDHQQLCCLQALKFSTGKMHLGIARPNLLKLSKAFK 391 Query: 125 GIGWLHQVSSRHSKK 81 GIG + +++HSKK Sbjct: 392 GIGRFYN-ANKHSKK 405 >ref|XP_002449446.1| hypothetical protein SORBIDRAFT_05g012520 [Sorghum bicolor] gi|241935289|gb|EES08434.1| hypothetical protein SORBIDRAFT_05g012520 [Sorghum bicolor] Length = 739 Score = 72.4 bits (176), Expect = 3e-10 Identities = 51/130 (39%), Positives = 59/130 (45%), Gaps = 6/130 (4%) Frame = -3 Query: 497 NLGNYGS-MRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXG--MDLLWETYESSESKPK 327 NL + GS RK+K+WKRTLACKL+EER MD+LWE YE K Sbjct: 591 NLLSEGSPSRKDKEWKRTLACKLYEERMQLRLCRDRAVVEGSDNMDMLWEAYEVGSGGNK 650 Query: 326 GHHGIXXXXXXXXXKFDVKYFVXXXXXXXXXXXXESNG---QLCCLQALKFSAGKMNLGM 156 G G V V + G QLCCLQALKFS KMN G Sbjct: 651 GRGGKRSGSKVKGSTSKVDDAVEEGEEEEEDADDDEEGSVRQLCCLQALKFSTRKMNFGG 710 Query: 155 GRLRISKAIK 126 G+ +SK K Sbjct: 711 GKPSLSKIAK 720 >ref|XP_004979245.1| PREDICTED: DNA ligase 1-like [Setaria italica] Length = 724 Score = 68.9 bits (167), Expect = 3e-09 Identities = 54/149 (36%), Positives = 68/149 (45%), Gaps = 6/149 (4%) Frame = -3 Query: 497 NLGNYGS-MRKEKDWKRTLACKLFEERXXXXXXXXXXXXXXG--MDLLWETYE--SSESK 333 NL + GS RK+K+WKRTLACKL+EER MD+LWE YE Sbjct: 576 NLVSEGSPSRKDKEWKRTLACKLYEERMQLRLCRDRAVVEGSDNMDMLWEAYEVGGGGGG 635 Query: 332 PKGHHGIXXXXXXXXXKFD-VKYFVXXXXXXXXXXXXESNGQLCCLQALKFSAGKMNLGM 156 KG G D V+ V E QLCCLQALK S KMN G Sbjct: 636 GKGRGGKRSGSKAKSVANDKVEELVDEGEEEEEEDDDEEVRQLCCLQALKLSTRKMNFGG 695 Query: 155 GRLRISKAIKGIGWLHQVSSRHSKKVHNG 69 G+ +SK K + + +S S++ +G Sbjct: 696 GKPSLSKITKVLRRMTALSRMGSRRKQSG 724