BLASTX nr result
ID: Forsythia21_contig00002853
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00002853 (1556 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073916.1| PREDICTED: uncharacterized protein LOC105158... 465 e-128 ref|XP_012839112.1| PREDICTED: uncharacterized protein LOC105959... 442 e-121 gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlise... 346 3e-92 ref|XP_009775615.1| PREDICTED: uncharacterized protein LOC104225... 335 8e-89 ref|XP_009623433.1| PREDICTED: uncharacterized protein LOC104114... 334 1e-88 emb|CDP10030.1| unnamed protein product [Coffea canephora] 328 5e-87 ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596... 317 1e-83 ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854... 311 1e-81 ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254... 307 2e-80 ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma... 302 4e-79 ref|XP_012071565.1| PREDICTED: uncharacterized protein LOC105633... 273 2e-70 ref|XP_010525850.1| PREDICTED: uncharacterized protein LOC104803... 273 3e-70 ref|XP_010525843.1| PREDICTED: uncharacterized protein LOC104803... 273 3e-70 ref|XP_010241508.1| PREDICTED: uncharacterized protein LOC104586... 271 8e-70 ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Popu... 270 2e-69 ref|XP_010241509.1| PREDICTED: uncharacterized protein LOC104586... 270 2e-69 ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297... 270 3e-69 gb|KHG01403.1| Phosphoribosylformylglycinamidine synthase [Gossy... 266 4e-68 ref|XP_010667195.1| PREDICTED: uncharacterized protein LOC104884... 265 6e-68 ref|XP_012460306.1| PREDICTED: uncharacterized protein LOC105780... 265 7e-68 >ref|XP_011073916.1| PREDICTED: uncharacterized protein LOC105158762 [Sesamum indicum] gi|747055359|ref|XP_011073917.1| PREDICTED: uncharacterized protein LOC105158762 [Sesamum indicum] Length = 433 Score = 465 bits (1196), Expect = e-128 Identities = 256/423 (60%), Positives = 309/423 (73%), Gaps = 7/423 (1%) Frame = -2 Query: 1312 LMYLYYPERCRFRPRIINSAV-RRHYRRRLLKYSP----TPNSTPTIFKPSDDTLQITLR 1148 LMYL + RCRFR I+ SAV RRH+RRRLLKYS TP PTIF+ SDDTLQITL+ Sbjct: 12 LMYLSFSGRCRFRHTIVTSAVGRRHHRRRLLKYSSDSTLTPTQQPTIFRLSDDTLQITLK 71 Query: 1147 PS-NSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLC 971 P NSL+QL E KL++ ++ +AFDDL+++V+VD GGVVISCRRSTVEF+ L Sbjct: 72 PPLNSLQQL----EGKLHQFLNYGREAFDDLRTVVTVDGNNGGVVISCRRSTVEFLIALL 127 Query: 970 MSSLVIIFIFQALFKRRRSDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDY 791 +SSLV++ F+ LFK R + EVLVYKRDRSLGGREV+VGKRE NW T+ K+TPLS ++ Sbjct: 128 VSSLVVVTAFRGLFKLRENRGEVLVYKRDRSLGGREVVVGKRETNWSTSHKSTPLSGDNA 187 Query: 790 TD-EKKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMS 614 +KK K L RR EELPQWWPQ++N GL+ + N+EEYQ MAN+LIRA+MD KMS Sbjct: 188 NYYQKKRKRKPLGRRRVEELPQWWPQVVNWGLH--DTGNKEEYQTMANQLIRAIMDRKMS 245 Query: 613 GKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQING 434 G+DIS NDI+QLRHICKT+GVR I TANARDSLYR +IN VL YCE ++N STS+QING Sbjct: 246 GEDISTNDIVQLRHICKTYGVRTFITTANARDSLYRVSINFVLDYCESMSNVSTSIQING 305 Query: 433 EDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVC 254 ED R+FIAGLADNIGLE+ RILQAWALEVQ+KHSEAL EL KVC Sbjct: 306 EDVREFIAGLADNIGLESAYAARMVSAAVAARTRSRILQAWALEVQNKHSEALVELFKVC 365 Query: 253 LIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGLVGIK 74 +IHR+F MVARGL+ LS+EQRE IL S +CG++ +SLVEALGL G + Sbjct: 366 VIHRIFPPAENSPEMEMVARGLDKSLSVEQREYILNSFIDVCGKDIDQSLVEALGLGGAR 425 Query: 73 DSQ 65 Q Sbjct: 426 YEQ 428 >ref|XP_012839112.1| PREDICTED: uncharacterized protein LOC105959538 [Erythranthe guttatus] gi|604331881|gb|EYU36739.1| hypothetical protein MIMGU_mgv1a007033mg [Erythranthe guttata] Length = 422 Score = 442 bits (1138), Expect = e-121 Identities = 247/418 (59%), Positives = 303/418 (72%), Gaps = 10/418 (2%) Frame = -2 Query: 1309 MYLYYPERCRFRPRIINSAV-RRHYRRRLLKYSPTPNSTP----TIFKPSDDTLQITLR- 1148 M L RC FR I+ SA+ RRH+RRRLLKYSPTP +TP TIFK SDD LQITLR Sbjct: 6 MNLNCSHRCHFRHAIVTSAIPRRHHRRRLLKYSPTPANTPIFAPTIFKLSDDGLQITLRR 65 Query: 1147 PSNSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCM 968 PS SL+ + E KLN+LI +AFDDL+++V+VD GG VISCRRS+VEF+A L Sbjct: 66 PSTSLQ--VQQLETKLNQLIGRGREAFDDLRTVVAVDETNGGFVISCRRSSVEFLAALFF 123 Query: 967 SSLVIIFIFQALFKR-RRSDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSND- 794 SSLV++ F+ LFK+ ++ EVLVYKRDRSLGG+EV+VGK+E N PT RK TPLSSND Sbjct: 124 SSLVVVIAFRGLFKQISKNSGEVLVYKRDRSLGGKEVVVGKKETNLPTRRKPTPLSSNDA 183 Query: 793 -YTDEKKAKITRLRNR-RKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNK 620 Y EKK T++ + RKEELPQWWPQ +N G E+ N+EEYQRMAN+LI A++D K Sbjct: 184 DYYYEKKINRTKILGKSRKEELPQWWPQAVNLGSP--EIENKEEYQRMANQLIGAIVDRK 241 Query: 619 MSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQI 440 M+G+DIS NDI+QLRH+CKT+GV+ SI TAN RDSLYR ++N VL+YCE I+N STS+QI Sbjct: 242 MAGEDISANDIVQLRHLCKTYGVKTSISTANTRDSLYRVSVNFVLNYCETISNISTSIQI 301 Query: 439 NGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSK 260 NGED +FIAGLADNIGLE+ +ILQAWALEVQ+KHSEAL EL K Sbjct: 302 NGEDVPEFIAGLADNIGLESTHAARIVSAAVAARTRSKILQAWALEVQNKHSEALAELFK 361 Query: 259 VCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGL 86 VC+IH++F MVARGL+ LS+EQRE IL S + G+E +S+VEALGL Sbjct: 362 VCIIHQIFPPEENSPEMEMVARGLDKSLSVEQREQILNSFIAVSGKEIGQSVVEALGL 419 >gb|EPS65535.1| hypothetical protein M569_09246, partial [Genlisea aurea] Length = 400 Score = 346 bits (887), Expect = 3e-92 Identities = 201/403 (49%), Positives = 259/403 (64%), Gaps = 14/403 (3%) Frame = -2 Query: 1249 RRHYRRRLLKYSPTPN--------STP--TIFKPSDDTLQITLR-PSNSLKQLLDLSEIK 1103 RRH+RRRLLKYSP N STP TI K SD+ LQITL PSNSL+++ E K Sbjct: 5 RRHHRRRLLKYSPNRNPETSPLIRSTPPITILKLSDNGLQITLSSPSNSLEKV----ESK 60 Query: 1102 LNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALFKR 923 LN++I+C +AF DL++LV+ D G V ISCRRSTVEF L +S +++ I + +FK Sbjct: 61 LNQIIECGREAFFDLRTLVTFDEDYGRVSISCRRSTVEFFIGLFISGFLVVLIIRNVFKL 120 Query: 922 RRSDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSS---NDYTDEKKAKITRLRN 752 R++ + LVY+RDRSLGGREVLVG NW + + PL S +DY +K+ I + Sbjct: 121 RKNGRQALVYRRDRSLGGREVLVGTGHSNWSSKLTSNPLDSVSISDYHQKKRGIIQGMS- 179 Query: 751 RRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRH 572 RKE+LPQWWPQ +S E N E YQR+AN+L++ ++D ++SG+DIS +DI+QLR+ Sbjct: 180 -RKEKLPQWWPQFHDSS---GEAPNTEGYQRIANQLVQGIVDRRVSGEDISMDDIVQLRY 235 Query: 571 ICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNI 392 +CK V SI TAN RDSLYR ++N L+YCE + S+QI E AR+F+AGLADNI Sbjct: 236 LCKAHRVNVSISTANTRDSLYRVSVNFTLNYCEGTLKEFASIQIGDEGAREFVAGLADNI 295 Query: 391 GLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXX 212 G+ +ILQAWALEVQ+KHSEAL ELSKVC IHRVF Sbjct: 296 GINAIQASRMVSGAVAARTHSKILQAWALEVQNKHSEALEELSKVCTIHRVFPPERNSAE 355 Query: 211 XXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGLV 83 MV RGL L+ EQRE IL+ ++ E+T SL EALGLV Sbjct: 356 MEMVFRGLAKSLTPEQREHILDLFISLGAEDTSESLAEALGLV 398 >ref|XP_009775615.1| PREDICTED: uncharacterized protein LOC104225498 [Nicotiana sylvestris] Length = 463 Score = 335 bits (858), Expect = 8e-89 Identities = 199/431 (46%), Positives = 264/431 (61%), Gaps = 28/431 (6%) Frame = -2 Query: 1291 ERCR----FRPRIINSAV------RRHYRRRLLK-YSPTPNSTPTIFKPSDDTLQITLR- 1148 ERCR R +I+ V RRH RRRLLK + P T PSD L L Sbjct: 21 ERCRSFHHHRHYVISRRVSPSPPRRRHLRRRLLKKFYPNLTEDTTSPPPSDQNLHFILTI 80 Query: 1147 ---PSNSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAV 977 P+ SL + DL + KL+ +D S A DL++L+ +DS G V+ SCRRSTV+F+ Sbjct: 81 DDLPTKSLYSVKDLLDSKLSEFVDSSRAAIKDLQTLIRIDSNNGRVLFSCRRSTVQFLGT 140 Query: 976 LCMSSLVIIFIFQALFK---------RRRSDSEVLVYKRDRSLGGREVLVGKREENWPTT 824 L ++S V+IF +A+FK R+++ LVYKRDRSLGG+EVLV K E + Sbjct: 141 LVITSFVVIFTLRAIFKLLVLGLRMNNERNNNVELVYKRDRSLGGKEVLVAKNETVY--R 198 Query: 823 RKTTPLSSNDYTDEKKAKITRLRNRRK----EELPQWWPQLLNSGLNLNEMINREEYQRM 656 K L S D + ++I R RRK E+LP+WWP +S + N+EEYQ+M Sbjct: 199 NKPNVLDSEDSNWDWGSRIRFSRRRRKKSSVEKLPKWWPVSTSSSDQVGAE-NQEEYQKM 257 Query: 655 ANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYC 476 ANRLIRA++DN+M+GKDI E+DIIQLR I + GV+ S +T NARD+LYR AI+ VLSYC Sbjct: 258 ANRLIRAILDNRMTGKDILEDDIIQLRCIGRVSGVKVSFDTENARDTLYRVAIDFVLSYC 317 Query: 475 EIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQ 296 E AN+S V I GE+A+ FIAGLADN+GL+N R LQAWALE+Q Sbjct: 318 ESTANQSAFVLIGGEEAQNFIAGLADNVGLDNTRAARMVSAAVAARTRSRFLQAWALEIQ 377 Query: 295 DKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEET 116 KHSEA EL K+C+IH++F MVARGLE HL ++QRE ++ +L +CG+++ Sbjct: 378 GKHSEAAVELFKICVIHQIFPPEEFSPEMEMVARGLEKHLKVDQREFLMNTLLRVCGDQS 437 Query: 115 HRSLVEALGLV 83 RS+ EALGL+ Sbjct: 438 RRSVAEALGLM 448 >ref|XP_009623433.1| PREDICTED: uncharacterized protein LOC104114644 [Nicotiana tomentosiformis] Length = 463 Score = 334 bits (857), Expect = 1e-88 Identities = 200/433 (46%), Positives = 265/433 (61%), Gaps = 30/433 (6%) Frame = -2 Query: 1291 ERCR----FRPRIINSAV------RRHYRRRLLK-YSPTPNSTPTIFKPSDDTLQITLR- 1148 ERCR R +I+ V RRH RRRLLK + P T PSD L L Sbjct: 19 ERCRSFHHHRHYVISRRVSPSPPRRRHLRRRLLKKFYPNLTEDTTSPPPSDQNLHFILTV 78 Query: 1147 ---PSNSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAV 977 P+ SL L DL + KL+ +D S A DL++L+ +DS G V+ SCRRSTV+F+ Sbjct: 79 DDLPTKSLYSLKDLLDSKLSEFVDSSRAAIKDLQTLIRIDSNNGRVLFSCRRSTVQFLGT 138 Query: 976 LCMSSLVIIFIFQALFK---------RRRSDSEVLVYKRDRSLGGREVLVGKREENWPTT 824 L ++S V+IF +A+FK R+++ LVYKRDRSLGG+EVLV K E + Sbjct: 139 LVITSFVVIFTLRAIFKLLVLGLRMNSNRNNNVELVYKRDRSLGGKEVLVAKNETVY--R 196 Query: 823 RKTTPLSSNDYTD--EKKAKITRLRNRRK----EELPQWWPQLLNSGLNLNEMINREEYQ 662 +K L S D + ++I R RRK E+LP+WWP +S + N+EEYQ Sbjct: 197 KKPNVLDSEDRNSNWDWGSRIRFSRRRRKKSSVEKLPKWWPVSTSSSDQVGAE-NQEEYQ 255 Query: 661 RMANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLS 482 RMANRLIRA++DN+M+GKDI E+DIIQLR I + GV+ S +T NARD+LYR AI+ VL+ Sbjct: 256 RMANRLIRAILDNRMTGKDILEDDIIQLRCIGRVSGVKVSFDTENARDTLYRVAIDFVLN 315 Query: 481 YCEIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALE 302 YCE AN+S V I GE+A+ FIAGLADN+GLEN + LQAWALE Sbjct: 316 YCESTANQSAFVLIGGEEAQNFIAGLADNVGLENTRAARMVSAAVAARTRSKFLQAWALE 375 Query: 301 VQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGE 122 +Q KHSEA EL K+C+IH++F MVARGLE HL ++QRE ++ +L +CG+ Sbjct: 376 IQGKHSEAAMELFKICVIHQIFPPEEFSPEMEMVARGLEKHLKVDQREFLMNTLLRVCGD 435 Query: 121 ETHRSLVEALGLV 83 ++ RS+ EALGL+ Sbjct: 436 QSRRSVAEALGLM 448 >emb|CDP10030.1| unnamed protein product [Coffea canephora] Length = 460 Score = 328 bits (842), Expect = 5e-87 Identities = 201/450 (44%), Positives = 270/450 (60%), Gaps = 37/450 (8%) Frame = -2 Query: 1291 ERCRFR------PRIINSAV---RRHYRRRLLKYSPTPN--STPTIFKPSDDTLQITL-- 1151 +RCR P+II+ +V RRH+RRRLLK+ P + S PT+ + LQI L Sbjct: 16 KRCRINVYKFTPPKIISRSVSSRRRHHRRRLLKHHPDADHRSPPTV----NQNLQIVLTV 71 Query: 1150 ------RPSNSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVE 989 +P + +L+D S+ KL+R I + DAF++L++LV+VD T VV+SCRRSTV Sbjct: 72 DRLSNSKPVTYISELVDASQSKLSRFIYAADDAFENLRTLVTVDGATKRVVVSCRRSTVH 131 Query: 988 FMAVLCMSSLVIIFIFQALFKRRRSDSEV-------LVYKRDRSLGGREVLVGKREENWP 830 F+ + +SSLVIIF+F+ L K +S+ ++Y+RDRSLGGREV V K + N+ Sbjct: 132 FLGFVLLSSLVIIFVFRVLIKLLIGNSDSFSENNGGVIYRRDRSLGGREVAVAKVDTNFR 191 Query: 829 TTRKTTPLSSNDY--------TDEKKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINR 674 S N+ + K+ R + R E+LPQWWP L E N+ Sbjct: 192 KNENKKKGSENNILMLMLESENEIKRPFWERRKKRSAEKLPQWWPVSSQGPGLLVE--NK 249 Query: 673 EEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAIN 494 EEYQ MANRLI+A+MD ++ G+DIS +DI+QLR IC+ GVR IE NARDS+YRA+++ Sbjct: 250 EEYQMMANRLIQAIMDKRIRGEDISMDDIVQLRRICRISGVRVLIEVENARDSIYRASVD 309 Query: 493 LVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQA 314 VL CE I N+S + I+GED FIAGLA+NIGLEN R LQA Sbjct: 310 FVLQCCERIENQSAFINIDGEDVHHFIAGLAENIGLENSRASRMVSAAVAARTRSRFLQA 369 Query: 313 WALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFT 134 WAL++Q HSEA+ EL K+CLIH++F MVARGLE L+++QREL+L L Sbjct: 370 WALKIQGNHSEAVAELLKICLIHKIFPPEESSAEMEMVARGLEKQLNVDQRELLLNMLIR 429 Query: 133 ICGEETHRSLVEALGLVGIKDS---QDKRV 53 CGE T RS+ EALGL+ S Q+KRV Sbjct: 430 TCGEGTRRSMTEALGLIQPPQSDVEQEKRV 459 >ref|XP_006364664.1| PREDICTED: uncharacterized protein LOC102596187 [Solanum tuberosum] Length = 455 Score = 317 bits (813), Expect = 1e-83 Identities = 189/431 (43%), Positives = 265/431 (61%), Gaps = 24/431 (5%) Frame = -2 Query: 1294 PERCRF---RPRIINSAV--RRHYRRRLLKYSPTPNSTPTIFKPSDDTLQITLR----PS 1142 P+RCR R I+ ++ RRH RRRL K+S TP PSD L L P+ Sbjct: 20 PKRCRHYHVSSRRISPSLPRRRHLRRRLKKFST--EDTP----PSDQNLHFVLTVDNLPT 73 Query: 1141 NSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSS 962 S + DL +KL + A +DL++L+ VD+ G + SC RSTV+F+A L +SS Sbjct: 74 KSFYSIKDLLHLKLGEFLHSGRAAIEDLRTLIRVDTDAGRLSFSCTRSTVKFLATLVVSS 133 Query: 961 LVIIFIFQALFKRRR-------SDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLS 803 ++IF +A+ R +++ LVYKRDRSLGGREVLV K E +K L Sbjct: 134 FLLIFTLRAIVNLVRGIRLNSGNNNVELVYKRDRSLGGREVLVAKNETPTLDRKKPNVLD 193 Query: 802 SNDYTD----EKKAKITRLRNRRK----EELPQWWPQLLNSGLNLNEMINREEYQRMANR 647 S++ ++ + I+ R R+K E+LP+WWP + SG + N+EEYQRMANR Sbjct: 194 SDEGNSNWDWDRDSPISFSRRRKKKSSVEQLPKWWP-VSTSGSDQVGAENQEEYQRMANR 252 Query: 646 LIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEII 467 LIRA++DN+M+GKDI +DIIQLR I + V+ S +T NARD+L+R A++ +L+YCE Sbjct: 253 LIRAILDNRMTGKDILADDIIQLRRIGRISNVKVSFDTENARDTLFRVAVDFILNYCEST 312 Query: 466 ANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKH 287 A++ST + I+GE+A+ F+AGLADN+GLE+ R LQAWALE+Q KH Sbjct: 313 ASQSTFLLIDGEEAQNFVAGLADNVGLESTRAARMVSAAVAARTRSRFLQAWALEMQGKH 372 Query: 286 SEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRS 107 SEA+ EL K+C+IH++F MVARGLE HL ++QRE ++ SL +CG+ET RS Sbjct: 373 SEAVVELFKICVIHQIFPPEEFSPEMEMVARGLEKHLKVDQREFLMNSLLHVCGDETRRS 432 Query: 106 LVEALGLVGIK 74 + EALGL+ +K Sbjct: 433 VAEALGLMYLK 443 >ref|XP_003632065.1| PREDICTED: uncharacterized protein LOC100854590 isoform X1 [Vitis vinifera] Length = 436 Score = 311 bits (796), Expect = 1e-81 Identities = 186/412 (45%), Positives = 249/412 (60%), Gaps = 13/412 (3%) Frame = -2 Query: 1282 RFRPRIINSAVRRHYRRRLLKYSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDLSEIK 1103 R P I +S RR+ ++ Y P N+ P+ D L + + L +L D ++I Sbjct: 20 RLYPPISSSIRRRNALKKPHHYHPHHNNKPS----PDPKLHMVV----DLHRLSDRAQIL 71 Query: 1102 LNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALFK- 926 LNRL+ DA DDL++LV+VD T VVI+CR ST+ F+ + SLV++F F+ L + Sbjct: 72 LNRLVSSGADAIDDLRTLVAVDRATQSVVIACRPSTLRFVGGFVVWSLVVVFGFRVLVRL 131 Query: 925 ----RRR---SDSEVLVYKRDRSLGGREVLVGKREEN-WPTTRKT----TPLSSNDYTDE 782 RR +V +RDRSLGG+EV+VG+ EE+ W + +PLS Sbjct: 132 GLRLRREFGFGSGRGVVVRRDRSLGGKEVVVGRAEESEWRMRNHSRVLGSPLSVVPGIGV 191 Query: 781 KKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDI 602 + R+R ++ LP+WWP L L E+ +++EYQR ANRLIR +M N+MSGKDI Sbjct: 192 NGGDWSPGRSRTEKRLPKWWPVTLPPPL---EVFDKQEYQREANRLIREIMANRMSGKDI 248 Query: 601 SENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDAR 422 E+D+IQLR IC+T G RASI+TANARDS YR ++ V++ C + +STSV+I+GEDAR Sbjct: 249 LEDDMIQLRRICRTSGARASIDTANARDSFYRTSVEFVINICSRASGQSTSVEIDGEDAR 308 Query: 421 QFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHR 242 QFIAGLADN+GLEN LQAWALE+Q +HSEA+ ELSK+CLIH+ Sbjct: 309 QFIAGLADNLGLENTRAARIVSASVAARTRSCFLQAWALEMQGRHSEAVVELSKICLIHQ 368 Query: 241 VFXXXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGL 86 +F MVARGLE L EQRE ++ L CGEE HRS EALGL Sbjct: 369 IFPPEESSPEMEMVARGLEKQLKYEQREFLMNMLLAGCGEECHRSAAEALGL 420 >ref|XP_004247979.1| PREDICTED: uncharacterized protein LOC101254735 [Solanum lycopersicum] Length = 458 Score = 307 bits (786), Expect = 2e-80 Identities = 181/415 (43%), Positives = 253/415 (60%), Gaps = 21/415 (5%) Frame = -2 Query: 1249 RRHYRRR----LLKYSPTPNSTPTIFKPSDDTLQITLR----PSNSLKQLLDLSEIKLNR 1094 RRH RRR L K+SP TP PSD L L P+ S + DL +KL Sbjct: 41 RRHLRRRRFPFLKKFSP--EDTP----PSDQNLHFVLTVDNLPTKSFYSIKDLIHLKLRE 94 Query: 1093 LIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALFKRRR- 917 + A +DL++L+ +D+ G V SC RSTV+F+A L +S+ ++IF +A+ R Sbjct: 95 FLHSGRAAIEDLQTLIRIDTDAGRVSFSCTRSTVKFLATLLVSTFLLIFTLRAILNLVRR 154 Query: 916 ------SDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTD--EKKAKITR 761 +++ LVYKRDRSLGGREVLV K E +K L ++ + I+ Sbjct: 155 IPLNTGNNNVELVYKRDRSLGGREVLVAKNETPTLDRKKPNVLDRDEGNSNWDLDTPISF 214 Query: 760 LRNRRK----EELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISEN 593 R R+K E+LP+WWP + SG + N+EEYQRMA+RLIRA++DN+M+GKDI + Sbjct: 215 SRRRKKKSSVEQLPKWWP-VSTSGSDQVGTENQEEYQRMADRLIRAILDNRMTGKDILAD 273 Query: 592 DIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFI 413 DIIQLR I + V+ S +T NARD+L+R A++ +L+YCE A++S V I+GE+A+ F+ Sbjct: 274 DIIQLRRIGRISNVKVSFDTENARDTLFRVAVDFILNYCESTASQSAFVLIDGEEAQNFV 333 Query: 412 AGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFX 233 AGLADN+GLE+ R LQAWALE+Q KHSEA+ EL K+C+IH++F Sbjct: 334 AGLADNVGLESTRAARMVSAAVAARTRSRFLQAWALEIQGKHSEAVVELFKICVIHQIFP 393 Query: 232 XXXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGLVGIKDS 68 MVARGLE HL ++QRE ++ SL +CG+ET RS+ EALGL+ +K + Sbjct: 394 PEEFSPEMEMVARGLEKHLKVDQRESLMNSLLQVCGDETRRSVAEALGLMYMKSN 448 >ref|XP_007026507.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508715112|gb|EOY07009.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 444 Score = 302 bits (774), Expect = 4e-79 Identities = 188/437 (43%), Positives = 258/437 (59%), Gaps = 17/437 (3%) Frame = -2 Query: 1342 IPTFKPTHSILMYLYYPERCR-FRPRIINSAVRRHYRRRLLKYSPTPNSTPTI-----FK 1181 +P P+ S ++L+ + + + P++ S RR R RL + N ++ F+ Sbjct: 11 LPLRSPSPSPPLFLFGSTQLKTWSPQLSFSTPRRSRRSRLPRNPNYDNHNLSLRRSIEFQ 70 Query: 1180 PSDDTLQITLRPSNSLKQLLDLSEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRR 1001 S D + L Q+ LS KLNRLI S DAF DL++LV +D T + +SCR+ Sbjct: 71 NSPDNPNVKL--VLDFDQISSLSSSKLNRLISFSTDAFQDLRNLVQIDPDTRTLQLSCRK 128 Query: 1000 STVEFMAVLCMSSLVIIFIFQALFK-------RRRSDSEVLVYKRDRSLGGREVLVG-KR 845 ST++F+A VI+F F L K R R +V+V +RDRSLGGREV+VG KR Sbjct: 129 STLQFLAAFLTCGFVIVFAFTVLVKLGLGLKARFRPKHKVIV-RRDRSLGGREVIVGTKR 187 Query: 844 EENWPTTRKT--TPLS-SNDYTDEKKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINR 674 + P + + PLS S K RL+ + ++LP+WWP++ +S + N Sbjct: 188 DGGDPPSFRALDNPLSLSTARPLSTKTNYPRLQVQLGDKLPKWWPEM-DSVPKEGSVFNS 246 Query: 673 EEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAIN 494 E YQ ANRLIRA++D+++ GKDI+E DIIQLR IC+T GVR SI+T N RDS YR ++ Sbjct: 247 EYYQTQANRLIRAIIDSRLGGKDITEEDIIQLRQICRTSGVRVSIDTTNTRDSFYRVSVE 306 Query: 493 LVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQA 314 LVL+ C + ++ST VQI+GEDARQF+AGLA+NIGL+N LQA Sbjct: 307 LVLNVCCRVPSQSTHVQIDGEDARQFLAGLAENIGLDNTRAARMVSAGVAARTRFIFLQA 366 Query: 313 WALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLFT 134 WA E+Q KHSEA+ ELSK+CL+HR+F MVARGLE L +EQREL++ L Sbjct: 367 WAFEMQGKHSEAMLELSKICLVHRIFPPEESSPEMEMVARGLEKLLKVEQRELLMGMLVG 426 Query: 133 ICGEETHRSLVEALGLV 83 +C E+ RS EALGLV Sbjct: 427 VCSGESRRSAAEALGLV 443 >ref|XP_012071565.1| PREDICTED: uncharacterized protein LOC105633555 [Jatropha curcas] gi|643731431|gb|KDP38719.1| hypothetical protein JCGZ_04072 [Jatropha curcas] Length = 451 Score = 273 bits (699), Expect = 2e-70 Identities = 165/391 (42%), Positives = 228/391 (58%), Gaps = 26/391 (6%) Frame = -2 Query: 1156 TLRPSNSLKQLLDLSEI------KLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRST 995 T SLK +LD+ +I KL+R + + DA+ DLK+L++VD +V SCRRST Sbjct: 60 TTSDQRSLKLVLDVDQISYLTSSKLHRFLSLTEDAYYDLKTLITVDQNNR-IVFSCRRST 118 Query: 994 VEFMAVLCMSSLVIIFIFQALFK-------RRRSDSEVLVYKRDRSLGGREVLVGKR-EE 839 ++F + + V + + L R R+ ++ +V +RDRSLGGREV+VG R E Sbjct: 119 IQFTGAVLLCGFVAVSAIRLLINLGLGIRSRFRASNQNVVVRRDRSLGGREVVVGTRVNE 178 Query: 838 NWPTTRKT-----TPLSSNDY---TDEKKAKITRLRNRRKEELPQWWPQLLNSGLNLNEM 683 R++ TPLS + ++ K R RR+E+LP+WWP + + +L + Sbjct: 179 RQEVKRQSSGALDTPLSPPSWAFGSELGKDDWRSYRVRREEKLPKWWPVSVATDQDL--V 236 Query: 682 INREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRA 503 +N+EEYQR ANRLIRA+ D + SG+D++ DIIQLR IC+T GV S +T N RD++YRA Sbjct: 237 VNKEEYQREANRLIRAITDYRTSGRDVTAYDIIQLRRICRTSGVHVSFDTTNTRDAVYRA 296 Query: 502 AINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRI 323 ++N VL C + QI+GEDA+ FI GLA NIGLEN Sbjct: 297 SVNYVLDLCSSDPSYYALNQIDGEDAQHFIVGLAKNIGLENIRAARMVSAAVAARTRSCF 356 Query: 322 LQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILES 143 LQAWALEVQ KHSEA ELSK+CL+ + F MVARGL HL LEQRE ++ Sbjct: 357 LQAWALEVQGKHSEAALELSKICLVLQTFPPEESSPEMEMVARGLAKHLKLEQRERLMNM 416 Query: 142 LFTICGEETHRSLVEALGLV----GIKDSQD 62 ++C EE+HRS +ALGL+ G+ D Q+ Sbjct: 417 FISVCSEESHRSAADALGLMLSPRGVGDQQE 447 >ref|XP_010525850.1| PREDICTED: uncharacterized protein LOC104803577 isoform X2 [Tarenaya hassleriana] Length = 427 Score = 273 bits (698), Expect = 3e-70 Identities = 159/411 (38%), Positives = 239/411 (58%), Gaps = 7/411 (1%) Frame = -2 Query: 1294 PERCRFRPRIINSAVRRHYRRRLLKYSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDL 1115 P+R R RPR H L T +S D +L + L + ++ L Sbjct: 31 PKRRRSRPRPRRRRASDHGGDGSLLSLSTSSS-------EDQSLSLVL----DVHRISTL 79 Query: 1114 SEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQA 935 + + + L+D DAF DL+SLVS+D +V+SCR+ST++F+ L ++ V +F +A Sbjct: 80 ASSRFHWLLDSGRDAFSDLQSLVSLDDNRR-LVVSCRKSTMQFIGGLVVTGFVFVFAVRA 138 Query: 934 L------FKRRRSDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKKA 773 L F+ LV +RDRSLGGREV+V P+ + + S+ + + Sbjct: 139 LVNLGSLFRSSFESKPKLVVRRDRSLGGREVVVAVETSRAPSRDTRSSMPSSGHVSRRNT 198 Query: 772 KITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISEN 593 + R +++LP+WWP L S + +++E+YQR ANRL+RAM+D+++SGKDI E+ Sbjct: 199 SPSSFSLRAQQKLPKWWPTSLTSQ---SWDVDKEDYQREANRLVRAMVDDRISGKDIMED 255 Query: 592 DIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFI 413 DII+LR +C+ G++ SIE AN RDS YR +++ VL+ C +++ST+V+I+ EDAR FI Sbjct: 256 DIIRLRRLCRIAGIQVSIEPANTRDSFYRTSVDFVLNVCSRASSESTAVEIDSEDARDFI 315 Query: 412 AGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFX 233 AGL++N+ LE LQAWALE+Q KHSE++ ELSK+C++HR+F Sbjct: 316 AGLSENVELEKTDAARMVSATVAARTRSWFLQAWALEIQGKHSESVAELSKICVVHRIFP 375 Query: 232 XXXXXXXXXMVARGLENHLSLEQRELILESLFTI-CGEETHRSLVEALGLV 83 MVARGLE + LE+R+ +L+ I C E++HRS EALGLV Sbjct: 376 PDESSAEMEMVARGLEKLMKLEERQTLLKKFIGICCSEDSHRSAAEALGLV 426 >ref|XP_010525843.1| PREDICTED: uncharacterized protein LOC104803577 isoform X1 [Tarenaya hassleriana] Length = 428 Score = 273 bits (698), Expect = 3e-70 Identities = 159/411 (38%), Positives = 239/411 (58%), Gaps = 7/411 (1%) Frame = -2 Query: 1294 PERCRFRPRIINSAVRRHYRRRLLKYSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDL 1115 P+R R RPR H L T +S D +L + L + ++ L Sbjct: 31 PKRRRSRPRPRRRRASDHGGDGSLLSLSTSSS-------EDQSLSLVL----DVHRISTL 79 Query: 1114 SEIKLNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQA 935 + + + L+D DAF DL+SLVS+D +V+SCR+ST++F+ L ++ V +F +A Sbjct: 80 ASSRFHWLLDSGRDAFSDLQSLVSLDDNRR-LVVSCRKSTMQFIGGLVVTGFVFVFAVRA 138 Query: 934 L------FKRRRSDSEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKKA 773 L F+ LV +RDRSLGGREV+V P+ + + S+ + + Sbjct: 139 LVNLGSLFRSSFESKPKLVVRRDRSLGGREVVVAVETSRAPSRDTRSSMPSSGHVSRRNT 198 Query: 772 KITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISEN 593 + R +++LP+WWP L S + +++E+YQR ANRL+RAM+D+++SGKDI E+ Sbjct: 199 SPSSFSLRAQQKLPKWWPTSLTSQ---SWDVDKEDYQREANRLVRAMVDDRISGKDIMED 255 Query: 592 DIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFI 413 DII+LR +C+ G++ SIE AN RDS YR +++ VL+ C +++ST+V+I+ EDAR FI Sbjct: 256 DIIRLRRLCRIAGIQVSIEPANTRDSFYRTSVDFVLNVCSRASSESTAVEIDSEDARDFI 315 Query: 412 AGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFX 233 AGL++N+ LE LQAWALE+Q KHSE++ ELSK+C++HR+F Sbjct: 316 AGLSENVELEKTDAARMVSATVAARTRSWFLQAWALEIQGKHSESVAELSKICVVHRIFP 375 Query: 232 XXXXXXXXXMVARGLENHLSLEQRELILESLFTI-CGEETHRSLVEALGLV 83 MVARGLE + LE+R+ +L+ I C E++HRS EALGLV Sbjct: 376 PDESSAEMEMVARGLEKLMKLEERQTLLKKFIGICCSEDSHRSAAEALGLV 426 >ref|XP_010241508.1| PREDICTED: uncharacterized protein LOC104586088 isoform X1 [Nelumbo nucifera] Length = 444 Score = 271 bits (694), Expect = 8e-70 Identities = 169/399 (42%), Positives = 233/399 (58%), Gaps = 14/399 (3%) Frame = -2 Query: 1237 RRRLLKYSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDLSEIKLNRLIDCSLDAFDDL 1058 RRR K P S I + D +I + S SL++++ SE++L+R + +A DL Sbjct: 42 RRRKPKTKTKPASNEKI-EMVIDIEEIANQASTSLRRIIRSSEVRLHRFVSSGKEAIRDL 100 Query: 1057 KSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALF-------KRRRSDSEVL 899 ++LV +DS +VISCRRS++ F+A + S VI+F + L R L Sbjct: 101 QALVMIDSDRR-IVISCRRSSLLFLANFVLWSCVIVFSVRVLVDLGFRFGSRLGFGYGSL 159 Query: 898 VYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKK--AKITRLRNR-----RKE 740 +++RDRSLGGREV+VG R +K +S N + + K+ ++ + R++ Sbjct: 160 IWRRDRSLGGREVVVGGRFRGSEERKKNLSVSVNPLSPARVMVTKVEEMQPQKRVTVREK 219 Query: 739 ELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKT 560 +LP WWP L S M+N+EE QR ANR+IRA+MDNKMSG+D E D++ LR ICKT Sbjct: 220 KLPSWWPVSLPSP---TLMVNKEELQREANRIIRAIMDNKMSGRDFMEEDVMHLRQICKT 276 Query: 559 FGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLEN 380 G R S+ETANAR S YR ++ LVL+ C I + VQ+ GEDARQF+AGLADNIGLE+ Sbjct: 277 SGARVSMETANARSSFYRTSVELVLNTC-ISSMSYKPVQMGGEDARQFVAGLADNIGLED 335 Query: 379 XXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMV 200 R LQAWA E+Q HSEA+ ELS +CLIH++F MV Sbjct: 336 IDAVRIVSATVAARTRSRFLQAWAFEMQGSHSEAMVELSGICLIHQIFPPEESSPEMDMV 395 Query: 199 ARGLENHLSLEQRELILESLFTICGEETHRSLVEALGLV 83 ARGL+ L +QR+ +L L +CG ++ RS EALGLV Sbjct: 396 ARGLKKQLREDQRKFLLNLLVGVCGAKSRRSAAEALGLV 434 >ref|XP_002309823.1| hypothetical protein POPTR_0007s02350g [Populus trichocarpa] gi|222852726|gb|EEE90273.1| hypothetical protein POPTR_0007s02350g [Populus trichocarpa] Length = 447 Score = 270 bits (691), Expect = 2e-69 Identities = 171/447 (38%), Positives = 254/447 (56%), Gaps = 24/447 (5%) Frame = -2 Query: 1351 LSVIPTFKPTHSILMYLYYPERCRFRPRIINSAVRRHYRRRLLKYSPTPNSTPTIFKPSD 1172 ++ +P P H + P + P + + +R R+ + PN ++ D Sbjct: 4 INTLPYSSPPH----FFPKPSSSLYTPPQNSFSTKRRRSRKSKTLTNNPNKPSSL----D 55 Query: 1171 DTLQITLRP-SNSLKQLLDLSEI------KLNRLIDCSLDAFDDLKSLVSVDSGTGGVVI 1013 ITL S +LK +L++++I + ++ + +A DDLK+LVS+D VV+ Sbjct: 56 SDYYITLNNNSQNLKLVLNITQISKLPSSRFHQFLSLGQEAVDDLKTLVSLDENNR-VVL 114 Query: 1012 SCRRSTVEFMAVLCMSSLVIIFIFQALFK-----RRR---SDSEVLVYKRDRSLGGREVL 857 SC++ST++F + +S ++I + LFK +R+ + V +RDRSLGG+EV+ Sbjct: 115 SCQKSTLQFAGTVLLSGFLLISSIRVLFKLGLGFKRKFGAGKNPNFVVRRDRSLGGKEVI 174 Query: 856 VG----KREENWPTTRKTTPLSSNDYTDE---KKAKITRLRNRRKEELPQWWPQLLNSGL 698 V +REE+ R P+ + D ++ TR R +++LP+WWP +SG Sbjct: 175 VAVDDQQREESKRPKRLANPVEISGLVDGLGFERGDWTRYRVGSQQKLPKWWP---DSGS 231 Query: 697 NLNEMI--NREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKTFGVRASIETANA 524 ++ ++EEYQR ANRLIRA+ D + GKD+ E+DIIQLR IC+T GVRAS T N Sbjct: 232 FSGRVVGPDQEEYQREANRLIRAITDYRTRGKDVMEHDIIQLRRICRTSGVRASFSTTNT 291 Query: 523 RDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLENXXXXXXXXXXXX 344 RD+ YRA+I++VL+ C + STSV+I GED R FIAGLA+NIGLE+ Sbjct: 292 RDAFYRASIDVVLNVCSSAPSYSTSVEIAGEDPRHFIAGLAENIGLESIRAARMVSAAVA 351 Query: 343 XXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQ 164 LQAWALEVQ KHSEA++ELSK+CL+ + F MVARGL +L +EQ Sbjct: 352 ARTRSCFLQAWALEVQGKHSEAVYELSKICLVLQTFPPEESSPEMEMVARGLARNLKVEQ 411 Query: 163 RELILESLFTICGEETHRSLVEALGLV 83 REL++ +C EE+ RS +ALGL+ Sbjct: 412 RELLMNMFMGVCSEESQRSAADALGLM 438 >ref|XP_010241509.1| PREDICTED: uncharacterized protein LOC104586088 isoform X2 [Nelumbo nucifera] Length = 436 Score = 270 bits (690), Expect = 2e-69 Identities = 168/398 (42%), Positives = 232/398 (58%), Gaps = 14/398 (3%) Frame = -2 Query: 1237 RRRLLKYSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDLSEIKLNRLIDCSLDAFDDL 1058 RRR K P S I + D +I + S SL++++ SE++L+R + +A DL Sbjct: 42 RRRKPKTKTKPASNEKI-EMVIDIEEIANQASTSLRRIIRSSEVRLHRFVSSGKEAIRDL 100 Query: 1057 KSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALF-------KRRRSDSEVL 899 ++LV +DS +VISCRRS++ F+A + S VI+F + L R L Sbjct: 101 QALVMIDSDRR-IVISCRRSSLLFLANFVLWSCVIVFSVRVLVDLGFRFGSRLGFGYGSL 159 Query: 898 VYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKK--AKITRLRNR-----RKE 740 +++RDRSLGGREV+VG R +K +S N + + K+ ++ + R++ Sbjct: 160 IWRRDRSLGGREVVVGGRFRGSEERKKNLSVSVNPLSPARVMVTKVEEMQPQKRVTVREK 219 Query: 739 ELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKT 560 +LP WWP L S M+N+EE QR ANR+IRA+MDNKMSG+D E D++ LR ICKT Sbjct: 220 KLPSWWPVSLPSP---TLMVNKEELQREANRIIRAIMDNKMSGRDFMEEDVMHLRQICKT 276 Query: 559 FGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLEN 380 G R S+ETANAR S YR ++ LVL+ C I + VQ+ GEDARQF+AGLADNIGLE+ Sbjct: 277 SGARVSMETANARSSFYRTSVELVLNTC-ISSMSYKPVQMGGEDARQFVAGLADNIGLED 335 Query: 379 XXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMV 200 R LQAWA E+Q HSEA+ ELS +CLIH++F MV Sbjct: 336 IDAVRIVSATVAARTRSRFLQAWAFEMQGSHSEAMVELSGICLIHQIFPPEESSPEMDMV 395 Query: 199 ARGLENHLSLEQRELILESLFTICGEETHRSLVEALGL 86 ARGL+ L +QR+ +L L +CG ++ RS EALGL Sbjct: 396 ARGLKKQLREDQRKFLLNLLVGVCGAKSRRSAAEALGL 433 >ref|XP_004296731.1| PREDICTED: uncharacterized protein LOC101297340 [Fragaria vesca subsp. vesca] Length = 430 Score = 270 bits (689), Expect = 3e-69 Identities = 166/408 (40%), Positives = 231/408 (56%), Gaps = 13/408 (3%) Frame = -2 Query: 1249 RRHYRRRLLKYSPTPNSTPTIFKPSD-DTLQITLRPSNSLKQLLDLSEIKLNRLIDCSLD 1073 RR+ RR + P S P + SD + LQ T L L S L + + D Sbjct: 37 RRNRRRNPNTPTTVPTSKPAFYTSSDPENLQATF----DLNTLYYSSHSYLRYFLSSASD 92 Query: 1072 AFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQALFKRRRSD------ 911 A +DL++LVSVD+ +V+SCR ST+ F+ +++ ++ F+ L R Sbjct: 93 AVEDLQTLVSVDADRR-IVVSCRPSTLRFVGNFAVATCAVVLGFRVLVGLVRLGFGSGSG 151 Query: 910 --SEVLVYKRDRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKKAKITRLRNRRKEE 737 E +V +RDRSLGG+EV+V + E P + + T ++++ + R R E+ Sbjct: 152 YGREKVVTRRDRSLGGKEVVVARVER--PRAEEVS------VTKKRESVFKKNRVRFGEK 203 Query: 736 LPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISENDIIQLRHICKTF 557 LPQWWP + + ++ EE+QR ANRL+RA+ DN+MSGKDI E+DII LR IC+ + Sbjct: 204 LPQWWPTTTSQPIL---GVDNEEHQREANRLVRAITDNRMSGKDIMEDDIIHLRQICRVY 260 Query: 556 GVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIAGLADNIGLENX 377 GVR S +T N RDSLYR +++ VL+ C + S V+I GEDARQFIAGLA+NIGLEN Sbjct: 261 GVRVSFDTTNTRDSLYRVSVDFVLNVCARAPSHSNGVEIEGEDARQFIAGLAENIGLENV 320 Query: 376 XXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXXXXXXXXXXMVA 197 LQAWAL +Q KH+EA+ ELSK+CL+ R+F MVA Sbjct: 321 RAGRIVSAAVAARTRSCFLQAWALVMQGKHAEAVVELSKICLVWRIFPPEESSPEMEMVA 380 Query: 196 RGLENHLSLEQRELILESLFTICGEETHRSLVEALGLV----GIKDSQ 65 RGLE HL ++QRE ++ L IC EE+ + EALGLV G+ D Q Sbjct: 381 RGLEKHLKMDQREFLMSMLVGICSEESQKRAAEALGLVSSFKGVGDEQ 428 >gb|KHG01403.1| Phosphoribosylformylglycinamidine synthase [Gossypium arboreum] Length = 447 Score = 266 bits (679), Expect = 4e-68 Identities = 174/422 (41%), Positives = 236/422 (55%), Gaps = 18/422 (4%) Frame = -2 Query: 1294 PERCRFRPRIINSAVRRHYRRRLLK--YSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLL 1121 P+ RF P++ S RR R R + ++ + NS K + D P+ LK LL Sbjct: 29 PKPYRF-PQLSFSTPRRPRRSRSSRSPWNHSHNSHSLSLKRTIDFESSADNPN--LKLLL 85 Query: 1120 DLSEIK----LNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVI 953 I +R + S DAF DL V +D+ T SCR+ST++F+A + ++ Sbjct: 86 HFDPISPLSSFDRFVSLSSDAFQDLLHSVHIDTQTRTFRFSCRKSTLQFLAGFLVCGFLV 145 Query: 952 IFIFQALF------KRRRSDSEVLVYKRDRSLGGREVLVGK-REENWPTTRKTT---PLS 803 F F+ F K R S + ++ +RDRSLGG+EV+VG R+ + P T + PLS Sbjct: 146 AFAFRVCFNLGLAFKARFSPKQKVIVRRDRSLGGKEVIVGTTRDHHNPRTNSSALDNPLS 205 Query: 802 SNDYTDEKKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDN 623 + K R + ELP+WWPQ L N + + E YQ ANRLI+A++DN Sbjct: 206 LSATPPNLANKTHYPRLHVRHELPKWWPQRLPQR-NTASVFDSEYYQTKANRLIKAIIDN 264 Query: 622 KMSGKDISENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQ 443 ++ GKD +E DIIQLR IC+ GVR SI+T N RDSLYRAA+ LVL+ C + ST+VQ Sbjct: 265 RLGGKDFAEEDIIQLRQICRASGVRVSIDTTNTRDSLYRAAVELVLNVCCRASISSTNVQ 324 Query: 442 INGEDARQFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELS 263 I+GEDAR+F+AGLA+NIGL++ LQAWA E+Q KH+EA+ ELS Sbjct: 325 IDGEDAREFLAGLAENIGLDSIRASRMVSAGVAARTRFCFLQAWAFEMQGKHTEAVSELS 384 Query: 262 KVCLIHRVFXXXXXXXXXXMVARGLENHLSLEQRELILESLF--TICGEETHRSLVEALG 89 K+CLIH +F MVARGLE L +EQREL++ + C EE S EALG Sbjct: 385 KICLIHGIFPPGKSSPEMEMVARGLEKILKVEQRELLMAMVVGNCNCSEEIRTSAAEALG 444 Query: 88 LV 83 LV Sbjct: 445 LV 446 >ref|XP_010667195.1| PREDICTED: uncharacterized protein LOC104884272 isoform X1 [Beta vulgaris subsp. vulgaris] gi|870842028|gb|KMS95546.1| hypothetical protein BVRB_007300 [Beta vulgaris subsp. vulgaris] Length = 424 Score = 265 bits (678), Expect = 6e-68 Identities = 162/409 (39%), Positives = 239/409 (58%), Gaps = 12/409 (2%) Frame = -2 Query: 1273 PRIINSAVRRHYRRRLLKYSPTPNSTPTIFKPSD--DTLQITLRPSN-SLKQLLDLSEIK 1103 P ++S R RRR+ + +P + P+ SD + LQ L + L+L E K Sbjct: 27 PLSLSSFSPRRRRRRITRKNPFKRADPSHSSSSDQHNRLQFVLDVDQLKTRTPLNLWESK 86 Query: 1102 LNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQAL--- 932 N+ + ++A++DL++L+ V+ G+ +V+SC ST+ F+ + S+V + + L Sbjct: 87 FNQFVSSGIEAYNDLRNLIIVEPGSNRIVVSCSESTIRFVGGFVIWSIVSVVFVRVLVGL 146 Query: 931 ---FKRRRSDSEVLVYKR-DRSLGGREVLVGKREENWPTTRKTTPLSSNDYTDEKKAKIT 764 F+RR +V V KR DRSLGGREV+V +R K S+ T+E+ ++ Sbjct: 147 GLGFRRRVGVMKVEVVKRRDRSLGGREVVVERR------VVKGGERKSDIETNERDLEVM 200 Query: 763 RLRNRRKEE--LPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDISEND 590 + RKE+ LP WWP G ++NREE++R A+ L++A+MD+K+ GKD+SE D Sbjct: 201 PKSSMRKEQRKLPSWWPVF---GPRPALVLNREEFKRQADELVQAIMDDKLRGKDVSEED 257 Query: 589 IIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDARQFIA 410 I++L IC+ GV+ S T NARDS YR A++ V++ C +S SVQI+GEDAR F+A Sbjct: 258 ILELHRICRMSGVQLSFGTENARDSFYRLAVHNVINTC--CRARSPSVQIDGEDARLFVA 315 Query: 409 GLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHRVFXX 230 GLA ++GL + LQAWALE+Q KHSEA+ EL K+CLIH++F Sbjct: 316 GLAYDVGLSDPRAVTIVSAAVAAQTRQWFLQAWALEMQAKHSEAMEELKKICLIHQIFPP 375 Query: 229 XXXXXXXXMVARGLENHLSLEQRELILESLFTICGEETHRSLVEALGLV 83 MVARGL+ HL LE RE +L ++ ++CGEE+ RS EALGLV Sbjct: 376 EPSSPEMEMVARGLQKHLKLEHREFLLTNIVSVCGEESPRSAAEALGLV 424 >ref|XP_012460306.1| PREDICTED: uncharacterized protein LOC105780484 [Gossypium raimondii] gi|763810186|gb|KJB77088.1| hypothetical protein B456_012G119900 [Gossypium raimondii] Length = 447 Score = 265 bits (677), Expect = 7e-68 Identities = 171/415 (41%), Positives = 232/415 (55%), Gaps = 18/415 (4%) Frame = -2 Query: 1273 PRIINSAVRRHYRRRLLK--YSPTPNSTPTIFKPSDDTLQITLRPSNSLKQLLDLSEIK- 1103 P++ S RR R R + ++ + NS K + D P+ LK LL I Sbjct: 35 PQLSFSTPRRPRRSRSSRSPWNHSHNSHSLSLKRTIDFESSADNPN--LKLLLHFDPISP 92 Query: 1102 ---LNRLIDCSLDAFDDLKSLVSVDSGTGGVVISCRRSTVEFMAVLCMSSLVIIFIFQAL 932 +R + S DAF DL V +D+ T SCR+ST++F+A + ++ F F+ Sbjct: 93 LSSFDRFVSFSSDAFQDLLHSVHIDTQTRTFRFSCRKSTLQFLAGFLVCGFLVAFAFRVC 152 Query: 931 F------KRRRSDSEVLVYKRDRSLGGREVLVGK-REENWPTTRKTT---PLSSNDYTDE 782 F K R S + ++ +RDRSLGG+EV+VG R+ + P T + PLS + Sbjct: 153 FRLGLAFKARFSPKQKVIVRRDRSLGGKEVIVGTTRDHHHPRTNSSALDNPLSLSATPPN 212 Query: 781 KKAKITRLRNRRKEELPQWWPQLLNSGLNLNEMINREEYQRMANRLIRAMMDNKMSGKDI 602 K R + ELP+WWPQ L N + + E YQ ANRLI+A++DN++ GKD Sbjct: 213 LANKTHYPRLHVRHELPKWWPQQLPQR-NTASVFDSEYYQTKANRLIKAIIDNRLGGKDF 271 Query: 601 SENDIIQLRHICKTFGVRASIETANARDSLYRAAINLVLSYCEIIANKSTSVQINGEDAR 422 SE +IIQLR IC+ GV SI+T N RDSLYRAA+ LVL+ C ST+VQI+GEDAR Sbjct: 272 SEENIIQLRQICRASGVCVSIDTTNTRDSLYRAAVELVLNVCCRAPINSTNVQIDGEDAR 331 Query: 421 QFIAGLADNIGLENXXXXXXXXXXXXXXXXXRILQAWALEVQDKHSEALFELSKVCLIHR 242 +F+AGLA+NIGL+N LQAWA E+Q KH+EA+ ELSK+CLIH Sbjct: 332 EFLAGLAENIGLDNIRASRMVSAGVAARTRFCFLQAWAFEMQSKHTEAVSELSKICLIHG 391 Query: 241 VFXXXXXXXXXXMVARGLENHLSLEQRELILESL--FTICGEETHRSLVEALGLV 83 +F MVARGLE L +EQREL++ ++ + C EE S EALGLV Sbjct: 392 IFPPGKSSPEMEMVARGLEKILKVEQRELLMATVVGYCNCSEEIRTSAAEALGLV 446