BLASTX nr result
ID: Forsythia21_contig00014792
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00014792 (2839 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011095124.1| PREDICTED: uncharacterized protein LOC105174... 822 0.0 ref|XP_009763963.1| PREDICTED: uncharacterized protein LOC104215... 743 0.0 ref|XP_012832261.1| PREDICTED: uncharacterized protein LOC105953... 725 0.0 gb|EYU42005.1| hypothetical protein MIMGU_mgv1a002009mg [Erythra... 697 0.0 emb|CBI18050.3| unnamed protein product [Vitis vinifera] 692 0.0 ref|XP_009763964.1| PREDICTED: uncharacterized protein LOC104215... 634 e-178 ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arab... 620 e-174 ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [... 619 e-174 ref|XP_009763966.1| PREDICTED: uncharacterized protein LOC104215... 591 e-166 gb|KDO49671.1| hypothetical protein CISIN_1g002779mg [Citrus sin... 576 e-161 ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citr... 571 e-159 gb|KDO49672.1| hypothetical protein CISIN_1g002779mg [Citrus sin... 568 e-159 gb|KDO49669.1| hypothetical protein CISIN_1g002779mg [Citrus sin... 568 e-159 ref|XP_007033558.1| NT domain of poly(A) polymerase and terminal... 566 e-158 ref|XP_007142048.1| hypothetical protein PHAVU_008G248100g [Phas... 564 e-157 gb|KJB27692.1| hypothetical protein B456_005G005000 [Gossypium r... 560 e-156 ref|XP_012481362.1| PREDICTED: uncharacterized protein LOC105796... 560 e-156 ref|XP_012089694.1| PREDICTED: uncharacterized protein LOC105648... 560 e-156 gb|KHG19864.1| Poly (A) RNA polymerase cid14 [Gossypium arboreum] 559 e-156 gb|KJB27695.1| hypothetical protein B456_005G005100 [Gossypium r... 558 e-155 >ref|XP_011095124.1| PREDICTED: uncharacterized protein LOC105174651 [Sesamum indicum] Length = 873 Score = 822 bits (2123), Expect = 0.0 Identities = 467/874 (53%), Positives = 544/874 (62%), Gaps = 150/874 (17%) Frame = -2 Query: 2457 GQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDY 2278 G AAA R ++S A P P EI + W +++AA II KVQPTT+SEERRR+V+DY Sbjct: 7 GGAAAELRPVASHSTPFAEPNPLEIRGQNWATVDRAAREIIRKVQPTTVSEERRREVVDY 66 Query: 2277 VQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRS 2098 +QRLIRN LG EVFPYGSVPLKTYLPDGDIDLTAFG N ED L DDMKSVLEEEE NR+ Sbjct: 67 IQRLIRNCLGIEVFPYGSVPLKTYLPDGDIDLTAFGVTNDEDALADDMKSVLEEEEKNRA 126 Query: 2097 AEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSI 1918 AEF+VK+VQLI AEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLE+VDRLIGK+HLFKRSI Sbjct: 127 AEFIVKDVQLIRAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEQVDRLIGKDHLFKRSI 186 Query: 1917 ILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDW 1738 ILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHSTLDGPLAVLYKFLDYFSKFDW Sbjct: 187 ILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSTLDGPLAVLYKFLDYFSKFDW 246 Query: 1737 ETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQ 1558 ETYCISLNGPVR+SSLP IVAE PED DLLL+ DFL SC+ MFSVPSR D+ SRGFQ Sbjct: 247 ETYCISLNGPVRLSSLPAIVAEMPEDSDRDLLLSSDFLSSCIGMFSVPSRGGDKNSRGFQ 306 Query: 1557 QKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSS 1378 +KHLNIVDPLK+INNLGRS+SKGNFYRIRSAFSYGA+KL RIL QPE+ IA EL FFS+ Sbjct: 307 RKHLNIVDPLKEINNLGRSVSKGNFYRIRSAFSYGARKLARILTQPEDSIATELHKFFSN 366 Query: 1377 TMQRHGCGQKPVVQD--------------PYPQSTYN--------------GFIPVSSSL 1282 TM RHG G++P VQD P P++ + F P S Sbjct: 367 TMARHGGGKRPDVQDFDPSVICNRPISAMPVPEAGLSKTDNLNEYIDEHAGDFQPSSGKF 426 Query: 1281 GTDSYK-----------------------------LENSDNAAGHRFIGDANDLATPSIA 1189 D K + NA G RF GD+NDLAT S+ Sbjct: 427 SQDLLKGTERKSDVANGEPYSSLVLKHPTLLLDRDQPSEPNALGSRFHGDSNDLATSSLG 486 Query: 1188 GLNISSDSP--------------------------KFRNMRNENPNSDQWGKCEKNV--- 1096 L IS+ S K ++MR+ P+SD+ C K+ Sbjct: 487 ELKISAGSSTRQTPVMKESVTAIAKPYHAPHLYFSKSKSMRDREPDSDKQDNCGKSTSSL 546 Query: 1095 --------------------------------SSEVLPE--------DSDHTNWDQXXXX 1036 S +V P D +H + D Sbjct: 547 VSSGSDEGRDDAVGSMDENQFVDKDEAVASSKSKDVFPAPKSLSFSGDQNHMDSDHGSTR 606 Query: 1035 XXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQ-NKTPWNG 859 Y+SY +CLQYGRWCYEY + SL M +P YQ + +PW+G Sbjct: 607 TSERPEALNSSDLTGD-YDSYLHCLQYGRWCYEYALTIHSLHMPHLPTAPYQGSNSPWDG 665 Query: 858 VIYPSQLRHNGFSHGHRNGFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQS 688 ++ S + NGFSH H NGF P +Y M+ LVPG+ FG EEMPKPRGTGTYFP M+QS Sbjct: 666 LLPLSHFKQNGFSHRHHNGFHPSPAMYTMQPLLVPGVPFGWEEMPKPRGTGTYFPNMSQS 725 Query: 687 PQGYRPSAVKGKNQAPLSSPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQTVNLGSSGI 508 PQGYR S++K +NQAP SPR NGR++IF E N+LDR+SHE SQPQV V+++V + SSG Sbjct: 726 PQGYRSSSMKARNQAPSRSPRTNGRSMIFREPNMLDRSSHELSQPQVRVEKSVMVSSSGN 785 Query: 507 HQSFSPRGNGHVPV--------------------GTFSPEQNRQQRXXXXXXXXXXXXXP 388 H SFSPRGNG+ GT ++NR+QR Sbjct: 786 HPSFSPRGNGYPNANGLSIQHEGVVEFELVGHASGTSESDKNRKQR----SVSGSPKTFS 841 Query: 387 GMQRPKTALSRDLDRVSFKSSYHLKDQDDFPPLS 286 G Q+ + ALSR+ DR+S HLKD+DDFPPLS Sbjct: 842 GTQKSRPALSREQDRISLN---HLKDEDDFPPLS 872 >ref|XP_009763963.1| PREDICTED: uncharacterized protein LOC104215769 isoform X1 [Nicotiana sylvestris] Length = 841 Score = 743 bits (1918), Expect = 0.0 Identities = 426/821 (51%), Positives = 503/821 (61%), Gaps = 114/821 (13%) Frame = -2 Query: 2406 ANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYG 2227 +NP S+I PERW EKA II VQPT +SE+RRR VIDYVQRLI LG EVFPYG Sbjct: 37 SNPSVSDIGPERWAKAEKATQNIIRVVQPTAVSEDRRRAVIDYVQRLIGGCLGCEVFPYG 96 Query: 2226 SVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKL 2047 SVPLKTYLPDGDIDLTAFGG N ED L +DM SVLE E+ N++AEFVVK+VQ+I AEVKL Sbjct: 97 SVPLKTYLPDGDIDLTAFGGTNFEDALANDMVSVLEAEDQNKAAEFVVKDVQMIRAEVKL 156 Query: 2046 VKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAH 1867 VKCIVQ+IVVD+SFNQIGGLCTLCFLE+VDRLIGK+HLFKRSIILIK WCYYESRILGAH Sbjct: 157 VKCIVQNIVVDISFNQIGGLCTLCFLEQVDRLIGKDHLFKRSIILIKTWCYYESRILGAH 216 Query: 1866 HGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLP 1687 HGLISTYALETLVLYIF FHSTLDGPLAVLYKFLDYFSKFDWE C+SL GPVRISSLP Sbjct: 217 HGLISTYALETLVLYIFHFFHSTLDGPLAVLYKFLDYFSKFDWENCCVSLTGPVRISSLP 276 Query: 1686 VIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLG 1507 V E PE GDLLL++DF+R C++MFSVPS+ D SR F +KHLNI+DPLK+ NNLG Sbjct: 277 ESVVEMPETDGGDLLLSNDFVRYCLDMFSVPSKGGDSNSRTFLRKHLNIIDPLKENNNLG 336 Query: 1506 RSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQDPY 1327 RS+S+GNF+RIRSAFSYGA+KLG IL+Q E+ IA EL FF +TM RHG G++P VQD Sbjct: 337 RSVSQGNFFRIRSAFSYGARKLGSILIQSEDKIAEELYKFFPNTMDRHGSGERPDVQD-- 394 Query: 1326 PQSTYNGFIPVSSSLGTDSYKLENSDNAA------------------------------- 1240 NGF P S + + ++ + N+A Sbjct: 395 ---MINGFCPASPAPDFEPSRINSDLNSASDSGIFRLNPDESCCREDGHHKSITDSHEKG 451 Query: 1239 ---GHRFIGDANDLATPSIAGLNISSDSPKFRNMRNENPNSDQWG--------------- 1114 G+R GDA DLA+ GL+IS+ P+ + ++ S Sbjct: 452 SPLGYRLSGDAADLASSMENGLSISTHIPQLTDSSSKKCQSTTKAMPYHAPHLYFTNSLV 511 Query: 1113 -----KCEKNVSS-EVLPED-------------------------------SDHTNWDQX 1045 K EK VSS LP S+ NWD Sbjct: 512 CNGEMKNEKRVSSGSSLPTSDEGRDFTVDGLKQTVLDVKEAVSSTPKAYGCSEDLNWD-- 569 Query: 1044 XXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPW 865 DY++YFN LQYGRWCYEY S +LP+ P PP + K W Sbjct: 570 LASTNGAGIPSKALSDLSGDYDNYFNYLQYGRWCYEYAS---NLPVPPAPPSPFHIKYSW 626 Query: 864 NGVIYPSQLRHNGFSHGHRNGFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMN 694 PS ++ NGFSHG NG IP Y + LV G+ + EEMPKPRGTGTYFP +N Sbjct: 627 EAAQQPSYMKRNGFSHGSTNGVIPSQAFYTINPMLVHGMPYALEEMPKPRGTGTYFPNLN 686 Query: 693 QSPQGYRPSAVKGKNQAPLSSPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQTVNLGSS 514 + PQGYRPS VKG++QA L SPR NGR F E + L+R+ HE+ QP+ DQ S Sbjct: 687 RPPQGYRPSMVKGRHQAGLRSPRTNGR-ATFTEMHTLERSFHEQPQPESSADQ------S 739 Query: 513 GIHQSFSPRGNGH----------------------VPVGTFSPEQNRQQR--XXXXXXXX 406 +H FSPRG GH VP+GT E+ RQ++ Sbjct: 740 DVHPLFSPRGRGHRSSMTALVVQSEGVVEFGSVGLVPLGTSISERTRQEKPVSPPTRQTS 799 Query: 405 XXXXXPGMQRPKTALSRDLDRVSFK-SSYHLKDQDDFPPLS 286 PGMQR + S+DLDR++ K SSYHLKD+DDFPPLS Sbjct: 800 PVSPIPGMQRSNSVFSKDLDRLALKSSSYHLKDEDDFPPLS 840 >ref|XP_012832261.1| PREDICTED: uncharacterized protein LOC105953174 [Erythranthe guttatus] Length = 746 Score = 725 bits (1872), Expect = 0.0 Identities = 413/749 (55%), Positives = 496/749 (66%), Gaps = 41/749 (5%) Frame = -2 Query: 2406 ANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYG 2227 A P P I E W ++A II KVQPT +SEE+R+ VI Y+QRLIRN LGAEV PYG Sbjct: 11 AEPNPFGIGTENWAAADRATLEIIRKVQPTPVSEEKRKAVIYYIQRLIRNFLGAEVIPYG 70 Query: 2226 SVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKL 2047 SVPLKTYLPDGDIDLTAFGGAN ED L DDMKSVLEEEE N AEFVVK+VQLI AEVKL Sbjct: 71 SVPLKTYLPDGDIDLTAFGGANFEDTLADDMKSVLEEEERNMGAEFVVKDVQLIRAEVKL 130 Query: 2046 VKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAH 1867 VKCI+QDIVVDVSFNQIGGLCTLCFLE+VDR+IG++HLFKRSIILIKAWCYYESRILGAH Sbjct: 131 VKCIIQDIVVDVSFNQIGGLCTLCFLEQVDRVIGRDHLFKRSIILIKAWCYYESRILGAH 190 Query: 1866 HGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLP 1687 HGLISTYALETLVLYIF FHSTLDGPLAVLYKFLDYFSKFDW+TYCISLNGP+R+SSLP Sbjct: 191 HGLISTYALETLVLYIFHHFHSTLDGPLAVLYKFLDYFSKFDWDTYCISLNGPIRLSSLP 250 Query: 1686 VIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLG 1507 I+AE PED GDLLL+ DFL SCV MFSVP R D+ SRGFQ KHLNIVDPLK+ NNLG Sbjct: 251 AIIAEMPEDSDGDLLLSSDFLSSCVGMFSVPCRGNDKNSRGFQTKHLNIVDPLKESNNLG 310 Query: 1506 RSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQ--D 1333 RSISKGNFYRIRSAFSYGA+KL RIL+Q ++ I+ EL FFS+T+ RHG G + + D Sbjct: 311 RSISKGNFYRIRSAFSYGARKLARILVQSDDSISVELHKFFSNTIARHGDGLRHDIHDFD 370 Query: 1332 PYPQSTYNGFIPVSSSLGTDS--YKLENSDNAAG-HRFIGDANDLATPSIAGLNISSDSP 1162 P YN IPV ++ +S +K+EN + G + + DA +A L I SD+P Sbjct: 371 LDPAIIYNSAIPVPTAPVPESWLHKVENFELLNGAEKKVPDAPSREPLDLAALKI-SDTP 429 Query: 1161 KFR------------NMRNENPNSDQ------------WGKCEKNVSSEVLPEDSDHTNW 1054 + +RN N NSD+ G +K+ +++V + Sbjct: 430 AAKPFFAPHLYFAESRLRNRNANSDKIDSSSFVLSESDEGFVDKDKNTDVFWATNQTNPG 489 Query: 1053 DQXXXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPP--LQYQ 880 DYE+Y LQYGRWCYE+ G+ SLPM P P + +Q Sbjct: 490 QGSSANRRETTESPSSLSDLTGDYETYLKFLQYGRWCYEHGLGIHSLPMPPPLPTTVPFQ 549 Query: 879 NKTPWNGVIYPSQLRHNGFSHGHRNGFIPVYPMRH-----ALVPGIAFGREEMPKPRGTG 715 +G+ S +HNGFSH NGF+P+ P + L+PG+ FG ++ K RGTG Sbjct: 550 GNIFLDGIAPLSHYKHNGFSHRLHNGFLPIPPALYPVPPPVLMPGVTFGWDDASKARGTG 609 Query: 714 TYFPKMNQSPQGYRPSAVKGKNQ-APLSSPRDNGRNVIFMETNLLDRNSHERSQP-QVLV 541 TYFP MNQ P GYR S++ G+NQ A +P +GR++IFME N+LDR+++E SQ QV + Sbjct: 610 TYFPNMNQPPLGYRSSSMNGRNQVAATRAPHMDGRSMIFMEGNMLDRSNNEVSQQNQVPI 669 Query: 540 DQTVNLGSSGIHQSFSP-RGNGHVPVGTFSPEQNRQQ--RXXXXXXXXXXXXXPGMQRPK 370 + V SFSP GNG PE + + GMQRP+ Sbjct: 670 ENDV--------MSFSPHNGNGLY----VQPEADIDECGLENSGPASSSPRTFSGMQRPQ 717 Query: 369 TALSRDLDRVSFKSSYHLKDQDDFPPLSV 283 SR+ DR+S +SSY LKD++DFPPL V Sbjct: 718 PPFSREQDRISLRSSYILKDEEDFPPLPV 746 >gb|EYU42005.1| hypothetical protein MIMGU_mgv1a002009mg [Erythranthe guttata] Length = 726 Score = 697 bits (1799), Expect = 0.0 Identities = 400/729 (54%), Positives = 479/729 (65%), Gaps = 41/729 (5%) Frame = -2 Query: 2406 ANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYG 2227 A P P I E W ++A II KVQPT +SEE+R+ VI Y+QRLIRN LGAEV PYG Sbjct: 11 AEPNPFGIGTENWAAADRATLEIIRKVQPTPVSEEKRKAVIYYIQRLIRNFLGAEVIPYG 70 Query: 2226 SVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKL 2047 SVPLKTYLPDGDIDLTAFGGAN ED L DDMKSVLEEEE N AEFVVK+VQLI AEVKL Sbjct: 71 SVPLKTYLPDGDIDLTAFGGANFEDTLADDMKSVLEEEERNMGAEFVVKDVQLIRAEVKL 130 Query: 2046 VKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAH 1867 VKCI+QDIVVDVSFNQIGGLCTLCFLE+VDR+IG++HLFKRSIILIKAWCYYESRILGAH Sbjct: 131 VKCIIQDIVVDVSFNQIGGLCTLCFLEQVDRVIGRDHLFKRSIILIKAWCYYESRILGAH 190 Query: 1866 HGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLP 1687 HGLISTYALETLVLYIF FHSTLDGPLAVLYKFLDYFSKFDW+TYCISLNGP+R+SSLP Sbjct: 191 HGLISTYALETLVLYIFHHFHSTLDGPLAVLYKFLDYFSKFDWDTYCISLNGPIRLSSLP 250 Query: 1686 VIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLG 1507 I+AE PED GDLLL+ DFL SCV MFSVP R D+ SRGFQ KHLNIVDPLK+ NNLG Sbjct: 251 AIIAEMPEDSDGDLLLSSDFLSSCVGMFSVPCRGNDKNSRGFQTKHLNIVDPLKESNNLG 310 Query: 1506 RSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQ--D 1333 RSISKGNFYRIRSAFSYGA+KL RIL+Q ++ I+ EL FFS+T+ RHG G + + D Sbjct: 311 RSISKGNFYRIRSAFSYGARKLARILVQSDDSISVELHKFFSNTIARHGDGLRHDIHDFD 370 Query: 1332 PYPQSTYNGFIPVSSSLGTDS--YKLENSDNAAG-HRFIGDANDLATPSIAGLNISSDSP 1162 P YN IPV ++ +S +K+EN + G + + DA +A L I SD+P Sbjct: 371 LDPAIIYNSAIPVPTAPVPESWLHKVENFELLNGAEKKVPDAPSREPLDLAALKI-SDTP 429 Query: 1161 KFR------------NMRNENPNSDQ------------WGKCEKNVSSEVLPEDSDHTNW 1054 + +RN N NSD+ G +K+ +++V + Sbjct: 430 AAKPFFAPHLYFAESRLRNRNANSDKIDSSSFVLSESDEGFVDKDKNTDVFWATNQTNPG 489 Query: 1053 DQXXXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPP--LQYQ 880 DYE+Y LQYGRWCYE+ G+ SLPM P P + +Q Sbjct: 490 QGSSANRRETTESPSSLSDLTGDYETYLKFLQYGRWCYEHGLGIHSLPMPPPLPTTVPFQ 549 Query: 879 NKTPWNGVIYPSQLRHNGFSHGHRNGFIPVYPMRH-----ALVPGIAFGREEMPKPRGTG 715 +G+ S +HNGFSH NGF+P+ P + L+PG+ FG ++ K RGTG Sbjct: 550 GNIFLDGIAPLSHYKHNGFSHRLHNGFLPIPPALYPVPPPVLMPGVTFGWDDASKARGTG 609 Query: 714 TYFPKMNQSPQGYRPSAVKGKNQ-APLSSPRDNGRNVIFMETNLLDRNSHERSQP-QVLV 541 TYFP MNQ P GYR S++ G+NQ A +P +GR++IFME N+LDR+++E SQ QV + Sbjct: 610 TYFPNMNQPPLGYRSSSMNGRNQVAATRAPHMDGRSMIFMEGNMLDRSNNEVSQQNQVPI 669 Query: 540 DQTVNLGSSGIHQSFSP-RGNGHVPVGTFSPEQNRQQ--RXXXXXXXXXXXXXPGMQRPK 370 + V SFSP GNG PE + + GMQRP+ Sbjct: 670 ENDV--------MSFSPHNGNGLY----VQPEADIDECGLENSGPASSSPRTFSGMQRPQ 717 Query: 369 TALSRDLDR 343 SR+ DR Sbjct: 718 PPFSREQDR 726 >emb|CBI18050.3| unnamed protein product [Vitis vinifera] Length = 824 Score = 692 bits (1785), Expect = 0.0 Identities = 410/807 (50%), Positives = 498/807 (61%), Gaps = 94/807 (11%) Frame = -2 Query: 2421 PLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAE 2242 PL S ++P P I +W E IIC+VQPT +SEERR++V+DYVQ LIR R+G E Sbjct: 22 PLPSLSHPNPPAIGAAQWARAENTVQEIICEVQPTEVSEERRKEVVDYVQGLIRVRVGCE 81 Query: 2241 VFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLIC 2062 VFP+GSVPLKTYLPDGDIDLTAFGG VED L ++ SVLE E+ NR+AEFVVK+VQLI Sbjct: 82 VFPFGSVPLKTYLPDGDIDLTAFGGPAVEDTLAYEVYSVLEAEDQNRAAEFVVKDVQLIH 141 Query: 2061 AEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESR 1882 AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE++DRLIGK+HLFKRSIILIKAWCYYESR Sbjct: 142 AEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQIDRLIGKDHLFKRSIILIKAWCYYESR 201 Query: 1881 ILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVR 1702 ILGAHHGLISTYALETLVLYIF LFHS L+GPLAVLYKFLDYFSKFDW+ YC+SLNGPVR Sbjct: 202 ILGAHHGLISTYALETLVLYIFLLFHSLLNGPLAVLYKFLDYFSKFDWDNYCVSLNGPVR 261 Query: 1701 ISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKD 1522 ISSLP ++AETPE+ D LL +D LR C++ FSVPSR ++ SR F QKH NIVDPLK+ Sbjct: 262 ISSLPEMIAETPENVGADPLLNNDILRDCLDRFSVPSRGLETNSRTFVQKHFNIVDPLKE 321 Query: 1521 INNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPV 1342 NNLGRS+SKGNFYRIRSAF+YGA+KLGRILLQPE+ I+ EL FF++T++RHG GQ+P Sbjct: 322 NNNLGRSVSKGNFYRIRSAFTYGARKLGRILLQPEDKISEELCKFFTNTLERHGRGQRPD 381 Query: 1341 VQDPYP----QSTYNGFIPVSSSLGTDSYKLENSDNAAGHRFIGDANDLATPSIAGLNI- 1177 V D P +S +G V +S+ +++ N+ +G R GDA DLA+P I G I Sbjct: 382 V-DLIPLDAERSMCDGVNLVPTSMLSEADNSSNAPAVSGFRISGDAKDLASPRIRGPKIS 440 Query: 1176 ---SSDSP------------------------KFRNMRNENPNSDQW-----GKCEKN-- 1099 S SP +N + N N D+ G E+ Sbjct: 441 NDTSKSSPPSGEESVSVLSKKAHFAPHLYFSRSAQNGKERNENLDKKLAGNSGLSEEESS 500 Query: 1098 ----------------------VSSEVLPEDSD--------HT-NWDQXXXXXXXXXXXX 1012 VS++V P S HT NWD+ Sbjct: 501 FVVHHGLNGNQSVNNHELLNSFVSNDVPPGLSPTACSSEYLHTGNWDRPSSGNSGNPEAP 560 Query: 1011 XXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRH 832 DY+S+FN LQYG WCY+Y G P+L M P Q+Q+ W+ + + +R Sbjct: 561 NSLADLSGDYDSHFNSLQYGWWCYDYIFGAPALSMPVALPSQFQSNNSWDAIQQSAHIRR 620 Query: 831 NGFSHGHRNGFI---PVYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAV 661 N F NG I P YP+ ++ G FG EEMPKPRGTGTYFP N S P Sbjct: 621 NIFPQITANGIIPRPPFYPLNPPMISGTGFGVEEMPKPRGTGTYFP--NTSHHLCNPLTS 678 Query: 660 KGKNQAPLSSPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQ-TVNLGSSGIHQSFSPRG 484 +G+NQAP+ SPR +GR V ETN L+R+S E S Q V Q GS H S SP G Sbjct: 679 RGRNQAPVRSPRHSGRAVTPHETNFLERSSRELSHAQFPVHQGNGKSGSLDSHPSGSPVG 738 Query: 483 ------NGHV----PVGTFS--------PEQNRQQR--XXXXXXXXXXXXXPGMQRPKTA 364 NG + V F PE R+ G QRPK+ Sbjct: 739 RTYSNANGSLLPSEKVVEFGDQASESPLPENIREPNHGSFLPQNSSLSLSPGGAQRPKSM 798 Query: 363 LSRDLDRVSFKSSYHLKDQDDFPPLSV 283 LS + DRV+ + +YHLKD+DDFPPLSV Sbjct: 799 LSMNDDRVAVQ-AYHLKDEDDFPPLSV 824 >ref|XP_009763964.1| PREDICTED: uncharacterized protein LOC104215769 isoform X2 [Nicotiana sylvestris] Length = 707 Score = 634 bits (1635), Expect = e-178 Identities = 355/659 (53%), Positives = 414/659 (62%), Gaps = 89/659 (13%) Frame = -2 Query: 2406 ANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYG 2227 +NP S+I PERW EKA II VQPT +SE+RRR VIDYVQRLI LG EVFPYG Sbjct: 37 SNPSVSDIGPERWAKAEKATQNIIRVVQPTAVSEDRRRAVIDYVQRLIGGCLGCEVFPYG 96 Query: 2226 SVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKL 2047 SVPLKTYLPDGDIDLTAFGG N ED L +DM SVLE E+ N++AEFVVK+VQ+I AEVKL Sbjct: 97 SVPLKTYLPDGDIDLTAFGGTNFEDALANDMVSVLEAEDQNKAAEFVVKDVQMIRAEVKL 156 Query: 2046 VKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAH 1867 VKCIVQ+IVVD+SFNQIGGLCTLCFLE+VDRLIGK+HLFKRSIILIK WCYYESRILGAH Sbjct: 157 VKCIVQNIVVDISFNQIGGLCTLCFLEQVDRLIGKDHLFKRSIILIKTWCYYESRILGAH 216 Query: 1866 HGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLP 1687 HGLISTYALETLVLYIF FHSTLDGPLAVLYKFLDYFSKFDWE C+SL GPVRISSLP Sbjct: 217 HGLISTYALETLVLYIFHFFHSTLDGPLAVLYKFLDYFSKFDWENCCVSLTGPVRISSLP 276 Query: 1686 VIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLG 1507 V E PE GDLLL++DF+R C++MFSVPS+ D SR F +KHLNI+DPLK+ NNLG Sbjct: 277 ESVVEMPETDGGDLLLSNDFVRYCLDMFSVPSKGGDSNSRTFLRKHLNIIDPLKENNNLG 336 Query: 1506 RSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQDPY 1327 RS+S+GNF+RIRSAFSYGA+KLG IL+Q E+ IA EL FF +TM RHG G++P VQD Sbjct: 337 RSVSQGNFFRIRSAFSYGARKLGSILIQSEDKIAEELYKFFPNTMDRHGSGERPDVQD-- 394 Query: 1326 PQSTYNGFIPVSSSLGTDSYKLENSDNAA------------------------------- 1240 NGF P S + + ++ + N+A Sbjct: 395 ---MINGFCPASPAPDFEPSRINSDLNSASDSGIFRLNPDESCCREDGHHKSITDSHEKG 451 Query: 1239 ---GHRFIGDANDLATPSIAGLNISSDSPKFRNMRNENPNSDQWG--------------- 1114 G+R GDA DLA+ GL+IS+ P+ + ++ S Sbjct: 452 SPLGYRLSGDAADLASSMENGLSISTHIPQLTDSSSKKCQSTTKAMPYHAPHLYFTNSLV 511 Query: 1113 -----KCEKNVSS-EVLPED-------------------------------SDHTNWDQX 1045 K EK VSS LP S+ NWD Sbjct: 512 CNGEMKNEKRVSSGSSLPTSDEGRDFTVDGLKQTVLDVKEAVSSTPKAYGCSEDLNWD-- 569 Query: 1044 XXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPW 865 DY++YFN LQYGRWCYEY S +LP+ P PP + K W Sbjct: 570 LASTNGAGIPSKALSDLSGDYDNYFNYLQYGRWCYEYAS---NLPVPPAPPSPFHIKYSW 626 Query: 864 NGVIYPSQLRHNGFSHGHRNGFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKM 697 PS ++ NGFSHG NG IP Y + LV G+ + EEMPKPRGTGTYFP + Sbjct: 627 EAAQQPSYMKRNGFSHGSTNGVIPSQAFYTINPMLVHGMPYALEEMPKPRGTGTYFPNL 685 >ref|XP_002876095.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp. lyrata] gi|297321933|gb|EFH52354.1| hypothetical protein ARALYDRAFT_485514 [Arabidopsis lyrata subsp. lyrata] Length = 829 Score = 620 bits (1598), Expect = e-174 Identities = 387/846 (45%), Positives = 480/846 (56%), Gaps = 111/846 (13%) Frame = -2 Query: 2493 SSMDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTT 2314 + +DDL E + S +LS PL+ P P +PE W +E+A II +V PT Sbjct: 3 ADLDDLEEESSS--------SLSPPLIPP--PRSPSNQPEFWMRVEEATREIIEQVHPTL 52 Query: 2313 ISEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDM 2134 +SE+RRRDVI YVQ+LIR LG EV +GSVPLKTYLPDGDIDLTAFGG E++L + Sbjct: 53 VSEDRRRDVILYVQKLIRITLGCEVHSFGSVPLKTYLPDGDIDLTAFGGLYHEEELAAKV 112 Query: 2133 KSVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDR 1954 SVLE EE+N S+ FVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQIGG+CTLCFLEK+D Sbjct: 113 FSVLEREEHNVSSHFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQIGGICTLCFLEKIDH 172 Query: 1953 LIGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVL 1774 LIGK+HLFKRSIILIKAWCYYESRILGA HGLISTYALETLVLYIF LFHS+L+GPLAVL Sbjct: 173 LIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPLAVL 232 Query: 1773 YKFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVP 1594 YKFLDYFSKFDW+ YCISLNGPV +SSLP IV ETPE+G D LLT +FL+ C+EM+SVP Sbjct: 233 YKFLDYFSKFDWDNYCISLNGPVCLSSLPEIVVETPENGGEDFLLTSEFLKECMEMYSVP 292 Query: 1593 SRFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPEN 1414 SR + RGFQ KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG+I LQ + Sbjct: 293 SRGFETNQRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQIFLQSDE 352 Query: 1413 GIANELQIFFSSTMQRHGCGQKPVVQDPYP---QSTYNGFIPVSSSL--GTDSYKLENSD 1249 I +EL+ FFS+ + RHG GQ+P V D P + YN P S+ G Y+ E+S Sbjct: 353 AIKSELRKFFSNMLLRHGSGQRPDVLDAVPFVRYNRYNALSPASNHFQEGQVVYESESSS 412 Query: 1248 NA-----------------------AGH----------------RFIGDANDLATPSIAG 1186 ++ GH RF GDA DLAT I Sbjct: 413 SSGATGNGRHDQEGSLDAGVSISSTTGHELSGSPGETAPSVSEERFSGDAKDLATLRIQK 472 Query: 1185 LNISSDSPK-------------------FRNMRN-ENPNSDQWGKCEKNV---------- 1096 L IS D+ K F MRN E N + GK ++N Sbjct: 473 LEISDDAMKSPCLSDKESVSPLNGKHHSFHQMRNGEVLNGNGVGKQQENSCLADSRRVKD 532 Query: 1095 --SSEVLPEDSDHTN--------WDQ----XXXXXXXXXXXXXXXXXXXXDYESYFNCLQ 958 S+E E H + W Q DYES N L+ Sbjct: 533 IHSNENENEHVGHEDLPFTGAVPWPQEDMHLHYSGHCVSGTPNMLSDLSGDYESQLNSLR 592 Query: 957 YGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRNGFIPVYPMR 778 +GRW ++Y P P++P Q N W + + R N + + NG +P Sbjct: 593 FGRWWFDYVQNGPMSPLSPPGLPQLPNNNSWEVIRHALPFRRNAPTPVNANGVVPRQVFF 652 Query: 777 HA---LVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLSSPRDNGRNV 607 H ++PG F EE+PKPRGTGTYFP N RP + +G++ SPR+NGR++ Sbjct: 653 HVNPQMIPGPGFAIEELPKPRGTGTYFPNANHYRD--RPFSPRGRSSHQARSPRNNGRSM 710 Query: 606 I--FMETNLLDRNSHERS-QPQVLVDQTVNLGSSGIHQSFSPRGNGH------------- 475 + E N DRN+ ER + + ++ + H+SF P NG Sbjct: 711 VQAHSEMNFPDRNTRERQLHYPNQTNGSCDMSHTDSHESF-PDTNGSTNHPYEKAPDFRP 769 Query: 474 ---VPVGTFSPEQNRQQRXXXXXXXXXXXXXPGMQRPKT-ALSRDLDRVSFKSSYHLKDQ 307 +PV SP + + R RPK+ S DRV+ SYHL D Sbjct: 770 TEPLPVEVLSPPEGSKPRDSIEGHHNRP------HRPKSIPSSTQEDRVTPTQSYHLTDD 823 Query: 306 DDFPPL 289 +FPPL Sbjct: 824 HEFPPL 829 >ref|NP_850678.2| PAP/OAS1 substrate-binding domain superfamily [Arabidopsis thaliana] gi|332645293|gb|AEE78814.1| PAP/OAS1 substrate-binding domain superfamily [Arabidopsis thaliana] Length = 829 Score = 619 bits (1596), Expect = e-174 Identities = 383/845 (45%), Positives = 481/845 (56%), Gaps = 110/845 (13%) Frame = -2 Query: 2493 SSMDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTT 2314 + +DDL E + S +LS PL+ P P +PE W +E+A II +V PT Sbjct: 3 ADLDDLEEESSS--------SLSPPLLPP--PRSPLNQPELWMRVEEATREIIEQVHPTL 52 Query: 2313 ISEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDM 2134 +SE+RRRDVI YVQ+LIR LG EV +GSVPLKTYLPDGDIDLTAFGG E++L + Sbjct: 53 VSEDRRRDVILYVQKLIRMTLGCEVHSFGSVPLKTYLPDGDIDLTAFGGLYHEEELAAKV 112 Query: 2133 KSVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDR 1954 +VLE EE+N S++FVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQIGG+CTLCFLEK+D Sbjct: 113 FAVLEREEHNLSSQFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQIGGICTLCFLEKIDH 172 Query: 1953 LIGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVL 1774 LIGK+HLFKRSIILIKAWCYYESRILGA HGLISTYALETLVLYIF LFHS+L+GPLAVL Sbjct: 173 LIGKDHLFKRSIILIKAWCYYESRILGAFHGLISTYALETLVLYIFHLFHSSLNGPLAVL 232 Query: 1773 YKFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVP 1594 YKFLDYFSKFDW++YCISLNGPV +SSLP IV ETPE+G DLLLT +FL+ C+EM+SVP Sbjct: 233 YKFLDYFSKFDWDSYCISLNGPVCLSSLPDIVVETPENGGEDLLLTSEFLKECLEMYSVP 292 Query: 1593 SRFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPEN 1414 SR + RGFQ KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG++ LQ + Sbjct: 293 SRGFETNPRGFQSKHLNIVDPLKETNNLGRSVSKGNFYRIRSAFTYGARKLGQLFLQSDE 352 Query: 1413 GIANELQIFFSSTMQRHGCGQKPVVQDPYP---QSTYNGFIPVSS--------------- 1288 I++EL+ FFS+ + RHG GQ+P V D P + YN +P S+ Sbjct: 353 AISSELRKFFSNMLLRHGSGQRPDVHDAIPFLRYNRYNAILPASNHFQEGQVVNESESSS 412 Query: 1287 ---SLGTDSYKLENSDNA-----------------------AGHRFIGDANDLATPSIAG 1186 + G + E+S +A + RF GDA DLAT I Sbjct: 413 SSGATGNGRHDQEDSLDAGVSIPSTTGPDLSGSPGETVPSVSEERFSGDAKDLATLRIQK 472 Query: 1185 LNIS-------------SDSP------KFRNMRN-ENPNSDQWGKCEKN---------VS 1093 L IS SDSP F MRN E N + GK ++N Sbjct: 473 LEISDDAMKSPCLSDKESDSPLNGKHHSFNQMRNGEVLNGNGVGKQQENSWHTGSRRVKD 532 Query: 1092 SEVLPEDSDHTNWD---------------QXXXXXXXXXXXXXXXXXXXXDYESYFNCLQ 958 + +++H ++ DYES N L+ Sbjct: 533 IHINENENEHVGYEDLPFASAVPWPQEDMHLHYSGHCVSGTPNMLSDLSGDYESQLNSLR 592 Query: 957 YGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRNGFIPVYPMR 778 +GRW ++Y P P++P Q N W + + R N + + NG +P Sbjct: 593 FGRWWFDYVQNGPMSPLSPPGLPQLPNNNSWEVMRHALPFRRNAPTPVNANGVVPRQVFF 652 Query: 777 HA---LVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLSSPRDNGRNV 607 H ++PG FG EE+PKPRGTGTYFP N RP + +G+N SPR+NGR++ Sbjct: 653 HVNPQMIPGPGFGIEELPKPRGTGTYFPNANHYRD--RPFSPRGRNSHQARSPRNNGRSM 710 Query: 606 --IFMETNLLDRNSHER--------------SQPQVLVDQTVNLGSSGIHQSFSP--RGN 481 E N DRN+ ER S L GS+ +P R Sbjct: 711 SQAHSEMNFPDRNTRERQLHYPNQTNGSCDMSHTDSLDSFPDTNGSTNHPYEKAPDFRPT 770 Query: 480 GHVPVGTFSPEQNRQQRXXXXXXXXXXXXXPGMQRPK-TALSRDLDRVSFKSSYHLKDQD 304 +PV SP ++ + R RPK S +RV+ SYHL D D Sbjct: 771 EPLPVEVLSPPEDSKPRDSIEGHHNRP------HRPKPRPSSTQEERVTPTQSYHLTDDD 824 Query: 303 DFPPL 289 +FPPL Sbjct: 825 EFPPL 829 >ref|XP_009763966.1| PREDICTED: uncharacterized protein LOC104215769 isoform X3 [Nicotiana sylvestris] Length = 649 Score = 591 bits (1524), Expect = e-166 Identities = 332/616 (53%), Positives = 387/616 (62%), Gaps = 86/616 (13%) Frame = -2 Query: 2406 ANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYG 2227 +NP S+I PERW EKA II VQPT +SE+RRR VIDYVQRLI LG EVFPYG Sbjct: 37 SNPSVSDIGPERWAKAEKATQNIIRVVQPTAVSEDRRRAVIDYVQRLIGGCLGCEVFPYG 96 Query: 2226 SVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKL 2047 SVPLKTYLPDGDIDLTAFGG N ED L +DM SVLE E+ N++AEFVVK+VQ+I AEVKL Sbjct: 97 SVPLKTYLPDGDIDLTAFGGTNFEDALANDMVSVLEAEDQNKAAEFVVKDVQMIRAEVKL 156 Query: 2046 VKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAH 1867 VKCIVQ+IVVD+SFNQIGGLCTLCFLE+VDRLIGK+HLFKRSIILIK WCYYESRILGAH Sbjct: 157 VKCIVQNIVVDISFNQIGGLCTLCFLEQVDRLIGKDHLFKRSIILIKTWCYYESRILGAH 216 Query: 1866 HGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLP 1687 HGLISTYALETLVLYIF FHSTLDGPLAVLYKFLDYFSKFDWE C+SL GPVRISSLP Sbjct: 217 HGLISTYALETLVLYIFHFFHSTLDGPLAVLYKFLDYFSKFDWENCCVSLTGPVRISSLP 276 Query: 1686 VIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLG 1507 V E PE GDLLL++DF+R C++MFSVPS+ D SR F +KHLNI+DPLK+ NNLG Sbjct: 277 ESVVEMPETDGGDLLLSNDFVRYCLDMFSVPSKGGDSNSRTFLRKHLNIIDPLKENNNLG 336 Query: 1506 RSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQDPY 1327 RS+S+GNF+RIRSAFSYGA+KLG IL+Q E+ IA EL FF +TM RHG G++P VQD Sbjct: 337 RSVSQGNFFRIRSAFSYGARKLGSILIQSEDKIAEELYKFFPNTMDRHGSGERPDVQD-- 394 Query: 1326 PQSTYNGFIPVSSSLGTDSYKLENSDNAA------------------------------- 1240 NGF P S + + ++ + N+A Sbjct: 395 ---MINGFCPASPAPDFEPSRINSDLNSASDSGIFRLNPDESCCREDGHHKSITDSHEKG 451 Query: 1239 ---GHRFIGDANDLATPSIAGLNISSDSPKFRNMRNENPNSDQWG--------------- 1114 G+R GDA DLA+ GL+IS+ P+ + ++ S Sbjct: 452 SPLGYRLSGDAADLASSMENGLSISTHIPQLTDSSSKKCQSTTKAMPYHAPHLYFTNSLV 511 Query: 1113 -----KCEKNVSS-EVLPED-------------------------------SDHTNWDQX 1045 K EK VSS LP S+ NWD Sbjct: 512 CNGEMKNEKRVSSGSSLPTSDEGRDFTVDGLKQTVLDVKEAVSSTPKAYGCSEDLNWD-- 569 Query: 1044 XXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPW 865 DY++YFN LQYGRWCYEY S +LP+ P PP + K W Sbjct: 570 LASTNGAGIPSKALSDLSGDYDNYFNYLQYGRWCYEYAS---NLPVPPAPPSPFHIKYSW 626 Query: 864 NGVIYPSQLRHNGFSH 817 PS ++ NGFSH Sbjct: 627 EAAQQPSYMKRNGFSH 642 >gb|KDO49671.1| hypothetical protein CISIN_1g002779mg [Citrus sinensis] Length = 710 Score = 576 bits (1484), Expect = e-161 Identities = 316/533 (59%), Positives = 371/533 (69%), Gaps = 63/533 (11%) Frame = -2 Query: 2487 MDDLLE-NADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTI 2311 M DL + + + A GER SS P+N + I E W+ E+A II +VQPT + Sbjct: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQGIIAQVQPTVV 58 Query: 2310 SEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMK 2131 SEERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG NVE+ L +D+ Sbjct: 59 SEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVC 118 Query: 2130 SVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRL 1951 SVLE E+ N++AEFVVK+ QLI AEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLE+VDRL Sbjct: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178 Query: 1950 IGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLY 1771 IGK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHS+L+GPLAVLY Sbjct: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLY 238 Query: 1770 KFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPS 1591 KFLDYFSKFDW++YCISLNGPVRISSLP +V ETPE+ GDLLL+ +FL+ CVE FSVPS Sbjct: 239 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 298 Query: 1590 RFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENG 1411 R D SR F KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG IL QPE Sbjct: 299 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 358 Query: 1410 IANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGT-----DSYKLENSDN 1246 + +EL+ FFS+T+ RHG GQ+P VQDP P S YNGF S+ GT D E+ N Sbjct: 359 LTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPN 418 Query: 1245 AAG----------------------------HRFIGDANDLATPSIAGLNISSDSPKFRN 1150 ++G R GDA DLAT L IS+++ K + Sbjct: 419 SSGITENCRIDDEAETINEPHNSGNGTAVSETRLSGDAKDLATSKNLNLVISNETSKCSS 478 Query: 1149 MRNE-----------------------NPNSDQW------GKCEKNVSSEVLP 1078 + E N NS +W G EKNV+S +LP Sbjct: 479 LSGEESKARHAPHLYFSSSTMGNGEIRNGNS-EWKQQLNSGSAEKNVTSGILP 530 Score = 72.0 bits (175), Expect = 3e-09 Identities = 39/97 (40%), Positives = 51/97 (52%), Gaps = 3/97 (3%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 YES+ L + W YE+ PM+P Q+Q+K W+ + R N N Sbjct: 608 YESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSAN 667 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFP 703 G +P YPM ++PG +FG EEMPK RGTGTYFP Sbjct: 668 GAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFP 704 >ref|XP_006429558.1| hypothetical protein CICLE_v10011044mg [Citrus clementina] gi|568855155|ref|XP_006481174.1| PREDICTED: uncharacterized protein LOC102622468 [Citrus sinensis] gi|557531615|gb|ESR42798.1| hypothetical protein CICLE_v10011044mg [Citrus clementina] Length = 882 Score = 571 bits (1472), Expect = e-159 Identities = 292/423 (69%), Positives = 339/423 (80%), Gaps = 6/423 (1%) Frame = -2 Query: 2487 MDDLLE-NADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTI 2311 M DL + + + A GER SS P+N + I E W+ E+A II +VQPT + Sbjct: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQAIIAQVQPTVV 58 Query: 2310 SEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMK 2131 SEERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG NVE+ L +D+ Sbjct: 59 SEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVC 118 Query: 2130 SVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRL 1951 SVLE E+ N++AEFVVK+ QLI AEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLE+VDRL Sbjct: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178 Query: 1950 IGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLY 1771 IGK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHS+L+GPLAVLY Sbjct: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLY 238 Query: 1770 KFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPS 1591 KFLDYFSKFDW++YCISLNGPVRISSLP +V ETPE+ GDLLL+ +FL+ CVE FSVPS Sbjct: 239 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 298 Query: 1590 RFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENG 1411 R D SR F KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG IL QPE Sbjct: 299 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 358 Query: 1410 IANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGT-----DSYKLENSDN 1246 + +EL+ FFS+T+ RHG GQ+P VQDP P S YNGF S+ LGT D E+ N Sbjct: 359 LTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFLGTELCREDQTIYESEPN 418 Query: 1245 AAG 1237 ++G Sbjct: 419 SSG 421 Score = 129 bits (324), Expect = 1e-26 Identities = 95/263 (36%), Positives = 124/263 (47%), Gaps = 30/263 (11%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 YES+ L + RW YE+ PM+P Q+Q+K W+ + R N + N Sbjct: 627 YESHLISLNHVRWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMNAN 686 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YPM ++PG +FG EEMPK RGTGTYFP N RP ++G+NQAP+ Sbjct: 687 GAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNTNHYRD--RPLNLRGRNQAPVR 744 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQT-VNLG------SSGIHQSFSPRGN-- 481 SPR NGR + ETN+L+ +S E S + V Q V G SS + P N Sbjct: 745 SPRSNGRVMTPPETNILEGSSREPSPAHIHVHQVGVKAGLSEPCHSSSPEKKTQPNANGL 804 Query: 480 -------------GHVPVGTFSPEQNRQQRXXXXXXXXXXXXXPGMQRPKTALSR----- 355 GH+ G S + NRQ G+ P+T SR Sbjct: 805 VHPVDRVVEFGSVGHLYYGPPSLDSNRQPN---TCSTIGQDSSVGLSSPRTPRSRPGLGT 861 Query: 354 DLDRVSFKSSYHLKDQDDFPPLS 286 D DR + YHLKD +DFPPLS Sbjct: 862 DQDRTDVQ--YHLKD-EDFPPLS 881 >gb|KDO49672.1| hypothetical protein CISIN_1g002779mg [Citrus sinensis] Length = 729 Score = 568 bits (1465), Expect = e-159 Identities = 316/552 (57%), Positives = 371/552 (67%), Gaps = 82/552 (14%) Frame = -2 Query: 2487 MDDLLE-NADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTI 2311 M DL + + + A GER SS P+N + I E W+ E+A II +VQPT + Sbjct: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQGIIAQVQPTVV 58 Query: 2310 SEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMK 2131 SEERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG NVE+ L +D+ Sbjct: 59 SEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVC 118 Query: 2130 SVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRL 1951 SVLE E+ N++AEFVVK+ QLI AEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLE+VDRL Sbjct: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178 Query: 1950 IGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLY 1771 IGK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHS+L+GPLAVLY Sbjct: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLY 238 Query: 1770 KFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPS 1591 KFLDYFSKFDW++YCISLNGPVRISSLP +V ETPE+ GDLLL+ +FL+ CVE FSVPS Sbjct: 239 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 298 Query: 1590 RFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENG 1411 R D SR F KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG IL QPE Sbjct: 299 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 358 Query: 1410 IANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGT-----DSYKLENSDN 1246 + +EL+ FFS+T+ RHG GQ+P VQDP P S YNGF S+ GT D E+ N Sbjct: 359 LTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPN 418 Query: 1245 AAG-----------------------------------------------HRFIGDANDL 1207 ++G R GDA DL Sbjct: 419 SSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGDAKDL 478 Query: 1206 ATPSIAGLNISSDSPKFRNMRNE-----------------------NPNSDQW------G 1114 AT L IS+++ K ++ E N NS +W G Sbjct: 479 ATSKNLNLVISNETSKCSSLSGEESKARHAPHLYFSSSTMGNGEIRNGNS-EWKQQLNSG 537 Query: 1113 KCEKNVSSEVLP 1078 EKNV+S +LP Sbjct: 538 SAEKNVTSGILP 549 Score = 72.0 bits (175), Expect = 3e-09 Identities = 39/97 (40%), Positives = 51/97 (52%), Gaps = 3/97 (3%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 YES+ L + W YE+ PM+P Q+Q+K W+ + R N N Sbjct: 627 YESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSAN 686 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFP 703 G +P YPM ++PG +FG EEMPK RGTGTYFP Sbjct: 687 GAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFP 723 >gb|KDO49669.1| hypothetical protein CISIN_1g002779mg [Citrus sinensis] Length = 882 Score = 568 bits (1465), Expect = e-159 Identities = 316/552 (57%), Positives = 371/552 (67%), Gaps = 82/552 (14%) Frame = -2 Query: 2487 MDDLLE-NADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTI 2311 M DL + + + A GER SS P+N + I E W+ E+A II +VQPT + Sbjct: 1 MGDLRDWSPEPNGAVFGERPSSSSSSVPSNQ--TAIGAEYWQRAEEATQGIIAQVQPTVV 58 Query: 2310 SEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMK 2131 SEERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG NVE+ L +D+ Sbjct: 59 SEERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNVEEALANDVC 118 Query: 2130 SVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRL 1951 SVLE E+ N++AEFVVK+ QLI AEVKLVKC+VQ+IVVD+SFNQ+GGL TLCFLE+VDRL Sbjct: 119 SVLEREDQNKAAEFVVKDAQLIRAEVKLVKCLVQNIVVDISFNQLGGLSTLCFLEQVDRL 178 Query: 1950 IGKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLY 1771 IGK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHS+L+GPLAVLY Sbjct: 179 IGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLNGPLAVLY 238 Query: 1770 KFLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPS 1591 KFLDYFSKFDW++YCISLNGPVRISSLP +V ETPE+ GDLLL+ +FL+ CVE FSVPS Sbjct: 239 KFLDYFSKFDWDSYCISLNGPVRISSLPEVVVETPENSGGDLLLSSEFLKECVEQFSVPS 298 Query: 1590 RFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENG 1411 R D SR F KHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG IL QPE Sbjct: 299 RGFDTNSRSFPPKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGHILSQPEES 358 Query: 1410 IANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGT-----DSYKLENSDN 1246 + +EL+ FFS+T+ RHG GQ+P VQDP P S YNGF S+ GT D E+ N Sbjct: 359 LTDELRKFFSNTLDRHGSGQRPDVQDPVPLSRYNGFGVSSTFSGTELCREDQTIYESEPN 418 Query: 1245 AAG-----------------------------------------------HRFIGDANDL 1207 ++G R GDA DL Sbjct: 419 SSGITENCRIDDEAELCGGVGKIKVSGMESSYCRTINEPHNSGNGTAVSETRLSGDAKDL 478 Query: 1206 ATPSIAGLNISSDSPKFRNMRNE-----------------------NPNSDQW------G 1114 AT L IS+++ K ++ E N NS +W G Sbjct: 479 ATSKNLNLVISNETSKCSSLSGEESKARHAPHLYFSSSTMGNGEIRNGNS-EWKQQLNSG 537 Query: 1113 KCEKNVSSEVLP 1078 EKNV+S +LP Sbjct: 538 SAEKNVTSGILP 549 Score = 127 bits (319), Expect = 5e-26 Identities = 95/263 (36%), Positives = 123/263 (46%), Gaps = 30/263 (11%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 YES+ L + W YE+ PM+P Q+Q+K W+ + R N N Sbjct: 627 YESHQISLNHVWWWYEHALNSSYSPMSPQLLSQFQSKNSWDLMQRSLPFRRNIIPQMSAN 686 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YPM ++PG +FG EEMPK RGTGTYFP N RP ++G+NQAP+ Sbjct: 687 GAVPRPLFYPMTPPMLPGASFGMEEMPKHRGTGTYFPNTNHYRD--RPLNLRGRNQAPVR 744 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQT-VNLG------SSGIHQSFSPRGN-- 481 SPR NGR + ETN+L+ +SHE S + V Q V G SS + P N Sbjct: 745 SPRSNGRVMTPPETNILEGSSHEPSPAHIHVHQVGVKAGLSEPCHSSSPEKKTQPNANGL 804 Query: 480 -------------GHVPVGTFSPEQNRQQRXXXXXXXXXXXXXPGMQRPKTALSR----- 355 GH+ G S + NRQ G+ P+T SR Sbjct: 805 VHPVDRVVEFGSVGHLYYGPPSLDSNRQPN---TCSTIGQDSSVGLSSPRTPRSRPGLGT 861 Query: 354 DLDRVSFKSSYHLKDQDDFPPLS 286 D DR + YHLKD +DFPPLS Sbjct: 862 DQDRTDVQ--YHLKD-EDFPPLS 881 >ref|XP_007033558.1| NT domain of poly(A) polymerase and terminal uridylyl transferase-containing protein, putative [Theobroma cacao] gi|508712587|gb|EOY04484.1| NT domain of poly(A) polymerase and terminal uridylyl transferase-containing protein, putative [Theobroma cacao] Length = 890 Score = 567 bits (1460), Expect = e-158 Identities = 308/509 (60%), Positives = 364/509 (71%), Gaps = 53/509 (10%) Frame = -2 Query: 2487 MDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTIS 2308 M DL + + A E SS S +N + I E W+ E+A II +VQPT +S Sbjct: 4 MGDLRDWSPEPNGVASEERSSSSSSSSSNQ--AGIAAEYWKKAEEATQGIIAQVQPTVVS 61 Query: 2307 EERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKS 2128 EERR+ VIDYVQRLI N LG VFP+GSVPLKTYLPDGDIDLTAFGG N E+ L +D+ S Sbjct: 62 EERRKAVIDYVQRLIGNYLGCGVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDVCS 121 Query: 2127 VLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLI 1948 VLE E++NR+AEFVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLEKVDR I Sbjct: 122 VLEREDHNRAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEKVDRRI 181 Query: 1947 GKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYK 1768 GK+HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF LFHS+LDGPLAVLYK Sbjct: 182 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHSSLDGPLAVLYK 241 Query: 1767 FLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSR 1588 FLDYFSKFDW+ YCISLNGP+ ISSLP +V ETPE+G GDLLL++DFL+ CVEMFSVPSR Sbjct: 242 FLDYFSKFDWDNYCISLNGPIHISSLPEVVVETPENGGGDLLLSNDFLKECVEMFSVPSR 301 Query: 1587 FVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGI 1408 + SR F QKHLNIVDPL++ NNLGRS+SKGNFYRIRSAF+YGA+KLG+IL Q E + Sbjct: 302 GFETNSRTFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLGKILSQAEESM 361 Query: 1407 ANELQIFFSSTMQRHGCGQKPVVQDPYPQ-STYNGFIPVSSSLGTDS-------YKLENS 1252 A+EL+ FFS+T+ RHG GQ+P VQD P S ++GF SS GT+S Y+ E+S Sbjct: 362 ADELRKFFSNTLDRHGSGQRPDVQDCIPSLSRFSGFGATSSVSGTESCQEDQTFYETESS 421 Query: 1251 D----------------------NAAGH-----------------------RFIGDANDL 1207 + N +G R GDA DL Sbjct: 422 NSITMTRNHRSDNEGSLHKVDNGNVSGRETNFSRILNEPQASANGMGVSEIRLSGDAKDL 481 Query: 1206 ATPSIAGLNISSDSPKFRNMRNENPNSDQ 1120 AT I GL IS+D+ K + +PNS++ Sbjct: 482 ATSRIQGLVISNDAHK-----SYDPNSEE 505 Score = 147 bits (370), Expect = 6e-32 Identities = 91/259 (35%), Positives = 126/259 (48%), Gaps = 25/259 (9%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 ++S+ L YGRWC++Y P+ P+ Q Q+ W+ V Q R N S + N Sbjct: 634 HDSHLRSLSYGRWCFDYAFNASVSPITPLVS-QLQSNNSWDVVRQSVQFRRNAISPMNAN 692 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YPM ++P FG EEMPKPRGTGTYFP N + R +G++Q + Sbjct: 693 GVVPRQVYYPMNPPMLPAAGFGMEEMPKPRGTGTYFPNHNTNHYRDRSLTARGRSQVQVR 752 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQTVNLGSS-----GIHQSFSPRGNGHV- 472 SPR+N R + ETN +R+S E +Q Q GSS G + P NG V Sbjct: 753 SPRNNSRAITSPETNSPERSSRELAQVQSPHQGGGKSGSSDLRHFGSEKVLYPNANGSVH 812 Query: 471 --------------PVGTFSPEQNRQQR--XXXXXXXXXXXXXPGMQRPKTALSRDLDRV 340 P+G SPE N Q GMQR K+ + + DR+ Sbjct: 813 HPERVVEFGSIGPLPLGPASPESNMQHNPGSPHALNLSASQPPSGMQRSKSTVGVEQDRI 872 Query: 339 SFKSSYHLKDQDDFPPLSV 283 + + SYHLK+++DFPPLS+ Sbjct: 873 AIR-SYHLKNEEDFPPLSI 890 >ref|XP_007142048.1| hypothetical protein PHAVU_008G248100g [Phaseolus vulgaris] gi|561015181|gb|ESW14042.1| hypothetical protein PHAVU_008G248100g [Phaseolus vulgaris] Length = 803 Score = 564 bits (1454), Expect = e-157 Identities = 353/785 (44%), Positives = 448/785 (57%), Gaps = 72/785 (9%) Frame = -2 Query: 2424 SPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGA 2245 SP + +NP PS + + W E+ I+ +QPT ++ RRR+V+DYVQRLIR Sbjct: 23 SPPLPISNPDPSSVVADAWAAAEQTTGEILRSIQPTLAADRRRREVVDYVQRLIRYGARC 82 Query: 2244 EVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLI 2065 EVFPYGSVPLKTYLPDGDIDLTA N+ED LV D+++VL EENN +AE+ VK+V+ I Sbjct: 83 EVFPYGSVPLKTYLPDGDIDLTALSCQNIEDGLVSDVRAVLHGEENNEAAEYEVKDVRFI 142 Query: 2064 CAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYES 1885 AEVKLVKCIVQDIVVD+SFNQ+GGL TLCFLEKVDRL+ K+HLFKRSIILIKAWCYYES Sbjct: 143 DAEVKLVKCIVQDIVVDISFNQLGGLSTLCFLEKVDRLVAKDHLFKRSIILIKAWCYYES 202 Query: 1884 RILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPV 1705 R+LGAHHGLISTYALETLVLYIF FH +LDGPLAVLY+FLDYFSKFDW+ YC+SL GPV Sbjct: 203 RVLGAHHGLISTYALETLVLYIFHQFHVSLDGPLAVLYRFLDYFSKFDWDNYCVSLKGPV 262 Query: 1704 RISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLK 1525 SSLP IVAE PE+G G+ LLT++F+RSCVE FSVPSR D R F QKHLNI+DPLK Sbjct: 263 SKSSLPNIVAEGPENG-GNTLLTEEFIRSCVESFSVPSRGPDLNLRVFPQKHLNIIDPLK 321 Query: 1524 DINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKP 1345 + NNLGRS++KGNF+RIRSAF YGA+KLG IL+ P++ IA+EL FF++T++RHG Q Sbjct: 322 ENNNLGRSVNKGNFFRIRSAFKYGARKLGWILMLPDDRIADELIRFFANTLERHGSTQLN 381 Query: 1344 V-----------VQDPYPQSTYN----GFIPVSSSLGTDSYKLENSDNA-AGHRFIGDAN 1213 V +D P + +N I +SSL + + NA A + D+ Sbjct: 382 VDKSVLSLSTASKKDDKPGNQHNYESREEIQDASSLAGEFFDCSGDGNAVASFKLSEDSR 441 Query: 1212 DLATPSIAGL----------------NISSDSPKFRNMRNEN--PNSDQWGKCEKNVSS- 1090 D AT + + NIS+ P + +E NS + EKN++S Sbjct: 442 DFATSGVLDIASANDLSYCSNGQIENNISNSEPALNTVIDEGMVSNSPRSHTDEKNMASY 501 Query: 1089 --------EVLPEDSDHTNWDQXXXXXXXXXXXXXXXXXXXXDYESYFNCLQYGRWCYEY 934 +L + H+ D+ DY S+ LQYG+ C Y Sbjct: 502 GSAVSTYANILENNFFHS--DRYTTNVSGGTEASMSLLDLTGDYHSHIGNLQYGQMCNGY 559 Query: 933 TSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRNGFI--PVYPMRHALVPG 760 T P +P P P ++ N+ PW V Q+ H+ S + N I VY + H +P Sbjct: 560 TVS-PVVPSPPRSP-KFPNRNPWETVRQCVQINHSIRSQANSNCVIGQQVYVINHPTLPM 617 Query: 759 IAFGREEMPKPRGTGTYFPKMNQSP-QGYRPSAVKGKNQAPLS------SPRDNGRNVIF 601 AF EE K RGTG YFP M+ P + RP +G+ QAP S R+NG + Sbjct: 618 TAFASEEKRKIRGTGAYFPNMSSRPFRDNRPIPGRGRGQAPGSHGHLQRHTRNNGLALAP 677 Query: 600 METNLLDRNSHERS-QPQVLVDQTVNLGSSGIHQSFSPRGNGHV----------PVGTFS 454 ETNL + E S + + T S S G+ + G+ Sbjct: 678 QETNLSAEGTFEYSLEGYSTIGSTKTRSSETYFPQPSTWGSHYANGFLHSSEKQESGSVI 737 Query: 453 PEQNRQQRXXXXXXXXXXXXXPGMQRPKTAL-----SRDLDRVSFK----SSYHLKDQDD 301 P+ R P T + S L V K +Y LK++DD Sbjct: 738 PQPRVAPRADMGNYPDSGISTSRGTVPNTGVVTEEKSNSLSAVDSKRIDVQAYRLKNEDD 797 Query: 300 FPPLS 286 FPPLS Sbjct: 798 FPPLS 802 >gb|KJB27692.1| hypothetical protein B456_005G005000 [Gossypium raimondii] Length = 737 Score = 560 bits (1444), Expect = e-156 Identities = 304/496 (61%), Positives = 347/496 (69%), Gaps = 53/496 (10%) Frame = -2 Query: 2487 MDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTIS 2308 M DL + + + SS S +N + I E W E+A II +VQPT +S Sbjct: 4 MGDLRDWSPEPNGVSSRDRYSSSSSSSSNQ--AGISAEYWRKAEEATQGIIARVQPTVVS 61 Query: 2307 EERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKS 2128 EERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG N E+ L +D S Sbjct: 62 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDACS 121 Query: 2127 VLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLI 1948 VLE E+ N +AEFVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDRLI Sbjct: 122 VLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLI 181 Query: 1947 GKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYK 1768 G++HLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIF LFHS+LDGPLAVLYK Sbjct: 182 GQDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSSLDGPLAVLYK 241 Query: 1767 FLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSR 1588 FLDYFSKFDWE YCISLNGP+ ISSLP IV ETPE+G GDLLL++DFLR CVE FSVPSR Sbjct: 242 FLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRECVETFSVPSR 301 Query: 1587 FVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGI 1408 D SR F QKHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG+IL Q E + Sbjct: 302 GFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGQILSQSEETL 361 Query: 1407 ANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGTDS-------YKLE--N 1255 +EL FFS+T+ RHG GQ+P VQDP P S + G S GT+S Y+ E N Sbjct: 362 GDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQEDQNFYESESSN 421 Query: 1254 SDNAAGH--------------------------------------------RFIGDANDL 1207 S G+ R GDA DL Sbjct: 422 SSTVTGNYRSSDNEGSLYKVYNGNMSERETDVGITFKEPQGSANASSISQIRLTGDAKDL 481 Query: 1206 ATPSIAGLNISSDSPK 1159 AT I GL IS+D+ K Sbjct: 482 ATSRIQGLVISNDAHK 497 Score = 79.7 bits (195), Expect = 1e-11 Identities = 40/100 (40%), Positives = 54/100 (54%), Gaps = 3/100 (3%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 Y++ L YG+W ++Y PM+ Q+Q+K W+ V Q R N S + N Sbjct: 633 YDANIRSLSYGQWWFDYAFSAAVPPMSSPLVSQFQSKNSWDVVRKSGQFRRNAISPMNTN 692 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMN 694 G +P YP+ ++ G FG EEMPKPRGTGTYFP N Sbjct: 693 GGVPRQAYYPINPPVLHGSGFGIEEMPKPRGTGTYFPNPN 732 >ref|XP_012481362.1| PREDICTED: uncharacterized protein LOC105796291 isoform X1 [Gossypium raimondii] gi|763760437|gb|KJB27691.1| hypothetical protein B456_005G005000 [Gossypium raimondii] Length = 884 Score = 560 bits (1444), Expect = e-156 Identities = 304/496 (61%), Positives = 347/496 (69%), Gaps = 53/496 (10%) Frame = -2 Query: 2487 MDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTIS 2308 M DL + + + SS S +N + I E W E+A II +VQPT +S Sbjct: 4 MGDLRDWSPEPNGVSSRDRYSSSSSSSSNQ--AGISAEYWRKAEEATQGIIARVQPTVVS 61 Query: 2307 EERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKS 2128 EERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG N E+ L +D S Sbjct: 62 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDACS 121 Query: 2127 VLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLI 1948 VLE E+ N +AEFVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDRLI Sbjct: 122 VLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLI 181 Query: 1947 GKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYK 1768 G++HLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIF LFHS+LDGPLAVLYK Sbjct: 182 GQDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSSLDGPLAVLYK 241 Query: 1767 FLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSR 1588 FLDYFSKFDWE YCISLNGP+ ISSLP IV ETPE+G GDLLL++DFLR CVE FSVPSR Sbjct: 242 FLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRECVETFSVPSR 301 Query: 1587 FVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGI 1408 D SR F QKHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG+IL Q E + Sbjct: 302 GFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGQILSQSEETL 361 Query: 1407 ANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGTDS-------YKLE--N 1255 +EL FFS+T+ RHG GQ+P VQDP P S + G S GT+S Y+ E N Sbjct: 362 GDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQEDQNFYESESSN 421 Query: 1254 SDNAAGH--------------------------------------------RFIGDANDL 1207 S G+ R GDA DL Sbjct: 422 SSTVTGNYRSSDNEGSLYKVYNGNMSERETDVGITFKEPQGSANASSISQIRLTGDAKDL 481 Query: 1206 ATPSIAGLNISSDSPK 1159 AT I GL IS+D+ K Sbjct: 482 ATSRIQGLVISNDAHK 497 Score = 131 bits (330), Expect = 3e-27 Identities = 88/258 (34%), Positives = 123/258 (47%), Gaps = 24/258 (9%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 Y++ L YG+W ++Y PM+ Q+Q+K W+ V Q R N S + N Sbjct: 633 YDANIRSLSYGQWWFDYAFSAAVPPMSSPLVSQFQSKNSWDVVRKSGQFRRNAISPMNTN 692 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YP+ ++ G FG EEMPKPRGTGTYFP N + R +G+N A Sbjct: 693 GGVPRQAYYPINPPVLHGSGFGIEEMPKPRGTGTYFPNPNTNYYKDRSLTARGRNPASAR 752 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQTVNLGS-----SGIHQSFSPRGNG--H 475 SPR+NGR + E N +RN+ E +Q + GS SG ++ SP NG H Sbjct: 753 SPRNNGRAITSPEPNSPERNNREVAQMHSVNQGVGKSGSSELRHSGSEKALSPNSNGSMH 812 Query: 474 VP--------------VGTFSPEQNRQQRXXXXXXXXXXXXXPGMQRPKTALSRDLDRVS 337 P V TF+ + GM+R K+A S D DR+ Sbjct: 813 QPDRLVEFGSMRALPLVPTFT-----ETGKPHNPGSPNAQNSTGMERLKSAASMDQDRI- 866 Query: 336 FKSSYHLKDQDDFPPLSV 283 S+HLK+++DFPPLS+ Sbjct: 867 LVQSFHLKNEEDFPPLSI 884 >ref|XP_012089694.1| PREDICTED: uncharacterized protein LOC105648043 [Jatropha curcas] gi|643706966|gb|KDP22776.1| hypothetical protein JCGZ_00363 [Jatropha curcas] Length = 900 Score = 560 bits (1444), Expect = e-156 Identities = 292/461 (63%), Positives = 338/461 (73%), Gaps = 51/461 (11%) Frame = -2 Query: 2385 IEPERWEILEKAAHMIICKVQPTTISEERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTY 2206 I E W+ E II +VQPT +SEERR+ VIDYVQRLIR +G EVFP+GSVPLKTY Sbjct: 34 ISAEYWQKAEDLTQGIIAQVQPTVVSEERRKAVIDYVQRLIRKSIGCEVFPFGSVPLKTY 93 Query: 2205 LPDGDIDLTAFGGANVEDKLVDDMKSVLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQD 2026 LPDGDIDLTAFGG NVE+ L +D+ SVLE E+ NR+AEF+VK+VQLI AEVKLVKC+VQ+ Sbjct: 94 LPDGDIDLTAFGGMNVEEVLANDVCSVLEREDKNRTAEFIVKDVQLIRAEVKLVKCLVQN 153 Query: 2025 IVVDVSFNQIGGLCTLCFLEKVDRLIGKNHLFKRSIILIKAWCYYESRILGAHHGLISTY 1846 IVVD+SFNQ+GGLCTLCFLEKVDRLIGK+HLFKRSIILIKAWCYYESRILGAHHGLISTY Sbjct: 154 IVVDISFNQLGGLCTLCFLEKVDRLIGKDHLFKRSIILIKAWCYYESRILGAHHGLISTY 213 Query: 1845 ALETLVLYIFRLFHSTLDGPLAVLYKFLDYFSKFDWETYCISLNGPVRISSLPVIVAETP 1666 ALETLVLYIF LFHS+L+GPLAVLYKFLDYFSKFDW+TYCISLNGPVRISSLP ++ ETP Sbjct: 214 ALETLVLYIFHLFHSSLNGPLAVLYKFLDYFSKFDWDTYCISLNGPVRISSLPEVLVETP 273 Query: 1665 EDGSGDLLLTDDFLRSCVEMFSVPSRFVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGN 1486 E+G+ DLLLT+DFL+ CV+ FSVP+R + SR F KHLNIVDPLK+ NNLGRS+SKGN Sbjct: 274 ENGTCDLLLTNDFLKECVDTFSVPARGYETNSRAFSPKHLNIVDPLKENNNLGRSVSKGN 333 Query: 1485 FYRIRSAFSYGAKKLGRILLQPENGIANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNG 1306 FYRIRSAFSYGA+KLG IL QPE IA EL FFS+T+ RHG GQ+P VQDP P + +G Sbjct: 334 FYRIRSAFSYGARKLGLILSQPEEIIAAELSKFFSNTLDRHGSGQRPDVQDPAPSESQHG 393 Query: 1305 FIPVSSSLGTDS------------------------------------------------ 1270 F S G ++ Sbjct: 394 FAAAISFSGAETNQEDQTICESESSDSSSILGESRLDQEQPLHGDNVKISGRKIYFSRTV 453 Query: 1269 YKLENSDNAAG---HRFIGDANDLATPSIAGLNISSDSPKF 1156 +L+N N A R GDA DLAT + GL+I+ D+ KF Sbjct: 454 NELQNCANEAAVSEFRLFGDAKDLATFKMQGLSIAKDALKF 494 Score = 145 bits (365), Expect = 2e-31 Identities = 103/262 (39%), Positives = 131/262 (50%), Gaps = 29/262 (11%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 +ES+ N L GRW YEY + P Q+QNK W+ + Q R N FS + N Sbjct: 627 FESHLNSLHLGRWWYEYAFNASVASICPQLFPQFQNKNSWDVIRRSVQFRRNAFSQMNVN 686 Query: 804 GFI--PVYP-MRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G + PV+P M L+PG +FG+EEMPKPRGTGTYFP N R +G+NQAP+ Sbjct: 687 GVVSRPVFPPMNPPLMPGASFGKEEMPKPRGTGTYFPNTNHYRD--RNMTGRGRNQAPM- 743 Query: 633 SPRDNGRNVIFMETNLLDRNSHER--SQPQVLVDQT-VNLGSSGIHQSFSPRGN------ 481 SPR NGR V E +L +RN +R SQ Q + Q LG S +H + SP Sbjct: 744 SPRSNGRTVTSQEKHLPERNGRDRELSQAQYHMHQDGGKLGPSDLHHTGSPETKHYTNVN 803 Query: 480 ---------------GHVPVGTFSPEQNRQQR--XXXXXXXXXXXXXPGMQRPKTALSRD 352 GH+P+G S E Q PGMQ PK + + Sbjct: 804 GSMHHSERVVEFGSIGHLPMGPSSIEGGWQPNPGSAPAHNYRVSQAIPGMQGPKPVSAIN 863 Query: 351 LDRVSFKSSYHLKDQDDFPPLS 286 DR++ + SYHLKD DDFPPLS Sbjct: 864 QDRIAVQ-SYHLKD-DDFPPLS 883 >gb|KHG19864.1| Poly (A) RNA polymerase cid14 [Gossypium arboreum] Length = 881 Score = 559 bits (1441), Expect = e-156 Identities = 304/496 (61%), Positives = 347/496 (69%), Gaps = 53/496 (10%) Frame = -2 Query: 2487 MDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTIS 2308 M DL + + + SS S +N + E W E+A II +VQPT +S Sbjct: 4 MGDLRDWSPEPNGVSSRDRYSSSSSSSSNQTGTSAE--YWRKAEEATQGIIARVQPTVVS 61 Query: 2307 EERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKS 2128 EERR+ VIDYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG N E+ L +D S Sbjct: 62 EERRKAVIDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLNFEEALANDACS 121 Query: 2127 VLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLI 1948 VLE E+ N +AEFVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDRLI Sbjct: 122 VLEREDRNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLI 181 Query: 1947 GKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYK 1768 GK+HLFKRSIILIKAWCYYESRILGAHHGLISTY LETLVLYIF LFHS+LDGPLAVLYK Sbjct: 182 GKDHLFKRSIILIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSSLDGPLAVLYK 241 Query: 1767 FLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSR 1588 FLDYFSKFDWE YCISLNGP+ ISSLP IV ETPE+G GDLLL++DFLR CVE FSVPSR Sbjct: 242 FLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRECVETFSVPSR 301 Query: 1587 FVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGI 1408 D SR F QKHLNIVDPLK+ NNLGRS+SKGNFYRIRSAF+YGA+KLG+IL Q E + Sbjct: 302 GFDANSRIFPQKHLNIVDPLKENNNLGRSVSKGNFYRIRSAFTYGARKLGQILSQSEETL 361 Query: 1407 ANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGTDS-------------- 1270 +EL+ FFS+T+ RHG GQ+P VQDP P S + G S GT+S Sbjct: 362 GDELRKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQEDQNFYESESSN 421 Query: 1269 -----------------YKLEN-------------------SDNAAG---HRFIGDANDL 1207 YK+ N S NA+ R GDA DL Sbjct: 422 SSTVTGNYRSSDNEGSLYKVNNGNMSERETDVGITFKEPQGSANASSISEIRLTGDAKDL 481 Query: 1206 ATPSIAGLNISSDSPK 1159 AT GL IS+D+ K Sbjct: 482 ATSRFQGLVISNDAHK 497 Score = 119 bits (299), Expect = 1e-23 Identities = 66/177 (37%), Positives = 93/177 (52%), Gaps = 8/177 (4%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 Y++ + L YG+WCY+Y P++ Q+Q+K W+ V Q R N S + N Sbjct: 634 YDANIHGLSYGQWCYDYAFSASIPPISSPLVSQFQSKNSWDAVHKSVQFRQNAISPMNAN 693 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YP+ ++ G FG EEMPKPRGTGTYFP N + R +G+N A Sbjct: 694 GGVPRQAYYPINPPVLHGSGFGMEEMPKPRGTGTYFPNPNTNYYKDRSLTARGRNPALAR 753 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQVLVDQTVNLGSSGIHQS-----FSPRGNG 478 SPR+NGR + F E N +R++ + +Q Q + GSSG+ S SP NG Sbjct: 754 SPRNNGRAITFPEPNSPERSNRDLAQMQSINQGVGKSGSSGLRHSGSEKALSPNANG 810 >gb|KJB27695.1| hypothetical protein B456_005G005100 [Gossypium raimondii] Length = 881 Score = 558 bits (1437), Expect = e-155 Identities = 302/496 (60%), Positives = 348/496 (70%), Gaps = 53/496 (10%) Frame = -2 Query: 2487 MDDLLENADSGQAAAGERTLSSPLMSPANPCPSEIEPERWEILEKAAHMIICKVQPTTIS 2308 M DL + + + + SS S +N + I E W E+A II +VQPT +S Sbjct: 4 MGDLRDWSPEPNGVSSRDSYSSSPSSSSNQ--TGISAEYWRKAEEATQGIIARVQPTVVS 61 Query: 2307 EERRRDVIDYVQRLIRNRLGAEVFPYGSVPLKTYLPDGDIDLTAFGGANVEDKLVDDMKS 2128 EERR+ V DYVQRLIRN LG EVFP+GSVPLKTYLPDGDIDLTAFGG E+ L +D+ S Sbjct: 62 EERRKAVTDYVQRLIRNYLGCEVFPFGSVPLKTYLPDGDIDLTAFGGLIFEEALANDVCS 121 Query: 2127 VLEEEENNRSAEFVVKEVQLICAEVKLVKCIVQDIVVDVSFNQIGGLCTLCFLEKVDRLI 1948 VLE E++N +AEFVVK+VQLI AEVKLVKC+VQ+IVVD+SFNQ+GGLCTLCFLE+VDRLI Sbjct: 122 VLEREDHNTAAEFVVKDVQLIRAEVKLVKCLVQNIVVDISFNQLGGLCTLCFLEQVDRLI 181 Query: 1947 GKNHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFRLFHSTLDGPLAVLYK 1768 GKNHLFKRSI+LIKAWCYYESRILGAHHGLISTY LETLVLYIF LFHS LDGPLAVLYK Sbjct: 182 GKNHLFKRSILLIKAWCYYESRILGAHHGLISTYGLETLVLYIFHLFHSFLDGPLAVLYK 241 Query: 1767 FLDYFSKFDWETYCISLNGPVRISSLPVIVAETPEDGSGDLLLTDDFLRSCVEMFSVPSR 1588 FLDYFSKFDWE YCISLNGP+ ISSLP IV ETPE+G GDLLL++DFLR CVE FSVPSR Sbjct: 242 FLDYFSKFDWENYCISLNGPIPISSLPDIVVETPENGGGDLLLSNDFLRECVEKFSVPSR 301 Query: 1587 FVDQKSRGFQQKHLNIVDPLKDINNLGRSISKGNFYRIRSAFSYGAKKLGRILLQPENGI 1408 + SR F QKHLNIVDPL++ NNLGRS+SKGNFYRIRSAF+YGA+KLG+IL Q E + Sbjct: 302 GFEANSRIFPQKHLNIVDPLRENNNLGRSVSKGNFYRIRSAFTYGARKLGQILSQSEETL 361 Query: 1407 ANELQIFFSSTMQRHGCGQKPVVQDPYPQSTYNGFIPVSSSLGTDS-------YKLE--N 1255 +EL FFS+T+ RHG GQ+P VQDP P S + G S GT+S Y+LE N Sbjct: 362 GDELHKFFSNTLDRHGNGQRPDVQDPAPLSRFRGLGATPSVSGTESCQEDQNFYELESSN 421 Query: 1254 SDNAAGH--------------------------------------------RFIGDANDL 1207 S G+ R GDA DL Sbjct: 422 SSTVTGNYRSSDNEGSLYKVYNGNMCERETDVGITFKEPQGSANASSISQIRLTGDAKDL 481 Query: 1206 ATPSIAGLNISSDSPK 1159 AT I GL IS+D+ K Sbjct: 482 ATSRIQGLVISNDAHK 497 Score = 117 bits (292), Expect = 7e-23 Identities = 65/177 (36%), Positives = 94/177 (53%), Gaps = 8/177 (4%) Frame = -2 Query: 984 YESYFNCLQYGRWCYEYTSGVPSLPMAPMPPLQYQNKTPWNGVIYPSQLRHNGFSHGHRN 805 Y++ + L YG+WCY+Y P++P Q+Q+K W+ V Q R N S + N Sbjct: 634 YDANIHSLSYGQWCYDYAFSASVPPISPPLVSQFQSKNSWDAVHKSVQFRRNTISPMNAN 693 Query: 804 GFIP---VYPMRHALVPGIAFGREEMPKPRGTGTYFPKMNQSPQGYRPSAVKGKNQAPLS 634 G +P YP+ ++ G FG EEMPKPRGTGTYFP N + R +G+N A Sbjct: 694 GGVPRQAYYPINPPVLHGSGFGMEEMPKPRGTGTYFPNPNTNYYKDRSLTARGRNPALAR 753 Query: 633 SPRDNGRNVIFMETNLLDRNSHERSQPQ-----VLVDQTVNLGSSGIHQSFSPRGNG 478 SPR+NGR + E N +R++ + +Q Q V ++ L SG ++ SP NG Sbjct: 754 SPRNNGRAITSPEPNSPERSNRDLAQMQSINQVVGKSRSSELRHSGSEKALSPNANG 810