BLASTX nr result

ID: Forsythia23_contig00033074 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00033074
         (759 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]   309   1e-81
ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, part...   302   1e-79
ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, part...   302   1e-79
ref|XP_007221749.1| hypothetical protein PRUPE_ppb022800mg, part...   301   3e-79
ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [The...   294   4e-77
gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]     293   6e-77
ref|NP_001063540.1| Os09g0491900 [Oryza sativa Japonica Group] g...   293   6e-77
gb|AAT85159.1| unknown protein [Oryza sativa Japonica Group] gi|...   293   6e-77
ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobrom...   292   1e-76
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...   292   2e-76
gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ...   291   2e-76
ref|XP_010278719.1| PREDICTED: uncharacterized protein LOC104612...   291   3e-76
ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobrom...   291   3e-76
ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prun...   291   4e-76
gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ...   290   5e-76
ref|XP_012704376.1| PREDICTED: uncharacterized protein LOC105915...   290   8e-76
ref|XP_011010189.1| PREDICTED: uncharacterized protein LOC105115...   290   8e-76
ref|XP_010530494.1| PREDICTED: uncharacterized protein LOC104807...   289   1e-75
ref|XP_009145096.1| PREDICTED: uncharacterized protein LOC103868...   288   2e-75
gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]              288   2e-75

>emb|CAN77045.1| hypothetical protein VITISV_035256 [Vitis vinifera]
          Length = 665

 Score =  309 bits (791), Expect = 1e-81
 Identities = 142/242 (58%), Positives = 182/242 (75%)
 Frame = +3

Query: 27   GPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPR 206
            G Y +F + DGY FK + LC+   SLRE  I E  +RG A HFGRDKT+++  ++FYWP 
Sbjct: 325  GAYPNFXLHDGYLFKGTXLCLXDXSLREQVIWELHSRGXAXHFGRDKTIAMTEDHFYWPS 384

Query: 207  MDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIM 386
            + RDV K++  CR C  +KG  +NTGLY PLPVP  PW+++++DFV+GLP+T R  DSI 
Sbjct: 385  LKRDVTKNVSKCRTCQPSKGRKKNTGLYMPLPVPHEPWQELSIDFVLGLPKTFRRHDSIF 444

Query: 387  VAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WK 566
            V VDR+SKM HF+PC KTLDA HVA L+FKEIV+LHG+PKTI SD+D KFMSYFWR+ WK
Sbjct: 445  VMVDRFSKMVHFIPCSKTLDAVHVAKLFFKEIVRLHGLPKTIVSDQDAKFMSYFWRSLWK 504

Query: 567  KLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQT 746
             L TKL+FSS+ H +T+GQTE VNRSLG+L+R LVG ++  WD  L  AEFAYN S +++
Sbjct: 505  MLNTKLKFSSAFHPQTEGQTEVVNRSLGDLLRCLVGEHVSNWDQILPMAEFAYNSSVNRS 564

Query: 747  NG 752
             G
Sbjct: 565  TG 566


>ref|XP_007221295.1| hypothetical protein PRUPE_ppa024499mg, partial [Prunus persica]
            gi|462417929|gb|EMJ22494.1| hypothetical protein
            PRUPE_ppa024499mg, partial [Prunus persica]
          Length = 1364

 Score =  302 bits (774), Expect = 1e-79
 Identities = 140/238 (58%), Positives = 178/238 (74%)
 Frame = +3

Query: 42   FLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDV 221
            FL+ DGY F+ + LCI   SLR+  + E    GLAGHFG+DKT++LV + FYWP + RDV
Sbjct: 976  FLLRDGYLFRGTQLCIPRTSLRDFLVWELHAGGLAGHFGKDKTITLVADRFYWPSLKRDV 1035

Query: 222  KKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDR 401
               +  C  C LAK   QNTGLYTPLP+P  PWKD+++DFV+GLP+T R  DSI+V VDR
Sbjct: 1036 AHILAQCCTCQLAKARKQNTGLYTPLPIPHTPWKDLSMDFVLGLPKTARGHDSILVVVDR 1095

Query: 402  YSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTK 581
            +SKMAHF+PC K  DA++VA L+FKE+++LHG+P +I SDRD KF+SYFW+T WK  GT 
Sbjct: 1096 FSKMAHFLPCSKAADASYVAKLFFKEVIRLHGLPVSIVSDRDVKFVSYFWKTLWKLFGTS 1155

Query: 582  LQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQTNGK 755
            L+FSS+ H +TDGQTE VNRSLG+L+R LVG     WDL L  AEFAYN S+++T GK
Sbjct: 1156 LKFSSAFHPQTDGQTEVVNRSLGDLLRCLVGDKQGNWDLILPVAEFAYNNSANRTTGK 1213


>ref|XP_007207981.1| hypothetical protein PRUPE_ppa015715mg, partial [Prunus persica]
            gi|462403623|gb|EMJ09180.1| hypothetical protein
            PRUPE_ppa015715mg, partial [Prunus persica]
          Length = 1445

 Score =  302 bits (774), Expect = 1e-79
 Identities = 145/253 (57%), Positives = 182/253 (71%), Gaps = 3/253 (1%)
 Frame = +3

Query: 6    IWEECSKG---PYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLS 176
            I+ E S G    Y  F+  DG+ F+ + LCI   SLRE  + E    GLAGHFG+DKT++
Sbjct: 950  IFHEVSNGNRREYVDFITRDGFLFRGTQLCIPRTSLREFLVWELHGGGLAGHFGKDKTIA 1009

Query: 177  LV*ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLP 356
            LV + FYWP + RDV   I  CR C LAK   +NTGLYTPLP+P  PWKD+++DFV+GLP
Sbjct: 1010 LVEDRFYWPSLKRDVAHLISQCRTCQLAKARKRNTGLYTPLPIPHTPWKDLSMDFVLGLP 1069

Query: 357  RT*RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKF 536
            +T R  DSI V VDR+SKMAHF+PC K  DA++VA L+FKE+V+LHG+P +I SDRD KF
Sbjct: 1070 KTSRGYDSIFVIVDRFSKMAHFLPCAKNTDASYVAKLFFKEVVRLHGLPVSIVSDRDVKF 1129

Query: 537  MSYFWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAE 716
            +SYFW+T WK  GT L+FSS+ H +TDGQTE VNRSLG+L+R LVG     WDL L  AE
Sbjct: 1130 VSYFWKTLWKLFGTTLKFSSAFHPQTDGQTEVVNRSLGDLLRCLVGDKPGNWDLLLPVAE 1189

Query: 717  FAYNRSSSQTNGK 755
            FAYN S +++ GK
Sbjct: 1190 FAYNNSVNRSTGK 1202


>ref|XP_007221749.1| hypothetical protein PRUPE_ppb022800mg, partial [Prunus persica]
           gi|462418685|gb|EMJ22948.1| hypothetical protein
           PRUPE_ppb022800mg, partial [Prunus persica]
          Length = 722

 Score =  301 bits (771), Expect = 3e-79
 Identities = 140/238 (58%), Positives = 177/238 (74%)
 Frame = +3

Query: 42  FLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDV 221
           FL+ DGY F+ + LCI   SLR+  + E    GLAGHFG+DKT++LV + FYWP + RDV
Sbjct: 249 FLLRDGYLFRGTQLCIPRTSLRDFLVWELHAGGLAGHFGKDKTITLVADRFYWPSLKRDV 308

Query: 222 KKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDR 401
              +  CR C LAK   QNTGLYTPLP+P  PWKD+++DFV+GLP+T R  DSI+V VDR
Sbjct: 309 AHILAQCRTCQLAKARKQNTGLYTPLPIPHTPWKDLSMDFVLGLPKTARGHDSILVVVDR 368

Query: 402 YSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTK 581
           +SKMAHF+PC K  DA++VA L+FKE++ LHG+P +I SDRD KF+SYFW+T WK  GT 
Sbjct: 369 FSKMAHFLPCSKAADASYVAKLFFKEVIHLHGLPVSIVSDRDVKFVSYFWKTLWKLFGTS 428

Query: 582 LQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQTNGK 755
           L+FSS+ H +TDGQTE VNRSL +L+R LVG     WDL L  AEFAYN S+++T GK
Sbjct: 429 LKFSSAFHPQTDGQTEVVNRSLRDLLRCLVGDKQGNWDLILPVAEFAYNNSANRTTGK 486


>ref|XP_007045326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508709261|gb|EOY01158.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 786

 Score =  294 bits (752), Expect = 4e-77
 Identities = 133/231 (57%), Positives = 176/231 (76%)
 Frame = +3

Query: 54   DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDVKKHI 233
            + Y FK + LCI   SLRE  I E    GL GHFGRDKTL++V + +YWP+M RDV++ +
Sbjct: 452  EDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLV 511

Query: 234  QSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDRYSKM 413
            + C  C   KG AQNTGLY PLP P APW  +++DFV+GLP+T +  DSI V VDR+SKM
Sbjct: 512  KRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFVLGLPKTAKGFDSIFVVVDRFSKM 571

Query: 414  AHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTKLQFS 593
            AHF+PC +T +ATH+A+L+F+EIV+LHGIP +I SDRD KFM +FWRT W+K GT+L++S
Sbjct: 572  AHFIPCFRTSNATHIAELFFREIVRLHGIPTSIVSDRDVKFMGHFWRTLWRKFGTELKYS 631

Query: 594  SSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQT 746
            S+ H +TDGQTE VNRSLGN++R L+ +N + WDL + QAEFAYN S +++
Sbjct: 632  STCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRS 682


>gb|ABE60891.1| putative polyprotein [Oryza sativa Japonica Group]
          Length = 1713

 Score =  293 bits (751), Expect = 6e-77
 Identities = 133/251 (52%), Positives = 181/251 (72%), Gaps = 1/251 (0%)
 Frame = +3

Query: 9    WEECSKGP-YHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            + +C+ G  +  + I DG+ F+ + LC+ HCS+R   + E    GL GHFG  KT  ++ 
Sbjct: 1163 YAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLA 1222

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
            ++FYWP+M RDV++ +Q C  CH AK      GLYTPLPVP APW+D+++DFV+GLPRT 
Sbjct: 1223 DHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTK 1282

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
            R +DSI V VDR+SKMAHF+PC K+ DA+H+A L+F EIV+LHG+PKTI SDRD KF+SY
Sbjct: 1283 RGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSY 1342

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FW+T W KLGT+L FS++ H +TDGQTE VNR+L  L+R+L+  N+++W+  L   EFAY
Sbjct: 1343 FWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAY 1402

Query: 726  NRSSSQTNGKC 758
            NR+   T   C
Sbjct: 1403 NRAVHSTTNMC 1413


>ref|NP_001063540.1| Os09g0491900 [Oryza sativa Japonica Group]
           gi|113631773|dbj|BAF25454.1| Os09g0491900 [Oryza sativa
           Japonica Group]
          Length = 681

 Score =  293 bits (751), Expect = 6e-77
 Identities = 133/251 (52%), Positives = 181/251 (72%), Gaps = 1/251 (0%)
 Frame = +3

Query: 9   WEECSKGP-YHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
           + +C+ G  +  + I DG+ F+ + LC+ HCS+R   + E    GL GHFG  KT  ++ 
Sbjct: 131 YAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLA 190

Query: 186 ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
           ++FYWP+M RDV++ +Q C  CH AK      GLYTPLPVP APW+D+++DFV+GLPRT 
Sbjct: 191 DHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTK 250

Query: 366 RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
           R +DSI V VDR+SKMAHF+PC K+ DA+H+A L+F EIV+LHG+PKTI SDRD KF+SY
Sbjct: 251 RGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSY 310

Query: 546 FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
           FW+T W KLGT+L FS++ H +TDGQTE VNR+L  L+R+L+  N+++W+  L   EFAY
Sbjct: 311 FWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAY 370

Query: 726 NRSSSQTNGKC 758
           NR+   T   C
Sbjct: 371 NRAVHSTTNMC 381


>gb|AAT85159.1| unknown protein [Oryza sativa Japonica Group]
           gi|52353557|gb|AAU44123.1| putative polyprotein [Oryza
           sativa Japonica Group]
          Length = 681

 Score =  293 bits (751), Expect = 6e-77
 Identities = 133/251 (52%), Positives = 181/251 (72%), Gaps = 1/251 (0%)
 Frame = +3

Query: 9   WEECSKGP-YHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
           + +C+ G  +  + I DG+ F+ + LC+ HCS+R   + E    GL GHFG  KT  ++ 
Sbjct: 131 YAKCTAGKGWEKYHIHDGFLFRANKLCVPHCSVRLLLLQETHAGGLMGHFGWRKTYDMLA 190

Query: 186 ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
           ++FYWP+M RDV++ +Q C  CH AK      GLYTPLPVP APW+D+++DFV+GLPRT 
Sbjct: 191 DHFYWPKMRRDVQRLVQRCVTCHKAKSKLNPHGLYTPLPVPSAPWEDISMDFVLGLPRTK 250

Query: 366 RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
           R +DSI V VDR+SKMAHF+PC K+ DA+H+A L+F EIV+LHG+PKTI SDRD KF+SY
Sbjct: 251 RGRDSIFVVVDRFSKMAHFIPCHKSDDASHIASLFFSEIVRLHGMPKTIVSDRDTKFLSY 310

Query: 546 FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
           FW+T W KLGT+L FS++ H +TDGQTE VNR+L  L+R+L+  N+++W+  L   EFAY
Sbjct: 311 FWKTLWAKLGTRLLFSTTCHPQTDGQTEVVNRTLSMLLRALIKKNLKEWEECLPHVEFAY 370

Query: 726 NRSSSQTNGKC 758
           NR+   T   C
Sbjct: 371 NRAVHSTTNMC 381


>ref|XP_007019612.1| Uncharacterized protein TCM_035725 [Theobroma cacao]
           gi|508724940|gb|EOY16837.1| Uncharacterized protein
           TCM_035725 [Theobroma cacao]
          Length = 499

 Score =  292 bits (748), Expect = 1e-76
 Identities = 133/231 (57%), Positives = 175/231 (75%)
 Frame = +3

Query: 54  DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDVKKHI 233
           + Y FK + LCI   SLRE  I E    GL GHFGRDKTL++V + +YWP+M RDV++ +
Sbjct: 47  EDYLFKGNQLCIPKGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRRDVERLV 106

Query: 234 QSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDRYSKM 413
           + C  C   KG AQNTGLY PLP P APW  +++DFV+ LP+T +  DSI V VDR+SKM
Sbjct: 107 KRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFVLELPKTAKGFDSIFVVVDRFSKM 166

Query: 414 AHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTKLQFS 593
           AHF+PC +T DATH+A+L+F+EIV+LHGIP +I SDRD KFM +FWRT W+K GT+L++S
Sbjct: 167 AHFIPCFRTSDATHIAELFFREIVRLHGIPTSIVSDRDVKFMGHFWRTLWRKFGTELKYS 226

Query: 594 SSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQT 746
           S+ H +TDGQTE VNRSLGN++R L+ +N + WDL + QAEFAYN S +++
Sbjct: 227 STCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRS 277


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
            gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
            superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  292 bits (747), Expect = 2e-76
 Identities = 134/231 (58%), Positives = 173/231 (74%)
 Frame = +3

Query: 54   DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDVKKHI 233
            + Y FK + LCI   SLRE  I E    GL GHFGRDKTL +V + +YWP+M RDV++ +
Sbjct: 1000 EDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLVMVADRYYWPKMRRDVERLV 1059

Query: 234  QSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDRYSKM 413
            + C  C   KG AQNTGLY PLP P APW  +++DFV+GLP+T +  DSI V VDR+SKM
Sbjct: 1060 KRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFVLGLPKTTKGFDSIFVVVDRFSKM 1119

Query: 414  AHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTKLQFS 593
            AHF+PC +T DATH+A+L+F+EIV LHGIP +I SDR  KFM YFWRT W+K GT+L++S
Sbjct: 1120 AHFIPCFRTSDATHIAELFFREIVILHGIPTSIVSDRHVKFMGYFWRTLWRKFGTELKYS 1179

Query: 594  SSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQT 746
            S+ H +TDGQTE VNRSLGN++R L+ +N + WDL + QAEFAYN S +++
Sbjct: 1180 STCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRS 1230


>gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  291 bits (746), Expect = 2e-76
 Identities = 138/250 (55%), Positives = 177/250 (70%)
 Frame = +3

Query: 6    IWEECSKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            I+ EC  GP+  F + D + FK + LC+ +CSLRE F+ EA   GL GHFG  KTL ++ 
Sbjct: 1113 IFAECKLGPFEKFNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILS 1172

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
            E+FYWP M +DV+K    C  C  AK      GLYTPLPV  +PW D+++DF++GLPRT 
Sbjct: 1173 EHFYWPSMRKDVEKVCSYCLECKQAKSRTLPHGLYTPLPVSNSPWIDISMDFILGLPRTK 1232

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
              KDSI V VDR+SKMA F+PC KT DA+HVADL+ KE+VKLHGIP+TI SDRD KF+S+
Sbjct: 1233 YGKDSIFVVVDRFSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSH 1292

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FWR  W KLGTKL FS+S H +TDGQTE VNR+LGN++R+++   +  W+  L   EFAY
Sbjct: 1293 FWRILWGKLGTKLLFSTSCHPQTDGQTEVVNRTLGNMLRAILKGKLTSWEDYLPIVEFAY 1352

Query: 726  NRSSSQTNGK 755
            NR+   + GK
Sbjct: 1353 NRTFHSSTGK 1362


>ref|XP_010278719.1| PREDICTED: uncharacterized protein LOC104612828 [Nelumbo nucifera]
          Length = 925

 Score =  291 bits (745), Expect = 3e-76
 Identities = 136/250 (54%), Positives = 175/250 (70%), Gaps = 1/250 (0%)
 Frame = +3

Query: 9    WEEC-SKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            WE+C ++ P   F I DG+  K   LCI   SLRE  I +    GLAGH GRDKT+  V 
Sbjct: 591  WEKCMNRQPVGDFYIHDGFLMKGEQLCIPCTSLREKIIKDLHGGGLAGHLGRDKTIEAVK 650

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
              +YWP++ RDV   +  C +C  AKG AQNTGLY PLP+P A W+D+ +DFV+GLP+T 
Sbjct: 651  GRYYWPKLRRDVTTIVSRCYICQTAKGQAQNTGLYMPLPIPTAIWEDLPMDFVLGLPKTP 710

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
            RN DS+ + VDR+SKMAHF+PC KT DAT  A L+FKEIV+LHG+PKTITSDRD +F+S+
Sbjct: 711  RNMDSVFIVVDRFSKMAHFLPCKKTADATATAKLFFKEIVRLHGVPKTITSDRDTRFLSH 770

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FW T W+   + L FSS+ H +TDG TE VNR+LGNL+RS+     +QWD  +AQAEFAY
Sbjct: 771  FWMTLWRLFDSSLNFSSTAHPQTDGLTEVVNRTLGNLIRSISRERPKQWDFAIAQAEFAY 830

Query: 726  NRSSSQTNGK 755
            N +   + G+
Sbjct: 831  NNAVHSSTGR 840


>ref|XP_007019474.1| Uncharacterized protein TCM_035549 [Theobroma cacao]
            gi|508724802|gb|EOY16699.1| Uncharacterized protein
            TCM_035549 [Theobroma cacao]
          Length = 1392

 Score =  291 bits (745), Expect = 3e-76
 Identities = 132/231 (57%), Positives = 175/231 (75%)
 Frame = +3

Query: 54   DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDVKKHI 233
            + Y FK + LCI   SLRE  I E    GL GHFGRDKTL++V + +YWP+M +DV++ +
Sbjct: 940  EDYLFKGNQLCIPEGSLREQIIRELHGNGLGGHFGRDKTLAMVADRYYWPKMRQDVERLV 999

Query: 234  QSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDRYSKM 413
            + C  C   KG AQNTGLY PLP P APW  +++DFV+GLP+T +  DSI V VDR+SKM
Sbjct: 1000 KRCPTCLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFVLGLPKTAKRFDSIFVVVDRFSKM 1059

Query: 414  AHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTKLQFS 593
            AHF+PC +T DATH+A+L+F+EIV+LH IP +I SDRD KFM +FWRT W+K GT+L++S
Sbjct: 1060 AHFIPCFRTSDATHIAELFFREIVRLHRIPTSIVSDRDVKFMGHFWRTLWRKFGTELKYS 1119

Query: 594  SSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQT 746
            S+ H +TDGQTE VNRSLGN++R L+ +N + WDL + QAEFAYN S +++
Sbjct: 1120 STCHPQTDGQTEVVNRSLGNMLRCLIQNNPKTWDLVIPQAEFAYNNSVNRS 1170


>ref|XP_007206823.1| hypothetical protein PRUPE_ppa025991mg [Prunus persica]
            gi|462402465|gb|EMJ08022.1| hypothetical protein
            PRUPE_ppa025991mg [Prunus persica]
          Length = 1274

 Score =  291 bits (744), Expect = 4e-76
 Identities = 140/253 (55%), Positives = 180/253 (71%), Gaps = 3/253 (1%)
 Frame = +3

Query: 6    IWEECSKG---PYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLS 176
            I+ E S G    Y  F+  DG+ F+R+ LCI   SL E  + E    GLAGHFG+DKT++
Sbjct: 779  IFHEVSNGNRREYVDFITRDGFLFRRTQLCIPRTSLLEFLVWELHGGGLAGHFGKDKTIA 838

Query: 177  LV*ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLP 356
            LV ++FYWP + RDV   I  CR C LAK   +NTG+YTPLP+P APWKD+++DFV+GLP
Sbjct: 839  LVEDHFYWPSLKRDVAHLISQCRTCQLAKARKRNTGVYTPLPIPHAPWKDLSMDFVLGLP 898

Query: 357  RT*RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKF 536
            +T R  DSI V VD +SKMAHF+PC K  DA+++A L+FKE+V+LHG+  +I SDRD KF
Sbjct: 899  KTSRGYDSIFVIVDCFSKMAHFLPCAKNTDASYMAKLFFKEVVRLHGLLVSIVSDRDFKF 958

Query: 537  MSYFWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAE 716
            +SYFW+T WK  GT L+FSS+ H +TDGQTE VNRSLG+L+  LVG     WDL L  AE
Sbjct: 959  VSYFWKTLWKLFGTTLKFSSAFHPQTDGQTEVVNRSLGDLLHCLVGDKPGNWDLLLPVAE 1018

Query: 717  FAYNRSSSQTNGK 755
            F YN S +++ GK
Sbjct: 1019 FTYNNSVNRSTGK 1031


>gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum]
          Length = 1588

 Score =  290 bits (743), Expect = 5e-76
 Identities = 138/250 (55%), Positives = 176/250 (70%)
 Frame = +3

Query: 6    IWEECSKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            I+ EC  GP+  F + D + FK + LC+ +CSLRE F+ EA   GL GHFG  KTL ++ 
Sbjct: 1113 IFAECKLGPFEKFNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILS 1172

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
            E+FYWP M +DV+K    C  C  AK      GLYTPLPV   PW D+++DF++GLPRT 
Sbjct: 1173 EHFYWPSMRKDVEKVCSYCLECKQAKSRTLPHGLYTPLPVSNFPWIDISMDFILGLPRTK 1232

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
              KDSI V VDR+SKMA F+PC KT DA+HVADL+ KE+VKLHGIP+TI SDRD KF+S+
Sbjct: 1233 YGKDSIFVVVDRFSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSH 1292

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FWR  W KLGTKL FS+S H +TDGQTE VNR+LGN++R+++   +  W+  L   EFAY
Sbjct: 1293 FWRILWGKLGTKLLFSTSCHPQTDGQTEVVNRTLGNMLRAILKGKLTSWEDYLPIVEFAY 1352

Query: 726  NRSSSQTNGK 755
            NR+   + GK
Sbjct: 1353 NRTFHSSTGK 1362


>ref|XP_012704376.1| PREDICTED: uncharacterized protein LOC105915107 [Setaria italica]
          Length = 1399

 Score =  290 bits (741), Expect = 8e-76
 Identities = 128/239 (53%), Positives = 174/239 (72%)
 Frame = +3

Query: 42   FLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDV 221
            F + DG+ F+ + LCI  CS+R   + EA   GLAGHFG  KTL ++ ++F+WP M RDV
Sbjct: 1038 FYLHDGFLFRTNKLCIPACSIRHVLLQEAHAGGLAGHFGMKKTLDMLADHFFWPHMRRDV 1097

Query: 222  KKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDR 401
            ++H++ C  C  AK      GLY PLP+P  PW+D+++DF++GLPR+ R  DSI V VDR
Sbjct: 1098 QRHVERCITCLKAKSRLNPHGLYIPLPIPNVPWEDISMDFILGLPRSQRGSDSIFVVVDR 1157

Query: 402  YSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTK 581
            +SKMAHF+PC KT DA+H+ADL+F+EIV+LHG+PKTI SDRD KF+SYFW+T W KLGTK
Sbjct: 1158 FSKMAHFIPCHKTDDASHIADLFFREIVRLHGVPKTIVSDRDAKFLSYFWKTLWGKLGTK 1217

Query: 582  LQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRSSSQTNGKC 758
            L FS++ H +TDGQTE VNR+L  ++R+++  N++ W+  L   EFAYNR+   T   C
Sbjct: 1218 LLFSTTCHPQTDGQTEVVNRTLSTMLRAVLKKNLKMWEDCLPHVEFAYNRAVHSTTNFC 1276


>ref|XP_011010189.1| PREDICTED: uncharacterized protein LOC105115097, partial [Populus
            euphratica]
          Length = 1282

 Score =  290 bits (741), Expect = 8e-76
 Identities = 135/227 (59%), Positives = 169/227 (74%)
 Frame = +3

Query: 54   DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*ENFYWPRMDRDVKKHI 233
            +GY FK   +CI   SLRE  + EA   GL+GHFG  KT  L+ E+F+WP M RDV K I
Sbjct: 988  EGYLFKMGRMCIPSGSLRELLVREAHGGGLSGHFGEKKTYELLKEHFFWPSMLRDVHKVI 1047

Query: 234  QSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*RNKDSIMVAVDRYSKM 413
            + C +C  AKG     GLY PLP+P  PW DV++DFV+GL RT R KDSIMV VDR+SKM
Sbjct: 1048 ERCAICKKAKGKENAYGLYMPLPIPEQPWMDVSMDFVLGLSRTQRGKDSIMVVVDRFSKM 1107

Query: 414  AHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYFWRT*WKKLGTKLQFS 593
            +HF+PC KT DA HVADL+F+EIV+LHGIPK+I SDRD KF+SYFW+T W+KLGTKL FS
Sbjct: 1108 SHFIPCNKTDDAVHVADLFFQEIVRLHGIPKSIVSDRDTKFLSYFWKTLWRKLGTKLLFS 1167

Query: 594  SSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYNRS 734
            ++ H +TDGQTE VNR+L +L+R+++  N++ WD  L   EFAYNRS
Sbjct: 1168 TACHPQTDGQTEVVNRTLSSLLRAVIHKNLKSWDTCLPIVEFAYNRS 1214


>ref|XP_010530494.1| PREDICTED: uncharacterized protein LOC104807077 [Tarenaya
            hassleriana]
          Length = 1689

 Score =  289 bits (739), Expect = 1e-75
 Identities = 129/244 (52%), Positives = 178/244 (72%)
 Frame = +3

Query: 6    IWEECSKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            I++EC+KG +  F + D Y F+   LCI  CSLR+  + EA    L GHFG +KTL +V 
Sbjct: 1194 IYKECTKGAHRLFYMEDDYLFRERRLCIPKCSLRDLILQEAHGGALMGHFGVEKTLVMVK 1253

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
            E+F+W  + RDV++ +  C +CH AK      GLY PLP+P  PW D+++DFV+GLP+  
Sbjct: 1254 EHFFWSHLKRDVERFVARCIICHQAKSKTHPHGLYLPLPIPFCPWTDLSMDFVLGLPKI- 1312

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
            +NKDSI V VDR+SKMAHF+PC K  DA+H+A L+FKE+V+LHG+P++I SDRD KF+SY
Sbjct: 1313 QNKDSIFVVVDRFSKMAHFIPCAKANDASHIAGLFFKEVVRLHGLPRSIVSDRDSKFLSY 1372

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FW+T W+KLGTKL FS++ H +TDGQTE VNR+L  L+R+ +G+N++ W   L   EFAY
Sbjct: 1373 FWKTLWRKLGTKLVFSTTCHPQTDGQTEVVNRTLAALLRATIGNNLKNWLECLPHVEFAY 1432

Query: 726  NRSS 737
            NR++
Sbjct: 1433 NRAT 1436


>ref|XP_009145096.1| PREDICTED: uncharacterized protein LOC103868780 [Brassica rapa]
          Length = 2690

 Score =  288 bits (738), Expect = 2e-75
 Identities = 131/242 (54%), Positives = 172/242 (71%)
 Frame = +3

Query: 9    WEECSKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV*E 188
            +  C K    H+   DG+ F  + LC+ +CSLR+ F+ E+    L GHFG  KTL  + +
Sbjct: 2232 YNSCEKFAVGHYFRHDGFLFYDNRLCVPNCSLRDLFVRESHGGSLMGHFGIAKTLKTLQD 2291

Query: 189  NFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT*R 368
            +F+WPRM RDV+K  + C  C  AK   Q+ GLYTPLP+P  PW D+++DF+VGLPRT  
Sbjct: 2292 HFFWPRMKRDVEKLCERCATCKQAKSKVQSHGLYTPLPIPYHPWNDISMDFIVGLPRTRT 2351

Query: 369  NKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSYF 548
             KDSI V VDR+SKMAHF+ C KT DA HVA+L+FKEIV+LHG+P+TI SDRD KF+SYF
Sbjct: 2352 GKDSIFVVVDRFSKMAHFIACHKTDDALHVANLFFKEIVRLHGMPRTIVSDRDTKFLSYF 2411

Query: 549  WRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAYN 728
            W+T W KLGTKL FS++ H +TDGQTE VNR+LG L+R+ +  N++ W+  L   EFAYN
Sbjct: 2412 WKTLWSKLGTKLLFSTTCHPQTDGQTEVVNRTLGTLLRAFIKKNLKSWEDYLPHCEFAYN 2471

Query: 729  RS 734
             +
Sbjct: 2472 HA 2473


>gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]
          Length = 1887

 Score =  288 bits (738), Expect = 2e-75
 Identities = 133/243 (54%), Positives = 174/243 (71%)
 Frame = +3

Query: 6    IWEECSKGPYHHFLI*DGYFFKRSHLCISHCSLREAFINEAQNRGLAGHFGRDKTLSLV* 185
            I+  C K  +  +   DG+ F  + LCI + SLRE FI EA   GL GHFG  KT+ ++ 
Sbjct: 1351 IYSSCEKFAFGKYYRHDGFLFYDNRLCIPNSSLRELFIREAHGGGLMGHFGVSKTIKVMQ 1410

Query: 186  ENFYWPRMDRDVKKHIQSCRVCHLAKGHAQNTGLYTPLPVPMAPWKDVNLDFVVGLPRT* 365
            ++F+WP M RDV++  + C  C  AK  +Q  GLYTPLP+P  PW D+++DFVVGLPRT 
Sbjct: 1411 DHFHWPHMKRDVERICERCPTCKQAKAKSQPHGLYTPLPIPSHPWNDISMDFVVGLPRTR 1470

Query: 366  RNKDSIMVAVDRYSKMAHFVPCLKTLDATHVADLYFKEIVKLHGIPKTITSDRDPKFMSY 545
              KDSI V VDR+SKMAHF+PC KT DA H+A+L+F+E+V+LHG+PKTI SDRD KF+SY
Sbjct: 1471 TGKDSIFVVVDRFSKMAHFIPCHKTDDAIHIANLFFREVVRLHGMPKTIVSDRDTKFLSY 1530

Query: 546  FWRT*WKKLGTKLQFSSSHHYRTDGQTETVNRSLGNLMRSLVGSNIRQWDLTLAQAEFAY 725
            FW+T W KLGTKL FS++ H +TDGQTE VNR+L  L+R+L+  N++ W+  L   EFAY
Sbjct: 1531 FWKTLWSKLGTKLLFSTTCHPQTDGQTEVVNRTLSTLLRALIKKNLKTWEDCLPHVEFAY 1590

Query: 726  NRS 734
            N S
Sbjct: 1591 NHS 1593


Top