BLASTX nr result

ID: Sinomenium21_contig00017112 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00017112
         (972 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom...   213   1e-52
ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac...   212   2e-52
ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom...   199   2e-48
ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma ...   182   1e-43
ref|XP_006493977.1| PREDICTED: serine/threonine-protein kinase P...   140   6e-31
emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]   137   5e-30
ref|XP_006603400.1| PREDICTED: uncharacterized protein LOC102659...   124   6e-26
ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The...   123   1e-25
gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]                 123   1e-25
ref|XP_006607055.1| PREDICTED: uncharacterized protein LOC100778...   122   2e-25
ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788...   122   2e-25
ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664...   122   2e-25
gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]              119   1e-24
ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The...   118   3e-24
gb|AAD17351.1| contains similarity to retrovirus-related polypro...   118   4e-24
emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera]   118   4e-24
gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum]             117   7e-24
ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [The...   116   2e-23
ref|XP_006575889.1| PREDICTED: uncharacterized protein LOC102669...   115   3e-23
ref|XP_006598549.1| PREDICTED: uncharacterized protein LOC100803...   114   5e-23

>ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao]
           gi|508718388|gb|EOY10285.1| Uncharacterized protein
           TCM_025656 [Theobroma cacao]
          Length = 505

 Score =  213 bits (541), Expect = 1e-52
 Identities = 113/273 (41%), Positives = 156/273 (57%)
 Frame = +2

Query: 23  DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202
           +IF+K HN     ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI++VVQLQ Y
Sbjct: 17  EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVEIADVVQLQPY 76

Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382
           W+LNDV +LALKVEKQ+    S  S    +E  +                        + 
Sbjct: 77  WNLNDVIRLALKVEKQRSRKRSMSSSR-QQESISNDESQSSVTIPPPKVNSSKTASSNDK 135

Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562
            T              ++CFK QGFGHIA +CPNR+I++LV                   
Sbjct: 136 ETTFTRASNVN-----KKCFKCQGFGHIAFDCPNRRIISLVEEEDYANWEKLEPVYDEYD 190

Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742
                    D   +LI++R+L+     ++E W+R N+F+T+CTS GK+C VIIDSG+ EN
Sbjct: 191 DEEIEEVSADHGEALIVRRNLNTAMMTKDESWLRHNIFYTRCTSQGKVCNVIIDSGSCEN 250

Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           +++  M++KL LQT  HPHPYKL WL+K NE+K
Sbjct: 251 VIANYMVEKLKLQTEVHPHPYKLQWLRKGNEVK 283


>ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao]
           gi|508704828|gb|EOX96724.1| Gag-pol polyprotein,
           putative [Theobroma cacao]
          Length = 794

 Score =  212 bits (540), Expect = 2e-52
 Identities = 113/275 (41%), Positives = 157/275 (57%), Gaps = 2/275 (0%)
 Frame = +2

Query: 23  DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202
           +IF+K HN     ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN  I++VVQLQ Y
Sbjct: 168 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVGIADVVQLQPY 227

Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382
           W+LNDV +LALKVEKQQ    S  S    ++                      + +  + 
Sbjct: 228 WNLNDVIRLALKVEKQQLRKSSMSSS--RQKDSTSNRGRQSSATIPPPKVNSSKTINHKE 285

Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562
            T              ++CFK QGFGHIAS+CPNR+I++L+                   
Sbjct: 286 TTSTRAPNVN------KKCFKCQGFGHIASDCPNRRIISLIEEEVMEEPSLEEVDDELEI 339

Query: 563 XXXXXV--TYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTF 736
                +     D   +L+++R+L+     E+E W+R N+FHT+CTS GK+C VIIDSG+ 
Sbjct: 340 FNNEEIEEVSADHGEALVVRRNLNTAMLTEDESWLRHNIFHTRCTSQGKVCNVIIDSGSC 399

Query: 737 ENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           EN+++  M+ KL LQT  HPHPYKL WL+K NE+K
Sbjct: 400 ENVIANYMVKKLKLQTEVHPHPYKLQWLRKGNEVK 434


>ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao]
           gi|508726763|gb|EOY18660.1| Uncharacterized protein
           TCM_043155 [Theobroma cacao]
          Length = 625

 Score =  199 bits (505), Expect = 2e-48
 Identities = 109/273 (39%), Positives = 151/273 (55%)
 Frame = +2

Query: 23  DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202
           +IF+K HN     ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI++VVQLQ Y
Sbjct: 137 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTLARYLGGLNVEIADVVQLQPY 196

Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382
           W+LNDV +L LKVEKQQ    S  S    +E  +                        + 
Sbjct: 197 WNLNDVIRLTLKVEKQQSRKRSMSSSR-QQESISNDESQSSVTIPPPKVNSSKTASSNDK 255

Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562
            T              ++CFK Q FGHIAS+CP+R+I++LV                   
Sbjct: 256 ETTFTRASNVN-----KKCFKCQRFGHIASDCPSRRIISLVEEEDYVNWEKLEPVYDEYD 310

Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742
                    D   + I++R+L+     ++E  +R N+F+T+CTS G +C VIIDSG+ EN
Sbjct: 311 DEEIEEVSADHGEAFIVRRNLNTALMTKDESCLRHNIFYTRCTSQGNVCNVIIDSGSCEN 370

Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           +V+  M++KL L T  HPHPYKL WL+K NE+K
Sbjct: 371 VVANYMVEKLKLPTEVHPHPYKLQWLRKGNEVK 403


>ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma cacao]
           gi|508700275|gb|EOX92171.1| Gag-pol polyprotein-like
           protein [Theobroma cacao]
          Length = 399

 Score =  182 bits (463), Expect = 1e-43
 Identities = 102/263 (38%), Positives = 141/263 (53%)
 Frame = +2

Query: 23  DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202
           +IF+K HN     ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI+++VQLQ Y
Sbjct: 168 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVQEPEEQTVARYLGGLNVEIADIVQLQPY 227

Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382
           W+LNDV +LALK            SK  S                           K+  
Sbjct: 228 WNLNDVIRLALKSSVTIPPPKVNSSKTASSND------------------------KKTT 263

Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562
            T              ++CFK QGFGHIAS+C NR+I++LV                   
Sbjct: 264 FTRASNVN--------KKCFKCQGFGHIASDCSNRRIISLVEEEDYANWEKLKPVYDEYD 315

Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742
                    D   +LI++R+L+     ++E W R N+F+T+CTS GK+C VIIDSG++EN
Sbjct: 316 DEEIEEVSADHGEALIVRRNLNTAMMTKDESWFRHNIFYTRCTSQGKVCNVIIDSGSYEN 375

Query: 743 MVSTCMMDKLGLQTVQHPHPYKL 811
           +++  M++KL L T  HPHPYKL
Sbjct: 376 VIANYMVEKLKLPTEVHPHPYKL 398


>ref|XP_006493977.1| PREDICTED: serine/threonine-protein kinase PBS1-like [Citrus
            sinensis]
          Length = 611

 Score =  140 bits (354), Expect = 6e-31
 Identities = 81/201 (40%), Positives = 107/201 (53%)
 Frame = +2

Query: 59   ELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALK 238
            +L + + TAEF+ LM+K D+ EPEEQTIA YLGGL  EI N+VQL+ YW+  DVCKL++K
Sbjct: 415  DLFIEESTAEFEQLMMKCDIVEPEEQTIAHYLGGLRIEIGNIVQLRPYWTFQDVCKLSIK 474

Query: 239  VEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXX 418
            VE+QQKE  +  S+  +R G                      P K E             
Sbjct: 475  VERQQKEARNNSSQSYTRPGSFSRSHPISVKRNSAIKSSPEVPQKDE-VGGNLKQPASTS 533

Query: 419  XXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQS 598
                RRCFK QG+GHIAS+CPNR+IV LV                        VTY D+ 
Sbjct: 534  NTNSRRCFKCQGYGHIASDCPNRRIVTLV---EEESDGSDEADTKNPGDEEKKVTYADEG 590

Query: 599  TSLIIQRSLSVVRAEEEEDWV 661
             SLI++++LS    E++EDW+
Sbjct: 591  ESLILRKTLSSNHVEDQEDWL 611


>emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera]
          Length = 1173

 Score =  137 bits (346), Expect = 5e-30
 Identities = 82/264 (31%), Positives = 131/264 (49%), Gaps = 4/264 (1%)
 Frame = +2

Query: 65  SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244
           +V DY  E +  M++ ++ E  E T+AR+L GLN +I+NVV+LQ Y  L ++  +A+KVE
Sbjct: 143 NVDDYHKEMEIAMIRANVEEDRETTMARFLNGLNRDIANVVELQHYVELENMVHMAIKVE 202

Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQ--EGATXXXXXXXXXX 418
           +Q K    +G+      G +                   +P K+  E             
Sbjct: 203 RQLKR---KGTLSFQNPGSSASWRPNGRKDEGVVFKSKTKPPKRRDEAPNVNKGKNESQT 259

Query: 419 XXXXRRCFKYQGFGHIASECPNRK--IVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGD 592
                +CF Y G GHIAS+CPN++  I  +                         V Y  
Sbjct: 260 RNHDIKCFHYLGVGHIASQCPNKRTMIAHVDGEVETESEEDDDQMPSLEDSCDDNVEYPV 319

Query: 593 QSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKL 772
           +  SL+ +R+LS    +++ +  R+N+FHT+C  + K+C +IID G+  N+ ST +++KL
Sbjct: 320 EGESLVARRALSAQVKKDDMEQQRENIFHTRCHINNKVCSMIIDGGSCANVASTTLVEKL 379

Query: 773 GLQTVQHPHPYKLSWLQKDNEIKD 844
            L T++HP PYKL WL    E+K+
Sbjct: 380 NLPTLKHPRPYKLQWLNDCGEVKE 403


>ref|XP_006603400.1| PREDICTED: uncharacterized protein LOC102659640 [Glycine max]
          Length = 594

 Score =  124 bits (311), Expect = 6e-26
 Identities = 81/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L+V +Y  E +  +++ ++ E  E T+AR+L GLN EI +VV+LQ Y  L+D+   AL+V
Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242

Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388
           E+Q K       K  +R                       RP               G+ 
Sbjct: 243 EQQIKR------KSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296

Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568
                          +CFK  G GHIASECP R+ + +                      
Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356

Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748
                 GD    +++ R L   + +  +D  R+N+FHT+C  +GK+C +I+D G+  N+ 
Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412

Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           S+ ++ KL L+T  HP PYKL WL +D EIK
Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEIK 443


>ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508702148|gb|EOX94044.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 546

 Score =  123 bits (308), Expect = 1e-25
 Identities = 77/292 (26%), Positives = 128/292 (43%), Gaps = 18/292 (6%)
 Frame = +2

Query: 17  TTDIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQ 196
           T +++ K H    N ++V +YT+EF+NL ++  L E  EQ  +RYL GLN+ I + + + 
Sbjct: 103 TMELYEKFHCLKQNNMTVEEYTSEFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVV 162

Query: 197 LYWSLNDVCKLALKVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXX 322
             +++ D  + AL  EK+   + +R   YG+                   +G A      
Sbjct: 163 RLYNIEDARQYALSAEKRVLRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTN 222

Query: 323 XXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVAL 502
                          +   G                 RCF     GHI+  CP R++   
Sbjct: 223 KGATNVEKNDKGKSIMPYGGQNSSGSSTNKGGSNSHIRCFTCGEKGHISFACPQRRV--- 279

Query: 503 VXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHT 682
                                      Y  Q  SL+++R ++    EE EDW R+++F T
Sbjct: 280 --NLAELGEELEPVYDEYEEEVEEIDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRT 337

Query: 683 KCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838
           +    GK+C ++ID G+ EN++S   ++KL L T +HP+PYK+ WL+K +E+
Sbjct: 338 RVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 389


>gb|ADP20178.1| gag-pol polyprotein [Silene latifolia]
          Length = 1518

 Score =  123 bits (308), Expect = 1e-25
 Identities = 84/290 (28%), Positives = 128/290 (44%), Gaps = 15/290 (5%)
 Frame = +2

Query: 17   TTDIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQ 196
            T D+F+K+ N    E +V  Y  EF+ L L+ ++ E  EQ IAR+L GL+  I+  V++Q
Sbjct: 178  TQDLFIKLSNLKQKEKTVEAYLREFEQLTLQCEINEKSEQRIARFLEGLDKNIAAEVRMQ 237

Query: 197  LYWSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQ 376
              WS +DV  L+L+VEK            G  +  A                   +   Q
Sbjct: 238  PLWSYDDVVNLSLRVEKM-----------GKTKPVATRPKPVFRPYSSVKINDPPKTTPQ 286

Query: 377  ----EGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVA-----------LVXX 511
                +G                 +CF+ QGFGH   +CP+ + +            LV  
Sbjct: 287  STVDKGKAPMNPKINPPLSRDKIKCFQCQGFGHFRKDCPSARTLTAIEVAEWEREGLVEY 346

Query: 512  XXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCT 691
                                  V + D   SL + R +   +A  E D  R  +F ++CT
Sbjct: 347  EEDEALVLEEVESEKETSPDQIVAHPDTGHSLFLWRVMHSQQAPLEADQ-RSMIFRSRCT 405

Query: 692  SHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
              G++C +II+ G+  N+ ST M+ KLGL T +HP+PYKL WL KD+ ++
Sbjct: 406  VQGRVCNLIINGGSCTNVASTTMVSKLGLPTQEHPNPYKLRWLSKDSGVR 455


>ref|XP_006607055.1| PREDICTED: uncharacterized protein LOC100778333, partial [Glycine
           max]
          Length = 560

 Score =  122 bits (307), Expect = 2e-25
 Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 4/264 (1%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L+V +Y  E +  +++ ++ E  E T+AR+L GLN  I +VV+LQ Y  L+D+   AL+V
Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPAIRDVVELQEYVVLDDLLHRALRV 242

Query: 242 EKQ--QKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE--GATXXXXXXX 409
           E+Q  +K    R S     + +A                   +       G+        
Sbjct: 243 EQQIKRKSATRRNSPNTYNQNWANRSKEGGNSFRPAATSPHGKSATPSVGGSKHNTSTSS 302

Query: 410 XXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYG 589
                   +CFK  G GHIASECP R+ + +                            G
Sbjct: 303 SNTGTRNIKCFKCLGRGHIASECPTRRTMIMKVDGEITSESEISEEEVEEEEYEEEAMQG 362

Query: 590 DQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDK 769
           D    +++ R L   + +  +D  R+N+FHT+C  +GK+C +I+D G+  N+ S+ ++ K
Sbjct: 363 D----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVASSTLVTK 418

Query: 770 LGLQTVQHPHPYKLSWLQKDNEIK 841
           L L+T  HP PYKL WL +D E+K
Sbjct: 419 LNLETKPHPTPYKLQWLSEDEEVK 442


>ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788838 [Glycine max]
          Length = 519

 Score =  122 bits (307), Expect = 2e-25
 Identities = 79/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L+V +Y  E +  +++ ++ E  E T+AR+L GLN EI +VV+LQ Y  L+D+   AL+V
Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242

Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388
           E+Q K       +  +R                       RP               G+ 
Sbjct: 243 EQQIKR------RSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296

Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568
                          +CFK  G GHIASECP R+ + +                      
Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356

Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748
                 GD    +++ R L   + +  +D  R+N+FHT+C  +GK+C +I+D G+  N+ 
Sbjct: 357 GEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412

Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           S+ ++ KL L+T  HP PYKL WL +D E+K
Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEVK 443


>ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664455 [Glycine max]
          Length = 1176

 Score =  122 bits (307), Expect = 2e-25
 Identities = 79/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L+V +Y  E +  +++ ++ E  E T+AR+L GLN EI +VV+LQ Y  L+D+   AL+V
Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242

Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388
           E+Q K       +  +R                       RP               G+ 
Sbjct: 243 EQQIKR------RSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296

Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568
                          +CFK  G GHIASECP R+ + +                      
Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356

Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748
                 GD    +++ R L   + +  +D  R+N+FHT+C  +GK+C +I+D G+  N+ 
Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412

Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           S+ ++ KL L+T  HP PYKL WL +D E+K
Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEVK 443


>gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana]
          Length = 1887

 Score =  119 bits (299), Expect = 1e-24
 Identities = 82/288 (28%), Positives = 132/288 (45%), Gaps = 5/288 (1%)
 Frame = +2

Query: 65   SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244
            SV +Y  E + LML+ D+ E  E  ++R++GGLN +I + +++Q Y  L ++   A+  E
Sbjct: 541  SVEEYYKEMETLMLRADIQEDNEAIMSRFMGGLNRDIIDRLEVQHYVELEELLHKAIMFE 600

Query: 245  KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXX 424
            KQ K   S+ S    +  +                        Q+G              
Sbjct: 601  KQLKRRSSKPSFGSGKPSYHKDERSGFQKDYKPFIKPKVEDQDQKGKGKAVMTRTRDI-- 658

Query: 425  XXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTS 604
               + FK QG GH ASEC N++I+ +                              +   
Sbjct: 659  ---KGFKCQGHGHYASECSNKRIMIIKDTGEIESEDEQLEESSSTEDYEAP----SKGEL 711

Query: 605  LIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQT 784
            L+  ++LSV+   +E++  R+N+FH+ C  + K+C +IID G+  N+ S  M++KLGL+ 
Sbjct: 712  LVTMKALSVIAKTDEQEQ-RENLFHSSCMVNDKVCSLIIDGGSCTNVASETMVEKLGLKV 770

Query: 785  VQHPHPYKLSWLQKDNEIKDLVVGYMGVPKYAG-----VSMTRHPQYI 913
            ++HP PYKL WL +D E+   V   + VP   G     V MT H  Y+
Sbjct: 771  MKHPRPYKLQWLNEDGEMS--VDRQVKVPLSIGKKTILVPMTPHEVYL 816


>ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508703673|gb|EOX95569.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 1452

 Score =  118 bits (296), Expect = 3e-24
 Identities = 74/279 (26%), Positives = 121/279 (43%), Gaps = 18/279 (6%)
 Frame = +2

Query: 56  NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLAL 235
           N ++V +YT+EF+NL ++  L E  EQ  +RYL GLN+ I + + +   +++ D  + AL
Sbjct: 107 NNMTVEEYTSEFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVVRLYNIEDARQYAL 166

Query: 236 KVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXXXXXXXXXXXXXXX 361
             EK+   + +R   YG+                   +G A                   
Sbjct: 167 SAEKRVLRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTNRGATNVEKNDKGK 226

Query: 362 RPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXX 541
             +   G                 RCF     GH +  CP RK+                
Sbjct: 227 SIMPYGGQNSSGSSTNKRGSNSHIRCFTCGEKGHTSFACPQRKV-----NLAELGEELEP 281

Query: 542 XXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVII 721
                         Y  Q  SL+++R ++    EE EDW R+++F T+    GK+C ++I
Sbjct: 282 VYDEYKEEVEEIDVYPAQGESLVVRRIMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVI 341

Query: 722 DSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838
           D G+ EN++S   ++KL L T +HP+PYK+ WL+K +E+
Sbjct: 342 DGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 380


>gb|AAD17351.1| contains similarity to retrovirus-related polyproteins and to CCHC
           zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051,
           E= 1) [Arabidopsis thaliana] gi|7267432|emb|CAB77944.1|
           putative polyprotein [Arabidopsis thaliana]
          Length = 1138

 Score =  118 bits (295), Expect = 4e-24
 Identities = 76/272 (27%), Positives = 123/272 (45%)
 Frame = +2

Query: 23  DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202
           ++  ++ N      +V +Y  E + LML+ D+ E  E T++R++GGLN +I +  ++  Y
Sbjct: 148 ELHQRLRNLVQGNRTVEEYFKEMETLMLRADVQEECEATMSRFMGGLNRDILDRFEVIHY 207

Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382
            +L ++   A+  EKQ K   ++ S   S+  +                      +  +G
Sbjct: 208 ENLEELFHKAVMFEKQIKRRSAKPSYNSSKPSYQREEKSGFQKEYKPFVKPKVEEISSKG 267

Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562
                            +CFK  G GH ASEC N++I+ +                    
Sbjct: 268 KEKEVTRTRDL------KCFKCHGLGHYASECSNKRIMII--------RDSGEVESEDEK 313

Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742
                V    +   L+  R LSV+   EE+   R+N+FHT+C   GK+C +IID G+  N
Sbjct: 314 PEESDVEEAPKGELLVTMRVLSVLNKAEEQAQ-RENLFHTRCLIKGKVCSLIIDGGSCTN 372

Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838
           + S  M+ KLGL+   HP PYKL WL +  E+
Sbjct: 373 VASETMVQKLGLEEFPHPKPYKLQWLNESGEM 404


>emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera]
          Length = 1115

 Score =  118 bits (295), Expect = 4e-24
 Identities = 83/264 (31%), Positives = 125/264 (47%), Gaps = 4/264 (1%)
 Frame = +2

Query: 65  SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244
           SV  Y  E +  M+  ++ E  E T+AR+L GLN +I+NVV+LQ Y  L D+  + +KVE
Sbjct: 143 SVDXYHKEMEIAMIXANVEEDREATMARFLNGLNRDIANVVELQHYVELXDMVHMXIKVE 202

Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRP--LKQEGATXXXXXXXXXX 418
           +Q K   +R  +  S                        RP   K EG            
Sbjct: 203 RQLKRKGTRSFQNXSSSA-------------------SWRPNGRKDEGVVFTSKXEPPKR 243

Query: 419 XXXXRRCFKYQGFGHIASECPNRK--IVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGD 592
                   K    GHIAS+CPN++  I  +                         V Y  
Sbjct: 244 RDEAPNVNK----GHIASQCPNKRTMIARVDGEVETXSEEDDDQMSXLEDACDDNVEYPX 299

Query: 593 QSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKL 772
           +  SL+ +R+LS    E++ +  R+N FHT+C  + K+C +IID G+  N+ ST +++KL
Sbjct: 300 EGESLVARRALSAQVKEDDMEQQRENXFHTRCHINNKVCSMIIDGGSCTNVASTTLVEKL 359

Query: 773 GLQTVQHPHPYKLSWLQKDNEIKD 844
            L T+++P PYKL WL    ++K+
Sbjct: 360 NLPTLKYPRPYKLXWLNDCGKVKE 383


>gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum]
          Length = 572

 Score =  117 bits (293), Expect = 7e-24
 Identities = 76/258 (29%), Positives = 125/258 (48%)
 Frame = +2

Query: 65  SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244
           SV +Y  E + L ++ ++ E +E T+AR+L GLN++IS++V+L  Y  ++++   A+KVE
Sbjct: 176 SVEEYFKEMEVLKIRANVEEDDEATMARFLHGLNHDISDIVELHHYVEMDELVHQAIKVE 235

Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXX 424
           +Q K    R S+                             ++ +G T            
Sbjct: 236 QQLK----RKSQARRNSTTFNSQSWKDKTKKEGASSSKEATVENKGKTITSSSSSVSTNK 291

Query: 425 XXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTS 604
             + CFK QG GHIAS+CP ++ + +                         +  GD    
Sbjct: 292 SVK-CFKCQGQGHIASQCPTKRTMLM----EENEGIVEEEDGDYDEEFEEEIPSGD---- 342

Query: 605 LIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQT 784
           L++ R +   + +EE+   R+N+FHT+C   GK+C +IID G+  N+ ST ++ KL L+T
Sbjct: 343 LLMVRRMLGSQIKEEDTGQRENLFHTRCFVQGKVCSLIIDGGSCTNVASTRLVSKLKLET 402

Query: 785 VQHPHPYKLSWLQKDNEI 838
             HP PYKL WL +  E+
Sbjct: 403 KPHPKPYKLQWLNESVEM 420


>ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
           gi|508716479|gb|EOY08376.1| DNA/RNA polymerases
           superfamily protein [Theobroma cacao]
          Length = 558

 Score =  116 bits (290), Expect = 2e-23
 Identities = 74/289 (25%), Positives = 125/289 (43%), Gaps = 18/289 (6%)
 Frame = +2

Query: 26  IFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYW 205
           I ++ H    N ++V +YT++F+NL ++  L E  EQ  +RYL GLN+ I + + +   +
Sbjct: 102 IRIEFHCLKQNNMTVEEYTSDFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVVRLY 161

Query: 206 SLNDVCKLALKVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXXXXX 331
           ++ D  + AL  EK+   + +R   YG+                   +G A         
Sbjct: 162 NIEDARQYALSTEKRVLRYGARKPLYGTHWQNNSKARRGYPTSQQNYQGAATINKTNRGA 221

Query: 332 XXXXXXXXXXRPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXX 511
                       +   G                 RCF     GH +  CP R++      
Sbjct: 222 TNVEKNDKGKGIMPYGGQNNSGSSTNKGGSNSHIRCFTCGEKGHTSFACPQRRV-----N 276

Query: 512 XXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCT 691
                                   Y  Q  SL+++R ++    EE EDW R+++F T+  
Sbjct: 277 LAELGEELEPVYDEYEEEVEEIDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRTRVV 336

Query: 692 SHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838
             GK+C ++ID G+ EN++S   ++KL L T +HP+PYK+ WL+K +E+
Sbjct: 337 CEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 385


>ref|XP_006575889.1| PREDICTED: uncharacterized protein LOC102669193 [Glycine max]
          Length = 488

 Score =  115 bits (288), Expect = 3e-23
 Identities = 75/263 (28%), Positives = 124/263 (47%), Gaps = 5/263 (1%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L++ +Y  E +  +++ ++ E  E T+AR+L GLN EI +VV+LQ Y +L+D+   AL+V
Sbjct: 184 LTMEEYYKEMEMALVRANIEEESENTMARFLNGLNPEIRDVVELQKYVALDDLLHRALRV 243

Query: 242 EKQ--QKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE---GATXXXXXX 406
           E+Q  +K    R S     + +A                            G+       
Sbjct: 244 EQQIKRKSATKRNSPNTYNQNWANRSKKEGGNSFHPAATSPQGKSAASSVGGSKHNTSTS 303

Query: 407 XXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTY 586
                    +CFK  G GHI+SECP R+ + +                         +  
Sbjct: 304 SSNTGTRNIKCFKCLGRGHISSECPTRRTMIMKADGEITSESEISEEEVEEEYEEEAM-- 361

Query: 587 GDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMD 766
             Q   L+++R L   + +  +D  ++N+FHT+C  +GK+C +I+D G+  N+ S+ ++ 
Sbjct: 362 --QGDMLMVRRLLGN-QMQPLDDNHKENIFHTRCAINGKLCSLIVDGGSCTNVASSILVT 418

Query: 767 KLGLQTVQHPHPYKLSWLQKDNE 835
           KL L+T  HP PYKL WL +D E
Sbjct: 419 KLNLETKPHPRPYKLQWLSEDEE 441


>ref|XP_006598549.1| PREDICTED: uncharacterized protein LOC100803523 [Glycine max]
          Length = 459

 Score =  114 bits (286), Expect = 5e-23
 Identities = 76/271 (28%), Positives = 119/271 (43%), Gaps = 11/271 (4%)
 Frame = +2

Query: 62  LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241
           L V +Y  E +  +++ ++ E  E T+AR+L GLN EI +VV+LQ Y +L+D+   AL+V
Sbjct: 183 LIVEEYYKEMETALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVALDDLLHRALRV 242

Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388
           E++ K       K  +R                       RP               G+ 
Sbjct: 243 EQKIKR------KSATRRNSPNTYNQNWANRSKKKGGNSFRPAATSPHGKSAASSVGGSK 296

Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568
                          +CFK  G GHIA EC  R+ + +                      
Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIACECSTRRTMIMKADGEITSESEISEEEVEEEEY 356

Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748
                 GD    +++ R L   +    +D  R+N+FHT+C  +GK+C +I+D G+  N+ 
Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMHPLDDNQRENIFHTRCIINGKLCSLIVDGGSCTNVA 412

Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841
           S+ ++  L L+T  HP PYKL WL +D E+K
Sbjct: 413 SSRLVSNLNLETKPHPRPYKLQWLSEDEEVK 443


Top