BLASTX nr result
ID: Sinomenium21_contig00017112
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00017112 (972 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom... 213 1e-52 ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac... 212 2e-52 ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom... 199 2e-48 ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma ... 182 1e-43 ref|XP_006493977.1| PREDICTED: serine/threonine-protein kinase P... 140 6e-31 emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera] 137 5e-30 ref|XP_006603400.1| PREDICTED: uncharacterized protein LOC102659... 124 6e-26 ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The... 123 1e-25 gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] 123 1e-25 ref|XP_006607055.1| PREDICTED: uncharacterized protein LOC100778... 122 2e-25 ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788... 122 2e-25 ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664... 122 2e-25 gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] 119 1e-24 ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The... 118 3e-24 gb|AAD17351.1| contains similarity to retrovirus-related polypro... 118 4e-24 emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera] 118 4e-24 gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum] 117 7e-24 ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [The... 116 2e-23 ref|XP_006575889.1| PREDICTED: uncharacterized protein LOC102669... 115 3e-23 ref|XP_006598549.1| PREDICTED: uncharacterized protein LOC100803... 114 5e-23 >ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao] gi|508718388|gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 213 bits (541), Expect = 1e-52 Identities = 113/273 (41%), Positives = 156/273 (57%) Frame = +2 Query: 23 DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202 +IF+K HN ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI++VVQLQ Y Sbjct: 17 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVEIADVVQLQPY 76 Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382 W+LNDV +LALKVEKQ+ S S +E + + Sbjct: 77 WNLNDVIRLALKVEKQRSRKRSMSSSR-QQESISNDESQSSVTIPPPKVNSSKTASSNDK 135 Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562 T ++CFK QGFGHIA +CPNR+I++LV Sbjct: 136 ETTFTRASNVN-----KKCFKCQGFGHIAFDCPNRRIISLVEEEDYANWEKLEPVYDEYD 190 Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742 D +LI++R+L+ ++E W+R N+F+T+CTS GK+C VIIDSG+ EN Sbjct: 191 DEEIEEVSADHGEALIVRRNLNTAMMTKDESWLRHNIFYTRCTSQGKVCNVIIDSGSCEN 250 Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 +++ M++KL LQT HPHPYKL WL+K NE+K Sbjct: 251 VIANYMVEKLKLQTEVHPHPYKLQWLRKGNEVK 283 >ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao] gi|508704828|gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 212 bits (540), Expect = 2e-52 Identities = 113/275 (41%), Positives = 157/275 (57%), Gaps = 2/275 (0%) Frame = +2 Query: 23 DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202 +IF+K HN ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN I++VVQLQ Y Sbjct: 168 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTVARYLGGLNVGIADVVQLQPY 227 Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382 W+LNDV +LALKVEKQQ S S ++ + + + Sbjct: 228 WNLNDVIRLALKVEKQQLRKSSMSSS--RQKDSTSNRGRQSSATIPPPKVNSSKTINHKE 285 Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562 T ++CFK QGFGHIAS+CPNR+I++L+ Sbjct: 286 TTSTRAPNVN------KKCFKCQGFGHIASDCPNRRIISLIEEEVMEEPSLEEVDDELEI 339 Query: 563 XXXXXV--TYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTF 736 + D +L+++R+L+ E+E W+R N+FHT+CTS GK+C VIIDSG+ Sbjct: 340 FNNEEIEEVSADHGEALVVRRNLNTAMLTEDESWLRHNIFHTRCTSQGKVCNVIIDSGSC 399 Query: 737 ENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 EN+++ M+ KL LQT HPHPYKL WL+K NE+K Sbjct: 400 ENVIANYMVKKLKLQTEVHPHPYKLQWLRKGNEVK 434 >ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao] gi|508726763|gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 199 bits (505), Expect = 2e-48 Identities = 109/273 (39%), Positives = 151/273 (55%) Frame = +2 Query: 23 DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202 +IF+K HN ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI++VVQLQ Y Sbjct: 137 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVHEPEEQTLARYLGGLNVEIADVVQLQPY 196 Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382 W+LNDV +L LKVEKQQ S S +E + + Sbjct: 197 WNLNDVIRLTLKVEKQQSRKRSMSSSR-QQESISNDESQSSVTIPPPKVNSSKTASSNDK 255 Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562 T ++CFK Q FGHIAS+CP+R+I++LV Sbjct: 256 ETTFTRASNVN-----KKCFKCQRFGHIASDCPSRRIISLVEEEDYVNWEKLEPVYDEYD 310 Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742 D + I++R+L+ ++E +R N+F+T+CTS G +C VIIDSG+ EN Sbjct: 311 DEEIEEVSADHGEAFIVRRNLNTALMTKDESCLRHNIFYTRCTSQGNVCNVIIDSGSCEN 370 Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 +V+ M++KL L T HPHPYKL WL+K NE+K Sbjct: 371 VVANYMVEKLKLPTEVHPHPYKLQWLRKGNEVK 403 >ref|XP_007048014.1| Gag-pol polyprotein-like protein [Theobroma cacao] gi|508700275|gb|EOX92171.1| Gag-pol polyprotein-like protein [Theobroma cacao] Length = 399 Score = 182 bits (463), Expect = 1e-43 Identities = 102/263 (38%), Positives = 141/263 (53%) Frame = +2 Query: 23 DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202 +IF+K HN ++V +YT EF+ L +K D+ EPEEQT+ARYLGGLN EI+++VQLQ Y Sbjct: 168 EIFIKFHNLRQKTMTVEEYTMEFEQLHMKCDVQEPEEQTVARYLGGLNVEIADIVQLQPY 227 Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382 W+LNDV +LALK SK S K+ Sbjct: 228 WNLNDVIRLALKSSVTIPPPKVNSSKTASSND------------------------KKTT 263 Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562 T ++CFK QGFGHIAS+C NR+I++LV Sbjct: 264 FTRASNVN--------KKCFKCQGFGHIASDCSNRRIISLVEEEDYANWEKLKPVYDEYD 315 Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742 D +LI++R+L+ ++E W R N+F+T+CTS GK+C VIIDSG++EN Sbjct: 316 DEEIEEVSADHGEALIVRRNLNTAMMTKDESWFRHNIFYTRCTSQGKVCNVIIDSGSYEN 375 Query: 743 MVSTCMMDKLGLQTVQHPHPYKL 811 +++ M++KL L T HPHPYKL Sbjct: 376 VIANYMVEKLKLPTEVHPHPYKL 398 >ref|XP_006493977.1| PREDICTED: serine/threonine-protein kinase PBS1-like [Citrus sinensis] Length = 611 Score = 140 bits (354), Expect = 6e-31 Identities = 81/201 (40%), Positives = 107/201 (53%) Frame = +2 Query: 59 ELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALK 238 +L + + TAEF+ LM+K D+ EPEEQTIA YLGGL EI N+VQL+ YW+ DVCKL++K Sbjct: 415 DLFIEESTAEFEQLMMKCDIVEPEEQTIAHYLGGLRIEIGNIVQLRPYWTFQDVCKLSIK 474 Query: 239 VEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXX 418 VE+QQKE + S+ +R G P K E Sbjct: 475 VERQQKEARNNSSQSYTRPGSFSRSHPISVKRNSAIKSSPEVPQKDE-VGGNLKQPASTS 533 Query: 419 XXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQS 598 RRCFK QG+GHIAS+CPNR+IV LV VTY D+ Sbjct: 534 NTNSRRCFKCQGYGHIASDCPNRRIVTLV---EEESDGSDEADTKNPGDEEKKVTYADEG 590 Query: 599 TSLIIQRSLSVVRAEEEEDWV 661 SLI++++LS E++EDW+ Sbjct: 591 ESLILRKTLSSNHVEDQEDWL 611 >emb|CAN77900.1| hypothetical protein VITISV_037350 [Vitis vinifera] Length = 1173 Score = 137 bits (346), Expect = 5e-30 Identities = 82/264 (31%), Positives = 131/264 (49%), Gaps = 4/264 (1%) Frame = +2 Query: 65 SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244 +V DY E + M++ ++ E E T+AR+L GLN +I+NVV+LQ Y L ++ +A+KVE Sbjct: 143 NVDDYHKEMEIAMIRANVEEDRETTMARFLNGLNRDIANVVELQHYVELENMVHMAIKVE 202 Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQ--EGATXXXXXXXXXX 418 +Q K +G+ G + +P K+ E Sbjct: 203 RQLKR---KGTLSFQNPGSSASWRPNGRKDEGVVFKSKTKPPKRRDEAPNVNKGKNESQT 259 Query: 419 XXXXRRCFKYQGFGHIASECPNRK--IVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGD 592 +CF Y G GHIAS+CPN++ I + V Y Sbjct: 260 RNHDIKCFHYLGVGHIASQCPNKRTMIAHVDGEVETESEEDDDQMPSLEDSCDDNVEYPV 319 Query: 593 QSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKL 772 + SL+ +R+LS +++ + R+N+FHT+C + K+C +IID G+ N+ ST +++KL Sbjct: 320 EGESLVARRALSAQVKKDDMEQQRENIFHTRCHINNKVCSMIIDGGSCANVASTTLVEKL 379 Query: 773 GLQTVQHPHPYKLSWLQKDNEIKD 844 L T++HP PYKL WL E+K+ Sbjct: 380 NLPTLKHPRPYKLQWLNDCGEVKE 403 >ref|XP_006603400.1| PREDICTED: uncharacterized protein LOC102659640 [Glycine max] Length = 594 Score = 124 bits (311), Expect = 6e-26 Identities = 81/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L+V +Y E + +++ ++ E E T+AR+L GLN EI +VV+LQ Y L+D+ AL+V Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242 Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388 E+Q K K +R RP G+ Sbjct: 243 EQQIKR------KSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296 Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568 +CFK G GHIASECP R+ + + Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356 Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748 GD +++ R L + + +D R+N+FHT+C +GK+C +I+D G+ N+ Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412 Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 S+ ++ KL L+T HP PYKL WL +D EIK Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEIK 443 >ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702148|gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 546 Score = 123 bits (308), Expect = 1e-25 Identities = 77/292 (26%), Positives = 128/292 (43%), Gaps = 18/292 (6%) Frame = +2 Query: 17 TTDIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQ 196 T +++ K H N ++V +YT+EF+NL ++ L E EQ +RYL GLN+ I + + + Sbjct: 103 TMELYEKFHCLKQNNMTVEEYTSEFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVV 162 Query: 197 LYWSLNDVCKLALKVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXX 322 +++ D + AL EK+ + +R YG+ +G A Sbjct: 163 RLYNIEDARQYALSAEKRVLRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTN 222 Query: 323 XXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVAL 502 + G RCF GHI+ CP R++ Sbjct: 223 KGATNVEKNDKGKSIMPYGGQNSSGSSTNKGGSNSHIRCFTCGEKGHISFACPQRRV--- 279 Query: 503 VXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHT 682 Y Q SL+++R ++ EE EDW R+++F T Sbjct: 280 --NLAELGEELEPVYDEYEEEVEEIDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRT 337 Query: 683 KCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838 + GK+C ++ID G+ EN++S ++KL L T +HP+PYK+ WL+K +E+ Sbjct: 338 RVVCEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 389 >gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] Length = 1518 Score = 123 bits (308), Expect = 1e-25 Identities = 84/290 (28%), Positives = 128/290 (44%), Gaps = 15/290 (5%) Frame = +2 Query: 17 TTDIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQ 196 T D+F+K+ N E +V Y EF+ L L+ ++ E EQ IAR+L GL+ I+ V++Q Sbjct: 178 TQDLFIKLSNLKQKEKTVEAYLREFEQLTLQCEINEKSEQRIARFLEGLDKNIAAEVRMQ 237 Query: 197 LYWSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQ 376 WS +DV L+L+VEK G + A + Q Sbjct: 238 PLWSYDDVVNLSLRVEKM-----------GKTKPVATRPKPVFRPYSSVKINDPPKTTPQ 286 Query: 377 ----EGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVA-----------LVXX 511 +G +CF+ QGFGH +CP+ + + LV Sbjct: 287 STVDKGKAPMNPKINPPLSRDKIKCFQCQGFGHFRKDCPSARTLTAIEVAEWEREGLVEY 346 Query: 512 XXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCT 691 V + D SL + R + +A E D R +F ++CT Sbjct: 347 EEDEALVLEEVESEKETSPDQIVAHPDTGHSLFLWRVMHSQQAPLEADQ-RSMIFRSRCT 405 Query: 692 SHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 G++C +II+ G+ N+ ST M+ KLGL T +HP+PYKL WL KD+ ++ Sbjct: 406 VQGRVCNLIINGGSCTNVASTTMVSKLGLPTQEHPNPYKLRWLSKDSGVR 455 >ref|XP_006607055.1| PREDICTED: uncharacterized protein LOC100778333, partial [Glycine max] Length = 560 Score = 122 bits (307), Expect = 2e-25 Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 4/264 (1%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L+V +Y E + +++ ++ E E T+AR+L GLN I +VV+LQ Y L+D+ AL+V Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPAIRDVVELQEYVVLDDLLHRALRV 242 Query: 242 EKQ--QKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE--GATXXXXXXX 409 E+Q +K R S + +A + G+ Sbjct: 243 EQQIKRKSATRRNSPNTYNQNWANRSKEGGNSFRPAATSPHGKSATPSVGGSKHNTSTSS 302 Query: 410 XXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYG 589 +CFK G GHIASECP R+ + + G Sbjct: 303 SNTGTRNIKCFKCLGRGHIASECPTRRTMIMKVDGEITSESEISEEEVEEEEYEEEAMQG 362 Query: 590 DQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDK 769 D +++ R L + + +D R+N+FHT+C +GK+C +I+D G+ N+ S+ ++ K Sbjct: 363 D----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVASSTLVTK 418 Query: 770 LGLQTVQHPHPYKLSWLQKDNEIK 841 L L+T HP PYKL WL +D E+K Sbjct: 419 LNLETKPHPTPYKLQWLSEDEEVK 442 >ref|XP_006607002.1| PREDICTED: uncharacterized protein LOC100788838 [Glycine max] Length = 519 Score = 122 bits (307), Expect = 2e-25 Identities = 79/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L+V +Y E + +++ ++ E E T+AR+L GLN EI +VV+LQ Y L+D+ AL+V Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242 Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388 E+Q K + +R RP G+ Sbjct: 243 EQQIKR------RSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296 Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568 +CFK G GHIASECP R+ + + Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356 Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748 GD +++ R L + + +D R+N+FHT+C +GK+C +I+D G+ N+ Sbjct: 357 GEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412 Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 S+ ++ KL L+T HP PYKL WL +D E+K Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEVK 443 >ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664455 [Glycine max] Length = 1176 Score = 122 bits (307), Expect = 2e-25 Identities = 79/271 (29%), Positives = 123/271 (45%), Gaps = 11/271 (4%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L+V +Y E + +++ ++ E E T+AR+L GLN EI +VV+LQ Y L+D+ AL+V Sbjct: 183 LTVEEYYKEMEMALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVVLDDLLHRALRV 242 Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388 E+Q K + +R RP G+ Sbjct: 243 EQQIKR------RSATRRNSPNTYNQNWANRSKKEGGNSFRPAATSPYGKSATPSVGGSK 296 Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568 +CFK G GHIASECP R+ + + Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIASECPTRRTMIMKADGEITSESEISEEEVEEEEY 356 Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748 GD +++ R L + + +D R+N+FHT+C +GK+C +I+D G+ N+ Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMQPLDDNQRENIFHTRCVINGKLCSLIVDGGSCTNVA 412 Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 S+ ++ KL L+T HP PYKL WL +D E+K Sbjct: 413 SSTLVTKLNLETKPHPRPYKLQWLSEDEEVK 443 >gb|AAF79348.1|AC007887_7 F15O4.13 [Arabidopsis thaliana] Length = 1887 Score = 119 bits (299), Expect = 1e-24 Identities = 82/288 (28%), Positives = 132/288 (45%), Gaps = 5/288 (1%) Frame = +2 Query: 65 SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244 SV +Y E + LML+ D+ E E ++R++GGLN +I + +++Q Y L ++ A+ E Sbjct: 541 SVEEYYKEMETLMLRADIQEDNEAIMSRFMGGLNRDIIDRLEVQHYVELEELLHKAIMFE 600 Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXX 424 KQ K S+ S + + Q+G Sbjct: 601 KQLKRRSSKPSFGSGKPSYHKDERSGFQKDYKPFIKPKVEDQDQKGKGKAVMTRTRDI-- 658 Query: 425 XXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTS 604 + FK QG GH ASEC N++I+ + + Sbjct: 659 ---KGFKCQGHGHYASECSNKRIMIIKDTGEIESEDEQLEESSSTEDYEAP----SKGEL 711 Query: 605 LIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQT 784 L+ ++LSV+ +E++ R+N+FH+ C + K+C +IID G+ N+ S M++KLGL+ Sbjct: 712 LVTMKALSVIAKTDEQEQ-RENLFHSSCMVNDKVCSLIIDGGSCTNVASETMVEKLGLKV 770 Query: 785 VQHPHPYKLSWLQKDNEIKDLVVGYMGVPKYAG-----VSMTRHPQYI 913 ++HP PYKL WL +D E+ V + VP G V MT H Y+ Sbjct: 771 MKHPRPYKLQWLNEDGEMS--VDRQVKVPLSIGKKTILVPMTPHEVYL 816 >ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508703673|gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 118 bits (296), Expect = 3e-24 Identities = 74/279 (26%), Positives = 121/279 (43%), Gaps = 18/279 (6%) Frame = +2 Query: 56 NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLAL 235 N ++V +YT+EF+NL ++ L E EQ +RYL GLN+ I + + + +++ D + AL Sbjct: 107 NNMTVEEYTSEFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVVRLYNIEDARQYAL 166 Query: 236 KVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXXXXXXXXXXXXXXX 361 EK+ + +R YG+ +G A Sbjct: 167 SAEKRVLRYGARKPLYGTHWQNNSEARRGYPTSQQNYQGAATINKTNRGATNVEKNDKGK 226 Query: 362 RPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXX 541 + G RCF GH + CP RK+ Sbjct: 227 SIMPYGGQNSSGSSTNKRGSNSHIRCFTCGEKGHTSFACPQRKV-----NLAELGEELEP 281 Query: 542 XXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVII 721 Y Q SL+++R ++ EE EDW R+++F T+ GK+C ++I Sbjct: 282 VYDEYKEEVEEIDVYPAQGESLVVRRIMTTTVNEEAEDWKRRSIFRTRVVCEGKVCDLVI 341 Query: 722 DSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838 D G+ EN++S ++KL L T +HP+PYK+ WL+K +E+ Sbjct: 342 DGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 380 >gb|AAD17351.1| contains similarity to retrovirus-related polyproteins and to CCHC zinc finger protein (Pfam: PF00098, Score=16.3, E=0.051, E= 1) [Arabidopsis thaliana] gi|7267432|emb|CAB77944.1| putative polyprotein [Arabidopsis thaliana] Length = 1138 Score = 118 bits (295), Expect = 4e-24 Identities = 76/272 (27%), Positives = 123/272 (45%) Frame = +2 Query: 23 DIFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLY 202 ++ ++ N +V +Y E + LML+ D+ E E T++R++GGLN +I + ++ Y Sbjct: 148 ELHQRLRNLVQGNRTVEEYFKEMETLMLRADVQEECEATMSRFMGGLNRDILDRFEVIHY 207 Query: 203 WSLNDVCKLALKVEKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEG 382 +L ++ A+ EKQ K ++ S S+ + + +G Sbjct: 208 ENLEELFHKAVMFEKQIKRRSAKPSYNSSKPSYQREEKSGFQKEYKPFVKPKVEEISSKG 267 Query: 383 ATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXX 562 +CFK G GH ASEC N++I+ + Sbjct: 268 KEKEVTRTRDL------KCFKCHGLGHYASECSNKRIMII--------RDSGEVESEDEK 313 Query: 563 XXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFEN 742 V + L+ R LSV+ EE+ R+N+FHT+C GK+C +IID G+ N Sbjct: 314 PEESDVEEAPKGELLVTMRVLSVLNKAEEQAQ-RENLFHTRCLIKGKVCSLIIDGGSCTN 372 Query: 743 MVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838 + S M+ KLGL+ HP PYKL WL + E+ Sbjct: 373 VASETMVQKLGLEEFPHPKPYKLQWLNESGEM 404 >emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera] Length = 1115 Score = 118 bits (295), Expect = 4e-24 Identities = 83/264 (31%), Positives = 125/264 (47%), Gaps = 4/264 (1%) Frame = +2 Query: 65 SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244 SV Y E + M+ ++ E E T+AR+L GLN +I+NVV+LQ Y L D+ + +KVE Sbjct: 143 SVDXYHKEMEIAMIXANVEEDREATMARFLNGLNRDIANVVELQHYVELXDMVHMXIKVE 202 Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRP--LKQEGATXXXXXXXXXX 418 +Q K +R + S RP K EG Sbjct: 203 RQLKRKGTRSFQNXSSSA-------------------SWRPNGRKDEGVVFTSKXEPPKR 243 Query: 419 XXXXRRCFKYQGFGHIASECPNRK--IVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGD 592 K GHIAS+CPN++ I + V Y Sbjct: 244 RDEAPNVNK----GHIASQCPNKRTMIARVDGEVETXSEEDDDQMSXLEDACDDNVEYPX 299 Query: 593 QSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKL 772 + SL+ +R+LS E++ + R+N FHT+C + K+C +IID G+ N+ ST +++KL Sbjct: 300 EGESLVARRALSAQVKEDDMEQQRENXFHTRCHINNKVCSMIIDGGSCTNVASTTLVEKL 359 Query: 773 GLQTVQHPHPYKLSWLQKDNEIKD 844 L T+++P PYKL WL ++K+ Sbjct: 360 NLPTLKYPRPYKLXWLNDCGKVKE 383 >gb|ADP20181.1| mutant gag-pol polyprotein [Pisum sativum] Length = 572 Score = 117 bits (293), Expect = 7e-24 Identities = 76/258 (29%), Positives = 125/258 (48%) Frame = +2 Query: 65 SVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKVE 244 SV +Y E + L ++ ++ E +E T+AR+L GLN++IS++V+L Y ++++ A+KVE Sbjct: 176 SVEEYFKEMEVLKIRANVEEDDEATMARFLHGLNHDISDIVELHHYVEMDELVHQAIKVE 235 Query: 245 KQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQEGATXXXXXXXXXXXX 424 +Q K R S+ ++ +G T Sbjct: 236 QQLK----RKSQARRNSTTFNSQSWKDKTKKEGASSSKEATVENKGKTITSSSSSVSTNK 291 Query: 425 XXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTYGDQSTS 604 + CFK QG GHIAS+CP ++ + + + GD Sbjct: 292 SVK-CFKCQGQGHIASQCPTKRTMLM----EENEGIVEEEDGDYDEEFEEEIPSGD---- 342 Query: 605 LIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMDKLGLQT 784 L++ R + + +EE+ R+N+FHT+C GK+C +IID G+ N+ ST ++ KL L+T Sbjct: 343 LLMVRRMLGSQIKEEDTGQRENLFHTRCFVQGKVCSLIIDGGSCTNVASTRLVSKLKLET 402 Query: 785 VQHPHPYKLSWLQKDNEI 838 HP PYKL WL + E+ Sbjct: 403 KPHPKPYKLQWLNESVEM 420 >ref|XP_007027874.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508716479|gb|EOY08376.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 558 Score = 116 bits (290), Expect = 2e-23 Identities = 74/289 (25%), Positives = 125/289 (43%), Gaps = 18/289 (6%) Frame = +2 Query: 26 IFLKIHNF**NELSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYW 205 I ++ H N ++V +YT++F+NL ++ L E EQ +RYL GLN+ I + + + + Sbjct: 102 IRIEFHCLKQNNMTVEEYTSDFNNLSIRVGLAESNEQITSRYLAGLNHSIRDEMGVVRLY 161 Query: 206 SLNDVCKLALKVEKQQKEFCSRGSKYGSR------------------EGFAXXXXXXXXX 331 ++ D + AL EK+ + +R YG+ +G A Sbjct: 162 NIEDARQYALSTEKRVLRYGARKPLYGTHWQNNSKARRGYPTSQQNYQGAATINKTNRGA 221 Query: 332 XXXXXXXXXXRPLKQEGATXXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXX 511 + G RCF GH + CP R++ Sbjct: 222 TNVEKNDKGKGIMPYGGQNNSGSSTNKGGSNSHIRCFTCGEKGHTSFACPQRRV-----N 276 Query: 512 XXXXXXXXXXXXXXXXXXXXXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCT 691 Y Q SL+++R ++ EE EDW R+++F T+ Sbjct: 277 LAELGEELEPVYDEYEEEVEEIDVYPAQGESLVVRRVMTTTVNEEAEDWKRRSIFRTRVV 336 Query: 692 SHGKICVVIIDSGTFENMVSTCMMDKLGLQTVQHPHPYKLSWLQKDNEI 838 GK+C ++ID G+ EN++S ++KL L T +HP+PYK+ WL+K +E+ Sbjct: 337 CEGKVCDLVIDGGSMENIISKEAVNKLKLPTNKHPYPYKIGWLKKGHEV 385 >ref|XP_006575889.1| PREDICTED: uncharacterized protein LOC102669193 [Glycine max] Length = 488 Score = 115 bits (288), Expect = 3e-23 Identities = 75/263 (28%), Positives = 124/263 (47%), Gaps = 5/263 (1%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L++ +Y E + +++ ++ E E T+AR+L GLN EI +VV+LQ Y +L+D+ AL+V Sbjct: 184 LTMEEYYKEMEMALVRANIEEESENTMARFLNGLNPEIRDVVELQKYVALDDLLHRALRV 243 Query: 242 EKQ--QKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE---GATXXXXXX 406 E+Q +K R S + +A G+ Sbjct: 244 EQQIKRKSATKRNSPNTYNQNWANRSKKEGGNSFHPAATSPQGKSAASSVGGSKHNTSTS 303 Query: 407 XXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXXXXXVTY 586 +CFK G GHI+SECP R+ + + + Sbjct: 304 SSNTGTRNIKCFKCLGRGHISSECPTRRTMIMKADGEITSESEISEEEVEEEYEEEAM-- 361 Query: 587 GDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMVSTCMMD 766 Q L+++R L + + +D ++N+FHT+C +GK+C +I+D G+ N+ S+ ++ Sbjct: 362 --QGDMLMVRRLLGN-QMQPLDDNHKENIFHTRCAINGKLCSLIVDGGSCTNVASSILVT 418 Query: 767 KLGLQTVQHPHPYKLSWLQKDNE 835 KL L+T HP PYKL WL +D E Sbjct: 419 KLNLETKPHPRPYKLQWLSEDEE 441 >ref|XP_006598549.1| PREDICTED: uncharacterized protein LOC100803523 [Glycine max] Length = 459 Score = 114 bits (286), Expect = 5e-23 Identities = 76/271 (28%), Positives = 119/271 (43%), Gaps = 11/271 (4%) Frame = +2 Query: 62 LSVADYTAEFDNLMLKGDLTEPEEQTIARYLGGLNYEISNVVQLQLYWSLNDVCKLALKV 241 L V +Y E + +++ ++ E E T+AR+L GLN EI +VV+LQ Y +L+D+ AL+V Sbjct: 183 LIVEEYYKEMETALVRANIEEDSEDTMARFLNGLNPEIRDVVELQEYVALDDLLHRALRV 242 Query: 242 EKQQKEFCSRGSKYGSREGFAXXXXXXXXXXXXXXXXXXXRPLKQE-----------GAT 388 E++ K K +R RP G+ Sbjct: 243 EQKIKR------KSATRRNSPNTYNQNWANRSKKKGGNSFRPAATSPHGKSAASSVGGSK 296 Query: 389 XXXXXXXXXXXXXXRRCFKYQGFGHIASECPNRKIVALVXXXXXXXXXXXXXXXXXXXXX 568 +CFK G GHIA EC R+ + + Sbjct: 297 HNTSTSSSNTGTRNIKCFKCLGRGHIACECSTRRTMIMKADGEITSESEISEEEVEEEEY 356 Query: 569 XXXVTYGDQSTSLIIQRSLSVVRAEEEEDWVRKNVFHTKCTSHGKICVVIIDSGTFENMV 748 GD +++ R L + +D R+N+FHT+C +GK+C +I+D G+ N+ Sbjct: 357 EEEAMQGD----MLMVRRLLGNQMHPLDDNQRENIFHTRCIINGKLCSLIVDGGSCTNVA 412 Query: 749 STCMMDKLGLQTVQHPHPYKLSWLQKDNEIK 841 S+ ++ L L+T HP PYKL WL +D E+K Sbjct: 413 SSRLVSNLNLETKPHPRPYKLQWLSEDEEVK 443