BLASTX nr result
ID: Cocculus23_contig00060466
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00060466 (355 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobrom... 130 1e-28 ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cac... 127 1e-27 ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobrom... 124 2e-26 ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobrom... 123 2e-26 gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medi... 119 4e-25 ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial ... 119 6e-25 ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutr... 117 2e-24 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 115 6e-24 gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ... 112 7e-23 gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ... 112 7e-23 ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, part... 110 2e-22 gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] 110 3e-22 ref|XP_006575997.1| PREDICTED: uncharacterized protein LOC100809... 108 6e-22 ref|XP_007050047.1| Uncharacterized protein TCM_003699 [Theobrom... 108 6e-22 ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221... 106 4e-21 ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624... 105 5e-21 ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [The... 105 7e-21 emb|CAN73690.1| hypothetical protein VITISV_034834 [Vitis vinifera] 104 1e-20 ref|XP_006605097.1| PREDICTED: uncharacterized protein LOC100803... 104 1e-20 ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [The... 103 3e-20 >ref|XP_007029783.1| Uncharacterized protein TCM_025656 [Theobroma cacao] gi|508718388|gb|EOY10285.1| Uncharacterized protein TCM_025656 [Theobroma cacao] Length = 505 Score = 130 bits (328), Expect = 1e-28 Identities = 56/110 (50%), Positives = 73/110 (66%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 +TE HP+PY L W +KG EV V KRC V FSIG Y+D+VWCD++P++ACH+LL RPWQY Sbjct: 263 QTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDIIPMDACHLLLGRPWQY 322 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQLLSMPQFSK 331 DR H+G +NTYSFI + KI L P + L+++P SK Sbjct: 323 DRRAHHDGYKNTYSFIKDGAKIMLTPLKPENRPKRQEEDKALITVPSLSK 372 >ref|XP_007052567.1| Gag-pol polyprotein, putative [Theobroma cacao] gi|508704828|gb|EOX96724.1| Gag-pol polyprotein, putative [Theobroma cacao] Length = 794 Score = 127 bits (320), Expect = 1e-27 Identities = 56/118 (47%), Positives = 74/118 (62%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 +TE HP+PY L W +KG EV V KRC V FSIG Y+D+VWCDV+P++ACH+LL RPWQY Sbjct: 414 QTEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGNKYEDEVWCDVIPMDACHLLLGRPWQY 473 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQLLSMPQFSKVMNDKGVV 355 DR H+G +NTYSFI + KI L P + L++M +K ++ Sbjct: 474 DRRAHHDGYKNTYSFIKDGAKIMLTPLKPEDCPKKQEKDKALITMSGLNKAFRKSSLL 531 >ref|XP_007009850.1| Uncharacterized protein TCM_043155 [Theobroma cacao] gi|508726763|gb|EOY18660.1| Uncharacterized protein TCM_043155 [Theobroma cacao] Length = 625 Score = 124 bits (310), Expect = 2e-26 Identities = 54/109 (49%), Positives = 71/109 (65%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 TE HP+PY L W +KG EV V KRC + F I Y+D+VWCDV+P++ACH+LL RPWQYD Sbjct: 384 TEVHPHPYKLQWLRKGNEVKVTKRCCIQFFIRNKYEDEVWCDVIPMDACHLLLGRPWQYD 443 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQLLSMPQFSK 331 R ++G +NTYSFI + VKI L P + L+++P SK Sbjct: 444 RRAHYDGYKNTYSFIKDGVKIMLTPLKPEDRPKRQEEDKALITVPSLSK 492 >ref|XP_007028192.1| Uncharacterized protein TCM_023754 [Theobroma cacao] gi|508716797|gb|EOY08694.1| Uncharacterized protein TCM_023754 [Theobroma cacao] Length = 440 Score = 123 bits (309), Expect = 2e-26 Identities = 55/109 (50%), Positives = 71/109 (65%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 TE HP+PY L W +KG EV V KRC V FSIG Y+D+VWCDV+P++ACH+LL RPWQYD Sbjct: 199 TEVHPHPYKLQWLRKGNEVKVTKRCCVQFSIGSKYEDEVWCDVIPMDACHLLLGRPWQYD 258 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQLLSMPQFSK 331 R ++G +N SFI + VKI L P + L+++P SK Sbjct: 259 RRAHYDGYKNISSFIKDGVKIMLTPLKPEDRPKRQEEDKALITVPTLSK 307 >gb|ABD32741.1| hypothetical protein MtrDRAFT_AC150777g16v1 [Medicago truncatula] Length = 187 Score = 119 bits (298), Expect = 4e-25 Identities = 51/85 (60%), Positives = 64/85 (75%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T+ HP+ Y L W KKG EV V+K CLV+FSIG+ YKD VWCDV+ ++ACH+LL RPWQYD Sbjct: 9 TKDHPHRYKLQWLKKGNEVRVSKCCLVSFSIGQKYKDNVWCDVISMDACHMLLGRPWQYD 68 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVP 259 R ++G NTY+F+ VKI LVP Sbjct: 69 RHALYDGHANTYTFVKYGVKIKLVP 93 >ref|XP_007033335.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] gi|508712364|gb|EOY04261.1| Uncharacterized protein TCM_019516, partial [Theobroma cacao] Length = 215 Score = 119 bits (297), Expect = 6e-25 Identities = 53/118 (44%), Positives = 71/118 (60%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 +TE P+PY L W +KG EV V K C V FSIG Y+D+VWCDV+P++AC +LL RPWQY Sbjct: 90 QTEVLPHPYKLQWLRKGNEVKVTKHCCVQFSIGNKYEDEVWCDVIPMDACQLLLGRPWQY 149 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQLLSMPQFSKVMNDKGVV 355 DR H+G +NTYSFI + KI L P + L++M +K ++ Sbjct: 150 DRRAHHDGYKNTYSFIKDGAKIMLTPLKSEDYPKKQEKDKALITMSGLNKAFRKSSLL 207 >ref|XP_006392773.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] gi|557089351|gb|ESQ30059.1| hypothetical protein EUTSA_v10012229mg [Eutrema salsugineum] Length = 382 Score = 117 bits (292), Expect = 2e-24 Identities = 54/131 (41%), Positives = 76/131 (58%), Gaps = 15/131 (11%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 K E HP PY L+W +G +V + R LV+FSIG YKD ++CD+ P++ H++L RPWQ+ Sbjct: 248 KREDHPAPYALAWITEGTDVKITHRALVSFSIGAFYKDTIYCDIAPMDVSHLILGRPWQF 307 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVPSNELG---------------LKSHGNGNTQLLSM 316 DRD HNG++NTYSF+ KI L+P+ E +K G+ +T L S Sbjct: 308 DRDTCHNGKKNTYSFVFENRKIVLLPNPEPASLPLATNKVDIQLPVMKDLGSKHTLLCSR 367 Query: 317 PQFSKVMNDKG 349 QF + D G Sbjct: 368 VQFETELRDSG 378 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 115 bits (288), Expect = 6e-24 Identities = 48/85 (56%), Positives = 61/85 (71%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T+ HP PY L W KG EV V+K+CLV FSIGK Y D+ CDV+P++ACH+LL RPW++D Sbjct: 425 TQDHPSPYKLRWLNKGAEVRVDKQCLVTFSIGKNYSDEALCDVLPMDACHLLLGRPWEFD 484 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVP 259 RD H+GR NTY+F K+ L P Sbjct: 485 RDSVHHGRDNTYTFKFRSRKVILTP 509 >gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 112 bits (279), Expect = 7e-23 Identities = 45/79 (56%), Positives = 61/79 (77%) Frame = +2 Query: 23 PYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYDRDVAHN 202 PY L W GEV VNK+C+++F++G+ Y+D++ CDVVP++ACH+LL RPWQYDRD H+ Sbjct: 465 PYRLQWLNDCGEVQVNKQCMISFNVGR-YEDEILCDVVPMQACHVLLGRPWQYDRDTTHH 523 Query: 203 GRRNTYSFIHNKVKITLVP 259 GR+N YS +HN K TL P Sbjct: 524 GRKNRYSLLHNGKKYTLAP 542 >gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 112 bits (279), Expect = 7e-23 Identities = 45/79 (56%), Positives = 61/79 (77%) Frame = +2 Query: 23 PYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYDRDVAHN 202 PY L W GEV VNK+C+++F++G+ Y+D++ CDVVP++ACH+LL RPWQYDRD H+ Sbjct: 465 PYRLQWLNDCGEVQVNKQCMISFNVGR-YEDEILCDVVPMQACHVLLGRPWQYDRDTTHH 523 Query: 203 GRRNTYSFIHNKVKITLVP 259 GR+N YS +HN K TL P Sbjct: 524 GRKNRYSLLHNGKKYTLAP 542 >ref|XP_006402169.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] gi|557103259|gb|ESQ43622.1| hypothetical protein EUTSA_v10015409mg, partial [Eutrema salsugineum] Length = 367 Score = 110 bits (275), Expect = 2e-22 Identities = 48/87 (55%), Positives = 61/87 (70%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 K E HP PY L+W K+G E+ + RCLV+FSIG YKD+++CDV ++ H+LL PWQY Sbjct: 257 KREDHPAPYKLTWLKQGVEIRIEHRCLVSFSIGSHYKDKIYCDVALMDVSHLLLGTPWQY 316 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVPS 262 DR V H+GRRN+YSFI KI L S Sbjct: 317 DRSVMHDGRRNSYSFIFENRKIVLFSS 343 >gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] Length = 1518 Score = 110 bits (274), Expect = 3e-22 Identities = 47/86 (54%), Positives = 61/86 (70%), Gaps = 1/86 (1%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVV-PLEACHILLERPWQY 181 T+ HP PY L W K V V+K+C+++FSIGK YKD+V CDVV P++ACH+LL RPW+Y Sbjct: 436 TQEHPNPYKLRWLSKDSGVRVDKQCIISFSIGKMYKDEVLCDVVVPMDACHLLLGRPWEY 495 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVP 259 DR+ H G+ N Y F H K+TL P Sbjct: 496 DRNTTHQGKDNVYIFKHQGKKVTLTP 521 >ref|XP_006575997.1| PREDICTED: uncharacterized protein LOC100809438 [Glycine max] Length = 527 Score = 108 bits (271), Expect = 6e-22 Identities = 48/86 (55%), Positives = 62/86 (72%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 KT PHP PY + W + GE+ V+++ L+ FSIGK Y D++ DVVP+EA H+LL RPWQY Sbjct: 394 KTSPHPRPYKIQWLSENGELVVDRQVLIYFSIGK-YVDEIMFDVVPMEASHLLLGRPWQY 452 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVP 259 DRDV HNG N +SF+H K+TL P Sbjct: 453 DRDVVHNGVTNKFSFVHKGQKVTLKP 478 >ref|XP_007050047.1| Uncharacterized protein TCM_003699 [Theobroma cacao] gi|508702308|gb|EOX94204.1| Uncharacterized protein TCM_003699 [Theobroma cacao] Length = 258 Score = 108 bits (271), Expect = 6e-22 Identities = 46/85 (54%), Positives = 59/85 (69%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T+ H +PY L W +KG EV V K C V F IG Y+D++WCDV+P++ACH+ L RP QYD Sbjct: 119 TKVHLHPYKLQWLRKGNEVKVMKHCCVQFYIGNKYQDEIWCDVIPMDACHLFLGRPCQYD 178 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVP 259 H+G +NTYSFI + VKI L P Sbjct: 179 CQAHHDGYKNTYSFIKDGVKIMLTP 203 >ref|XP_004150126.1| PREDICTED: uncharacterized protein LOC101221019 [Cucumis sativus] Length = 390 Score = 106 bits (264), Expect = 4e-21 Identities = 45/86 (52%), Positives = 56/86 (65%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 K E HP PY + W KKGGE +V++ C V SIG YKDQ+ CDV+ ++ CH+LL RPWQY Sbjct: 79 KAEAHPTPYKIGWVKKGGEATVSEICTVPLSIGNAYKDQIVCDVIEMDVCHLLLGRPWQY 138 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVP 259 D H GR NTY F K+ L+P Sbjct: 139 DTQSLHKGRENTYEFQWMGRKVVLLP 164 >ref|XP_006494982.1| PREDICTED: uncharacterized protein LOC102624489 [Citrus sinensis] Length = 1083 Score = 105 bits (263), Expect = 5e-21 Identities = 54/86 (62%), Positives = 60/86 (69%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 KT H Y L W GEV VNK+ LV+FSIG+ YKD V CDVVP+ A HILL RPWQY Sbjct: 268 KTLKHSRLYKLQWLNDYGEVKVNKQVLVSFSIGR-YKDDVLCDVVPMHAGHILLGRPWQY 326 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLVP 259 DR V H+G N YSF+ NK KITLVP Sbjct: 327 DRRVTHDGYLNRYSFVINKRKITLVP 352 >ref|XP_007049887.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702148|gb|EOX94044.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 546 Score = 105 bits (262), Expect = 7e-21 Identities = 51/119 (42%), Positives = 68/119 (57%), Gaps = 3/119 (2%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T HPYPY + W KKG EV V +CLV F++G D+ CDVVP++ HIL+ RPW YD Sbjct: 371 TNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGNNLDDEALCDVVPMDVGHILVGRPWLYD 430 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHGNGNTQL---LSMPQFSKVMNDKGV 352 D+ H + NTYSF N + TL P E KS N +++ LS F ++ G+ Sbjct: 431 HDMVHKTKPNTYSFYKNNKRYTLYPLREETKKSANNKISKITGYLSAENFEAEGSEMGI 489 >emb|CAN73690.1| hypothetical protein VITISV_034834 [Vitis vinifera] Length = 818 Score = 104 bits (260), Expect = 1e-20 Identities = 51/85 (60%), Positives = 59/85 (69%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T HP PY L W GE VNK LV+FSIG+ YKD+V CD+VP+ A HILL RPWQ+D Sbjct: 452 TLKHPRPYKLQWLNDFGEDKVNKEVLVSFSIGR-YKDEVLCDIVPMHAGHILLGRPWQFD 510 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVP 259 R V H+G +N YSFI N ITLVP Sbjct: 511 RKVNHDGFKNRYSFIKNNKTITLVP 535 >ref|XP_006605097.1| PREDICTED: uncharacterized protein LOC100803456 [Glycine max] Length = 732 Score = 104 bits (259), Expect = 1e-20 Identities = 46/85 (54%), Positives = 61/85 (71%) Frame = +2 Query: 2 KTEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQY 181 +T+PHP PY L W + GE+ V+K+ + FSIGK YKD+ CDVVP+EA H+LL RPWQ+ Sbjct: 283 ETKPHPRPYKLQWLSEEGELRVDKQVEIQFSIGK-YKDKTLCDVVPMEASHVLLGRPWQF 341 Query: 182 DRDVAHNGRRNTYSFIHNKVKITLV 256 DR H+G +N Y F H+ K+TLV Sbjct: 342 DRKAHHDGHKNKYIFYHDNRKVTLV 366 >ref|XP_007051412.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508703673|gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 103 bits (257), Expect = 3e-20 Identities = 51/120 (42%), Positives = 68/120 (56%), Gaps = 3/120 (2%) Frame = +2 Query: 5 TEPHPYPYTLSWFKKGGEVSVNKRCLVAFSIGKTYKDQVWCDVVPLEACHILLERPWQYD 184 T HPYPY + W KKG EV V +CLV F++G D+ CDVVP++ HIL+ RPW YD Sbjct: 362 TNKHPYPYKIGWLKKGHEVPVTTQCLVKFTMGDNSDDEALCDVVPMDVGHILVGRPWLYD 421 Query: 185 RDVAHNGRRNTYSFIHNKVKITLVPSNELGLKSHG---NGNTQLLSMPQFSKVMNDKGVV 355 D+ H + NTYSF N + TL P E KS + T+ LS F ++ G++ Sbjct: 422 HDMVHKTKPNTYSFYKNNKRYTLYPLREETKKSANHKISKITRYLSAENFEAEGSEMGIM 481