BLASTX nr result
ID: Catharanthus22_contig00009730
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00009730 (666 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAM74412.1|AC120497_12 Putative retroelement [Oryza sativa Ja... 205 4e-70 emb|CAN84067.1| hypothetical protein VITISV_041979 [Vitis vinifera] 198 7e-70 emb|CAN80130.1| hypothetical protein VITISV_001989 [Vitis vinifera] 180 1e-65 emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera] 176 2e-61 ref|XP_006575965.1| PREDICTED: uncharacterized protein LOC102669... 206 9e-61 gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] 175 4e-58 gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial ... 177 5e-58 gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobrom... 175 2e-57 ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664... 227 2e-57 gb|AAU90169.1| putative polyprotein [Oryza sativa Japonica Group] 187 1e-56 gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] 167 2e-56 gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum ... 222 9e-56 gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao] 169 1e-55 gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobrom... 169 2e-55 gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum ... 221 2e-55 gb|AAQ56407.1| putative gag-pol polyprotein [Oryza sativa Japoni... 220 3e-55 gb|AAP43919.1| integrase [Gossypium hirsutum] 220 3e-55 gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobrom... 169 7e-55 gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subc... 219 8e-55 gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japoni... 219 8e-55 >gb|AAM74412.1|AC120497_12 Putative retroelement [Oryza sativa Japonica Group] Length = 540 Score = 205 bits (521), Expect(2) = 4e-70 Identities = 92/133 (69%), Positives = 105/133 (78%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++L H FWP +++D+ERFV C C+KAKSR PHGLY PL VP PW DISMDF+ Sbjct: 136 KTEDVLAMHFFWPRMRKDIERFVARCTTCQKAKSRLNPHGLYMPLPVPSIPWADISMDFV 195 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLPRT +G DSIFVVVDRFSKMAHFIPC KSDDA H+A +FF IVR HG+P +IVSD Sbjct: 196 LGLPRTKRGRDSIFVVVDRFSKMAHFIPCHKSDDAVHIADMFFHKIVRLHGMPSTIVSDS 255 Query: 39 DTKFLSHFWRVLW 1 D KFLSHFWR LW Sbjct: 256 DAKFLSHFWRTLW 268 Score = 86.7 bits (213), Expect(2) = 4e-70 Identities = 45/90 (50%), Positives = 56/90 (62%), Gaps = 1/90 (1%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLA-PFGKYFLHEGFFYCE 488 ADALSRRYT LS L +I G E +KE Y+ D F E+ C + K+ ++ GF Y Sbjct: 46 ADALSRRYTFLSQLDCRIFGLESIKEQYALDPDFNEVMINCKEGRTWNKFVINGGFVYRA 105 Query: 487 GRLCISSCSTRILLVKEAHCGGLMGHFGVK 398 RLCI S R+LL++EAH GGL GHFG K Sbjct: 106 NRLCIPVGSVRLLLIQEAHGGGLTGHFGAK 135 >emb|CAN84067.1| hypothetical protein VITISV_041979 [Vitis vinifera] Length = 364 Score = 198 bits (503), Expect(2) = 7e-70 Identities = 89/135 (65%), Positives = 107/135 (79%) Frame = -1 Query: 405 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 226 V KT ++LHEH FWP +KRDVER CI C++ KSR LPHGLYT L VP PWV+I MD Sbjct: 181 VRKTLDVLHEHFFWPKMKRDVERACARCITCRRTKSRVLPHGLYTLLPVPSAPWVNIYMD 240 Query: 225 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 46 F+LGLPR+ G DSIFVVVDRFSKM HFI C K++DA+H+A+LFF IV +G+P+SIVS Sbjct: 241 FVLGLPRSRNGRDSIFVVVDRFSKMTHFISCHKTNDATHIANLFFREIVWLYGVPKSIVS 300 Query: 45 DRDTKFLSHFWRVLW 1 DRD KFL +FW+VLW Sbjct: 301 DRDVKFLRYFWKVLW 315 Score = 92.8 bits (229), Expect(2) = 7e-70 Identities = 43/89 (48%), Positives = 62/89 (69%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEG 485 ADALSRRY L+STL K+LGFE +KE+Y++D+ F ++ C FGK++ +G+ + + Sbjct: 94 ADALSRRYALVSTLNAKLLGFEYVKELYANDNDFASVYGACEKTAFGKFYRLDGYLFRKN 153 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGVK 398 LC+ + S LLV+EAH G MGHFGV+ Sbjct: 154 ILCVPNSSMCELLVREAHGGDXMGHFGVR 182 >emb|CAN80130.1| hypothetical protein VITISV_001989 [Vitis vinifera] Length = 340 Score = 180 bits (456), Expect(2) = 1e-65 Identities = 84/135 (62%), Positives = 99/135 (73%) Frame = -1 Query: 405 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 226 V KT ++LHEH FWP +KRDVER CI C++AKSR LPHGLYTPL VP PWVDISMD Sbjct: 183 VRKTLDVLHEHFFWPKMKRDVERACARCITCRQAKSRVLPHGLYTPLPVPSAPWVDISMD 242 Query: 225 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 46 F+LGLPR+ MAHFI K+DDA+H+A+LFF IVR +G+PRSIVS Sbjct: 243 FVLGLPRS--------------RNMAHFISXHKTDDATHIANLFFREIVRLYGVPRSIVS 288 Query: 45 DRDTKFLSHFWRVLW 1 DRD KFLS+FW+VLW Sbjct: 289 DRDVKFLSYFWKVLW 303 Score = 96.7 bits (239), Expect(2) = 1e-65 Identities = 46/89 (51%), Positives = 61/89 (68%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEG 485 ADALSRRY L+STL K+LGFE +KE+Y +D F ++ C FGK++ H G+ + E Sbjct: 96 ADALSRRYALVSTLNAKLLGFEYVKELYVNDDDFASVYGACEKTTFGKFYRHLGYLFREN 155 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGVK 398 RL + + LLV+EAH GGLMGHFGV+ Sbjct: 156 RLRVPNSFMNDLLVREAHGGGLMGHFGVR 184 >emb|CAN68499.1| hypothetical protein VITISV_041099 [Vitis vinifera] Length = 1115 Score = 176 bits (445), Expect(2) = 2e-61 Identities = 80/104 (76%), Positives = 91/104 (87%) Frame = -1 Query: 312 KKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPC 133 ++AKSR LPHGLYTPL VP PWVDISMDF+LGLPR+ G DSIFVVVDRFSKMAHFI C Sbjct: 870 RQAKSRVLPHGLYTPLPVPSAPWVDISMDFVLGLPRSRNGRDSIFVVVDRFSKMAHFISC 929 Query: 132 RKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 K+DDA+H+A+LFF IVR HG+PRSIVSDRD KFLS+FW+VLW Sbjct: 930 HKTDDATHIANLFFRKIVRLHGVPRSIVSDRDVKFLSYFWKVLW 973 Score = 86.7 bits (213), Expect(2) = 2e-61 Identities = 41/82 (50%), Positives = 56/82 (68%) Frame = -3 Query: 661 DALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEGR 482 DALSRRY L+STL K+LGFE +KE+Y++D F ++ C FGK++ +G+ + E R Sbjct: 771 DALSRRYALVSTLNAKLLGFEYVKELYANDDDFSSVYGACEKITFGKFYRLDGYLFRENR 830 Query: 481 LCISSCSTRILLVKEAHCGGLM 416 LC+ + S LLV EAH GGLM Sbjct: 831 LCVPNSSMLELLVHEAHGGGLM 852 >ref|XP_006575965.1| PREDICTED: uncharacterized protein LOC102669237, partial [Glycine max] Length = 1520 Score = 206 bits (523), Expect(2) = 9e-61 Identities = 88/135 (65%), Positives = 107/135 (79%) Frame = -1 Query: 405 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 226 + KT +L E +WPH+K+DV + C+ C +AKSR +PHGLY PL +P PWVDISMD Sbjct: 1258 IDKTLVLLKEKFYWPHMKKDVHKHCTRCVACLQAKSRVMPHGLYIPLPIPSTPWVDISMD 1317 Query: 225 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 46 F+LGLPRT +G+DSIFVVVDRFSKMAHFIPC K DDA H++ LFF +VR HG+PR+IVS Sbjct: 1318 FVLGLPRTQRGVDSIFVVVDRFSKMAHFIPCHKVDDAFHISKLFFKEVVRLHGLPRTIVS 1377 Query: 45 DRDTKFLSHFWRVLW 1 DRD KFLSHFW+ LW Sbjct: 1378 DRDAKFLSHFWKTLW 1392 Score = 54.7 bits (130), Expect(2) = 9e-61 Identities = 24/37 (64%), Positives = 29/37 (78%) Frame = -3 Query: 511 HEGFFYCEGRLCISSCSTRILLVKEAHCGGLMGHFGV 401 HEG+ + EG+LCI S R LLVKE+H GGLMGHFG+ Sbjct: 1222 HEGYLFKEGKLCIPQGSIRKLLVKESHEGGLMGHFGI 1258 >gb|ADP20179.1| gag-pol polyprotein [Silene latifolia] Length = 1475 Score = 175 bits (444), Expect(2) = 4e-58 Identities = 80/135 (59%), Positives = 102/135 (75%) Frame = -1 Query: 405 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 226 + KTY+IL E +WP + DV+ + C C+++KS G YTPL VP +PW DISMD Sbjct: 1098 IQKTYDILQEQFYWPKMLGDVQDVIKRCAPCQQSKSY-FQTGPYTPLPVPNQPWEDISMD 1156 Query: 225 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 46 FI+ LPRT +G DSI VVVDRFSKMAHFI C+K++DA+ VA L+F +V+ HGIP+SIVS Sbjct: 1157 FIVALPRTQRGKDSIMVVVDRFSKMAHFIACKKTEDATSVAELYFKEVVKLHGIPKSIVS 1216 Query: 45 DRDTKFLSHFWRVLW 1 DRD+KF+SHFWR LW Sbjct: 1217 DRDSKFMSHFWRTLW 1231 Score = 76.3 bits (186), Expect(2) = 4e-58 Identities = 37/92 (40%), Positives = 56/92 (60%), Gaps = 3/92 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFK---EIFEKCLLAPFGKYFLHEGFFY 494 ADALSRR+ +LS ++ ++LGFE +KE+Y D FK E+ + + KY + GF + Sbjct: 1008 ADALSRRFIMLSFMEQRVLGFEYMKELYVEDPDFKGEWELLQSGQIKLKSKYLVQNGFLF 1067 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 398 +LC+ R LL++E H GL GHFG++ Sbjct: 1068 FGNKLCVPRGPYRNLLIREVHSNGLAGHFGIQ 1099 >gb|EOX94045.1| DNA/RNA polymerases superfamily protein, partial [Theobroma cacao] Length = 624 Score = 177 bits (449), Expect(2) = 5e-58 Identities = 80/133 (60%), Positives = 96/133 (72%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++ + +WP ++RDVER V C C K GLY PL P PW+ +SMDF+ Sbjct: 489 KTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 548 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLP+T+KG DSIFVVVDRFSKMAHFIPC ++ DA+H+A LFF IVR HGIP SIVSDR Sbjct: 549 LGLPKTAKGFDSIFVVVDRFSKMAHFIPCFRTSDATHIAELFFREIVRLHGIPTSIVSDR 608 Query: 39 DTKFLSHFWRVLW 1 D KF+ HFWR LW Sbjct: 609 DVKFMGHFWRTLW 621 Score = 73.9 bits (180), Expect(2) = 5e-58 Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI---FEKCLLAPFGKYFLHEGFFY 494 ADALSRR +LS + T++ GFE LK YSSD YF +I + L A Y LHE + + Sbjct: 397 ADALSRRCKMLSVMSTQVTGFEELKNQYSSDSYFSKIIADLQGSLQAENLPYRLHEDYLF 456 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFG 404 +LCI S R +++E H GL GHFG Sbjct: 457 KGNQLCIPEGSLREQIIRELHGNGLGGHFG 486 >gb|EOY01158.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 786 Score = 175 bits (444), Expect(2) = 2e-57 Identities = 79/133 (59%), Positives = 96/133 (72%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++ + +WP ++RDVER V C C K GLY PL P PW+ +SMDF+ Sbjct: 489 KTLAMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 548 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLP+T+KG DSIFVVVDRFSKMAHFIPC ++ +A+H+A LFF IVR HGIP SIVSDR Sbjct: 549 LGLPKTAKGFDSIFVVVDRFSKMAHFIPCFRTSNATHIAELFFREIVRLHGIPTSIVSDR 608 Query: 39 DTKFLSHFWRVLW 1 D KF+ HFWR LW Sbjct: 609 DVKFMGHFWRTLW 621 Score = 73.9 bits (180), Expect(2) = 2e-57 Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI---FEKCLLAPFGKYFLHEGFFY 494 ADALSRR +LS + T++ GFE LK YSSD YF +I + L A Y LHE + + Sbjct: 397 ADALSRRCKMLSVMSTQVTGFEELKNQYSSDSYFSKIIADLQGSLQAENLPYRLHEDYLF 456 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFG 404 +LCI S R +++E H GL GHFG Sbjct: 457 KGNQLCIPEGSLREQIIRELHGNGLGGHFG 486 >ref|XP_006596896.1| PREDICTED: uncharacterized protein LOC102664455 [Glycine max] Length = 1176 Score = 227 bits (579), Expect = 2e-57 Identities = 108/170 (63%), Positives = 125/170 (73%), Gaps = 2/170 (1%) Frame = -1 Query: 504 DFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFV 331 D F+ K + C + E +S+ H G+ V KT EIL EH FWPH++RDV +F Sbjct: 773 DGFLFKANKLCVPKCSIRELLVSESHEGGLMGHFGVQKTLEILQEHFFWPHMRRDVHKFC 832 Query: 330 GSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKM 151 G CI CK+AKS+ PHGLYTPL VP+ PW DISMDF+LGLP+T G DS+FVVVDRFSKM Sbjct: 833 GHCIVCKQAKSKVKPHGLYTPLPVPEYPWTDISMDFVLGLPKTKNGKDSVFVVVDRFSKM 892 Query: 150 AHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 AHFIPC+K DDA HVA LFF IVR HG+PRSIVSDRD KFLSHFWR LW Sbjct: 893 AHFIPCKKVDDACHVADLFFKEIVRLHGLPRSIVSDRDAKFLSHFWRTLW 942 >gb|AAU90169.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1154 Score = 187 bits (475), Expect(2) = 1e-56 Identities = 85/119 (71%), Positives = 97/119 (81%) Frame = -1 Query: 357 LKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIF 178 L+ DVER+V C+ KAKS+ PHGLYTPL VP PW DISMDF+LGLPRT +G DSIF Sbjct: 829 LRHDVERYVQRCVTSHKAKSKLNPHGLYTPLPVPNAPWEDISMDFVLGLPRTRRGRDSIF 888 Query: 177 VVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 V VDRFSKMAHFIPC KSDDASHVA LFF +VR HG+PR+IVSDRD KF+S+FW+ LW Sbjct: 889 VAVDRFSKMAHFIPCNKSDDASHVADLFFREVVRLHGVPRTIVSDRDVKFMSYFWKTLW 947 Score = 59.7 bits (143), Expect(2) = 1e-56 Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 1/71 (1%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAP-FGKYFLHEGFFYCE 488 ADALSR+ LL+ L K+ E LKE+YS D F + + KCL + KY +H+GF + Sbjct: 760 ADALSRKSVLLTQLDVKVSSLESLKELYSKDSEFSDPYSKCLDGKGWEKYHVHDGFLFRA 819 Query: 487 GRLCISSCSTR 455 +LC+ S R Sbjct: 820 DKLCVPESSLR 830 >gb|ADP20178.1| gag-pol polyprotein [Silene latifolia] Length = 1518 Score = 167 bits (422), Expect(2) = 2e-56 Identities = 76/135 (56%), Positives = 99/135 (73%) Frame = -1 Query: 405 VSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMD 226 V KT EIL + +WP + DV+ + C +C+ +KS P G YTPL VP +PW D+SMD Sbjct: 1110 VQKTLEILQDQFYWPRMMGDVQIILRRCSKCQLSKSSFQP-GPYTPLPVPSKPWEDLSMD 1168 Query: 225 FILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVS 46 FI+ LPRT +G DS+ VVVDRFSKMAHF+ C+K++DA VA LF IVR HG+P++IVS Sbjct: 1169 FIVALPRTQRGKDSVMVVVDRFSKMAHFVACKKTEDAVSVAELFLKEIVRLHGVPKTIVS 1228 Query: 45 DRDTKFLSHFWRVLW 1 DRDTKF+ +FW+ LW Sbjct: 1229 DRDTKFMGYFWKTLW 1243 Score = 79.0 bits (193), Expect(2) = 2e-56 Identities = 41/92 (44%), Positives = 57/92 (61%), Gaps = 3/92 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIF---EKCLLAPFGKYFLHEGFFY 494 ADALSRR++LLS + ++LGFE +KE+Y D F E + + KY L EGF + Sbjct: 1020 ADALSRRHSLLSVMSNRVLGFEFMKELYKEDPDFSEEWITQTEGHKNQGSKYLLQEGFLF 1079 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFGVK 398 +LC+ S R LL++E H GG+ GHFGV+ Sbjct: 1080 QGNKLCVPRGSYRDLLIREVHSGGMGGHFGVQ 1111 >gb|AAW28578.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 222 bits (565), Expect = 9e-56 Identities = 106/174 (60%), Positives = 130/174 (74%), Gaps = 2/174 (1%) Frame = -1 Query: 516 FCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDV 343 F ++D F+ K + C + L E ++ + H G+ V KT EIL EH +WP +++DV Sbjct: 1125 FNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDV 1184 Query: 342 ERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDR 163 E+ C+ECK+AKSRTLPHGLYTPL V PW+DISMDFILGLPRT G DSIFVVVDR Sbjct: 1185 EKVCSYCLECKQAKSRTLPHGLYTPLPVSNSPWIDISMDFILGLPRTKYGKDSIFVVVDR 1244 Query: 162 FSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 FSKMA FIPC+K++DASHVA LF +V+ HGIPR+IVSDRD KFLSHFWR+LW Sbjct: 1245 FSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSHFWRILW 1298 Score = 114 bits (286), Expect = 2e-23 Identities = 57/112 (50%), Positives = 77/112 (68%), Gaps = 1/112 (0%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEG 485 ADALSRRY L+STL +K+LGF+ +K +Y++D F EIF +C L PF K+ L + F + E Sbjct: 1077 ADALSRRYVLISTLTSKLLGFDQIKFLYANDSDFGEIFAECKLGPFEKFNLQDEFLFKEN 1136 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGV-KNL*NSS*TCFLATFKKRCRKIC 332 +LC+ +CS R L V+EAHCGGLMGHFGV K L S + + +K K+C Sbjct: 1137 KLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDVEKVC 1188 >gb|EOY16699.1| Uncharacterized protein TCM_035549 [Theobroma cacao] Length = 1392 Score = 169 bits (429), Expect(2) = 1e-55 Identities = 77/133 (57%), Positives = 94/133 (70%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++ + +WP +++DVER V C C K GLY PL P PW+ +SMDF+ Sbjct: 977 KTLAMVADRYYWPKMRQDVERLVKRCPTCLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 1036 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLP+T+K DSIFVVVDRFSKMAHFIPC ++ DA+H+A LFF IVR H IP SIVSDR Sbjct: 1037 LGLPKTAKRFDSIFVVVDRFSKMAHFIPCFRTSDATHIAELFFREIVRLHRIPTSIVSDR 1096 Query: 39 DTKFLSHFWRVLW 1 D KF+ HFWR LW Sbjct: 1097 DVKFMGHFWRTLW 1109 Score = 73.9 bits (180), Expect(2) = 1e-55 Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI---FEKCLLAPFGKYFLHEGFFY 494 ADALSRR +LS + T++ GFE LK YSSD YF +I + L A Y LHE + + Sbjct: 885 ADALSRRCKMLSVMSTQVTGFEELKNQYSSDSYFSKIIADLQGSLQAENLPYRLHEDYLF 944 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFG 404 +LCI S R +++E H GL GHFG Sbjct: 945 KGNQLCIPEGSLREQIIRELHGNGLGGHFG 974 >gb|EOX95569.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1452 Score = 169 bits (427), Expect(2) = 2e-55 Identities = 77/133 (57%), Positives = 94/133 (70%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++ + +WP ++RDVER V C C K GLY PL P PW+ +SMDF+ Sbjct: 1037 KTLVMVADRYYWPKMRRDVERLVKRCPACLFGKGSAQNTGLYVPLPEPDAPWIHLSMDFV 1096 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLP+T+KG DSIFVVVDRFSKMAHFIPC ++ DA+H+A LFF IV HGIP SIVSDR Sbjct: 1097 LGLPKTTKGFDSIFVVVDRFSKMAHFIPCFRTSDATHIAELFFREIVILHGIPTSIVSDR 1156 Query: 39 DTKFLSHFWRVLW 1 KF+ +FWR LW Sbjct: 1157 HVKFMGYFWRTLW 1169 Score = 73.9 bits (180), Expect(2) = 2e-55 Identities = 41/90 (45%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI---FEKCLLAPFGKYFLHEGFFY 494 ADALSRR +LS + T++ GFE LK YSSD YF +I + L A Y LHE + + Sbjct: 945 ADALSRRCKMLSVMSTQVTGFEELKNQYSSDSYFSKIIADLQGSLQAENLPYRLHEDYLF 1004 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFG 404 +LCI S R +++E H GL GHFG Sbjct: 1005 KGNQLCIPEGSLREQIIRELHGNGLGGHFG 1034 >gb|AAW28577.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1588 Score = 221 bits (562), Expect = 2e-55 Identities = 106/174 (60%), Positives = 130/174 (74%), Gaps = 2/174 (1%) Frame = -1 Query: 516 FCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDV 343 F ++D F+ K + C + L E ++ + H G+ V KT EIL EH +WP +++DV Sbjct: 1125 FNLQDEFLFKENKLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDV 1184 Query: 342 ERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDR 163 E+ C+ECK+AKSRTLPHGLYTPL V PW+DISMDFILGLPRT G DSIFVVVDR Sbjct: 1185 EKVCSYCLECKQAKSRTLPHGLYTPLPVSNFPWIDISMDFILGLPRTKYGKDSIFVVVDR 1244 Query: 162 FSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 FSKMA FIPC+K++DASHVA LF +V+ HGIPR+IVSDRD KFLSHFWR+LW Sbjct: 1245 FSKMARFIPCKKTNDASHVADLFVKEVVKLHGIPRTIVSDRDAKFLSHFWRILW 1298 Score = 114 bits (286), Expect = 2e-23 Identities = 57/112 (50%), Positives = 77/112 (68%), Gaps = 1/112 (0%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEG 485 ADALSRRY L+STL +K+LGF+ +K +Y++D F EIF +C L PF K+ L + F + E Sbjct: 1077 ADALSRRYVLISTLTSKLLGFDQIKFLYANDSDFGEIFAECKLGPFEKFNLQDEFLFKEN 1136 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGV-KNL*NSS*TCFLATFKKRCRKIC 332 +LC+ +CS R L V+EAHCGGLMGHFGV K L S + + +K K+C Sbjct: 1137 KLCVPNCSLRELFVREAHCGGLMGHFGVPKTLEILSEHFYWPSMRKDVEKVC 1188 >gb|AAQ56407.1| putative gag-pol polyprotein [Oryza sativa Japonica Group] Length = 1619 Score = 220 bits (561), Expect = 3e-55 Identities = 106/157 (67%), Positives = 121/157 (77%) Frame = -1 Query: 471 HLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRT 292 H++LL+ + H G+ V KT +IL +H+FWP ++RDVERFV C C+KAKSR Sbjct: 1117 HMLLLQ----EAHGGGLMGHFGVKKTEDILADHLFWPKMRRDVERFVARCTTCQKAKSRL 1172 Query: 291 LPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDAS 112 PHGLY PL VP PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+ Sbjct: 1173 NPHGLYMPLPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDAT 1232 Query: 111 HVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 HVA LFF IVR HG+P +IVSDRDTKFLSHFWR LW Sbjct: 1233 HVADLFFREIVRLHGVPNTIVSDRDTKFLSHFWRTLW 1269 Score = 86.7 bits (213), Expect = 6e-15 Identities = 44/89 (49%), Positives = 57/89 (64%), Gaps = 1/89 (1%) Frame = -3 Query: 661 DALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLA-PFGKYFLHEGFFYCEG 485 DALSRRY +LS L KI G E +KE Y+ D FK++ C + K+ L GF + Sbjct: 1048 DALSRRYAMLSQLDFKIFGLETIKEQYAHDDDFKDVLLNCKEGRTWNKFVLTNGFVFRAN 1107 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGVK 398 +LCI + S +LL++EAH GGLMGHFGVK Sbjct: 1108 KLCIPASSVHMLLLQEAHGGGLMGHFGVK 1136 >gb|AAP43919.1| integrase [Gossypium hirsutum] Length = 334 Score = 220 bits (561), Expect = 3e-55 Identities = 104/182 (57%), Positives = 130/182 (71%), Gaps = 2/182 (1%) Frame = -1 Query: 540 C*HLLENIFCMKDFFIVKVD--CAYHLVLLEFYLSKKHIVGV*WDILVSKTYEILHEHVF 367 C H F + D + +++ C + E + + H G+ V+KT +IL EH Sbjct: 134 CGHTAFEKFYLVDGLLFRLNRLCIPKCSMRELLIHEAHSGGLMGHFGVAKTLDILQEHFH 193 Query: 366 WPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFILGLPRTSKGID 187 WPH+K+DVE+ CI CK+AKS+ + HGLYTPL +P PWVD+SMDFILGLPRT KG D Sbjct: 194 WPHMKKDVEKVCSKCITCKQAKSKVMLHGLYTPLPIPTSPWVDLSMDFILGLPRTKKGRD 253 Query: 186 SIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDRDTKFLSHFWRV 7 SIFVVVDRFSKM+HFIPC K+DDA+HVA LFF +VR HGIP++IVSDRD KFLSHFW+V Sbjct: 254 SIFVVVDRFSKMSHFIPCHKTDDATHVADLFFKEVVRLHGIPKTIVSDRDVKFLSHFWKV 313 Query: 6 LW 1 LW Sbjct: 314 LW 315 Score = 104 bits (259), Expect = 3e-20 Identities = 56/114 (49%), Positives = 69/114 (60%), Gaps = 1/114 (0%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLAPFGKYFLHEGFFYCEG 485 ADALSRRYTL++TL K+LGFE +KE+Y D F I++ C F K++L +G + Sbjct: 94 ADALSRRYTLITTLNAKVLGFEHIKELYDDDTDFSHIYKNCGHTAFEKFYLVDGLLFRLN 153 Query: 484 RLCISSCSTRILLVKEAHCGGLMGHFGV-KNL*NSS*TCFLATFKKRCRKICRK 326 RLCI CS R LL+ EAH GGLMGHFGV K L KK K+C K Sbjct: 154 RLCIPKCSMRELLIHEAHSGGLMGHFGVAKTLDILQEHFHWPHMKKDVEKVCSK 207 >gb|EOX92840.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 647 Score = 169 bits (428), Expect(2) = 7e-55 Identities = 77/133 (57%), Positives = 93/133 (69%) Frame = -1 Query: 399 KTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTPLEVPKEPWVDISMDFI 220 KT ++ + +WP + RDVER V C C K GLY PL P PW+ +SMDF+ Sbjct: 305 KTLAMVADRYYWPKMHRDVERLVKRCSTCLFGKGSAQNTGLYVPLLEPDAPWIHLSMDFV 364 Query: 219 LGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFTWIVRFHGIPRSIVSDR 40 LGLP+ +KG DSIFVVV +FSKMAHFIPC K+ DA+H+A LFF +VR HGIP SIVSDR Sbjct: 365 LGLPKIAKGFDSIFVVVYQFSKMAHFIPCFKTSDATHIAELFFCEVVRLHGIPTSIVSDR 424 Query: 39 DTKFLSHFWRVLW 1 D KF+ HFWR LW Sbjct: 425 DVKFMGHFWRTLW 437 Score = 71.6 bits (174), Expect(2) = 7e-55 Identities = 41/90 (45%), Positives = 52/90 (57%), Gaps = 3/90 (3%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEI---FEKCLLAPFGKYFLHEGFFY 494 ADALSRR +LS + T++ GFE LK YSSD YF +I + L A Y LHE + + Sbjct: 213 ADALSRRCKMLSVMSTQVTGFEELKNQYSSDSYFSKIIADLQGSLQAGNLPYRLHEDYLF 272 Query: 493 CEGRLCISSCSTRILLVKEAHCGGLMGHFG 404 +LCI S R ++ E H GL GHFG Sbjct: 273 KGNQLCILEGSLREQIIGELHGNGLGGHFG 302 >gb|ABF97027.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 889 Score = 219 bits (557), Expect = 8e-55 Identities = 104/149 (69%), Positives = 115/149 (77%) Frame = -1 Query: 447 LSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTP 268 L + H G+ V KT +IL +H FWP ++RDVERFV C C+KAKSR PHGLY P Sbjct: 538 LQEAHGGGLMGHFGVKKTEDILADHFFWPKMRRDVERFVARCTTCQKAKSRLNPHGLYMP 597 Query: 267 LEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFT 88 L VP PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+HVA LFF Sbjct: 598 LPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDATHVADLFFR 657 Query: 87 WIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 IVR HG+P +IVSDRDTKFLSHFWR LW Sbjct: 658 EIVRLHGVPNTIVSDRDTKFLSHFWRTLW 686 Score = 91.7 bits (226), Expect = 2e-16 Identities = 46/90 (51%), Positives = 60/90 (66%), Gaps = 1/90 (1%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLA-PFGKYFLHEGFFYCE 488 ADALSRRY +LS L KI G E +KE Y+ D FK++ C+ + K+ L GF + Sbjct: 464 ADALSRRYAMLSQLDFKIFGLETIKEQYAHDDDFKDVLLNCMEGRTWNKFVLTNGFVFRA 523 Query: 487 GRLCISSCSTRILLVKEAHCGGLMGHFGVK 398 +LCI + S R+LL++EAH GGLMGHFGVK Sbjct: 524 NKLCIPASSVRMLLLQEAHGGGLMGHFGVK 553 >gb|AAQ56388.1| putative gag-pol polyprotein [Oryza sativa Japonica Group] gi|91795218|gb|ABE60890.1| putative polyprotein [Oryza sativa Japonica Group] Length = 1616 Score = 219 bits (557), Expect = 8e-55 Identities = 104/149 (69%), Positives = 115/149 (77%) Frame = -1 Query: 447 LSKKHIVGV*WDILVSKTYEILHEHVFWPHLKRDVERFVGSCIECKKAKSRTLPHGLYTP 268 L + H G+ V KT +IL +H FWP ++RDVERFV C C+KAKSR PHGLY P Sbjct: 1225 LQEAHGGGLMGHFGVKKTEDILADHFFWPKMRRDVERFVARCTTCQKAKSRLNPHGLYMP 1284 Query: 267 LEVPKEPWVDISMDFILGLPRTSKGIDSIFVVVDRFSKMAHFIPCRKSDDASHVASLFFT 88 L VP PW DISMDF+LGLPRT KG DSIFVVVDRFSKMAHFIPC KSDDA+HVA LFF Sbjct: 1285 LPVPSVPWEDISMDFVLGLPRTKKGRDSIFVVVDRFSKMAHFIPCHKSDDATHVADLFFR 1344 Query: 87 WIVRFHGIPRSIVSDRDTKFLSHFWRVLW 1 IVR HG+P +IVSDRDTKFLSHFWR LW Sbjct: 1345 EIVRLHGVPNTIVSDRDTKFLSHFWRTLW 1373 Score = 89.4 bits (220), Expect = 9e-16 Identities = 46/90 (51%), Positives = 58/90 (64%), Gaps = 1/90 (1%) Frame = -3 Query: 664 ADALSRRYTLLSTLQTKILGFEMLKEMYSSDHYFKEIFEKCLLA-PFGKYFLHEGFFYCE 488 ADALSRRY +LS L KI G E +KE Y+ D FK + C + K+ L GF + Sbjct: 1151 ADALSRRYAMLSQLDFKIFGLETIKEQYAHDDDFKNVLLNCKEGRTWNKFVLTNGFVFRA 1210 Query: 487 GRLCISSCSTRILLVKEAHCGGLMGHFGVK 398 +LCI + S R+LL++EAH GGLMGHFGVK Sbjct: 1211 NKLCIPASSVRMLLLQEAHGGGLMGHFGVK 1240