BLASTX nr result
ID: Cephaelis21_contig00019478
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00019478 (2623 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-... 143 5e-62 gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa ... 147 1e-58 gb|EEE69154.1| hypothetical protein OsJ_28290 [Oryza sativa Japo... 142 6e-57 emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulga... 151 1e-56 gb|ABA98491.1| retrotransposon protein, putative, unclassified [... 145 1e-54 >gb|AAG51783.1|AC079679_3 reverse transcriptase, putative; 16838-20266 [Arabidopsis thaliana] Length = 1142 Score = 143 bits (360), Expect(3) = 5e-62 Identities = 61/144 (42%), Positives = 95/144 (65%) Frame = -2 Query: 1026 LGMPSMSGRSKVQIFQGVWEKVMGRIRGWSEKFLSTAGKEILWKDVEQAIPIYTMSCFLL 847 LG+P G SK ++F V +++ RI GWS KFLS GKE++ K V +P Y MSCF L Sbjct: 535 LGLPESLGGSKTKVFSFVRDRLQSRINGWSAKFLSKGGKEVMIKSVAATLPRYVMSCFRL 594 Query: 846 PESFQKEIQATMAQYRWQHKSDNQGVYWVAWRKLCTPKKESGMGFRNLRSINLSMLAKQG 667 P++ ++ + +A++ W D++G++W+AW KLC+ K + G+GFRN+ N ++LAKQ Sbjct: 595 PKAITSKLTSAVAKFWWSSNGDSRGMHWMAWDKLCSSKSDGGLGFRNVDDFNSALLAKQL 654 Query: 666 WRILINPGSLMANIVKARYFSQNH 595 WR++ P SL A + K RYF +++ Sbjct: 655 WRLITAPDSLFAKVFKGRYFRKSN 678 Score = 108 bits (269), Expect(3) = 5e-62 Identities = 59/187 (31%), Positives = 91/187 (48%) Frame = -1 Query: 574 YNPSMTWCDIFSSIQVLRYSSR*RVGSRADIPIWNSPWLPCPFTFQVNCPVSVIIADARV 395 Y+PS W + S+ ++ RVGS A I +WN PW+P F S++ +V Sbjct: 686 YSPSYGWRSMISARSLVYKGLIKRVGSGASISVWNDPWIPAQFPRPAKYGGSIVDPSLKV 745 Query: 394 AELIREVDNQWNHSLLDAVFWSEEAEIIKGIPLRFHQRPDSWMWHFTHSREISVRSAYYL 215 LI N WN LL +F E+ +I +P+ D+ WHFT + +V+S Y+ Sbjct: 746 KSLIDSRSNFWNIDLLKELFDPEDVPLISALPIGNPNMEDTLGWHFTKAGNYTVKSGYHT 805 Query: 214 AVERGVDYSSNRQASQPSTSNNHPYVEWYTIWQLDVPPKVKHFLWRFAMSALPTQDSLRR 35 A +D + P + Y IW++ PPK++HFLW+ +P ++LR+ Sbjct: 806 A---RLDLNEGTTLIGPDLTTLKAY-----IWKVQCPPKLRHFLWQILSGCVPVSENLRK 857 Query: 34 RGILPPK 14 RGIL K Sbjct: 858 RGILCDK 864 Score = 37.0 bits (84), Expect(3) = 5e-62 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 3/71 (4%) Frame = -3 Query: 1286 LSRIQLHRSSPKVNHLLFADDTLLLGKATLEKAIFCGRFLHFMSMLWS---SWLNLNKSE 1116 ++ I++ SP V+HLLFADD+L KA E+ CG L + S +N +KS Sbjct: 448 ITGIKVATPSPAVSHLLFADDSLFFCKANKEQ---CGIILEILKQYESVSGQQINFSKSS 504 Query: 1115 VFFSPNTSSRI 1083 + F I Sbjct: 505 IQFGHKVEDSI 515 >gb|AFP55557.1| non-ltr retroelement reverse transcriptase [Rosa rugosa] Length = 1747 Score = 147 bits (372), Expect(3) = 1e-58 Identities = 70/160 (43%), Positives = 102/160 (63%) Frame = -2 Query: 1071 ATFLQIPVTQTHHR*LGMPSMSGRSKVQIFQGVWEKVMGRIRGWSEKFLSTAGKEILWKD 892 + L +PV H R LG+P++SG+ K ++FQ + ++V R+ GW K LS AGKE+L K Sbjct: 1024 SAILDMPVVPCHERYLGLPTVSGKDKKKLFQSLPDRVWNRVHGWEGKLLSKAGKEVLIKT 1083 Query: 891 VEQAIPIYTMSCFLLPESFQKEIQATMAQYRWQHKSDNQGVYWVAWRKLCTPKKESGMGF 712 V QAIP YTMS F LP I +A++ W K +G++W W LC KK+ G+GF Sbjct: 1084 VAQAIPNYTMSVFQLPAGTSDAINKCVARF-WWGKEGGKGIHWRRWSDLCFSKKDGGLGF 1142 Query: 711 RNLRSINLSMLAKQGWRILINPGSLMANIVKARYFSQNHF 592 R+L N ++L KQGWR+++ P SL+A ++KA+YF + F Sbjct: 1143 RDLSLFNQALLGKQGWRLMMYPDSLVARMLKAKYFPWDDF 1182 Score = 91.7 bits (226), Expect(3) = 1e-58 Identities = 54/187 (28%), Positives = 92/187 (49%) Frame = -1 Query: 601 EPFFYAHLGYNPSMTWCDIFSSIQVLRYSSR*RVGSRADIPIWNSPWLPCPFTFQVNCPV 422 + F A LG +PS W ++LR R R+G ++ ++ PW+P +F+ Sbjct: 1180 DDFMEAELGSSPSYLWRSFLWGRELLRKGVRWRIGDGKEVRVFIDPWVPGLPSFRPILRQ 1239 Query: 421 SVIIADARVAELIREVDNQWNHSLLDAVFWSEEAEIIKGIPLRFHQRPDSWMWHFTHSRE 242 + RV++L+ + WN L+ F +E E I I + +RPD +MW++ + Sbjct: 1240 GAPLF-LRVSDLLHN-NGGWNMEALNYWFTDDECEAISSITVGATRRPDVYMWNYCKNGR 1297 Query: 241 ISVRSAYYLAVERGVDYSSNRQASQPSTSNNHPYVEWYTIWQLDVPPKVKHFLWRFAMSA 62 +V+S Y+LA E + + N + P W +W+L +PPK+ HFLWR +M Sbjct: 1298 YTVKSGYWLACEENREEAINIVLA--------PRNFWKHLWKLKLPPKINHFLWRCSMGF 1349 Query: 61 LPTQDSL 41 +P + L Sbjct: 1350 IPCMEVL 1356 Score = 37.4 bits (85), Expect(3) = 1e-58 Identities = 19/65 (29%), Positives = 32/65 (49%) Frame = -3 Query: 1277 IQLHRSSPKVNHLLFADDTLLLGKATLEKAIFCGRFLHFMSMLWSSWLNLNKSEVFFSPN 1098 + + R +P V+HL +ADD+LL AT+ + +N +KS + FSP Sbjct: 955 VAIARGAPSVSHLFYADDSLLFCDATVTDCMALKNIFSTYEAASGQKINKDKSAICFSPK 1014 Query: 1097 TSSRI 1083 + + I Sbjct: 1015 SPAAI 1019 Score = 79.3 bits (194), Expect = 5e-12 Identities = 48/163 (29%), Positives = 71/163 (43%), Gaps = 5/163 (3%) Frame = -3 Query: 2501 MYWFWH*VLVEILLMLQYYFLAYIFRGH-----ALTGFYGEPRIRFLVTMWNYFKTLKNK 2337 M FW+ +V + +YF+ + TGFYG P W+ ++L+ Sbjct: 351 MCLFWNNKVVVDYISSSFYFINAMVTWEDKKKCRFTGFYGHPETSQRHLSWDLLRSLRRV 410 Query: 2336 REGPWACVGDFIEVLFRSEATGKHPRQSWQMRAFGQALNHCQLRYMGYCGYPFTWSSSHF 2157 PW C GDF E+L +E TG R Q+ F A+ C L + G+ +TW + Sbjct: 411 CSEPWLCCGDFNEILDFNEKTGAVQRSQRQIDGFRHAVEDCGLYEFAFTGFQYTWDNRRK 470 Query: 2156 DISRSKARLDRCCANSRWSNLFPDTIVTHLDVVSSDHMPILLD 2028 + K RLDR N + HL +SSDH P+L + Sbjct: 471 GDANVKERLDRGFGNLALIQQWGGISCHHLVSMSSDHCPLLFE 513 >gb|EEE69154.1| hypothetical protein OsJ_28290 [Oryza sativa Japonica Group] Length = 615 Score = 142 bits (359), Expect(2) = 6e-57 Identities = 72/195 (36%), Positives = 116/195 (59%) Frame = -2 Query: 1191 SYFLW*ILTLYEHALVQLVEFEQIRGFFQS*YFIQDHLVIATFLQIPVTQTHHR*LGMPS 1012 SY + +L+LYE L Q + ++ F + + L I + + LG+P Sbjct: 152 SYRIKNVLSLYEDCLGQTINKDKSTIMFSKNSTTVEKENVMAGLGIQSEARNEKYLGLPI 211 Query: 1011 MSGRSKVQIFQGVWEKVMGRIRGWSEKFLSTAGKEILWKDVEQAIPIYTMSCFLLPESFQ 832 GRS+ Q F + ++V R++GW E+ LS AGKEIL K V Q+IP Y MSCF L ++ Sbjct: 212 YMGRSRSQTFSYLKDRVWKRLQGWKERLLSKAGKEILIKSVVQSIPTYAMSCFDLTKTLC 271 Query: 831 KEIQATMAQYRWQHKSDNQGVYWVAWRKLCTPKKESGMGFRNLRSINLSMLAKQGWRILI 652 E+ + + ++ W + + V+WV+W LC K++ G+G+R+L NL+MLA+QGWR+++ Sbjct: 272 NELGSLVCRFWWAQQENENKVHWVSWELLCRRKEQGGIGYRDLHLFNLAMLARQGWRLIM 331 Query: 651 NPGSLMANIVKARYF 607 P SL A +++A+YF Sbjct: 332 EPMSLCAQVLRAKYF 346 Score = 107 bits (267), Expect(2) = 6e-57 Identities = 58/180 (32%), Positives = 93/180 (51%) Frame = -1 Query: 565 SMTWCDIFSSIQVLRYSSR*RVGSRADIPIWNSPWLPCPFTFQVNCPVSVIIADARVAEL 386 S +W I IQ L+ RVG +I IW+ PWLP T + P + + +V +L Sbjct: 361 SYSWRSIVRGIQALKKGLIWRVGDGTNIDIWHDPWLPSGITRRPITPRGRTVVN-KVTDL 419 Query: 385 IREVDNQWNHSLLDAVFWSEEAEIIKGIPLRFHQRPDSWMWHFTHSREISVRSAYYLAVE 206 I +W+ L++ +FW E+ + I IP+R D WHF + SV+SAY++ + Sbjct: 420 IDPTIGKWDKELIEGLFWEEDVKQILTIPIRAGVE-DGLAWHFDNRGIFSVKSAYHVLED 478 Query: 205 RGVDYSSNRQASQPSTSNNHPYVEWYTIWQLDVPPKVKHFLWRFAMSALPTQDSLRRRGI 26 + + + S N + W IW+L PKVKHF+W A ++LP + S+++RG+ Sbjct: 479 ERRRHKPKQDGASSSGQTNMEKLCWQQIWKLPYLPKVKHFIWHLAHNSLPFRMSIQKRGM 538 >emb|CCA66036.1| hypothetical protein [Beta vulgaris subsp. vulgaris] Length = 1369 Score = 151 bits (381), Expect(2) = 1e-56 Identities = 75/194 (38%), Positives = 111/194 (57%) Frame = -2 Query: 1173 ILTLYEHALVQLVEFEQIRGFFQS*YFIQDHLVIATFLQIPVTQTHHR*LGMPSMSGRSK 994 IL+ YE A Q + E+ + + L + H + LG+P+ G SK Sbjct: 711 ILSTYEAASGQKLNMEKSEMSYSRNLEPDKINTLQMKLAFKTVEGHEKYLGLPTFIGSSK 770 Query: 993 VQIFQGVWEKVMGRIRGWSEKFLSTAGKEILWKDVEQAIPIYTMSCFLLPESFQKEIQAT 814 ++FQ + ++V +++GW K+LS AG+E+L K V QAIP Y M CF++P+S I+ Sbjct: 771 KRVFQAIQDRVWKKLKGWKGKYLSQAGREVLIKAVAQAIPTYAMQCFVIPKSIIDGIEKM 830 Query: 813 MAQYRWQHKSDNQGVYWVAWRKLCTPKKESGMGFRNLRSINLSMLAKQGWRILINPGSLM 634 + W K + + V WVAW KL PKKE G+G RN N ++LAKQ WRIL P SLM Sbjct: 831 CRNFFWGQKEEERRVAWVAWEKLFLPKKEGGLGIRNFDVFNRALLAKQAWRILTKPDSLM 890 Query: 633 ANIVKARYFSQNHF 592 A ++K +YF +++F Sbjct: 891 ARVIKGKYFPRSNF 904 Score = 97.8 bits (242), Expect(2) = 1e-56 Identities = 59/192 (30%), Positives = 93/192 (48%), Gaps = 2/192 (1%) Frame = -1 Query: 595 FFYAHLGYNPSMTWCDIFSSIQVLRYSSR*RVGSRADIPIWNSPWLPCPFTFQVNCPVSV 416 F A + N S T I S+ V++ +G D IW PW+P + + V Sbjct: 904 FLEARVSPNMSFTCKSILSARAVIQKGMCRVIGDGRDTTIWGDPWVPSLERYSIAATEGV 963 Query: 415 IIADA--RVAELIREVDNQWNHSLLDAVFWSEEAEIIKGIPLRFHQRPDSWMWHFTHSRE 242 D +V ELI +++WN LL+ +F E+ I+ IP+ ++PD WMW + + + Sbjct: 964 SEDDGPQKVCELIS--NDRWNVELLNTLFQPWESTAIQRIPVALQKKPDQWMWMMSKNGQ 1021 Query: 241 ISVRSAYYLAVERGVDYSSNRQASQPSTSNNHPYVEWYTIWQLDVPPKVKHFLWRFAMSA 62 +VRSAYY + + + PSTS W IW+ +PPKVK F W+ + Sbjct: 1022 FTVRSAYYHEL-------LEDRKTGPSTSRGPNLKLWQKIWKAKIPPKVKLFSWKAIHNG 1074 Query: 61 LPTQDSLRRRGI 26 L ++R+RG+ Sbjct: 1075 LAVYTNMRKRGM 1086 Score = 79.3 bits (194), Expect(4) = 4e-30 Identities = 43/126 (34%), Positives = 57/126 (45%) Frame = -3 Query: 2411 TGFYGEPRIRFLVTMWNYFKTLKNKREGPWACVGDFIEVLFRSEATGKHPRQSWQMRAFG 2232 TG YG P L PW C GDF +L SE G S + F Sbjct: 108 TGIYGYPEEEHKDKTGALLSALARASRRPWLCGGDFNLMLVASEKKGGDGFNSREADIFR 167 Query: 2231 QALNHCQLRYMGYCGYPFTWSSSHFDISRSKARLDRCCANSRWSNLFPDTIVTHLDVVSS 2052 A+ C +G+ GY FTW+++ + + RLDR AN W FP + V+HL S Sbjct: 168 NAMEECHFMDLGFVGYEFTWTNNRGGDANIQERLDRFVANDLWKIKFPGSFVSHLPKRKS 227 Query: 2051 DHMPIL 2034 DH+PI+ Sbjct: 228 DHVPIV 233 Score = 55.1 bits (131), Expect(4) = 4e-30 Identities = 30/91 (32%), Positives = 43/91 (47%) Frame = -2 Query: 1578 GRWCXXXXXXXXXXXQFYTKLFSTSCPQEQTMNLVLERVPPRVTLETNDVLLRPFAEHKI 1399 G W ++ LF + E M+ +L V P++T E L PF ++ Sbjct: 385 GEWFEDEDDVTECFAHYFENLFQSGNNCE--MDPILNIVKPQITDELGTQLDAPFRREEV 442 Query: 1398 FTALSHMSPLKSPGPDGFNTGFYQSYWLIVG 1306 AL+ M P K+PGPDG N FYQ +W +G Sbjct: 443 SAALAQMHPNKAPGPDGMNALFYQHFWDTIG 473 Score = 38.5 bits (88), Expect(4) = 4e-30 Identities = 29/119 (24%), Positives = 53/119 (44%), Gaps = 2/119 (1%) Frame = -1 Query: 1984 FWFEAKWLQHADCQQLIRDSWTSDDTVGPSSFPTNTCVSNLFW*KRV--GCGKEFHSIYC 1811 F FEA WL+ + ++++++W G + T L W K+ KE Sbjct: 252 FRFEAMWLREGESDEVVKETWMRGTDAGINL--ARTANKLLSWSKQKFGHVAKEIRMCQH 309 Query: 1810 FVEVTCPGIAS**SFGEKIIRVFSLKQQLDMLLADEDLRRKQWAKVDWYRYGDRNRSFF 1634 ++V S + I+ + +L ++D L E++ Q ++ DW + GD+N FF Sbjct: 310 QMKVLMESEPS----EDNIMHMRALDARMDELEKREEVYWHQRSRQDWIKSGDKNTKFF 364 Score = 27.7 bits (60), Expect(4) = 4e-30 Identities = 10/26 (38%), Positives = 17/26 (65%) Frame = -2 Query: 2526 GVALLWKIDVLVLALGFSRNFIDVTV 2449 G+A+LW+ ++ V + S N ID+ V Sbjct: 72 GLAMLWRSEIKVQVMSMSSNHIDIVV 97 >gb|ABA98491.1| retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group] Length = 1621 Score = 145 bits (366), Expect(2) = 1e-54 Identities = 72/189 (38%), Positives = 112/189 (59%) Frame = -2 Query: 1173 ILTLYEHALVQLVEFEQIRGFFQS*YFIQDHLVIATFLQIPVTQTHHR*LGMPSMSGRSK 994 IL +YE Q++ ++ F + + L + T+ R LG+P GRS+ Sbjct: 958 ILQIYEECSGQVINKDKSAVMFSPNTSSLEKRAVMAALNMQRETTNERYLGLPVFVGRSR 1017 Query: 993 VQIFQGVWEKVMGRIRGWSEKFLSTAGKEILWKDVEQAIPIYTMSCFLLPESFQKEIQAT 814 +IF + E++ RI+GW EK LS AGKEIL K V QAIP + M CF L + +I Sbjct: 1018 TKIFSYLKERIWQRIQGWKEKLLSRAGKEILIKAVAQAIPTFAMGCFELTKDLCDQISKM 1077 Query: 813 MAQYRWQHKSDNQGVYWVAWRKLCTPKKESGMGFRNLRSINLSMLAKQGWRILINPGSLM 634 +A+Y W ++ + ++W++W KL PK G+GFR++ NL+MLAKQGWR++ +P SL Sbjct: 1078 IAKYWWSNQEKDNKMHWLSWNKLTLPKNMGGLGFRDIYIFNLAMLAKQGWRLIQDPDSLC 1137 Query: 633 ANIVKARYF 607 + +++A+YF Sbjct: 1138 SRVLRAKYF 1146 Score = 97.1 bits (240), Expect(2) = 1e-54 Identities = 61/184 (33%), Positives = 89/184 (48%), Gaps = 2/184 (1%) Frame = -1 Query: 571 NPSMTWCDIFSSIQVLRYSSR*RVGSRADIPIWNSPWLPCPFTFQVNCPVSVIIADARVA 392 N S TW I ++VL+ RVG + I IW PW+P ++ + P + +V Sbjct: 1159 NVSYTWRSIQKGLRVLQNGMIWRVGDGSKINIWADPWIPRGWSRKPMTPRGANLV-TKVE 1217 Query: 391 ELIREVDNQWNHSLLDAVFWSEEAEIIKGIPLRFHQRPDSWMWHFTHSREISVRSAYYLA 212 ELI W+ LL FW E+ IK IP+ D WHF +V+SAY Sbjct: 1218 ELIDPYTGTWDEDLLSQTFWEEDVAAIKSIPVHVEME-DVLAWHFDARGCFTVKSAY--K 1274 Query: 211 VERGVDYSSNRQASQPSTSNNHPYVE--WYTIWQLDVPPKVKHFLWRFAMSALPTQDSLR 38 V+R ++ ++R P SN + W +W+L VP K+KHFLWR + L + +L Sbjct: 1275 VQREMERRASRNGC-PGVSNWESGDDDFWKKLWKLGVPGKIKHFLWRMCHNTLALRANLH 1333 Query: 37 RRGI 26 RG+ Sbjct: 1334 HRGM 1337 Score = 78.2 bits (191), Expect(3) = 7e-26 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 1/131 (0%) Frame = -3 Query: 2402 YGEPRIRFLVTMWNYFKTLKNKREGPWACVGDFIEVLFRSEATGKHPRQSWQMRAFGQAL 2223 YG+ W + L + PW GDF E+LF E G + M F AL Sbjct: 352 YGDAHSETKHRTWTTMRGLIDNPTTPWLMAGDFNEILFSHEKQGGRMKAQSAMDEFRHAL 411 Query: 2222 NHCQLRYMGYCGYPFTW-SSSHFDISRSKARLDRCCANSRWSNLFPDTIVTHLDVVSSDH 2046 C L +G+ G FTW + SH + RLDR AN W +FP V + D SDH Sbjct: 412 TDCGLDDLGFEGDAFTWRNHSHSQEGYIRERLDRAVANPEWRAMFPAARVINGDPRHSDH 471 Query: 2045 MPILLDLFNKN 2013 P++++L KN Sbjct: 472 RPVIIELEGKN 482 Score = 48.1 bits (113), Expect(3) = 7e-26 Identities = 27/91 (29%), Positives = 45/91 (49%) Frame = -2 Query: 1578 GRWCXXXXXXXXXXXQFYTKLFSTSCPQEQTMNLVLERVPPRVTLETNDVLLRPFAEHKI 1399 G W +F+ +LF+++ Q +L+ V +V+ N+ L F ++ Sbjct: 632 GSWVEREEDKRAMIIEFFKQLFTSNGGQNSQK--LLDVVDRKVSGAMNESLRAEFTREEV 689 Query: 1398 FTALSHMSPLKSPGPDGFNTGFYQSYWLIVG 1306 AL + LK+PGPDG GFY++ W +VG Sbjct: 690 KEALDAIGDLKAPGPDGMPAGFYKACWDVVG 720 Score = 40.0 bits (92), Expect(3) = 7e-26 Identities = 35/136 (25%), Positives = 60/136 (44%), Gaps = 11/136 (8%) Frame = -1 Query: 1984 FWFEAKWLQHADCQQLIRDSWTSDDTVGPSSFPTNTCVSNLF-----W*KRV--GCGKEF 1826 F FEA WL+ +++++++W D + G P + ++ + W V K Sbjct: 494 FRFEAAWLEEEKFKEVVKEAW--DVSAGLQGLPVHASLAGVAAGLSSWSSNVLGDLEKRV 551 Query: 1825 HSIYCFVEVTCPGIAS**SFGEKIIRVFSLKQQLDMLLADEDLRRKQWAKVDWYRYGDRN 1646 + +E S ++++R L+ +L+ L D+ KQ A +W GDRN Sbjct: 552 KKVKKELETCRRQPIS----RDQVVREEVLRYRLEKLEQQVDIYWKQRAHTNWLNKGDRN 607 Query: 1645 RSFF----MPRRRLNK 1610 SFF RRR N+ Sbjct: 608 TSFFHASCSERRRRNR 623