BLASTX nr result
ID: Cocculus22_contig00031713
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00031713 (348 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204... 71 2e-10 emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678... 71 2e-10 dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like ... 70 3e-10 ref|XP_002877469.1| predicted protein [Arabidopsis lyrata subsp.... 70 3e-10 gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal... 66 4e-09 ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arab... 66 6e-09 gb|AAC63678.1| putative non-LTR retroelement reverse transcripta... 65 7e-09 dbj|BAB01431.1| non-LTR retroelement reverse transcriptase-like ... 65 1e-08 ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein A... 65 1e-08 gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] 65 1e-08 dbj|BAE98403.1| putative non-LTR reverse transcriptase [Arabidop... 65 1e-08 gb|AAD12028.1| putative non-LTR retroelement reverse transcripta... 65 1e-08 ref|XP_002459639.1| hypothetical protein SORBIDRAFT_02g007880 [S... 64 2e-08 ref|XP_002448781.1| hypothetical protein SORBIDRAFT_06g033056 [S... 64 2e-08 ref|XP_006300939.1| hypothetical protein CARUB_v10021318mg, part... 64 3e-08 ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein A... 64 3e-08 gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CA... 63 4e-08 ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcript... 63 4e-08 gb|ABK28243.1| unknown [Arabidopsis thaliana] 63 4e-08 gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thali... 63 4e-08 >ref|XP_004148188.1| PREDICTED: uncharacterized protein LOC101204314 [Cucumis sativus] Length = 282 Score = 70.9 bits (172), Expect = 2e-10 Identities = 38/110 (34%), Positives = 52/110 (47%) Frame = -1 Query: 333 EATKIGEKKKDKIVWRLESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*K 154 + ++ +D+ VW S FS SAW IR + L+W NIP+HS Sbjct: 86 QGVRLSPSVEDRWVWVPGSHDSFSITSAWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWL 145 Query: 153 LALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 +R T RL+R + + CL C + ES DHLFF C F +IW I Sbjct: 146 AIRDRLGTRGRLSRWDRSIPLSCLLCGGNYESRDHLFFSCHFGWEIWSRI 195 >emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1| putative protein [Arabidopsis thaliana] Length = 473 Score = 70.9 bits (172), Expect = 2e-10 Identities = 35/102 (34%), Positives = 57/102 (55%), Gaps = 3/102 (2%) Frame = -1 Query: 306 KDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRP 136 +D+ +W+ + + FS K WN+IR + K +WF + IP+H+ + NR Sbjct: 271 EDRALWKGKENRFRSIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRL 330 Query: 135 ATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWK 10 +T R+ N+ VD+ C+ C + ES DHLFF C F+ +IW+ Sbjct: 331 STGDRMTLWNMGVDATCILCNKALESRDHLFFSCPFATEIWE 372 >dbj|BAB08692.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] gi|93007380|gb|ABE97193.1| hypothetical protein At5g13655 [Arabidopsis thaliana] Length = 385 Score = 70.1 bits (170), Expect = 3e-10 Identities = 35/118 (29%), Positives = 61/118 (51%), Gaps = 10/118 (8%) Frame = -1 Query: 327 TKIGEKKKDKIV-------WRLESSG---EFSFKSAWNYIRMKNQPFK*CKLIWFPRNIP 178 T+I ++K+ +IV W+ + G F K W+ IR + + IWF P Sbjct: 168 TEIQKQKQSRIVTERDVALWKGKEDGFHPTFLSKETWSQIRNTQPEMQGYRGIWFSNATP 227 Query: 177 RHSITV*KLALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 ++++ + NR AT ++ N D+ C+FC N E+ +HLFF+C ++ K+W G+ Sbjct: 228 KYALLTWLMVRNRIATGEKMGLWNQNTDTSCIFCKNPNETREHLFFQCVYTRKVWNGL 285 >ref|XP_002877469.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297323307|gb|EFH53728.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 328 Score = 70.1 bits (170), Expect = 3e-10 Identities = 37/100 (37%), Positives = 51/100 (51%), Gaps = 2/100 (2%) Frame = -1 Query: 303 DKIVWRLESSG--EFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRPAT 130 D +VW ++ FS K+ WN +R+ L+W IPRH IT LNR T Sbjct: 166 DTLVWAVDGIPYKHFSSKAVWNAVRISKPVNYWAPLVWHKAAIPRHVITSWLFILNRNPT 225 Query: 129 AARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWK 10 RL+ V+ CL C + ES +HLFF C FS ++W+ Sbjct: 226 LDRLSSWGYDVELDCLLCGLAHESRNHLFFNCVFSVEVWR 265 >gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana] Length = 629 Score = 66.2 bits (160), Expect = 4e-09 Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 3/105 (2%) Frame = -1 Query: 309 KKDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNR 139 ++D +WR + FS K WN +R K+ K +WF + P++ NR Sbjct: 427 REDATLWRGKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNR 486 Query: 138 PATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 +T R+ N D +C FC S E+ DHLFF C ++ IW I Sbjct: 487 LSTGYRMQLWNNGSDVKCTFCSTSIETRDHLFFSCSYASAIWTAI 531 >ref|XP_002865536.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] gi|297311371|gb|EFH41795.1| hypothetical protein ARALYDRAFT_917542 [Arabidopsis lyrata subsp. lyrata] Length = 227 Score = 65.9 bits (159), Expect = 6e-09 Identities = 33/103 (32%), Positives = 51/103 (49%), Gaps = 3/103 (2%) Frame = -1 Query: 312 KKKDKIVWRL---ESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALN 142 K DK +WR + F+ + WN +R +WFP+ +PR+S V + Sbjct: 45 KGPDKPLWRYTLDDYDSSFTSRHTWNLLRKAKHKVLWHNSVWFPQRVPRYSFIVWLAVKD 104 Query: 141 RPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 + +T R+ V+ C+FC +ES DHLFF C F++ IW Sbjct: 105 QLSTGTRMRAWG--VEQPCVFCRERDESRDHLFFACPFTYSIW 145 >gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1216 Score = 65.5 bits (158), Expect = 7e-09 Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 3/104 (2%) Frame = -1 Query: 306 KDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRP 136 +D I+WR + FS K WN+IR + K +WF P+ S NR Sbjct: 744 EDAILWRGKEDVFKARFSTKDTWNHIRTSSNQRAWHKGVWFAHATPKFSFCAWLAIRNRL 803 Query: 135 ATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 +T R+ N + C+FC + E+ DHLFF+C +S +IW I Sbjct: 804 STGDRMMTWNNGTPTTCVFCSSPMETRDHLFFQCCYSSEIWTSI 847 >dbj|BAB01431.1| non-LTR retroelement reverse transcriptase-like protein [Arabidopsis thaliana] Length = 637 Score = 65.1 bits (157), Expect = 1e-08 Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Frame = -1 Query: 312 KKKDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALN 142 ++ D+ +W+ + G FS W IR + + + +WFP + P++S N Sbjct: 243 QESDRSLWKQKEDSFKGSFSSPKTWQQIRTISNECEWYRGVWFPSSTPKYSFVTWLAFHN 302 Query: 141 RPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 R AT RL + N + C+FC E+ DHLFF C +S +IW Sbjct: 303 RLATGDRLYKWNSEARATCVFCDEELETRDHLFFSCPYSSQIW 345 >ref|XP_004309110.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 872 Score = 65.1 bits (157), Expect = 1e-08 Identities = 32/103 (31%), Positives = 53/103 (51%) Frame = -1 Query: 321 IGEKKKDKIVWRLESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALN 142 I +DK++W+ S+GE + K A+ +++ + K +W +PR S+ K+ Sbjct: 491 INPSMEDKLIWQASSTGELTAKQAFLFLQQASPVVPWGKPLWSKFILPRMSLHAWKVMRG 550 Query: 141 RPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 + L R + + SRC FC NS ES DH+F C F+ +W Sbjct: 551 TVISYHLLQRRGVALVSRCEFCGNSTESLDHIFLHCSFAASVW 593 >gb|AAD32866.1|AC005489_4 F14N23.4 [Arabidopsis thaliana] Length = 1161 Score = 65.1 bits (157), Expect = 1e-08 Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 3/115 (2%) Frame = -1 Query: 348 RRLLTEATKIGEKKKDKIVWRL---ESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIP 178 R L + A+ I D +W++ S FS W+Y++ + K +WF ++P Sbjct: 938 RVLPSAASLIDCPHDDTYLWKIGHHAPSNRFSTADTWSYLQPSSTSVLWHKAVWFKDHVP 997 Query: 177 RHSITV*KLALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 + + +A NR T RL R + C+ C + +ES +HLFF+C FS +IW Sbjct: 998 KQAFICWVVAHNRLHTRDRLRRWGFSIPPTCVLCNDLDESREHLFFRCQFSSEIW 1052 >dbj|BAE98403.1| putative non-LTR reverse transcriptase [Arabidopsis thaliana] Length = 278 Score = 65.1 bits (157), Expect = 1e-08 Identities = 33/103 (32%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Frame = -1 Query: 312 KKKDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALN 142 ++ D+ +W+ + G FS W IR + + + +WFP + P++S N Sbjct: 75 QESDRSLWKQKEDSFKGSFSSPKTWQQIRTISNECEWYRGVWFPSSTPKYSFVTWLAFHN 134 Query: 141 RPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 R AT RL + N + C+FC E+ DHLFF C +S +IW Sbjct: 135 RLATGDRLYKWNSEARATCVFCDEELETRDHLFFSCPYSSQIW 177 >gb|AAD12028.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1447 Score = 64.7 bits (156), Expect = 1e-08 Identities = 34/104 (32%), Positives = 53/104 (50%), Gaps = 3/104 (2%) Frame = -1 Query: 315 EKKKDKIVWRLESS---GEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLAL 145 ++K D +VWR + +FS + WN IR K +WF + P+ S V AL Sbjct: 1243 KEKADDVVWRGRNDIYKPQFSTRDTWNNIRTTATKVTWYKGVWFYQATPKFSFCVWLAAL 1302 Query: 144 NRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 +R +T R+ + C+FC + +S DHLFF C +S ++W Sbjct: 1303 DRLSTGDRMANWKGVSSGSCVFCNHPTKSRDHLFFNCPYSSEVW 1346 >ref|XP_002459639.1| hypothetical protein SORBIDRAFT_02g007880 [Sorghum bicolor] gi|241923016|gb|EER96160.1| hypothetical protein SORBIDRAFT_02g007880 [Sorghum bicolor] Length = 475 Score = 64.3 bits (155), Expect = 2e-08 Identities = 40/104 (38%), Positives = 54/104 (51%) Frame = -1 Query: 315 EKKKDKIVWRLESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRP 136 E ++D IVW LESSGE++ KSA+ N LIW P+ + L NR Sbjct: 280 ETEEDSIVWTLESSGEYTAKSAYAVQFAGNIVSNHPALIWRVWATPKCKYFIWLLLQNRL 339 Query: 135 ATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 TAARL + C C + E++ HLFF+C FS ++W GI Sbjct: 340 WTAARLQLRRWTNNYFCALCERNLETAHHLFFECPFSLEVWHGI 383 >ref|XP_002448781.1| hypothetical protein SORBIDRAFT_06g033056 [Sorghum bicolor] gi|241939964|gb|EES13109.1| hypothetical protein SORBIDRAFT_06g033056 [Sorghum bicolor] Length = 206 Score = 63.9 bits (154), Expect = 2e-08 Identities = 40/104 (38%), Positives = 54/104 (51%) Frame = -1 Query: 315 EKKKDKIVWRLESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRP 136 E ++D IVW LESSGE++ KSA+ N LIW P+ + L NR Sbjct: 11 ETEEDSIVWTLESSGEYTAKSAYAAQFAGNIVSNHPALIWRVWATPKCKYFIWLLIQNRL 70 Query: 135 ATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWKGI 4 TAARL + C C + E++ HLFF+C FS ++W GI Sbjct: 71 WTAARLQLRGWTNNYFCALCERNLETAHHLFFECPFSLEVWHGI 114 >ref|XP_006300939.1| hypothetical protein CARUB_v10021318mg, partial [Capsella rubella] gi|482569649|gb|EOA33837.1| hypothetical protein CARUB_v10021318mg, partial [Capsella rubella] Length = 290 Score = 63.5 bits (153), Expect = 3e-08 Identities = 35/102 (34%), Positives = 47/102 (46%), Gaps = 3/102 (2%) Frame = -1 Query: 309 KKDKIVWRL---ESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNR 139 + D +WR E G FS W K +WF IP+H+ + NR Sbjct: 78 ESDFFLWRNSPNEPPGVFSTSKTWISTHPAGPLVPWFKSVWFKERIPKHAFISWVVIRNR 137 Query: 138 PATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 T RL + V S CL C +S ES HLFF+C +SH++W Sbjct: 138 LTTRDRLRGWGMNVPSECLLCTSSAESRLHLFFECAYSHEVW 179 >ref|XP_004309472.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria vesca subsp. vesca] Length = 364 Score = 63.5 bits (153), Expect = 3e-08 Identities = 35/97 (36%), Positives = 49/97 (50%) Frame = -1 Query: 303 DKIVWRLESSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRPATAA 124 DK++W SSGE S K A+ ++R + KLIW IPR S+ K+ R + Sbjct: 3 DKLIWVPLSSGELSAKEAFQFLRPRLPSLDWGKLIWSKFIIPRISLHSWKVLRGRVLSED 62 Query: 123 RLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 L R I + SRC+ C ES H+F C F+ +W Sbjct: 63 LLQRRGIALASRCVLCGRDGESLPHIFLTCSFAASLW 99 >gb|AAC19278.1| T14P8.10 [Arabidopsis thaliana] gi|7269009|emb|CAB80742.1| AT4g02490 [Arabidopsis thaliana] Length = 657 Score = 63.2 bits (152), Expect = 4e-08 Identities = 32/100 (32%), Positives = 49/100 (49%), Gaps = 3/100 (3%) Frame = -1 Query: 303 DKIVWRLES---SGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNIPRHSITV*KLALNRPA 133 D +W++ S +FS W ++ + K +WF +P+H+ A NR Sbjct: 549 DSYLWKVGDRVPSSKFSTADTWRALQPFSVSVSWHKAVWFTNQVPKHAFISWVTAWNRLH 608 Query: 132 TAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIW 13 T RL +IV + C+ C +E+ DHLFF C FS +IW Sbjct: 609 TRDRLRSWGLIVPAECVLCNLVDETRDHLFFACRFSSRIW 648 >ref|NP_567266.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] gi|5732057|gb|AAD48956.1|AF149414_5 contains similarity to a family of Arabidopsis thaliana predicted proteins, which have similarity to reverse transcriptases; see T14P8.10 (GB:AF069298) [Arabidopsis thaliana] gi|7267223|emb|CAB80830.1| AT4g04650 [Arabidopsis thaliana] gi|332657009|gb|AEE82409.1| RNA-directed DNA polymerase (reverse transcriptase)-related family protein [Arabidopsis thaliana] Length = 332 Score = 63.2 bits (152), Expect = 4e-08 Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Frame = -1 Query: 348 RRLLTEATKIGE-KKKDKIVWRLE---SSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNI 181 + LL EA + + + D +W+ + S FS W+ + ++ K +WF ++ Sbjct: 80 KNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQSHTVPWHKAVWFKNHV 139 Query: 180 PRHSITV*KLALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWK 10 P+H+ +A NR T RL + + + CL C ++S HLFF+C FS +W+ Sbjct: 140 PKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFECQFSGVVWR 196 >gb|ABK28243.1| unknown [Arabidopsis thaliana] Length = 297 Score = 63.2 bits (152), Expect = 4e-08 Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Frame = -1 Query: 348 RRLLTEATKIGE-KKKDKIVWRLE---SSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNI 181 + LL EA + + + D +W+ + S FS W+ + ++ K +WF ++ Sbjct: 80 KNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQSHTVPWHKAVWFKNHV 139 Query: 180 PRHSITV*KLALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWK 10 P+H+ +A NR T RL + + + CL C ++S HLFF+C FS +W+ Sbjct: 140 PKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFECQFSGVVWR 196 >gb|ABE65512.1| hypothetical protein At4g04650 [Arabidopsis thaliana] Length = 296 Score = 63.2 bits (152), Expect = 4e-08 Identities = 33/117 (28%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Frame = -1 Query: 348 RRLLTEATKIGE-KKKDKIVWRLE---SSGEFSFKSAWNYIRMKNQPFK*CKLIWFPRNI 181 + LL EA + + + D +W+ + S FS W+ + ++ K +WF ++ Sbjct: 80 KNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQSHTVPWHKAVWFKNHV 139 Query: 180 PRHSITV*KLALNRPATAARLNRMNIIVDSRCLFCWNSEESSDHLFFKCFFSHKIWK 10 P+H+ +A NR T RL + + + CL C ++S HLFF+C FS +W+ Sbjct: 140 PKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHLFFECQFSGVVWR 196