BLASTX nr result
ID: Catharanthus22_contig00010871
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00010871 (1464 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 90 2e-27 ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutr... 98 9e-26 ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624... 77 1e-24 dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t... 80 4e-23 ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661... 113 2e-22 gb|AAD11595.1| putative reverse transcriptase [Arabidopsis thali... 79 3e-18 ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutr... 59 3e-17 ref|XP_006415896.1| hypothetical protein EUTSA_v10009346mg, part... 75 1e-15 ref|XP_006471813.1| PREDICTED: uncharacterized protein LOC102631... 86 2e-15 gb|EOY05030.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) ... 74 2e-15 gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis tha... 90 3e-15 emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera] 71 1e-14 gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 64 2e-14 ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500... 58 2e-14 ref|XP_006589931.1| PREDICTED: uncharacterized protein LOC102669... 87 2e-14 ref|XP_006589879.1| PREDICTED: uncharacterized protein LOC102665... 87 2e-14 gb|EPS60009.1| hypothetical protein M569_14795, partial [Genlise... 86 4e-14 ref|XP_006471815.1| PREDICTED: uncharacterized protein LOC102606... 85 7e-14 emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera] 85 7e-14 ref|XP_006393736.1| hypothetical protein EUTSA_v10012212mg, part... 75 7e-14 >gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 1496 Score = 90.1 bits (222), Expect(2) = 2e-27 Identities = 53/139 (38%), Positives = 73/139 (52%) Frame = +1 Query: 631 AYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPENDPX 810 AY I++SD G + + + LK NNYAE + +Q GFIDGSIPKP DP Sbjct: 21 AYLINASDNPGALISSVV----LKENNYAEWSEELQNFLRAKQKLGFIDGSIPKPAADPE 76 Query: 811 XXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALA 990 IN+++ WI +I+ T+RS++ + +LW +L RF V NG RK L +A Sbjct: 77 LSLWIAINSMIVGWIRTSIDPTIRSTVGFVSEASQLWENLRRRFSVGNGVRKTLLKDEIA 136 Query: 991 NCK*GGDFVNVYHS*LKKL 1047 C G V Y+ L KL Sbjct: 137 ACTQDGQPVLAYYGRLIKL 155 Score = 60.8 bits (146), Expect(2) = 2e-27 Identities = 42/143 (29%), Positives = 74/143 (51%), Gaps = 5/143 (3%) Frame = +2 Query: 1043 SYHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFG 1222 +Y+ RL KLW+EL NY E + + K++E+++V+ ++GL D F Sbjct: 147 AYYGRLIKLWEELQNYKS--GRECKCEAASDIEKEREDDRVHK------FLLGL-DSRFS 197 Query: 1223 TVRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGRERKYDH----GFR*RWN-KPV 1387 ++RSSIT EPLP L Q+ +R ++ + + N+ R + GF + + P Sbjct: 198 SIRSSITDIEPLPDLYQVYSRVVR------EEQNLNASRTKDVVKTEAIGFSVQSSTTPR 251 Query: 1388 VQTRTKSVCTHCQKQGHDIIPVF 1456 + ++ CTHC ++GH++ F Sbjct: 252 FRDKSTLFCTHCNRKGHEVTQCF 274 >ref|XP_006397294.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum] gi|557098311|gb|ESQ38747.1| hypothetical protein EUTSA_v10029485mg [Eutrema salsugineum] Length = 196 Score = 98.2 bits (243), Expect(2) = 9e-26 Identities = 53/135 (39%), Positives = 80/135 (59%) Frame = +1 Query: 637 YISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPENDPXXX 816 YI SDR G + T +Q L+G NY + K ++ A GFI+G++PKP Sbjct: 2 YIHPSDRPGDLITTMQ----LRGENYEDWAKHVRNALRTKRKLGFIEGTLPKPTAPKELE 57 Query: 817 XXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALANC 996 +N+++ +WI+NTIE L+++I+ + +ELW DL+ +FLV NGP+ EL A +ANC Sbjct: 58 QWEVVNSMLVAWIMNTIESNLKTTISMVDEAKELWDDLKLQFLVGNGPQISELRADIANC 117 Query: 997 K*GGDFVNVYHS*LK 1041 + GD + VY LK Sbjct: 118 RQNGDSIMVYFEKLK 132 Score = 47.4 bits (111), Expect(2) = 9e-26 Identities = 30/71 (42%), Positives = 41/71 (57%) Frame = +2 Query: 1064 KLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFGTVRSSIT 1243 K+WDEL Y + + + E+ A L +D EEE+ + GL+ E FGTVRS+I Sbjct: 132 KMWDELAVYKPIRTCSCG-ELRAQLEEDLEEERTNT------FLTGLDAERFGTVRSTIR 184 Query: 1244 HEEPLPKLKQI 1276 EPLPKL Q+ Sbjct: 185 SLEPLPKLTQV 195 >ref|XP_006480040.1| PREDICTED: uncharacterized protein LOC102624694 isoform X1 [Citrus sinensis] gi|568852764|ref|XP_006480041.1| PREDICTED: uncharacterized protein LOC102624694 isoform X2 [Citrus sinensis] gi|568852766|ref|XP_006480042.1| PREDICTED: uncharacterized protein LOC102624694 isoform X3 [Citrus sinensis] Length = 320 Score = 77.4 bits (189), Expect(2) = 1e-24 Identities = 37/84 (44%), Positives = 54/84 (64%) Frame = +1 Query: 790 KPENDPXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKY 969 +P +P +N+++ SWILNTIE TLRS+I + E ++LW D++ERF V NGPR + Sbjct: 3 EPAKEPELDDWWTVNSMIVSWILNTIEPTLRSTITHMEVAKKLWDDIKERFSVGNGPRVH 62 Query: 970 ELNAALANCK*GGDFVNVYHS*LK 1041 +L + LA CK G + Y+ LK Sbjct: 63 QLKSELAECKQRGMTILSYYGKLK 86 Score = 64.3 bits (155), Expect(2) = 1e-24 Identities = 47/140 (33%), Positives = 72/140 (51%), Gaps = 7/140 (5%) Frame = +2 Query: 1043 SYHSRLKKLWDELDNYTRMP---SSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDE 1213 SY+ +LK +W+EL NY + P E+ A L K EEE+++ ++GL+D Sbjct: 80 SYYGKLKLIWEELANYEQYPICSCGGCTCELEAKLNKKCEEERLHQ------FLMGLDDT 133 Query: 1214 IFGTVRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGRE-RKYDHGFR*RWN-KPV 1387 I+G+VRS+I +PLP L + + +Q T G+E R F + K Sbjct: 134 IYGSVRSNILSTDPLPPLNRAYSLVVQEERVQT----ITRGKEGRGEPVAFAVQGGVKGQ 189 Query: 1388 VQTRTKS--VCTHCQKQGHD 1441 ++ R KS +C HC+K GHD Sbjct: 190 IEIREKSSVICKHCRKTGHD 209 >dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1098 Score = 79.7 bits (195), Expect(2) = 4e-23 Identities = 45/138 (32%), Positives = 73/138 (52%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPENDPXX 813 Y I++SD G + + + LK +NY+E + + + GF+DG+IPKP +P Sbjct: 14 YGITASDNPGALISSVI----LKEDNYSEWAEELMNSLQAKQKLGFLDGTIPKPTTEPAL 69 Query: 814 XXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALAN 993 N+++ WI +I+ T+RS++ + ++LW L++RF NG RK L + Sbjct: 70 SSWKAANSMIIGWIRTSIDPTIRSTVAFVSDAKDLWDSLKQRFSNGNGVRKQLLKDEILA 129 Query: 994 CK*GGDFVNVYHS*LKKL 1047 CK G V VY+ L KL Sbjct: 130 CKQDGQSVLVYYGRLTKL 147 Score = 57.0 bits (136), Expect(2) = 4e-23 Identities = 45/153 (29%), Positives = 71/153 (46%), Gaps = 16/153 (10%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFGT 1225 Y+ RL KLW+EL NY S E + K++E++KV+ ++ L DE F Sbjct: 140 YYGRLTKLWEELQNYKT--SRTCTCEAAPDIAKEREDDKVHQ------FLLNL-DERFRP 190 Query: 1226 VRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGR----ERKYDHGFR*R------- 1372 +RS+IT ++PLP L Q+ +R + + + N+ R + GF + Sbjct: 191 IRSTITVQDPLPALNQVYSRVIH------EEQNLNASRIKDDIKTEAVGFTVQATPLPPT 244 Query: 1373 -----WNKPVVQTRTKSVCTHCQKQGHDIIPVF 1456 + P + R+ CTH +QGHDI F Sbjct: 245 PQVAAVSAPRFRDRSSLTCTHYHRQGHDITECF 277 >ref|XP_006586558.1| PREDICTED: uncharacterized protein LOC102661920 [Glycine max] Length = 516 Score = 113 bits (282), Expect = 2e-22 Identities = 75/204 (36%), Positives = 108/204 (52%), Gaps = 2/204 (0%) Frame = +1 Query: 589 EEKIHVKQRIVKNDAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXG 768 E H+K++I D Y SSD G I T +Q LKG NY E +A++ + Sbjct: 19 ESGSHLKKQISPYDLY---SSDNPGNIITQVQ----LKGENYDEWARAVRGSLRARRKFR 71 Query: 769 FIDGSIPKPEND-PXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFL 945 F+DGSI KP++ P +N+++ SWI NTIE LRS+I Y E +ELW D+++RF Sbjct: 72 FVDGSIKKPDDAAPEIDDWWTVNSMIVSWIFNTIEPKLRSTITYRENAQELWDDIKQRFS 131 Query: 946 VSNGPRKYELNAALANCK*GGDFVNVYHS*LKKLSFAIEKVMG*A*QLYTDAIICYKFR- 1122 +SNGPR +L + LANCK GD + Y LKKL + C + Sbjct: 132 ISNGPRIQQLKSELANCKQNGDSIVTYFGRLKKLWDELNDFD------QIPMCTCNGCKC 185 Query: 1123 NIGHSDQRQRGGKSIYQFFMGLND 1194 I + ++R + ++QF MGL+D Sbjct: 186 GISAALNKKREEEKLHQFLMGLDD 209 Score = 68.9 bits (167), Expect = 5e-09 Identities = 48/148 (32%), Positives = 72/148 (48%), Gaps = 10/148 (6%) Frame = +2 Query: 1043 SYHSRLKKLWDELDNYTRMPSSAIN---FEILAILTKDKEEEKVYINSSWD*MIIGLNDE 1213 +Y RLKKLWDEL+++ ++P N I A L K +EEEK++ ++GL+D Sbjct: 157 TYFGRLKKLWDELNDFDQIPMCTCNGCKCGISAALNKKREEEKLHQ------FLMGLDDT 210 Query: 1214 IFGTVRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGRERKYD-------HGFR*R 1372 F TVRS++ +PLP L + +Q G+E + D G Sbjct: 211 QFRTVRSNVLSLDPLPNLNRAYQMVVQEERVGV----MTRGKEERGDPIAFAVKSGRTSS 266 Query: 1373 WNKPVVQTRTKSVCTHCQKQGHDIIPVF 1456 W K T ++ C+HC++ GHDI F Sbjct: 267 WEKK-PNTGSEKPCSHCKRDGHDIDSCF 293 >gb|AAD11595.1| putative reverse transcriptase [Arabidopsis thaliana] gi|4263040|gb|AAD15309.1| putative reverse transcriptase [Arabidopsis thaliana] gi|7270676|emb|CAB77838.1| putative reverse transcriptase [Arabidopsis thaliana] Length = 374 Score = 78.6 bits (192), Expect(2) = 3e-18 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 1/134 (0%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEND-PX 810 Y++ SSD GL+ + L GN+Y I AM + GF+DGSIPKP++D P Sbjct: 70 YHLVSSDHPGLVLA----PELLDGNSYGTWIIAMTTSIEAKNKLGFVDGSIPKPDDDDPY 125 Query: 811 XXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALA 990 N++V SW+LN++ + +SI Y +W DL RF S+ PR Y+L + Sbjct: 126 CKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAAAIWKDLYTRFHKSSLPRLYKLRQQIH 185 Query: 991 NCK*GGDFVNVYHS 1032 + + G ++ YH+ Sbjct: 186 SLRQGNLDLSSYHT 199 Score = 41.6 bits (96), Expect(2) = 3e-18 Identities = 34/139 (24%), Positives = 59/139 (42%), Gaps = 4/139 (2%) Frame = +2 Query: 1034 D*KSYHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDE 1213 D SYH+R + LW+EL + +P + + I ++E +V ++GLND Sbjct: 193 DLSSYHTRKQTLWEELTSLQAIPRTVEDLLI------ERETNRVID------FLMGLND- 239 Query: 1214 IFGTVRSSITHEEPLPKLKQIM----ARHLQGGTTSTHDSDFNSGRERKYDHGFR*RWNK 1381 + VRS I ++ LP L ++ +Q + S + + N Sbjct: 240 CYDAVRSQILMKKTLPSLSEVFNMIDQDEIQRSARISTTPGMTSSVFAVSNQSSQSVLNG 299 Query: 1382 PVVQTRTKSVCTHCQKQGH 1438 Q + + VCT+C + GH Sbjct: 300 DTYQKKERPVCTYCSRPGH 318 >ref|XP_006419099.1| hypothetical protein EUTSA_v10003107mg [Eutrema salsugineum] gi|557097027|gb|ESQ37535.1| hypothetical protein EUTSA_v10003107mg [Eutrema salsugineum] Length = 189 Score = 59.3 bits (142), Expect(2) = 3e-17 Identities = 43/137 (31%), Positives = 61/137 (44%) Frame = +1 Query: 637 YISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPENDPXXX 816 Y+ SDR G + T +Q LKG NY + K ++ A GFIDG++ KP Sbjct: 2 YLHPSDRPGDLITTVQ----LKGENYEDWAKHVRNALRTKRKLGFIDGTLMKPTTAKELE 57 Query: 817 XXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALANC 996 +N++ G+ + ++LT F N PR EL A +ANC Sbjct: 58 QWEVVNSIEGAMGRSELKLT---------------------FSAGNVPRISELRADIANC 96 Query: 997 K*GGDFVNVYHS*LKKL 1047 + GD V VY LKK+ Sbjct: 97 RQNGDSVMVYFGKLKKM 113 Score = 57.4 bits (137), Expect(2) = 3e-17 Identities = 38/97 (39%), Positives = 53/97 (54%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFGT 1225 Y +LKK+WDEL Y + + + E+ A L +D+EEE+ + GL+ E FGT Sbjct: 106 YFGKLKKMWDELAIYKPIRTCSCG-ELKAQLEEDQEEERTNT------FLTGLDAERFGT 158 Query: 1226 VRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSG 1336 VRS+I EPLPKL Q+ R + H S F +G Sbjct: 159 VRSTIQSIEPLPKLSQVYQR------LAKHRSKFYTG 189 >ref|XP_006415896.1| hypothetical protein EUTSA_v10009346mg, partial [Eutrema salsugineum] gi|557093667|gb|ESQ34249.1| hypothetical protein EUTSA_v10009346mg, partial [Eutrema salsugineum] Length = 272 Score = 75.5 bits (184), Expect(2) = 1e-15 Identities = 45/139 (32%), Positives = 71/139 (51%), Gaps = 1/139 (0%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKP-ENDPX 810 YY+ SD +G + T I L G+NY K M + GF+DG++ +P +N Sbjct: 28 YYLHLSDNTGQVLTPIL----LNGSNYERWAKLMLNSLRTKRKIGFVDGTLKRPSDNSDE 83 Query: 811 XXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALA 990 +N+++ WI + IE L SI+ + + +W L+ RF VS+ R ++L+ +A Sbjct: 84 AEKWDMVNSMIIGWIYSGIESKLCPSISLVDSAKAMWNSLQRRFSVSDDTRLHQLHGDIA 143 Query: 991 NCK*GGDFVNVYHS*LKKL 1047 CK GD V VY +K L Sbjct: 144 ACKQNGDSVEVYFGRIKVL 162 Score = 36.2 bits (82), Expect(2) = 1e-15 Identities = 25/101 (24%), Positives = 49/101 (48%), Gaps = 1/101 (0%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTR-MPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFG 1222 Y R+K LWD+L + + +F+ +++ +K +EK+ ++ ++GL+ FG Sbjct: 155 YFGRIKVLWDDLADLNKGFQCCCKSFDCSSMVAYEKNQEKMRVHQ----FLMGLDTSRFG 210 Query: 1223 TVRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGRER 1345 T RS++ + L + ++ +Q H S S ER Sbjct: 211 TARSNLLSRQLDLNLDSVYSQIIQ---EERHLSVMRSNEER 248 >ref|XP_006471813.1| PREDICTED: uncharacterized protein LOC102631218 isoform X1 [Citrus sinensis] gi|568835517|ref|XP_006471814.1| PREDICTED: uncharacterized protein LOC102631218 isoform X2 [Citrus sinensis] Length = 1057 Score = 85.5 bits (210), Expect(2) = 2e-15 Identities = 49/142 (34%), Positives = 77/142 (54%) Frame = +1 Query: 622 KNDAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEN 801 +ND + + SD ++ L GNNY ++AM +A GF+DG+I KP+N Sbjct: 742 ENDPFLVHPSDSPTIVLV----SPLLTGNNYGTWVRAMTMALRARNKLGFVDGTITKPDN 797 Query: 802 DPXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNA 981 D N LV SW+LN+I L S+ Y++ ELW+ L+ERF N + Y+L Sbjct: 798 DDGGKWQSC-NDLVRSWVLNSISSKLACSVLYAQSARELWLHLQERF-QQNASKIYKLKQ 855 Query: 982 ALANCK*GGDFVNVYHS*LKKL 1047 A+++ + G V++Y+ +KKL Sbjct: 856 AISSLRQGDVAVHLYYRIMKKL 877 Score = 25.4 bits (54), Expect(2) = 2e-15 Identities = 34/154 (22%), Positives = 54/154 (35%), Gaps = 23/154 (14%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFGT 1225 Y+ +KKLW +L++ + + K K ++ + GL+D + Sbjct: 870 YYRIMKKLWRKLNSLQHLEP--------CVSGKAKVVNELQQQDFGMEFLQGLHDR-YAA 920 Query: 1226 VRSSITHEEPLPKLKQIMA----------RHLQGGTTST------------HDS-DFNSG 1336 +RS I +P PK +I+A H GG + H S D G Sbjct: 921 IRSRILLMDPFPKAHKILALIKKEETQQDLHALGGPSKAAALAIPNRQPLLHSSLDNRMG 980 Query: 1337 RERKYDHGFR*RWNKPVVQTRTKSVCTHCQKQGH 1438 + D N + + C HC K GH Sbjct: 981 NDISADSSVS-NLNGISGNDQRRQTCEHCGKLGH 1013 >gb|EOY05030.1| Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao] Length = 1141 Score = 74.3 bits (181), Expect(2) = 2e-15 Identities = 44/137 (32%), Positives = 68/137 (49%), Gaps = 1/137 (0%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPE-NDPX 810 Y++ SSD GLIF + N G NY ++ A GF+DG+I KP+ N Sbjct: 31 YFLHSSDHPGLIF--VTHPLNENGENYFTWRRSFLNALRSKNKAGFVDGTIVKPDVNSQD 88 Query: 811 XXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALA 990 N +V W++N + ++SS +++ E+W DL+ERF PR YEL A+A Sbjct: 89 YDSWVQCNAIVLFWLINALAKEIQSSAAHADTTHEVWADLQERFTQRMAPRMYELRRAIA 148 Query: 991 NCK*GGDFVNVYHS*LK 1041 + ++ Y+ LK Sbjct: 149 LLQQEKSSISSYYGKLK 165 Score = 36.2 bits (82), Expect(2) = 2e-15 Identities = 26/77 (33%), Positives = 38/77 (49%), Gaps = 2/77 (2%) Frame = +2 Query: 1043 SYHSRLKKLWDELDNYTRMPSSAINFEILAILTKD--KEEEKVYINSSWD*MIIGLNDEI 1216 SY+ +LK +W EL +P A + +E+EKV+ ++GL D+ Sbjct: 159 SYYGKLKTVWGELQASNPIPVCTCGCTCGAAKKMEDMQEQEKVFD------FLMGL-DDT 211 Query: 1217 FGTVRSSITHEEPLPKL 1267 F TVRS I +PLP L Sbjct: 212 FSTVRSQILSVDPLPSL 228 >gb|AAG50751.1|AC079733_19 polyprotein, putative [Arabidopsis thaliana] Length = 1468 Score = 89.7 bits (221), Expect = 3e-15 Identities = 59/191 (30%), Positives = 93/191 (48%), Gaps = 4/191 (2%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKP-ENDPX 810 Y ++++D SG + + LK NNY E + A GF+DG+IP+P + P Sbjct: 21 YDLTAADNSGAVIS----HPILKTNNYEEWACGFKTALRSRKKFGFLDGTIPQPLDGSPD 76 Query: 811 XXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALA 990 IN L+ SW+ TI+ L ++I++ + +LW + +RF VSNGP+ ++ A LA Sbjct: 77 LEDWLTINALLVSWMKMTIDSELLTNISHRDVARDLWEQIRKRFSVSNGPKNQKMKADLA 136 Query: 991 NCK*GGDFVNVYHS*LKKLSFAIEKVMG*A*QLYTDAIICYKFR---NIGHSDQRQRGGK 1161 CK G V Y+ L K+ I Y IC R N+G ++ R Sbjct: 137 TCKQEGMTVEGYYGKLNKIWDNINS--------YRPLRICKCGRCICNLGTDQEKYREDD 188 Query: 1162 SIYQFFMGLND 1194 ++Q+ GLN+ Sbjct: 189 MVHQYLYGLNE 199 >emb|CAN74847.1| hypothetical protein VITISV_028741 [Vitis vinifera] Length = 1262 Score = 71.2 bits (173), Expect(2) = 1e-14 Identities = 36/125 (28%), Positives = 65/125 (52%) Frame = +1 Query: 625 NDAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEND 804 N +++ + DR G T L G+NY + +QLA F++G+I P+ Sbjct: 18 NSPFFLGTGDRPGDFIT----PTRLHGDNYNDWASDIQLALEARRKFEFLEGTITGPQPP 73 Query: 805 PXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAA 984 +N ++GSWI NTI+ ++S+++ + LW L++R+ + NGPR +L + Sbjct: 74 YTQSDWNTVNAMLGSWITNTIDPEVKSTLSKFRDAKRLWEHLKQRYAMVNGPRIQQLKTS 133 Query: 985 LANCK 999 +A C+ Sbjct: 134 IAKCE 138 Score = 36.6 bits (83), Expect(2) = 1e-14 Identities = 32/151 (21%), Positives = 57/151 (37%), Gaps = 18/151 (11%) Frame = +2 Query: 1043 SYHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDEIFG 1222 +Y+ +L LW+EL + S A E+ ++ ++GLN +++ Sbjct: 147 TYYGKLNVLWEELFKNEPLISCTCCSSCTAASLHQARREQGKLHD----FLMGLNTDLYA 202 Query: 1223 TVRSSITHEEPLPKLKQIMARHLQG-------GTTSTHDSDFNSGRERKYDHGFR*RWNK 1381 +R++I ++PLP L + +Q T ++ R R + + Sbjct: 203 QLRTNILSQDPLPSLDRAYQLVIQDERVRLAKAVTEDKPAEVLGFXVRTGAGRGRGKTER 262 Query: 1382 PVVQTRTKS-----------VCTHCQKQGHD 1441 PV K+ C HC K GHD Sbjct: 263 PVCSHXKKTGHETSTCWSXVACPHCHKHGHD 293 >gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein (gb|U12626) [Arabidopsis thaliana] Length = 1315 Score = 63.5 bits (153), Expect(2) = 2e-14 Identities = 33/90 (36%), Positives = 52/90 (57%), Gaps = 1/90 (1%) Frame = +1 Query: 766 GFIDGSIPKPEND-PXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERF 942 GF+DGSIPKP++D P N++V SW+LN++ + +SI Y +W DL RF Sbjct: 12 GFVDGSIPKPDDDDPYCKIWRRCNSMVKSWLLNSVSKEIYTSILYFPTAAAIWKDLYTRF 71 Query: 943 LVSNGPRKYELNAALANCK*GGDFVNVYHS 1032 S+ PR Y+L + + + G ++ YH+ Sbjct: 72 HKSSLPRLYKLRQQIHSLRQGNLDLSSYHT 101 Score = 43.5 bits (101), Expect(2) = 2e-14 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 4/139 (2%) Frame = +2 Query: 1034 D*KSYHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEKVYINSSWD*MIIGLNDE 1213 D SYH+R + LW+EL + +P + + I ++E +V ++GLND Sbjct: 95 DLSSYHTRTQTLWEELTSLQAVPRTVEDLLI------ERETNRVID------FLMGLND- 141 Query: 1214 IFGTVRSSITHEEPLPKLKQIMARHLQGGTTSTHDSDFNSGRERKY----DHGFR*RWNK 1381 + TVRS I ++ LP L ++ Q T + G + + N Sbjct: 142 CYDTVRSQILMKKTLPSLSEVFNMIDQDETQRSARISTTPGMTSSVFPVSNQSSQSALNG 201 Query: 1382 PVVQTRTKSVCTHCQKQGH 1438 Q + + VC++C + GH Sbjct: 202 DTYQKKERPVCSYCSRPGH 220 >ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500638 [Cicer arietinum] Length = 379 Score = 57.8 bits (138), Expect(2) = 2e-14 Identities = 43/180 (23%), Positives = 80/180 (44%), Gaps = 13/180 (7%) Frame = +1 Query: 547 ISKDKEESMASIDMEEKIHVKQRIVK------------NDAYYISSSDRSGLIFT*IQFK 690 + D + S +S +++ + +Q I K D +++ SD GL Sbjct: 1 MDSDHDTSSSSSSSDDRSNNQQHIKKFPNFNRSYQNDMMDPFFMHPSDNPGLALV----S 56 Query: 691 KNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEN-DPXXXXXXXINTLVGSWILNTI 867 L N+ +AM ++ GF+ G+I +P++ D NT+V SWI N++ Sbjct: 57 PPLNNTNFHSWSRAMLVSLRSKNKSGFVLGTISRPKDTDRLSMAWDRCNTMVMSWIRNSL 116 Query: 868 ELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAALANCK*GGDFVNVYHS*LKKL 1047 E + SI + + E+W +L +R+ + R +L + + G + +Y + LKKL Sbjct: 117 ESDIAQSIMWMDSAAEIWHELNDRYHQGDIFRISDLQEEIYGLRQGDSSITIYFTNLKKL 176 Score = 49.3 bits (116), Expect(2) = 2e-14 Identities = 41/149 (27%), Positives = 61/149 (40%), Gaps = 18/149 (12%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTRMPSSA----INFEILAILTKDKEEEKVYINSSWD*MIIGLNDE 1213 Y + LKKLW EL+N+ +PS + + +L + + +E + V + GLN++ Sbjct: 169 YFTNLKKLWQELENFFPLPSCSCTPTCSCNLLPKIREYRENDYVIH------FLKGLNEQ 222 Query: 1214 IFGTVRSSITHEEPLPKLKQIMARHLQG--------------GTTSTHDSDFNSGRERKY 1351 + VRS I EPLP + ++ + LQ S H F G Sbjct: 223 -YSPVRSQIMLMEPLPTISKVFSMLLQQERQFFSHTEELKTVAVVSNHSRGFGRGSSLGS 281 Query: 1352 DHGFR*RWNKPVVQTRTKSVCTHCQKQGH 1438 G R R +CTHC K GH Sbjct: 282 GRGSGSR-------GRGYKICTHCNKSGH 303 >ref|XP_006589931.1| PREDICTED: uncharacterized protein LOC102669127 [Glycine max] Length = 656 Score = 86.7 bits (213), Expect = 2e-14 Identities = 49/142 (34%), Positives = 76/142 (53%), Gaps = 2/142 (1%) Frame = +1 Query: 628 DAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPENDP 807 D ++I +D GL+ K L G NY ++M LA GF++GSIP P+++ Sbjct: 24 DPFHIHHTDNPGLVLV----SKPLDGLNYLTWRRSMILALDGRNKLGFVNGSIPIPDSND 79 Query: 808 XXXXXXXI--NTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNA 981 N++V SWILN++ + +S+ YS +W DLE+ F + NGPR ++L Sbjct: 80 TAKLHTWKRNNSIVASWILNSLIKEISASVIYSTSASNIWNDLEKHFNIKNGPRIFQLRK 139 Query: 982 ALANCK*GGDFVNVYHS*LKKL 1047 AL NC G + +N+Y + K L Sbjct: 140 ALLNCVQGTNSINIYFTRFKGL 161 >ref|XP_006589879.1| PREDICTED: uncharacterized protein LOC102665528 [Glycine max] Length = 298 Score = 86.7 bits (213), Expect = 2e-14 Identities = 52/140 (37%), Positives = 74/140 (52%), Gaps = 2/140 (1%) Frame = +1 Query: 634 YYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPE-NDPX 810 +YI +D L+ K L G NY +M LA GF+DGSIP P+ +D Sbjct: 23 FYIHHTDNPALVLV----SKPLDGLNYLTWWCSMILALDGQNKLGFVDGSIPIPDFSDTA 78 Query: 811 XXXXXXIN-TLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAAL 987 N ++V SWILN++ + +S+ YS +W DLE+RF + NGPR ++L AL Sbjct: 79 KLHMWKRNDSIVASWILNSLTKEISASVIYSTSASNIWNDLEKRFNIKNGPRIFQLRKAL 138 Query: 988 ANCK*GGDFVNVYHS*LKKL 1047 NC G D +N+Y + K L Sbjct: 139 LNCVQGTDSINIYFTRFKGL 158 >gb|EPS60009.1| hypothetical protein M569_14795, partial [Genlisea aurea] Length = 156 Score = 85.9 bits (211), Expect = 4e-14 Identities = 55/161 (34%), Positives = 82/161 (50%), Gaps = 1/161 (0%) Frame = +1 Query: 709 NYAELIKAMQLAXXXXXXXGFIDGSIPKPENDPXXXXXXXINTLVGSWILNTIELTLRSS 888 NY E KAM+ GF+DG+I + + +N+++ +WI+NT+E LR++ Sbjct: 2 NYDEWAKAMRAGLRAKKKYGFVDGTITERPPEISVDLWEQVNSMLVAWIINTVEPGLRTT 61 Query: 889 INYSEYVEELWVDLEERFLVSNGPRKYELNAALANCK*GGDFVNVYHS*LKKLSFAIEKV 1068 + ++ V LW DL+ERF VS+GPR +L LA C+ GGD V Y +KK + Sbjct: 62 VTITDLVFPLWNDLQERFCVSHGPRLTQLKIDLARCQQGGDSVVQYFGRMKKYWDEYTTL 121 Query: 1069 MG*A*QLYTDAIICYKFR-NIGHSDQRQRGGKSIYQFFMGL 1188 G + C R N+ R+R I+QF MGL Sbjct: 122 DG------LPSCNCGGCRCNLNLQLNRKRESDKIHQFLMGL 156 >ref|XP_006471815.1| PREDICTED: uncharacterized protein LOC102606840 isoform X1 [Citrus sinensis] gi|568835521|ref|XP_006471816.1| PREDICTED: uncharacterized protein LOC102606840 isoform X2 [Citrus sinensis] gi|568835523|ref|XP_006471817.1| PREDICTED: uncharacterized protein LOC102606840 isoform X3 [Citrus sinensis] Length = 469 Score = 85.1 bits (209), Expect = 7e-14 Identities = 47/142 (33%), Positives = 76/142 (53%) Frame = +1 Query: 622 KNDAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEN 801 +ND + + SD ++ L GN Y ++ M +A GF+DG+I KP++ Sbjct: 243 ENDPFLVHPSDSPTIVLV----SPPLTGNKYGTWVRTMIMALQVRNKLGFVDGTITKPDD 298 Query: 802 DPXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNA 981 D N LV SW+LN+I L S+ Y++ ELW+DL+ERF +N + YEL Sbjct: 299 DDGGKWQRC-NDLVRSWVLNSISSELACSVLYAQSARELWLDLQERFQQTNASKIYELRQ 357 Query: 982 ALANCK*GGDFVNVYHS*LKKL 1047 A+++ + G V+ Y+ +K+L Sbjct: 358 AISSLRQGDVSVHHYYRRMKRL 379 >emb|CAN74230.1| hypothetical protein VITISV_000585 [Vitis vinifera] Length = 334 Score = 85.1 bits (209), Expect = 7e-14 Identities = 55/192 (28%), Positives = 91/192 (47%), Gaps = 3/192 (1%) Frame = +1 Query: 628 DAYYISSSDRSGLIFT*IQFKKNLKGNNYAELIKAMQLAXXXXXXXGFIDGSIPKPEN-D 804 D + + SD G++ K L+G+NY+ +AM+++ GF+ GSI P + D Sbjct: 19 DPFSLHHSDHPGMVLV----SKVLEGDNYSTWSRAMRISLSAKDKIGFVTGSIKPPSSTD 74 Query: 805 PXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFLVSNGPRKYELNAA 984 N +V SW+LN+I + SS+ Y+E E+W DL ERF N R Y++ Sbjct: 75 DSFPSWQRCNDMVISWLLNSIHPDIASSVIYAETASEIWADLRERFSQGNDSRIYQIKRD 134 Query: 985 LANCK*GGDFVNVYHS*LKKLSFAIEKVMG*A*QLYTDAIICY--KFRNIGHSDQRQRGG 1158 + + G ++VY++ LK + Y + + C + D+++R Sbjct: 135 IVEHRQGQQSISVYYTKLKAFXDELSS--------YHEVLSCSCGGLEKLKERDEKER-- 184 Query: 1159 KSIYQFFMGLND 1194 + QF MGLND Sbjct: 185 --VMQFLMGLND 194 >ref|XP_006393736.1| hypothetical protein EUTSA_v10012212mg, partial [Eutrema salsugineum] gi|557090314|gb|ESQ31022.1| hypothetical protein EUTSA_v10012212mg, partial [Eutrema salsugineum] Length = 159 Score = 75.1 bits (183), Expect(2) = 7e-14 Identities = 39/94 (41%), Positives = 57/94 (60%) Frame = +1 Query: 766 GFIDGSIPKPENDPXXXXXXXINTLVGSWILNTIELTLRSSINYSEYVEELWVDLEERFL 945 GFIDG++ KP +N+++ +WI+NTI+ TL S++ + +ELW DL+ F Sbjct: 14 GFIDGTLTKPTAAKELEQWEVVNSMLVAWIMNTIKPTLWISVSMVDEAKELWHDLKLHFS 73 Query: 946 VSNGPRKYELNAALANCK*GGDFVNVYHS*LKKL 1047 N PR EL+A +ANC+ GD V VY LKK+ Sbjct: 74 AGNRPRISELSADIANCRQHGDSVMVYFGKLKKM 107 Score = 30.4 bits (67), Expect(2) = 7e-14 Identities = 15/39 (38%), Positives = 24/39 (61%) Frame = +2 Query: 1046 YHSRLKKLWDELDNYTRMPSSAINFEILAILTKDKEEEK 1162 Y +LKK+WDEL Y + + + E+ L +D+EEE+ Sbjct: 100 YFGKLKKMWDELAIYKPIRTCSCG-ELKTQLEEDREEER 137