BLASTX nr result
ID: Cocculus23_contig00013921
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00013921 (1102 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002272609.1| PREDICTED: uncharacterized protein LOC100258... 124 5e-26 ref|XP_006841433.1| hypothetical protein AMTR_s00003p00049560 [A... 118 5e-24 ref|XP_004295083.1| PREDICTED: uncharacterized protein LOC101308... 111 5e-22 emb|CAN76673.1| hypothetical protein VITISV_011790 [Vitis vinifera] 111 5e-22 ref|XP_007035794.1| Uncharacterized protein isoform 1 [Theobroma... 106 2e-20 gb|EXB71059.1| hypothetical protein L484_004194 [Morus notabilis] 103 1e-19 ref|XP_006597704.1| PREDICTED: uncharacterized protein LOC100816... 103 1e-19 ref|XP_006597703.1| PREDICTED: uncharacterized protein LOC100816... 103 1e-19 ref|XP_006597702.1| PREDICTED: uncharacterized protein LOC100816... 103 1e-19 ref|XP_006586892.1| PREDICTED: uncharacterized protein LOC100816... 101 5e-19 ref|XP_006586891.1| PREDICTED: uncharacterized protein LOC100816... 101 5e-19 ref|XP_007147362.1| hypothetical protein PHAVU_006G117800g [Phas... 97 2e-17 ref|XP_006349838.1| PREDICTED: uncharacterized protein LOC102584... 89 3e-15 ref|XP_006378540.1| hypothetical protein POPTR_0010s15520g [Popu... 89 3e-15 ref|XP_004253153.1| PREDICTED: uncharacterized protein LOC101259... 86 4e-14 ref|XP_003609773.1| Pre-mRNA polyadenylation factor fip1 [Medica... 83 2e-13 ref|XP_002519223.1| conserved hypothetical protein [Ricinus comm... 81 9e-13 tpg|DAA45032.1| TPA: hypothetical protein ZEAMMB73_268123 [Zea m... 75 4e-11 ref|XP_006651314.1| PREDICTED: uncharacterized protein LOC102703... 75 6e-11 ref|XP_003561693.1| PREDICTED: uncharacterized protein LOC100823... 73 2e-10 >ref|XP_002272609.1| PREDICTED: uncharacterized protein LOC100258583 [Vitis vinifera] gi|296083247|emb|CBI22883.3| unnamed protein product [Vitis vinifera] Length = 1300 Score = 124 bits (312), Expect = 5e-26 Identities = 105/287 (36%), Positives = 152/287 (52%), Gaps = 8/287 (2%) Frame = +3 Query: 120 SLSSKRKS----VHVKHGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIE 287 SLS +R S +H K+G++HVG+ +++ ++ R + RE + +RSS G Sbjct: 1039 SLSYERTSGHTRIHTKYGSAHVGMLVHNKKSQQQRYKRIRE-GRSDDFIDRSSNVLGQ-G 1096 Query: 288 TNEPANLRCRDSIEQHLIDWKGKSSRRLSKAGNA-RCSRNEKINHSVDEERITFRHSDES 464 +E A LR R S++ LI +GKSS R S+A +A R E ++ +DE++ + + Sbjct: 1097 NHEQAVLRSRASVD--LIVGEGKSSGRRSEARSAVHHDRFENMDWKIDEDQGILKDVNG- 1153 Query: 465 YLWNDVPSKGESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETV-LEEGQ 641 P +G+ I + D S+ NN +DK V + DE + +EEGQ Sbjct: 1154 ------PQRGKII-------------QPDLKSESNWNNEKCLDKFLVTEHDEALDIEEGQ 1194 Query: 642 LAIEPERETCSHPETNLFSVKTSQAGVAKEEKANSDNASENKI-GGVDSNRILETLAKME 818 + E E S ET S + + K N++ A+ NK+ D+ RIL+TLAKME Sbjct: 1195 IIPEEMNEDDS-VETKDASESITPSRNVKRRLGNANAANGNKVVAECDNQRILQTLAKME 1253 Query: 819 RRRERFKEPIALKKEPDKNLTIQAD-TVETTEAKQQRPARKRRWGGS 956 +R+ERFK+PI LKKEPDK Q D VE E QQRP RKRRW GS Sbjct: 1254 KRQERFKKPITLKKEPDKIPKPQVDPIVEMAETMQQRPLRKRRWNGS 1300 >ref|XP_006841433.1| hypothetical protein AMTR_s00003p00049560 [Amborella trichopoda] gi|548843454|gb|ERN03108.1| hypothetical protein AMTR_s00003p00049560 [Amborella trichopoda] Length = 1203 Score = 118 bits (295), Expect = 5e-24 Identities = 87/285 (30%), Positives = 148/285 (51%), Gaps = 18/285 (6%) Frame = +3 Query: 156 HGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIETNEPANLR---CRDSI 326 + +SHV +D++ ++ + LT + S+ S++ NR S + + ++ C++S+ Sbjct: 931 YDSSHVRKFVEDQRFDKVKNGLTGK-SRVSELCNRISSISNVYDIDKKHGQTATCCKESV 989 Query: 327 EQHLIDWKGKSSRRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDV-PSKGESI 503 H+I W+GK RR + A H ++E F SD+ ++ P + Sbjct: 990 NFHMIGWEGKQPRRSTGA-----------RHIPEDEMADFPDSDQLQRGGEIGPRVVQDN 1038 Query: 504 SLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRV--NKQDETVLEEGQLAIE-PERETCS 674 +I SK+ERV ++ SD ++ +DK + NK+D + ++ +E P++ + Sbjct: 1039 HRQNINSKIERVSHRNKESSSDHSDDKWLDKFPITQNKEDGSGQQKKDAKVEEPKKIEVT 1098 Query: 675 HPETNLFSVKTSQAGVAKE-------EKANSDNASENK--IGGVDSNRILETLAKMERRR 827 S +T+ + + KE EKA+ A++N + +++ RILET+AKME+R+ Sbjct: 1099 KTVKKKVSKRTTPSSIIKERFSGSMNEKAHQKGANDNNKMVTKINNERILETMAKMEKRK 1158 Query: 828 ERFKEPIALKKEPDK--NLTIQADTVETTEAKQQRPARKRRWGGS 956 ERFKEPI KEP+K N + VE TE K QRP RKRRW G+ Sbjct: 1159 ERFKEPIVSNKEPEKISNAPSVSIQVEETEVKGQRPQRKRRWCGN 1203 >ref|XP_004295083.1| PREDICTED: uncharacterized protein LOC101308556 [Fragaria vesca subsp. vesca] Length = 408 Score = 111 bits (278), Expect = 5e-22 Identities = 93/279 (33%), Positives = 140/279 (50%), Gaps = 5/279 (1%) Frame = +3 Query: 132 KRKSVHVKHGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIETNEPANLR 311 + + H K+G G+ +D+ Q E+ R ++ R+E + V NRS M ++R Sbjct: 137 RHEKFHAKYGPLSDGMRYDNMQPEQRRLKMPRKEIGANFV-NRS---VKMYRGKHEQSVR 192 Query: 312 CRDSIEQHLIDWKGKSSRRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSK 491 CR+S++ + + K + RCS+ + H+ E + E ++ + + Sbjct: 193 CRNSMDLAVRERKILT----------RCSKARNLMHNGRPENMGAEIGGE-WMTSGISQA 241 Query: 492 GESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETV-LEEGQLAIEPERET 668 ES + R K +N +QNN D V Q+ + +EEGQ+ + + T Sbjct: 242 CES--------EKARAVKITQNIIWNQNNKKGHDIFPVTAQNADLDIEEGQIVTQEQNTT 293 Query: 669 CSHP-ETNLFSVKTSQAGVAKEEKANSDNASENK--IGGVDSNRILETLAKMERRRERFK 839 HP + S T A + +S NAS+ + G D RIL+T+AKME+R ERFK Sbjct: 294 --HPLQRKHASDYTEPADSLIKGVFDSRNASKGNKVVEGYDKQRILQTMAKMEQRGERFK 351 Query: 840 EPIALKKEPDKNLTIQAD-TVETTEAKQQRPARKRRWGG 953 EPI LKKEPDK L + D TVET + KQ RPARKR+WGG Sbjct: 352 EPITLKKEPDKQLMPEVDPTVETADEKQHRPARKRQWGG 390 >emb|CAN76673.1| hypothetical protein VITISV_011790 [Vitis vinifera] Length = 1338 Score = 111 bits (278), Expect = 5e-22 Identities = 101/307 (32%), Positives = 153/307 (49%), Gaps = 28/307 (9%) Frame = +3 Query: 120 SLSSKRKS----VHVKHGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIE 287 SLS +R S +H K+G++HVG+ +++ ++ R + RE + +RSS G Sbjct: 1039 SLSYERTSGHTRIHTKYGSAHVGMLVHNKKSQQQRYKRIRE-GRSDDFIDRSSNVLGQ-G 1096 Query: 288 TNEPANLRCRDSIEQHLIDWKGKSSRRLSKAGNARCSRNEKINHSVDEERITFRHSDESY 467 +E LR R S++ LI +GK AG+ + ++ ++H ++ + D Sbjct: 1097 NHEQXVLRSRASVD--LIVGEGKCVASAFMAGS-KAEYSQNVSHKIESFALA-PTKDLLS 1152 Query: 468 LWNDVPSKGESIS-LHHIRS-----KVE---------------RVGKCDRNHFSDQNNGM 584 N + E+ S +HH R K++ ++ + D S+ NN Sbjct: 1153 FENSSGRRSEARSAVHHDRFENMDWKIDEDQGILKDVNGPQRGKIIQPDLKSESNWNNEK 1212 Query: 585 SIDKSRVNKQDETV-LEEGQLAIEPERETCSHPETNLFSVKTSQAGVAKEEKANSDNASE 761 +DK V + DE + +EEGQ+ I E ET S + + K N++ A+ Sbjct: 1213 CLDKFLVTEHDEALDIEEGQI-IPEEMNXDDSVETKDASESITPSRNVKRRLGNANAANG 1271 Query: 762 NKI-GGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLTIQAD-TVETTEAKQQRPAR 935 NK+ D+ RIL+TLAKME+R+ERFK+PI LKKEPDK Q D VE E QQRP R Sbjct: 1272 NKVVAECDNQRILQTLAKMEKRQERFKKPITLKKEPDKIPKPQVDPIVEMAETMQQRPLR 1331 Query: 936 KRRWGGS 956 KRRW GS Sbjct: 1332 KRRWNGS 1338 >ref|XP_007035794.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508714823|gb|EOY06720.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1247 Score = 106 bits (265), Expect = 2e-20 Identities = 98/303 (32%), Positives = 145/303 (47%), Gaps = 6/303 (1%) Frame = +3 Query: 66 WPRDKLRSWSR---GRVDAKESLSSKRKSVHVKHGTSHVGLPFDDEQLERDRKRLTREES 236 W +DKL R V +SK +H +HG+ + +D LE + E S Sbjct: 974 WTKDKLLGNDRLLAQWVSFSCQKTSKHDLIHARHGSLRDEMLINDLMLEHHGYEMITEGS 1033 Query: 237 KYSQVSNRSSFNFGMIETNEPANLRCRDSIEQHLIDWKGKSSRRLSKAGNARCS-RNEKI 413 N + I + L+ RDS++ LI +GKSS R G+ C+ R EKI Sbjct: 1034 ------NANCHEGNSIIRQKQKVLKDRDSVD--LIVGEGKSSVRHLDGGSLICNGRLEKI 1085 Query: 414 NHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVGKCDRNHFSDQNNGMSID 593 E+ + R ++S N V + IS +E+ + D+ ++ N + I+ Sbjct: 1086 GLEFPMEQKSLRDVNDSCGGNRVKT---DISNTDGSRTIEK--QLDKFSVAECNQDLDIE 1140 Query: 594 KSRVNKQDETVLEEGQLAIEPERETCSHPETNLFSVKTSQAGVAKEEKANSDNASENK-I 770 + +T+ EE + +E E + ET + Q K + D++ N+ + Sbjct: 1141 EG------QTICEEQSINLEKENVS----ETMV------QRSKVKMRTLHVDSSDGNRAV 1184 Query: 771 GGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLTIQAD-TVETTEAKQQRPARKRRW 947 G D+ RI+ETLAKME+RRERFK+PI +K EPDK Q D V+T E K QRPARKRRW Sbjct: 1185 GEYDNKRIVETLAKMEKRRERFKDPITIKMEPDKTSEPQVDLVVDTNEIKHQRPARKRRW 1244 Query: 948 GGS 956 G S Sbjct: 1245 GVS 1247 >gb|EXB71059.1| hypothetical protein L484_004194 [Morus notabilis] Length = 1179 Score = 103 bits (258), Expect = 1e-19 Identities = 97/272 (35%), Positives = 130/272 (47%), Gaps = 8/272 (2%) Frame = +3 Query: 156 HGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIETNEPANLRCRDSIEQH 335 HG+ H + DD Q ++ R+ ++ S YS+ RS F NE A LRCRDS+ + Sbjct: 938 HGSLHDAMHIDDMQADKHGYRMIKDGS-YSRGIYRSQKMFRA--KNEQAFLRCRDSL--N 992 Query: 336 LIDWKGKSSRRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHH 515 L GK SRR N C HS + +Y+ DV ES Sbjct: 993 LFVGGGKLSRRRPTDRNLSC-------HS---------RLEGTYI-EDV---NESSQYEA 1032 Query: 516 IRSKVERVG-KCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQLAIEP-ERETCSHPETN 689 ++S + +VG F DQ + ++ +EEGQ+ E R+ P + Sbjct: 1033 VQSNLPKVGLNLSNEDFHDQF-------PLAARNEDFDIEEGQIVTEEFYRDPLERPHDS 1085 Query: 690 LFSVKTSQAGVAKEEKANSDNASE-NKIGG-VDSNRILETLAKMERRRERFKEPIALKKE 863 + + +T K+ D AS +K GG D ILETLAKMERRRERFKEPIALK+E Sbjct: 1086 VSAARTESV---KKRMLEYDLASHGSKTGGQCDDQWILETLAKMERRRERFKEPIALKRE 1142 Query: 864 PDK----NLTIQADTVETTEAKQQRPARKRRW 947 DK ++ VET E KQ RPARKR+W Sbjct: 1143 QDKCAKPDIVPAPTIVETAETKQHRPARKRQW 1174 >ref|XP_006597704.1| PREDICTED: uncharacterized protein LOC100816009 isoform X3 [Glycine max] Length = 1101 Score = 103 bits (257), Expect = 1e-19 Identities = 95/279 (34%), Positives = 143/279 (51%), Gaps = 9/279 (3%) Frame = +3 Query: 147 HVKHGTS-HVGLPFDDEQLERDRKRLT-REESKYSQVSNRSSFNFGMIETNEPANLRCRD 320 H H TS + DD L++ + + R+ KY + S++ A LRCR Sbjct: 854 HETHATSLFTKVQSDDLPLQQHQLSMPKRDNEKYFKGSSKIMCR----SKGGQAVLRCRK 909 Query: 321 SIEQHLIDWKGKSSRRLSKAGNARCS-RNEKINHSVDEER----ITFRHSDESYLWNDVP 485 S++ LI +GKS R S+ C+ R E +N + ++R + F S+++ D P Sbjct: 910 SVD--LIHGEGKSQVRSSRVS---CNGRLENVNQGIAKKRKRASVGFDESNKNTFKFDSP 964 Query: 486 SKGESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQL-AIEPER 662 K ES +++SK K +N DQ S D +EEGQ+ A EP Sbjct: 965 -KYES----NLKSK-----KWVQN-LQDQAQKESSD-----------IEEGQIVAEEPYM 1002 Query: 663 ETCSHPETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKE 842 E S + V K+ + ++N+S+ IGG DS RIL++LAKME+RRERFK+ Sbjct: 1003 EKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKMEKRRERFKQ 1062 Query: 843 PIALKKEPDKNLTIQADT-VETTEAKQQRPARKRRWGGS 956 P+ +KKE +++L + D+ V+T E KQ RP RKRRW G+ Sbjct: 1063 PMTMKKEAEESLKLNNDSIVDTGEMKQHRPTRKRRWVGN 1101 >ref|XP_006597703.1| PREDICTED: uncharacterized protein LOC100816009 isoform X2 [Glycine max] Length = 1101 Score = 103 bits (257), Expect = 1e-19 Identities = 95/279 (34%), Positives = 143/279 (51%), Gaps = 9/279 (3%) Frame = +3 Query: 147 HVKHGTS-HVGLPFDDEQLERDRKRLT-REESKYSQVSNRSSFNFGMIETNEPANLRCRD 320 H H TS + DD L++ + + R+ KY + S++ A LRCR Sbjct: 854 HETHATSLFTKVQSDDLPLQQHQLSMPKRDNEKYFKGSSKIMCR----SKGGQAVLRCRK 909 Query: 321 SIEQHLIDWKGKSSRRLSKAGNARCS-RNEKINHSVDEER----ITFRHSDESYLWNDVP 485 S++ LI +GKS R S+ C+ R E +N + ++R + F S+++ D P Sbjct: 910 SVD--LIHGEGKSQVRSSRVS---CNGRLENVNQGIAKKRKRASVGFDESNKNTFKFDSP 964 Query: 486 SKGESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQL-AIEPER 662 K ES +++SK K +N DQ S D +EEGQ+ A EP Sbjct: 965 -KYES----NLKSK-----KWVQN-LQDQAQKESSD-----------IEEGQIVAEEPYM 1002 Query: 663 ETCSHPETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKE 842 E S + V K+ + ++N+S+ IGG DS RIL++LAKME+RRERFK+ Sbjct: 1003 EKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKMEKRRERFKQ 1062 Query: 843 PIALKKEPDKNLTIQADT-VETTEAKQQRPARKRRWGGS 956 P+ +KKE +++L + D+ V+T E KQ RP RKRRW G+ Sbjct: 1063 PMTMKKEAEESLKLNNDSIVDTGEMKQHRPTRKRRWVGN 1101 >ref|XP_006597702.1| PREDICTED: uncharacterized protein LOC100816009 isoform X1 [Glycine max] Length = 1104 Score = 103 bits (257), Expect = 1e-19 Identities = 95/279 (34%), Positives = 143/279 (51%), Gaps = 9/279 (3%) Frame = +3 Query: 147 HVKHGTS-HVGLPFDDEQLERDRKRLT-REESKYSQVSNRSSFNFGMIETNEPANLRCRD 320 H H TS + DD L++ + + R+ KY + S++ A LRCR Sbjct: 857 HETHATSLFTKVQSDDLPLQQHQLSMPKRDNEKYFKGSSKIMCR----SKGGQAVLRCRK 912 Query: 321 SIEQHLIDWKGKSSRRLSKAGNARCS-RNEKINHSVDEER----ITFRHSDESYLWNDVP 485 S++ LI +GKS R S+ C+ R E +N + ++R + F S+++ D P Sbjct: 913 SVD--LIHGEGKSQVRSSRVS---CNGRLENVNQGIAKKRKRASVGFDESNKNTFKFDSP 967 Query: 486 SKGESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQL-AIEPER 662 K ES +++SK K +N DQ S D +EEGQ+ A EP Sbjct: 968 -KYES----NLKSK-----KWVQN-LQDQAQKESSD-----------IEEGQIVAEEPYM 1005 Query: 663 ETCSHPETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKE 842 E S + V K+ + ++N+S+ IGG DS RIL++LAKME+RRERFK+ Sbjct: 1006 EKVSVSRRDASEGPAVTDSVNKKRMSQNENSSDQYIGGYDSQRILDSLAKMEKRRERFKQ 1065 Query: 843 PIALKKEPDKNLTIQADT-VETTEAKQQRPARKRRWGGS 956 P+ +KKE +++L + D+ V+T E KQ RP RKRRW G+ Sbjct: 1066 PMTMKKEAEESLKLNNDSIVDTGEMKQHRPTRKRRWVGN 1104 >ref|XP_006586892.1| PREDICTED: uncharacterized protein LOC100816396 isoform X2 [Glycine max] Length = 1094 Score = 101 bits (252), Expect = 5e-19 Identities = 91/277 (32%), Positives = 134/277 (48%), Gaps = 7/277 (2%) Frame = +3 Query: 147 HVKHGTS-HVGLPFDDEQLERDRKRLT-REESKYSQVSNRSSFNFGMIETNEPANLRCRD 320 H H TS + DD L+R + + R+ KY + S++ A LRCR Sbjct: 854 HETHATSLFAKVQSDDLPLQRHQLSMPIRDSEKYFKGSSKIMCR----SKGGQALLRCRK 909 Query: 321 SIEQHLIDWKGKSSRRLSKA-GNARCSR-NEKINHSVDEERITFRHSDESYLWNDVPSKG 494 S++ LI +GKS R S+ N R N++I + F S+++ D P Sbjct: 910 SVD--LIHGEGKSQVRSSRVLCNGRLENANQRIAKKRRRAAVGFDESNKNASKFDTPK-- 965 Query: 495 ESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETV-LEEGQ-LAIEPERET 668 H S+Q + + + Q E+ +EEGQ +A EP E Sbjct: 966 ---------------------HKSNQESKKWVQDLQDQAQKESSEIEEGQFVAEEPYMEE 1004 Query: 669 CSHPETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKEPI 848 S GV K+ + ++N+SE IGG DS RIL++LAKME+RRERFK+P+ Sbjct: 1005 ASEGPA-------VTDGVNKKRMSQNENSSEQCIGGYDSQRILDSLAKMEKRRERFKQPM 1057 Query: 849 ALKKEPDKNLTIQADT-VETTEAKQQRPARKRRWGGS 956 +KKE +++L + D+ V+ E KQ RPARKRRW G+ Sbjct: 1058 TMKKEAEESLKLNDDSIVDKGEMKQHRPARKRRWVGN 1094 >ref|XP_006586891.1| PREDICTED: uncharacterized protein LOC100816396 isoform X1 [Glycine max] Length = 1097 Score = 101 bits (252), Expect = 5e-19 Identities = 91/277 (32%), Positives = 134/277 (48%), Gaps = 7/277 (2%) Frame = +3 Query: 147 HVKHGTS-HVGLPFDDEQLERDRKRLT-REESKYSQVSNRSSFNFGMIETNEPANLRCRD 320 H H TS + DD L+R + + R+ KY + S++ A LRCR Sbjct: 857 HETHATSLFAKVQSDDLPLQRHQLSMPIRDSEKYFKGSSKIMCR----SKGGQALLRCRK 912 Query: 321 SIEQHLIDWKGKSSRRLSKA-GNARCSR-NEKINHSVDEERITFRHSDESYLWNDVPSKG 494 S++ LI +GKS R S+ N R N++I + F S+++ D P Sbjct: 913 SVD--LIHGEGKSQVRSSRVLCNGRLENANQRIAKKRRRAAVGFDESNKNASKFDTPK-- 968 Query: 495 ESISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETV-LEEGQ-LAIEPERET 668 H S+Q + + + Q E+ +EEGQ +A EP E Sbjct: 969 ---------------------HKSNQESKKWVQDLQDQAQKESSEIEEGQFVAEEPYMEE 1007 Query: 669 CSHPETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKEPI 848 S GV K+ + ++N+SE IGG DS RIL++LAKME+RRERFK+P+ Sbjct: 1008 ASEGPA-------VTDGVNKKRMSQNENSSEQCIGGYDSQRILDSLAKMEKRRERFKQPM 1060 Query: 849 ALKKEPDKNLTIQADT-VETTEAKQQRPARKRRWGGS 956 +KKE +++L + D+ V+ E KQ RPARKRRW G+ Sbjct: 1061 TMKKEAEESLKLNDDSIVDKGEMKQHRPARKRRWVGN 1097 >ref|XP_007147362.1| hypothetical protein PHAVU_006G117800g [Phaseolus vulgaris] gi|561020585|gb|ESW19356.1| hypothetical protein PHAVU_006G117800g [Phaseolus vulgaris] Length = 1101 Score = 96.7 bits (239), Expect = 2e-17 Identities = 83/250 (33%), Positives = 133/250 (53%), Gaps = 9/250 (3%) Frame = +3 Query: 225 REESKYSQVSNRSSFNFGMIETNEPANLRCRDSIEQHLIDWKGKSSRRLSKAGNARCSRN 404 +E KY + S++ + A LRCR S++ LID +GKS R S+ + R Sbjct: 880 QEAEKYFKASSKIMYR----SKGGQAVLRCRKSVD--LIDREGKSQVRSSRVLSN--GRL 931 Query: 405 EKINHSVDEERITFRHS---DESYLWNDVPSKGESISLHHIRSKVERVGKCDRNHFSDQN 575 E +N + ++R R S DES N SK ++ SK E C + + Q+ Sbjct: 932 ENVNQGIAKKRR--RDSVGFDES---NKRASKFDA-------SKYEGNLGCKKWIKNLQD 979 Query: 576 NGMSIDKSRVNKQDETVLEEGQLAIEPERETCSHPETNLFSVKTSQAGVA----KEEKAN 743 G +++ + +EEGQ+ + + + E ++ S+ V K+ + Sbjct: 980 QG---------QKENSDIEEGQIVTQKWKSSIE--EASVARRDASKGPVVTDSVKKRMSP 1028 Query: 744 SDNASENKIGGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLTIQADT--VETTEAK 917 ++ +S+ IGG DS RIL++LAKME+RRERFK+PI +KKE +++L + +D+ V+T+E K Sbjct: 1029 NEGSSDQCIGGYDSQRILDSLAKMEKRRERFKQPITMKKEAEESLKLNSDSSIVDTSEMK 1088 Query: 918 QQRPARKRRW 947 Q RP RKRRW Sbjct: 1089 QHRPVRKRRW 1098 >ref|XP_006349838.1| PREDICTED: uncharacterized protein LOC102584286 [Solanum tuberosum] Length = 1130 Score = 89.4 bits (220), Expect = 3e-15 Identities = 95/322 (29%), Positives = 148/322 (45%), Gaps = 14/322 (4%) Frame = +3 Query: 33 RRYFGQSMTFDWPRDKLRS-WSRGRVDAKESLSSKRKSVHVK--------HGTSHVGLPF 185 RR QS W D+ S + + DA+ + S R+S + HG + V Sbjct: 836 RRGGQQSEGMQWVEDENSSRYQQNIFDAERTSYSFRRSSSDRRFNSFDNNHGPNPVEKLL 895 Query: 186 DDEQLERDRKRLTREESKYSQVSNRSS-FNFGMIETNEPANLRCRDSIEQHLIDWKGKSS 362 DD +E+++ +L RE + SQ S F+ P R RDS++ LI G+SS Sbjct: 896 DDRHVEQEKYKLIREGNNASQFGQGSKVFHKDNHWRRFP---RGRDSVDTGLIVENGESS 952 Query: 363 RRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVG 542 R SKAG + ++ +H + + + D + P +++ ++ + + Sbjct: 953 GRCSKAGGV--TSFDRYSHLDSDSYVELKPIDGT----SKPHFRKTLRTRNVTTDPKEND 1006 Query: 543 KCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQLAIEPERETCSHPETNLFSVKTSQAGV 722 K + FSD N S+D ++ ++EE I +R TCS Sbjct: 1007 KGRLDIFSDANQEESLDI-----EEGQIIEEMNEKIIKKRITCS---------------- 1045 Query: 723 AKEEKANSDN-ASENKIGGVDSN-RILETLAKMERRRERFKEPIALKKEPDKNLT--IQA 890 K + + N A + + G D+N RILE +AKME+R ERFK+PIALK + KN++ + Sbjct: 1046 GKSQISEMKNFAYDKNVEGQDNNPRILEIMAKMEKRGERFKQPIALKSD-TKNVSKPLVD 1104 Query: 891 DTVETTEAKQQRPARKRRWGGS 956 +TE Q RPARKRRW S Sbjct: 1105 SFALSTEPMQPRPARKRRWAAS 1126 >ref|XP_006378540.1| hypothetical protein POPTR_0010s15520g [Populus trichocarpa] gi|550329875|gb|ERP56337.1| hypothetical protein POPTR_0010s15520g [Populus trichocarpa] Length = 194 Score = 89.4 bits (220), Expect = 3e-15 Identities = 65/192 (33%), Positives = 94/192 (48%), Gaps = 3/192 (1%) Frame = +3 Query: 390 RCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVGKCDRNHFSD 569 RCS + ++ F+ D + + SK + S I++ V G D+ + Sbjct: 16 RCSNGRSLM-----QKSMFKRMDLKFAKEPMCSKDFNESQTGIQTDVLETGGDDKEKW-- 68 Query: 570 QNNGMSIDKSRVNKQDETV-LEEGQLAIEPERETCSHPETNLFSVKTSQAGVAKEEKANS 746 I KS+V + +E + +E+GQ+ E + F Sbjct: 69 ------IGKSQVTEHNEKLNIEDGQIMAEESSMESKLAKKCAFKSVVPTCNAKNRNFLCE 122 Query: 747 DNASENKI-GGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLTIQADT-VETTEAKQ 920 + +S NK G VDS RIL+T+AKME+RRERFK+PIA KKE DK Q + ++T A Q Sbjct: 123 NASSRNKNDGAVDSKRILDTIAKMEKRRERFKDPIAQKKELDKTSEPQVEVIIDTVPANQ 182 Query: 921 QRPARKRRWGGS 956 RPARKRRWGG+ Sbjct: 183 DRPARKRRWGGT 194 >ref|XP_004253153.1| PREDICTED: uncharacterized protein LOC101259137 [Solanum lycopersicum] Length = 1130 Score = 85.5 bits (210), Expect = 4e-14 Identities = 102/326 (31%), Positives = 136/326 (41%), Gaps = 18/326 (5%) Frame = +3 Query: 33 RRYFGQSMTFDWPRDKLRSWSRGRV-DAKESLSSKR--------KSVHVKHGTSHVGLPF 185 RR QS W D+ S + V DA+ + S R KS HG + V Sbjct: 837 RRGGRQSEGMQWVEDENNSGYQENVFDAERTSYSFRRTSSDKRFKSFDNNHGPNPVEKLL 896 Query: 186 DDEQLERDRKRLTREESKYSQVSNRSS-FNFGMIETNEPANLRCRDSIEQHLIDWKGKSS 362 DD +E+++ +L RE + +Q S F+ P R RDS++ LI G+SS Sbjct: 897 DDRHVEQEKYKLIREGNNANQFGQGSKVFHKDNHWRRFP---RGRDSVDTDLIVENGESS 953 Query: 363 RRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVG 542 R SKAG S D + H D P G S H R + Sbjct: 954 GRCSKAGGVT---------SFDR----YGHLDSDCYLKLKPVDGTSKL--HFRETLRT-- 996 Query: 543 KCDRNHFSDQNNGMSIDKSRV------NKQDETVLEEGQLAIEPERETCSHPETNLFSVK 704 RN +D DK R+ N+++ +EEGQ+ E N VK Sbjct: 997 ---RNVTTDPKEN---DKERLAIFSDANQEESLDIEEGQII----------EEMNEKIVK 1040 Query: 705 TSQAGVAKEEKANSDNASENK-IGGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLT 881 K E N + K + G S +ILE +AKME+R ERFK+PIALK + T Sbjct: 1041 KRITYSGKSEIGEMKNFATGKNVEGQGSPKILEIIAKMEKRGERFKQPIALKSDTKNIST 1100 Query: 882 IQADT-VETTEAKQQRPARKRRWGGS 956 D+ +TE Q RPARKRRW S Sbjct: 1101 PLVDSFAVSTEPMQPRPARKRRWAAS 1126 >ref|XP_003609773.1| Pre-mRNA polyadenylation factor fip1 [Medicago truncatula] gi|355510828|gb|AES91970.1| Pre-mRNA polyadenylation factor fip1 [Medicago truncatula] Length = 1110 Score = 82.8 bits (203), Expect = 2e-13 Identities = 78/284 (27%), Positives = 132/284 (46%), Gaps = 15/284 (5%) Frame = +3 Query: 147 HVKHGTSHVGLPFDDEQLERDRKRLTREESKYSQVSNRSSFNFGMIETNEPANLRCRDSI 326 H +H + H + +D +L++ + +R + + + S + + P LR + S Sbjct: 858 HARHRSLHARVQRNDIKLQQHQLNFSR---RGGDIFIKRSSKVMSRDHSHPTVLRYKKS- 913 Query: 327 EQHLIDWKGKSSRRLSKAGNARCSRNE---KINHSVDEERITFRHSDESYLWNDVPSKGE 497 LI+ +GKS++ +R RN+ ++ + E+R D+S + + Sbjct: 914 -GALINREGKSAK------GSRLMRNDTLQNVDRGIAEKRKALVGFDDS--------RKK 958 Query: 498 SISLHHIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQLAIEPERETCSH 677 +I L +S+ DQN + + S +++ +EEG++ E S Sbjct: 959 AIKLDVSKSQCV-----------DQNKKLLQNLSDKGQKEGLDVEEGEIVTEEPSVEVSV 1007 Query: 678 PETNLFSVKTSQAGVAKEEKANSDNASENKIGGVDSNRILETLAKMERRRERFKEPIALK 857 ++ T V K+ N +N SE +I +DS +IL+TLAKME+RRERFK+PI + Sbjct: 1008 SRRDVSEGATLAENVKKKISQNGNN-SEPQIDNLDSQKILDTLAKMEKRRERFKQPIGMN 1066 Query: 858 KEPDK------NLTIQA------DTVETTEAKQQRPARKRRWGG 953 KE K N +++ V+ E KQQRP RKRRW G Sbjct: 1067 KEAVKQPISLNNEVVKSLKLNTNSAVDIGEMKQQRPVRKRRWNG 1110 >ref|XP_002519223.1| conserved hypothetical protein [Ricinus communis] gi|223541538|gb|EEF43087.1| conserved hypothetical protein [Ricinus communis] Length = 1155 Score = 80.9 bits (198), Expect = 9e-13 Identities = 50/124 (40%), Positives = 73/124 (58%), Gaps = 1/124 (0%) Frame = +3 Query: 588 IDKSRVNKQDETV-LEEGQLAIEPERETCSHPETNLFSVKTSQAGVAKEEKANSDNASEN 764 +DK V+KQD + +EEGQ+ PE T + + +T + + +S N + Sbjct: 1037 LDKFPVSKQDGYLDIEEGQIV--PEEPTIGNRLEEKQAPETVSLMRSMKNAFHSGNMTNK 1094 Query: 765 KIGGVDSNRILETLAKMERRRERFKEPIALKKEPDKNLTIQADTVETTEAKQQRPARKRR 944 + D +ILE+LAKME+RRERFK+PIA K+EPDK + + ++KQ+RPARKRR Sbjct: 1095 RY---DDQQILESLAKMEKRRERFKDPIAFKREPDKPMKPIDLIADAIKSKQERPARKRR 1151 Query: 945 WGGS 956 W S Sbjct: 1152 WADS 1155 >tpg|DAA45032.1| TPA: hypothetical protein ZEAMMB73_268123 [Zea mays] Length = 598 Score = 75.5 bits (184), Expect = 4e-11 Identities = 68/216 (31%), Positives = 98/216 (45%), Gaps = 8/216 (3%) Frame = +3 Query: 333 HLIDWKGKSSRRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLH 512 HL D K K R ++ N+K +H VD++ T RH Sbjct: 399 HLNDRKIKFEREGNELRRV-IEDNQKGSHPVDKDLHTSRHK------------------- 438 Query: 513 HIRSKVERVGKCDRNHFSDQNNGMSIDKSRVNKQDETVLEEGQLAIEPERETCSHPETNL 692 H+ K+ + R H +Q+ S D++ +NK E +EEG+L E + Sbjct: 439 HVHQKLWKQNLSHR-HSGNQDLEKSADQNCLNKDVE--IEEGELIEEDHNNIIYKSKLKQ 495 Query: 693 FSV------KTSQAGVAKEEKANSDNASENKIGGVDSNR--ILETLAKMERRRERFKEPI 848 +V +TS A + A S +A+ N +S+ ILE + KM++RRERFKE I Sbjct: 496 ENVVLKSVIETSSAEQLQVNNATSKDATCNNRATRESDEKHILEVMEKMQKRRERFKEAI 555 Query: 849 ALKKEPDKNLTIQADTVETTEAKQQRPARKRRWGGS 956 A KKE + A T + QRPARKRRWGG+ Sbjct: 556 APKKEVGDKKDLSALACSTDFIQNQRPARKRRWGGN 591 >ref|XP_006651314.1| PREDICTED: uncharacterized protein LOC102703384 [Oryza brachyantha] Length = 1066 Score = 74.7 bits (182), Expect = 6e-11 Identities = 60/196 (30%), Positives = 86/196 (43%), Gaps = 10/196 (5%) Frame = +3 Query: 399 RNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVGKCDRNHFSDQNN 578 R K N E R E L D + H + + R + RN +++ Sbjct: 871 RKRKFNRQGIEIRREVESDSEGCLPADSDLHSSKLKSVHQKVRKPRSYRISRNQILEKSI 930 Query: 579 GMSIDKSRVNKQDETVLEEGQLAIEPERETCSHPETNLFS-------VKTSQAGVAKEEK 737 +N++ E + EEG+L + +T S + N S ++ S AG Sbjct: 931 QQKQQHVSINQECEEI-EEGELIEQDHHDTASRSKFNQRSKVVLRSVIEASSAGQGGMVN 989 Query: 738 ANSDNA--SENKIGGVDSNRILETLAKMERRRERFKEPIALKKEPDKN-LTIQADTVETT 908 A S +A S D ILE + KM++RRERFKEPIA +KE D++ + A T Sbjct: 990 ATSKDADCSNGATRECDDKHILEVMKKMQKRRERFKEPIAPQKEEDEHGKELLAATYSVD 1049 Query: 909 EAKQQRPARKRRWGGS 956 + K RPARKR WG S Sbjct: 1050 DMKNPRPARKRLWGCS 1065 >ref|XP_003561693.1| PREDICTED: uncharacterized protein LOC100823950 [Brachypodium distachyon] Length = 1045 Score = 73.2 bits (178), Expect = 2e-10 Identities = 71/270 (26%), Positives = 116/270 (42%), Gaps = 14/270 (5%) Frame = +3 Query: 189 DEQLERDRKRLTRE-ESKYSQVSNRSSFNFG-MIETNEPANLRCRDSIEQHLIDWKGKSS 362 D+ + DRK E S ++ ++F M +N +N+ S E + K S Sbjct: 789 DQSVICDRKLYAMEVHSSTKEIGRADIYSFSDMRNSNTISNIHDERSHELVVFQPKDADS 848 Query: 363 RRLSKAGNARCSRNEKINHSVDEERITFRHSDESYLWNDVPSKGESISLHHIRSKVERVG 542 L+ R K +E R ++E L P++ + LH + K V Sbjct: 849 IHLN-------DRKRKFKRHGNEVRREVGRANEECL----PAEKD---LHSSKHKDVHVK 894 Query: 543 KCDRNHFSDQNNGMSIDKSRVNKQ----DETVLEEGQLAIEPERET-----CSHPETNLF 695 N + ++K+R K +E +EEG+L E +++ +HP Sbjct: 895 MQKLNGSYHDSVYQDLEKTRYQKSQNGNEEDEIEEGELIEEDHQDSFPKSKLNHPRKATL 954 Query: 696 S--VKTSQAGVAKEEKANSDNASENKIGG-VDSNRILETLAKMERRRERFKEPIALKKEP 866 ++ S AG + A S + + ++ D+ ILE + KM++RRERFKEP+ + + Sbjct: 955 KSVIEASSAGQLEMINAMSKDVCDKEVSWECDNKHILEVMEKMQKRRERFKEPVVTQNDE 1014 Query: 867 DKNLTIQADTVETTEAKQQRPARKRRWGGS 956 D + A + K RPARKRRWGGS Sbjct: 1015 DGKNELLAVACSANDIKNLRPARKRRWGGS 1044