BLASTX nr result
ID: Catharanthus22_contig00005213
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00005213 (2233 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004248266.1| PREDICTED: PAP-associated domain-containing ... 628 e-177 ref|XP_006362956.1| PREDICTED: PAP-associated domain-containing ... 625 e-176 ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing ... 595 e-167 ref|XP_002329093.1| predicted protein [Populus trichocarpa] gi|5... 577 e-162 ref|XP_006451882.1| hypothetical protein CICLE_v10008024mg [Citr... 575 e-161 gb|EOY12983.1| Nucleotidyltransferase family protein isoform 1 [... 575 e-161 gb|EOY12984.1| Nucleotidyltransferase family protein isoform 2 [... 570 e-160 ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing ... 570 e-159 ref|XP_002524282.1| nucleic acid binding protein, putative [Rici... 568 e-159 ref|XP_006464744.1| PREDICTED: LOW QUALITY PROTEIN: PAP-associat... 562 e-157 dbj|BAE71308.1| hypothetical protein [Trifolium pratense] 561 e-157 gb|ESW21437.1| hypothetical protein PHAVU_005G070800g [Phaseolus... 545 e-152 gb|EMJ12736.1| hypothetical protein PRUPE_ppa003914mg [Prunus pe... 543 e-152 gb|EXB51373.1| PAP-associated domain-containing protein 5 [Morus... 542 e-151 gb|EOY12986.1| Nucleotidyltransferase family protein isoform 4 [... 540 e-150 ref|XP_004487752.1| PREDICTED: PAP-associated domain-containing ... 539 e-150 ref|NP_568798.1| nucleotidyltransferase family protein [Arabidop... 509 e-141 ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arab... 509 e-141 gb|EOY12985.1| Nucleotidyltransferase family protein isoform 3 [... 508 e-141 dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana] 504 e-140 >ref|XP_004248266.1| PREDICTED: PAP-associated domain-containing protein 5-like [Solanum lycopersicum] Length = 521 Score = 628 bits (1619), Expect = e-177 Identities = 337/519 (64%), Positives = 379/519 (73%), Gaps = 6/519 (1%) Frame = -1 Query: 2131 ESILYETLSPLSTADGXXXXXXXXXXXXXDL---EPYVVLRNEISLSAVQSSLDGTAAPD 1961 E ILYETL PLS A EPYVV RN+ISLS +Q TAAPD Sbjct: 4 EGILYETLRPLSAAGTTTTATDDIPPSLSSSDEHEPYVVFRNQISLSNLQCPSPETAAPD 63 Query: 1960 YFSLDLDAD--DIXXXXXXXXXXXXXXXT-KEPARTLEGNWFRANSRFKSPMLQLHKEIL 1790 YFSLDLD D D+ KE R LEGNWFRAN RFKSPMLQLH+EI+ Sbjct: 64 YFSLDLDGDASDLNNGSVSTPVPAATPLRDKEVERGLEGNWFRANCRFKSPMLQLHQEII 123 Query: 1789 DFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGS 1610 DFC+FLSPT EEQA R EA+E V +VIK+IWPNC+ EVFGSF+TGLYLPTSD+D+VILGS Sbjct: 124 DFCEFLSPTLEEQASRNEAVECVFNVIKYIWPNCKPEVFGSFKTGLYLPTSDVDLVILGS 183 Query: 1609 DIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAA 1430 +I++PQIGLQALSR LSQK V KKIQVI+KARVPIIKFVEKKSGI+FDISFDV+NGP AA Sbjct: 184 EIRSPQIGLQALSRALSQKGVAKKIQVISKARVPIIKFVEKKSGISFDISFDVENGPKAA 243 Query: 1429 EFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASL 1250 +FIKDA+S WP LRPLCLILK+FLQQRELNEVYTGGIGSYALL MLIAMLQ+++ +AS+ Sbjct: 244 DFIKDAMSSWPPLRPLCLILKVFLQQRELNEVYTGGIGSYALLVMLIAMLQNHRNGQASV 303 Query: 1249 EHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPE 1070 E NLGILLVNFFDIYGRKLNT+DVGVSCNGE FFLK KGFS GK LISIEDPQ PE Sbjct: 304 EENLGILLVNFFDIYGRKLNTSDVGVSCNGEATFFLKSCKGFSIKGKQSLISIEDPQTPE 363 Query: 1069 NDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGE 890 NDIGKSSFNYFQVRSAF+MAF LTN K I LGP RSILGTIIRPD L+ERKGGS+GE Sbjct: 364 NDIGKSSFNYFQVRSAFSMAFTTLTNAKAIFALGPNRSILGTIIRPDEVLVERKGGSNGE 423 Query: 889 GTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXX 710 T NLLPGAGE + Q+ D QE+YCNW+L+D N+E LPR N I E+ G Sbjct: 424 VTFTNLLPGAGEGLQQYG-DQQEIYCNWQLND-NEEALPRGNGIAENGGAESSGKKRKSS 481 Query: 709 XXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSR 593 KEN + E++ SR H+R Sbjct: 482 KDKQPAKKVKENGHSSHIRDEENSSRKEKSSKKHWKHNR 520 >ref|XP_006362956.1| PREDICTED: PAP-associated domain-containing protein 5-like [Solanum tuberosum] Length = 521 Score = 625 bits (1613), Expect = e-176 Identities = 335/519 (64%), Positives = 375/519 (72%), Gaps = 6/519 (1%) Frame = -1 Query: 2131 ESILYETLSPLSTADGXXXXXXXXXXXXXDL---EPYVVLRNEISLSAVQSSLDGTAAPD 1961 + ILYETL PLS A EPYVV RN+ISLS +Q TAAPD Sbjct: 4 DGILYETLRPLSAAGTTTTATDDFPPSLSSSDEHEPYVVFRNQISLSTIQCPSPETAAPD 63 Query: 1960 YFSLDLDADDIXXXXXXXXXXXXXXXT---KEPARTLEGNWFRANSRFKSPMLQLHKEIL 1790 YFSLDLD D KE R LEGNWFRAN RFKSPMLQLH+EI+ Sbjct: 64 YFSLDLDGDAADLNTSSVSTPVPAATPLPDKEVERGLEGNWFRANCRFKSPMLQLHQEII 123 Query: 1789 DFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGS 1610 DFC+FLSPT EEQA R EAIE V +VIK+IWPNC+ EVFGSF+TGLYLPTSD+D+VILGS Sbjct: 124 DFCEFLSPTLEEQASRNEAIECVFNVIKYIWPNCKPEVFGSFKTGLYLPTSDVDLVILGS 183 Query: 1609 DIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAA 1430 +I++PQIGLQALSR LSQK V KKIQVI+KARVPIIKFVEKKSGI+FDISFDV+NGP AA Sbjct: 184 EIRSPQIGLQALSRALSQKGVAKKIQVISKARVPIIKFVEKKSGISFDISFDVENGPKAA 243 Query: 1429 EFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASL 1250 EFIKDA+S WP LRPLCLILK+FLQQRELNEVYTGGIGSYALL MLIAMLQ+++ +AS Sbjct: 244 EFIKDAMSSWPPLRPLCLILKVFLQQRELNEVYTGGIGSYALLVMLIAMLQNHRNGQASA 303 Query: 1249 EHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPE 1070 E NLGILLVNFFDIYGRKLNT+DVGVSCNGEG FFLK KGFS GK LISIEDPQ PE Sbjct: 304 EENLGILLVNFFDIYGRKLNTSDVGVSCNGEGTFFLKSRKGFSIKGKQSLISIEDPQTPE 363 Query: 1069 NDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGE 890 NDIGKSSFNYFQVRSAF+MAF LTN K I LG +SILGTIIRPD L+ERKGGS+GE Sbjct: 364 NDIGKSSFNYFQVRSAFSMAFTTLTNAKAIFALGSNKSILGTIIRPDEVLVERKGGSNGE 423 Query: 889 GTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXX 710 T NLLPGAGE + Q+ D QE+YCNW+L+D+ +E LPR N I ED Sbjct: 424 VTFNNLLPGAGEGLQQYG-DQQEIYCNWQLNDD-EEALPRGNGIAEDGDAQSSGKKRKSS 481 Query: 709 XXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSR 593 KEN + E++ SR H+R Sbjct: 482 KDKQPAKKVKENGHSSSVRDEENSSRKEKSSKKHWKHNR 520 >ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis vinifera] gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera] Length = 497 Score = 595 bits (1535), Expect = e-167 Identities = 310/469 (66%), Positives = 356/469 (75%) Frame = -1 Query: 2143 METAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAP 1964 META S YETLSPLS +PY V RN+ISLS++ TAAP Sbjct: 1 META-SYFYETLSPLSPPPSDRSPPPSDES-----QPYYVYRNQISLSSLSYPSPETAAP 54 Query: 1963 DYFSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKEILDF 1784 DYFSLD AD ++E A +E WFR NSR +SPML+LHKEILDF Sbjct: 55 DYFSLDARAD--VEEPSPARFRTPPPASEEEAPAVESGWFRGNSRLRSPMLKLHKEILDF 112 Query: 1783 CDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGSDI 1604 DFLSPTP+EQ+ R AIESV +VI++IWPNC+ EVFGSF+TGLYLPTSDID+VILGSDI Sbjct: 113 SDFLSPTPKEQSARNAAIESVFNVIRYIWPNCKVEVFGSFKTGLYLPTSDIDVVILGSDI 172 Query: 1603 QNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAAEF 1424 + PQIGL ALSR LSQK + KKIQVIAKARVPIIKF+EK+S +AFDISFDV+NGP AAE+ Sbjct: 173 KTPQIGLYALSRALSQKGIAKKIQVIAKARVPIIKFIEKRSSVAFDISFDVENGPKAAEY 232 Query: 1423 IKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASLEH 1244 I+DA+SKWP LRPLCLILK+FLQQRELNEVY+GGIGSYALLAMLIAMLQ+ Q AS+EH Sbjct: 233 IQDAISKWPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAMLQNLQEWNASVEH 292 Query: 1243 NLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPEND 1064 NLG+LLVNFFD YGRKLNT D+GV+CNG G FFLK KGF G+ +LISIEDPQ P ND Sbjct: 293 NLGVLLVNFFDFYGRKLNTVDIGVTCNGPGTFFLKSTKGFVNKGQKFLISIEDPQLPGND 352 Query: 1063 IGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGEGT 884 IGK+SFNYFQ+RSAF+MAF+ LTN +TILGL P RSILGTIIRPD LLERKGGS+G T Sbjct: 353 IGKNSFNYFQIRSAFSMAFSTLTNARTILGLDPNRSILGTIIRPDPILLERKGGSNGTMT 412 Query: 883 IKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVN 737 +LLPGAGE L QEL CNW+++D +EPLPRSN I D N Sbjct: 413 FDHLLPGAGEP-LSPQTGGQELLCNWQVEDAEEEPLPRSNPIAGDGSAN 460 >ref|XP_002329093.1| predicted protein [Populus trichocarpa] gi|566154024|ref|XP_006370267.1| hypothetical protein POPTR_0001s41140g [Populus trichocarpa] gi|550349446|gb|ERP66836.1| hypothetical protein POPTR_0001s41140g [Populus trichocarpa] Length = 543 Score = 577 bits (1487), Expect = e-162 Identities = 312/535 (58%), Positives = 371/535 (69%), Gaps = 11/535 (2%) Frame = -1 Query: 2122 LYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDG-TAAPDYFSLD 1946 LYETL+ L+ L+PY V RNEISLSA S+ +AAPD+FSLD Sbjct: 12 LYETLT-LTPLSPSPTATPIRSPLSDPLQPYSVFRNEISLSAFNSAAAAESAAPDFFSLD 70 Query: 1945 LDADDIXXXXXXXXXXXXXXXTKE--------PARTLEGNWFRANSRFKSPMLQLHKEIL 1790 + + D ++ P E WFR +S+F+SPMLQLHKEI+ Sbjct: 71 VGSGDEEELELKTPVNGEAKGKRKAEVETENLPEPMTESVWFRGDSKFRSPMLQLHKEIV 130 Query: 1789 DFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGS 1610 DFCDFLSPT EEQA R EA+ V VIK+IWPNC+ EVFGSFRTGLYLPTSDID+VILGS Sbjct: 131 DFCDFLSPTQEEQASRAEAVRCVFDVIKYIWPNCKVEVFGSFRTGLYLPTSDIDVVILGS 190 Query: 1609 DIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAA 1430 +++PQIGL ALSR LSQK V KKIQVIA+ARVPI+KFVEK+SG++FDISFDV GPIAA Sbjct: 191 GLKSPQIGLNALSRALSQKGVAKKIQVIARARVPIVKFVEKRSGVSFDISFDVNGGPIAA 250 Query: 1429 EFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASL 1250 EFIK+A+SKWP LRPLCLILK+FLQQRELNEVY+GGI SYALLAML+AMLQ+++ +ASL Sbjct: 251 EFIKNAISKWPELRPLCLILKVFLQQRELNEVYSGGISSYALLAMLMAMLQNHRECQASL 310 Query: 1249 EHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPE 1070 E NLG+LL++FFD YGRKLNT +VGVSC G G FF K+ KGF G+ +LI+IEDPQAPE Sbjct: 311 ERNLGLLLIHFFDFYGRKLNTTNVGVSCKGTGTFFSKRTKGFMNNGRPFLIAIEDPQAPE 370 Query: 1069 NDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGE 890 NDIGK+SFNYFQ+RSAFAMAF LTNPKTIL LGP RSILGTIIRPD LLERKGG +GE Sbjct: 371 NDIGKNSFNYFQIRSAFAMAFTTLTNPKTILSLGPNRSILGTIIRPDPVLLERKGGKNGE 430 Query: 889 GTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXX 710 T +LLPGAGE LQ + QE+ CNW+LDDE +E LPR D + Sbjct: 431 VTFSSLLPGAGEP-LQSNYGQQEILCNWQLDDE-EEALPRGGGDAGDGSAHSSGKKRKAS 488 Query: 709 XXXXXXXXXKENEDDRIGK--HEKSGSRTRSGKLSKQLHSRSHQDGGISSGYNGN 551 + D IGK H++SGS+ KQ ++ G S G+ Sbjct: 489 SKEKSRKKKSKENGD-IGKVRHDESGSKKEKSTKKKQRWRKNDSSKGFGSHAAGS 542 >ref|XP_006451882.1| hypothetical protein CICLE_v10008024mg [Citrus clementina] gi|557555108|gb|ESR65122.1| hypothetical protein CICLE_v10008024mg [Citrus clementina] Length = 516 Score = 575 bits (1483), Expect = e-161 Identities = 299/522 (57%), Positives = 363/522 (69%) Frame = -1 Query: 2143 METAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAP 1964 ME + +ILYE LSPL + +L+PY V RNEISL+ + + + + A Sbjct: 1 MEESHNILYEALSPLRGSPASDDPTLRQSPPPDELDPYTVFRNEISLTDLHCAAEESPAQ 60 Query: 1963 DYFSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKEILDF 1784 D+FSLD++ + K +E WF+ NSRFKSPMLQLHKEI+DF Sbjct: 61 DFFSLDVNESGVDDVEEVEPKTPPA---KSAEPRMENRWFKGNSRFKSPMLQLHKEIVDF 117 Query: 1783 CDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGSDI 1604 CDFLSPT EE+ R A+E+V VIK+IWP C+ EVFGSFRTGLYLPTSDID+VI+ S I Sbjct: 118 CDFLSPTSEEREVRNTAVEAVFDVIKYIWPKCKPEVFGSFRTGLYLPTSDIDVVIMESGI 177 Query: 1603 QNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAAEF 1424 NP GLQALSR L Q+ + KKIQVIAKARVPI+KFVEKKSG++FDISFD QNGP AAEF Sbjct: 178 HNPATGLQALSRALLQRGIAKKIQVIAKARVPIVKFVEKKSGVSFDISFDAQNGPKAAEF 237 Query: 1423 IKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASLEH 1244 IKDA++K P LRPLCLILK+FLQQRELNEVY+GGIGSYALL M++A+L+ RAS EH Sbjct: 238 IKDALAKCPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMIMAVLKSLYECRASPEH 297 Query: 1243 NLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPEND 1064 NLGILLVNFFD YGRKLNT DVGVSC G G+FF K KGF+ G+ +LI+IEDPQAP+ND Sbjct: 298 NLGILLVNFFDFYGRKLNTTDVGVSCKGAGSFFKKSSKGFTNKGRPFLIAIEDPQAPDND 357 Query: 1063 IGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGEGT 884 IGK+SFNYFQ++SAFAMAF LTNPKTIL LGP RSILGTIIRPD LLERKGGS+GE T Sbjct: 358 IGKNSFNYFQIKSAFAMAFTTLTNPKTILSLGPNRSILGTIIRPDPVLLERKGGSNGEIT 417 Query: 883 IKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXXXX 704 NLLPGAGE + H D +E+ CNW+ D E +E PR N ++ +G Sbjct: 418 FNNLLPGAGEPLQTHFGDQREIMCNWQSDYE-EESFPRGNGSVQSSGKKRKAFSKEKSTS 476 Query: 703 XXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSHQDG 578 E++ + E + +SGK + ++ H +G Sbjct: 477 KKKTEETGESK----SREEGGSKKEKSGKKKRWRQNQGHANG 514 >gb|EOY12983.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] Length = 540 Score = 575 bits (1482), Expect = e-161 Identities = 317/534 (59%), Positives = 367/534 (68%), Gaps = 10/534 (1%) Frame = -1 Query: 2137 TAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDY 1958 +++ ILYETL+P+S EPY V RNEISL A S +AAPDY Sbjct: 8 SSQPILYETLTPISLPSSPAAQSPPFNEPP--FEPYTVFRNEISLLAENSISLDSAAPDY 65 Query: 1957 FSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGN-----WFRANSRFKSPMLQLHKEI 1793 FSLD++ K P E WFR NSRFKSPMLQLHKEI Sbjct: 66 FSLDVNDPAEPVIVQASVSAWDEPEPKTPGVVDEPRLENEWWFRGNSRFKSPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFLSPTPEEQA R A++SV VIK+IWP C+ EVFGSFRTGLYLPTSDID+VILG Sbjct: 126 VDFCDFLSPTPEEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILG 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S I+NPQ GL ALSR LSQK + KK+QVIAKARVPI+KFVEKKS +AFDISFDV NGP A Sbjct: 186 SGIKNPQTGLHALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRAS 1253 A+FIK+AV KWP LRPLCLILK+FLQQR+LNEVY+GGIGSYALLAML+AMLQ +A Sbjct: 246 ADFIKEAVLKWPQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQSLHESQAY 305 Query: 1252 LEHNLGILLVNFFDIYGRKLNTADVGVSCNGE-GNFFLKKLKGFSTPGKHYLISIEDPQA 1076 EHNLGILLV+FFD YGRKLNTADVGVSCNG G FFLK +GFS G+ +LISIEDPQA Sbjct: 306 QEHNLGILLVHFFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDPQA 365 Query: 1075 PENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSS 896 P+NDIGK+SFN+ Q+RSAF MA + LTNPK IL LGP RSILGTIIRPD LLERKGGSS Sbjct: 366 PDNDIGKNSFNFIQIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSS 425 Query: 895 GEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGV-NXXXXXX 719 G T +LLPGAGE + + Q++ CNW+LDDE EPLPR + I D + Sbjct: 426 GGVTFSSLLPGAGEPLQPLYGEQQDILCNWQLDDE--EPLPRGDGIDVDVSAQSSGRKRK 483 Query: 718 XXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSH---QDGGISS 566 KEN D R HE++ + K H+ ++ + GG SS Sbjct: 484 SASKERSKKKKVKENGDARKVWHEETVFKKEKSTRKKGYHNDANGFGRHGGSSS 537 >gb|EOY12984.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] Length = 541 Score = 570 bits (1470), Expect = e-160 Identities = 317/535 (59%), Positives = 367/535 (68%), Gaps = 11/535 (2%) Frame = -1 Query: 2137 TAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDY 1958 +++ ILYETL+P+S EPY V RNEISL A S +AAPDY Sbjct: 8 SSQPILYETLTPISLPSSPAAQSPPFNEPP--FEPYTVFRNEISLLAENSISLDSAAPDY 65 Query: 1957 FSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGN-----WFRANSRFKSPMLQLHKEI 1793 FSLD++ K P E WFR NSRFKSPMLQLHKEI Sbjct: 66 FSLDVNDPAEPVIVQASVSAWDEPEPKTPGVVDEPRLENEWWFRGNSRFKSPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFLSPTPEEQA R A++SV VIK+IWP C+ EVFGSFRTGLYLPTSDID+VILG Sbjct: 126 VDFCDFLSPTPEEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILG 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S I+NPQ GL ALSR LSQK + KK+QVIAKARVPI+KFVEKKS +AFDISFDV NGP A Sbjct: 186 SGIKNPQTGLHALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAML-QDYQTRRA 1256 A+FIK+AV KWP LRPLCLILK+FLQQR+LNEVY+GGIGSYALLAML+AML Q +A Sbjct: 246 ADFIKEAVLKWPQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQQSLHESQA 305 Query: 1255 SLEHNLGILLVNFFDIYGRKLNTADVGVSCNGE-GNFFLKKLKGFSTPGKHYLISIEDPQ 1079 EHNLGILLV+FFD YGRKLNTADVGVSCNG G FFLK +GFS G+ +LISIEDPQ Sbjct: 306 YQEHNLGILLVHFFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDPQ 365 Query: 1078 APENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGS 899 AP+NDIGK+SFN+ Q+RSAF MA + LTNPK IL LGP RSILGTIIRPD LLERKGGS Sbjct: 366 APDNDIGKNSFNFIQIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGS 425 Query: 898 SGEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGV-NXXXXX 722 SG T +LLPGAGE + + Q++ CNW+LDDE EPLPR + I D + Sbjct: 426 SGGVTFSSLLPGAGEPLQPLYGEQQDILCNWQLDDE--EPLPRGDGIDVDVSAQSSGRKR 483 Query: 721 XXXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSH---QDGGISS 566 KEN D R HE++ + K H+ ++ + GG SS Sbjct: 484 KSASKERSKKKKVKENGDARKVWHEETVFKKEKSTRKKGYHNDANGFGRHGGSSS 538 >ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cucumis sativus] Length = 544 Score = 570 bits (1468), Expect = e-159 Identities = 310/534 (58%), Positives = 359/534 (67%), Gaps = 11/534 (2%) Frame = -1 Query: 2140 ETAESILYETLSPLS-TADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAP 1964 E + LY+TLSPLS +A DLEPY V RNEISLS + TAA Sbjct: 5 EAVQHYLYDTLSPLSFSAITTTTTGDQLSSPDVDLEPYSVFRNEISLSTPDCAPAETAAT 64 Query: 1963 DYFSLDLDAD---------DIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPML 1811 ++F+LD+ AD E LE WFR NS KSPML Sbjct: 65 EFFALDVAADKGEENSGICSSPLPVTSALETEPRTPECEDQSRLESGWFRGNSGLKSPML 124 Query: 1810 QLHKEILDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDI 1631 QLHKEI+DFC+FLSPT EE+ R A+E V SV+KHIWP+C+ EVFGSF+TGLYLPTSDI Sbjct: 125 QLHKEIVDFCEFLSPTEEERVARDSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPTSDI 184 Query: 1630 DIVILGSDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDV 1451 D+VILGS I PQ+GLQALSR LSQK + KKIQVI KARVPIIKF+EK+SGI+FDISFDV Sbjct: 185 DVVILGSGIPKPQLGLQALSRALSQKGIAKKIQVIGKARVPIIKFIEKQSGISFDISFDV 244 Query: 1450 QNGPIAAEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDY 1271 QNGP AA+FIK AVSKWP LRPLCLILK+FLQQRELNEVY+GG+GSYALL ML+AMLQ Sbjct: 245 QNGPKAADFIKGAVSKWPPLRPLCLILKVFLQQRELNEVYSGGLGSYALLTMLMAMLQSI 304 Query: 1270 QTRRASLEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISI 1091 +SLEHNLG+LLV+FFD YGRKLNT+DVGVSCN G FF K +GF T G+ L+SI Sbjct: 305 NVPPSSLEHNLGVLLVHFFDFYGRKLNTSDVGVSCNAGGIFFSKSYRGFMTKGRPCLLSI 364 Query: 1090 EDPQAPENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLER 911 EDPQAP+NDIGK+SFNYFQ+RSAFAMA++ LTN KT+LGLGP RSILGTIIRPD LL+R Sbjct: 365 EDPQAPDNDIGKNSFNYFQIRSAFAMAYSILTNVKTVLGLGPNRSILGTIIRPDPVLLKR 424 Query: 910 KGGSSGEGTIKNLLPGAGEAVLQ-HSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNX 734 KGG GE T +LLPGAGE V Q D QE+ CNW+ DE EPLPR N E+ G Sbjct: 425 KGGRHGEVTFNSLLPGAGEPVQQPEYGDDQEMLCNWQFGDE--EPLPRGNDTPENVGTPS 482 Query: 733 XXXXXXXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSHQDGGI 572 + R HE +GSR K+L G+ Sbjct: 483 SKKQRKTREKSRKKEKESHSSKRR---HEDNGSRKEQSSKKKRLRQNDSDANGL 533 >ref|XP_002524282.1| nucleic acid binding protein, putative [Ricinus communis] gi|223536473|gb|EEF38121.1| nucleic acid binding protein, putative [Ricinus communis] Length = 526 Score = 568 bits (1465), Expect = e-159 Identities = 309/519 (59%), Positives = 363/519 (69%), Gaps = 4/519 (0%) Frame = -1 Query: 2122 LYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYFSLDL 1943 LY+TLSPLS P+ V RNEISLS SS + APD+FSLD+ Sbjct: 18 LYQTLSPLSLPTPDQSPRSDDDGDHRHPNPFSVFRNEISLSTANSSAIESVAPDFFSLDV 77 Query: 1942 --DADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKEILDFCDFLS 1769 A + LE +WFR NSRF+SPMLQLHKEI+DFCDFLS Sbjct: 78 VEAAAEPKTPSVVAEPRKSKAAQSVSETKLESSWFRGNSRFRSPMLQLHKEIVDFCDFLS 137 Query: 1768 PTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGSDIQNPQI 1589 PTPEE+ R A++ V VIK+IWPNC+ EVFGS++TGLYLPTSDID+VI S I+NPQI Sbjct: 138 PTPEEEDARNTAVKCVFDVIKYIWPNCKVEVFGSYKTGLYLPTSDIDVVIFRSGIKNPQI 197 Query: 1588 GLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAAEFIKDAV 1409 GLQALSR LSQK + KKIQVIAKARVPI+KFVEK+SG++FDISFDV NGP AAEFIKDAV Sbjct: 198 GLQALSRALSQKGIAKKIQVIAKARVPIVKFVEKRSGVSFDISFDVDNGPKAAEFIKDAV 257 Query: 1408 SKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASLEHNLGIL 1229 KWPALRPL LILK+FLQQRELNEVY+GGIGSYALL ML+A+L +AS EHNLG+L Sbjct: 258 RKWPALRPLSLILKVFLQQRELNEVYSGGIGSYALLTMLMAVL------KASSEHNLGVL 311 Query: 1228 LVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPENDIGKSS 1049 LV FFD YGRKLNT DVGVSC G G FF K+ KGF G+ +LI+IEDPQAP+NDIGK+S Sbjct: 312 LVYFFDFYGRKLNTTDVGVSCKGAGTFFSKRKKGFMNKGRPFLIAIEDPQAPDNDIGKNS 371 Query: 1048 FNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGEGTIKNLL 869 FNY Q+RSAF+MAF+ LTNP+TIL LGP RSILGTIIRPD+ LLERK G +GE T +LL Sbjct: 372 FNYSQIRSAFSMAFSTLTNPRTILSLGPNRSILGTIIRPDSILLERKAGCNGEVTFSSLL 431 Query: 868 PGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXXXXXXXXX 689 PGAGE + H D QE+ NW+LDD+ +E LPR I ED+G Sbjct: 432 PGAGELIQSH-YDHQEILGNWQLDDD-EEVLPRGGGIAEDSGAQ----SSGKKRKSSKDK 485 Query: 688 XXKENEDDRIGK--HEKSGSRTRSGKLSKQLHSRSHQDG 578 K E+ IGK HE+SGSR + K + H+R +G Sbjct: 486 STKREENGSIGKVSHEESGSR-KDRKKQRWRHNRDDVNG 523 >ref|XP_006464744.1| PREDICTED: LOW QUALITY PROTEIN: PAP-associated domain-containing protein 5-like [Citrus sinensis] Length = 516 Score = 562 bits (1448), Expect = e-157 Identities = 297/523 (56%), Positives = 359/523 (68%), Gaps = 1/523 (0%) Frame = -1 Query: 2143 METAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAP 1964 ME + +ILYE LSPL + +L+ Y V RNEISL+ + + + + A Sbjct: 1 MEESHNILYEALSPLRGSQASDDPTLRQSPPPDELDHYTVFRNEISLTDLHCAAEESPAQ 60 Query: 1963 DYFSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKEILDF 1784 D+FSLD++ + K +E WF+ NSRFKSPMLQLHKEI+DF Sbjct: 61 DFFSLDVNESGVDDVEEVEPKTPPA---KSAEPRMENRWFKGNSRFKSPMLQLHKEIVDF 117 Query: 1783 CDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGSDI 1604 CDFLSPT EE+ R A+E+V VIK+IWP C+ EVFGSFRTGLYLPTSDID+VI+ S I Sbjct: 118 CDFLSPTSEEREVRNTAVEAVFDVIKYIWPKCKPEVFGSFRTGLYLPTSDIDVVIMESGI 177 Query: 1603 QNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAAEF 1424 NP GLQALSR L Q+ + KKIQVIAKARVPI+KFVEKKSG++FDISFD QNGP AAEF Sbjct: 178 HNPATGLQALSRALLQRGIAKKIQVIAKARVPIVKFVEKKSGVSFDISFDAQNGPKAAEF 237 Query: 1423 IKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASLEH 1244 IKDA++ P LRPLCLILK+FLQQRELNEVY+GGIGSYALL M++A+L+ RAS EH Sbjct: 238 IKDALANCPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMIMAVLKSLYKCRASPEH 297 Query: 1243 NLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPEND 1064 NLGILLVNFFD YGRKL T DVGVSC G G+FF K KGF+ G+ +LI+IEDPQAP+N Sbjct: 298 NLGILLVNFFDFYGRKLKTTDVGVSCKGAGSFFKKSSKGFTNKGRPFLIAIEDPQAPDNA 357 Query: 1063 IGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGEGT 884 IGK+SFNYFQ++SAFAMAF LTNPKTIL L P RSILGTIIRPD LLERKGGS+GE T Sbjct: 358 IGKNSFNYFQIKSAFAMAFTTLTNPKTILSLXPNRSILGTIIRPDPVLLERKGGSNGEIT 417 Query: 883 IKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXXXX 704 +LLPGAGE + H D +E+ CNW+ D E +E PR N ++ G Sbjct: 418 FNSLLPGAGEPLKTHFGDQREIMCNWQSDYE-EESFPRGNGSVQSCGKRRKAFSKEKSTS 476 Query: 703 XXXXXXXKENEDDRIGKHEKSGS-RTRSGKLSKQLHSRSHQDG 578 E++ HE+ GS + +SGK +R H +G Sbjct: 477 KKKTEEIGESK-----SHEEGGSKKEKSGKKKCWRQNRGHANG 514 >dbj|BAE71308.1| hypothetical protein [Trifolium pratense] Length = 518 Score = 561 bits (1446), Expect = e-157 Identities = 303/524 (57%), Positives = 367/524 (70%), Gaps = 6/524 (1%) Frame = -1 Query: 2143 METAESILYETLSPLS-TADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAA 1967 ++ E+ILY TLSPL TAD E Y V RNEISL Q + A Sbjct: 4 LQIPETILYTTLSPLPLTADDPPDSNNH--------EQYSVFRNEISLDTPQVDSVYSTA 55 Query: 1966 PDYFSLDL----DADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHK 1799 PD+FSLD+ +A+D +P TLEG WFR N +F+SPMLQLHK Sbjct: 56 PDFFSLDVADEAEAEDPLPEPKTPAEPKTPAIEHKP--TLEGGWFRGNGKFRSPMLQLHK 113 Query: 1798 EILDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVI 1619 EI+DFC+FLSPTPEE+A R AIESV VIKHIWP+CQ E+FGSFRTGLYLPTSDID+VI Sbjct: 114 EIVDFCEFLSPTPEEKAKRDAAIESVFEVIKHIWPHCQVEIFGSFRTGLYLPTSDIDVVI 173 Query: 1618 LGSDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGP 1439 L S + NPQIGL A+SR LSQ+ + KKIQVI KARVPIIKFVEKKSG++FDISFD+ NGP Sbjct: 174 LKSGLPNPQIGLNAISRSLSQRSMAKKIQVIGKARVPIIKFVEKKSGLSFDISFDIDNGP 233 Query: 1438 IAAEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRR 1259 AAE+I++AV+KWP LRPLCLILK+FLQQRELNEVY+GGIGSYALL ML+AML++ + + Sbjct: 234 KAAEYIQEAVAKWPQLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMLMAMLRNVRQSQ 293 Query: 1258 ASLEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQ 1079 + EHNLG+LLV+FFD YGRKLNT+DVGVSC GEG FF K +GF + +L+ I+DPQ Sbjct: 294 PTAEHNLGVLLVHFFDFYGRKLNTSDVGVSCIGEGTFFRKSSRGFYNKTRPFLLGIQDPQ 353 Query: 1078 APENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGS 899 P+NDIGK+SFNYFQVRSAF MAF LTNPK IL LGP RSILGTIIRPD L+ERKGGS Sbjct: 354 TPDNDIGKNSFNYFQVRSAFLMAFTTLTNPKVILSLGPNRSILGTIIRPDPVLMERKGGS 413 Query: 898 SGEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXX 719 +GE T +LLPGAGE + Q ++ CNW+LD E +EPLPR + G N Sbjct: 414 NGEMTFNSLLPGAGEPI-QQQYGEHDMLCNWQLDFE-EEPLPRGD------GENTGAEPS 465 Query: 718 XXXXXXXXXXXXKENEDDRIG-KHEKSGSRTRSGKLSKQLHSRS 590 KEN+++R K++++ S T +G K R+ Sbjct: 466 RRSSKKKRKSASKENKENRDSRKNKENSSMTENGVHKKHKKKRA 509 >gb|ESW21437.1| hypothetical protein PHAVU_005G070800g [Phaseolus vulgaris] Length = 522 Score = 545 bits (1405), Expect = e-152 Identities = 289/460 (62%), Positives = 339/460 (73%), Gaps = 6/460 (1%) Frame = -1 Query: 2131 ESILYETLSPL--STADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDY 1958 ++ +Y+TL PL S AD EPY V RNEIS+ Q +L + D+ Sbjct: 10 KTFVYDTLCPLALSAADSPFPDHH---------EPYSVYRNEISVDTPQCALPTSTTVDF 60 Query: 1957 FSLDLDADDIXXXXXXXXXXXXXXXTKEPART----LEGNWFRANSRFKSPMLQLHKEIL 1790 FSLD+ A + K P LE WF N +FKSPMLQLHKEI+ Sbjct: 61 FSLDV-ASEAYGHESLPEPLAATPEPKTPTPAPEPKLESVWFGGNCKFKSPMLQLHKEIV 119 Query: 1789 DFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGS 1610 DFC+FLSPT E+A R AIESV VIKHIWP+CQ EVFGSFRTGLYLPTSDID+VIL S Sbjct: 120 DFCEFLSPTAAEKAVRDMAIESVFGVIKHIWPHCQVEVFGSFRTGLYLPTSDIDVVILKS 179 Query: 1609 DIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAA 1430 + NPQIGL A+S+ LSQ+ + K+IQVI KARVPIIKFVEK SG+AFDISFD+ NGP AA Sbjct: 180 GLPNPQIGLNAISKALSQRSMAKRIQVIGKARVPIIKFVEKISGLAFDISFDIDNGPKAA 239 Query: 1429 EFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASL 1250 E+I++AV KWP LRPLCLILK+FLQQRELNEVY+GGIGSYALLAML+AML++ + +AS Sbjct: 240 EYIQEAVLKWPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLMAMLRNLRLSQASA 299 Query: 1249 EHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPE 1070 EHNLG+LLV+FFD YGRKLN++DVGVSCNG G FF+K KGF G+ LISIEDPQAPE Sbjct: 300 EHNLGVLLVHFFDFYGRKLNSSDVGVSCNGTGTFFVKSSKGFLNKGRPSLISIEDPQAPE 359 Query: 1069 NDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGE 890 NDIGK+SFNYFQ+RSAF+MAF LTNPK I+ LGP RSILGTIIRPD LLERKGG +G+ Sbjct: 360 NDIGKNSFNYFQIRSAFSMAFKNLTNPKIIMSLGPNRSILGTIIRPDPVLLERKGGLNGD 419 Query: 889 GTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPR 770 T LLPGAGE LQ Q++ CNW+LD E +EPLPR Sbjct: 420 VTFDKLLPGAGEP-LQQQYGEQDMLCNWQLDYE-EEPLPR 457 >gb|EMJ12736.1| hypothetical protein PRUPE_ppa003914mg [Prunus persica] Length = 540 Score = 543 bits (1400), Expect = e-152 Identities = 288/468 (61%), Positives = 337/468 (72%), Gaps = 12/468 (2%) Frame = -1 Query: 2131 ESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYFS 1952 + LYETL LS DLE Y V RNE++LS Q + TAAPD+FS Sbjct: 7 QGFLYETLPALSLPT------PNQSPPPDDLESYSVFRNEVTLSTPQCAPVDTAAPDFFS 60 Query: 1951 LDLDADDIXXXXXXXXXXXXXXXTK------------EPARTLEGNWFRANSRFKSPMLQ 1808 LD+ AD+ E LE WFR +S+FKSPMLQ Sbjct: 61 LDVGADEAEPNWASPSRTLAAEPRTPLHQYEPTTPALEVEPKLESGWFRGHSKFKSPMLQ 120 Query: 1807 LHKEILDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDID 1628 LHKEI+DFC+FLSPTPEEQ R A+E V VIK+IWP C+ EVFGSF+TGLYLP SDID Sbjct: 121 LHKEIVDFCEFLSPTPEEQEARTSAVERVSQVIKYIWPRCKVEVFGSFKTGLYLPASDID 180 Query: 1627 IVILGSDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQ 1448 +VI+ S I PQ GLQALSR LSQ + KKIQVI KAR+PIIKFVEK SGIAFDISFD++ Sbjct: 181 VVIMRSGIPTPQQGLQALSRALSQMGLAKKIQVIGKARIPIIKFVEKTSGIAFDISFDIE 240 Query: 1447 NGPIAAEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQ 1268 +GP AA+FI+DAVSKWP LRPLCLILK+FLQQRELNEVY+GG+GSYALL ML+AML ++ Sbjct: 241 SGPKAADFIQDAVSKWPPLRPLCLILKVFLQQRELNEVYSGGLGSYALLTMLMAMLHSHR 300 Query: 1267 TRRASLEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIE 1088 +AS E NLG+LLVNFFD YGRKLNT+DVGVSC G G FF K +KGF T G+ +LI+IE Sbjct: 301 ECQASSEQNLGVLLVNFFDFYGRKLNTSDVGVSCKGAGTFFKKSVKGFITKGRPFLIAIE 360 Query: 1087 DPQAPENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERK 908 DPQAPEND+GK+SFNYFQ+RSAF+MA+ LTNPK IL LGP RSILGTIIRPD L+ERK Sbjct: 361 DPQAPENDVGKNSFNYFQIRSAFSMAYTTLTNPKVILCLGPNRSILGTIIRPDPTLVERK 420 Query: 907 GGSSGEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSN 764 GG G +LLPGAG+ LQ D QE CNW+LDD+ D+PLPR + Sbjct: 421 GG-PGLVAFDSLLPGAGKP-LQLEHDGQEFMCNWQLDDD-DDPLPRGD 465 >gb|EXB51373.1| PAP-associated domain-containing protein 5 [Morus notabilis] Length = 521 Score = 542 bits (1397), Expect = e-151 Identities = 303/556 (54%), Positives = 356/556 (64%), Gaps = 19/556 (3%) Frame = -1 Query: 2140 ETAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPD 1961 ET+++ LYETLSPL+ + LEP+ V RNEISLS++ S+ T D Sbjct: 3 ETSQNFLYETLSPLALSSANQSPPPDD------LEPFTVFRNEISLSSLPSASPATTTQD 56 Query: 1960 YFSLDLDADDIXXXXXXXXXXXXXXXTKEPART----LEGNWFRANSRFKSPMLQLHKEI 1793 +FSLD+ AD K PAR LE WFR NS+FKSPMLQLHKEI Sbjct: 57 FFSLDVGADGSDSVPASPAPPRQAAEPKTPAREAEPRLESGWFRGNSKFKSPMLQLHKEI 116 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFC+FLSPTPEEQ R AIE V VIK+IWPNC+ EVFGSF+TGLYLP+SDID+VILG Sbjct: 117 VDFCEFLSPTPEEQDARNAAIERVFDVIKYIWPNCKVEVFGSFKTGLYLPSSDIDVVILG 176 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 + I NPQ GLQALSR LSQ+ + KK+QVIAKARVPIIKFVEKKSG+AFDISFDVQNGP+A Sbjct: 177 AGIPNPQQGLQALSRALSQRSLVKKMQVIAKARVPIIKFVEKKSGVAFDISFDVQNGPVA 236 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRAS 1253 AEFIKD VSK P LRPLCLILK+FLQQRELNE Sbjct: 237 AEFIKDVVSKMPPLRPLCLILKVFLQQRELNE------------------------SLRE 272 Query: 1252 LEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAP 1073 E NLG++LVNFFD YGRKLNT+DVGVSCNG G FF K KGF+TPG+ +LISI+DPQA Sbjct: 273 PEGNLGVILVNFFDFYGRKLNTSDVGVSCNGGGTFFSKISKGFATPGRPFLISIQDPQAS 332 Query: 1072 ENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSG 893 ENDIGK+SFNYFQ+RSAF+MAF LTNP+ I+ LGP RSILGTIIRPDA LLERKGGS+ Sbjct: 333 ENDIGKNSFNYFQIRSAFSMAFTTLTNPRIIMDLGPNRSILGTIIRPDAVLLERKGGSNR 392 Query: 892 EGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXX 713 + T +LLPGAGE L QE+ CNW+LDDE EPLPR + D Sbjct: 393 QVTFDSLLPGAGEP-LNTQYGQQEMLCNWQLDDE--EPLPRGGDLAGDPSEYSSGKKRRA 449 Query: 712 XXXXXXXXXXKE---------------NEDDRIGKHEKSGSRTRSGKLSKQLHSRSHQDG 578 + N D +H ++G +R K+ ++ SH + Sbjct: 450 SAKEKSGKKKVKDNGDVGSARHRENGYNGDVGSSRHRENGYGSRKEKIKEKRFRHSHGNA 509 Query: 577 GISSGYNGNGRVSSPW 530 NG GR SPW Sbjct: 510 ------NGYGRSVSPW 519 >gb|EOY12986.1| Nucleotidyltransferase family protein isoform 4 [Theobroma cacao] Length = 525 Score = 540 bits (1391), Expect = e-150 Identities = 306/534 (57%), Positives = 353/534 (66%), Gaps = 10/534 (1%) Frame = -1 Query: 2137 TAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDY 1958 +++ ILYETL+P+S EPY V RNEISL A S +AAPDY Sbjct: 8 SSQPILYETLTPISLPSSPAAQSPPFNEPP--FEPYTVFRNEISLLAENSISLDSAAPDY 65 Query: 1957 FSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGN-----WFRANSRFKSPMLQLHKEI 1793 FSLD++ K P E WFR NSRFKSPMLQLHKEI Sbjct: 66 FSLDVNDPAEPVIVQASVSAWDEPEPKTPGVVDEPRLENEWWFRGNSRFKSPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFLSPTPEEQA R A++SV VIK+IWP C+ EVFGSFRTGLYLPTSDID+VILG Sbjct: 126 VDFCDFLSPTPEEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILG 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S I+NPQ GL ALSR LSQK + KK+QVIAKARVPI+KFVEKKS +AFDISFDV NGP A Sbjct: 186 SGIKNPQTGLHALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRAS 1253 A+FIK+AV KWP LRPLCLILK+FLQQR+LNEVY+GGIGSYALLAML+AMLQ +A Sbjct: 246 ADFIKEAVLKWPQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQSLHESQAY 305 Query: 1252 LEHNLGILLVNFFDIYGRKLNTADVGVSCNGE-GNFFLKKLKGFSTPGKHYLISIEDPQA 1076 EHNLGILLV+FFD YGRKLNTADVGVSCNG G FFLK +GFS G+ +LISIEDP Sbjct: 306 QEHNLGILLVHFFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDP-- 363 Query: 1075 PENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSS 896 Q+RSAF MA + LTNPK IL LGP RSILGTIIRPD LLERKGGSS Sbjct: 364 -------------QIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSS 410 Query: 895 GEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGV-NXXXXXX 719 G T +LLPGAGE + + Q++ CNW+LDDE EPLPR + I D + Sbjct: 411 GGVTFSSLLPGAGEPLQPLYGEQQDILCNWQLDDE--EPLPRGDGIDVDVSAQSSGRKRK 468 Query: 718 XXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSH---QDGGISS 566 KEN D R HE++ + K H+ ++ + GG SS Sbjct: 469 SASKERSKKKKVKENGDARKVWHEETVFKKEKSTRKKGYHNDANGFGRHGGSSS 522 >ref|XP_004487752.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cicer arietinum] Length = 513 Score = 539 bits (1388), Expect = e-150 Identities = 296/521 (56%), Positives = 357/521 (68%), Gaps = 5/521 (0%) Frame = -1 Query: 2131 ESILYETLSPLS-TADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYF 1955 E+I+Y T +PLS TAD + V RN ISL Q + APD+F Sbjct: 12 ETIVYTTTTPLSLTADDFPDSDNH--------DQCSVFRNVISLDTPQCDSVYSTAPDFF 63 Query: 1954 SLDL----DADDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKEILD 1787 SLD+ +A+D EP TLE WFR N +F+SPMLQLHKEI+D Sbjct: 64 SLDVADEGEAEDPIPEPVTPAEPKTPALAPEP--TLESGWFRGNCKFRSPMLQLHKEIVD 121 Query: 1786 FCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILGSD 1607 FC+FLSPTPEE+A R AIESV +VIKHIWP+CQ EVFGSFRTGLYLPTSDID+VIL S Sbjct: 122 FCEFLSPTPEEKAKRDTAIESVFAVIKHIWPHCQVEVFGSFRTGLYLPTSDIDVVILRSG 181 Query: 1606 IQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIAAE 1427 + NPQIGL A+SR LSQ+ + KKIQVI KARVPIIKFVEK S ++FDISFD++NGP AAE Sbjct: 182 LPNPQIGLNAISRALSQRSMAKKIQVIGKARVPIIKFVEKTSSLSFDISFDIENGPKAAE 241 Query: 1426 FIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRASLE 1247 +I++AV+ P LRPLCLILK+FLQQRELNEVY+GGIGSYALL ML+A+L++ + + S E Sbjct: 242 YIQEAVANCPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMLMAVLRNVRQSQTSAE 301 Query: 1246 HNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAPEN 1067 HNLG+LLV+FFD YGRKLNT+DVGVSCNG G FFLK +GF + L+ I Q P+N Sbjct: 302 HNLGVLLVHFFDFYGRKLNTSDVGVSCNGAGTFFLKSSRGFYNKARPSLLGIWLNQTPDN 361 Query: 1066 DIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSGEG 887 DIGK+SFNYFQVRSAF MAF LTNPK IL LGP RSILGTIIRPD L+ERKGGS+GE Sbjct: 362 DIGKNSFNYFQVRSAFLMAFTTLTNPKVILNLGPNRSILGTIIRPDPVLMERKGGSNGEM 421 Query: 886 TIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXXXXXXX 707 T +LLPGAGE + Q Q++ CNW+LD E +EPLPR ++ + A Sbjct: 422 TFNSLLPGAGEPI-QQQYGEQDMLCNWQLDFE-EEPLPRGDSTRKSAS------------ 467 Query: 706 XXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSHQ 584 KEN D R+ + ++GS T +G K R Q Sbjct: 468 --KENGKPKENGDSRMVNNNENGSVTENGVHKKHKKKRVKQ 506 >ref|NP_568798.1| nucleotidyltransferase family protein [Arabidopsis thaliana] gi|27754278|gb|AAO22592.1| unknown protein [Arabidopsis thaliana] gi|332009022|gb|AED96405.1| nucleotidyltransferase family protein [Arabidopsis thaliana] Length = 530 Score = 509 bits (1310), Expect = e-141 Identities = 276/468 (58%), Positives = 326/468 (69%), Gaps = 9/468 (1%) Frame = -1 Query: 2134 AESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYF 1955 A + +Y+TL PLS +D Y V R EIS ++ +A D+F Sbjct: 10 APAFVYDTLPPLSFSDSNQSPPPTHEES----HQYSVFRKEISDFPDDTTPVESATVDFF 65 Query: 1954 SLDLDADDIXXXXXXXXXXXXXXXTKEPART------LEGNWFRANSRFKSPMLQLHKEI 1793 SLD++ + K R LE NWF NS K PMLQLHKEI Sbjct: 66 SLDVEGETTENGVEPVTPVVVASKKKSKKRKKDEEPRLESNWFSENSFSKIPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFL PT E+A R A+ESV SVIK+IWP+C+ EVFGS++TGLYLPTSDID+VIL Sbjct: 126 VDFCDFLLPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDVVILE 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S + NPQ+GL+ALSR LSQ+ + K + VIAKARVPIIKFVEKKS IAFD+SFD++NGP A Sbjct: 186 SGLTNPQLGLRALSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRAS 1253 AEFI+DAVSK P LRPLCLILK+FLQQRELNEVY+GGIGSYALLAMLIA L+ + R++ Sbjct: 246 AEFIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKDGRSA 305 Query: 1252 LEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQAP 1073 EHNLG+LLV FFD YGRKLNTADVG+SC G+FF K KGF + LISIEDPQ P Sbjct: 306 PEHNLGVLLVKFFDFYGRKLNTADVGISCKMGGSFFSKYNKGFLNRARPSLISIEDPQTP 365 Query: 1072 ENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSSG 893 ENDIGKSSFNYFQ+RSAFAMA + LTN K IL LGP RSILGTIIRPD L ERKGG +G Sbjct: 366 ENDIGKSSFNYFQIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRVLSERKGGQNG 425 Query: 892 EGTIKNLLPGAGEAVLQHSEDPQE--LYCNWRLDDENDE-PLPRSNAI 758 + T +LLPGAGE + S L+CNW L++E +E PR N I Sbjct: 426 DVTFNSLLPGAGEPLPLESNGKTNGGLFCNWELEEEEEEGSFPRGNDI 473 >ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp. lyrata] gi|297310108|gb|EFH40532.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp. lyrata] Length = 530 Score = 509 bits (1310), Expect = e-141 Identities = 285/531 (53%), Positives = 343/531 (64%), Gaps = 9/531 (1%) Frame = -1 Query: 2134 AESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYF 1955 A + +Y+TL PLS +D Y V R EIS V ++ +A D+F Sbjct: 10 APAFVYDTLPPLSFSDSNQSPPTHDES-----HQYSVFRKEISDFTVATTPVESATVDFF 64 Query: 1954 SLDLDA-------DDIXXXXXXXXXXXXXXXTKEPARTLEGNWFRANSRFKSPMLQLHKE 1796 SLD+D + + K+ LE NWF NS K PMLQLHKE Sbjct: 65 SLDVDGGTTENGVEPVTPVVVASSKKKSKKRKKDEEPRLESNWFSENSFSKIPMLQLHKE 124 Query: 1795 ILDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVIL 1616 I+DFCDFL PT E+A R A+ESV SVI +IWP+C+ EVFGS++TGLYLPTSDID+VIL Sbjct: 125 IVDFCDFLLPTQAEKAERDAAVESVSSVITYIWPSCKVEVFGSYKTGLYLPTSDIDVVIL 184 Query: 1615 GSDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPI 1436 S + NPQ+GL+ALSR LSQ+ + K + VIAKARVPIIKFVEKKS IAFD+SFD++NGP Sbjct: 185 ESGLTNPQLGLRALSRALSQRGIAKNLVVIAKARVPIIKFVEKKSNIAFDLSFDMENGPK 244 Query: 1435 AAEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRA 1256 AAEFI+DAVSK P LRPLCLILK+FLQQRELNEVY+GGIGSYALLAMLIA L+ + R+ Sbjct: 245 AAEFIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKDGRS 304 Query: 1255 SLEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDPQA 1076 + EHNLG+LLV FFD YGRKLNTADVGVSC G+FF K KGF + LISIEDPQ Sbjct: 305 APEHNLGVLLVKFFDFYGRKLNTADVGVSCKTGGSFFSKYDKGFLNRARPGLISIEDPQT 364 Query: 1075 PENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSS 896 PENDIGKSSFNYFQ+RSAFAMA + LTN K IL LGP RSILGTIIRPD L ERKGG + Sbjct: 365 PENDIGKSSFNYFQIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRILSERKGGKN 424 Query: 895 GEGTIKNLLPGAGE--AVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGVNXXXXX 722 G+ T +LLPGAGE + +S+ L+CNW L+++ + PR + D Sbjct: 425 GDITFNSLLPGAGEPLPMASNSKTNGGLFCNWELEEDEEGSFPRGSTTNGD--------- 475 Query: 721 XXXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSHQDGGIS 569 D GK K SR + K SK+ ++G S Sbjct: 476 -------------ITPVVDTPGKKSKESSRKKKKKSSKKEVDEEEEEGASS 513 >gb|EOY12985.1| Nucleotidyltransferase family protein isoform 3 [Theobroma cacao] Length = 507 Score = 508 bits (1308), Expect = e-141 Identities = 294/534 (55%), Positives = 338/534 (63%), Gaps = 10/534 (1%) Frame = -1 Query: 2137 TAESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDY 1958 +++ ILYETL+P+S EPY V RNEISL A S +AAPDY Sbjct: 8 SSQPILYETLTPISLPSSPAAQSPPFNEPP--FEPYTVFRNEISLLAENSISLDSAAPDY 65 Query: 1957 FSLDLDADDIXXXXXXXXXXXXXXXTKEPARTLEGN-----WFRANSRFKSPMLQLHKEI 1793 FSLD++ K P E WFR NSRFKSPMLQLHKEI Sbjct: 66 FSLDVNDPAEPVIVQASVSAWDEPEPKTPGVVDEPRLENEWWFRGNSRFKSPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFLSPTPEEQA R A++SV VIK+IWP C+ EVFGSFRTGLYLPTSDID+VILG Sbjct: 126 VDFCDFLSPTPEEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILG 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S I+NPQ GL ALSR LSQK + KK+QVIAKARVPI+KFVEKKS +AFDISFDV NGP A Sbjct: 186 SGIKNPQTGLHALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAMLQDYQTRRAS 1253 A+FIK+AV KWP LRPLCLILK+FLQQR+LNEVY+GGIGSYALLAML+AMLQ +A Sbjct: 246 ADFIKEAVLKWPQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQSLHESQAY 305 Query: 1252 LEHNLGILLVNFFDIYGRKLNTADVGVSCNGE-GNFFLKKLKGFSTPGKHYLISIEDPQA 1076 EHNLGILLV+FFD YGRKLNTADVGVSCNG G FFLK +G Sbjct: 306 QEHNLGILLVHFFDFYGRKLNTADVGVSCNGRGGTFFLKSSRG----------------- 348 Query: 1075 PENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGGSS 896 SAF MA + LTNPK IL LGP RSILGTIIRPD LLERKGGSS Sbjct: 349 ----------------SAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSS 392 Query: 895 GEGTIKNLLPGAGEAVLQHSEDPQELYCNWRLDDENDEPLPRSNAILEDAGV-NXXXXXX 719 G T +LLPGAGE + + Q++ CNW+LDDE EPLPR + I D + Sbjct: 393 GGVTFSSLLPGAGEPLQPLYGEQQDILCNWQLDDE--EPLPRGDGIDVDVSAQSSGRKRK 450 Query: 718 XXXXXXXXXXXXKENEDDRIGKHEKSGSRTRSGKLSKQLHSRSH---QDGGISS 566 KEN D R HE++ + K H+ ++ + GG SS Sbjct: 451 SASKERSKKKKVKENGDARKVWHEETVFKKEKSTRKKGYHNDANGFGRHGGSSS 504 >dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana] Length = 533 Score = 504 bits (1297), Expect = e-140 Identities = 278/471 (59%), Positives = 327/471 (69%), Gaps = 12/471 (2%) Frame = -1 Query: 2134 AESILYETLSPLSTADGXXXXXXXXXXXXXDLEPYVVLRNEISLSAVQSSLDGTAAPDYF 1955 A + +Y+TL PLS +D Y V R EIS ++ +A D+F Sbjct: 10 APAFVYDTLPPLSFSDSNQSPPPTHEES----HQYSVFRKEISDFPDDTTPVESATVDFF 65 Query: 1954 SLDLDADDIXXXXXXXXXXXXXXXTKEPART------LEGNWFRANSRFKSPMLQLHKEI 1793 SLD++ + K R LE NWF NS K PMLQLHKEI Sbjct: 66 SLDVEGETTENGVEPVTPVVVASKKKSKKRKKDEEPRLESNWFSENSFSKIPMLQLHKEI 125 Query: 1792 LDFCDFLSPTPEEQAGRKEAIESVVSVIKHIWPNCQAEVFGSFRTGLYLPTSDIDIVILG 1613 +DFCDFL PT E+A R A+ESV SVIK+IWP+C+ EVFGS++TGLYLPTSDID+VIL Sbjct: 126 VDFCDFLLPTQAEKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDVVILE 185 Query: 1612 SDIQNPQIGLQALSRLLSQKRVGKKIQVIAKARVPIIKFVEKKSGIAFDISFDVQNGPIA 1433 S + NPQ+GL+ALSR LSQ+ + K + VIAKARVPIIKFVEKKS IAFD+SFD++NGP A Sbjct: 186 SGLTNPQLGLRALSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKA 245 Query: 1432 AEFIKDAVSKWPALRPLCLILKIFLQQRELNEVYTGGIGSYALLAMLIAML--QDY-QTR 1262 AEFI+DAVSK P LRPLCLILK+FLQQRELNEVY+GGIGSYALLAMLIA L Q Y + Sbjct: 246 AEFIQDAVSKLPPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKVQVYLKDG 305 Query: 1261 RASLEHNLGILLVNFFDIYGRKLNTADVGVSCNGEGNFFLKKLKGFSTPGKHYLISIEDP 1082 R++ EHNLG+LLV FFD YGRKLNTADVG+SC G+FF K KGF + LISIEDP Sbjct: 306 RSAPEHNLGVLLVKFFDFYGRKLNTADVGISCKMGGSFFSKYNKGFLNRARPSLISIEDP 365 Query: 1081 QAPENDIGKSSFNYFQVRSAFAMAFNALTNPKTILGLGPERSILGTIIRPDAALLERKGG 902 Q PENDIGKSSFNYFQ+RSAFAMA + LTN K IL LGP RSILGTIIRPD L ERKGG Sbjct: 366 QTPENDIGKSSFNYFQIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRVLSERKGG 425 Query: 901 SSGEGTIKNLLPGAGEAVLQHSEDPQE--LYCNWRLDDENDE-PLPRSNAI 758 +G+ T +LLPGAGE + S L+CNW L++E +E PR N I Sbjct: 426 QNGDVTFNSLLPGAGEPLPLESNGKTNGGLFCNWELEEEEEEGSFPRGNDI 476