BLASTX nr result
ID: Mentha26_contig00013945
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00013945 (1182 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU44940.1| hypothetical protein MIMGU_mgv1a004370mg [Mimulus... 507 e-141 ref|XP_006362956.1| PREDICTED: PAP-associated domain-containing ... 493 e-137 ref|XP_004248266.1| PREDICTED: PAP-associated domain-containing ... 477 e-132 ref|XP_006451882.1| hypothetical protein CICLE_v10008024mg [Citr... 461 e-127 ref|XP_006370267.1| hypothetical protein POPTR_0001s41140g [Popu... 456 e-126 ref|XP_007021458.1| Nucleotidyltransferase family protein isofor... 451 e-124 ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing ... 450 e-124 ref|XP_006464744.1| PREDICTED: LOW QUALITY PROTEIN: PAP-associat... 448 e-123 ref|XP_007021459.1| Nucleotidyltransferase family protein isofor... 446 e-123 ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing ... 446 e-122 dbj|BAE71308.1| hypothetical protein [Trifolium pratense] 435 e-119 ref|XP_007211537.1| hypothetical protein PRUPE_ppa003914mg [Prun... 433 e-119 ref|XP_002524282.1| nucleic acid binding protein, putative [Rici... 433 e-119 ref|XP_006280286.1| hypothetical protein CARUB_v10026211mg [Caps... 428 e-117 ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arab... 427 e-117 ref|XP_004487752.1| PREDICTED: PAP-associated domain-containing ... 426 e-116 ref|XP_007149443.1| hypothetical protein PHAVU_005G070800g [Phas... 425 e-116 ref|NP_568798.1| nucleotidyltransferase family protein [Arabidop... 424 e-116 dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana] 420 e-115 ref|XP_007021461.1| Nucleotidyltransferase family protein isofor... 415 e-113 >gb|EYU44940.1| hypothetical protein MIMGU_mgv1a004370mg [Mimulus guttatus] Length = 531 Score = 507 bits (1306), Expect = e-141 Identities = 273/396 (68%), Positives = 300/396 (75%), Gaps = 4/396 (1%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 EQESR AAI++VF VI YIWPSA E+FGSF TGLYLPSSDIDVVIL SNVRSPQIGL A Sbjct: 128 EQESRNAAIEAVFGVIKYIWPSAETEVFGSFRTGLYLPSSDIDVVILDSNVRSPQIGLTA 187 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 LSRALSQ+GIAKK+QVIAKARVPIIKFVEKKSGFAFD+SFDVHNGPKAAEFIKDAV Sbjct: 188 LSRALSQRGIAKKIQVIAKARVPIIKFVEKKSGFAFDVSFDVHNGPKAAEFIKDAVFRWP 247 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVNF 543 +FLQQRELNEVYTGGIGSYALLSMLIA+LR ++D AS E NLGVLLVNF Sbjct: 248 PLRPLCLILKIFLQQRELNEVYTGGIGSYALLSMLIALLRAQEDRQASAEHNLGVLLVNF 307 Query: 544 FDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNYY 723 FDMYGCKLNT DVGVSCNG G FF K +KGF+VEGRPSL+AIEDPQAPDNDIGK+SFNYY Sbjct: 308 FDMYGCKLNTSDVGVSCNGGGIFFSKSSKGFAVEGRPSLLAIEDPQAPDNDIGKNSFNYY 367 Query: 724 QARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGAG 903 QARSAFAMA+T LTNAK I LG NRSIL IIRPD+VLLERKGG+NG +T +NLF Sbjct: 368 QARSAFAMAFTILTNAKTIMSLGPNRSILGAIIRPDSVLLERKGGTNGNMTLDNLFPSTA 427 Query: 904 EPLEQLF-SDQQEMYCNWTLN--DVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXX 1074 EP++QL DQQE+YCNW LN + +EE LPRGNG GD Sbjct: 428 EPMQQLLDGDQQEIYCNWPLNNEEDEEELLPRGNG--GD---VKSSSGKKRKKAAAASKE 482 Query: 1075 XXXHDTSGVDRHENGSG-KEPSAKKRSSRSRHYQNG 1179 V ENGS KE S+KKR S+ + G Sbjct: 483 NNTTPVKKVKARENGSAVKEGSSKKRRSKKHRHSGG 518 >ref|XP_006362956.1| PREDICTED: PAP-associated domain-containing protein 5-like [Solanum tuberosum] Length = 521 Score = 493 bits (1268), Expect = e-137 Identities = 259/389 (66%), Positives = 297/389 (76%), Gaps = 1/389 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ SR AI+ VF+VI YIWP+ E+FGSF+TGLYLP+SD+D+VILGS +RSPQIGLQ Sbjct: 134 EEQASRNEAIECVFNVIKYIWPNCKPEVFGSFKTGLYLPTSDVDLVILGSEIRSPQIGLQ 193 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKG+AKK+QVI+KARVPIIKFVEKKSG +FDISFDV NGPKAAEFIKDA+S Sbjct: 194 ALSRALSQKGVAKKIQVISKARVPIIKFVEKKSGISFDISFDVENGPKAAEFIKDAMSSW 253 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVYTGGIGSYALL MLIAML+ ++ AS E NLG+LLVN Sbjct: 254 PPLRPLCLILKVFLQQRELNEVYTGGIGSYALLVMLIAMLQNHRNGQASAEENLGILLVN 313 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD+YG KLNT DVGVSCNGEGTFF K KGFS++G+ SLI+IEDPQ P+NDIGKSSFNY Sbjct: 314 FFDIYGRKLNTSDVGVSCNGEGTFFLKSRKGFSIKGKQSLISIEDPQTPENDIGKSSFNY 373 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF+MA+TTLTNAK I LGSN+SIL IIRPD VL+ERKGGSNG++TFNNL GA Sbjct: 374 FQVRSAFSMAFTTLTNAKAIFALGSNKSILGTIIRPDEVLVERKGGSNGEVTFNNLLPGA 433 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXXXX 1080 GE L+Q + DQQE+YCNW LND DEE LPRGNG D D Sbjct: 434 GEGLQQ-YGDQQEIYCNWQLND-DEEALPRGNGIAEDGDAQSSGKKRKSSKDKQPAKKVK 491 Query: 1081 XH-DTSGVDRHENGSGKEPSAKKRSSRSR 1164 + +S V EN S KE S+KK +R Sbjct: 492 ENGHSSSVRDEENSSRKEKSSKKHWKHNR 520 >ref|XP_004248266.1| PREDICTED: PAP-associated domain-containing protein 5-like [Solanum lycopersicum] Length = 521 Score = 477 bits (1228), Expect = e-132 Identities = 253/390 (64%), Positives = 295/390 (75%), Gaps = 2/390 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ SR A++ VF+VI YIWP+ E+FGSF+TGLYLP+SD+D+VILGS +RSPQIGLQ Sbjct: 134 EEQASRNEAVECVFNVIKYIWPNCKPEVFGSFKTGLYLPTSDVDLVILGSEIRSPQIGLQ 193 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKG+AKK+QVI+KARVPIIKFVEKKSG +FDISFDV NGPKAA+FIKDA+S Sbjct: 194 ALSRALSQKGVAKKIQVISKARVPIIKFVEKKSGISFDISFDVENGPKAADFIKDAMSSW 253 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVYTGGIGSYALL MLIAML+ ++ AS E NLG+LLVN Sbjct: 254 PPLRPLCLILKVFLQQRELNEVYTGGIGSYALLVMLIAMLQNHRNGQASVEENLGILLVN 313 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD+YG KLNT DVGVSCNGE TFF K KGFS++G+ SLI+IEDPQ P+NDIGKSSFNY Sbjct: 314 FFDIYGRKLNTSDVGVSCNGEATFFLKSCKGFSIKGKQSLISIEDPQTPENDIGKSSFNY 373 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF+MA+TTLTNAK I LG NRSIL IIRPD VL+ERKGGSNG++TF NL GA Sbjct: 374 FQVRSAFSMAFTTLTNAKAIFALGPNRSILGTIIRPDEVLVERKGGSNGEVTFTNLLPGA 433 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTP--GDNDVTXXXXXXXXXXXXXXXXX 1074 GE L+Q + DQQE+YCNW LND +EE LPRGNG G + + Sbjct: 434 GEGLQQ-YGDQQEIYCNWQLND-NEEALPRGNGIAENGGAESSGKKRKSSKDKQPAKKVK 491 Query: 1075 XXXHDTSGVDRHENGSGKEPSAKKRSSRSR 1164 H +S + EN S KE S+KK +R Sbjct: 492 ENGH-SSHIRDEENSSRKEKSSKKHWKHNR 520 >ref|XP_006451882.1| hypothetical protein CICLE_v10008024mg [Citrus clementina] gi|557555108|gb|ESR65122.1| hypothetical protein CICLE_v10008024mg [Citrus clementina] Length = 516 Score = 461 bits (1185), Expect = e-127 Identities = 243/395 (61%), Positives = 289/395 (73%), Gaps = 2/395 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+E R A+++VFDVI YIWP E+FGSF TGLYLP+SDIDVVI+ S + +P GLQ Sbjct: 126 EEREVRNTAVEAVFDVIKYIWPKCKPEVFGSFRTGLYLPTSDIDVVIMESGIHNPATGLQ 185 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRAL Q+GIAKK+QVIAKARVPI+KFVEKKSG +FDISFD NGPKAAEFIKDA++ Sbjct: 186 ALSRALLQRGIAKKIQVIAKARVPIVKFVEKKSGVSFDISFDAQNGPKAAEFIKDALAKC 245 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+M++A+L++ + ASPE NLG+LLVN Sbjct: 246 PPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMIMAVLKSLYECRASPEHNLGILLVN 305 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSC G G+FF+K +KGF+ +GRP LIAIEDPQAPDNDIGK+SFNY Sbjct: 306 FFDFYGRKLNTTDVGVSCKGAGSFFKKSSKGFTNKGRPFLIAIEDPQAPDNDIGKNSFNY 365 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q +SAFAMA+TTLTN K I LG NRSIL IIRPD VLLERKGGSNG++TFNNL GA Sbjct: 366 FQIKSAFAMAFTTLTNPKTILSLGPNRSILGTIIRPDPVLLERKGGSNGEITFNNLLPGA 425 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXXXX 1080 GEPL+ F DQ+E+ CNW +D +EE PRGNG+ V Sbjct: 426 GEPLQTHFGDQREIMCNWQ-SDYEEESFPRGNGS-----VQSSGKKRKAFSKEKSTSKKK 479 Query: 1081 XHDT-SGVDRHENGSGKEPSAKKRSSR-SRHYQNG 1179 +T R E GS KE S KK+ R ++ + NG Sbjct: 480 TEETGESKSREEGGSKKEKSGKKKRWRQNQGHANG 514 >ref|XP_006370267.1| hypothetical protein POPTR_0001s41140g [Populus trichocarpa] gi|550349446|gb|ERP66836.1| hypothetical protein POPTR_0001s41140g [Populus trichocarpa] Length = 543 Score = 456 bits (1174), Expect = e-126 Identities = 241/385 (62%), Positives = 283/385 (73%), Gaps = 2/385 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ SRA A++ VFDVI YIWP+ E+FGSF TGLYLP+SDIDVVILGS ++SPQIGL Sbjct: 141 EEQASRAEAVRCVFDVIKYIWPNCKVEVFGSFRTGLYLPTSDIDVVILGSGLKSPQIGLN 200 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKG+AKK+QVIA+ARVPI+KFVEK+SG +FDISFDV+ GP AAEFIK+A+S Sbjct: 201 ALSRALSQKGVAKKIQVIARARVPIVKFVEKRSGVSFDISFDVNGGPIAAEFIKNAISKW 260 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGI SYALL+ML+AML+ ++ AS ERNLG+LL++ Sbjct: 261 PELRPLCLILKVFLQQRELNEVYSGGISSYALLAMLMAMLQNHRECQASLERNLGLLLIH 320 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT +VGVSC G GTFF K KGF GRP LIAIEDPQAP+NDIGK+SFNY Sbjct: 321 FFDFYGRKLNTTNVGVSCKGTGTFFSKRTKGFMNNGRPFLIAIEDPQAPENDIGKNSFNY 380 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAFAMA+TTLTN K I LG NRSIL IIRPD VLLERKGG NG++TF++L GA Sbjct: 381 FQIRSAFAMAFTTLTNPKTILSLGPNRSILGTIIRPDPVLLERKGGKNGEVTFSSLLPGA 440 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDV--TXXXXXXXXXXXXXXXXX 1074 GEPL+ + QQE+ CNW L+D +EE LPRG G GD + Sbjct: 441 GEPLQSNYG-QQEILCNWQLDD-EEEALPRGGGDAGDGSAHSSGKKRKASSKEKSRKKKS 498 Query: 1075 XXXHDTSGVDRHENGSGKEPSAKKR 1149 D V E+GS KE S KK+ Sbjct: 499 KENGDIGKVRHDESGSKKEKSTKKK 523 >ref|XP_007021458.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] gi|508721086|gb|EOY12983.1| Nucleotidyltransferase family protein isoform 1 [Theobroma cacao] Length = 540 Score = 451 bits (1160), Expect = e-124 Identities = 232/338 (68%), Positives = 271/338 (80%), Gaps = 1/338 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ +R AA+ SVFDVI YIWP+ E+FGSF TGLYLP+SDIDVVILGS +++PQ GL Sbjct: 137 EEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILGSGIKNPQTGLH 196 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKKMQVIAKARVPI+KFVEKKS AFDISFDV NGPKAA+FIK+AV Sbjct: 197 ALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKAADFIKEAVLKW 256 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQR+LNEVY+GGIGSYALL+ML+AML++ ++ A E NLG+LLV+ Sbjct: 257 PQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQSLHESQAYQEHNLGILLVH 316 Query: 541 FFDMYGCKLNTVDVGVSCNGE-GTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFN 717 FFD YG KLNT DVGVSCNG GTFF K ++GFS +GRP LI+IEDPQAPDNDIGK+SFN Sbjct: 317 FFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDPQAPDNDIGKNSFN 376 Query: 718 YYQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHG 897 + Q RSAF MA +TLTN K I LG NRSIL IIRPD VLLERKGGS+G +TF++L G Sbjct: 377 FIQIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSSGGVTFSSLLPG 436 Query: 898 AGEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGD 1011 AGEPL+ L+ +QQ++ CNW L+ DEEPLPRG+G D Sbjct: 437 AGEPLQPLYGEQQDILCNWQLD--DEEPLPRGDGIDVD 472 >ref|XP_002282332.2| PREDICTED: PAP-associated domain-containing protein 5-like [Vitis vinifera] gi|302143015|emb|CBI20310.3| unnamed protein product [Vitis vinifera] Length = 497 Score = 450 bits (1158), Expect = e-124 Identities = 229/337 (67%), Positives = 271/337 (80%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 +EQ +R AAI+SVF+VI YIWP+ E+FGSF+TGLYLP+SDIDVVILGS++++PQIGL Sbjct: 121 KEQSARNAAIESVFNVIRYIWPNCKVEVFGSFKTGLYLPTSDIDVVILGSDIKTPQIGLY 180 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKK+QVIAKARVPIIKF+EK+S AFDISFDV NGPKAAE+I+DA+S Sbjct: 181 ALSRALSQKGIAKKIQVIAKARVPIIKFIEKRSSVAFDISFDVENGPKAAEYIQDAISKW 240 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+MLIAML+ Q+ +AS E NLGVLLVN Sbjct: 241 PPLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAMLQNLQEWNASVEHNLGVLLVN 300 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNTVD+GV+CNG GTFF K KGF +G+ LI+IEDPQ P NDIGK+SFNY Sbjct: 301 FFDFYGRKLNTVDIGVTCNGPGTFFLKSTKGFVNKGQKFLISIEDPQLPGNDIGKNSFNY 360 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF+MA++TLTNA+ I L NRSIL IIRPD +LLERKGGSNG +TF++L GA Sbjct: 361 FQIRSAFSMAFSTLTNARTILGLDPNRSILGTIIRPDPILLERKGGSNGTMTFDHLLPGA 420 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGD 1011 GEPL + QE+ CNW + D +EEPLPR N GD Sbjct: 421 GEPLSPQ-TGGQELLCNWQVEDAEEEPLPRSNPIAGD 456 >ref|XP_006464744.1| PREDICTED: LOW QUALITY PROTEIN: PAP-associated domain-containing protein 5-like [Citrus sinensis] Length = 516 Score = 448 bits (1152), Expect = e-123 Identities = 239/395 (60%), Positives = 284/395 (71%), Gaps = 2/395 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+E R A+++VFDVI YIWP E+FGSF TGLYLP+SDIDVVI+ S + +P GLQ Sbjct: 126 EEREVRNTAVEAVFDVIKYIWPKCKPEVFGSFRTGLYLPTSDIDVVIMESGIHNPATGLQ 185 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRAL Q+GIAKK+QVIAKARVPI+KFVEKKSG +FDISFD NGPKAAEFIKDA++ Sbjct: 186 ALSRALLQRGIAKKIQVIAKARVPIVKFVEKKSGVSFDISFDAQNGPKAAEFIKDALANC 245 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+M++A+L++ ASPE NLG+LLVN Sbjct: 246 PPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMIMAVLKSLYKCRASPEHNLGILLVN 305 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KL T DVGVSC G G+FF+K +KGF+ +GRP LIAIEDPQAPDN IGK+SFNY Sbjct: 306 FFDFYGRKLKTTDVGVSCKGAGSFFKKSSKGFTNKGRPFLIAIEDPQAPDNAIGKNSFNY 365 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q +SAFAMA+TTLTN K I L NRSIL IIRPD VLLERKGGSNG++TFN+L GA Sbjct: 366 FQIKSAFAMAFTTLTNPKTILSLXPNRSILGTIIRPDPVLLERKGGSNGEITFNSLLPGA 425 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXXXX 1080 GEPL+ F DQ+E+ CNW +D +EE PRGNG+ V Sbjct: 426 GEPLKTHFGDQREIMCNWQ-SDYEEESFPRGNGS-----VQSCGKRRKAFSKEKSTSKKK 479 Query: 1081 XHDTSGVDRH-ENGSGKEPSAKKRSSR-SRHYQNG 1179 + H E GS KE S KK+ R +R + NG Sbjct: 480 TEEIGESKSHEEGGSKKEKSGKKKCWRQNRGHANG 514 >ref|XP_007021459.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] gi|508721087|gb|EOY12984.1| Nucleotidyltransferase family protein isoform 2 [Theobroma cacao] Length = 541 Score = 446 bits (1148), Expect = e-123 Identities = 232/339 (68%), Positives = 271/339 (79%), Gaps = 2/339 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ +R AA+ SVFDVI YIWP+ E+FGSF TGLYLP+SDIDVVILGS +++PQ GL Sbjct: 137 EEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILGSGIKNPQTGLH 196 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKKMQVIAKARVPI+KFVEKKS AFDISFDV NGPKAA+FIK+AV Sbjct: 197 ALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKAADFIKEAVLKW 256 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAML-RTRQDNHASPERNLGVLLV 537 VFLQQR+LNEVY+GGIGSYALL+ML+AML ++ ++ A E NLG+LLV Sbjct: 257 PQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQQSLHESQAYQEHNLGILLV 316 Query: 538 NFFDMYGCKLNTVDVGVSCNGE-GTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSF 714 +FFD YG KLNT DVGVSCNG GTFF K ++GFS +GRP LI+IEDPQAPDNDIGK+SF Sbjct: 317 HFFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDPQAPDNDIGKNSF 376 Query: 715 NYYQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFH 894 N+ Q RSAF MA +TLTN K I LG NRSIL IIRPD VLLERKGGS+G +TF++L Sbjct: 377 NFIQIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSSGGVTFSSLLP 436 Query: 895 GAGEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGD 1011 GAGEPL+ L+ +QQ++ CNW L+ DEEPLPRG+G D Sbjct: 437 GAGEPLQPLYGEQQDILCNWQLD--DEEPLPRGDGIDVD 473 >ref|XP_004141224.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cucumis sativus] Length = 544 Score = 446 bits (1146), Expect = e-122 Identities = 239/388 (61%), Positives = 283/388 (72%), Gaps = 2/388 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+ +R +A++ VF V+ +IWP E+FGSF+TGLYLP+SDIDVVILGS + PQ+GLQ Sbjct: 142 EERVARDSAVERVFSVVKHIWPHCKVEVFGSFQTGLYLPTSDIDVVILGSGIPKPQLGLQ 201 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKK+QVI KARVPIIKF+EK+SG +FDISFDV NGPKAA+FIK AVS Sbjct: 202 ALSRALSQKGIAKKIQVIGKARVPIIKFIEKQSGISFDISFDVQNGPKAADFIKGAVSKW 261 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GG+GSYALL+ML+AML++ +S E NLGVLLV+ Sbjct: 262 PPLRPLCLILKVFLQQRELNEVYSGGLGSYALLTMLMAMLQSINVPPSSLEHNLGVLLVH 321 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSCN G FF K +GF +GRP L++IEDPQAPDNDIGK+SFNY Sbjct: 322 FFDFYGRKLNTSDVGVSCNAGGIFFSKSYRGFMTKGRPCLLSIEDPQAPDNDIGKNSFNY 381 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAFAMAY+ LTN K + LG NRSIL IIRPD VLL+RKGG +G++TFN+L GA Sbjct: 382 FQIRSAFAMAYSILTNVKTVLGLGPNRSILGTIIRPDPVLLKRKGGRHGEVTFNSLLPGA 441 Query: 901 GEPLEQ-LFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXXX 1077 GEP++Q + D QEM CNW DEEPLPRGN TP +N T Sbjct: 442 GEPVQQPEYGDDQEMLCNWQFG--DEEPLPRGNDTP-ENVGTPSSKKQRKTREKSRKKEK 498 Query: 1078 XXHDTSGVDRHE-NGSGKEPSAKKRSSR 1158 H S RHE NGS KE S+KK+ R Sbjct: 499 ESH--SSKRRHEDNGSRKEQSSKKKRLR 524 >dbj|BAE71308.1| hypothetical protein [Trifolium pratense] Length = 518 Score = 435 bits (1119), Expect = e-119 Identities = 221/333 (66%), Positives = 266/333 (79%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+ R AAI+SVF+VI +IWP EIFGSF TGLYLP+SDIDVVIL S + +PQIGL Sbjct: 127 EEKAKRDAAIESVFEVIKHIWPHCQVEIFGSFRTGLYLPTSDIDVVILKSGLPNPQIGLN 186 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 A+SR+LSQ+ +AKK+QVI KARVPIIKFVEKKSG +FDISFD+ NGPKAAE+I++AV+ Sbjct: 187 AISRSLSQRSMAKKIQVIGKARVPIIKFVEKKSGLSFDISFDIDNGPKAAEYIQEAVAKW 246 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+ML+AMLR + + + E NLGVLLV+ Sbjct: 247 PQLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMLMAMLRNVRQSQPTAEHNLGVLLVH 306 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSC GEGTFF+K ++GF + RP L+ I+DPQ PDNDIGK+SFNY Sbjct: 307 FFDFYGRKLNTSDVGVSCIGEGTFFRKSSRGFYNKTRPFLLGIQDPQTPDNDIGKNSFNY 366 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF MA+TTLTN KVI LG NRSIL IIRPD VL+ERKGGSNG++TFN+L GA Sbjct: 367 FQVRSAFLMAFTTLTNPKVILSLGPNRSILGTIIRPDPVLMERKGGSNGEMTFNSLLPGA 426 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNG 999 GEP++Q + + +M CNW L D +EEPLPRG+G Sbjct: 427 GEPIQQQYG-EHDMLCNWQL-DFEEEPLPRGDG 457 >ref|XP_007211537.1| hypothetical protein PRUPE_ppa003914mg [Prunus persica] gi|462407402|gb|EMJ12736.1| hypothetical protein PRUPE_ppa003914mg [Prunus persica] Length = 540 Score = 433 bits (1114), Expect = e-119 Identities = 236/406 (58%), Positives = 283/406 (69%), Gaps = 12/406 (2%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQE+R +A++ V VI YIWP E+FGSF+TGLYLP+SDIDVVI+ S + +PQ GLQ Sbjct: 137 EEQEARTSAVERVSQVIKYIWPRCKVEVFGSFKTGLYLPASDIDVVIMRSGIPTPQQGLQ 196 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQ G+AKK+QVI KAR+PIIKFVEK SG AFDISFD+ +GPKAA+FI+DAVS Sbjct: 197 ALSRALSQMGLAKKIQVIGKARIPIIKFVEKTSGIAFDISFDIESGPKAADFIQDAVSKW 256 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GG+GSYALL+ML+AML + ++ AS E+NLGVLLVN Sbjct: 257 PPLRPLCLILKVFLQQRELNEVYSGGLGSYALLTMLMAMLHSHRECQASSEQNLGVLLVN 316 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSC G GTFF+K KGF +GRP LIAIEDPQAP+ND+GK+SFNY Sbjct: 317 FFDFYGRKLNTSDVGVSCKGAGTFFKKSVKGFITKGRPFLIAIEDPQAPENDVGKNSFNY 376 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF+MAYTTLTN KVI LG NRSIL IIRPD L+ERKGG G + F++L GA Sbjct: 377 FQIRSAFSMAYTTLTNPKVILCLGPNRSILGTIIRPDPTLVERKGGP-GLVAFDSLLPGA 435 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDNDVTXXXXXXXXXXXXXXXXXXX 1080 G+PL QL D QE CNW L+D D++PLPRG+ + G Sbjct: 436 GKPL-QLEHDGQEFMCNWQLDD-DDDPLPRGDDSAGGGSGRSSGRKRKASFKEKSGKKGK 493 Query: 1081 XHDTSGVDRHENGSGKEPSAK------------KRSSRSRHYQNGA 1182 + G ENGS KE + + K+ R RH Q+ A Sbjct: 494 ENGEVGRRNVENGSKKEKARRDENSSRKGKGKMKKIRRRRHSQDNA 539 >ref|XP_002524282.1| nucleic acid binding protein, putative [Ricinus communis] gi|223536473|gb|EEF38121.1| nucleic acid binding protein, putative [Ricinus communis] Length = 526 Score = 433 bits (1113), Expect = e-119 Identities = 228/338 (67%), Positives = 264/338 (78%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+++R A+K VFDVI YIWP+ E+FGS++TGLYLP+SDIDVVI S +++PQIGLQ Sbjct: 141 EEEDARNTAVKCVFDVIKYIWPNCKVEVFGSYKTGLYLPTSDIDVVIFRSGIKNPQIGLQ 200 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKK+QVIAKARVPI+KFVEK+SG +FDISFDV NGPKAAEFIKDAV Sbjct: 201 ALSRALSQKGIAKKIQVIAKARVPIVKFVEKRSGVSFDISFDVDNGPKAAEFIKDAVRKW 260 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+ML+A+L+ AS E NLGVLLV Sbjct: 261 PALRPLSLILKVFLQQRELNEVYSGGIGSYALLTMLMAVLK------ASSEHNLGVLLVY 314 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSC G GTFF K KGF +GRP LIAIEDPQAPDNDIGK+SFNY Sbjct: 315 FFDFYGRKLNTTDVGVSCKGAGTFFSKRKKGFMNKGRPFLIAIEDPQAPDNDIGKNSFNY 374 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 Q RSAF+MA++TLTN + I LG NRSIL IIRPD++LLERK G NG++TF++L GA Sbjct: 375 SQIRSAFSMAFSTLTNPRTILSLGPNRSILGTIIRPDSILLERKAGCNGEVTFSSLLPGA 434 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGDN 1014 GE L Q D QE+ NW L+D DEE LPRG G D+ Sbjct: 435 GE-LIQSHYDHQEILGNWQLDD-DEEVLPRGGGIAEDS 470 >ref|XP_006280286.1| hypothetical protein CARUB_v10026211mg [Capsella rubella] gi|482548990|gb|EOA13184.1| hypothetical protein CARUB_v10026211mg [Capsella rubella] Length = 533 Score = 428 bits (1100), Expect = e-117 Identities = 221/336 (65%), Positives = 254/336 (75%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 E+ R AA++SV VI YIWPS EIFGS+ TGLYLP+SDIDVVIL S + +PQ+GL+A Sbjct: 142 EKAERDAAVESVSSVITYIWPSCKVEIFGSYRTGLYLPTSDIDVVILESGLTNPQLGLRA 201 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 LSRALSQ+GIAK +QVIAKARVPIIKFVEKKS AFD+SFD+ NGPKAAEFI+DAVS Sbjct: 202 LSRALSQRGIAKNIQVIAKARVPIIKFVEKKSNIAFDLSFDMDNGPKAAEFIQDAVSKLP 261 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVNF 543 VFLQQRELNEVY+GGIGSYALL+MLIA L+ +D ++PE NLGVLLV F Sbjct: 262 PLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKDGRSAPEHNLGVLLVKF 321 Query: 544 FDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNYY 723 FD YG KLNT DVGVSC G+FF K NKGF RP LI+IEDPQ PDNDIGKSSFNY+ Sbjct: 322 FDFYGRKLNTSDVGVSCKKGGSFFSKSNKGFLNMARPGLISIEDPQTPDNDIGKSSFNYF 381 Query: 724 QARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGAG 903 Q RSAF+MA +TLTN KVI LG NRSIL IIRPD +L ERKGG NG +TF++L GAG Sbjct: 382 QIRSAFSMALSTLTNTKVIPALGPNRSILGTIIRPDRILSERKGGKNGDVTFSSLLPGAG 441 Query: 904 EPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGD 1011 EPL ++CNW L + +E PRGN T GD Sbjct: 442 EPLPSDGKSNGGLFCNWELEEDEEGSFPRGNLTNGD 477 >ref|XP_002864273.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp. lyrata] gi|297310108|gb|EFH40532.1| hypothetical protein ARALYDRAFT_495454 [Arabidopsis lyrata subsp. lyrata] Length = 530 Score = 427 bits (1097), Expect = e-117 Identities = 218/338 (64%), Positives = 256/338 (75%), Gaps = 2/338 (0%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 E+ R AA++SV VI YIWPS E+FGS++TGLYLP+SDIDVVIL S + +PQ+GL+A Sbjct: 138 EKAERDAAVESVSSVITYIWPSCKVEVFGSYKTGLYLPTSDIDVVILESGLTNPQLGLRA 197 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 LSRALSQ+GIAK + VIAKARVPIIKFVEKKS AFD+SFD+ NGPKAAEFI+DAVS Sbjct: 198 LSRALSQRGIAKNLVVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVNF 543 VFLQQRELNEVY+GGIGSYALL+MLIA L+ +D ++PE NLGVLLV F Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKDGRSAPEHNLGVLLVKF 317 Query: 544 FDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNYY 723 FD YG KLNT DVGVSC G+FF KY+KGF RP LI+IEDPQ P+NDIGKSSFNY+ Sbjct: 318 FDFYGRKLNTADVGVSCKTGGSFFSKYDKGFLNRARPGLISIEDPQTPENDIGKSSFNYF 377 Query: 724 QARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGAG 903 Q RSAFAMA +TLTN K I LG NRSIL IIRPD +L ERKGG NG +TFN+L GAG Sbjct: 378 QIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRILSERKGGKNGDITFNSLLPGAG 437 Query: 904 EPLEQLFSDQQE--MYCNWTLNDVDEEPLPRGNGTPGD 1011 EPL + + ++CNW L + +E PRG+ T GD Sbjct: 438 EPLPMASNSKTNGGLFCNWELEEDEEGSFPRGSTTNGD 475 >ref|XP_004487752.1| PREDICTED: PAP-associated domain-containing protein 5-like [Cicer arietinum] Length = 513 Score = 426 bits (1094), Expect = e-116 Identities = 230/398 (57%), Positives = 277/398 (69%), Gaps = 10/398 (2%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EE+ R AI+SVF VI +IWP E+FGSF TGLYLP+SDIDVVIL S + +PQIGL Sbjct: 131 EEKAKRDTAIESVFAVIKHIWPHCQVEVFGSFRTGLYLPTSDIDVVILRSGLPNPQIGLN 190 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 A+SRALSQ+ +AKK+QVI KARVPIIKFVEK S +FDISFD+ NGPKAAE+I++AV+ Sbjct: 191 AISRALSQRSMAKKIQVIGKARVPIIKFVEKTSSLSFDISFDIENGPKAAEYIQEAVANC 250 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQRELNEVY+GGIGSYALL+ML+A+LR + + S E NLGVLLV+ Sbjct: 251 PPLRPLCLILKVFLQQRELNEVYSGGIGSYALLTMLMAVLRNVRQSQTSAEHNLGVLLVH 310 Query: 541 FFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNY 720 FFD YG KLNT DVGVSCNG GTFF K ++GF + RPSL+ I Q PDNDIGK+SFNY Sbjct: 311 FFDFYGRKLNTSDVGVSCNGAGTFFLKSSRGFYNKARPSLLGIWLNQTPDNDIGKNSFNY 370 Query: 721 YQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGA 900 +Q RSAF MA+TTLTN KVI LG NRSIL IIRPD VL+ERKGGSNG++TFN+L GA Sbjct: 371 FQVRSAFLMAFTTLTNPKVILNLGPNRSILGTIIRPDPVLMERKGGSNGEMTFNSLLPGA 430 Query: 901 GEPLEQLFSDQQEMYCNWTLNDVDEEPLPRG----------NGTPGDNDVTXXXXXXXXX 1050 GEP++Q + +Q+M CNW L D +EEPLPRG NG P +N Sbjct: 431 GEPIQQQYG-EQDMLCNWQL-DFEEEPLPRGDSTRKSASKENGKPKENG----------- 477 Query: 1051 XXXXXXXXXXXHDTSGVDRHENGSGKEPSAKKRSSRSR 1164 D+ V+ +ENGS E K+ + R Sbjct: 478 ------------DSRMVNNNENGSVTENGVHKKHKKKR 503 >ref|XP_007149443.1| hypothetical protein PHAVU_005G070800g [Phaseolus vulgaris] gi|561022707|gb|ESW21437.1| hypothetical protein PHAVU_005G070800g [Phaseolus vulgaris] Length = 522 Score = 425 bits (1092), Expect = e-116 Identities = 220/330 (66%), Positives = 260/330 (78%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 E+ R AI+SVF VI +IWP E+FGSF TGLYLP+SDIDVVIL S + +PQIGL A Sbjct: 131 EKAVRDMAIESVFGVIKHIWPHCQVEVFGSFRTGLYLPTSDIDVVILKSGLPNPQIGLNA 190 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 +S+ALSQ+ +AK++QVI KARVPIIKFVEK SG AFDISFD+ NGPKAAE+I++AV Sbjct: 191 ISKALSQRSMAKRIQVIGKARVPIIKFVEKISGLAFDISFDIDNGPKAAEYIQEAVLKWP 250 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVNF 543 VFLQQRELNEVY+GGIGSYALL+ML+AMLR + + AS E NLGVLLV+F Sbjct: 251 PLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLMAMLRNLRLSQASAEHNLGVLLVHF 310 Query: 544 FDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNYY 723 FD YG KLN+ DVGVSCNG GTFF K +KGF +GRPSLI+IEDPQAP+NDIGK+SFNY+ Sbjct: 311 FDFYGRKLNSSDVGVSCNGTGTFFVKSSKGFLNKGRPSLISIEDPQAPENDIGKNSFNYF 370 Query: 724 QARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGAG 903 Q RSAF+MA+ LTN K+I LG NRSIL IIRPD VLLERKGG NG +TF+ L GAG Sbjct: 371 QIRSAFSMAFKNLTNPKIIMSLGPNRSILGTIIRPDPVLLERKGGLNGDVTFDKLLPGAG 430 Query: 904 EPLEQLFSDQQEMYCNWTLNDVDEEPLPRG 993 EPL+Q + +Q+M CNW L D +EEPLPRG Sbjct: 431 EPLQQQYG-EQDMLCNWQL-DYEEEPLPRG 458 >ref|NP_568798.1| nucleotidyltransferase family protein [Arabidopsis thaliana] gi|27754278|gb|AAO22592.1| unknown protein [Arabidopsis thaliana] gi|332009022|gb|AED96405.1| nucleotidyltransferase family protein [Arabidopsis thaliana] Length = 530 Score = 424 bits (1089), Expect = e-116 Identities = 220/334 (65%), Positives = 254/334 (76%), Gaps = 3/334 (0%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 E+ R AA++SV VI YIWPS E+FGS++TGLYLP+SDIDVVIL S + +PQ+GL+A Sbjct: 138 EKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDVVILESGLTNPQLGLRA 197 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 LSRALSQ+GIAK + VIAKARVPIIKFVEKKS AFD+SFD+ NGPKAAEFI+DAVS Sbjct: 198 LSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVNF 543 VFLQQRELNEVY+GGIGSYALL+MLIA L+ +D ++PE NLGVLLV F Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKYLKDGRSAPEHNLGVLLVKF 317 Query: 544 FDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFNYY 723 FD YG KLNT DVG+SC G+FF KYNKGF RPSLI+IEDPQ P+NDIGKSSFNY+ Sbjct: 318 FDFYGRKLNTADVGISCKMGGSFFSKYNKGFLNRARPSLISIEDPQTPENDIGKSSFNYF 377 Query: 724 QARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHGAG 903 Q RSAFAMA +TLTN K I LG NRSIL IIRPD VL ERKGG NG +TFN+L GAG Sbjct: 378 QIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRVLSERKGGQNGDVTFNSLLPGAG 437 Query: 904 E--PLEQLFSDQQEMYCNWTLNDVDEE-PLPRGN 996 E PLE ++CNW L + +EE PRGN Sbjct: 438 EPLPLESNGKTNGGLFCNWELEEEEEEGSFPRGN 471 >dbj|BAB09549.1| unnamed protein product [Arabidopsis thaliana] Length = 533 Score = 420 bits (1080), Expect = e-115 Identities = 220/337 (65%), Positives = 255/337 (75%), Gaps = 6/337 (1%) Frame = +1 Query: 4 EQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQA 183 E+ R AA++SV VI YIWPS E+FGS++TGLYLP+SDIDVVIL S + +PQ+GL+A Sbjct: 138 EKAERDAAVESVSSVIKYIWPSCKVEVFGSYKTGLYLPTSDIDVVILESGLTNPQLGLRA 197 Query: 184 LSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXXX 363 LSRALSQ+GIAK + VIAKARVPIIKFVEKKS AFD+SFD+ NGPKAAEFI+DAVS Sbjct: 198 LSRALSQRGIAKNLLVIAKARVPIIKFVEKKSNIAFDLSFDMENGPKAAEFIQDAVSKLP 257 Query: 364 XXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTR---QDNHASPERNLGVLL 534 VFLQQRELNEVY+GGIGSYALL+MLIA L+ + +D ++PE NLGVLL Sbjct: 258 PLRPLCLILKVFLQQRELNEVYSGGIGSYALLAMLIAFLKVQVYLKDGRSAPEHNLGVLL 317 Query: 535 VNFFDMYGCKLNTVDVGVSCNGEGTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSF 714 V FFD YG KLNT DVG+SC G+FF KYNKGF RPSLI+IEDPQ P+NDIGKSSF Sbjct: 318 VKFFDFYGRKLNTADVGISCKMGGSFFSKYNKGFLNRARPSLISIEDPQTPENDIGKSSF 377 Query: 715 NYYQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFH 894 NY+Q RSAFAMA +TLTN K I LG NRSIL IIRPD VL ERKGG NG +TFN+L Sbjct: 378 NYFQIRSAFAMALSTLTNTKAILSLGPNRSILGTIIRPDRVLSERKGGQNGDVTFNSLLP 437 Query: 895 GAGE--PLEQLFSDQQEMYCNWTLNDVDEE-PLPRGN 996 GAGE PLE ++CNW L + +EE PRGN Sbjct: 438 GAGEPLPLESNGKTNGGLFCNWELEEEEEEGSFPRGN 474 >ref|XP_007021461.1| Nucleotidyltransferase family protein isoform 4 [Theobroma cacao] gi|508721089|gb|EOY12986.1| Nucleotidyltransferase family protein isoform 4 [Theobroma cacao] Length = 525 Score = 415 bits (1066), Expect = e-113 Identities = 220/338 (65%), Positives = 257/338 (76%), Gaps = 1/338 (0%) Frame = +1 Query: 1 EEQESRAAAIKSVFDVINYIWPSAVAEIFGSFETGLYLPSSDIDVVILGSNVRSPQIGLQ 180 EEQ +R AA+ SVFDVI YIWP+ E+FGSF TGLYLP+SDIDVVILGS +++PQ GL Sbjct: 137 EEQAARDAAVDSVFDVIKYIWPACRPEVFGSFRTGLYLPTSDIDVVILGSGIKNPQTGLH 196 Query: 181 ALSRALSQKGIAKKMQVIAKARVPIIKFVEKKSGFAFDISFDVHNGPKAAEFIKDAVSXX 360 ALSRALSQKGIAKKMQVIAKARVPI+KFVEKKS AFDISFDV NGPKAA+FIK+AV Sbjct: 197 ALSRALSQKGIAKKMQVIAKARVPIVKFVEKKSAVAFDISFDVDNGPKAADFIKEAVLKW 256 Query: 361 XXXXXXXXXXXVFLQQRELNEVYTGGIGSYALLSMLIAMLRTRQDNHASPERNLGVLLVN 540 VFLQQR+LNEVY+GGIGSYALL+ML+AML++ ++ A E NLG+LLV+ Sbjct: 257 PQLRPLCLILKVFLQQRDLNEVYSGGIGSYALLAMLMAMLQSLHESQAYQEHNLGILLVH 316 Query: 541 FFDMYGCKLNTVDVGVSCNGE-GTFFQKYNKGFSVEGRPSLIAIEDPQAPDNDIGKSSFN 717 FFD YG KLNT DVGVSCNG GTFF K ++GFS +GRP LI+IEDP Sbjct: 317 FFDFYGRKLNTADVGVSCNGRGGTFFLKSSRGFSNKGRPFLISIEDP------------- 363 Query: 718 YYQARSAFAMAYTTLTNAKVITRLGSNRSILSVIIRPDAVLLERKGGSNGKLTFNNLFHG 897 Q RSAF MA +TLTN K I LG NRSIL IIRPD VLLERKGGS+G +TF++L G Sbjct: 364 --QIRSAFGMALSTLTNPKAILSLGPNRSILGTIIRPDPVLLERKGGSSGGVTFSSLLPG 421 Query: 898 AGEPLEQLFSDQQEMYCNWTLNDVDEEPLPRGNGTPGD 1011 AGEPL+ L+ +QQ++ CNW L+ DEEPLPRG+G D Sbjct: 422 AGEPLQPLYGEQQDILCNWQLD--DEEPLPRGDGIDVD 457