BLASTX nr result
ID: Catharanthus22_contig00016568
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00016568 (1473 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838900.1| hypothetical protein AMTR_s00002p00269010 [A... 69 5e-09 ref|WP_002830793.1| hypothetical protein [Pediococcus acidilacti... 63 3e-07 gb|EON66515.1| hypothetical protein W97_05760 [Coniosporium apol... 62 8e-07 gb|ELU14571.1| hypothetical protein CAPTEDRAFT_210994 [Capitella... 61 1e-06 ref|XP_001009685.3| HMG box family protein [Tetrahymena thermop... 60 2e-06 gb|EMP38687.1| Peptidyl-prolyl cis-trans isomerase G [Chelonia m... 59 5e-06 ref|WP_002829279.1| hypothetical protein [Pediococcus acidilacti... 59 7e-06 gb|ELU06812.1| hypothetical protein CAPTEDRAFT_2780 [Capitella t... 58 9e-06 >ref|XP_006838900.1| hypothetical protein AMTR_s00002p00269010 [Amborella trichopoda] gi|548841406|gb|ERN01469.1| hypothetical protein AMTR_s00002p00269010 [Amborella trichopoda] Length = 643 Score = 68.9 bits (167), Expect = 5e-09 Identities = 79/331 (23%), Positives = 143/331 (43%), Gaps = 8/331 (2%) Frame = +1 Query: 238 DLIEGTSEENLGKLIGKHVSKRFAELMYEGVVESRDNASEMFKVVYSNGHSEEMRLEELT 417 D IE S++ ++G+ V K+F Y G V S D+ + F+V+Y +G E++ EEL Sbjct: 270 DHIETESKD----MVGRKVRKKFGRKKYLGDVVSYDSGTHWFRVLYEDGDVEDLEREELK 325 Query: 418 KYLVPHEHHVHSYPNE-------LPNKTNETGEGLALIAPAPPTRRHQRTGTMSSGSRRK 576 + L+P + S N+ K+ T + L P+ P G++ S+ K Sbjct: 326 EILLPLDGDPQSISNKRSPGSDATSRKSFRTKKAKILDLPSTP-------GSLKRSSKSK 378 Query: 577 GAVDGVVQQRQASKRVLSHSTRNNEASRGLQHAIKNNEITEIMELQCSTKNDEKLKMTAH 756 G + + + ST ++ S+ + +I N L+ ST + K + H Sbjct: 379 GNKSSIRKNKGRKS-----STPKSKGSKVSKSSIPKN------MLKSSTPKSKVSKSSTH 427 Query: 757 PDSGTGDGTSGAMKHQRSTQKQASKS-ALSRSTQKQASKSALSRSTRNNETSRGLRRATK 933 + G+ T + ST K K S S+ K+ K+ LS ST + E+ G +RA++ Sbjct: 428 QNKGSKPSTPKNKGSKLSTPKSKGKERKASVSSSKREKKTPLSLSTESTES--GSKRASR 485 Query: 934 NNETTEIMELQHSMRNDEKLKMIARPXXXXXXXXSGMMKLQSSTQRQASKRVVSHSNRNN 1113 ++ +L+ S+ +K K +R S Q T+ + S N Sbjct: 486 KRSLDDVKDLE-SIPVQKKPKDSSR--------KSAFSSRQKGTRLPDPCGIESTLNE-- 534 Query: 1114 GTSRGLQRATKNNETTETMELQRSTMNDEKL 1206 S ++ K +E ++ E +R+ +DE++ Sbjct: 535 --SDEVRGRPKKSEVSQGKERKRARSDDEQM 563 >ref|WP_002830793.1| hypothetical protein [Pediococcus acidilactici] gi|357540561|gb|EHJ24574.1| subtilisin-like serine protease [Pediococcus acidilactici MA18/5M] Length = 3481 Score = 63.2 bits (152), Expect = 3e-07 Identities = 59/284 (20%), Positives = 120/284 (42%), Gaps = 5/284 (1%) Frame = +1 Query: 628 SHSTRNNEASRGLQHAI--KNNEITEIMELQCSTKNDEKLKMTAHPDSGTGDGTSGAMKH 801 S S ++++ SR + + KN+ + + S KND+ + + D D S ++ Sbjct: 666 STSNKHDDDSRSISTSTSDKNDNDSRSISTSTSDKNDDDSRSASTSDKNDDDSRSASISD 725 Query: 802 QRSTQKQASKSALSRSTQKQASKSALSRSTRNNETSRGLRRATKNNETTEIMELQHSMRN 981 + S+SA + S+SA S S +N++ SR + KN++ + + S +N Sbjct: 726 KNDDD---SRSASTSDKNDDDSRSA-SISDKNDDDSRSASTSDKNDDDSRSASI--SDKN 779 Query: 982 DEKLKMIARPXXXXXXXXSGMMKLQSSTQRQASKRVVSHSNRNNGTSRGLQRATKNNETT 1161 D+ + + S + S + R S S++N+ SR + KN+ + Sbjct: 780 DDDSRSASTSDKNDDDSRSASI----SDKNDDDSRSASTSDKNDNDSRSASTSDKNDNDS 835 Query: 1162 ETMELQRSTMNDEKLKMIARRNSDTDYGTSGVMKLQSSTQRQASKRAVSLSARNNETSRG 1341 ++ S ND + ++ SD + S + +S + R+ S S +NN S+ Sbjct: 836 RSISTSTSDKNDNDSRSVSTSTSDKNDNDSRSVSASTSDKNDDDSRSRSESDKNNSESKS 895 Query: 1342 LRRATKKKAITEIMEPPRSTRND---EKLKMSALQDSDTDDGTS 1464 ++ +E + +++D + + ++ SD DD S Sbjct: 896 ESDKNDSESKSESDKHDSESKSDSDKHESESRSISQSDKDDSES 939 >gb|EON66515.1| hypothetical protein W97_05760 [Coniosporium apollinis CBS 100218] Length = 535 Score = 61.6 bits (148), Expect = 8e-07 Identities = 68/317 (21%), Positives = 135/317 (42%), Gaps = 9/317 (2%) Frame = +1 Query: 541 RTGTMSSGSRRKGAVDGVVQQRQASKRVLSHSTRNNEASRGLQHAIKNNEITEIMELQCS 720 +T T ++ S K ++K+ S +T++ ++S + +++ T+ + S Sbjct: 117 KTKTKTTSSSTKTTSSSTKTTSSSTKKTDSSTTKDQQSSTTKKP--ESSSSTKNQQSSSS 174 Query: 721 TKNDEKLKMTAHPDSGTGDGTSGAMKHQRSTQKQASKSALSRSTQKQASKSALSRSTRNN 900 TK E T P++ + T K Q S+ + S ST+KQ S S+ ST+ Sbjct: 175 TKKQESSSTTKKPETSSSSST----KKQESSSTTKKPESSSSSTKKQESSSS---STKKQ 227 Query: 901 ETSRGLRRATKNNETTEIMELQHSMRNDEKLKMIARPXXXXXXXXSGMMKLQSSTQRQAS 1080 E+S +TKN +++ + Q S +K + + S K +SS+ Sbjct: 228 ESSS----STKNQQSSSSTKKQESSSTTKKPETSSSSSTKRQESSSTTKKPESSSSTTKK 283 Query: 1081 KRVVSHSNRNNGTSRGLQRATKNNETTETMELQRSTMNDEKLKMIARRNSDTDYGTSGVM 1260 + S + + +S +TK E++ + Q+ + + K + + + +S Sbjct: 284 QESSSSTTKKQESSSS---STKKQESSTSSTKQQESSSSTKKQESSSSTKQPESSSSSTK 340 Query: 1261 KLQSS--TQRQAS----KRAVSLSARNNETSRGLRR---ATKKKAITEIMEPPRSTRNDE 1413 K +SS T++Q S K S S + E+S ++ ++ KK + + P S+ + Sbjct: 341 KQESSSSTKKQESSTTQKPESSSSTKKQESSSSTKKQESSSTKKQESSTTKQPESSSSSS 400 Query: 1414 KLKMSALQDSDTDDGTS 1464 K + S S ++ TS Sbjct: 401 KKEDSTSSTSKKEENTS 417 >gb|ELU14571.1| hypothetical protein CAPTEDRAFT_210994 [Capitella teleta] Length = 372 Score = 61.2 bits (147), Expect = 1e-06 Identities = 77/330 (23%), Positives = 129/330 (39%), Gaps = 34/330 (10%) Frame = +1 Query: 517 APPTR---RHQRTGTMSSGSRRKGAVDGVVQQRQASKRVLSHST------RNNEASRGLQ 669 AP TR R QRT +SGSR + + +++ R S ST R+ SR + Sbjct: 2 APITRSQTREQRTKKSTSGSRSRSRSRSTGRAKKSRSRSRSRSTQRAKKYRSRSRSRSTR 61 Query: 670 HAIKNNEITEIMELQCSTKNDEKLKMTAHPDSGTGDGTSGAMKHQRSTQKQASKSALSRS 849 A K+ + M ST+ +K + + S S + RSTQ+ + + SRS Sbjct: 62 RAKKSRSRSRSM----STRRAKKSRSRSRSRSTGRAKKSRSRSRSRSTQRAKNYRSRSRS 117 Query: 850 TQKQASKSALSRS----------TRNNETSRGLRRATK-----NNETTEIMELQHSMRND 984 +K + SRS +R+ SR RRA K + +T+ ++ S Sbjct: 118 RSTGRAKKSRSRSRSRSTRRAKKSRSRSRSRSTRRAKKYRSRSRSRSTQRAKMSRSKSRS 177 Query: 985 EKLKMIARPXXXXXXXXSGMMKLQSSTQRQAS----------KRVVSHSNRNNGTSRGLQ 1134 + + +G K S R S R S SR Sbjct: 178 RSTRRAKKSRSRSRRRSTGRAKKSRSRSRSRSIGRAKKSRSRSRSKSTGRAKKSRSRSRS 237 Query: 1135 RATKNNETTETMELQRSTMNDEKLKMIARRNSDTDYGTSGVMKLQSSTQRQASKRAVSLS 1314 R+T + + + +R+T +K + +R S T K +S ++ +++RA Sbjct: 238 RSTGRAKKSRSRSRRRNTRRTKKSRSRSRSRS-----TGRAKKSRSRSRSMSTQRAKKYR 292 Query: 1315 ARNNETSRGLRRATKKKAITEIMEPPRSTR 1404 +R+ SR RRA K ++ + M R+ + Sbjct: 293 SRSR--SRSTRRAKKSRSRSRSMSTQRAKK 320 >ref|XP_001009685.3| HMG box family protein [Tetrahymena thermophila] gi|225565324|gb|EAR89440.3| HMG box protein, putative [Tetrahymena thermophila SB210] Length = 670 Score = 60.5 bits (145), Expect = 2e-06 Identities = 79/364 (21%), Positives = 137/364 (37%), Gaps = 12/364 (3%) Frame = +1 Query: 388 SEEMRLEELTKYLVPHEHHVHSYPNELP-NKTNETGEGLALIAPAPPTR------RHQRT 546 +EE EE T + E H PN K+ T G A A + + + Sbjct: 94 NEEQNKEEETDKVKEMEADSHPTPNRKGIKKSKSTTSGSRSNARASQKKASASKSKDAKK 153 Query: 547 GTMSSGSRRKGAVDGVVQQRQASKRVLSHSTRNNEASRGLQHAIKNNEITEIMELQCSTK 726 T SGS+ K ++ S S RN+ S+ NNE + +K Sbjct: 154 ATNRSGSKNK-------SNNKSRGSSSSSSKRNSSNSKSKDQKKANNE-------RSQSK 199 Query: 727 NDEKLKMTAHPDSGTGDGTSGAMKHQRSTQKQASKSALSRSTQKQASKSALSRS----TR 894 N+ + H D D T M+ ++S+ K + K++ S+ KQ +++ S+S +R Sbjct: 200 NNRNSRKQNHTDEKHTDNTK-VMEAEKSSSK-SRKNSKSKEPSKQKERNSTSKSKGNQSR 257 Query: 895 NNETSRGLRRATKNNETTEIMELQHSMRNDEKLKMIARPXXXXXXXXSGMMKLQSSTQRQ 1074 +N S+ + + K + ME +++ RN K K R + K + Q Q Sbjct: 258 SNSKSKQVGKVVKMPRKSNKMEAENTSRNSSKSKSRGRNNSSKKRQSNSKSK---TRQEQ 314 Query: 1075 ASKRVVSHSNRNNGTS-RGLQRATKNNETTETMELQRSTMNDEKLKMIARRNSDTDYGTS 1251 + K+ S + T RG + K+ ET ++ S + ++++S G+ Sbjct: 315 SQKKRNQMSVEKSATKPRGRSQVKKSMETEHNVQRSSSKSKSKS----SKKHSKAKEGSK 370 Query: 1252 GVMKLQSSTQRQASKRAVSLSARNNETSRGLRRATKKKAITEIMEPPRSTRNDEKLKMSA 1431 QS + + R +S N + R +K K+ R + K Sbjct: 371 SRKNSQSKQRSSSKSRNISQQKSRNNSKAKSRNNSKSKSRNNSKSKSRRDSKQAQSKSKH 430 Query: 1432 LQDS 1443 DS Sbjct: 431 AGDS 434 >gb|EMP38687.1| Peptidyl-prolyl cis-trans isomerase G [Chelonia mydas] Length = 703 Score = 58.9 bits (141), Expect = 5e-06 Identities = 80/363 (22%), Positives = 140/363 (38%), Gaps = 25/363 (6%) Frame = +1 Query: 448 HSYPNELPNKTNETGEGLALIAPAPPTRRHQRTGTMSSGSRRKGAVDGVVQQRQASKRVL 627 H +E PN+ E + T+ H ++ + +RR D + +A KR Sbjct: 356 HRQVSESPNRRGEKEK---------KTKDH-KSSSKDRETRRNSEKDDKHNKSKAKKRAK 405 Query: 628 SHS-TRNNEASRGLQHAIKNNEITEIMELQCSTKNDEKLKMTAHPDSGTGDGTSGAMKHQ 804 S S +++ E S+ + K+N E S + D + H DS G K + Sbjct: 406 SKSRSKSKEKSKSRERDSKHNRHEEKRVRSRSRERDHERGKDKHYDS------RGRAK-E 458 Query: 805 RSTQKQASKSALSRSTQKQASKS------ALSRSTRNNET-----------------SRG 915 RS K+ KSA S+S ++ SKS A SRS +T +G Sbjct: 459 RSRSKERCKSAGSKSNEQDHSKSKDREKHAKSRSKEREQTKGKHSSNNKARERSRSRDKG 518 Query: 916 LRRATKNNETTEIMELQHSMRNDEKLKMIARPXXXXXXXXSGMMKLQSSTQRQASKRVVS 1095 R +++ + +HS D++ K R K + + +R+ S+ Sbjct: 519 KRARSRSKDRDRSRSKEHSKNEDKEAKRKGRSRSRERKGTPEKYKGKENKRRRDSRSHER 578 Query: 1096 HSNRNNGTSRGLQRATKNNETTETMELQRSTMNDEKLKMIARRNSDTDYGTSGVMKLQSS 1275 +++ + L R ++++ E QR + + N D G+ K S Sbjct: 579 EESQSRNKEKYLNRESRSSHKKNDAESQRKKRSKSRESSSPETNKDKK-GSRDQDKSPDS 637 Query: 1276 TQRQASK-RAVSLSARNNETSRGLRRATKKKAITEIMEPPRSTRNDEKLKMSALQDSDTD 1452 +RQ+SK R S+ + + R++ +K I + + R D K K S D ++ Sbjct: 638 KRRQSSKDREFKKSSTHRSREKEKTRSSLEKEINQKSKSQERDRADRKDKKS---DHESS 694 Query: 1453 DGT 1461 GT Sbjct: 695 PGT 697 >ref|WP_002829279.1| hypothetical protein [Pediococcus acidilactici] gi|270281388|gb|EFA27220.1| KxYKxGKxW signal domain protein [Pediococcus acidilactici 7_4] Length = 3479 Score = 58.5 bits (140), Expect = 7e-06 Identities = 73/402 (18%), Positives = 161/402 (40%), Gaps = 8/402 (1%) Frame = +1 Query: 283 GKHVSKRFAELMYEGVVESRDNASEMFKVVYSNGHSEEMRLEELTKYLVPHEHHVHSYPN 462 G+HV A+L ++ ++ ++A + + +R ++ T ++ + +H N Sbjct: 511 GEHVDIADADLSWDMREDNWNHAKD---------YPITIRFKDTTGEIIETQVTIHVLKN 561 Query: 463 --ELPNK--TNETGEGLALIAPAPPTRRHQRTGTMSSGSRRKGA-VDGVVQQRQASKRVL 627 E+ K T G+G L G+++SG + VD + + Sbjct: 562 QSEISGKDATYTVGQGPHLTVDDLQPSGRNADGSLASGFEADFSHVDWDTAGKYTVEISF 621 Query: 628 SHSTRNNEASRGLQHAIKNNEITEIMELQCSTKNDEKLKMTAHPDSGTGDGTSGAMKHQR 807 + + + S + + +N + S KND+ + + D + TS + KH Sbjct: 622 TDAVTKGKVSTTVTVTVDDNYGSMSTSASMSNKNDDDSRSASISDKNDSESTSTSNKHDD 681 Query: 808 STQKQASKSALSRSTQKQASKSALSRSTRNNETSRGLRRATKNNETTEIMELQHSMRNDE 987 ++ ++S S + + S S ++++ SR + +T + + +E S +ND+ Sbjct: 682 DSR------SISTSISDKNDSESTSTSNKHDDDSRSISTSTSDKQDSE--STSTSNKNDD 733 Query: 988 KLKMIARPXXXXXXXXSGMMKLQSSTQRQASKRVVSHSNRNNGTSRGLQRATKNNETTET 1167 + I+ S +S + R +S S + S + KN++ + + Sbjct: 734 DSRSISTSTSDKNDSESA----STSNKNDDDSRSISTSISDKDDSESTSTSNKNDDDSRS 789 Query: 1168 MELQRSTMNDEKLKMIARRNSDTDYGTSGVMKLQSSTQRQASKRAVSLSARNNETSRGL- 1344 + S ND + ++ SD + S + +S + R+ S S +NN S+ Sbjct: 790 ISTSTSDKNDNDSRSVSTSTSDKNDNDSRSVSASTSDKNDDDSRSRSESDKNNSESKSES 849 Query: 1345 -RRATKKKAITEIMEPPRSTRNDEKLKMS-ALQDSDTDDGTS 1464 + ++ K+ ++ + + +D+ S ++ SD DD S Sbjct: 850 DKNDSESKSESDKHDSESKSESDKHDSESRSISQSDKDDSES 891 >gb|ELU06812.1| hypothetical protein CAPTEDRAFT_2780 [Capitella teleta] Length = 332 Score = 58.2 bits (139), Expect = 9e-06 Identities = 77/302 (25%), Positives = 114/302 (37%), Gaps = 26/302 (8%) Frame = +1 Query: 541 RTGTMSSGSRRKGAVDGVVQQRQASKRVLSHSTRNNEASRGLQHAIKNNEITEIMELQCS 720 R G S R + + R S+ S STR+ RG + K+ S Sbjct: 4 RRGKKSRSRSRNRSTQRAKKSRSKSR---SRSTRSKSTRRGKKSRSKSRSR--------S 52 Query: 721 TKNDEKLKMTAHPDSGTGDGTSGAMKHQRSTQKQASKSALSRSTQKQASKSALSRSTRNN 900 TK ++ + + S S + RST++ + SRS Q +K + SRS Sbjct: 53 TKRGQQSRSRSRGRSTQRGKKSRSRSRNRSTRRSTKPRSRSRSRSTQRAKKSRSRSR--- 109 Query: 901 ETSRGLRRATKNNETTEIMELQHSMRNDEKLKMIARPXXXXXXXXSGMMKLQSSTQRQAS 1080 SR RR TK+ + M S R +K K +R S STQR Sbjct: 110 --SRSSRRGTKSRSRSRSM----SPRRGKKFKNRSRSRSTRRAKKSRSRSRNRSTQRAKK 163 Query: 1081 KRVVSHSN--RNNGTSRGLQ-------RATKNNETTETMELQRSTMNDEKLKMIAR---- 1221 R S S R+ T RG + R+TK + + + RST +K + +R Sbjct: 164 SRSKSRSRSTRSKSTRRGKKSRSKSRSRSTKRGQQSRSRSRGRSTQRGKKSRSRSRSRST 223 Query: 1222 ------RNSDTDYGTSGVMKLQS-----STQRQASKRAV--SLSARNNETSRGLRRATKK 1362 R+ T K +S STQR R+ S S R+ T RG + +K Sbjct: 224 KRGQQSRSRSRGRSTRRAKKSRSRSRNRSTQRAKKSRSKSRSRSTRSKSTRRGKKSRSKS 283 Query: 1363 KA 1368 ++ Sbjct: 284 RS 285