BLASTX nr result
ID: Cephaelis21_contig00003489
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00003489 (3423 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002307797.1| predicted protein [Populus trichocarpa] gi|2... 527 e-147 ref|XP_002510687.1| pentatricopeptide repeat-containing protein,... 504 e-140 ref|XP_002272601.1| PREDICTED: pentatricopeptide repeat-containi... 493 e-136 emb|CAN74746.1| hypothetical protein VITISV_012024 [Vitis vinifera] 492 e-136 ref|XP_003528528.1| PREDICTED: pentatricopeptide repeat-containi... 477 e-131 >ref|XP_002307797.1| predicted protein [Populus trichocarpa] gi|222857246|gb|EEE94793.1| predicted protein [Populus trichocarpa] Length = 475 Score = 527 bits (1358), Expect = e-147 Identities = 275/476 (57%), Positives = 343/476 (72%) Frame = +2 Query: 629 SDTRVYHRKPPKNLRFPRRTKLPLDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDDV 808 S TR+ +RK PKN+R+PRR+KLP D GV+ + D ++D DD+ Sbjct: 4 SSTRICYRKIPKNIRYPRRSKLPPD---FGVNLFLKKPQTDSVQDHS----------DDL 50 Query: 809 NGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXXX 988 + EEE +VN NGEIVWE +E+EAISSLF+GRIPQKPG Sbjct: 51 TEE----EEEEEIEVN---NGEIVWESEEIEAISSLFRGRIPQKPGKLGRERPLPLPVPY 103 Query: 989 XXXXXGFPTQKRMLRKSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNRW 1168 G P K+ + K V+++R S+SSQ+YKNP+FL GLA+EIK L P ++ VS+VL+ Sbjct: 104 KLRPLGLPAPKKHVNKQVSLSRASISSQIYKNPSFLIGLAKEIKRLSP-DQDVSVVLDNC 162 Query: 1169 ARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELRL 1348 +R+L KGSLS+TIRELGH+GLPE AL FCW QKQP L+PDDR+LASTVEVLAR+ +L++ Sbjct: 163 SRYLHKGSLSLTIRELGHLGLPERALQTFCWVQKQPRLFPDDRVLASTVEVLARNHDLKV 222 Query: 1349 PFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILEL 1528 PF L ++F +L S+ V EA+VKG I+GGSL + KL+S A+D KR++D VY K+ILEL Sbjct: 223 PFNL--EKFTNLASRRVIEAMVKGLIRGGSLKLSWKLISVAKDGKRMLDPSVYAKIILEL 280 Query: 1529 GKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGVP 1708 GKNPDK A REDLNL+ QDCTAVMKVCI+LGKFE VE ++WF++SG P Sbjct: 281 GKNPDKHVLAEALLDELAEREDLNLSQQDCTAVMKVCIKLGKFEAVESLFNWFRQSGHEP 340 Query: 1709 SVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRYF 1888 SVVMYTT+IHSRYSE KYREALAVVWEME S+CLFD AYRV IKLFVAL DLPRAVRYF Sbjct: 341 SVVMYTTLIHSRYSESKYREALAVVWEMEGSDCLFDLTAYRVVIKLFVALNDLPRAVRYF 400 Query: 1889 SKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRVQLFEL 2056 SKLKEAG SPT+D+Y +I +Y+ SGR+AKCKEVW EAEMAGFK +++ L +L Sbjct: 401 SKLKEAGLSPTYDIYRNLITLYMVSGRLAKCKEVWKEAEMAGFKFSKEMAAGLLQL 456 >ref|XP_002510687.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223551388|gb|EEF52874.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 469 Score = 504 bits (1298), Expect = e-140 Identities = 274/479 (57%), Positives = 336/479 (70%), Gaps = 1/479 (0%) Frame = +2 Query: 629 SDTRVYHRKP-PKNLRFPRRTKLPLDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDD 805 S+T+ Y+RK PKNL+ PRR+KLP D GV+ + + D DS VDD Sbjct: 4 SNTKTYYRKKLPKNLQSPRRSKLPPD---FGVNLFLKKPTTGV--DPFQMDSFL-VDD-- 55 Query: 806 VNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXX 985 G +L +EE +NG+IVWE DE+EAISSLF+GRIPQ+PGN Sbjct: 56 --GDGDLNQEEKE------QNGDIVWESDEIEAISSLFQGRIPQRPGNLNRERPLPLPLP 107 Query: 986 XXXXXXGFPTQKRMLRKSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNR 1165 G P+ K+ R V R + +++YKNP+FL LA++IK L P + VS VL+ Sbjct: 108 HKLRPLGPPSPKKHNRNVVVSLRSPICNKVYKNPSFLISLAKQIKCLNPDD-DVSAVLDD 166 Query: 1166 WARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELR 1345 ARFLRKGSLS+TIRELGHMG P+ AL FCWAQKQP LYPDDRILASTVE+LAR+++L+ Sbjct: 167 CARFLRKGSLSLTIRELGHMGFPDRALQTFCWAQKQPQLYPDDRILASTVEILARNQDLK 226 Query: 1346 LPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILE 1525 +P +F SL S+ V EA+++GF+KGG L + KLL+ A+ KR++D+ +Y +LILE Sbjct: 227 VPIDWQ--KFTSLASRGVIEAMIRGFLKGGRLKLAWKLLAVAKHDKRMLDASLYARLILE 284 Query: 1526 LGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGV 1705 LGKNPDK REDLNL+ QDCTA+MKVCIRL KFE VE + WFK+SG Sbjct: 285 LGKNPDKYMLVEQLLDELGEREDLNLSHQDCTAIMKVCIRLQKFEFVECLFTWFKQSGHE 344 Query: 1706 PSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRY 1885 PSVVMYTT+IHSRYSEKKYREALA VWEME S+ LFD PAYRV IKLFVAL DLPRAVRY Sbjct: 345 PSVVMYTTLIHSRYSEKKYREALAGVWEMEGSSFLFDLPAYRVVIKLFVALNDLPRAVRY 404 Query: 1886 FSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRVQLFELGK 2062 FSKLKEAG SPT+D+Y +IKIYL SGR+AKCKE+W EAEMAGFKLD+QI++ L K Sbjct: 405 FSKLKEAGLSPTYDIYRNLIKIYLVSGRLAKCKEIWKEAEMAGFKLDEQIKMDLLHQEK 463 >ref|XP_002272601.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01860-like [Vitis vinifera] Length = 514 Score = 493 bits (1270), Expect = e-136 Identities = 271/487 (55%), Positives = 330/487 (67%), Gaps = 10/487 (2%) Frame = +2 Query: 632 DTRVYHRKPPKNLRFPRRTKLPLDPCLT-----GVSTVSRPFANDII--RDDDNFDSLTA 790 + RV HRKP KNL PRR KLP +P ++ G S + ++ D N D Sbjct: 46 NARVNHRKPTKNLPHPRRAKLPPEPEISTFLKGGNSGTEQSEMGTVLDKEPDANDDGFLV 105 Query: 791 VDDDDVNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXX 970 D + G+KE GEIVW+ DE+EAISSLF GRIPQKPG Sbjct: 106 ---DGIEGRKE---------------GEIVWDSDEIEAISSLFMGRIPQKPGKLNRERPL 147 Query: 971 XXXXXXXXXXXGFPTQKRMLRKSVAI---ARQSVSSQLYKNPTFLGGLAREIKALPPGER 1141 G PT KR +R + ++ +R S+S Q+YKNP FL +AREI+ LP E Sbjct: 148 PLPLPYKLRPMGLPTTKRHVRAASSMPYASRASLSKQVYKNPDFLISIAREIRKLPL-ED 206 Query: 1142 SVSLVLNRWARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEV 1321 VS VLN+W RFLRKGSLS+TIRELGHMGLPE AL F WAQKQP L+PDDRILASTVEV Sbjct: 207 DVSPVLNKWVRFLRKGSLSLTIRELGHMGLPERALQTFFWAQKQPQLFPDDRILASTVEV 266 Query: 1322 LARSRELRLPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSG 1501 LAR+ +L++PF L ++F L S+SV EA+ +GFI+ GSL++ KLL A+DSKR++ Sbjct: 267 LARTHKLKVPFSL--EKFTGLASRSVIEALARGFIRRGSLSLAWKLLLVAKDSKRMLGPS 324 Query: 1502 VYVKLILELGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYD 1681 +Y KLI ELGKNPDK REDL L+ QDCTAVMKVCIRLGKFEIVE ++ Sbjct: 325 IYAKLIFELGKNPDKHSLVQALLDELGEREDLKLSHQDCTAVMKVCIRLGKFEIVESLFN 384 Query: 1682 WFKKSGGVPSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALT 1861 W+K+S PSVVMYTT+IHSRY+EKKYREALAVVWEMEAS+C+FD PAYRV IKLF+AL Sbjct: 385 WYKQSENSPSVVMYTTLIHSRYTEKKYREALAVVWEMEASDCVFDLPAYRVVIKLFIALN 444 Query: 1862 DLPRAVRYFSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRV 2041 DL R RYFSKLKEAGFSPT+D+Y ++KIY+ R+AKC+EV E EM+GFKLD+ Sbjct: 445 DLSRTGRYFSKLKEAGFSPTYDIYRDMLKIYMVFRRLAKCREVCKELEMSGFKLDKGTLS 504 Query: 2042 QLFELGK 2062 QL +L K Sbjct: 505 QLLQLEK 511 >emb|CAN74746.1| hypothetical protein VITISV_012024 [Vitis vinifera] Length = 514 Score = 492 bits (1267), Expect = e-136 Identities = 271/485 (55%), Positives = 329/485 (67%), Gaps = 10/485 (2%) Frame = +2 Query: 632 DTRVYHRKPPKNLRFPRRTKLPLDP----CLTGVSTVSRPFANDIIRD---DDNFDSLTA 790 + RV HRKP KNL PRR KLP +P L G ++ + + D D N D Sbjct: 46 NARVNHRKPTKNLPHPRRAKLPPEPEISTFLKGGNSGTEQSEMGTVLDKELDPNDDGFLV 105 Query: 791 VDDDDVNGQKELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXX 970 D + G+KE GEIVW+ DE+EAISSLF GRIPQKPG Sbjct: 106 ---DGIEGRKE---------------GEIVWDSDEIEAISSLFMGRIPQKPGKLNRERPL 147 Query: 971 XXXXXXXXXXXGFPTQKRMLRKSVAI---ARQSVSSQLYKNPTFLGGLAREIKALPPGER 1141 G PT KR +R + ++ +R S+S Q+YKNP FL +AREI+ LP E Sbjct: 148 PLPLPYKIRPMGLPTTKRHVRAASSMPYASRASLSKQVYKNPDFLISIAREIRNLPL-ED 206 Query: 1142 SVSLVLNRWARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEV 1321 VS VLN+W RFLRKGSLS+TIRELGHMGLPE AL F WAQKQP L+PDDRILASTVEV Sbjct: 207 DVSPVLNKWVRFLRKGSLSLTIRELGHMGLPERALQTFFWAQKQPQLFPDDRILASTVEV 266 Query: 1322 LARSRELRLPFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSG 1501 LAR+ +L++PF L ++F L ++SV EA+ +GFI+ GSL++ KLL A+DSKR++ Sbjct: 267 LARTHKLKVPFSL--EKFTGLATRSVIEALARGFIRRGSLSLAWKLLLVAKDSKRMLGPS 324 Query: 1502 VYVKLILELGKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYD 1681 +Y KLI ELGKNPDK REDL L+ QDCTAVMKVCIRLGKFEIVE ++ Sbjct: 325 IYAKLIFELGKNPDKHSLVQALLDELGEREDLKLSHQDCTAVMKVCIRLGKFEIVESLFN 384 Query: 1682 WFKKSGGVPSVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALT 1861 W+K+S PSVVMYTT+IHSRY+EKKYREALAVVWEMEAS+CLFD PAYRV IKLF+AL Sbjct: 385 WYKQSENSPSVVMYTTLIHSRYTEKKYREALAVVWEMEASDCLFDLPAYRVVIKLFIALN 444 Query: 1862 DLPRAVRYFSKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQIRV 2041 DL R RYFSKLKEAGFSPT+D+Y ++KIY+ R+AKC+EV E EM+GFKLD+ Sbjct: 445 DLSRTGRYFSKLKEAGFSPTYDIYRDMLKIYMVFRRLAKCREVCKELEMSGFKLDKGTLS 504 Query: 2042 QLFEL 2056 QL +L Sbjct: 505 QLLQL 509 >ref|XP_003528528.1| PREDICTED: pentatricopeptide repeat-containing protein At2g01860-like [Glycine max] Length = 478 Score = 477 bits (1227), Expect = e-131 Identities = 257/469 (54%), Positives = 323/469 (68%), Gaps = 7/469 (1%) Frame = +2 Query: 650 RKPPKNLRFPRRTKLP----LDPCLTGVSTVSRPFANDIIRDDDNFDSLTAVDDDDVNGQ 817 R+PPK+ R+ R K P ++ L ST S+P DDD++ Sbjct: 34 RRPPKDNRYRPRPKQPPEFGVNLFLKKPSTASKP------------------TDDDMDSN 75 Query: 818 KELFEEEGSSDVNDGENGEIVWEQDELEAISSLFKGRIPQKPGNXXXXXXXXXXXXXXXX 997 +E EEE DG G +VWE DELEAISSLF+GRIPQKPG Sbjct: 76 EENDEEE------DGNIG-VVWESDELEAISSLFQGRIPQKPGKLDRERPLPLPVPFKLR 128 Query: 998 XXGFPTQKRMLR---KSVAIARQSVSSQLYKNPTFLGGLAREIKALPPGERSVSLVLNRW 1168 PT K ++ +V +R S++ ++YK+P+FL GLAR+I L P + VS +L +W Sbjct: 129 PLRLPTPKTQVKLTAPAVVSSRASMAKKVYKSPSFLVGLARQISRLGP-DADVSKILGKW 187 Query: 1169 ARFLRKGSLSMTIRELGHMGLPESALLVFCWAQKQPHLYPDDRILASTVEVLARSRELRL 1348 +FLRKGSLS+TIRELGHMG PE AL F WAQ QPHL+PDD ILASTVEVLAR+ ELR+ Sbjct: 188 VQFLRKGSLSLTIRELGHMGFPERALQTFLWAQNQPHLFPDDWILASTVEVLARNHELRI 247 Query: 1349 PFKLHDDRFMSLVSQSVYEAIVKGFIKGGSLNIVRKLLSAARDSKRVVDSGVYVKLILEL 1528 PF L ++ L S++V EA++KGFIKGG+L K+L AR KR++DS +Y KLILEL Sbjct: 248 PFNL--GQYTGLASRAVLEAMIKGFIKGGNLRFAWKVLIVARRDKRMLDSSIYAKLILEL 305 Query: 1529 GKNPDKXXXXXXXXXXXAGREDLNLNPQDCTAVMKVCIRLGKFEIVEGFYDWFKKSGGVP 1708 GKNPD+ R++LNL+ QDCTA+MKVC+++GKFE+VE + WFK+SG P Sbjct: 306 GKNPDRHRHVLPLLDELGERDELNLSQQDCTAIMKVCVKMGKFEVVESLFSWFKQSGYQP 365 Query: 1709 SVVMYTTMIHSRYSEKKYREALAVVWEMEASNCLFDFPAYRVAIKLFVALTDLPRAVRYF 1888 S+VM+T++IHSRY+EKKYREALAVVWEMEASNCLFD PAYRV IKLFVAL DL RA RYF Sbjct: 366 SIVMFTSVIHSRYTEKKYREALAVVWEMEASNCLFDLPAYRVVIKLFVALNDLSRATRYF 425 Query: 1889 SKLKEAGFSPTFDLYMYIIKIYLASGRIAKCKEVWLEAEMAGFKLDQQI 2035 SKLKEAGFSP+F LY +++IY+ASGRIAKCKE+ EAE+AGFKLD+ + Sbjct: 426 SKLKEAGFSPSFGLYKDMLQIYMASGRIAKCKELCREAEIAGFKLDKYL 474