BLASTX nr result
ID: Cephaelis21_contig00014311
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00014311 (2163 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi... 723 0.0 ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2... 687 0.0 ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm... 682 0.0 ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi... 669 0.0 ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi... 663 0.0 >ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Vitis vinifera] Length = 511 Score = 723 bits (1867), Expect = 0.0 Identities = 356/481 (74%), Positives = 414/481 (86%) Frame = -1 Query: 1704 LVIPRSSDPIFKVQYFCRGLTVNCSQSPGFVVARKSKFRELRLFKSVELDRFITSDDEDE 1525 L++P+ S F +Y R T+ Q+P FVV ++ K RE RLFKSVELD+F+TSDDEDE Sbjct: 32 LIVPKFSRS-FLGEYCSRATTICNHQNPRFVVPKRDKIREFRLFKSVELDQFLTSDDEDE 90 Query: 1524 MSEGFFEAIEELERMVREPSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLR 1345 MSEGFFEAIEELERM REPSDVLEEMN++LSARELQLVLVYFSQEG+DSWCALEV+EWLR Sbjct: 91 MSEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLR 150 Query: 1344 KENRVDKETMELMVSIMCGWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLY 1165 KENRVDKETMELMVSIMC WV+KLIE + D G LKP FSMIEKVISLY Sbjct: 151 KENRVDKETMELMVSIMCSWVKKLIEGEHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLY 210 Query: 1164 WEARDKDAAVLFVKEVLARGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLR 985 WE +K+ AVLFVKEVL R +AYS+D + HKGGPTGYLAWKMM EGNYR AVKLV+HLR Sbjct: 211 WEMEEKEKAVLFVKEVLRREIAYSEDDGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLR 270 Query: 984 ECGLKPELYSYLIAMTAVVKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLL 805 E GLKPE+YSYLIAMTAVVKELNE AKALRKLKGF+K+G+IAELDAEN L+EKYQ++LL Sbjct: 271 ESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKSGLIAELDAENVELIEKYQSDLL 330 Query: 804 DDGVRLSNWVLQDGGPSLNGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDI 625 DGVRLS+WV+Q+G L+G+V+ERLLAMYICAGRG+EAE+QLWEMK VGKEAD++LYDI Sbjct: 331 ADGVRLSSWVIQEGRSPLHGVVYERLLAMYICAGRGLEAERQLWEMKLVGKEADRELYDI 390 Query: 624 VLAICASQKEVDAIGRLLTSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLG 445 VLAICAS+KE AI RLLT +EV S++ +KKTLSWLLRGY+KG HFDDA+ET+I+MLDLG Sbjct: 391 VLAICASKKEASAISRLLTGMEVTSSIRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLG 450 Query: 444 FCPEFLDRAAVLQGLRRRIQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIKT 265 CPE+LDRAAVLQGLR RIQ+ GN+ETYLKLCK LSDANLIGPCL+YL+++K+KLWI+KT Sbjct: 451 LCPEYLDRAAVLQGLRNRIQQTGNVETYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKT 510 Query: 264 V 262 + Sbjct: 511 I 511 >ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1| predicted protein [Populus trichocarpa] Length = 500 Score = 687 bits (1774), Expect = 0.0 Identities = 336/467 (71%), Positives = 399/467 (85%), Gaps = 4/467 (0%) Frame = -1 Query: 1656 CRGLTVNCS----QSPGFVVARKSKFRELRLFKSVELDRFITSDDEDEMSEGFFEAIEEL 1489 C T+ C+ + P FVVA+ +K RE RLFKSVELD+++TSDDE+EM EGFFEAIEEL Sbjct: 32 CMVSTIICNYQTPKRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEEL 91 Query: 1488 ERMVREPSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMEL 1309 ERM REPSD+LEEMN++LSARELQLVLVYFSQEG+DSWCALEV+EWLRKENRVDKETMEL Sbjct: 92 ERMTREPSDILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMEL 151 Query: 1308 MVSIMCGWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLF 1129 MVSIMC WV+KLIE + D G LKPSFSMIEKVISLYW+ K+ AV F Sbjct: 152 MVSIMCSWVKKLIEGEQDVGDVVDLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSF 211 Query: 1128 VKEVLARGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYL 949 VKEVL RG+AYS D E KGGPTGYL WKMMV+GNYR+AVKLV+HLRE GLKPE+Y+YL Sbjct: 212 VKEVLRRGIAYSGDDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYL 271 Query: 948 IAMTAVVKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQ 769 IAMTAVVKELNE +KALRKLKG+S++G++ ELDAEN LVEKYQ++LL DGV LS+WV+Q Sbjct: 272 IAMTAVVKELNEFSKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQ 331 Query: 768 DGGPSLNGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVD 589 +G P+L G+VHERLLAMYICAGRG++AE+QLWEMK VGKEAD DLYDIVLAICASQKE Sbjct: 332 EGSPALYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEAS 391 Query: 588 AIGRLLTSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVL 409 A+ RLLT +EVAS++ KKK+LSWLLRGY+KGGH+ +A ET+I+MLDLG P++LDR AV+ Sbjct: 392 AVARLLTRIEVASSMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVM 451 Query: 408 QGLRRRIQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268 QGLR+RIQ+ GN+E+YLKLCKRLSD NLIGP L+YL+++K+KLWI+K Sbjct: 452 QGLRKRIQQWGNVESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMK 498 >ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis] gi|223539607|gb|EEF41193.1| conserved hypothetical protein [Ricinus communis] Length = 499 Score = 682 bits (1759), Expect = 0.0 Identities = 337/461 (73%), Positives = 395/461 (85%), Gaps = 2/461 (0%) Frame = -1 Query: 1644 TVNCSQSPGFVVARKSKFR--ELRLFKSVELDRFITSDDEDEMSEGFFEAIEELERMVRE 1471 ++ +S FVVA++SK R E R+ KSVELD++I SDDE+EMSEGFFEAIEELERM RE Sbjct: 37 SIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTRE 96 Query: 1470 PSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMC 1291 PSDVLEEMN+KLSARELQLVLVYFSQEG+DSWCALEV+EWLRKENRVDKETMELMVSIMC Sbjct: 97 PSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMC 156 Query: 1290 GWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLA 1111 W++KLIE + + G LKPSFSMIEKVISLYWE +K+ +V FVKEVL Sbjct: 157 SWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLR 216 Query: 1110 RGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAV 931 R VAY +D E KGGPTGYLAWKMMV+GNYRDAVKLV+H RE GLKPE+YSYLIAMTAV Sbjct: 217 REVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAV 276 Query: 930 VKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSL 751 VKELNE AKALRKLKGF+K+G+IAELDAENT L+EKYQ++L+ DGV LS+WV+Q+G PSL Sbjct: 277 VKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSL 336 Query: 750 NGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLL 571 G+VHERLLAMYICAGRG++AE+QLWEMK VGK AD DLYDIVLAICASQKE A+ RLL Sbjct: 337 YGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLL 396 Query: 570 TSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRR 391 T +EV S+L KKKTLSWLLRGYLKGG +D+A E +++MLD+G CP++LDR AVLQGLR+R Sbjct: 397 TRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKR 456 Query: 390 IQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268 IQ+ GN+E+YL LCKRLSD NLIGP L+YL+++K+KLWI+K Sbjct: 457 IQQWGNVESYLNLCKRLSDENLIGPSLVYLYIKKYKLWIMK 497 >ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 508 Score = 669 bits (1725), Expect = 0.0 Identities = 334/508 (65%), Positives = 411/508 (80%), Gaps = 15/508 (2%) Frame = -1 Query: 1746 MATANGIALFSYLGLVIP------RSSDPI--------FKVQYFCRGLTVNCS-QSPGFV 1612 MA+A+G+A LG V R P+ F ++++ +C ++P FV Sbjct: 1 MASAHGLAPIFKLGFVFSSVSPSQRKRHPLMFPASHCGFSLKFYGGLSARSCKFKNPSFV 60 Query: 1611 VARKSKFRELRLFKSVELDRFITSDDEDEMSEGFFEAIEELERMVREPSDVLEEMNEKLS 1432 A+ R R KSVE+D+++TS+DE MS+GFFEAIEELERM REPSDVLEEMN++LS Sbjct: 61 SAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDVLEEMNDRLS 118 Query: 1431 ARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMCGWVRKLIEEKSDT 1252 ARELQLVLVYFSQ+G+DSWCALEV++WLRKENRVDKETMELMV+IMCGWV+KLI+++ Sbjct: 119 ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQQQHGV 178 Query: 1251 GXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLARGVAYSDDHKEEH 1072 G L+P FSMIEKVISLYWE +K+ AVLFV+EVL RG+ Y ++ +E H Sbjct: 179 GDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYVEEDEEGH 238 Query: 1071 KGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAVVKELNEVAKALRK 892 KGGPTGYLAWKMM EG+YR+AV+LV+ RE GLKPE+YSYL+AMTAVVKELNE AKALRK Sbjct: 239 KGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKELNEFAKALRK 298 Query: 891 LKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSLNGIVHERLLAMYI 712 LKGF++AG++AELD E+ L EKYQ++ L DGVRLSNWV+QDG PSL+GIVHERLLAMYI Sbjct: 299 LKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIVHERLLAMYI 358 Query: 711 CAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLLTSLEVASALSKKK 532 CAG G+EAE+QLWEMK VGKEAD DLYDIVLAICASQKE +A RLLT LEV S+ KKK Sbjct: 359 CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVVSSPQKKK 418 Query: 531 TLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRRIQEPGNLETYLKL 352 +LSWLLRGY+KGGHF++A ET+++ML+LGF PE+LDRAAVLQGLR+RIQ+ GNL+TY++L Sbjct: 419 SLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 478 Query: 351 CKRLSDANLIGPCLLYLHLRKHKLWIIK 268 CK LSDANLIGPCL++L++RK+KLW++K Sbjct: 479 CKSLSDANLIGPCLVHLYIRKYKLWVVK 506 >ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100, chloroplastic-like [Glycine max] Length = 510 Score = 663 bits (1711), Expect = 0.0 Identities = 325/456 (71%), Positives = 389/456 (85%), Gaps = 2/456 (0%) Frame = -1 Query: 1629 QSPGFVVARKSKFRELRLFKSVELDRFITSDDE-DEMSEGFFEAIEELERMVREPSDVLE 1453 ++P FV ++ R R KSVELD+++TSDDE DEMS+GFFEAIEELERM REPSDVLE Sbjct: 55 KNPSFV--KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLE 112 Query: 1452 EMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMCGWVRKL 1273 EMN++LSARELQLVLVYFSQ+G+DSWCALEV++WLRKENRVDKETMELMV+IMCGWV+KL Sbjct: 113 EMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKL 172 Query: 1272 IEEKSDT-GXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLARGVAY 1096 I+E G L+P FSMIEKVISLYWE +K+ AVLFV+EVL RG+ Y Sbjct: 173 IQEHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPY 232 Query: 1095 SDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAVVKELN 916 ++ +E HKGGPTGYLAWKMM EG+Y AV+LV+H E GLKPE+YSYL+AMTAVVKELN Sbjct: 233 LEEDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELN 292 Query: 915 EVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSLNGIVH 736 E+AKALRKLK F++ G++AELD E+ L EKYQ++LL DGVRLSNW +QDG PSL+GI+H Sbjct: 293 ELAKALRKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIH 352 Query: 735 ERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLLTSLEV 556 ERLLAMYICAG G+EAEKQLWEMK VGKEAD DLYDIVLAICASQKE +A RLLT LEV Sbjct: 353 ERLLAMYICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEV 412 Query: 555 ASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRRIQEPG 376 AS+ KKK+LSWLLRGY+KGGHF++A ET+++MLDLGF PE+LDRAAVLQGLR+RIQ+ G Sbjct: 413 ASSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYG 472 Query: 375 NLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268 NL+TY++LCK LSDANLIGPCL++L++RK+KLW++K Sbjct: 473 NLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVK 508