BLASTX nr result

ID: Cephaelis21_contig00014311 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00014311
         (2163 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   723   0.0  
ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2...   687   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   682   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   669   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   663   0.0  

>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  723 bits (1867), Expect = 0.0
 Identities = 356/481 (74%), Positives = 414/481 (86%)
 Frame = -1

Query: 1704 LVIPRSSDPIFKVQYFCRGLTVNCSQSPGFVVARKSKFRELRLFKSVELDRFITSDDEDE 1525
            L++P+ S   F  +Y  R  T+   Q+P FVV ++ K RE RLFKSVELD+F+TSDDEDE
Sbjct: 32   LIVPKFSRS-FLGEYCSRATTICNHQNPRFVVPKRDKIREFRLFKSVELDQFLTSDDEDE 90

Query: 1524 MSEGFFEAIEELERMVREPSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLR 1345
            MSEGFFEAIEELERM REPSDVLEEMN++LSARELQLVLVYFSQEG+DSWCALEV+EWLR
Sbjct: 91   MSEGFFEAIEELERMTREPSDVLEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLR 150

Query: 1344 KENRVDKETMELMVSIMCGWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLY 1165
            KENRVDKETMELMVSIMC WV+KLIE + D G             LKP FSMIEKVISLY
Sbjct: 151  KENRVDKETMELMVSIMCSWVKKLIEGEHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLY 210

Query: 1164 WEARDKDAAVLFVKEVLARGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLR 985
            WE  +K+ AVLFVKEVL R +AYS+D  + HKGGPTGYLAWKMM EGNYR AVKLV+HLR
Sbjct: 211  WEMEEKEKAVLFVKEVLRREIAYSEDDGDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLR 270

Query: 984  ECGLKPELYSYLIAMTAVVKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLL 805
            E GLKPE+YSYLIAMTAVVKELNE AKALRKLKGF+K+G+IAELDAEN  L+EKYQ++LL
Sbjct: 271  ESGLKPEVYSYLIAMTAVVKELNEFAKALRKLKGFTKSGLIAELDAENVELIEKYQSDLL 330

Query: 804  DDGVRLSNWVLQDGGPSLNGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDI 625
             DGVRLS+WV+Q+G   L+G+V+ERLLAMYICAGRG+EAE+QLWEMK VGKEAD++LYDI
Sbjct: 331  ADGVRLSSWVIQEGRSPLHGVVYERLLAMYICAGRGLEAERQLWEMKLVGKEADRELYDI 390

Query: 624  VLAICASQKEVDAIGRLLTSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLG 445
            VLAICAS+KE  AI RLLT +EV S++ +KKTLSWLLRGY+KG HFDDA+ET+I+MLDLG
Sbjct: 391  VLAICASKKEASAISRLLTGMEVTSSIRRKKTLSWLLRGYIKGSHFDDASETIIKMLDLG 450

Query: 444  FCPEFLDRAAVLQGLRRRIQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIKT 265
             CPE+LDRAAVLQGLR RIQ+ GN+ETYLKLCK LSDANLIGPCL+YL+++K+KLWI+KT
Sbjct: 451  LCPEYLDRAAVLQGLRNRIQQTGNVETYLKLCKHLSDANLIGPCLVYLYIKKYKLWILKT 510

Query: 264  V 262
            +
Sbjct: 511  I 511


>ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1|
            predicted protein [Populus trichocarpa]
          Length = 500

 Score =  687 bits (1774), Expect = 0.0
 Identities = 336/467 (71%), Positives = 399/467 (85%), Gaps = 4/467 (0%)
 Frame = -1

Query: 1656 CRGLTVNCS----QSPGFVVARKSKFRELRLFKSVELDRFITSDDEDEMSEGFFEAIEEL 1489
            C   T+ C+    + P FVVA+ +K RE RLFKSVELD+++TSDDE+EM EGFFEAIEEL
Sbjct: 32   CMVSTIICNYQTPKRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEEL 91

Query: 1488 ERMVREPSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMEL 1309
            ERM REPSD+LEEMN++LSARELQLVLVYFSQEG+DSWCALEV+EWLRKENRVDKETMEL
Sbjct: 92   ERMTREPSDILEEMNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMEL 151

Query: 1308 MVSIMCGWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLF 1129
            MVSIMC WV+KLIE + D G             LKPSFSMIEKVISLYW+   K+ AV F
Sbjct: 152  MVSIMCSWVKKLIEGEQDVGDVVDLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSF 211

Query: 1128 VKEVLARGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYL 949
            VKEVL RG+AYS D  E  KGGPTGYL WKMMV+GNYR+AVKLV+HLRE GLKPE+Y+YL
Sbjct: 212  VKEVLRRGIAYSGDDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYL 271

Query: 948  IAMTAVVKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQ 769
            IAMTAVVKELNE +KALRKLKG+S++G++ ELDAEN  LVEKYQ++LL DGV LS+WV+Q
Sbjct: 272  IAMTAVVKELNEFSKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQ 331

Query: 768  DGGPSLNGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVD 589
            +G P+L G+VHERLLAMYICAGRG++AE+QLWEMK VGKEAD DLYDIVLAICASQKE  
Sbjct: 332  EGSPALYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEAS 391

Query: 588  AIGRLLTSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVL 409
            A+ RLLT +EVAS++ KKK+LSWLLRGY+KGGH+ +A ET+I+MLDLG  P++LDR AV+
Sbjct: 392  AVARLLTRIEVASSMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVM 451

Query: 408  QGLRRRIQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268
            QGLR+RIQ+ GN+E+YLKLCKRLSD NLIGP L+YL+++K+KLWI+K
Sbjct: 452  QGLRKRIQQWGNVESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMK 498


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  682 bits (1759), Expect = 0.0
 Identities = 337/461 (73%), Positives = 395/461 (85%), Gaps = 2/461 (0%)
 Frame = -1

Query: 1644 TVNCSQSPGFVVARKSKFR--ELRLFKSVELDRFITSDDEDEMSEGFFEAIEELERMVRE 1471
            ++   +S  FVVA++SK R  E R+ KSVELD++I SDDE+EMSEGFFEAIEELERM RE
Sbjct: 37   SIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIASDDEEEMSEGFFEAIEELERMTRE 96

Query: 1470 PSDVLEEMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMC 1291
            PSDVLEEMN+KLSARELQLVLVYFSQEG+DSWCALEV+EWLRKENRVDKETMELMVSIMC
Sbjct: 97   PSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMC 156

Query: 1290 GWVRKLIEEKSDTGXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLA 1111
             W++KLIE + + G             LKPSFSMIEKVISLYWE  +K+ +V FVKEVL 
Sbjct: 157  SWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGEKEKSVSFVKEVLR 216

Query: 1110 RGVAYSDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAV 931
            R VAY +D  E  KGGPTGYLAWKMMV+GNYRDAVKLV+H RE GLKPE+YSYLIAMTAV
Sbjct: 217  REVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLKPEVYSYLIAMTAV 276

Query: 930  VKELNEVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSL 751
            VKELNE AKALRKLKGF+K+G+IAELDAENT L+EKYQ++L+ DGV LS+WV+Q+G PSL
Sbjct: 277  VKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEKYQSDLIADGVCLSSWVIQEGSPSL 336

Query: 750  NGIVHERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLL 571
             G+VHERLLAMYICAGRG++AE+QLWEMK VGK AD DLYDIVLAICASQKE  A+ RLL
Sbjct: 337  YGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHADGDLYDIVLAICASQKEASAVSRLL 396

Query: 570  TSLEVASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRR 391
            T +EV S+L KKKTLSWLLRGYLKGG +D+A E +++MLD+G CP++LDR AVLQGLR+R
Sbjct: 397  TRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDYLDRVAVLQGLRKR 456

Query: 390  IQEPGNLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268
            IQ+ GN+E+YL LCKRLSD NLIGP L+YL+++K+KLWI+K
Sbjct: 457  IQQWGNVESYLNLCKRLSDENLIGPSLVYLYIKKYKLWIMK 497


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  669 bits (1725), Expect = 0.0
 Identities = 334/508 (65%), Positives = 411/508 (80%), Gaps = 15/508 (2%)
 Frame = -1

Query: 1746 MATANGIALFSYLGLVIP------RSSDPI--------FKVQYFCRGLTVNCS-QSPGFV 1612
            MA+A+G+A    LG V        R   P+        F ++++      +C  ++P FV
Sbjct: 1    MASAHGLAPIFKLGFVFSSVSPSQRKRHPLMFPASHCGFSLKFYGGLSARSCKFKNPSFV 60

Query: 1611 VARKSKFRELRLFKSVELDRFITSDDEDEMSEGFFEAIEELERMVREPSDVLEEMNEKLS 1432
             A+    R  R  KSVE+D+++TS+DE  MS+GFFEAIEELERM REPSDVLEEMN++LS
Sbjct: 61   SAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTREPSDVLEEMNDRLS 118

Query: 1431 ARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMCGWVRKLIEEKSDT 1252
            ARELQLVLVYFSQ+G+DSWCALEV++WLRKENRVDKETMELMV+IMCGWV+KLI+++   
Sbjct: 119  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQQQHGV 178

Query: 1251 GXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLARGVAYSDDHKEEH 1072
            G             L+P FSMIEKVISLYWE  +K+ AVLFV+EVL RG+ Y ++ +E H
Sbjct: 179  GDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYVEEDEEGH 238

Query: 1071 KGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAVVKELNEVAKALRK 892
            KGGPTGYLAWKMM EG+YR+AV+LV+  RE GLKPE+YSYL+AMTAVVKELNE AKALRK
Sbjct: 239  KGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAVVKELNEFAKALRK 298

Query: 891  LKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSLNGIVHERLLAMYI 712
            LKGF++AG++AELD E+  L EKYQ++ L DGVRLSNWV+QDG PSL+GIVHERLLAMYI
Sbjct: 299  LKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSLHGIVHERLLAMYI 358

Query: 711  CAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLLTSLEVASALSKKK 532
            CAG G+EAE+QLWEMK VGKEAD DLYDIVLAICASQKE +A  RLLT LEV S+  KKK
Sbjct: 359  CAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVVSSPQKKK 418

Query: 531  TLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRRIQEPGNLETYLKL 352
            +LSWLLRGY+KGGHF++A ET+++ML+LGF PE+LDRAAVLQGLR+RIQ+ GNL+TY++L
Sbjct: 419  SLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVRL 478

Query: 351  CKRLSDANLIGPCLLYLHLRKHKLWIIK 268
            CK LSDANLIGPCL++L++RK+KLW++K
Sbjct: 479  CKSLSDANLIGPCLVHLYIRKYKLWVVK 506


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  663 bits (1711), Expect = 0.0
 Identities = 325/456 (71%), Positives = 389/456 (85%), Gaps = 2/456 (0%)
 Frame = -1

Query: 1629 QSPGFVVARKSKFRELRLFKSVELDRFITSDDE-DEMSEGFFEAIEELERMVREPSDVLE 1453
            ++P FV  ++   R  R  KSVELD+++TSDDE DEMS+GFFEAIEELERM REPSDVLE
Sbjct: 55   KNPSFV--KQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLE 112

Query: 1452 EMNEKLSARELQLVLVYFSQEGKDSWCALEVYEWLRKENRVDKETMELMVSIMCGWVRKL 1273
            EMN++LSARELQLVLVYFSQ+G+DSWCALEV++WLRKENRVDKETMELMV+IMCGWV+KL
Sbjct: 113  EMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKL 172

Query: 1272 IEEKSDT-GXXXXXXXXXXXXXLKPSFSMIEKVISLYWEARDKDAAVLFVKEVLARGVAY 1096
            I+E     G             L+P FSMIEKVISLYWE  +K+ AVLFV+EVL RG+ Y
Sbjct: 173  IQEHHGVVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPY 232

Query: 1095 SDDHKEEHKGGPTGYLAWKMMVEGNYRDAVKLVVHLRECGLKPELYSYLIAMTAVVKELN 916
             ++ +E HKGGPTGYLAWKMM EG+Y  AV+LV+H  E GLKPE+YSYL+AMTAVVKELN
Sbjct: 233  LEEDEEGHKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELN 292

Query: 915  EVAKALRKLKGFSKAGVIAELDAENTLLVEKYQANLLDDGVRLSNWVLQDGGPSLNGIVH 736
            E+AKALRKLK F++ G++AELD E+  L EKYQ++LL DGVRLSNW +QDG PSL+GI+H
Sbjct: 293  ELAKALRKLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIH 352

Query: 735  ERLLAMYICAGRGVEAEKQLWEMKFVGKEADKDLYDIVLAICASQKEVDAIGRLLTSLEV 556
            ERLLAMYICAG G+EAEKQLWEMK VGKEAD DLYDIVLAICASQKE +A  RLLT LEV
Sbjct: 353  ERLLAMYICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEV 412

Query: 555  ASALSKKKTLSWLLRGYLKGGHFDDATETVIRMLDLGFCPEFLDRAAVLQGLRRRIQEPG 376
            AS+  KKK+LSWLLRGY+KGGHF++A ET+++MLDLGF PE+LDRAAVLQGLR+RIQ+ G
Sbjct: 413  ASSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYG 472

Query: 375  NLETYLKLCKRLSDANLIGPCLLYLHLRKHKLWIIK 268
            NL+TY++LCK LSDANLIGPCL++L++RK+KLW++K
Sbjct: 473  NLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVK 508


Top