BLASTX nr result

ID: Anemarrhena21_contig00020531 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00020531
         (1027 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008800497.1| PREDICTED: putative pentatricopeptide repeat...   249   2e-63
ref|XP_010939837.1| PREDICTED: putative pentatricopeptide repeat...   239   2e-60
ref|XP_010089903.1| hypothetical protein L484_008591 [Morus nota...   217   9e-54
ref|XP_011623398.1| PREDICTED: pentatricopeptide repeat-containi...   215   4e-53
emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]   213   2e-52
ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat...   212   3e-52
ref|XP_010260521.1| PREDICTED: LOW QUALITY PROTEIN: putative pen...   211   7e-52
ref|XP_009104764.1| PREDICTED: putative pentatricopeptide repeat...   211   7e-52
emb|CDX85827.1| BnaC06g23070D [Brassica napus]                        207   1e-50
ref|XP_007048433.1| Tetratricopeptide repeat-like superfamily pr...   206   2e-50
emb|CDX68186.1| BnaA07g22260D [Brassica napus]                        205   5e-50
ref|NP_177580.1| pentatricopeptide repeat-containing protein [Ar...   203   1e-49
ref|XP_012438783.1| PREDICTED: putative pentatricopeptide repeat...   202   2e-49
ref|XP_010537377.1| PREDICTED: putative pentatricopeptide repeat...   202   2e-49
ref|XP_010537376.1| PREDICTED: putative pentatricopeptide repeat...   202   2e-49
ref|XP_006390431.1| hypothetical protein EUTSA_v10018864mg [Eutr...   202   2e-49
ref|XP_009338951.1| PREDICTED: LOW QUALITY PROTEIN: putative pen...   202   4e-49
ref|XP_010471530.1| PREDICTED: putative pentatricopeptide repeat...   201   5e-49
ref|XP_006302229.1| hypothetical protein CARUB_v10020251mg [Caps...   201   9e-49
ref|XP_010265137.1| PREDICTED: pentatricopeptide repeat-containi...   200   2e-48

>ref|XP_008800497.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Phoenix dactylifera]
          Length = 458

 Score =  249 bits (637), Expect = 2e-63
 Identities = 136/241 (56%), Positives = 161/241 (66%), Gaps = 9/241 (3%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGS------GCR--PNSVTFV 872
            AR  FD A QRDVTTWTS+I+G ALHG A +AL LF EMK S      GCR  PN VTFV
Sbjct: 217  ARHLFDSAKQRDVTTWTSMIVGLALHGLANEALMLFAEMKESISSESNGCRVSPNHVTFV 276

Query: 871  GVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPV 692
            GVL ACSHAGL  EG+  WENM R Y L+  LAHYGCMVDL CR+G LE+AY FI+ MPV
Sbjct: 277  GVLMACSHAGLVSEGRFHWENMQRDYNLRPRLAHYGCMVDLFCRAGLLEDAYAFIKGMPV 336

Query: 691  QANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAEAXXXXXXXXX 512
            Q NAVIWRTLL A  L GNVS+G  AR+RLLELEP+   D +++SN YA A         
Sbjct: 337  QRNAVIWRTLLAASCLHGNVSLGALARRRLLELEPDYAGDDVTLSNVYAAAGLWDEKQDV 396

Query: 511  XXXXXXRA-PGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSKDSWEIKVVDYA 335
                  +  PGCSLIEVGS  HEFVA DRR ++ K +Q++L+ +V  S+    +   D A
Sbjct: 397  RKRMKRQRDPGCSLIEVGSRTHEFVAADRRHLREKGMQEVLQSIVENSRALVHVPDADIA 456

Query: 334  L 332
            +
Sbjct: 457  V 457


>ref|XP_010939837.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Elaeis guineensis]
          Length = 455

 Score =  239 bits (610), Expect = 2e-60
 Identities = 131/229 (57%), Positives = 153/229 (66%), Gaps = 9/229 (3%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGS------GC--RPNSVTFV 872
            A   FD   QRDVTTWTS+I+G ALHGRA +AL LF EMK S      GC   PN VTFV
Sbjct: 217  AHHLFDGTKQRDVTTWTSMIVGLALHGRANEALLLFAEMKKSMSGHSNGCCVSPNHVTFV 276

Query: 871  GVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPV 692
            GVL ACSHAGL  E +  WE+M R Y LK  LAHYGCMVDL CR+G LE+AY FI+ MP+
Sbjct: 277  GVLMACSHAGLVNEAQFHWESMQRDYNLKPQLAHYGCMVDLFCRAGLLEDAYAFIKRMPM 336

Query: 691  QANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAEAXXXXXXXXX 512
            Q NAVIWRTLL ACSL GNVS+G  AR RLLELEP+   D ++MSN YA A         
Sbjct: 337  QCNAVIWRTLLAACSLHGNVSLGALARCRLLELEPDYAGDDVTMSNMYAAAGLWDEKQDV 396

Query: 511  XXXXXXRA-PGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSK 368
                  R  PG SLIEVGS  HEF + DRR ++ K +Q++L+ +V  S+
Sbjct: 397  RKRMKRRRDPGSSLIEVGSRTHEFASSDRRHLREKGMQEVLQSIVENSR 445


>ref|XP_010089903.1| hypothetical protein L484_008591 [Morus notabilis]
            gi|587848284|gb|EXB38563.1| hypothetical protein
            L484_008591 [Morus notabilis]
          Length = 451

 Score =  217 bits (553), Expect = 9e-54
 Identities = 115/220 (52%), Positives = 143/220 (65%), Gaps = 21/220 (9%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK------------GSGCR--- 893
            AR+ FD   ++DVTTWTS+I+GHALHG+A++AL LF +MK              GC    
Sbjct: 228  ARRLFDSLRRKDVTTWTSMIVGHALHGQAEEALNLFAKMKETRESPKKKKKKNDGCNGGA 287

Query: 892  ----PNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLE 725
                PN VTF+GVL +CSHAG+ EEGK+Q+ +M   Y LK   +HYGCMVDL CR+G LE
Sbjct: 288  SSIVPNDVTFIGVLMSCSHAGMVEEGKRQFRSMVEDYGLKPKDSHYGCMVDLFCRAGMLE 347

Query: 724  EAYGFIETMPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA 545
            EAY FI  MPV+ANAV+WRTLLGACSL G+V +G + R++LLELEP  V D + +SN YA
Sbjct: 348  EAYDFISKMPVRANAVVWRTLLGACSLNGSVELGSKVRQKLLELEPAHVGDSVVLSNIYA 407

Query: 544  E--AXXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGD 431
                               R+PGCS IEVGS + EFVA D
Sbjct: 408  AEGMWERKMTVRDQMTKQRRSPGCSSIEVGSGISEFVASD 447


>ref|XP_011623398.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like
            [Amborella trichopoda]
          Length = 420

 Score =  215 bits (548), Expect = 4e-53
 Identities = 111/219 (50%), Positives = 143/219 (65%), Gaps = 4/219 (1%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCRPNSVTFVGVLTACSH 848
            AR  FD    RD+T+WTS+I  HALHG A KAL LF EM+G   +PN VTFVG+LTACSH
Sbjct: 191  ARHLFDNLAYRDITSWTSMIAAHALHGEALKALGLFGEMEGENIKPNEVTFVGILTACSH 250

Query: 847  AGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVIWR 668
            AGL E G+Q +E+M + Y +   ++HYGCMVDLLCR+G+L +AYGFI+ MP   NAV+WR
Sbjct: 251  AGLVERGRQLFESMHKEYGIMPKMSHYGCMVDLLCRAGRLTDAYGFIQCMPFPPNAVVWR 310

Query: 667  TLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAEAXXXXXXXXXXXXXXXRA 488
            TLL ACSL G++ +G  AR  L ELEP  V D + +SN +A                 R 
Sbjct: 311  TLLSACSLHGDMELGAIARDYLAELEPGHVGDDVVLSNMHAAVGQWDEKAAVRKRIKSRG 370

Query: 487  ----PGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGM 383
                PGCSLI+V   V+EFV  D +   ++EI ++L+GM
Sbjct: 371  RRRLPGCSLIQVEGTVNEFVIADNKHPLSEEIYEVLKGM 409



 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 4/134 (2%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCRPNSVTFVGVLTACSH 848
            A   FD    R+   W+++I G+  +G+  KAL+LF+EM+  G  P+ VT    L+AC+ 
Sbjct: 90   AHAVFDEMKHRNFVAWSALITGYVRNGKPNKALKLFREMQEEGLEPDQVTLAIALSACAD 149

Query: 847  AGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLC----RSGQLEEAYGFIETMPVQANA 680
             G  + G  +W      Y LK N+    C+V+ L     +  ++E A    + +  + + 
Sbjct: 150  LGALQTG--EW---IHAYALKNNITPDLCLVNALINMYGKCSKVEIARHLFDNLAYR-DI 203

Query: 679  VIWRTLLGACSLKG 638
              W +++ A +L G
Sbjct: 204  TSWTSMIAAHALHG 217


>emb|CAN66662.1| hypothetical protein VITISV_031722 [Vitis vinifera]
          Length = 1060

 Score =  213 bits (542), Expect = 2e-52
 Identities = 119/247 (48%), Positives = 152/247 (61%), Gaps = 25/247 (10%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR--------------- 893
            AR+ FD   ++DVTTWTS+I+GHALHG+A++AL+LF EMK +  R               
Sbjct: 803  ARRLFDGTQKKDVTTWTSMIVGHALHGQAEEALQLFTEMKETNKRARKNKRNGEXESSLV 862

Query: 892  -PNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAY 716
             PN VTF+GVL ACSHAGL EEGKQ + +M   Y L+  ++H+GCMVDLLCR+G L EAY
Sbjct: 863  LPNDVTFMGVLMACSHAGLVEEGKQHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAY 922

Query: 715  GFIETMPVQANAVIWRTLLGACSLKG--------NVSIGDRARKRLLELEPELVSDRISM 560
             FI  MPV+ NAV+WRTLLGACSL+G        N+ I   AR++LLELEP  V D + M
Sbjct: 923  EFILKMPVRPNAVVWRTLLGACSLQGDSNGNGNSNIKIXSEARRQLLELEPSHVGDNVIM 982

Query: 559  SNAY-AEAXXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGM 383
            SN Y A+                R PGCS IEVG  + EFVA D +     +I +IL+ +
Sbjct: 983  SNLYAAKGMWDKKMLVRNQIKQRRDPGCSSIEVGIDIKEFVAADDQHPCMPQIYEILDHL 1042

Query: 382  VHCSKDS 362
                + S
Sbjct: 1043 TRTMRAS 1049


>ref|XP_002281821.2| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Vitis vinifera]
          Length = 482

 Score =  212 bits (540), Expect = 3e-52
 Identities = 119/247 (48%), Positives = 152/247 (61%), Gaps = 25/247 (10%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR--------------- 893
            AR+ FD   ++DVTTWTS+I+GHALHG+A++AL+LF EMK +  R               
Sbjct: 225  ARRLFDGTQKKDVTTWTSMIVGHALHGQAEEALQLFTEMKETNKRARKNKRNGEHESSLV 284

Query: 892  -PNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAY 716
             PN VTF+GVL ACSHAGL EEGKQ + +M   Y L+  ++H+GCMVDLLCR+G L EAY
Sbjct: 285  LPNDVTFMGVLMACSHAGLVEEGKQHFRSMKEDYSLRPRISHFGCMVDLLCRAGLLTEAY 344

Query: 715  GFIETMPVQANAVIWRTLLGACSLKG--------NVSIGDRARKRLLELEPELVSDRISM 560
             FI  MPV+ NAV+WRTLLGACSL+G        N+ I   AR++LLELEP  V D + M
Sbjct: 345  EFILKMPVRPNAVVWRTLLGACSLQGDSNGNGNSNIKIYSEARRQLLELEPSHVGDNVIM 404

Query: 559  SNAY-AEAXXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGM 383
            SN Y A+                R PGCS IEVG  + EFVA D +     +I +IL+ +
Sbjct: 405  SNLYAAKGMWDKKMLVRNQIKQRRDPGCSSIEVGIDIKEFVAADDQHPCMPQIYEILDHL 464

Query: 382  VHCSKDS 362
                + S
Sbjct: 465  TRTMRAS 471


>ref|XP_010260521.1| PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide
            repeat-containing protein At1g74400 [Nelumbo nucifera]
          Length = 302

 Score =  211 bits (537), Expect = 7e-52
 Identities = 115/236 (48%), Positives = 147/236 (62%), Gaps = 18/236 (7%)
 Frame = -1

Query: 1015 FDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR-----------------PN 887
            FD    +DVTTWTS+I+GHA+H +A++AL LF+EM  S  +                 PN
Sbjct: 51   FDSVRPKDVTTWTSMIVGHAVHEQAEEALRLFEEMNRSKSKRKMRNKNDGEHWSDLILPN 110

Query: 886  SVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFI 707
             VTF+GVL ACSH GL EE  Q  E+M++ Y LK  ++HYGCMVDLLCR+G L++AY FI
Sbjct: 111  EVTFIGVLMACSHKGLVEEEWQHLESMSKKYGLKPRISHYGCMVDLLCRAGLLKDAYDFI 170

Query: 706  ETMPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAEA-XXX 530
              MP+QANAV+W TLLGACSL G+  +G   R+RL EL+P  V D +++SN YA A    
Sbjct: 171  VNMPIQANAVVWCTLLGACSLHGDTELGLSVRQRLFELDPSHVGDDVALSNTYAAAGLWD 230

Query: 529  XXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSKDS 362
                        R PGCS IE  + VHEFV  DR     +EI ++LEGM+   K S
Sbjct: 231  DKLMVRNQIQHXRIPGCSSIETTTGVHEFVTADRSHHMKREIYEVLEGMIKNLKAS 286


>ref|XP_009104764.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Brassica rapa]
          Length = 467

 Score =  211 bits (537), Expect = 7e-52
 Identities = 112/224 (50%), Positives = 155/224 (69%), Gaps = 6/224 (2%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK--GSGCRPNSVTFVGVLTAC 854
            AR+ FD   ++DVTT+TS+I+G+AL+G+AQ++LELFK+MK   S   PN VTF+GVL AC
Sbjct: 229  ARRVFDETVRKDVTTYTSMIVGYALNGQAQESLELFKKMKRQDSSVSPNDVTFIGVLMAC 288

Query: 853  SHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVI 674
            SH GL EEGK+ + +M   Y LK   AH+GCMVDLLCRSG+L++A+ FI  MPV+ NAVI
Sbjct: 289  SHGGLVEEGKRHFRSMVEDYNLKPREAHFGCMVDLLCRSGRLKDAHEFISQMPVKPNAVI 348

Query: 673  WRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAY-AEAXXXXXXXXXXXXXX 497
            WRTLLGACSL+GNV +G+ A++R+ EL+ + V D +++SN Y A+               
Sbjct: 349  WRTLLGACSLQGNVELGEEAQRRIFELDSDHVGDYVALSNIYAAKGMWDEKVRMRDRVRK 408

Query: 496  XRAPGCSLIEVGSAVHEFVAG---DRRQVKTKEIQQILEGMVHC 374
             R PG S IE+G+ + EFV+G   D  ++   EI ++L  +V C
Sbjct: 409  RREPGKSWIEMGNIIAEFVSGGGDDDGKLMVGEISEVLRCLVAC 452


>emb|CDX85827.1| BnaC06g23070D [Brassica napus]
          Length = 466

 Score =  207 bits (527), Expect = 1e-50
 Identities = 110/223 (49%), Positives = 154/223 (69%), Gaps = 5/223 (2%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEM-KGSGCRPNSVTFVGVLTACS 851
            AR+ FD   ++DVTT+TS+I+G+AL+G+AQ++LELFK+M + S   PN VTF+GVL ACS
Sbjct: 229  ARRVFDETMRKDVTTYTSMIVGYALNGQAQESLELFKKMSQDSSVSPNDVTFIGVLMACS 288

Query: 850  HAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVIW 671
            H GL EEGK+ + +M   Y LK   AHYGC+VDLLCRSG+L++A+ FI  MPV+ NAVIW
Sbjct: 289  HGGLVEEGKRHFRSMVEDYNLKPRDAHYGCIVDLLCRSGRLKDAHDFINQMPVKPNAVIW 348

Query: 670  RTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAY-AEAXXXXXXXXXXXXXXX 494
            RTLLGACSL+GNV +G+ A++R+ EL+ + V D +++SN Y A+                
Sbjct: 349  RTLLGACSLQGNVELGEEAQRRIFELDSDHVGDYVALSNIYAAKGMWDEKLKIRDRVRKR 408

Query: 493  RAPGCSLIEVGSAVHEFVAG---DRRQVKTKEIQQILEGMVHC 374
            R PG S IE+G+ + EFV+G      ++   EI ++L  +V C
Sbjct: 409  REPGKSWIEMGNIIAEFVSGGGDGDGKLMVGEISEVLRCLVAC 451


>ref|XP_007048433.1| Tetratricopeptide repeat-like superfamily protein, putative
            [Theobroma cacao] gi|508700694|gb|EOX92590.1|
            Tetratricopeptide repeat-like superfamily protein,
            putative [Theobroma cacao]
          Length = 477

 Score =  206 bits (524), Expect = 2e-50
 Identities = 112/229 (48%), Positives = 145/229 (63%), Gaps = 17/229 (7%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK----------------GSGC 896
            AR+ FD   ++DVTTWTS+I+GHALHG+A +AL+LF +M+                 S  
Sbjct: 227  ARKLFDSLGEKDVTTWTSMIVGHALHGQANEALQLFGKMEEIKQKNGKSRDEGNRGSSII 286

Query: 895  RPNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAY 716
             PN VTF+GVL ACSH G+ EEGK+ +++M+  Y LK    H+GCMVD+ CR+G L+EAY
Sbjct: 287  LPNDVTFIGVLMACSHGGMVEEGKKYYQSMSEDYGLKPRDVHFGCMVDIFCRAGLLKEAY 346

Query: 715  GFIETMPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAY-AEA 539
             FI  MP +ANAVIWRTLLGAC+L G + +G++ R RLLELEP  V D ++MSN Y A+ 
Sbjct: 347  EFILEMPGKANAVIWRTLLGACNLHGEIELGEKVRCRLLELEPGHVGDNVAMSNFYAAKG 406

Query: 538  XXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQIL 392
                           RAPGCS IEV S + EFV+ D     T EI + L
Sbjct: 407  MWDKKVTVRDQITQRRAPGCSSIEVASEISEFVSADDDHPLTAEICEAL 455



 Score = 60.1 bits (144), Expect = 3e-06
 Identities = 31/133 (23%), Positives = 70/133 (52%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCRPNSVTFVGVLTACSH 848
            A   FD    + + +WT++I  +  + + QKA+ELF++M+     P+ VT    L+AC++
Sbjct: 125  AHYMFDEIPSKSIVSWTALISAYVANQKPQKAVELFRKMQMLNVEPDQVTVTVALSACAN 184

Query: 847  AGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVIWR 668
             G  E G+     + R  +LK +L+    ++++  + G+++ A    +++  + +   W 
Sbjct: 185  LGALEMGEWIHAYVGRKPELKADLSLNNALINMYAKCGEIKTARKLFDSLG-EKDVTTWT 243

Query: 667  TLLGACSLKGNVS 629
            +++   +L G  +
Sbjct: 244  SMIVGHALHGQAN 256


>emb|CDX68186.1| BnaA07g22260D [Brassica napus]
          Length = 537

 Score =  205 bits (521), Expect = 5e-50
 Identities = 111/230 (48%), Positives = 156/230 (67%), Gaps = 7/230 (3%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK--GSGCRPNSVTFVGVLTAC 854
            AR+ FD   ++DVTT+TS+I+G+AL+G+AQ++LELFK+MK   S   PN VTF+GVL AC
Sbjct: 229  ARRVFDETVRKDVTTYTSMIVGYALNGQAQESLELFKKMKRQDSSVSPNDVTFIGVLMAC 288

Query: 853  SHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVI 674
            SH GL EEGK+ + +M   Y LK   AH+GCMVDLLCRSG+L++A+ FI  MPV+ NAVI
Sbjct: 289  SHGGLVEEGKRHFRSMVEDYNLKPREAHFGCMVDLLCRSGRLKDAHEFISQMPVKPNAVI 348

Query: 673  WRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAY-AEAXXXXXXXXXXXXXX 497
            WRTLLGACSL+GNV +G+  ++R+ EL+ + V D +++SN Y A+               
Sbjct: 349  WRTLLGACSLQGNVELGEEVQRRIFELDRDHVGDYVALSNIYAAKGMWDEKVRMRDRVRK 408

Query: 496  XRAPGCSLIEVGSAVHEFVAG---DRRQVKTKEIQQI-LEGMVHCSKDSW 359
             R PG S IE+G+ + EFV+G   D  ++   EI ++ L  ++  S+  W
Sbjct: 409  RREPGKSWIEMGNIIAEFVSGGGDDDGKLMVGEISELELFSVLEASRAYW 458


>ref|NP_177580.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75169846|sp|Q9CA73.1|PP119_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At1g74400 gi|12324820|gb|AAG52382.1|AC011765_34
            hypothetical protein; 20273-21661 [Arabidopsis thaliana]
            gi|332197466|gb|AEE35587.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 462

 Score =  203 bits (517), Expect = 1e-49
 Identities = 108/228 (47%), Positives = 153/228 (67%), Gaps = 10/228 (4%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK------GSGCRPNSVTFVGV 866
            AR+ FD + ++DVTT+TS+I G+AL+G+AQ++LELFK+MK       +   PN VTF+GV
Sbjct: 223  ARKLFDESMRKDVTTYTSMIFGYALNGQAQESLELFKKMKTIDQSQDTVITPNDVTFIGV 282

Query: 865  LTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQA 686
            L ACSH+GL EEGK+ +++M   Y LK   AH+GCMVDL CRSG L++A+ FI  MP++ 
Sbjct: 283  LMACSHSGLVEEGKRHFKSMIMDYNLKPREAHFGCMVDLFCRSGHLKDAHEFINQMPIKP 342

Query: 685  NAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXXXXXXX 509
            N VIWRTLLGACSL GNV +G+  ++R+ EL+ + V D +++SN YA +           
Sbjct: 343  NTVIWRTLLGACSLHGNVELGEEVQRRIFELDRDHVGDYVALSNIYASKGMWDEKSKMRD 402

Query: 508  XXXXXRAPGCSLIEVGSAVHEFVAG---DRRQVKTKEIQQILEGMVHC 374
                 R PG S IE+GS ++EFV+G   +  Q+   EI ++L  +V C
Sbjct: 403  RVRKRRMPGKSWIELGSIINEFVSGPDNNDEQLMMGEISEVLRCLVSC 450


>ref|XP_012438783.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Gossypium raimondii]
            gi|763783883|gb|KJB50954.1| hypothetical protein
            B456_008G194600 [Gossypium raimondii]
          Length = 478

 Score =  202 bits (515), Expect = 2e-49
 Identities = 113/250 (45%), Positives = 146/250 (58%), Gaps = 18/250 (7%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK----------------GSGC 896
            AR  FD   ++DVTTWTS+I+GHALHG+A +AL LF EM+                 S  
Sbjct: 227  ARNLFDSLGEKDVTTWTSMIVGHALHGQANEALGLFGEMEEIKWKNSKNKEEGNRGSSTI 286

Query: 895  RPNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAY 716
             PN VTF+GVL ACSH G+ EEGK+ +  M   Y LK    H+GCMVDL CR+G L+EAY
Sbjct: 287  LPNDVTFIGVLMACSHGGMIEEGKKYYRRMVNYYGLKPREVHFGCMVDLFCRAGLLKEAY 346

Query: 715  GFIETMPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAE-- 542
             FI  MP QANAV WRTLLGAC++ G + +G++ + +L ELEP  V D ++MSN YA   
Sbjct: 347  NFIIEMPGQANAVTWRTLLGACNINGEIELGEKVKLQLQELEPGYVGDSVAMSNIYAAKG 406

Query: 541  AXXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSKDS 362
                            RAPGCS IEV S + EF++GD       EI + L+ +      +
Sbjct: 407  MWDKKVEVRDQIKQLRRAPGCSSIEVASEISEFISGDDDHPLKTEIYEALKYL------T 460

Query: 361  WEIKVVDYAL 332
              +K  DY+L
Sbjct: 461  ISMKAYDYSL 470


>ref|XP_010537377.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 isoform X2 [Tarenaya hassleriana]
          Length = 437

 Score =  202 bits (515), Expect = 2e-49
 Identities = 109/232 (46%), Positives = 151/232 (65%), Gaps = 12/232 (5%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR-----------PNSV 881
            AR+ F+   ++DVTTWTS+I+GHAL+G+A++ALELF +MK +  R           PN V
Sbjct: 198  ARKFFNGTKRKDVTTWTSMIIGHALNGQAEEALELFSKMKAADQRMPKICKNSTILPNDV 257

Query: 880  TFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIET 701
            TF+GVL ACSHAGL EEGKQ + +M   Y LK   AH+GCMVD  CR+G L++AY FI  
Sbjct: 258  TFIGVLMACSHAGLVEEGKQHFRSMVEEYNLKPRDAHFGCMVDTFCRAGLLKDAYEFIMN 317

Query: 700  MPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXX 524
            +P + NAVIWRTLLGACS+ GN+ +G+  +++L+EL+ + V D I++SN YA +      
Sbjct: 318  IPTKPNAVIWRTLLGACSVYGNIELGEEVQRKLVELDHDHVGDCIALSNIYASKGMWEKK 377

Query: 523  XXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSK 368
                      R PG S IE G+ ++EFV+GD    K  EI +IL+ +   +K
Sbjct: 378  TEARDRVTKRRVPGKSWIEFGTIMNEFVSGDDDHPKMGEICEILKCLALSTK 429


>ref|XP_010537376.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 isoform X1 [Tarenaya hassleriana]
          Length = 459

 Score =  202 bits (515), Expect = 2e-49
 Identities = 109/232 (46%), Positives = 151/232 (65%), Gaps = 12/232 (5%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR-----------PNSV 881
            AR+ F+   ++DVTTWTS+I+GHAL+G+A++ALELF +MK +  R           PN V
Sbjct: 220  ARKFFNGTKRKDVTTWTSMIIGHALNGQAEEALELFSKMKAADQRMPKICKNSTILPNDV 279

Query: 880  TFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIET 701
            TF+GVL ACSHAGL EEGKQ + +M   Y LK   AH+GCMVD  CR+G L++AY FI  
Sbjct: 280  TFIGVLMACSHAGLVEEGKQHFRSMVEEYNLKPRDAHFGCMVDTFCRAGLLKDAYEFIMN 339

Query: 700  MPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXX 524
            +P + NAVIWRTLLGACS+ GN+ +G+  +++L+EL+ + V D I++SN YA +      
Sbjct: 340  IPTKPNAVIWRTLLGACSVYGNIELGEEVQRKLVELDHDHVGDCIALSNIYASKGMWEKK 399

Query: 523  XXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSK 368
                      R PG S IE G+ ++EFV+GD    K  EI +IL+ +   +K
Sbjct: 400  TEARDRVTKRRVPGKSWIEFGTIMNEFVSGDDDHPKMGEICEILKCLALSTK 451


>ref|XP_006390431.1| hypothetical protein EUTSA_v10018864mg [Eutrema salsugineum]
            gi|557086865|gb|ESQ27717.1| hypothetical protein
            EUTSA_v10018864mg [Eutrema salsugineum]
          Length = 326

 Score =  202 bits (515), Expect = 2e-49
 Identities = 110/226 (48%), Positives = 149/226 (65%), Gaps = 8/226 (3%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMK------GSGCRPNSVTFVGV 866
            AR+ FD   ++DVTT+TS+I+G+AL+G+AQ++LELFK+MK       S   PN VTF+GV
Sbjct: 86   ARKLFDGTLRKDVTTYTSMIVGYALNGQAQESLELFKKMKTIGQSQDSSVTPNDVTFIGV 145

Query: 865  LTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQA 686
            L ACSH GL EEGK+ + +M   Y LK   AH+GCMVDL CRSG+L++A+ FI  MPV+ 
Sbjct: 146  LMACSHGGLVEEGKRHFRSMVEDYNLKPRDAHFGCMVDLFCRSGRLKDAHEFINQMPVKP 205

Query: 685  NAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXXXXXXX 509
            NAVIWRTLL ACSL GNV + +  + R+ EL+ + V D +++SN YA +           
Sbjct: 206  NAVIWRTLLSACSLYGNVELAEEVQGRIFELDDDHVGDYVALSNIYASKGMWDEKLKMRD 265

Query: 508  XXXXXRAPGCSLIEVGSAVHEFVAG-DRRQVKTKEIQQILEGMVHC 374
                 R PG S IEVGS + EFV+G D  ++   EI ++L  +V C
Sbjct: 266  RVRKRRLPGKSWIEVGSIIAEFVSGDDDEKLMMGEISEVLRSLVAC 311


>ref|XP_009338951.1| PREDICTED: LOW QUALITY PROTEIN: putative pentatricopeptide
            repeat-containing protein At1g74400 [Pyrus x
            bretschneideri]
          Length = 528

 Score =  202 bits (513), Expect = 4e-49
 Identities = 109/237 (45%), Positives = 145/237 (61%), Gaps = 17/237 (7%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCR--------------- 893
            AR+ FD   ++DV TWTS+I+GHALHG+A++AL LF +MK +                  
Sbjct: 278  ARRLFDGIREKDVMTWTSMIVGHALHGQAEEALTLFGQMKEASKNTRKNKRSGDFENGLV 337

Query: 892  -PNSVTFVGVLTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAY 716
             PN VTF+GVL ACSHAG+ EEGK  + +M++ Y LK   AH+GCMVDL CR+G L+EAY
Sbjct: 338  VPNDVTFIGVLMACSHAGMVEEGKWHFRSMSQVYGLKPREAHFGCMVDLFCRAGLLQEAY 397

Query: 715  GFIETMPVQANAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAY-AEA 539
             FI  M   +NAV+WRTLLGACSL GN+ +G + R +LLELEP    D +++SN Y A+ 
Sbjct: 398  DFILKMTGPSNAVMWRTLLGACSLHGNIKLGSQVRVKLLELEPTYAGDDVALSNIYAAKG 457

Query: 538  XXXXXXXXXXXXXXXRAPGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILEGMVHCSK 368
                           R PGCS IEVG ++ EFV+ D       E+ +IL  ++   K
Sbjct: 458  MWDRKMVVRDQMKQRRPPGCSSIEVGRSISEFVSADDDHPLRTEMYEILRQLIASMK 514


>ref|XP_010471530.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At1g74400 [Camelina sativa]
          Length = 465

 Score =  201 bits (512), Expect = 5e-49
 Identities = 108/229 (47%), Positives = 149/229 (65%), Gaps = 11/229 (4%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSG------CRPNSVTFVGV 866
            AR+ FD   ++DVTT+TS+I G+AL+G+AQ++LELFK+MK           PN VTF+GV
Sbjct: 226  ARKLFDETMRKDVTTYTSMIFGYALNGQAQESLELFKKMKTIDQSQDIVITPNDVTFIGV 285

Query: 865  LTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQA 686
            L ACSH GL EEGK+ + +M   Y LK   AH+GC+VDL CRSG L++A+ FI  MP++ 
Sbjct: 286  LMACSHGGLVEEGKRHFRSMIEDYNLKPREAHFGCIVDLFCRSGHLKDAHEFINQMPIKP 345

Query: 685  NAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXXXXXXX 509
            NAVIWRTLL AC L GNV +G+  ++R+ ELE + V D +++SN YA +           
Sbjct: 346  NAVIWRTLLSACCLHGNVELGEEVQRRIFELEHDHVGDYVALSNIYASKGMWDEKWKMRD 405

Query: 508  XXXXXRAPGCSLIEVGSAVHEFVAG----DRRQVKTKEIQQILEGMVHC 374
                 R PG S IE+GS + EFV+G    D++Q+   EI ++L  +V C
Sbjct: 406  RVRKRRVPGKSWIELGSIITEFVSGHDDNDKKQLMMGEISEVLRCLVAC 454


>ref|XP_006302229.1| hypothetical protein CARUB_v10020251mg [Capsella rubella]
            gi|482570939|gb|EOA35127.1| hypothetical protein
            CARUB_v10020251mg [Capsella rubella]
          Length = 465

 Score =  201 bits (510), Expect = 9e-49
 Identities = 109/228 (47%), Positives = 149/228 (65%), Gaps = 10/228 (4%)
 Frame = -1

Query: 1027 ARQSFDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKG-SGCR-----PNSVTFVGV 866
            AR+ FD   ++DVTT+TS+I G+AL+G+AQ++LELF +MK    C+     PN VTF+GV
Sbjct: 223  ARKLFDETKRKDVTTYTSMIFGYALNGQAQESLELFNKMKTIDQCQDIVITPNDVTFIGV 282

Query: 865  LTACSHAGLEEEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQA 686
            L ACSH GL EEGKQ +++M   Y LK   AH+GCMVDLLCR+G L++A+ FI  MP++ 
Sbjct: 283  LMACSHGGLVEEGKQYFKSMIVDYNLKPRAAHFGCMVDLLCRAGHLKDAHEFINQMPIKP 342

Query: 685  NAVIWRTLLGACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYA-EAXXXXXXXXXX 509
            N VIWRTLL ACSL GNV +G+  ++R+ EL+ + V D +++SN YA +           
Sbjct: 343  NTVIWRTLLSACSLHGNVELGEEVQRRIFELDDDHVGDYVALSNIYASKGMWDEKRKMRD 402

Query: 508  XXXXXRAPGCSLIEVGSAVHEFVAG---DRRQVKTKEIQQILEGMVHC 374
                 R PG S IE+GS + EFV+G   D  Q+   EI + L  +V C
Sbjct: 403  RVRKRRVPGKSWIELGSIITEFVSGHDDDDEQLVVGEISEALRCLVSC 450


>ref|XP_010265137.1| PREDICTED: pentatricopeptide repeat-containing protein At1g08070-like
            [Nelumbo nucifera]
          Length = 708

 Score =  200 bits (508), Expect = 2e-48
 Identities = 101/213 (47%), Positives = 136/213 (63%), Gaps = 4/213 (1%)
 Frame = -1

Query: 1015 FDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCRPNSVTFVGVLTACSHAGLE 836
            F     RDV ++TS+I G A+HG  ++AL+LF EM   G +P+ VTF+G+LTACSH GL 
Sbjct: 394  FKNMQHRDVYSYTSMIAGLAMHGEGERALDLFSEMSRVGMKPDEVTFIGILTACSHVGLV 453

Query: 835  EEGKQQWENMTRCYKLKRNLAHYGCMVDLLCRSGQLEEAYGFIETMPVQANAVIWRTLLG 656
            EEG+Q +E+M+R YKLK  + HYGCMVDLL R+G + EA  FI  MP++ +A +W  LLG
Sbjct: 454  EEGRQYFEDMSRVYKLKPQIEHYGCMVDLLGRAGFISEAEEFIRKMPIEPDAFVWGALLG 513

Query: 655  ACSLKGNVSIGDRARKRLLELEPELVSDRISMSNAYAEA----XXXXXXXXXXXXXXXRA 488
            AC + G V +G+R  K+L+E+EPE     + MSN YA A                   + 
Sbjct: 514  ACRIHGKVELGERIMKKLVEIEPEKDGTFVLMSNIYASANRWRDAVKVRKAMKERKMKKI 573

Query: 487  PGCSLIEVGSAVHEFVAGDRRQVKTKEIQQILE 389
            PGCSLIE+   VHEF  GD+   KTKEI ++L+
Sbjct: 574  PGCSLIELNGMVHEFRKGDKSHPKTKEIYKMLD 606



 Score = 61.6 bits (148), Expect = 9e-07
 Identities = 33/110 (30%), Positives = 61/110 (55%), Gaps = 4/110 (3%)
 Frame = -1

Query: 1015 FDVADQRDVTTWTSIIMGHALHGRAQKALELFKEMKGSGCRPNSVTFVGVLTACSHAGLE 836
            F+   +++V +W S+I+G    G  ++AL +F+ M+  G  P+ VT VGVL +C++ G+ 
Sbjct: 293  FNSMPKKNVVSWNSMILGLTQQGEFKEALLVFRSMQRDGAEPDDVTLVGVLNSCANLGVL 352

Query: 835  EEGKQQWENMTRCYKLKRNLAHYG----CMVDLLCRSGQLEEAYGFIETM 698
            E GK  W      Y  ++ +   G     +VD+  + G +++A+G  + M
Sbjct: 353  ELGK--W---VHAYVDRKGIRADGFIGNALVDMYAKCGSIDQAFGVFKNM 397


Top