BLASTX nr result

ID: Akebia22_contig00009602 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00009602
         (1539 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007045557.1| Pentatricopeptide repeat 336, putative [Theo...   406   e-110
ref|XP_002272104.1| PREDICTED: pentatricopeptide repeat-containi...   404   e-110
ref|XP_004136798.1| PREDICTED: pentatricopeptide repeat-containi...   377   e-102
ref|XP_006847891.1| hypothetical protein AMTR_s00029p00104100 [A...   335   4e-89
ref|XP_002454838.1| hypothetical protein SORBIDRAFT_04g038280 [S...   240   1e-60
ref|NP_001144243.1| hypothetical protein [Zea mays] gi|195638968...   238   5e-60
ref|XP_006648168.1| PREDICTED: pentatricopeptide repeat-containi...   234   6e-59
ref|XP_006858124.1| hypothetical protein AMTR_s00062p00111890 [A...   231   9e-58
ref|NP_001048609.1| Os02g0829800 [Oryza sativa Japonica Group] g...   230   1e-57
ref|NP_564786.1| pentatricopeptide repeat-containing protein [Ar...   228   6e-57
gb|EAY72213.1| hypothetical protein OsI_00065 [Oryza sativa Indi...   228   6e-57
ref|XP_007026036.1| Pentatricopeptide repeat 336 [Theobroma caca...   225   4e-56
ref|XP_002263756.2| PREDICTED: pentatricopeptide repeat-containi...   225   4e-56
ref|XP_003570721.1| PREDICTED: pentatricopeptide repeat-containi...   225   4e-56
ref|XP_006468012.1| PREDICTED: pentatricopeptide repeat-containi...   222   3e-55
ref|XP_006391954.1| hypothetical protein EUTSA_v10023498mg [Eutr...   222   3e-55
ref|XP_002886503.1| pentatricopeptide repeat-containing protein ...   221   9e-55
ref|XP_006302331.1| hypothetical protein CARUB_v10020389mg [Caps...   220   1e-54
gb|AAM62848.1| putative membrane-associated salt-inducible prote...   220   1e-54
ref|XP_006449054.1| hypothetical protein CICLE_v10015479mg [Citr...   220   2e-54

>ref|XP_007045557.1| Pentatricopeptide repeat 336, putative [Theobroma cacao]
            gi|508709492|gb|EOY01389.1| Pentatricopeptide repeat 336,
            putative [Theobroma cacao]
          Length = 395

 Score =  406 bits (1043), Expect = e-110
 Identities = 211/404 (52%), Positives = 286/404 (70%), Gaps = 8/404 (1%)
 Frame = -1

Query: 1488 MASFLRNPRINVARILSSNFSSSAVEKPLISSFRKVKSSIRSEADPEKLAEIFQKSSDFS 1309
            MAS  +NPR+ + + L S  + +    P   SF+  KS+I SE +PEKLAEIFQ+     
Sbjct: 1    MASIFKNPRLAIPKSLFS--TQTQKPNPPFPSFKAAKSAIISEKNPEKLAEIFQQCLHLP 58

Query: 1308 RFCRDRALFDLSVRKLSRSKRFDLIEQIL----LYQEKSPVLKSEGFWIRIMMLYSKARM 1141
             F R R ++ LS+RKL+R+ R DL++ +L    L+ + +  LKSEGFWIR++MLYS A M
Sbjct: 59   TFLRHRPIYHLSIRKLARANRLDLVDSLLQAQKLHSQNASALKSEGFWIRLIMLYSNAGM 118

Query: 1140 FDQAVRTFDQIEQLGCNR----TEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGV 973
              QA++T + + Q   NR    +EKS CA+LTV L +  +++++ESFK++P K+GV P V
Sbjct: 119  VPQALQTLEDLCQ---NRYSIVSEKSLCAILTVYLNNGMFEQIYESFKTIPEKLGVKPSV 175

Query: 972  VAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFK 793
            V++N++LKAF +E  +ESA   +EKM+    V P+I +YNI+LGGYL+ GDE+ FD   K
Sbjct: 176  VSHNLILKAFVKENKLESALEWVEKMD----VSPNIATYNILLGGYLKNGDENGFDGAMK 231

Query: 792  EILKKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVA 613
            E+ +K L  NL TYNHRI+RFC++KEC RA KLLDEMVSKG+KPNS S+NT+IDGFC++ 
Sbjct: 232  EVSRKGLEGNLTTYNHRISRFCKSKECARANKLLDEMVSKGVKPNSASYNTIIDGFCRIE 291

Query: 612  DFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESM 433
            D ESA+KV             S TY TL+R +V+EGEFD ALEM   SI +KWVPPFE+M
Sbjct: 292  DLESARKVLDKMLSDGYVLPCSFTYYTLLRSMVKEGEFDSALEMSMESIKRKWVPPFEAM 351

Query: 432  EGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILPL 301
            EGLV GLV+ S+ EEAK++VEKMKKRL+G A++SW K+E  LPL
Sbjct: 352  EGLVKGLVERSRSEEAKQVVEKMKKRLKGDALESWGKIEAALPL 395


>ref|XP_002272104.1| PREDICTED: pentatricopeptide repeat-containing protein At1g61870,
            mitochondrial [Vitis vinifera]
            gi|297738261|emb|CBI27462.3| unnamed protein product
            [Vitis vinifera]
          Length = 386

 Score =  404 bits (1037), Expect = e-110
 Identities = 218/398 (54%), Positives = 282/398 (70%), Gaps = 2/398 (0%)
 Frame = -1

Query: 1488 MASFLRNPRINVARILS-SNFSSSAVEKPLISSFRKVKSSIRSEADPEKLAEIFQ-KSSD 1315
            MAS  R PR    R+LS + FS+  +  P  ++F   KS++ SE DPEKLA IF  +SS+
Sbjct: 1    MASLCRIPR----RLLSLARFST--LSDPF-TTFLAAKSAVESEPDPEKLAHIFHHQSSN 53

Query: 1314 FSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEKSPVLKSEGFWIRIMMLYSKARMFD 1135
            F+RF R R L+ LS R+LSRS R DL+E+++ +Q+  P  ++EGFWIR++MLYS + M D
Sbjct: 54   FARFRRHRPLYQLSCRRLSRSGRLDLVERLIDHQKTLPHPRTEGFWIRLIMLYSTSGMVD 113

Query: 1134 QAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIV 955
             A+RTF Q+ Q     TEKS CA+LTV L++   D++H  F ++P +IGVSPG  +Y++V
Sbjct: 114  HALRTFHQMVQDRVQLTEKSLCAILTVYLDNDLIDQLHTVFNTMPSEIGVSPGTKSYSLV 173

Query: 954  LKAFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKE 775
            LKAFC++K MESA+ LL KME      PDI SYN++L  Y   GD  +FDEI KEI  K 
Sbjct: 174  LKAFCQQKDMESARKLLHKME-----NPDIGSYNVLLEAYSENGDGVEFDEILKEIKNKG 228

Query: 774  LNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAK 595
            L  +  TYNHRI RFC+NKE VRA+KLLDEMV+KG+KPNS S+N +I GFCKV DFESA+
Sbjct: 229  LEHDCTTYNHRILRFCKNKESVRAKKLLDEMVAKGVKPNSASYNMIIHGFCKVGDFESAQ 288

Query: 594  KVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNG 415
            KV             S +Y+TL +H+V+EGEFD AL MCK  I +KWVPPFE+M+GLV G
Sbjct: 289  KVLGRMLADGYVAPCSISYITLFQHMVKEGEFDSALNMCKEIIRRKWVPPFEAMDGLVKG 348

Query: 414  LVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILPL 301
            LV+ISKVE AK++VEKMKKRL+G+A DSW   E  LPL
Sbjct: 349  LVEISKVEAAKEVVEKMKKRLKGNAADSWKTHEAALPL 386


>ref|XP_004136798.1| PREDICTED: pentatricopeptide repeat-containing protein At1g61870,
            mitochondrial-like [Cucumis sativus]
            gi|449494815|ref|XP_004159654.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g61870,
            mitochondrial-like [Cucumis sativus]
          Length = 405

 Score =  377 bits (968), Expect = e-102
 Identities = 196/405 (48%), Positives = 281/405 (69%), Gaps = 8/405 (1%)
 Frame = -1

Query: 1491 LMASFLRNPR--INVARILSSNFSSSAVEKPL----ISSFRKVKSSIRSEADPEKLAEIF 1330
            + A+  R PR    ++R+ S ++S++   +P       S R  KS+I S++DP+KLA+ F
Sbjct: 1    MAAALPRTPRRLFLISRLHSFSYSTTPPLQPTSDSPFPSLRAAKSAILSQSDPDKLAQSF 60

Query: 1329 QKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEKSPVLKSEGFWIRIMMLYSK 1150
             ++S    FCR R ++  S+RKL+R++RFDLI+ I+    KSP   SEGFWIR++MLYS 
Sbjct: 61   IQASTLPSFCRYRPIYHQSIRKLARAQRFDLIDVIIQSHHKSPSATSEGFWIRLIMLYSS 120

Query: 1149 ARMFDQAVRTFDQ-IEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGV 973
              M +QA+   DQ I    CN +EKS CA+L+V L++   ++VHE F+S+P KIGV+P  
Sbjct: 121  VGMVNQALYILDQAILHKSCNLSEKSLCAILSVFLDNSMPEKVHEMFRSIPEKIGVTPTA 180

Query: 972  VAYNIVLKAFCEEKMMESAQSLLEKMETENG-VKPDINSYNIILGGYLRIGDESKFDEIF 796
            V++N+VLKAF  +  + SA++ ++++  ++  V P+I+S+ I+LG Y   GD   FDEI 
Sbjct: 181  VSHNLVLKAFVRQNDLPSARNWIDELCKDDAKVIPNIDSFTILLGAYWSNGDMIGFDEIE 240

Query: 795  KEILKKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKV 616
            KEI K+ L  NL TYN+RI+R C+NKEC RA+K+LDEM+SKG+KPNS+S++++I G+C V
Sbjct: 241  KEISKRGLEFNLATYNYRISRLCKNKECARAKKILDEMISKGVKPNSSSYDSIIHGYCDV 300

Query: 615  ADFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFES 436
             D ESA K+             S  Y  LIR +V+EGEF++ALE C+ +I ++WVPPFE+
Sbjct: 301  GDIESAMKILKGILEDGHVSPTSRIYYRLIRSMVKEGEFEMALETCRETIKRRWVPPFEA 360

Query: 435  MEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILPL 301
            ME LV GLV +SKVEEAK++VEKMKKRL+G AVDSW K+E  LPL
Sbjct: 361  MEALVRGLVAMSKVEEAKEVVEKMKKRLKGPAVDSWRKIEAALPL 405


>ref|XP_006847891.1| hypothetical protein AMTR_s00029p00104100 [Amborella trichopoda]
            gi|548851196|gb|ERN09472.1| hypothetical protein
            AMTR_s00029p00104100 [Amborella trichopoda]
          Length = 454

 Score =  335 bits (858), Expect = 4e-89
 Identities = 174/365 (47%), Positives = 250/365 (68%)
 Frame = -1

Query: 1395 SFRKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLY 1216
            + +  +S IRS   PE+  E+F+K+S   RF  DRA F   V+KL+  +RFDLIEQ L  
Sbjct: 92   TLKNARSRIRSAGSPEEAFEVFRKASKSPRFRHDRAAFSAFVQKLAGYERFDLIEQALES 151

Query: 1215 QEKSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQ 1036
             +K P    EGF IR+++LYS+A M D+A+ TF ++++L C R+EKSF A L+ LL + +
Sbjct: 152  HKKPPFSLMEGFIIRLILLYSEAGMVDKALDTFYEMDELECPRSEKSFSATLSGLLLNKR 211

Query: 1035 YDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSY 856
            +D VH  F  +P K  +SP V  Y+I+++AFCEE +++SA  +L KME + G+KPD+ SY
Sbjct: 212  FDDVHRLFDEIPNKFDISPTVFTYDIIIRAFCEEHLLDSAFEMLGKME-KIGIKPDVVSY 270

Query: 855  NIILGGYLRIGDESKFDEIFKEILKKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVS 676
            N ++ G+LR GD+++ DE+ KE+ +K   P+L TYN RI  FC++KE V+A+ LL+EM S
Sbjct: 271  NTLIDGFLRAGDQTRVDELLKEMTEKGCAPDLVTYNLRILGFCKDKESVKAQALLEEMRS 330

Query: 675  KGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFD 496
            +GI+PNS S+N VI GF K  + E A++V+            S TY  LI+  +E G ++
Sbjct: 331  RGIRPNSRSYNAVIFGFYKEGNLEEARRVY-ESIPKGDESPNSGTYFMLIQFEIEHGNYE 389

Query: 495  LALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVE 316
             ALE+CK SI +KW+PPF +M+ L++GLVKISKV+EAK IVE+MKK+  GSA DSW KVE
Sbjct: 390  TALELCKKSIKRKWIPPFFTMKSLIDGLVKISKVDEAKAIVEEMKKKFSGSAADSWMKVE 449

Query: 315  GILPL 301
              + L
Sbjct: 450  TTISL 454


>ref|XP_002454838.1| hypothetical protein SORBIDRAFT_04g038280 [Sorghum bicolor]
            gi|241934669|gb|EES07814.1| hypothetical protein
            SORBIDRAFT_04g038280 [Sorghum bicolor]
          Length = 419

 Score =  240 bits (613), Expect = 1e-60
 Identities = 147/415 (35%), Positives = 226/415 (54%), Gaps = 17/415 (4%)
 Frame = -1

Query: 1494 SLMASFLRNP-----RINVARILSSNFSSSAVEKPLI-SSFRKVKSSIRSEA-DPEKLAE 1336
            S  A+  R+P     R  + R+LS+    +    P   +   ++KSSIR  A  P+ LA 
Sbjct: 3    SAAAALCRSPSLLSRRHLLVRLLSTQTQLATPPTPTTPADLSRLKSSIRDAATSPDALAT 62

Query: 1335 IFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQIL---LYQEKSPVLKSEGFWIRIM 1165
            +F        F  DR LF LSV +L+ + R DL+  +L   L    SP   SEGF +R++
Sbjct: 63   LFLSGLPHPAFLADRPLFALSVHRLASAGRRDLVASVLSSSLTALPSPH-PSEGFLLRLI 121

Query: 1164 MLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGV 985
             LYS A M D ++  F  +       ++++  ALL+   ++  YDR   +F ++P ++G+
Sbjct: 122  SLYSAAGMPDHSLTVFRLVNP----PSDRALSALLSTYHDNRLYDRAVRAFNTLPAELGI 177

Query: 984  SPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFD 805
             PG+V++N++LKA      + +A+S  +KM    GV+PDI S N IL GYL  GD++ FD
Sbjct: 178  KPGLVSHNVLLKALVASGDIAAARSAFDKMPDTAGVQPDIVSCNEILKGYLSTGDDAAFD 237

Query: 804  EIFKEIL--KKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVID 631
            ++ KEI    + L PN+GTYN R+   C  +    A +LLD M + G+ PN  SFNTVI 
Sbjct: 238  QLVKEIAGPNRRLKPNVGTYNLRMAMLCSKERSFEAEELLDAMGANGVPPNRASFNTVIK 297

Query: 630  GFCKVADFESAKKVF-----XXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASI 466
            G C   +  +A  +F                  +TY+ L+  LV +  FD ALE+CK  +
Sbjct: 298  GLCNEGEVGAAMALFKRMPEVPRQKGKGVSPNFETYIMLLEALVNKNLFDPALEVCKECL 357

Query: 465  GKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILPL 301
              KW PPF++++GLV  L+K  K + A++++  M+K ++G A   WTKVE   P+
Sbjct: 358  HNKWAPPFQAVKGLVESLLKSRKAKHAREVLMAMRKAVKGDAKQEWTKVEAQFPM 412


>ref|NP_001144243.1| hypothetical protein [Zea mays] gi|195638968|gb|ACG38952.1|
            hypothetical protein [Zea mays]
            gi|413939592|gb|AFW74143.1| hypothetical protein
            ZEAMMB73_602318 [Zea mays]
          Length = 419

 Score =  238 bits (607), Expect = 5e-60
 Identities = 148/415 (35%), Positives = 224/415 (53%), Gaps = 17/415 (4%)
 Frame = -1

Query: 1494 SLMASFLRNP-----RINVARILSSNFSSSAVEKPLI-SSFRKVKSSIRSEAD-PEKLAE 1336
            S  A+  R+P     R  + R+LS+         P   +   ++KSSIR  A  P+ LA 
Sbjct: 3    SAAAALYRSPSLLSRRHLLIRLLSTQTQLVTPPTPTTPADLSRLKSSIRDAATTPDALAT 62

Query: 1335 IFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQIL---LYQEKSPVLKSEGFWIRIM 1165
            +F        F  DR LF LSV +L+ + R DL+  +L   L    SP   SEGF +R++
Sbjct: 63   LFLSGLPHPAFLADRPLFALSVHRLASAGRRDLVASVLSSSLTALPSPH-PSEGFLLRLI 121

Query: 1164 MLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGV 985
             LYS A M D ++  F  ++      ++++  ALL+   ++  YDR   +F ++P ++G+
Sbjct: 122  SLYSAAGMPDHSLAVFRLVKPA----SDRALSALLSAYHDNRLYDRTVRAFNTLPAELGI 177

Query: 984  SPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFD 805
             PG+V++N++LKA      + +A +L ++M    GV+PDI S N IL GYL  GD   FD
Sbjct: 178  KPGLVSHNVLLKALVASGDVAAAHTLFDEMPDTAGVQPDIVSCNEILKGYLNAGDADAFD 237

Query: 804  EIFKEIL--KKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVID 631
             + KEI   K+ L PN+GTYN R+   C       A +LLD M + G+ PN TSFNTVI 
Sbjct: 238  RLVKEIAGPKRRLKPNVGTYNLRMALLCSKMRSFEAEELLDVMGANGVPPNRTSFNTVIK 297

Query: 630  GFCKVADFESAKKVF-----XXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASI 466
            G C   +  +A  +F                  +TY+ L+  LV++  FD ALE+CK  +
Sbjct: 298  GLCNEGEVGAAMALFKRMPEVPRQHGKGVSPNFETYIMLLEALVKKNLFDPALEICKECL 357

Query: 465  GKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILPL 301
              KW PPF++++GLV GL+K  K + A+++   M+K ++G A   W KVE   P+
Sbjct: 358  RNKWAPPFQAVKGLVQGLLKSRKAKHAREVFMAMRKAVKGDAKQEWIKVEAQFPM 412


>ref|XP_006648168.1| PREDICTED: pentatricopeptide repeat-containing protein At1g61870,
            mitochondrial-like [Oryza brachyantha]
          Length = 422

 Score =  234 bits (598), Expect = 6e-59
 Identities = 144/397 (36%), Positives = 220/397 (55%), Gaps = 13/397 (3%)
 Frame = -1

Query: 1467 PRINVARILSSNFSSSAVEKPLISSFRKVKSSIRSEAD-PEKLAEIFQKSSDFSRFCRDR 1291
            P + + R L     S+  + P  +    +K+SIRS A  P+ LA++F        F  DR
Sbjct: 20   PALLLRRQLLLRLLSTQTQTP--ADLAHLKNSIRSAAHTPDTLADLFLSGLSHPAFLADR 77

Query: 1290 ALFDLSVRKLSRSKRFDLIEQILLYQEKSPVLK--SEGFWIRIMMLYSKARMFDQAVRTF 1117
             LF LSV +L+ + R DL+  IL     S      SEGF IR++ LYS A M D ++ TF
Sbjct: 78   PLFTLSVHRLASAGRRDLVASILSSSLTSLPAPHPSEGFLIRLISLYSAAGMPDHSLSTF 137

Query: 1116 DQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCE 937
              I       ++++  ALL+   ++  YDR  ++F+++P ++G+ P VV++N++LK+   
Sbjct: 138  RIISP----PSDRALSALLSAYHDNRLYDRAIQAFRTLPAELGIKPSVVSHNVLLKSLVA 193

Query: 936  EKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEIL-----KKEL 772
               + SA++L ++M  + GV+PDI S N IL GYL   D + FD+  K+       K+ L
Sbjct: 194  NGDVASARALFDEMPVKAGVEPDIVSCNEILKGYLNTADYAAFDQFLKDNTTATAGKRRL 253

Query: 771  NPNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKK 592
             PN+GTYN R+   C       A +LLD M +KG+ PN  SFNTVI G CK  +  +A  
Sbjct: 254  KPNVGTYNLRMAALCSKGRSFEAAELLDAMEAKGVLPNRGSFNTVIQGLCKEGEVGAAVA 313

Query: 591  VF-----XXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEG 427
            +                  S+TY+TL+  LV +G F  ALE+ K  +  KW PPF++++G
Sbjct: 314  ILKRMPEVPRPNGKGVSPNSETYITLLEALVNKGVFGPALEVFKECLVNKWAPPFQAVQG 373

Query: 426  LVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVE 316
            L+ GL+K  KV+ AK++   M+K ++G A + W KVE
Sbjct: 374  LIKGLLKSRKVKHAKEVAMAMRKVVKGDAKEEWKKVE 410


>ref|XP_006858124.1| hypothetical protein AMTR_s00062p00111890 [Amborella trichopoda]
            gi|548862227|gb|ERN19591.1| hypothetical protein
            AMTR_s00062p00111890 [Amborella trichopoda]
          Length = 398

 Score =  231 bits (588), Expect = 9e-58
 Identities = 139/398 (34%), Positives = 228/398 (57%), Gaps = 12/398 (3%)
 Frame = -1

Query: 1461 INVARILSSNFSSSAVE------KPLISSFRKVKSSI---RSEADPEKLAEIFQKSSDFS 1309
            + ++ I   N+S+S+         P ++S +K ++++   +SE DPE++ +I +++S   
Sbjct: 5    LRISAIFCRNYSASSPSILNTKGLPFLTSKQKSRAALALLKSEKDPERILQICREASLTP 64

Query: 1308 RFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEKSPVLKSEGFWIRIMMLYSKARMFDQA 1129
                DR  + ++V KL+ ++ F  I + +   +K P L++E F ++ ++LY KA M DQA
Sbjct: 65   ESHLDRVAYTVAVEKLTATQSFAAIREFIEEHKKRPDLQNERFMVKAILLYGKAGMLDQA 124

Query: 1128 VRTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLK 949
            ++TF Q+  L   RT KS  ALL+  + + +Y  V   F        + P  V YN ++K
Sbjct: 125  IQTFKQMGDLNLTRTVKSLNALLSSCIIAKKYKEVARLFDEYSKDYSIKPDTVTYNTMIK 184

Query: 948  AFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKELN 769
            A CE    +SA +LL++M  + G KP+  SY  +L G+ R   E KFD++   +   E N
Sbjct: 185  ALCESDSSDSALALLKEM-GKKGCKPNAISYGNLLAGFYR---EEKFDKVGVVLDLMERN 240

Query: 768  ---PNLGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESA 598
               P + TYN RI   C+ K+   A  L+  MVSKG++PN+T+F  +I GFC+  + E A
Sbjct: 241  GCHPGVTTYNVRIQSLCKLKKSSEAMALIRGMVSKGVRPNTTTFYHLIYGFCREGNLEEA 300

Query: 597  KKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVN 418
            KKVF            S+ Y  L+ +L E G+++ A ++C+ S+ K WVP F+ M+ LVN
Sbjct: 301  KKVF-SEMKSRGCVPDSNCYFALLYYLCEGGDYEPAFKLCRESMEKDWVPSFKVMKSLVN 359

Query: 417  GLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILP 304
            GLVK+SK+E AK+I+ +MK++   ++ + W  VE  LP
Sbjct: 360  GLVKLSKIEAAKEIIGEMKEKFPSNS-EMWATVEQGLP 396


>ref|NP_001048609.1| Os02g0829800 [Oryza sativa Japonica Group]
            gi|48716331|dbj|BAD22943.1| membrane-associated
            salt-inducible protein-like [Oryza sativa Japonica Group]
            gi|113538140|dbj|BAF10523.1| Os02g0829800 [Oryza sativa
            Japonica Group] gi|125584252|gb|EAZ25183.1| hypothetical
            protein OsJ_08983 [Oryza sativa Japonica Group]
            gi|215769058|dbj|BAH01287.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 423

 Score =  230 bits (586), Expect = 1e-57
 Identities = 136/369 (36%), Positives = 210/369 (56%), Gaps = 13/369 (3%)
 Frame = -1

Query: 1383 VKSSIRSEAD-PEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEK 1207
            +K+SIRS A  PE LA++F        F  DR +F LSV +L+ + R DL+  IL     
Sbjct: 47   LKNSIRSAAHTPEALADLFISGLSHPAFLADRPIFTLSVHRLASAGRRDLVASILSSSLT 106

Query: 1206 SPVLK--SEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQY 1033
            S      SEGF IR++ LYS A M D ++ TF    ++    ++++  ALL+   ++  Y
Sbjct: 107  SLPAPHPSEGFLIRLISLYSAAGMPDHSLSTF----RIVTPPSDRALSALLSAYHDNRLY 162

Query: 1032 DRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYN 853
            DR  ++F+++P ++G+ P VV++N++LK+F     + SA++L ++M ++  V+PDI S N
Sbjct: 163  DRAIQAFRTLPAELGIKPSVVSHNVLLKSFVASGDLASARALFDEMPSKADVEPDIVSCN 222

Query: 852  IILGGYLRIGDESKFDEIFKEIL-----KKELNPNLGTYNHRITRFCRNKECVRARKLLD 688
             IL GYL   D + FD+  K+       K+ L PN+ TYN R+   C       A +LLD
Sbjct: 223  EILKGYLNAADYAAFDQFLKDNTTAAGGKRRLKPNVSTYNLRMASLCSKGRSFEAAELLD 282

Query: 687  EMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVF-----XXXXXXXXXXXXSDTYVTLIR 523
             M +KG+ PN  SFNTVI G CK  +  +A  +F                 S+TY+ L+ 
Sbjct: 283  AMEAKGVPPNRGSFNTVIQGLCKEGEVGAAVAIFKRMPEVPRPNGKGVLPNSETYIMLLE 342

Query: 522  HLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGS 343
             LV +G F  ALE+ K  +  KW PPF++++GL+ GL+K  K + AK++   M+K ++G 
Sbjct: 343  GLVNKGVFAPALEVFKECLQNKWAPPFQAVQGLIKGLLKSRKAKHAKEVAMAMRKVVKGD 402

Query: 342  AVDSWTKVE 316
            A + W KVE
Sbjct: 403  AKEEWKKVE 411


>ref|NP_564786.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806489|sp|Q8LE47.2|PPR87_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g61870, mitochondrial; AltName: Full=Protein
            PENTATRICOPEPTIDE REPEAT 336; Flags: Precursor
            gi|16226403|gb|AAL16159.1|AF428391_1 At1g61870/F8K4_8
            [Arabidopsis thaliana] gi|3367521|gb|AAC28506.1| Similar
            to gb|U08285 membrane-associated salt-inducible protein
            from Nicotiana tabacum. ESTs gb|T44131 and gb|T04378 come
            from this gene [Arabidopsis thaliana]
            gi|17065564|gb|AAL32936.1| Unknown protein [Arabidopsis
            thaliana] gi|32815835|gb|AAP88326.1| At1g61870
            [Arabidopsis thaliana] gi|332195777|gb|AEE33898.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 408

 Score =  228 bits (581), Expect = 6e-57
 Identities = 141/397 (35%), Positives = 225/397 (56%), Gaps = 8/397 (2%)
 Frame = -1

Query: 1470 NPRINVARILSSNFSSSAVEKPLISSFRKVKSSI---RSEADPEKLAEIFQKSSDFSRFC 1300
            N    +  + S++   S   K  ++S  K K+++   +SE DP+++ EI + +S  +  C
Sbjct: 18   NASPQIRSLSSASTILSPDSKTPLTSKEKSKAALSLLKSEKDPDRILEICRAAS-LTPDC 76

Query: 1299 R-DRALFDLSVRKLSRSKRFDLIEQILL-YQEKSPVLKSEGFWIRIMMLYSKARMFDQAV 1126
            R DR  F  +V  L+  K F  +  +L  + E  P LKSE F    ++LY++A M D ++
Sbjct: 77   RIDRIAFSAAVENLAEKKHFSAVSNLLDGFIENRPDLKSERFAAHAIVLYAQANMLDHSL 136

Query: 1125 RTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKA 946
            R F  +E+   +RT KS  ALL   L +  Y      +  +P   G+ P +  YN ++K 
Sbjct: 137  RVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKRVYIEMPKMYGIEPDLETYNRMIKV 196

Query: 945  FCEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEI-LKKELN 769
            FCE     S+ S++ +ME + G+KP+ +S+ +++ G+     E K DE+ K + + K+  
Sbjct: 197  FCESGSASSSYSIVAEMERK-GIKPNSSSFGLMISGFYA---EDKSDEVGKVLAMMKDRG 252

Query: 768  PNLG--TYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAK 595
             N+G  TYN RI   C+ K+   A+ LLD M+S G+KPN+ +++ +I GFC   DFE AK
Sbjct: 253  VNIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLIHGFCNEDDFEEAK 312

Query: 594  KVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNG 415
            K+F            S+ Y TLI +L + G+F+ AL +CK S+ K WVP F  M+ LVNG
Sbjct: 313  KLF-KIMVNRGCKPDSECYFTLIYYLCKGGDFETALSLCKESMEKNWVPSFSIMKSLVNG 371

Query: 414  LVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILP 304
            L K SKVEEAK+++ ++K++   + V+ W +VE  LP
Sbjct: 372  LAKDSKVEEAKELIGQVKEKFTRN-VELWNEVEAALP 407


>gb|EAY72213.1| hypothetical protein OsI_00065 [Oryza sativa Indica Group]
          Length = 423

 Score =  228 bits (581), Expect = 6e-57
 Identities = 135/369 (36%), Positives = 209/369 (56%), Gaps = 13/369 (3%)
 Frame = -1

Query: 1383 VKSSIRSEAD-PEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEK 1207
            +K+SIRS A  PE LA++F        F  DR +F LSV +L+ + R DL+  IL     
Sbjct: 47   LKNSIRSAAHTPEALADLFISGLSHPAFLADRPIFTLSVHRLASAGRRDLVASILSSSLT 106

Query: 1206 SPVLK--SEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQY 1033
            S      SEGF IR++ LYS A M D ++ TF    ++    ++++  ALL+   ++  Y
Sbjct: 107  SLPAPHPSEGFLIRLISLYSAAGMPDHSLSTF----RIVTPPSDRALSALLSAYHDNRLY 162

Query: 1032 DRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYN 853
            DR  ++F+++P ++G+ P VV++N++LK+F     + SA++L ++M ++  V+PDI S N
Sbjct: 163  DRAIQAFRTLPAELGIKPSVVSHNVLLKSFVASGDLASARALFDEMPSKADVEPDIVSCN 222

Query: 852  IILGGYLRIGDESKFDEIFKEIL-----KKELNPNLGTYNHRITRFCRNKECVRARKLLD 688
             IL GYL   D + FD+  K+       K+ L PN+ TYN R+   C       A +LLD
Sbjct: 223  EILKGYLNAADYAAFDQFLKDNTTAAGGKRRLKPNVSTYNLRMASLCSKGRSFEAAELLD 282

Query: 687  EMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVF-----XXXXXXXXXXXXSDTYVTLIR 523
             M +KG+ PN  SFNTVI G CK  +  +A  +F                 S+TY+ L+ 
Sbjct: 283  AMEAKGVPPNRGSFNTVIQGLCKEGEVGAAVAIFKRMPEVPRPNGKGVLPNSETYIMLLE 342

Query: 522  HLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGS 343
             LV +G F  ALE+ K  +  KW PPF++++GL+ GL+K  K + AK++   M+K ++G 
Sbjct: 343  GLVNKGVFAPALEVFKECLQNKWAPPFQAVQGLIKGLLKSRKAKHAKEVAMAMRKVVKGD 402

Query: 342  AVDSWTKVE 316
            A + W K E
Sbjct: 403  AKEEWKKFE 411


>ref|XP_007026036.1| Pentatricopeptide repeat 336 [Theobroma cacao]
            gi|508781402|gb|EOY28658.1| Pentatricopeptide repeat 336
            [Theobroma cacao]
          Length = 398

 Score =  225 bits (574), Expect = 4e-56
 Identities = 132/365 (36%), Positives = 206/365 (56%), Gaps = 3/365 (0%)
 Frame = -1

Query: 1389 RKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQE 1210
            R   S ++SE +P+++ EI + +S       DR  F +++ KLS  K F  I+  L    
Sbjct: 39   RAALSLLKSEQNPDRILEICRAASLTPASHLDRITFSVAISKLSEGKHFQSIDTFLHELR 98

Query: 1209 KSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYD 1030
              P L++E F    ++LY +A+M + A+  FD+    G  R+ KS  ALL   + S  Y+
Sbjct: 99   SRPDLQNERFASHSLILYGQAKMLNHALTAFDEFYNEGLCRSAKSLNALLVAGIVSKDYE 158

Query: 1029 RVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNI 850
             V   F   P + G+ P +  YN  +KA CE     SA S+L  M+++ GV+P+  ++  
Sbjct: 159  EVKRIFVEFPKRYGIEPDLECYNSAIKAMCESGSSSSAYSILVDMKSK-GVQPNATTFGT 217

Query: 849  ILGGYLRIGDESKFDEIFKEI-LKKELNPNLG--TYNHRITRFCRNKECVRARKLLDEMV 679
            +L G+ +   E K++++ K + L KE    +G  TYN RI   C  K+   A+ LLD M+
Sbjct: 218  LLAGFYK---EEKYEDVGKVLNLMKEYGVPVGVSTYNTRIQSLCMLKKSTEAKALLDGML 274

Query: 678  SKGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEF 499
            S+G+KPN+ ++N +I GFCK  + E AK++F            S  Y TL+    + G+F
Sbjct: 275  SRGMKPNTVTYNNLIHGFCKEGNLEEAKRLF-KSMRNSGLEPDSQCYFTLVHFSCQGGDF 333

Query: 498  DLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKV 319
            + AL +CK S+ K WVP F SM+ LVNGL  +SKVEEAK++++K+K++   +A D W +V
Sbjct: 334  EAALSICKESMEKNWVPSFSSMKSLVNGLSSMSKVEEAKELIQKVKEKFSKNA-DLWDEV 392

Query: 318  EGILP 304
            E  LP
Sbjct: 393  EKSLP 397


>ref|XP_002263756.2| PREDICTED: pentatricopeptide repeat-containing protein At3g13150-like
            [Vitis vinifera]
          Length = 379

 Score =  225 bits (574), Expect = 4e-56
 Identities = 121/346 (34%), Positives = 213/346 (61%), Gaps = 2/346 (0%)
 Frame = -1

Query: 1389 RKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQE 1210
            + +K S  + +  +++ + F+KSSD  RF      ++ +V  L+++K+F  IE IL +Q+
Sbjct: 25   KTIKRSSSNNSSLKEMVDKFKKSSDSKRFRSRYGYYEKAVLTLAKAKKFSFIEDILEHQK 84

Query: 1209 KSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYD 1030
            +   + +E F +R+M LY KA MF+ A + FD++ +L C RT  SF ALL+V + S ++D
Sbjct: 85   QYNEISTEVFAVRLMTLYGKAGMFEHAHKLFDELPKLNCERTVVSFNALLSVCVNSKKFD 144

Query: 1029 RVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNI 850
            ++   F+ +P  +GV P VV+YNI++ AFCE   ++SA S+L++ME + G++PD+ ++N 
Sbjct: 145  KIDGFFQELPGNLGVVPDVVSYNIIVNAFCEMGSLDSALSVLDEME-KVGLEPDLITFNT 203

Query: 849  ILGGYLRIGDESKFDEIFKEILKKELNPNLGTYNHRITRFCRNKECVRARKLLDEMVSKG 670
            +L  + + G  +  ++I+  + K  + PN+ +YN ++           A +L+DEM + G
Sbjct: 204  LLNAFYQNGSYADGEKIWDLMKKNNVAPNVRSYNAKLRGVISENRMSEAVELIDEMKTSG 263

Query: 669  IKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLA 490
            IKP+  + N+++ GFC   + E AK+ +            + TY+TLI  LVE+G+FD+A
Sbjct: 264  IKPDVFTLNSLMKGFCNAGNLEEAKRWYSEIARNELPPVRA-TYMTLIPFLVEKGDFDMA 322

Query: 489  LEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVE--KMKK 358
             E+CK    ++W+     ++ ++ GLVK SK+EEA ++VE  K+KK
Sbjct: 323  TELCKEVCSRRWLIEPALLQQVLEGLVKESKIEEATELVELAKLKK 368


>ref|XP_003570721.1| PREDICTED: pentatricopeptide repeat-containing protein At1g80150,
            mitochondrial-like [Brachypodium distachyon]
          Length = 423

 Score =  225 bits (574), Expect = 4e-56
 Identities = 134/371 (36%), Positives = 209/371 (56%), Gaps = 11/371 (2%)
 Frame = -1

Query: 1386 KVKSSIRSEAD-PEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQIL---L 1219
            ++K+SIRS A  P+ LA +F ++     F  DR +F L+V +L+ + R DL+  IL   L
Sbjct: 48   RIKNSIRSAATGPDDLATLFLRALPNQAFLGDRPIFSLAVTRLASAGRRDLVFSILSSSL 107

Query: 1218 YQEKSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESH 1039
                +P   SEGF IR++ LY+ A M   ++ TF  ++      T++ F ALL    ++ 
Sbjct: 108  TALPAPH-PSEGFLIRLISLYAAAGMPQHSLSTFRLVKPA----TDRVFSALLAAYHDTA 162

Query: 1038 QYDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINS 859
            Q+D    +F+ +P ++   PGVV++N++LK+      +  A+ + ++M  + GV+PDI S
Sbjct: 163  QHDLAVTAFRDLPAELSFQPGVVSHNVLLKSMVATGDVAGARQVFDEMADKAGVQPDIVS 222

Query: 858  YNIILGGYLRIGDESKFDEIFKEIL--KKELNPNLGTYNHRITRFCRNKECVRARKLLDE 685
             N +L GYL+  D + FD++FKEI   K+ L PN+ TYN R+   C       A +LLD 
Sbjct: 223  CNEVLRGYLKTADYAAFDQLFKEIAGGKRRLKPNVTTYNLRMAALCAKGRSFEAEELLDV 282

Query: 684  MVSKGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXXS-----DTYVTLIRH 520
            M + G+ PN  SFNTVI G CK  +  +A  +F                  +TY+ L+  
Sbjct: 283  MGANGVPPNRESFNTVIGGLCKEGEVGAAAALFKRMPEVPRPNGKGVSPNFETYIMLLEA 342

Query: 519  LVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSA 340
            LVE+  F  ALE+CK  +  KW PPF++++GL+ GLVK  KV++AK++   M+K  +G A
Sbjct: 343  LVEKRVFSPALEVCKECLANKWAPPFQAVKGLIQGLVKSRKVKQAKELGMAMRKATKGDA 402

Query: 339  VDSWTKVEGIL 307
               W  VE  +
Sbjct: 403  KAEWENVESAI 413


>ref|XP_006468012.1| PREDICTED: pentatricopeptide repeat-containing protein At1g61870,
            mitochondrial-like [Citrus sinensis]
          Length = 402

 Score =  222 bits (566), Expect = 3e-55
 Identities = 132/386 (34%), Positives = 217/386 (56%), Gaps = 5/386 (1%)
 Frame = -1

Query: 1446 ILSSNFSSSAVEKPLISS--FRKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLS 1273
            + +S+  SS  + PL S    R   + ++SE++PEK+ EI + ++       DR  F ++
Sbjct: 22   LATSSILSSGDKTPLTSKDKTRAALTLLKSESNPEKILEICRAAALTPESHLDRLAFSIA 81

Query: 1272 VRKLSRSKRFDLIEQILLYQEKSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGC 1093
            + KLS +  F+ I Q L   +  P L++E F    ++LY +A M + AVRTF ++++   
Sbjct: 82   INKLSEANYFNGISQYLEELKTRPDLQNERFHAHSIILYGQANMTEHAVRTFKEMDEHKL 141

Query: 1092 NRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQ 913
              +  +F ALL  L  +  Y  V   F   P   G+ P +  YN V+KAFCE     SA 
Sbjct: 142  RHSVGAFNALLLALTIAKDYKEVKRVFIEFPKTYGIKPDLDTYNRVIKAFCESSDSSSAY 201

Query: 912  SLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKE---LNPNLGTYNHR 742
            S+L +M+ ++ +KP+ +S+  ++ G+ +   E K++++ K +   E   +   +  YN R
Sbjct: 202  SILAEMDRKS-IKPNASSFGALVAGFYK---EEKYEDVNKVLQMMERYGMKSGVSMYNVR 257

Query: 741  ITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXX 562
            I   C+ ++C  A+ LLDEM+SKG+KPNS +++  I GFCK  +FE AKK F        
Sbjct: 258  IHSLCKLRKCAEAKALLDEMLSKGMKPNSVTYSHFIYGFCKDGNFEEAKK-FYRIMSNSG 316

Query: 561  XXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAK 382
                S  Y T++  + + G+++ AL  CK SI K WVP F +M+ LV GL  +SKV EAK
Sbjct: 317  LSPNSSVYFTMVYFMCKGGDYETALGFCKESIAKGWVPNFTTMKSLVTGLAGVSKVSEAK 376

Query: 381  KIVEKMKKRLRGSAVDSWTKVEGILP 304
            +++  +K++   + VD+W ++E  LP
Sbjct: 377  ELIGLVKEKFTKN-VDTWKEIEAGLP 401


>ref|XP_006391954.1| hypothetical protein EUTSA_v10023498mg [Eutrema salsugineum]
            gi|557088460|gb|ESQ29240.1| hypothetical protein
            EUTSA_v10023498mg [Eutrema salsugineum]
          Length = 408

 Score =  222 bits (566), Expect = 3e-55
 Identities = 129/393 (32%), Positives = 218/393 (55%), Gaps = 4/393 (1%)
 Frame = -1

Query: 1470 NPRINVARILSSNFSSSAVEKPLISSFRKVKSSI---RSEADPEKLAEIFQKSSDFSRFC 1300
            NP   +  + S++   S   K  ++S +K K+++   ++E DP+++ EI + +S      
Sbjct: 18   NPSPQIRSLSSASSILSPDSKTPLTSKQKSKAALSLLKTEKDPDRILEICRAASLTPDCH 77

Query: 1299 RDRALFDLSVRKLSRSKRFDLIEQILL-YQEKSPVLKSEGFWIRIMMLYSKARMFDQAVR 1123
             DR  F  +V  L+  K F  +  +L  + E  P L+SE F    ++LY++A M D ++R
Sbjct: 78   IDRIAFSAAVENLAEKKHFAAVTNLLDGFIETRPDLRSERFAAHAIVLYAQANMLDHSLR 137

Query: 1122 TFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAF 943
             F+++E+L   RT KS  ALL   L +  Y      +  +P    + P +  YN ++K F
Sbjct: 138  IFNELEKLEIPRTVKSLNALLFACLVAKDYKEAKRVYMEMPKMYKIEPDLETYNRMIKVF 197

Query: 942  CEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKELNPN 763
            CE     S+ S++ +ME +  +KP  +S+ +++ G+   G   +  ++   + ++ ++  
Sbjct: 198  CESGSASSSYSIIAEMERKR-IKPTSSSFGLMIAGFYHEGKNEEVGKVLAMMKERGVSVG 256

Query: 762  LGTYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVFX 583
            + T+N RI   C+ K+   A+ LLD M+S G+KPNS ++  +I GFC   D + AKK+F 
Sbjct: 257  VSTHNIRIQSLCKRKKSAEAKALLDGMLSSGMKPNSVTYGHLIHGFCSEGDLDEAKKLF- 315

Query: 582  XXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKI 403
                       S+ Y TLI +L + G+F+  L +CK S+ K WVP F  M+ LVNGLVK 
Sbjct: 316  KVMVNRGCKPDSECYFTLIYYLCKGGDFETGLSLCKESMEKNWVPSFGIMKSLVNGLVKD 375

Query: 402  SKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILP 304
            SKVEEAKK++ ++K++   + V+ W +VE  LP
Sbjct: 376  SKVEEAKKLIAQVKEKFTRN-VELWNEVEAALP 407


>ref|XP_002886503.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297332344|gb|EFH62762.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 408

 Score =  221 bits (562), Expect = 9e-55
 Identities = 137/396 (34%), Positives = 220/396 (55%), Gaps = 7/396 (1%)
 Frame = -1

Query: 1470 NPRINVARILSSNFSSSAVEKPLISSFRKVKSSI---RSEADPEKLAEIFQKSSDFSRFC 1300
            N    +  + S++   S   K  ++S  K K+++   +SE DP+++ EI + +S      
Sbjct: 18   NASPQIRSLSSASTILSPDSKTPLTSKEKSKAALSLLKSEKDPDRILEICRAASLTPDCH 77

Query: 1299 RDRALFDLSVRKLSRSKRFDLIEQILL-YQEKSPVLKSEGFWIRIMMLYSKARMFDQAVR 1123
             DR  F  +V  L+  K F  +  +L  + E    LKSE F    ++LY++A M D ++R
Sbjct: 78   IDRIAFSAAVENLAEKKHFSAVSNLLDGFIENRQDLKSERFAAHAIVLYAQANMLDHSLR 137

Query: 1122 TFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAF 943
             F  +E+    RT KS  ALL   L +  Y      +  +P   G+ P +  YN ++K F
Sbjct: 138  VFRDLEKFEIPRTVKSLNALLFACLVAKDYKEAKRVYIEMPKMYGIEPDLETYNRMIKVF 197

Query: 942  CEEKMMESAQSLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKE-ILKKELNP 766
            CE     S+ S++ +ME + G+KP+ +S+ +++ G+     E K DE+ K  ++ K+   
Sbjct: 198  CESGSASSSYSIVAEMERK-GIKPNSSSFGLMISGFY---SEDKNDEVGKVLVMMKDRGV 253

Query: 765  NLG--TYNHRITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKK 592
            N+G  TYN RI   C+ K+   A+ LLD M+S G+KPN+ +++ +I GFC   DFE AKK
Sbjct: 254  NIGVSTYNIRIQSLCKRKKSKEAKALLDGMLSAGMKPNTVTYSHLIRGFCNEDDFEEAKK 313

Query: 591  VFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGL 412
            +F            S+ Y TLI +L + G+F+ AL +CK S+ K WVP F  M+ LVNGL
Sbjct: 314  LF-KVMVNRGCKPDSECYFTLIYYLCKGGDFETALVLCKESMEKNWVPSFSIMKSLVNGL 372

Query: 411  VKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGILP 304
             K SKV+EAK+++ ++K++   + V+ W +VE  LP
Sbjct: 373  AKDSKVDEAKELIGQVKEKFTRN-VELWNEVEAALP 407


>ref|XP_006302331.1| hypothetical protein CARUB_v10020389mg [Capsella rubella]
            gi|482571041|gb|EOA35229.1| hypothetical protein
            CARUB_v10020389mg [Capsella rubella]
          Length = 408

 Score =  220 bits (561), Expect = 1e-54
 Identities = 130/382 (34%), Positives = 208/382 (54%), Gaps = 3/382 (0%)
 Frame = -1

Query: 1440 SSNFSSSAVEKPLIS--SFRKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLSVR 1267
            +S   S   + PL S    R   S ++SE DP+++ EI + +S       DR  F  +V 
Sbjct: 29   ASTILSPDSKTPLTSREKSRAALSLLKSEKDPDRILEICRAASLTPDCHIDRIAFSAAVE 88

Query: 1266 KLSRSKRFDLIEQILL-YQEKSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCN 1090
             L+  K F  +  +L  + E  P LKSE F    ++LY++A M D ++R F  +E+    
Sbjct: 89   NLAEKKHFTAVSNLLDGFIENRPDLKSERFAAHAIVLYAQANMLDHSLRIFRDLEKYEIP 148

Query: 1089 RTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQS 910
            RT KS  ALL   L +  Y      +  +P   G+ P +  YN ++K FCE     SA S
Sbjct: 149  RTVKSLNALLFACLVAKDYKEAKRVYIEMPKMYGIEPDLETYNRMIKVFCESGSASSAYS 208

Query: 909  LLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKELNPNLGTYNHRITRF 730
            ++ +ME + G+KP+ +S+ +++ G+          ++   + ++ +N  + TYN RI   
Sbjct: 209  IVAEMERK-GIKPNSSSFGLMISGFYAEDKNDDVGKVLAMMKERGVNTGVSTYNIRIQSL 267

Query: 729  CRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXX 550
            C+ K+   A+ LLD M+S G+KPN+ +++ +I GFC   D E AKK+F            
Sbjct: 268  CKRKKSKEAKALLDGMLSAGMKPNTVTYSHLIRGFCNEDDLEEAKKLF-KVMVNRGCKPD 326

Query: 549  SDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVE 370
            S+ Y TLI +L + G+F+ AL +CK S+ K WVP F  M+ LVNGL K SKV+EAK+++ 
Sbjct: 327  SECYFTLIYYLCKGGDFEAALSLCKESMEKNWVPSFSIMKSLVNGLAKDSKVDEAKELIA 386

Query: 369  KMKKRLRGSAVDSWTKVEGILP 304
            ++K++   +  + W +VE  LP
Sbjct: 387  QVKEKFTRN-TELWNEVEAALP 407


>gb|AAM62848.1| putative membrane-associated salt-inducible protein [Arabidopsis
            thaliana]
          Length = 407

 Score =  220 bits (561), Expect = 1e-54
 Identities = 130/361 (36%), Positives = 205/361 (56%), Gaps = 3/361 (0%)
 Frame = -1

Query: 1377 SSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLSVRKLSRSKRFDLIEQILLYQEKSPV 1198
            S ++SE DP+++ EI + +S       DR  F  +V  L+    F  +  +L    ++  
Sbjct: 52   SLLKSEKDPDRILEICRAASLTPDCHIDRIAFSAAVENLAEKNHFSAVSNLLDGFIENRH 111

Query: 1197 LKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGCNRTEKSFCALLTVLLESHQYDRVHE 1018
            LKSE F    ++LY++A M D ++R F  +E+   +RT KS  ALL   L +  Y     
Sbjct: 112  LKSERFAAHAIVLYAQANMLDHSLRVFRDLEKFEISRTVKSLNALLFACLVAKDYKEAKR 171

Query: 1017 SFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQSLLEKMETENGVKPDINSYNIILGG 838
             +  +P   G+ P +  YN ++K FCE     S+ S++ +ME + G+KP+ +S+ +++ G
Sbjct: 172  VYIEMPKMYGIEPDLETYNRMIKVFCESGSASSSYSIVAEMERK-GIKPNSSSFGLMISG 230

Query: 837  YLRIGDESKFDEIFKEI-LKKELNPNLG--TYNHRITRFCRNKECVRARKLLDEMVSKGI 667
            +     E K DE+ K + + K    N+G  TYN RI   C+ K+   A+ LLD M+S G+
Sbjct: 231  FYA---EDKSDEVGKVLAMMKARGVNIGVSTYNIRIQSLCKKKKSKEAKALLDGMLSAGM 287

Query: 666  KPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXXXXXXSDTYVTLIRHLVEEGEFDLAL 487
            KPN+ +++ +I GFC   DFE AKK+F            S+ Y TLI +L + G+F+ AL
Sbjct: 288  KPNTVTYSHLIHGFCNEDDFEEAKKLF-KVMVNRGCKPDSECYFTLIYYLCKGGDFETAL 346

Query: 486  EMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAKKIVEKMKKRLRGSAVDSWTKVEGIL 307
             +CK S+ K WVP F  M+ LVNGL K SKVEEAK+++ ++K++   + V+ W +VE  L
Sbjct: 347  SLCKESMEKNWVPSFSIMKSLVNGLAKDSKVEEAKELIGQVKEKFTRN-VELWNEVEAAL 405

Query: 306  P 304
            P
Sbjct: 406  P 406


>ref|XP_006449054.1| hypothetical protein CICLE_v10015479mg [Citrus clementina]
            gi|557551665|gb|ESR62294.1| hypothetical protein
            CICLE_v10015479mg [Citrus clementina]
          Length = 402

 Score =  220 bits (560), Expect = 2e-54
 Identities = 132/386 (34%), Positives = 216/386 (55%), Gaps = 5/386 (1%)
 Frame = -1

Query: 1446 ILSSNFSSSAVEKPLISS--FRKVKSSIRSEADPEKLAEIFQKSSDFSRFCRDRALFDLS 1273
            + +S+  SS  + PL S    R   + ++SE++PEK+ EI + ++       DR  F ++
Sbjct: 22   LATSSILSSGDKTPLTSKDKTRAALTLLKSESNPEKILEICRAAALTPESHLDRLAFSIA 81

Query: 1272 VRKLSRSKRFDLIEQILLYQEKSPVLKSEGFWIRIMMLYSKARMFDQAVRTFDQIEQLGC 1093
            + KLS +  F+ I Q L   +  P L++E F    ++LY +A M + AVRTF ++++   
Sbjct: 82   INKLSEANYFNGISQYLEELKTRPDLQNERFHAHSIILYGQANMTEHAVRTFKEMDEHKL 141

Query: 1092 NRTEKSFCALLTVLLESHQYDRVHESFKSVPPKIGVSPGVVAYNIVLKAFCEEKMMESAQ 913
              +  +F ALL  L  +  Y  V   F   P   G+ P +  YN V+KAFCE     SA 
Sbjct: 142  RHSVGAFNALLLALTIAKDYKEVKRVFIEFPKTYGIKPDLDTYNRVIKAFCESGDSSSAY 201

Query: 912  SLLEKMETENGVKPDINSYNIILGGYLRIGDESKFDEIFKEILKKE---LNPNLGTYNHR 742
            S+L +M+ ++ +KP+ +S+  ++ G+ +   E K++++ K +   E   +   +  YN R
Sbjct: 202  SILAEMDRKS-IKPNASSFGALVAGFYK---EEKYEDVNKVLQMMERYGMKSGVSMYNVR 257

Query: 741  ITRFCRNKECVRARKLLDEMVSKGIKPNSTSFNTVIDGFCKVADFESAKKVFXXXXXXXX 562
            I   C+ ++C  A+ LLDEM+SKG+KPNS +++  I GFCK  +FE AKK F        
Sbjct: 258  IHSLCKLRKCAEAKALLDEMLSKGMKPNSVTYSHFIYGFCKDGNFEEAKK-FYRIMSNSG 316

Query: 561  XXXXSDTYVTLIRHLVEEGEFDLALEMCKASIGKKWVPPFESMEGLVNGLVKISKVEEAK 382
                S  Y T++  + + G+++ AL  CK SI K WVP F +M+ LV GL   SKV EAK
Sbjct: 317  LSPNSSVYFTMVYFMCKGGDYETALGFCKESIEKGWVPNFSTMKSLVTGLAGASKVSEAK 376

Query: 381  KIVEKMKKRLRGSAVDSWTKVEGILP 304
            +++  +K++   + VD+W ++E  LP
Sbjct: 377  ELIGLVKEKFTKN-VDTWNEIEAGLP 401


Top