BLASTX nr result

ID: Catharanthus23_contig00014786 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00014786
         (2009 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr...   494   e-137
ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi...   474   e-131
ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi...   445   e-122
ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi...   441   e-121
ref|XP_002309173.2| pentatricopeptide repeat-containing family p...   437   e-119
gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, put...   436   e-119
gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus...   429   e-117
gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss...   421   e-115
ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi...   420   e-114
gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]     419   e-114
ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ...   418   e-114
gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus...   416   e-113
ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi...   399   e-108
ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar...   361   7e-97
ref|XP_002879744.1| pentatricopeptide repeat-containing protein ...   360   1e-96
gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637...   358   4e-96
ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr...   357   8e-96
ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps...   348   4e-93
ref|XP_002533822.1| pentatricopeptide repeat-containing protein,...   322   3e-85
emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera]   317   9e-84

>ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina]
            gi|557531581|gb|ESR42764.1| hypothetical protein
            CICLE_v10013613mg [Citrus clementina]
          Length = 506

 Score =  494 bits (1273), Expect = e-137
 Identities = 251/468 (53%), Positives = 341/468 (72%), Gaps = 11/468 (2%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKT----------HLLNSLIDS 358
            N +LRK RKWPLSPYK +WH T   QQA QN+KQS+ +  T          H+L+SL+ S
Sbjct: 11   NLHLRKHRKWPLSPYKAKWHQTLDQQQAKQNVKQSLTTPPTKQQQQIPKQPHILSSLLHS 70

Query: 359  FAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDS 538
            F++Y CEP P+AYHFVIK LA N S++  I  VLDHI+K E FETPEFI IDLIK Y D+
Sbjct: 71   FSIYNCEPPPEAYHFVIKTLAEN-SQFCDISSVLDHIEKRENFETPEFIFIDLIKTYADA 129

Query: 539  NRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCE-KGVLEIIPQVLIKAQLMNIRIEESC 715
            +R Q ++ LFY+IP FRC PSV+SL ALL VLC  K  ++++PQ+L+K+QLMNIRIEES 
Sbjct: 130  HRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIEESS 189

Query: 716  FGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEE 895
            F ILI  LCRI +V +A E+LN M NDGF +DG+  S+IL ++ EQ++ S  E++ F++E
Sbjct: 190  FRILISTLCRINRVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQE 249

Query: 896  MCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEED 1075
            M KLGF     D+ +VI+ L KKE   DA+  L +MK D I PDIVCY ++L+ ++++ED
Sbjct: 250  MKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQED 309

Query: 1076 YVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYST 1255
            YV A+++FDELLVLGLVP+V+TYN +INGLCKQN VE GIKM+ CMEELG +PD++TY+T
Sbjct: 310  YVKAEELFDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNT 369

Query: 1256 ILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKF 1435
            +L +L    +LN  REL+ +M+ KG+ LN  TY ++I GL +  +  +A  LL+E +NK 
Sbjct: 370  LLQALCKVRELNRLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKG 429

Query: 1436 SVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
              + S  +FD+ ICGLCQ G++ +A E+L++M++K+V PG   WEALL
Sbjct: 430  LCTQS-SMFDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALL 476



 Score = 73.9 bits (180), Expect = 2e-10
 Identities = 79/349 (22%), Positives = 148/349 (42%), Gaps = 7/349 (2%)
 Frame = +2

Query: 368  YECEPFPKAYHFVIKVLANNPSRWDQIPQVL--DHIQKVETFETPEFILIDLIKFYGDSN 541
            + C P   + + ++ VL  N      +PQ+L    +  +   E+   ILI  +      N
Sbjct: 145  FRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIEESSFRILISTLCRI---N 201

Query: 542  RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGVLEIIPQVLIKAQLMNIRIEESCFG 721
            R+  AIE+   + N        +   +L  +CE+  L     +    ++  +     CFG
Sbjct: 202  RVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGFVQEMKKLGF---CFG 258

Query: 722  IL-----IRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCF 886
            ++     IR+L +  KV  A  +LN M +DG   D   ++ +L  +  Q++   AE +  
Sbjct: 259  MVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIVQEDYVKAEEL-- 316

Query: 887  LEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVL 1066
             +E+  LG  P+   +   I  L K+ N    IK +  M+     PD++ YN +L  L  
Sbjct: 317  FDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVITYNTLLQALCK 376

Query: 1067 EEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVT 1246
              +     ++  E+   G+V N+ TY+  I+GL  +  + E   +L      G       
Sbjct: 377  VRELNRLRELVKEMKWKGIVLNLQTYSIMIDGLASKGDIIEACGLLEEALNKGLCTQSSM 436

Query: 1247 YSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEF 1393
            +   +  L   G +  A EL+ +M  K +   +  +E ++   ++  +F
Sbjct: 437  FDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLSSVSKLDF 485


>ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Solanum lycopersicum]
          Length = 496

 Score =  474 bits (1220), Expect = e-131
 Identities = 244/482 (50%), Positives = 332/482 (68%), Gaps = 7/482 (1%)
 Frame = +2

Query: 161  LKSSLHLSTPSWSP---VHNCYLRKRRKWPLSPYKTQWHLT-FAHQQAMQNLKQSV--KS 322
            L   LH S+ S+S    ++N +LRKRRKWPLS YKT+W      HQ +MQ L +S   +S
Sbjct: 10   LHKKLHSSSHSYSARSSMNNYFLRKRRKWPLSLYKTKWQEEKLTHQLSMQKLVESTPNRS 69

Query: 323  SKTHLLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEF 502
             KTHLL+ L+DSF+ YEC+P P AY+F++K L  NPS WD+IP +LD+I+K E FETPE+
Sbjct: 70   PKTHLLSILLDSFSAYECDPTPNAYYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEY 129

Query: 503  ILIDLIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIK 679
            I   LIKFYGDSN    A E+F+ +P +RCNPSV SL  L+ VLC+    L I+ QVL+K
Sbjct: 130  IFTYLIKFYGDSNMTHLAYEMFFTMPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVK 189

Query: 680  AQLMNIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKE 859
            +QL+NI +EES F ILIRALCRIGK N A +LL  M + GFNLD  I S IL  M + K+
Sbjct: 190  SQLLNIWVEESTFKILIRALCRIGKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKD 249

Query: 860  CSGAEIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCY 1039
            C G EI   LEEM KLG+SP R D  +VI+F        DA++ L +MK+  + PD+VCY
Sbjct: 250  CVGVEIWGVLEEMRKLGYSPKRVDLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCY 309

Query: 1040 NLILDRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEE 1219
            NL+L+ L+ E +Y NAD++FDELLVLGL P++ TYN +INGLCKQ+K+ E +++L CME+
Sbjct: 310  NLVLNGLIFEGEYSNADELFDELLVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMED 369

Query: 1220 LGCRPDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQ 1399
            LGC+P++ TY TIL  L   G L+  +E++ +M+ KG+QL+S  Y ++I  ++ N E ++
Sbjct: 370  LGCKPEMNTYHTILDGLCRCGMLSSVKEVLGQMKSKGLQLSSHIYGVIINCMIRNGEVDE 429

Query: 1400 AFDLLQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
            A++LL EM++   V  S+  FD +I  LC  G   +  E+L  MS KN+ PG+ SWEA +
Sbjct: 430  AYNLLHEMVDMGFVPQSI-TFDGLIGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFV 488

Query: 1580 QV 1585
            QV
Sbjct: 489  QV 490


>ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  445 bits (1145), Expect = e-122
 Identities = 230/476 (48%), Positives = 321/476 (67%), Gaps = 9/476 (1%)
 Frame = +2

Query: 179  LSTPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKS--------SKTH 334
            L  PS+S  +  +LRKRRKWPLSPYK  WH TF H+QAMQ LK ++ +        S + 
Sbjct: 2    LRPPSFSKTN--FLRKRRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQSPSNSQ 59

Query: 335  LLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILID 514
             L+ LIDSF +Y  +P P AY FVI  L     ++  +P +L  ++KVE FETPEFI  +
Sbjct: 60   FLSILIDSFRIYNSDPTPNAYRFVISTLTRC-RQFHHLPPLLHRLEKVEKFETPEFIFTN 118

Query: 515  LIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLM 691
            LIK YG++N  + A++LF+RIPNFRC PSV+SL ALL VLC++   L ++PQ+L+K+Q M
Sbjct: 119  LIKVYGNANMFEDAVDLFFRIPNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKSQAM 178

Query: 692  NIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGA 871
            NIR+EES F IL+ ALCRI K NYA  +LN+M NDG+ +D ++ S IL ++ EQK  SG 
Sbjct: 179  NIRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGD 238

Query: 872  EIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLIL 1051
            E++ F+EEM KLGF P R D  +VI+FL K+    DA+    +MK D I PD V Y +IL
Sbjct: 239  EVLRFMEEMRKLGFYPGRVDCNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMIL 298

Query: 1052 DRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCR 1231
            + +  + DY  AD +FDE+LVLG+VP++  YN +IN LCKQN +EEG++ML  M ELGC+
Sbjct: 299  NGVTADGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCK 358

Query: 1232 PDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDL 1411
            PD VTY+ +L  +     L   REL  +ME++G+Q N  TY +++ GL+   E +++  L
Sbjct: 359  PDYVTYNMLLEGMSKVRDLGGMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSL 418

Query: 1412 LQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
            L+EM++K+  S     FD+IIC LCQ G++ +A +++ KM  K + PG  +WEALL
Sbjct: 419  LEEMLDKY-FSCWCSTFDEIICELCQRGLVCKALQLVNKMVRKTIAPGARAWEALL 473


>ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 491

 Score =  441 bits (1134), Expect = e-121
 Identities = 221/466 (47%), Positives = 322/466 (69%), Gaps = 1/466 (0%)
 Frame = +2

Query: 185  TPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLLNSLIDSFA 364
            T S +P    ++RK RKWP+SPY T+WH  F   QA+Q LK S  +    LL++LI SF 
Sbjct: 4    TSSLTPRSKFFVRKHRKWPVSPYNTKWHKLFNQHQALQTLKHSPLNPPQTLLSTLIHSFN 63

Query: 365  MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544
             + C+P P+AY+FV+K L    S+   IP VLD ++ +E F  PE I  +LI+FYG +NR
Sbjct: 64   TFNCDPTPEAYNFVLKTLFKT-SQLSHIPSVLDRLESIEKFHPPESIFANLIRFYGSANR 122

Query: 545  IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721
            ++ AI++F RIP FRC+PS  SL +LL VLC     L+++PQVL+ ++ M IR+EES F 
Sbjct: 123  VEDAIDVFCRIPKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSRAMGIRLEESSFR 182

Query: 722  ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901
            ILI ALCRIG V YA E++  M ++G++LD +I S +L ++ EQK   G E++ F+EEM 
Sbjct: 183  ILISALCRIGSVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGFVEEMK 242

Query: 902  KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081
            K+GF P   D+ +VI+ L K+    DA++ L +MK++ + PDIVCY ++L  ++   DY 
Sbjct: 243  KVGFCPGMLDYSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIANGDYK 302

Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261
            NADKVFDELLVLGLVP+V+TYN +INGLC QN VE GIKM+ CM+ELGCRP+L+TY+ +L
Sbjct: 303  NADKVFDELLVLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLITYNLLL 362

Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441
             +L  + +L+ AREL+S+M + G+ +N  T+ +++ GL    + ++A   ++EM++KF +
Sbjct: 363  KALCKNEELSRARELVSEMTLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEMLDKF-M 421

Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
                  +D +I GLCQ G++ +A ++L KM +KNV PG  +WEALL
Sbjct: 422  CRRCSAYDDVIYGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALL 467


>ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550335936|gb|EEE92696.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 490

 Score =  437 bits (1123), Expect = e-119
 Identities = 221/447 (49%), Positives = 310/447 (69%), Gaps = 8/447 (1%)
 Frame = +2

Query: 215  YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV-------KSSKTHLLNSLIDSFAMYE 373
            +LRK RKWP SPYK +WH  F  QQAMQ+LKQS          +K HLL+SLI SF++Y+
Sbjct: 13   FLRKHRKWPYSPYKARWHRIFNQQQAMQSLKQSALKPPQQESPNKPHLLSSLIHSFSIYD 72

Query: 374  CEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQG 553
             EP PKA+ F+ K L    S++  IP VLDH++KVE+FE PE     LI+ YG +N+   
Sbjct: 73   VEPAPKAFDFIFKTLVKT-SQFHHIPSVLDHLEKVESFEPPESTFAYLIEVYGRTNKTHE 131

Query: 554  AIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILI 730
            AIELFYRIP FRC PSV+SL  L+ VLC     L+++P++L+K+Q+MNIR+EES F +LI
Sbjct: 132  AIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKSQVMNIRVEESTFQVLI 191

Query: 731  RALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLG 910
             ALCRI KV +A E+LN M NDGF ++  I+S +L  + EQK+ +  E++ FLE++ KLG
Sbjct: 192  TALCRIRKVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGFLEQLRKLG 251

Query: 911  FSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNAD 1090
            F P   D+ +VI+FL K +   DA+  L  MK D I PDI CY ++L  ++ ++DY+ AD
Sbjct: 252  FFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIEDKDYLKAD 311

Query: 1091 KVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSL 1270
            ++FDELLV GLVP+ +TYN +INGLCKQN V+ GIKM+  MEELGC+P+L+TY+ ++  L
Sbjct: 312  ELFDELLVFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLITYNMLVKQL 371

Query: 1271 VADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDS 1450
               G+L+ A EL+ +M +KG+ LN  TY ++I GL +N +  +A  L +E ++K   + S
Sbjct: 372  CKVGELSKAGELVREMGLKGIGLNMQTYRIMIDGLASNGKIVEACGLFEEALDKGLCTQS 431

Query: 1451 VLLFDKIICGLCQNGMLNQAFEVLRKM 1531
             L+FD+IICGLC   +  +A ++L KM
Sbjct: 432  -LMFDEIICGLCHRDLSCKALKLLEKM 457


>gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao]
          Length = 542

 Score =  436 bits (1120), Expect = e-119
 Identities = 222/484 (45%), Positives = 330/484 (68%), Gaps = 11/484 (2%)
 Frame = +2

Query: 161  LKSSLHLSTPSWS-----PVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS 325
            LKS L  S  +W         N +LRK R+WP   YKT+W+ TF  +QAM + KQ V  +
Sbjct: 33   LKSELAWSDLAWPLKEMVRCRNLFLRKHRRWPHFAYKTKWNQTFTQKQAMLSFKQLVAVA 92

Query: 326  KTHL-----LNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFE 490
            + +L     L++L+ SF++Y   P P+AYHF+IK L  N   ++ IP VL H++ VE F+
Sbjct: 93   QDNLPPPILLSTLVRSFSLYNVHPTPQAYHFLIKTLIQN-LHFNHIPSVLHHLEHVEKFQ 151

Query: 491  TPEFILIDLIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQ 667
            TPE+I  DLI  YG +NRIQ A+++FYRIP FRC PS +SL +LL +LC     L+++PQ
Sbjct: 152  TPEYIFADLITTYGIANRIQDAVDIFYRIPKFRCVPSAYSLNSLLALLCRNQYSLKLVPQ 211

Query: 668  VLIKAQLMNIRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMG 847
            VL+K+ LMNIR+EES   IL+ ALCR+ KV+YA ++L  M ++G  ++ ++ S+IL ++ 
Sbjct: 212  VLLKSLLMNIRVEESTLRILVSALCRMNKVSYAIDILQRMIDEGLGVNDKVCSFILSSIC 271

Query: 848  EQKECSGAEIMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPD 1027
             + +  G ++M    E+ KLGF P   D+  +I+FL KK    DA+  L +MK   I P 
Sbjct: 272  AKADLDGEDVMGLWRELGKLGFCPAMSDYNCLIRFLVKKGRGLDALDFLNQMKSVGIKPG 331

Query: 1028 IVCYNLILDRLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLR 1207
            IV Y + L+ ++ E DY+ AD++FDELL+LGLVP+V+TYN++I+ LCKQNKVEEGIKM+ 
Sbjct: 332  IVSYTMALNGVIAEGDYMLADELFDELLMLGLVPDVYTYNAYIDALCKQNKVEEGIKMVA 391

Query: 1208 CMEELGCRPDLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNS 1387
            CMEEL C+P+++TY+ +L ++   G+++ A EL+ +M+ KG+++N V+Y ++I GL++  
Sbjct: 392  CMEELRCKPNVLTYNMLLEAICKVGEISRAMELVKEMKYKGIEMNLVSYTVIIDGLVSKG 451

Query: 1388 EFEQAFDLLQEMINKFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESW 1567
            E  +A  L++E+++K     S L FD++ICGLCQ G++ +A E+LRKM  KNV PG   W
Sbjct: 452  EILEAHGLVEEVLHKCFCHQS-LAFDEVICGLCQRGLVCEALELLRKMVAKNVSPGARGW 510

Query: 1568 EALL 1579
            EALL
Sbjct: 511  EALL 514


>gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris]
          Length = 514

 Score =  429 bits (1104), Expect = e-117
 Identities = 221/467 (47%), Positives = 312/467 (66%), Gaps = 10/467 (2%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS---------KTHLLNSLIDSF 361
            N YLRK RKWP SPYKT WH  F  QQAM  LKQ+                LL++L+D+F
Sbjct: 11   NKYLRKFRKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTPNLPHPFLLSTLLDAF 70

Query: 362  AMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541
              Y C+P PKAY+FVIK L +  S    IP VLDH++++ETFETPEFIL+ LI+FYG S+
Sbjct: 71   KAYSCDPTPKAYYFVIKTLTST-SHLQDIPPVLDHLEQLETFETPEFILVYLIRFYGLSD 129

Query: 542  RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718
            R+Q A++LF RIP FRC P+V SL  +L +LC K   L+++P++L+K+Q MNIR+EES F
Sbjct: 130  RVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVEESTF 189

Query: 719  GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898
             +LI ALCRI +V YA ++LN+M   G+ LD  I S I+ ++ EQ++ +  E +    +M
Sbjct: 190  QVLIEALCRIKRVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVIWRDM 249

Query: 899  CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078
             KLGF P   D+ ++I+FL K+   TDA+  L + K D I PD+VCY ++L  +V E +Y
Sbjct: 250  RKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVAEGEY 309

Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258
            V  +++FDE+LV GLVP+V+TYN +INGLCKQN V+E +K++  MEEL CRP++VT +T+
Sbjct: 310  VKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVTCNTL 369

Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438
            L +L   G L  AR +M +M  KG+ LN  +Y +++ GL+   E  +A  LL+EM+ K  
Sbjct: 370  LGALCVAGDLRKARGVMKEMGWKGVGLNLHSYRIMLDGLVGKGEIGEACFLLEEMLEKCF 429

Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
               S   FD II  +CQ G++ +A E+ +K+  K+  PG  +WEALL
Sbjct: 430  FPRS-STFDHIIFQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALL 475


>gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 480

 Score =  421 bits (1082), Expect = e-115
 Identities = 213/463 (46%), Positives = 322/463 (69%), Gaps = 6/463 (1%)
 Frame = +2

Query: 209  NCYLRKRRKWPL-SPYKTQWHLTFAHQQAMQNLKQSVKSS---KTHLLNSLIDSFAMYEC 376
            N +LRK RKWPL S +KT+W   F   Q M + KQ V      +   + SL+ S ++Y  
Sbjct: 7    NFFLRKHRKWPLISSHKTKWRQAFTQNQPMVSFKQLVARHNPLQPDFVPSLLQSLSLYNL 66

Query: 377  EPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGA 556
               P+AYHF+IK L +N  ++  IP +L H+Q ++ F+TPE+I   L+KFYG +NRIQ A
Sbjct: 67   HQSPQAYHFLIKTLLHN-RQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFYGKANRIQDA 124

Query: 557  IELFYRIPNFRCNPSVHSLKALLLVLC--EKGVLEIIPQVLIKAQLMNIRIEESCFGILI 730
            +++FYRIP FRC PS +SL ALL +LC  ++G L+++PQVL+ +  MNIR+EES F +L+
Sbjct: 125  VDIFYRIPQFRCFPSAYSLNALLALLCRSQRG-LKLLPQVLLNSLHMNIRLEESTFRLLV 183

Query: 731  RALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLG 910
              LCR+ KV YA E+L  M +DG  ++ ++FS++L ++  + +  G +++ F   + KLG
Sbjct: 184  CTLCRMNKVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGFWRGLRKLG 243

Query: 911  FSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNAD 1090
            FSP  GD+  V++FL KK    DA   L +MK D I P I+ Y ++L+ +  E DY+ AD
Sbjct: 244  FSPAMGDYDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTAEGDYILAD 303

Query: 1091 KVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSL 1270
            ++FDELL+LGLVPNV+TY ++I+ LCKQNKVEEGIKM+ CMEELGC+P+++ Y+T+L ++
Sbjct: 304  ELFDELLMLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLIYNTLLRTI 363

Query: 1271 VADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDS 1450
               G+++ AREL+ +M+ KG+++N V+Y ++I GL++N E  +A  L++E+++K     S
Sbjct: 364  SKAGEISRARELVKEMKYKGIEMNWVSYTIIIDGLVSNGEILEACALVEEVLHKCIFIKS 423

Query: 1451 VLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
             L FD++ICGLCQ G++ +A E+L KM E+++ PG   WEALL
Sbjct: 424  -LTFDEVICGLCQRGLVCKARELLGKMVERSISPGARVWEALL 465


>ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Glycine max]
          Length = 499

 Score =  420 bits (1079), Expect = e-114
 Identities = 220/470 (46%), Positives = 311/470 (66%), Gaps = 13/470 (2%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTH----------LLNSLI 352
            N YLRK +KWP SPYKT WH  F  +QAM+NLKQ+     S  H          LL++L+
Sbjct: 11   NKYLRKFKKWPHSPYKTSWHHNFGEEQAMKNLKQATLEMDSSQHPQRPNLPCPFLLSTLL 70

Query: 353  DSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYG 532
            DSF  Y  +P PKAY FV+K L +  S+   IP VL H++ +E FETPE IL+ LI+FYG
Sbjct: 71   DSFKAYSIDPTPKAYFFVLKTLTST-SQLQDIPPVLYHLEHLEKFETPESILVYLIRFYG 129

Query: 533  DSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEK-GVLEIIPQVLIKAQLMNIRIEE 709
             S+R+Q A++LF+RIP FRC P+V SL  +L +LC K   LE++P++L+K+Q MNIR+EE
Sbjct: 130  LSDRVQDAVDLFFRIPRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQHMNIRVEE 189

Query: 710  SCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFL 889
            S F +LIRALCRI +V YA ++LN M  DG+ LD +I S ++ A+ EQK+ + AE +   
Sbjct: 190  STFRVLIRALCRIKRVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVVW 249

Query: 890  EEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLE 1069
             +M KLGF P   D+ ++I+FL K+    DA+  L + K D I  D+V Y ++L  +V E
Sbjct: 250  RDMRKLGFCPGVMDYTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVAE 309

Query: 1070 EDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTY 1249
             +YV  D++FDE+LV+GL+P+ +TYN +INGLCKQN V E ++++  MEELGC+P++VTY
Sbjct: 310  GEYVMLDELFDEMLVIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVTY 369

Query: 1250 STILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMIN 1429
            +T+L +L   G    ARELM +M  KG+ LN  TY +V+ GL+   E  ++  LL+EM+ 
Sbjct: 370  NTLLGALSVAGDFVKARELMKEMGWKGVGLNLHTYRIVLDGLVGKGEIGESCLLLEEMLE 429

Query: 1430 KFSVSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
            K     S   FD II  +CQ  +  +A E+ +K+  K+  PG  +WEALL
Sbjct: 430  KCLFPRS-STFDNIIFQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALL 478


>gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]
          Length = 494

 Score =  419 bits (1076), Expect = e-114
 Identities = 215/461 (46%), Positives = 319/461 (69%), Gaps = 4/461 (0%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLLNSLIDSFAMYECEPFP 388
            N +LRK R++P+SPYKT+WH TF   QA+Q LK+    +   LL+ L++SF  Y+C P P
Sbjct: 10   NKFLRKHREFPISPYKTKWHETFNQTQALQTLKRHQNENPNRLLSLLLNSFNSYDCNPTP 69

Query: 389  KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568
            +AYHFV+K L    S++D I  VLD I+ VE FETPE+    +I FYG  +RI+ AI++F
Sbjct: 70   EAYHFVLKTLIKT-SQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFYGFLDRIEDAIDIF 128

Query: 569  YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745
            +RIP FRC PS +SL +LL VLC +   L  +P+VLIK++ MNIR+EE+ F ILI ALC+
Sbjct: 129  WRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRLEEASFRILITALCK 188

Query: 746  IGKVNYAAELLNHMANDGFNLDGRIFSYILKAM-GEQKEC--SGAEIMCFLEEMCKLGFS 916
            IGKV YA E+L+ M +DG+++D RI S IL  + G+ KE   +G +++  L++M K+GF 
Sbjct: 189  IGKVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDVLELLQKMEKMGFC 248

Query: 917  PNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKV 1096
            P  GD+  VI+ L +++   +A+  L +MK D + PD+VCY ++L  +V E +Y  AD++
Sbjct: 249  PRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHGIVAEGEYSKADEM 308

Query: 1097 FDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVA 1276
            FDE+LVLGLVP+V+TYN++INGLCKQN V+  +  +  MEELGC+P+L+TY+ IL +L  
Sbjct: 309  FDEMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPNLITYNLILRALCK 368

Query: 1277 DGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVL 1456
            +G+   A+EL+++M +KG +    TY +++  LL   E  +A  L++EM++K  +     
Sbjct: 369  NGEFGRAKELVAEMSLKGFEDYLQTYIIMLDVLLGKGEIVEACGLMEEMLDKL-LCRRCS 427

Query: 1457 LFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
            ++D+II GLC+ G+  +A E+L KM  KNV PG  +W+ALL
Sbjct: 428  MYDEIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALL 468


>ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355498545|gb|AES79748.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 653

 Score =  418 bits (1074), Expect = e-114
 Identities = 215/467 (46%), Positives = 312/467 (66%), Gaps = 6/467 (1%)
 Frame = +2

Query: 197  SPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNL----KQSVKSSKTHLLNSLIDSFA 364
            S   N YLRK RKWP SPYKT WH  F  QQA+Q L     Q+  ++   LL++LI SF 
Sbjct: 7    SKTANKYLRKFRKWPHSPYKTSWHHNFGEQQAIQILINAKTQTQNNNDPFLLSTLIHSFK 66

Query: 365  MYECEPFPKAYHFVIKVLAN-NPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541
             Y  +P PKAY F+IK + N N S   +IP +L+H++  E FETPEFI + LI+FYG ++
Sbjct: 67   AYHTDPSPKAYFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIRFYGFND 126

Query: 542  RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718
            R+Q A++LF+RIP FRC P+V SL  LL +LC K   L ++P +L+K++ M IR+EES F
Sbjct: 127  RVQDAVDLFFRIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRDMKIRLEESSF 186

Query: 719  GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898
             +LI+ALCRI +V+YA +++N M  DG+ LD +I S I+ ++ EQ + +  E +     M
Sbjct: 187  WVLIKALCRIKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEALVVWGNM 246

Query: 899  CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078
             KLGF P   D  ++I+FL K+    DA++ L ++K D I PDIVCY ++L  +V E DY
Sbjct: 247  RKLGFCPGVMDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGIVKEGDY 306

Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258
            V  D++FDE+LVLGLVP+V+TYN +INGLCKQN  +E +K++  ME+LGC+P++VTY+T+
Sbjct: 307  VKLDELFDEILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNVVTYNTL 366

Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438
            L +L   G L  A+ +M +M +KG++LN  TY +++ GL+   E  +A  LL+EM+ K  
Sbjct: 367  LGALCMSGDLGKAKRVMKEMRLKGVELNLHTYRIMLDGLVGKGEIGEACVLLEEMLEKCF 426

Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
               S   FD I+  +CQ G+++ A  ++ K+  K+  PG + WEALL
Sbjct: 427  YPRS-STFDSIVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALL 472


>gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris]
          Length = 513

 Score =  416 bits (1069), Expect = e-113
 Identities = 213/468 (45%), Positives = 311/468 (66%), Gaps = 10/468 (2%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSS---------KTHLLNSLIDSF 361
            N YLRK RKWP SPYKT WH  F  QQAM  LKQ+                LL++LIDSF
Sbjct: 11   NKYLRKFRKWPHSPYKTSWHHNFGEQQAMHKLKQATLEMGCPQTPNLPHPFLLSTLIDSF 70

Query: 362  AMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSN 541
              Y C+P PKAY+F+IK L    S++  IP VLDH++ +E FETPEF L+ LI+FYG S+
Sbjct: 71   KSYSCDPTPKAYYFLIKTLTCT-SQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFYGLSD 129

Query: 542  RIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKG-VLEIIPQVLIKAQLMNIRIEESCF 718
            ++Q A++LF RIP FRC P+V SL  +L +LC K   L+++P++L+K+Q MNIR+EES F
Sbjct: 130  KVQDAVDLFLRIPRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVEESTF 189

Query: 719  GILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEM 898
             +LI+ALCRI +V YA ++LN+M   G+ LD  + S I+ ++ EQ++ +  E +    +M
Sbjct: 190  QVLIKALCRIKRVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVIWRDM 249

Query: 899  CKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDY 1078
             KLGF P   D+ ++I+FL K+    DA+  L + K D I PD+VCY ++L  ++ E +Y
Sbjct: 250  RKLGFCPGIMDYTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIAEGEY 309

Query: 1079 VNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTI 1258
            V  +++FDE+LV GLVP+V+TYN +INGLCKQN V+E +K++  MEEL C+P++VT + +
Sbjct: 310  VKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVTCNIL 369

Query: 1259 LPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFS 1438
            L +L   G L  AR +M +M  KG++L+  +Y +++ GL+   E  +A  LL+EM+ K S
Sbjct: 370  LGALCVAGDLRKARGVMKEMGWKGVRLDLHSYRIMLDGLVGKGEIGEACFLLEEMLEK-S 428

Query: 1439 VSDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALLQ 1582
                   FD II  +CQ G++ +A E+ +K+  K+  PG  +WEALL+
Sbjct: 429  FFPRSSTFDHIIFQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLK 476


>ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
            gi|449483740|ref|XP_004156675.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
          Length = 491

 Score =  399 bits (1024), Expect = e-108
 Identities = 210/474 (44%), Positives = 308/474 (64%), Gaps = 2/474 (0%)
 Frame = +2

Query: 209  NCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKSSKTHLL-NSLIDSFAMYECEPF 385
            N +LRK RKWPLS +KT+WH TF   +A++ LKQ+    + HLL ++L+ SF  Y C P 
Sbjct: 12   NNFLRKHRKWPLSSHKTKWHQTFDQDEALRILKQAANPDQPHLLLSALVTSFTAYSCHPT 71

Query: 386  PKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIEL 565
            P AY+FV+K LA   S++  IP VL  +Q +E F+TPE+I +DLIK YG  NRIQ A+ L
Sbjct: 72   PNAYYFVLKTLART-SQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLYGRMNRIQDAVTL 130

Query: 566  FYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALC 742
            F RIP FRC PS  SL +LL  L      L IIP +++ +  M IR+E S F ILI ALC
Sbjct: 131  FRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLEHSTFQILITALC 190

Query: 743  RIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPN 922
            ++ KV +A EL N+M  +G+ L+ +I S IL ++ +QK+ SG  ++ FLEEM + GF P 
Sbjct: 191  KVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGFLEEMRQKGFCPA 250

Query: 923  RGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFD 1102
              D+ +VI+F   +   +DA+  L +MK D   PDIVCY ++L+ ++ + DY  AD++FD
Sbjct: 251  VVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIADGDYKMADELFD 310

Query: 1103 ELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADG 1282
            ELL+ GLVP+++TYN +I+GLCKQ     G++M+  ME LGC+P+++TY+ IL SL   G
Sbjct: 311  ELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVITYNVILKSLCKTG 370

Query: 1283 KLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVLLF 1462
            +L+ AR+L SKM++KG+  N  T+ ++I GL  N E  +A  LL+EM+        +  F
Sbjct: 371  ELDEARKLRSKMQLKGLAENLRTFRIMIDGLFHNGEVIEACVLLEEMLGS-RFPPQISTF 429

Query: 1463 DKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALLQVFEFIHSPNEISAI 1624
             +I+  LC+  M+ +A E+L  M  KN  PG ++WE LL     + S +E++++
Sbjct: 430  SEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILL-----LSSESELTSV 478


>ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g38420, mitochondrial; Flags: Precursor
            gi|3395430|gb|AAC28762.1| hypothetical protein
            [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 453

 Score =  361 bits (926), Expect = 7e-97
 Identities = 188/447 (42%), Positives = 294/447 (65%), Gaps = 3/447 (0%)
 Frame = +2

Query: 191  SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFA 364
            SW  + N ++RK RK P S +KT+W+     + AM+ L+ ++   S    ++ +L+ SF 
Sbjct: 6    SWHRMSN-FMRKYRKIPHSSFKTKWNENLKQKYAMEELRSNLLTDSENASVMRTLLSSFQ 64

Query: 365  MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544
            ++ CEP P+AY FVIK LA + S+ + I  VL H++  E F+TPE I  D+I  YG S R
Sbjct: 65   LHNCEPTPQAYRFVIKTLAKS-SQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGR 123

Query: 545  IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721
            I+ AIE+F++IPNFRC PS ++L ALLLVL  K   LE++P++L+KA  M +R+EES FG
Sbjct: 124  IEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFG 183

Query: 722  ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901
            ILI ALCRIG+V+ A EL+ +M+ D   +D R++S +L ++ + K+ S  +++ +LE++ 
Sbjct: 184  ILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLR 243

Query: 902  KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081
            K  FSP   D+  V++FL +     + +  L +MK D ++PD+VCY ++L  ++ +EDY 
Sbjct: 244  KTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYP 303

Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261
             ADK+FDELL+LGL P+V+TYN +INGLCKQN +E  +KM+  M +LG  P++VTY+ ++
Sbjct: 304  KADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILI 363

Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441
             +LV  G L+ A+ L  +ME  G+  NS T++++I   +   E   A  LL+E  N  +V
Sbjct: 364  KALVKAGDLSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFN-MNV 422

Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVL 1522
                   +++I  LC+ G+++QA E+L
Sbjct: 423  FVKSSRIEEVISRLCEKGLMDQAVELL 449


>ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325583|gb|EFH56003.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 444

 Score =  360 bits (925), Expect = 1e-96
 Identities = 188/441 (42%), Positives = 291/441 (65%), Gaps = 5/441 (1%)
 Frame = +2

Query: 215  YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFAMYECEPFP 388
            ++RK RK P S +KT+W+     + AM+ L+ ++   S    ++ +L+ SF ++ CEP P
Sbjct: 4    FMRKYRKIPQSSFKTKWNENLKQKYAMEELRSNLLADSENGSVMRTLVSSFQLHNCEPTP 63

Query: 389  KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568
            +AY FVI+ LA   S+ + I  VLDH++  E F+TPE I  D+I  YG S RI+ AI++F
Sbjct: 64   QAYRFVIETLAKT-SQLENIASVLDHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIDVF 122

Query: 569  YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745
            ++IPNFRC PS ++L ALLLVL  K   LE++P++L+KA  M +R+EES FGILI ALCR
Sbjct: 123  FKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASRMGVRLEESTFGILINALCR 182

Query: 746  IGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPNR 925
            IG+V+ A EL+ +M+ D   +D R++S +L ++ + K+ S  +++ +LE++ K  F P  
Sbjct: 183  IGEVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGYLEDLRKTRFLPGL 242

Query: 926  GDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFDE 1105
             D+  V++FL +     + +  L +MK D IDPD+VCY ++L  ++ +EDY  ADK+FDE
Sbjct: 243  RDYTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIADEDYPKADKLFDE 302

Query: 1106 LLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADGK 1285
            LL+LGL P+V+TYN +INGLCKQN +E  IKM+  M +LG  P++VTY+ ++  LV  G 
Sbjct: 303  LLLLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVTYNIVIKGLVKAGD 362

Query: 1286 LNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEM--INKFSVSDSVLL 1459
            L+ A+ L  +MEM G+  NS TY+++I   +   E   A  LL+E   +N F  S  +  
Sbjct: 363  LSRAKTLWKEMEMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEAFNMNLFVKSSKI-- 420

Query: 1460 FDKIICGLCQNGMLNQAFEVL 1522
             +++I  LC+ G++++A E+L
Sbjct: 421  -EEVISRLCEKGLMDKAVELL 440


>gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1|
            At2g38420 [Arabidopsis thaliana]
          Length = 444

 Score =  358 bits (920), Expect = 4e-96
 Identities = 185/439 (42%), Positives = 290/439 (66%), Gaps = 3/439 (0%)
 Frame = +2

Query: 215  YLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV--KSSKTHLLNSLIDSFAMYECEPFP 388
            ++RK RK P S +KT+W+     + AM+ L+ ++   S    ++ +L+ SF ++ CEP P
Sbjct: 4    FMRKYRKIPHSSFKTKWNENLKQKYAMEELRSNLLTDSENASVMRTLLSSFQLHNCEPTP 63

Query: 389  KAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNRIQGAIELF 568
            +AY FVIK LA + S+ + I  VL H++  E F+TPE I  D+I  YG S RI+ AIE+F
Sbjct: 64   QAYRFVIKTLAKS-SQLENISSVLYHLEVSEKFDTPESIFRDVIAAYGFSGRIEEAIEVF 122

Query: 569  YRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFGILIRALCR 745
            ++IPNFRC PS ++L ALLLVL  K   LE++P++L+KA  M +R+EES FGILI ALCR
Sbjct: 123  FKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLEESTFGILIDALCR 182

Query: 746  IGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMCKLGFSPNR 925
            IG+V+ A EL+ +M+ D   +D R++S +L ++ + K+ S  +++ +LE++ K  FSP  
Sbjct: 183  IGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGYLEDLRKTRFSPGL 242

Query: 926  GDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYVNADKVFDE 1105
             D+  V++FL +     + +  L +MK D ++PD+VCY ++L  ++ +EDY  ADK+FDE
Sbjct: 243  RDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIADEDYPKADKLFDE 302

Query: 1106 LLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTILPSLVADGK 1285
            LL+LGL P+V+TYN +INGLCKQN +E  +KM+  M +LG  P++VTY+ ++ +LV  G 
Sbjct: 303  LLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVTYNILIKALVKAGD 362

Query: 1286 LNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSVSDSVLLFD 1465
            L+ A+ L  +ME  G+  NS T++++I   +   E   A  LL+E  N  +V       +
Sbjct: 363  LSRAKTLWKEMETNGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEAFN-MNVFVKSSRIE 421

Query: 1466 KIICGLCQNGMLNQAFEVL 1522
            ++I  LC+ G+++QA E+L
Sbjct: 422  EVISRLCEKGLMDQAVELL 440


>ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum]
            gi|557112223|gb|ESQ52507.1| hypothetical protein
            EUTSA_v10017948mg [Eutrema salsugineum]
          Length = 456

 Score =  357 bits (917), Expect = 8e-96
 Identities = 189/450 (42%), Positives = 296/450 (65%), Gaps = 6/450 (1%)
 Frame = +2

Query: 191  SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSV-----KSSKTHLLNSLID 355
            SW  + N + RK RK P S +KT+W+     + AM+ L+  +      +    +L +LI 
Sbjct: 6    SWHRMSN-FFRKYRKIPHSSFKTKWNENLKQKYAMEELRSGLIADSGSNENDGVLRTLIS 64

Query: 356  SFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGD 535
            SF ++ CEP P+AY FVIK LA   S+ + I  VL+HI+  E F+TPE I  D+I  YG 
Sbjct: 65   SFRLHNCEPTPQAYKFVIKTLAKT-SQLENIASVLNHIEISEKFDTPESIFRDVIFAYGF 123

Query: 536  SNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEES 712
            S RI+ AI++F++IPNFRC PS ++L ALL VL  K   L+++P+VL+KA  + +R+EES
Sbjct: 124  SGRIEEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASKLGVRLEES 183

Query: 713  CFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLE 892
              GILI ALCRIG+V+ A +L+  M++D + +D R++S +L ++ + K+ S  +++ +LE
Sbjct: 184  TLGILIDALCRIGEVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGYLE 243

Query: 893  EMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEE 1072
             + K  FSP+  D+  V++FL +     + +  L +MK D I+PDIVCY +IL  ++ +E
Sbjct: 244  GLRKTRFSPDLRDYTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIADE 303

Query: 1073 DYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYS 1252
            DY  ADK+FDELL+LGLVP+V+TYN +INGLCKQ+ +E GIKM+ CME+LG  P++VTY+
Sbjct: 304  DYKKADKLFDELLLLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVTYN 363

Query: 1253 TILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINK 1432
             ++ +LV  G ++ A+ +  +ME  G+  NS +Y++++   +   E   A  LL+E  ++
Sbjct: 364  ILIKALVKAGDMSRAKIIWEEMETNGVDRNSHSYDIMVNASIEADEVVCAHGLLEEAFSR 423

Query: 1433 FSVSDSVLLFDKIICGLCQNGMLNQAFEVL 1522
              V  S    +++IC LC  G++++A E+L
Sbjct: 424  SLVVKSSRT-EEVICRLCDKGLMDKAVELL 452


>ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella]
            gi|482562854|gb|EOA27044.1| hypothetical protein
            CARUB_v10023139mg [Capsella rubella]
          Length = 470

 Score =  348 bits (894), Expect = 4e-93
 Identities = 183/447 (40%), Positives = 287/447 (64%), Gaps = 3/447 (0%)
 Frame = +2

Query: 191  SWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQS--VKSSKTHLLNSLIDSFA 364
            SW  + N +LRK RK P SP+KT+W+     + AM+ L+ S    S    ++ +L+ SF 
Sbjct: 23   SWHRMSN-FLRKYRKIPHSPFKTKWNENLKQKYAMEELRSSPVADSEDGGVIRTLVSSFR 81

Query: 365  MYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILIDLIKFYGDSNR 544
            ++ CEP P+AY FVIK LA   S+ + I  VL H++  E F+TPE I  D+I  YG + R
Sbjct: 82   LHNCEPTPQAYRFVIKTLAKT-SQLENIASVLSHLEVSEKFDTPESIFRDVIAAYGFAGR 140

Query: 545  IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721
            I  AI++F++IPNFRC PS ++L ALLLVL  K   LE++P++L+KA  M +R+EES FG
Sbjct: 141  IGEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASRMGVRLEESTFG 200

Query: 722  ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901
            ILI ALC+IG+V+ A EL+ +M+ D   +D R++S +L ++ + K+ S  +++ +LE++ 
Sbjct: 201  ILIDALCKIGEVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGYLEDLR 260

Query: 902  KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081
            K  FSP   D+  V+ FL +     + +  L +MK D I+PDIVCY ++L  ++ + +Y 
Sbjct: 261  KTRFSPGLRDYTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIADAEYS 320

Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261
             ADK FDELL+LGL P+V+TYN ++NGLCKQN +E  +KM+  M +LG  P+++TY+ ++
Sbjct: 321  KADKFFDELLLLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVITYNILI 380

Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441
             +LV  G L+ A+ L  +M + G+  NS TY+++I   +   +   A   L+E  N  +V
Sbjct: 381  KALVNAGDLSQAKTLWEEMGINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEAFN-MNV 439

Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVL 1522
                   +++I  LC  G++++A E+L
Sbjct: 440  FAKSSRTEEVISRLCDKGLMDKAVELL 466


>ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223526239|gb|EEF28557.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 373

 Score =  322 bits (826), Expect = 3e-85
 Identities = 168/346 (48%), Positives = 231/346 (66%), Gaps = 1/346 (0%)
 Frame = +2

Query: 545  IQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGV-LEIIPQVLIKAQLMNIRIEESCFG 721
            +Q AI LFYR PNFRC PSV+ L  LL VLC     L  +P+VL+K+Q MNIR+EES F 
Sbjct: 1    MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQDMNIRMEESSFR 60

Query: 722  ILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAEIMCFLEEMC 901
            +LI ALC I KV YA E+ N M NDGF++D +I S +L ++  Q + S +E+M FL E+ 
Sbjct: 61   LLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGELR 120

Query: 902  KLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILDRLVLEEDYV 1081
            K GF P   D+  VI FL ++    +A+  L +MKLD I PDIVCY  +L+ ++    Y 
Sbjct: 121  KFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVYS 180

Query: 1082 NADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRPDLVTYSTIL 1261
             AD++FDELLV GLVP+V+TYN +I GLCKQN VE GI+M+  MEELGC+P+L+TY+ +L
Sbjct: 181  KADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNILL 240

Query: 1262 PSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLLQEMINKFSV 1441
              L  +G+ + AR+L+  M  KG+ L   TY+++I GL +  +  +A  LL+E ++K  +
Sbjct: 241  EDLCKNGEDSRARDLVRDMGSKGIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDK-GL 299

Query: 1442 SDSVLLFDKIICGLCQNGMLNQAFEVLRKMSEKNVGPGVESWEALL 1579
                L FD++I GLCQ G + +A E+L K+  KNV PGV  WE LL
Sbjct: 300  CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLL 345


>emb|CAN63706.1| hypothetical protein VITISV_013107 [Vitis vinifera]
          Length = 390

 Score =  317 bits (813), Expect = 9e-84
 Identities = 177/420 (42%), Positives = 244/420 (58%), Gaps = 8/420 (1%)
 Frame = +2

Query: 179  LSTPSWSPVHNCYLRKRRKWPLSPYKTQWHLTFAHQQAMQNLKQSVKS--------SKTH 334
            L  PS+S  +  +LRKRRKWPLSPYK  WH TF H+QAMQ LK ++ +        S + 
Sbjct: 4    LRPPSFSKTN--FLRKRRKWPLSPYKATWHETFHHRQAMQTLKNTIANQSPSPQSPSNSQ 61

Query: 335  LLNSLIDSFAMYECEPFPKAYHFVIKVLANNPSRWDQIPQVLDHIQKVETFETPEFILID 514
             L+ LIDSF +Y  +P P AY FVI  L     ++  +P +L  ++KVE FETPEFI  +
Sbjct: 62   FLSILIDSFRIYNSDPTPNAYRFVISTLTRC-RQFHHLPPLLHRLEKVEKFETPEFIFTN 120

Query: 515  LIKFYGDSNRIQGAIELFYRIPNFRCNPSVHSLKALLLVLCEKGVLEIIPQVLIKAQLMN 694
            LIK                                                +L+K+Q MN
Sbjct: 121  LIK------------------------------------------------ILLKSQAMN 132

Query: 695  IRIEESCFGILIRALCRIGKVNYAAELLNHMANDGFNLDGRIFSYILKAMGEQKECSGAE 874
            IR+EES F IL+ ALCRI K NYA  +LN+M NDG+ +D ++ S IL ++ EQK  SG E
Sbjct: 133  IRLEESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDE 192

Query: 875  IMCFLEEMCKLGFSPNRGDWYDVIQFLAKKENATDAIKALRRMKLDCIDPDIVCYNLILD 1054
            ++ F+EEM KLGF P R D  +VI FL K+    DA+    +MK D I PD V Y +IL+
Sbjct: 193  VLRFMEEMRKLGFYPGRVDCNNVIXFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILN 252

Query: 1055 RLVLEEDYVNADKVFDELLVLGLVPNVFTYNSFINGLCKQNKVEEGIKMLRCMEELGCRP 1234
             +  + DY  AD +FDE+LVLG+VP++  YN +IN LCKQN +EEG++ML  M ELGC+P
Sbjct: 253  GVTADGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKP 312

Query: 1235 DLVTYSTILPSLVADGKLNLARELMSKMEMKGMQLNSVTYELVIRGLLTNSEFEQAFDLL 1414
            D V Y+ +L  +     L   REL  +ME++G+Q N  TY +++ GL+   E +++  L+
Sbjct: 313  DYVXYNMLLEGMSKVRDLGGMRELAREMELEGVQWNWETYRIMLDGLVGKGEIDESCSLV 372


Top