BLASTX nr result

ID: Forsythia23_contig00003590 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00003590
         (1381 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containi...   588   e-165
ref|XP_011088649.1| PREDICTED: pentatricopeptide repeat-containi...   541   e-151
emb|CDP14534.1| unnamed protein product [Coffea canephora]            476   e-131
ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   476   e-131
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   469   e-129
ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_009761148.1| PREDICTED: pentatricopeptide repeat-containi...   461   e-127
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   457   e-126
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   455   e-125
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   436   e-119
ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   431   e-118
ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containi...   428   e-117
ref|XP_010086846.1| hypothetical protein L484_006076 [Morus nota...   427   e-116
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   425   e-116
ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi...   422   e-115
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   421   e-115
gb|KHG25256.1| hypothetical protein F383_08951 [Gossypium arboreum]   418   e-114
ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containi...   418   e-114
gb|KDO68195.1| hypothetical protein CISIN_1g042756mg, partial [C...   417   e-113
ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containi...   415   e-113

>ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Erythranthe guttatus] gi|604333640|gb|EYU37991.1|
            hypothetical protein MIMGU_mgv1a006093mg [Erythranthe
            guttata]
          Length = 458

 Score =  588 bits (1515), Expect = e-165
 Identities = 295/433 (68%), Positives = 348/433 (80%)
 Frame = -3

Query: 1301 MIGVCGLRLSAPPAKPAGCCRGRQYPLVFCDLTKQGHRLLSSIATAQDPSASIGLLRKFI 1122
            MIGVC ++LS   A+P      RQ P + C LTKQG RLLSSIAT++ PSA+I LLRKF+
Sbjct: 1    MIGVCSIQLSLS-ARPVSA-GFRQLPPLVCVLTKQGQRLLSSIATSEQPSAAISLLRKFV 58

Query: 1121 ASSSKHVAXXXXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQE 942
            ASSSKHVA                   LAFPLY II QESWF+WN+KLVADLI+LL K E
Sbjct: 59   ASSSKHVALSTLSHLLSPSTSHPRLSSLAFPLYGIIEQESWFTWNSKLVADLISLLYKAE 118

Query: 941  RFDEAENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSV 762
            RFDEA+NLF E VSKLGFKER+LC FYCNLVDSHAKH SE+ V DSC  LKQLIL SSSV
Sbjct: 119  RFDEADNLFGETVSKLGFKERDLCTFYCNLVDSHAKHMSERGVSDSCTRLKQLILASSSV 178

Query: 761  YVKRKGYESIVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKR 582
            YVK+KGYES+++GFC IG P++AEN +EEMR  GLKPS FE+R+L+YGYG++G  EDMKR
Sbjct: 179  YVKQKGYESMIAGFCEIGSPDKAENLMEEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKR 238

Query: 581  SIIQMENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCP 402
            S+ QME EGFELDTVC NMV+SSFGA N+  DM+ WLKK+RNSGI  SIRTYNSVLNSCP
Sbjct: 239  SVGQMEKEGFELDTVCYNMVLSSFGARNEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCP 298

Query: 401  EITLMVQDLKNLPLSINELVDNLNKDEADLVMELVKSSVLDQATEWKSSELKLDLHGMHL 222
             + L+++D+K+LPLS+NELVDNL   EADLV+EL+KS VLDQ  EWKS+ELKLD+HGMHL
Sbjct: 299  TVILLLEDMKSLPLSVNELVDNLKTGEADLVLELMKSDVLDQVMEWKSTELKLDMHGMHL 358

Query: 221  SSAYLILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKC 42
            S+AYLILLQWF E+++RF  GN   P EILVVCGSGKHS+ RGESPVK LAK+M+ ++KC
Sbjct: 359  STAYLILLQWFKELKVRFGDGNHETPTEILVVCGSGKHSSKRGESPVKVLAKEMVTRMKC 418

Query: 41   PMRIDRKNVGCLI 3
            P+RIDRKN+GC I
Sbjct: 419  PLRIDRKNIGCFI 431


>ref|XP_011088649.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Sesamum indicum]
          Length = 454

 Score =  541 bits (1394), Expect = e-151
 Identities = 265/429 (61%), Positives = 334/429 (77%), Gaps = 2/429 (0%)
 Frame = -3

Query: 1283 LRLSAPPAKPAGCCRGRQYPLVFCDLTKQGHRLLSSIATAQDPSASIGLLRKFIASSSKH 1104
            L LS PP   A       YP   C LT+QGHR LSS+ T QDPSA++GLLRKF++SSSKH
Sbjct: 7    LHLSPPPPPTAF----PHYPPFLCALTRQGHRFLSSLLTTQDPSAALGLLRKFVSSSSKH 62

Query: 1103 VAXXXXXXXXXXXXXXXXXXXL--AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDE 930
            VA                      AFPLY++I+QESWFSWNTKL+ADLIA L K+E FD+
Sbjct: 63   VALTTLSHLLSPSPSNSNPRLSSLAFPLYSMIKQESWFSWNTKLLADLIAFLYKEEHFDD 122

Query: 929  AENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKR 750
            AE+L +E V +L FK+R+LC FYCNLV+SHAKHKSE  V+DSC +L+ LI  +SSVYV+ 
Sbjct: 123  AEDLLTETVMRLRFKKRDLCMFYCNLVESHAKHKSEGGVLDSCTQLRHLIFLTSSVYVRH 182

Query: 749  KGYESIVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQ 570
            + Y S+V+GFC +GLP++AEN ++EMR  GLKPS+FE+RSL+YGYG++G  EDMKRSI+Q
Sbjct: 183  RAYGSMVAGFCEVGLPDKAENLMQEMRENGLKPSVFELRSLVYGYGQMGFLEDMKRSIVQ 242

Query: 569  MENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITL 390
            +E +GFELDTV CNMV+SSFGAHN+L +M+SWLKK+   GI  S RTYNSVLNSCP I L
Sbjct: 243  VEKDGFELDTVGCNMVLSSFGAHNELLEMLSWLKKMTTLGIPFSTRTYNSVLNSCPTIIL 302

Query: 389  MVQDLKNLPLSINELVDNLNKDEADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAY 210
            M++D+KNLPLS +EL+ NL  +EA+LV+EL+KS+VLDQ  EW SSELKLD+HGMHL++AY
Sbjct: 303  MLEDMKNLPLSTDELLGNLKVEEANLVLELLKSTVLDQVMEWGSSELKLDMHGMHLTTAY 362

Query: 209  LILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRI 30
            L+LLQ F E++LRF+AGN   P EI V+CG GKHS+ RGESPVK L K+++++ KCP+RI
Sbjct: 363  LVLLQCFKELKLRFLAGNHTTPTEISVICGCGKHSSTRGESPVKSLTKEIIKRTKCPLRI 422

Query: 29   DRKNVGCLI 3
            DRKNVGC I
Sbjct: 423  DRKNVGCFI 431


>emb|CDP14534.1| unnamed protein product [Coffea canephora]
          Length = 449

 Score =  476 bits (1226), Expect = e-131
 Identities = 239/409 (58%), Positives = 306/409 (74%), Gaps = 3/409 (0%)
 Frame = -3

Query: 1220 VFCDLTKQGHRLLSSIATAQDPSASIG--LLRKFIASSSKHVAXXXXXXXXXXXXXXXXX 1047
            V C L KQG R LSS+AT  + S++     LRKF+ +SSKHVA                 
Sbjct: 30   VCCSLCKQGQRFLSSLATTDESSSAAHHRSLRKFVKTSSKHVALDTLSHLLSPTTAHPHL 89

Query: 1046 XXL-AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELC 870
                A PLY II Q SWFSWN KL+AD+ AL+ KQERF EAE L  +A+ KL   +R+LC
Sbjct: 90   SYHLALPLYLIISQASWFSWNAKLLADVTALMYKQERFIEAEALILQALKKLPAHDRDLC 149

Query: 869  NFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAE 690
            NFYC+L+ S+AKH+S K V DS   LKQL+ +SSSVYV+++ YES++SG C IGLP EAE
Sbjct: 150  NFYCHLLHSNAKHRSRKGVFDSLTSLKQLLARSSSVYVQKRAYESMISGLCEIGLPGEAE 209

Query: 689  NSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSF 510
            N +EEMR VGLKPS FE +SL++ YGRLG FEDMKRS+ QME+ G ELDTVC NMV+SS 
Sbjct: 210  NLMEEMRGVGLKPSGFEFKSLVHAYGRLGLFEDMKRSVTQMEDAGVELDTVCSNMVLSSL 269

Query: 509  GAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLN 330
            G+H   S+MVSWL+++++S +S SIRTYNSVLNSCP + L++QD K +PLS+ +L+ NL+
Sbjct: 270  GSHKVFSEMVSWLRRMKDSEVSFSIRTYNSVLNSCPTLILLLQDPKTIPLSMEDLMGNLS 329

Query: 329  KDEADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQV 150
            ++EADLV ELV SSVLD+A E  S+ELKLDLHGMHLS++ LI LQW   ++LRF AG+ +
Sbjct: 330  QEEADLVRELVASSVLDEAMECNSAELKLDLHGMHLSTSCLIFLQWIDRLRLRFSAGDNM 389

Query: 149  APAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
             P +I VVCGSGKHSA RGESPVKGL ++M+ ++KCP+RIDR+N+GC +
Sbjct: 390  VPTQITVVCGSGKHSASRGESPVKGLLREMILRIKCPLRIDRRNLGCFV 438


>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  476 bits (1226), Expect = e-131
 Identities = 238/414 (57%), Positives = 314/414 (75%), Gaps = 3/414 (0%)
 Frame = -3

Query: 1235 RQYPLVFCDLTKQGHRLLSSI--ATAQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXX 1062
            R  P   C L+KQGHR LS++  A ++D SA+  LLRKF+ASSSKHVA            
Sbjct: 24   RPRPCPRCSLSKQGHRFLSTLIAADSEDISATRHLLRKFVASSSKHVALSTLSHLVSPTT 83

Query: 1061 XXXXXXXL-AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFK 885
                     A PLY  I + SWF WN+KLVADL+ALL K ERFDEAE L +E VSKLG +
Sbjct: 84   TSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSR 143

Query: 884  ERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGL 705
            ER+LC+FY  L+ S +KH SE+ V+D C +LK ++L+SSSVY+K++GY S+V GFC IGL
Sbjct: 144  ERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGYASMVEGFCLIGL 203

Query: 704  PNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNM 525
            P +AE  +EEM+ +GLK S FE RSL+Y YG+ G   DMKR +++ME+ GF+LDTV  NM
Sbjct: 204  PRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMESMGFQLDTVSSNM 263

Query: 524  VVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINEL 345
            V++SFG+HN+LS++VS L+K+  SG+  SIRTYNSVLNSCP I+L++QDLK++PLS+ EL
Sbjct: 264  VLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEEL 323

Query: 344  VDNLNKDEADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFI 165
            + NL+++EA LV  LV SSVL++  +WK SELKLDLHGMHL+SAY+I+LQWF ++Q +F+
Sbjct: 324  MGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVIILQWFHQLQCKFL 383

Query: 164  AGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            A N+V P EI+VVCG+GKHS VRGESPVK L K+++ ++ CP+RIDRKN+GC I
Sbjct: 384  AENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNIGCFI 437


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Solanum lycopersicum]
          Length = 459

 Score =  469 bits (1206), Expect = e-129
 Identities = 237/407 (58%), Positives = 309/407 (75%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1214 CDLTKQGHRLLSS-IAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXX 1041
            C L+KQGHR LS+ IAT + D SA+  LLRKF+ SSSKHVA                   
Sbjct: 31   CSLSKQGHRFLSTLIATDSDDISATRHLLRKFVGSSSKHVALSTLSHLVSPTTTSHYRLC 90

Query: 1040 L-AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNF 864
              A PLY  I + SWF WN+KLVA+L+ALL K ERFDEAE L +E+VSKLG +ER+LC+F
Sbjct: 91   SLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESVSKLGSRERDLCSF 150

Query: 863  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENS 684
            Y  L+ S +KH SE+ V+D C +LK ++L SSSVY+K++GY S+V GFC IGLP +AE  
Sbjct: 151  YSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEGFCLIGLPRKAEEL 210

Query: 683  IEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 504
            +EEM+ +GLK S FE RSL+Y YG+ G   DMKR +++ME  GF+LDTV  NMV++SFG+
Sbjct: 211  MEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLDTVGSNMVLNSFGS 270

Query: 503  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKD 324
            HN+LS++VS L+K+  SG+  SIRTYNSVLNSCP I+L++QDLK++PLS+ EL+ NL+++
Sbjct: 271  HNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEELMGNLDEN 330

Query: 323  EADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 144
            EA LV  LV SSVL++  +WK  ELKLDLHGMHL+SAYLI+LQWF ++Q +F+A N+V P
Sbjct: 331  EAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQLQCKFLAENRVLP 390

Query: 143  AEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
             EI+VVCG+GKHS VRGESPVK L K+++ ++ CP+RIDRKNVGC I
Sbjct: 391  GEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNVGCFI 437


>ref|XP_009629638.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nicotiana tomentosiformis]
          Length = 459

 Score =  468 bits (1205), Expect = e-129
 Identities = 239/406 (58%), Positives = 305/406 (75%), Gaps = 2/406 (0%)
 Frame = -3

Query: 1214 CDLTKQGHRLLSS-IAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXX 1041
            C L+KQGHR +S+ IAT + D SA+  LLRKF+ASSSKHVA                   
Sbjct: 32   CSLSKQGHRFISTLIATDSDDISATHRLLRKFVASSSKHVALSTLSHLLSPTTSHLRLSS 91

Query: 1040 LAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFY 861
            LA PLY  I + SWF WN+KLVADL+ALL K ERFDEAE L +E VSKLG +ER+LC+FY
Sbjct: 92   LALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGGRERDLCSFY 151

Query: 860  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSI 681
              L+ S +KHKSEK V+D C +LK  +  SSSVY+K++GY S+V  FC+IGLP +AE  I
Sbjct: 152  SQLIHSQSKHKSEKGVLDFCTKLKLFLSCSSSVYLKQQGYASMVDAFCSIGLPRDAEELI 211

Query: 680  EEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 501
            EEM+ +GLK S FE R+L+Y YG+ G F DMKR + QME+ G +LDTV  NMV++SFG+ 
Sbjct: 212  EEMKELGLKLSKFEFRALVYSYGKSGFFSDMKRIVGQMESMGLQLDTVGANMVLNSFGSQ 271

Query: 500  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDE 321
             +LS+MVSWL+K+  SG+  SIRTYNSVLNSCP I+L++QD K++PLS+ EL+ NLN++E
Sbjct: 272  YELSEMVSWLQKMDVSGVPFSIRTYNSVLNSCPTISLLLQDPKSVPLSLEELLANLNENE 331

Query: 320  ADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 141
            A LV  LV SSVL++  +W  SELKLDLHGMH SSAY+I+LQWF ++Q +  A N+V PA
Sbjct: 332  ASLVKILVGSSVLEETMQWNPSELKLDLHGMHFSSAYVIILQWFHQLQCKLDAENRVLPA 391

Query: 140  EILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            EI VVCG+GKHS VRGESPVKGL K+++ ++ CP+RIDRKN+GC I
Sbjct: 392  EITVVCGAGKHSVVRGESPVKGLIKELLLRVGCPLRIDRKNIGCFI 437


>ref|XP_009761148.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nicotiana sylvestris]
          Length = 455

 Score =  461 bits (1187), Expect = e-127
 Identities = 235/406 (57%), Positives = 306/406 (75%), Gaps = 2/406 (0%)
 Frame = -3

Query: 1214 CDLTKQGHRLLSS-IAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXX 1041
            C L+KQGHR +S+ IAT + D SA+  LLRKF+ASSSKHVA                   
Sbjct: 32   CSLSKQGHRFISTLIATDSDDISATHRLLRKFVASSSKHVALSTLSQLLSPTTSNLRLSS 91

Query: 1040 LAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFY 861
            LA PLY  I + SWF WN+KLVADL+ALL K ERFDEAE L +E VSKLG +ER+LC+FY
Sbjct: 92   LALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSRERDLCSFY 151

Query: 860  CNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSI 681
              L+ S +K KSE+ V++   +LKQ+IL SSSVY+K++GY S+V  FC+IGLP EAE  +
Sbjct: 152  SQLIHSLSKQKSERGVLNFVTKLKQVILCSSSVYLKQQGYASMVDAFCSIGLPREAEEFM 211

Query: 680  EEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAH 501
            EEM+ +GLK S FE R+L+Y YG+ G F +MKR + QM+  G +LDTV  NMV++SFG+ 
Sbjct: 212  EEMKELGLKLSKFEFRALVYSYGKSGCFSEMKRIVGQMDGLGLKLDTVGANMVLNSFGSQ 271

Query: 500  NKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDE 321
             +LS+MVSWL+K++ S +  SIRTYNSVLNSCP I+ ++QD K+LPLS+ EL+ NLN++E
Sbjct: 272  YELSEMVSWLRKMKASDVPFSIRTYNSVLNSCPTISHLLQDPKSLPLSLEELMGNLNENE 331

Query: 320  ADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPA 141
            A LV  LV SSVL++  +W  SELKLDLHGMHLSSAY+++LQWF ++Q + +A N+V PA
Sbjct: 332  AGLVKILVGSSVLEETMQWNPSELKLDLHGMHLSSAYVVILQWFHQLQCKLVAENRVLPA 391

Query: 140  EILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            EI VVCG+GKHS VRGESPVKGL K+++ ++ CP+RIDRKN+GC I
Sbjct: 392  EITVVCGTGKHSVVRGESPVKGLIKEILLRVGCPLRIDRKNIGCFI 437


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  457 bits (1176), Expect = e-126
 Identities = 234/407 (57%), Positives = 297/407 (72%)
 Frame = -3

Query: 1223 LVFCDLTKQGHRLLSSIATAQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXX 1044
            L+ C L+KQG   LSS+A  +DPSAS  L+ KFIASSSK +A                  
Sbjct: 20   LIQCALSKQGQLFLSSVA--RDPSASNRLICKFIASSSKSIALNALSHLLSPTTTHPYLS 77

Query: 1043 XLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNF 864
             LA PLY+ I + SWFSWN KL+AD+IALL KQ +  EAE L SE + KLG +ER+L +F
Sbjct: 78   SLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLVSF 137

Query: 863  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENS 684
            YCNL+DSH+KH S + V D  + L +++ +SSSVYVK + Y+S++S  CA+GLP EAEN 
Sbjct: 138  YCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAENL 197

Query: 683  IEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 504
            IEEMR  GLKPS+FE RS++YGYGR+G  EDM+R ++QM NEGFELDTV  NMV+SS+GA
Sbjct: 198  IEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSYGA 257

Query: 503  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKD 324
            +NK S+MVSWL++++NS I  SIRTYNSVLNSCP I  ++QDLK  P +I+EL++ L  D
Sbjct: 258  YNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLKGD 317

Query: 323  EADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 144
            EA LV EL+ S VL +  EW  SE KLDLHGMHL SAYLI+LQW  E++ R  A   V P
Sbjct: 318  EALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAEYVMP 377

Query: 143  AEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
             EI VVCGSGKHS+VRGESPVK + ++MM + + PM+IDRKN+GC +
Sbjct: 378  VEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFV 424


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  455 bits (1170), Expect = e-125
 Identities = 239/425 (56%), Positives = 300/425 (70%), Gaps = 4/425 (0%)
 Frame = -3

Query: 1265 PAKPAGCCRGRQYPLVFCDLTKQGHRLLSSIATA---QDPSASIGLLRKFIASSSKHVAX 1095
            P +P+  C     PL     TKQGHR  SS+A      DP+ +  L++KF+ASS K +A 
Sbjct: 20   PTRPSIKCESGGVPL-----TKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIAL 74

Query: 1094 XXXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLF 915
                              LAFPLYT I + SW++WN KLVA+LIALL KQ R+DE+E L 
Sbjct: 75   NALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALI 134

Query: 914  SEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYES 735
            S+AVSKL F+ER+L  FYCN ++S +KH S++   D+   L +LI  SSSVYVKR+GY+S
Sbjct: 135  SQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKS 194

Query: 734  IVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEG 555
            +VS  C +  PNEAEN +EEMR  GL P+LFE R + YGYG+LG FEDM+R + +ME EG
Sbjct: 195  MVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEG 254

Query: 554  FELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDL 375
            FE+DT+C NMV+SS+GA+N  S MV WL+K++   I  SIRTYNSVLNSCPEI  +VQ L
Sbjct: 255  FEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGL 314

Query: 374  KNLPLSINELVDNLNKDEADLVMELVK-SSVLDQATEWKSSELKLDLHGMHLSSAYLILL 198
             ++PLS+ EL   LN+DEA LV ELVK SSVLD+A EW  SE KLDLHGMHL SAYLI+L
Sbjct: 315  DSVPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIML 374

Query: 197  QWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKN 18
            QW  EM+ RF     V PA+I +VCGSGKHS+VRGESPVK L ++MM ++K PM+IDRKN
Sbjct: 375  QWIEEMKCRFKVEECVIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKN 434

Query: 17   VGCLI 3
            +GC I
Sbjct: 435  IGCFI 439


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  436 bits (1121), Expect = e-119
 Identities = 234/439 (53%), Positives = 294/439 (66%), Gaps = 9/439 (2%)
 Frame = -3

Query: 1292 VCGLRLSAPPAKPAGCCRGRQYPLVFCD-----LTKQGHRLLSSIATA--QDPSASIGLL 1134
            +  L +  PP   + CCR RQ  L         LTKQG R LSS+A A  +D  A+  L+
Sbjct: 2    ISSLHMRIPPPWNSRCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAVTRDSKAASRLI 61

Query: 1133 RKFIASSSKHVAXXXXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALL 954
             KF+ASS + +A                   LAFPLY  I +ESWF WN KLVA++IA L
Sbjct: 62   SKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFL 121

Query: 953  NKQERFDEAENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQ 774
            +KQ + +EAE L  E +SKLG +EREL  FYCNL+DS  KH S++   D+   L QL+  
Sbjct: 122  DKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNS 181

Query: 773  SSSVYVKRKGYESIVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFE 594
            SSSVYVKR+  +S++SG C +G P+EAEN IEEMR  GL+PS FE + +IYGYGRLG  E
Sbjct: 182  SSSVYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLE 241

Query: 593  DMKRSIIQMENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVL 414
            DM+R + QME++G  +DTVC NMV+SS+G HN+LS MV WL+K+++SGI  S+RTYNSVL
Sbjct: 242  DMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVL 301

Query: 413  NSCPEITLMVQDL--KNLPLSINELVDNLNKDEADLVMELVKSSVLDQATEWKSSELKLD 240
            NSC  I  M+QDL   + PLSI EL + LN++E  +V EL  SSVLD+A +W S E KLD
Sbjct: 302  NSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLD 361

Query: 239  LHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQM 60
            LHGMHL SAY I+LQW  EM+ RF     V PAEI VVCGSGKHS VRGES VK + K+M
Sbjct: 362  LHGMHLGSAYFIILQWMDEMRNRFNNEKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKM 421

Query: 59   MQQLKCPMRIDRKNVGCLI 3
            M +   PMR+ R N+GC I
Sbjct: 422  MVRTSSPMRVHRNNIGCFI 440


>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  431 bits (1108), Expect = e-118
 Identities = 222/405 (54%), Positives = 287/405 (70%), Gaps = 1/405 (0%)
 Frame = -3

Query: 1214 CDLTKQGHRLLSSIAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXXL 1038
            C +TKQG R L+ +A  A+D   +  L+ KF+ SS+K +A                   L
Sbjct: 32   CAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSSTKSIALNTLSYLLSPDTTLPHLSSL 91

Query: 1037 AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFYC 858
            A P Y+ I + SWF WN KLVA L+ALL+KQ + +EAE L SE +SKLG +EREL  F+C
Sbjct: 92   ALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELALFHC 151

Query: 857  NLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSIE 678
             LV+SH+K  S+     S + L QL+  SSSVYVK + +ES+VSG C +  P EA+N IE
Sbjct: 152  QLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREADNLIE 211

Query: 677  EMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAHN 498
            EMR  GLKPS+FE RS++YGYGRLG FEDM + + QMEN+G  +DT+C NMV+SS+GAH+
Sbjct: 212  EMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSYGAHS 271

Query: 497  KLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDEA 318
            +L+ M+ WL+K+++  +  SIRTYNSVLNSC  I  M+Q+ K+ P SI EL   LN DEA
Sbjct: 272  ELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLNGDEA 331

Query: 317  DLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAE 138
             LV ELV+S+VLD+   W+  E KLDLHGMHL SAYLILL+WF  M+ RF +G  V PAE
Sbjct: 332  LLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKDVIPAE 391

Query: 137  ILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            ++V+CGSGKHS+VRGESPVKGL KQMM +++ PMRIDRKNVGC +
Sbjct: 392  VVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFV 436


>ref|XP_012480490.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Gossypium raimondii] gi|763765430|gb|KJB32684.1|
            hypothetical protein B456_005G255600 [Gossypium
            raimondii]
          Length = 458

 Score =  428 bits (1100), Expect = e-117
 Identities = 222/417 (53%), Positives = 293/417 (70%), Gaps = 9/417 (2%)
 Frame = -3

Query: 1226 PLVFCD-----LTKQGHRLLSSI---ATAQDPSASIGLLRKFIASSSKHVAXXXXXXXXX 1071
            PL+ C+     LTKQ HR  SS+   A   DP+ +  L++KF+ASS K +A         
Sbjct: 27   PLIKCESGGVPLTKQAHRFFSSLTSTAAVDDPATANRLIKKFVASSPKSIALNALSHLLS 86

Query: 1070 XXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLG 891
                      +AFPLYT I + SW++WN KLVADL+ LL+ Q + DE++ L S+ VSKL 
Sbjct: 87   PRNSHPHLSAIAFPLYTKISEASWYNWNPKLVADLVPLLDIQGKHDESQALISQVVSKLK 146

Query: 890  FKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAI 711
            FKER+L  FYCNL++S +KH+S++   D+   L +L+  SSS+YVK++GY+S+VS  C +
Sbjct: 147  FKERDLVQFYCNLIESCSKHESKQGFNDAYGYLSELVNNSSSMYVKKQGYKSMVSSLCEM 206

Query: 710  GLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCC 531
            G PNEAEN +E+M   G+KPSLFE+R ++YGYG++G FEDM+R + +ME EGF +DT+  
Sbjct: 207  GQPNEAENVVEDMIKNGVKPSLFELRFVLYGYGKMGFFEDMERMVKKMEIEGFGVDTISS 266

Query: 530  NMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSIN 351
            NM++SS+GA+N L  MV WL+K++   I  SIRTYN VLNSCP I   V+     P+S++
Sbjct: 267  NMILSSYGAYNALPKMVPWLQKMKALEIPFSIRTYNCVLNSCPMIMSFVRGSGGFPVSVS 326

Query: 350  ELVDNLNKDEADLVMELVK-SSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQL 174
            ELV+ L++DEA LV ELV+ SSVLD+A EW   ELKLDLHGMH  SAYLI+LQW  EM+ 
Sbjct: 327  ELVNVLDEDEALLVKELVESSSVLDEAMEWDDLELKLDLHGMHSGSAYLIMLQWIKEMKS 386

Query: 173  RFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            RF     V PA+I VVCG+GKHS+VRGESPVK L K MM Q+K PMRIDRKN+GC I
Sbjct: 387  RFRVKECVVPAQITVVCGTGKHSSVRGESPVKTLIKAMMVQMKSPMRIDRKNIGCFI 443


>ref|XP_010086846.1| hypothetical protein L484_006076 [Morus notabilis]
            gi|587833217|gb|EXB24044.1| hypothetical protein
            L484_006076 [Morus notabilis]
          Length = 517

 Score =  427 bits (1097), Expect = e-116
 Identities = 231/431 (53%), Positives = 292/431 (67%), Gaps = 1/431 (0%)
 Frame = -3

Query: 1292 VCGLRLSAPPAKPAGCCRGRQYPLVFCDLTKQGHRLLSSIA-TAQDPSASIGLLRKFIAS 1116
            +CGL     P + A      Q     C LTKQGHR LS+++  A + SA+  L+ KF+AS
Sbjct: 85   ICGLS----PTRSAAASSSIQ-----CALTKQGHRFLSTLSINAGNASAANKLIGKFVAS 135

Query: 1115 SSKHVAXXXXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERF 936
            S K ++                    +  LY+ IR+ SWF ++ KLVA L ALL+KQ R+
Sbjct: 136  SPKSISLNALSHLLSPDTTHTHLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRY 195

Query: 935  DEAENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYV 756
             EAE L +EAVSKLG ++REL  FYC+LV+SH+K  S+     S   L QL+  SSS YV
Sbjct: 196  SEAEALIAEAVSKLGHRQRELAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYV 255

Query: 755  KRKGYESIVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSI 576
            K + +E++V   C +  P EAE+ +EEMR+ GLKPS+FE RSL+YGYGRLG +EDM R++
Sbjct: 256  KCRAFETMVGALCTMDRPCEAESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTV 315

Query: 575  IQMENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEI 396
             QME EG  +DT+C NMV+SS+GAHN+L  MV WL+K+R S I  SIRTYNSVLN CP I
Sbjct: 316  NQMEIEGLVIDTICSNMVLSSYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTI 375

Query: 395  TLMVQDLKNLPLSINELVDNLNKDEADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSS 216
            T M+QDLK++PLS+ EL   L  DE  LVMELV SSVL++   W S E+KLDLHGMHL S
Sbjct: 376  TAMLQDLKDIPLSMYELNATLRGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGS 435

Query: 215  AYLILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPM 36
            AYLI+L+W  EM  RF  GN   PAE++VVCGSGKHS VRG SPVK L K+MM Q+K PM
Sbjct: 436  AYLIMLEWMEEMTRRFNDGNHGIPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPM 495

Query: 35   RIDRKNVGCLI 3
            +IDRKN GC +
Sbjct: 496  KIDRKNAGCFL 506


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  425 bits (1093), Expect = e-116
 Identities = 217/405 (53%), Positives = 283/405 (69%), Gaps = 1/405 (0%)
 Frame = -3

Query: 1214 CDLTKQGHRLLSSIAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXXL 1038
            C LTKQG R L+ +A  A +PS +  L+ KF+++S K  A                   L
Sbjct: 33   CALTKQGQRFLTKLAANAGNPSVANKLISKFLSTSPKSTALTTLSYLLSPHTAHPHLSSL 92

Query: 1037 AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFYC 858
            A P+Y+ I + SWF WN KLVA L+ALL KQ +  ++E L SE +SKLG KEREL  F+C
Sbjct: 93   ALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEALISETISKLGNKERELVQFHC 152

Query: 857  NLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSIE 678
             LV+SH+K  S+     +C  L QL+  SSSVYVKR+ +ES+V G CA+  P EA+  IE
Sbjct: 153  QLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFESMVGGLCAMDRPGEADELIE 212

Query: 677  EMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAHN 498
            EMR  GLK S+FE RS++YGYGRLG FE+M + + QME +GF  DT+CCNMV+SS+GAHN
Sbjct: 213  EMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQGFGDDTICCNMVLSSYGAHN 272

Query: 497  KLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDEA 318
            +L+ M +WL+K++ S +  S+RTYNSVLNSCP I  M+Q+ K +P S+ EL   L+ DEA
Sbjct: 273  ELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQEPKAVPCSVGELSGVLDGDEA 332

Query: 317  DLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAE 138
             +V ELV S+V+D+A  W S+E KLDLHGMHL SAYL++L+WF  M  RF +   V PAE
Sbjct: 333  LVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVMLEWFEAMGNRFKSAECVVPAE 392

Query: 137  ILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            +++VCG GKHS+VRGESPVK L K+MM Q++ PMRIDRKNVGC I
Sbjct: 393  VVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRKNVGCFI 437


>ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nelumbo nucifera]
          Length = 451

 Score =  422 bits (1084), Expect = e-115
 Identities = 218/408 (53%), Positives = 287/408 (70%), Gaps = 1/408 (0%)
 Frame = -3

Query: 1223 LVFCDLTKQGHRLLSSIATAQDPSASIG-LLRKFIASSSKHVAXXXXXXXXXXXXXXXXX 1047
            L +C L+K+GHR  +S+A A   SA+   L+RKF+ASSSK  A                 
Sbjct: 28   LPWCALSKKGHRFFTSLAAAAGDSAAANRLIRKFVASSSKSDALNALSHLISSNTTHFHL 87

Query: 1046 XXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCN 867
              L  P+Y  I +  WF+WN KLVA +IA L+KQ + +EAE L SE+V KLGF+ER++  
Sbjct: 88   SSLVLPMYRRIAETPWFNWNPKLVASVIAYLDKQGQPEEAEALISESVQKLGFQERDVAL 147

Query: 866  FYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAEN 687
            FYC+L+DS++K +S   V +S   LKQL   SSS  + R+ YE+I+   C++ LP +AEN
Sbjct: 148  FYCDLIDSYSKQRSRIGVFESYARLKQLFSDSSSS-LSRRAYETIICSLCSVDLPRDAEN 206

Query: 686  SIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFG 507
             +EEM   G KPS FE RSL+ GYGRLG F DM+R + +ME+ G+ LDT+C NMV+SSFG
Sbjct: 207  MVEEMTISGFKPSAFEFRSLVSGYGRLGLFTDMRRVLRKMEDAGYCLDTICSNMVLSSFG 266

Query: 506  AHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNK 327
            AH++LS+M SWL+K+++S IS SIRTYNSV+NSCP IT +++DLK +PLS+ +L   L K
Sbjct: 267  AHSELSEMASWLRKMKDSNISFSIRTYNSVMNSCPTITSLLKDLKFVPLSMEDLKGRLQK 326

Query: 326  DEADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVA 147
            DE  LV +L+ SSVL  A +W  SE KLDLHGMHL++AYLI+LQW   ++ RF AGN V 
Sbjct: 327  DETLLVEQLIGSSVLMDALKWCPSEGKLDLHGMHLATAYLIMLQWVQVLRSRFSAGNWVI 386

Query: 146  PAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            P E  V+CGSGKHS+VRGESPVK L KQMM ++K PM+IDR NVGC +
Sbjct: 387  PTEFRVICGSGKHSSVRGESPVKALVKQMMVRMKSPMKIDRNNVGCFV 434


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  421 bits (1081), Expect = e-115
 Identities = 217/405 (53%), Positives = 287/405 (70%), Gaps = 3/405 (0%)
 Frame = -3

Query: 1208 LTKQGHRLLSSI---ATAQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXXL 1038
            L KQGHR LSS+   A A DPSA+   ++KF+A+S K V+                    
Sbjct: 53   LMKQGHRFLSSLSSPALAGDPSATNRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFF 112

Query: 1037 AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFYC 858
            A  LY+ I + SWF WN KL+A+L+ALLNKQER  E+E L S AVS+L   ER++  FYC
Sbjct: 113  ALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSNAVSRLKSNERDIALFYC 172

Query: 857  NLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSIE 678
            NLV+S++K  S +   ++C  L+++  +S+SVYVK + Y+S+VSG C +  P++AE+ IE
Sbjct: 173  NLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIE 232

Query: 677  EMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAHN 498
            EMR   +KP LFE +S++YGYGRLG FEDM R + +ME EG ++DTVC NMV+SS+GAHN
Sbjct: 233  EMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHN 292

Query: 497  KLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDEA 318
             L  M SWL+K+++S + LS RTYNSVLNSCP I  +++DL + P+S++EL+  LNKDE 
Sbjct: 293  ALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKDEE 352

Query: 317  DLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAE 138
             LV  L +SSVLD+A EW S E KLDLHGMHLSS+YLI++QW  EM++RF  G  V PAE
Sbjct: 353  VLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWMDEMRIRFSEGKCVVPAE 412

Query: 137  ILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            I++V GSGKHS VRGESPVK L K++M +   PMRIDRKN+G  I
Sbjct: 413  IVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIGSFI 457


>gb|KHG25256.1| hypothetical protein F383_08951 [Gossypium arboreum]
          Length = 458

 Score =  418 bits (1075), Expect = e-114
 Identities = 222/424 (52%), Positives = 294/424 (69%), Gaps = 9/424 (2%)
 Frame = -3

Query: 1247 CCRGRQYPLVFCD-----LTKQGHRLLSSI---ATAQDPSASIGLLRKFIASSSKHVAXX 1092
            C R  Q PL+ C+     LTKQ HR  SS+   A   DP+ +  L++KF+ASS K +A  
Sbjct: 21   CLRPTQ-PLIKCESGGVPLTKQAHRFFSSLISTAAVDDPATANRLIKKFVASSPKSIALN 79

Query: 1091 XXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFS 912
                             +AFPLYT I + SW++WN KLVADL+ LL+ Q + DE++ L S
Sbjct: 80   ALSHLLSPRNSHPHLSAIAFPLYTKISEASWYNWNPKLVADLVPLLDIQGKHDESQALNS 139

Query: 911  EAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESI 732
            + VSKL FKER+L  FYCNL++S +KH+S++   D+   L +L+  SSS+YVK++G++S+
Sbjct: 140  QVVSKLKFKERDLVQFYCNLIESCSKHESKQGFNDAYGFLSELVNNSSSMYVKKQGFKSM 199

Query: 731  VSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGF 552
            VS  C +G PNEAEN +E+M   G+KPSLFE+R ++YGYG++G FEDM+R + +ME EGF
Sbjct: 200  VSSLCEMGQPNEAENVVEDMIKNGVKPSLFELRFVLYGYGKMGFFEDMERMVKKMEIEGF 259

Query: 551  ELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLK 372
             +DT+  NM++SS+GA+N L  MV WL+K++   I  SIRTYN VLNSCP I   V+   
Sbjct: 260  GVDTISSNMILSSYGAYNALPKMVPWLQKMKALEIPFSIRTYNCVLNSCPMIMSFVRGSG 319

Query: 371  NLPLSINELVDNLNKDEADLVMELVK-SSVLDQATEWKSSELKLDLHGMHLSSAYLILLQ 195
              P+S++ELV+ L++ EA LV ELV+ SSVLD+A EW   ELKLDLHGMH  SAYLI+LQ
Sbjct: 320  GFPVSVSELVNVLDEAEALLVKELVESSSVLDEAMEWDDLELKLDLHGMHSGSAYLIMLQ 379

Query: 194  WFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNV 15
            W  EM+ RF     V PA+I VVCG+GKHS+VRGESPVK L K MM Q+K PMRIDRKN+
Sbjct: 380  WIEEMKSRFRVEECVVPAQITVVCGTGKHSSVRGESPVKTLIKAMMVQMKSPMRIDRKNI 439

Query: 14   GCLI 3
            G  I
Sbjct: 440  GRFI 443


>ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Tarenaya hassleriana] gi|729371006|ref|XP_010548125.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g17033 [Tarenaya hassleriana]
          Length = 462

 Score =  418 bits (1074), Expect = e-114
 Identities = 218/405 (53%), Positives = 288/405 (71%), Gaps = 3/405 (0%)
 Frame = -3

Query: 1208 LTKQGHRLLSSI---ATAQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXXXL 1038
            LTKQGHR +SS+   A A D SA    +RKF+A+S K VA                   +
Sbjct: 46   LTKQGHRFISSLSSPAVAGDSSAINRQIRKFVAASPKSVALNVLSHLLSPLNSHPHLSSI 105

Query: 1037 AFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNFYC 858
            A  LY+ I +  WF WN KLVADL+ALLNKQE+F E+E+L S AVS+L   ER L  F+C
Sbjct: 106  ALNLYSEIAEAPWFDWNPKLVADLVALLNKQEQFPESESLLSAAVSRLKPNERGLALFHC 165

Query: 857  NLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENSIE 678
            NLV+S++K  S +   DS + L+++I +SSSVYVK +GY+SIVSG C +  P +AE  + 
Sbjct: 166  NLVESNSKQGSTRGFNDSYSCLREIIQRSSSVYVKSQGYKSIVSGLCNMDRPYDAERVLA 225

Query: 677  EMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGAHN 498
            EM+  G+KP LFE RS++YGYGRLG F DM R++ +ME++G ++DTVC NMV+SS+GA +
Sbjct: 226  EMKTEGIKPELFEYRSVLYGYGRLGLFFDMNRTVHEMESDGHKIDTVCSNMVLSSYGARD 285

Query: 497  KLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKDEA 318
             L +M SWL+K++  GI LSIRTYNSVLNSCP IT +++DL + P+S++EL   LN+DE 
Sbjct: 286  ALPEMGSWLQKLKGFGIPLSIRTYNSVLNSCPTITSLLKDLDSCPVSLSELTGLLNEDEM 345

Query: 317  DLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAE 138
             L  ELV+SSVLD+A EW + E KLDLHGMHLSS+YLI++QW  ++++RF  G  V P E
Sbjct: 346  LLTRELVQSSVLDEAMEWNALEGKLDLHGMHLSSSYLIMMQWMDKVRIRFEEGKHVIPVE 405

Query: 137  ILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            I++V GSGKHS VRGESPVK L K++M +   PMRIDRKN+G  I
Sbjct: 406  IVIVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIGSFI 450


>gb|KDO68195.1| hypothetical protein CISIN_1g042756mg, partial [Citrus sinensis]
          Length = 425

 Score =  417 bits (1072), Expect = e-113
 Identities = 226/421 (53%), Positives = 283/421 (67%), Gaps = 9/421 (2%)
 Frame = -3

Query: 1292 VCGLRLSAPPAKPAGCCRGRQYPLVFCD-----LTKQGHRLLSSIATA--QDPSASIGLL 1134
            +  L +  PP   + CCR RQ  L         LTKQG R LSS+A A  +D  A+  L+
Sbjct: 2    ISSLHMRIPPPWNSRCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAVTRDSKAASRLI 61

Query: 1133 RKFIASSSKHVAXXXXXXXXXXXXXXXXXXXLAFPLYTIIRQESWFSWNTKLVADLIALL 954
             KF+ASS + +A                   LAFPLY  I +ESWF WN KLVA++IA L
Sbjct: 62   SKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFL 121

Query: 953  NKQERFDEAENLFSEAVSKLGFKERELCNFYCNLVDSHAKHKSEKEVMDSCNELKQLILQ 774
            +KQ + +EAE L  E +SKLG +EREL  FYCNL+DS  KH S++   D+   L QL+  
Sbjct: 122  DKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNS 181

Query: 773  SSSVYVKRKGYESIVSGFCAIGLPNEAENSIEEMRNVGLKPSLFEIRSLIYGYGRLGSFE 594
            SSSVYVKR+  +S++SG C +G P+EAEN IEEMR  GL+PS FE + +IYGYGRLG  E
Sbjct: 182  SSSVYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLE 241

Query: 593  DMKRSIIQMENEGFELDTVCCNMVVSSFGAHNKLSDMVSWLKKVRNSGISLSIRTYNSVL 414
            DM+R + QME++G  +DTVC NMV+SS+G HN+LS MV WL+K+++SGI  S+RTYNSVL
Sbjct: 242  DMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVL 301

Query: 413  NSCPEITLMVQDL--KNLPLSINELVDNLNKDEADLVMELVKSSVLDQATEWKSSELKLD 240
            NSC  I  M+QDL   + PLSI EL + LN++E  +V EL  SSVLD+A +W S E KLD
Sbjct: 302  NSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLD 361

Query: 239  LHGMHLSSAYLILLQWFGEMQLRFIAGNQVAPAEILVVCGSGKHSAVRGESPVKGLAKQM 60
            LHGMHL SAY I+LQW  EM+ RF     V PAEI VVCGSGKHS VRGES VK + K+M
Sbjct: 362  LHGMHLGSAYFIILQWMDEMRNRFNNEKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKM 421

Query: 59   M 57
            M
Sbjct: 422  M 422


>ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Malus domestica]
          Length = 461

 Score =  415 bits (1066), Expect = e-113
 Identities = 215/407 (52%), Positives = 284/407 (69%), Gaps = 1/407 (0%)
 Frame = -3

Query: 1220 VFCDLTKQGHRLLSSIAT-AQDPSASIGLLRKFIASSSKHVAXXXXXXXXXXXXXXXXXX 1044
            V C LTKQG R L+ +A  A+DP  +  L+ KF++SS K +A                  
Sbjct: 41   VQCVLTKQGQRFLTKLAANARDPKFTNKLISKFLSSSPKSIALSTLSYLLSPDSTPPHLS 100

Query: 1043 XLAFPLYTIIRQESWFSWNTKLVADLIALLNKQERFDEAENLFSEAVSKLGFKERELCNF 864
             LAFPLY+ I +ESWF WN KLVA L+ALL+ Q  + ++E L SE +SKLG +EREL  F
Sbjct: 101  SLAFPLYSKITEESWFEWNPKLVASLVALLDNQGLYSQSEALISETISKLGSRERELALF 160

Query: 863  YCNLVDSHAKHKSEKEVMDSCNELKQLILQSSSVYVKRKGYESIVSGFCAIGLPNEAENS 684
            +C L++SH+K  S+     + + L QL+  SSSVYVKR+ +ES+V G CA+  P EA+  
Sbjct: 161  HCQLLESHSKLSSKHGFDSTYSYLHQLLHNSSSVYVKRRAFESMVGGLCAMDRPQEADIL 220

Query: 683  IEEMRNVGLKPSLFEIRSLIYGYGRLGSFEDMKRSIIQMENEGFELDTVCCNMVVSSFGA 504
            IEEM   GLKPS+FE RS++YGYGRLG FE+M + + +ME +G  +DT+C NMV+SS+GA
Sbjct: 221  IEEMMVKGLKPSVFEFRSVVYGYGRLGLFEEMLKVVEKMEGQGLAVDTICSNMVLSSYGA 280

Query: 503  HNKLSDMVSWLKKVRNSGISLSIRTYNSVLNSCPEITLMVQDLKNLPLSINELVDNLNKD 324
            +++L+ MV WL+K++   +  SIRTYNSVLNSCP I  M+QD K++P SI +L   LN D
Sbjct: 281  YSELAAMVLWLRKMKILRLPFSIRTYNSVLNSCPTIMAMLQDPKDVPCSIEQLNGVLNGD 340

Query: 323  EADLVMELVKSSVLDQATEWKSSELKLDLHGMHLSSAYLILLQWFGEMQLRFIAGNQVAP 144
            E  +V ELV S+VL++   W+S E KLDLHG+HL SAYLI+L+WF  M+ RF  G  V P
Sbjct: 341  EGLVVKELVGSTVLEEVMVWESLEAKLDLHGLHLGSAYLIMLEWFEAMRHRFNCGECVIP 400

Query: 143  AEILVVCGSGKHSAVRGESPVKGLAKQMMQQLKCPMRIDRKNVGCLI 3
            AE+++VCG GKHS+VRGESPVKGL K MM ++  PMRIDRKNVGC I
Sbjct: 401  AEVVIVCGLGKHSSVRGESPVKGLVKVMMHRMGSPMRIDRKNVGCFI 447


Top