BLASTX nr result

ID: Papaver30_contig00044727 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver30_contig00044727
         (2274 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008804661.1| PREDICTED: pentatricopeptide repeat-containi...   290   3e-75
ref|XP_008804662.1| PREDICTED: pentatricopeptide repeat-containi...   290   5e-75
ref|XP_010274657.1| PREDICTED: pentatricopeptide repeat-containi...   289   7e-75
emb|CBI39461.3| unnamed protein product [Vitis vinifera]              283   4e-73
ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containi...   283   4e-73
ref|XP_010923026.1| PREDICTED: uncharacterized protein LOC105046...   279   9e-72
ref|XP_002526313.1| conserved hypothetical protein [Ricinus comm...   279   9e-72
ref|XP_011015613.1| PREDICTED: pentatricopeptide repeat-containi...   278   1e-71
ref|XP_006478887.1| PREDICTED: pentatricopeptide repeat-containi...   278   1e-71
gb|KDO50539.1| hypothetical protein CISIN_1g047178mg [Citrus sin...   277   3e-71
ref|XP_006373907.1| hypothetical protein POPTR_0016s10300g [Popu...   277   3e-71
ref|XP_006443149.1| hypothetical protein CICLE_v10021498mg [Citr...   277   3e-71
ref|XP_011007631.1| PREDICTED: pentatricopeptide repeat-containi...   275   1e-70
ref|XP_011463739.1| PREDICTED: pentatricopeptide repeat-containi...   275   2e-70
ref|XP_004298657.1| PREDICTED: pentatricopeptide repeat-containi...   275   2e-70
ref|XP_011091155.1| PREDICTED: pentatricopeptide repeat-containi...   273   6e-70
ref|XP_012483420.1| PREDICTED: pentatricopeptide repeat-containi...   272   8e-70
ref|XP_007048491.1| Uncharacterized protein TCM_046974 [Theobrom...   269   7e-69
ref|XP_008455250.1| PREDICTED: pentatricopeptide repeat-containi...   268   2e-68
ref|XP_002323526.2| hypothetical protein POPTR_0016s10300g [Popu...   267   4e-68

>ref|XP_008804661.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190
            isoform X1 [Phoenix dactylifera]
          Length = 317

 Score =  290 bits (743), Expect = 3e-75
 Identities = 146/273 (53%), Positives = 195/273 (71%), Gaps = 14/273 (5%)
 Frame = -2

Query: 2105 IAQCCRSCYSTMVQPWRQKLST-----------QKEMKPSAEVQNNNPAIQELPKKDLSI 1959
            +A  C S Y+T    ++ + S            Q+ +K     +++N  + +  + D+  
Sbjct: 34   VAGHCCSSYATSTFGFQSRYSNDSRIVEDQVFEQRGLKAKFPSEHDNSMVNQKQRSDIEP 93

Query: 1958 IDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALE 1779
            + +++IG+ +S  +K KFL+NTLLDL++ KE +Y TLDAWVA EQ FP   LKR LI LE
Sbjct: 94   LPRQQIGKNISSAEKAKFLINTLLDLKNSKEAVYGTLDAWVAWEQNFPLAMLKRALIVLE 153

Query: 1778 REHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPK 1599
            ++ QWHRVVQV+KWMLSKGQG TMGTY QLIRALEKD+R EEAH IW +K+G+++HSVP 
Sbjct: 154  KQEQWHRVVQVVKWMLSKGQGTTMGTYEQLIRALEKDNRAEEAHKIWVKKIGHDLHSVPW 213

Query: 1598 ELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLE 1419
              CDLM+SIYY+NNM ERLVKLFK LE +DRKPP KS+V+KVADAY+ LGLLEE+  +LE
Sbjct: 214  RFCDLMLSIYYRNNMLERLVKLFKGLEEFDRKPPKKSIVRKVADAYELLGLLEEKNKLLE 273

Query: 1418 KYNYLLIEKSKEHPRKSE---KAA*KKVDKSGT 1329
            KY++L I+ S+E  RKS+   KA+ K   K+GT
Sbjct: 274  KYSHLFIKSSEERSRKSQKSKKASRKNDKKTGT 306


>ref|XP_008804662.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190
            isoform X2 [Phoenix dactylifera]
          Length = 315

 Score =  290 bits (741), Expect = 5e-75
 Identities = 145/269 (53%), Positives = 194/269 (72%), Gaps = 13/269 (4%)
 Frame = -2

Query: 2105 IAQCCRSCYSTMVQPWRQKLST-----------QKEMKPSAEVQNNNPAIQELPKKDLSI 1959
            +A  C S Y+T    ++ + S            Q+ +K     +++N  + +  + D+  
Sbjct: 34   VAGHCCSSYATSTFGFQSRYSNDSRIVEDQVFEQRGLKAKFPSEHDNSMVNQKQRSDIEP 93

Query: 1958 IDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALE 1779
            + +++IG+ +S  +K KFL+NTLLDL++ KE +Y TLDAWVA EQ FP   LKR LI LE
Sbjct: 94   LPRQQIGKNISSAEKAKFLINTLLDLKNSKEAVYGTLDAWVAWEQNFPLAMLKRALIVLE 153

Query: 1778 REHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPK 1599
            ++ QWHRVVQV+KWMLSKGQG TMGTY QLIRALEKD+R EEAH IW +K+G+++HSVP 
Sbjct: 154  KQEQWHRVVQVVKWMLSKGQGTTMGTYEQLIRALEKDNRAEEAHKIWVKKIGHDLHSVPW 213

Query: 1598 ELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLE 1419
              CDLM+SIYY+NNM ERLVKLFK LE +DRKPP KS+V+KVADAY+ LGLLEE+  +LE
Sbjct: 214  RFCDLMLSIYYRNNMLERLVKLFKGLEEFDRKPPKKSIVRKVADAYELLGLLEEKNKLLE 273

Query: 1418 KYNYLLIEKSKEHPRKSEKA--A*KKVDK 1338
            KY++L I+ S+E  RKS+K+  A +K DK
Sbjct: 274  KYSHLFIKSSEERSRKSQKSKKASRKNDK 302


>ref|XP_010274657.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera]
            gi|720059741|ref|XP_010274658.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera]
            gi|720059745|ref|XP_010274659.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera]
            gi|720059748|ref|XP_010274660.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera]
            gi|720059751|ref|XP_010274661.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Nelumbo nucifera]
          Length = 346

 Score =  289 bits (740), Expect = 7e-75
 Identities = 155/272 (56%), Positives = 191/272 (70%), Gaps = 7/272 (2%)
 Frame = -2

Query: 2120 MEVDWIAQCCR-------SCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLS 1962
            MEV WI Q            YST VQ       ++  +  S+EVQ  N    E    D  
Sbjct: 57   MEVGWITQLGMIKLPFGIPSYSTTVQGQIPNDCSRGAII-SSEVQLGNQTKHEDLSDDNL 115

Query: 1961 IIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIAL 1782
             I K +IG  VS KDK+KFL+NTL DL+D KE IY  LDAWVA E+ FP  SLK++L+AL
Sbjct: 116  HIHKFQIGENVSKKDKIKFLVNTLSDLKDSKEAIYGALDAWVAWERNFPIASLKQVLLAL 175

Query: 1781 EREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVP 1602
            E+E QWHRV+QVIKWMLSKGQG T+GTY QLIRAL+ DHR EEAH  W +K+G ++HSVP
Sbjct: 176  EKEQQWHRVIQVIKWMLSKGQGNTLGTYRQLIRALDMDHRAEEAHNFWMKKIGTDLHSVP 235

Query: 1601 KELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVL 1422
             +LC LMISIYY+NNM +RLVKLFK LEA+DRKPP K++VQKVADAY+ LG  EE++ +L
Sbjct: 236  WQLCSLMISIYYRNNMLDRLVKLFKGLEAFDRKPPEKAIVQKVADAYEILGRPEEKERIL 295

Query: 1421 EKYNYLLIEKSKEHPRKSEKAA*KKVDKSGTR 1326
            +KYN+L  E  K  P++S+KA+ KK  KSG R
Sbjct: 296  DKYNHLFTETWKGKPKRSQKASQKKTRKSGER 327


>emb|CBI39461.3| unnamed protein product [Vitis vinifera]
          Length = 296

 Score =  283 bits (725), Expect = 4e-73
 Identities = 147/258 (56%), Positives = 177/258 (68%)
 Frame = -2

Query: 2099 QCCRSCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGK 1920
            Q   S YST  Q      S   E+       NN P   +   KD + + K +IG  VS K
Sbjct: 23   QTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQPMYHD-SGKDAASVHKHQIGENVSRK 81

Query: 1919 DKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVIK 1740
            DK+ FL+ TLLDL+D KE +Y  LDAWVA EQ FP  SLKR+LI LE+E QWHRV+QV+K
Sbjct: 82   DKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEKEQQWHRVIQVVK 141

Query: 1739 WMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQN 1560
            WMLSKGQG TMGTYGQLIRAL+ DHR EEAH  W +K+G ++HSVP  LC  MIS+YY+N
Sbjct: 142  WMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWHLCHRMISVYYRN 201

Query: 1559 NMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEH 1380
            NM E LVKLFK LEA+DRKP  K VV+KVADAY+ LGLLEE++ + EKY+YL  E     
Sbjct: 202  NMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEKYDYLFTETVAGK 261

Query: 1379 PRKSEKAA*KKVDKSGTR 1326
            P+KS+K   +K  KSG R
Sbjct: 262  PKKSKKFLSEK-KKSGRR 278


>ref|XP_002269673.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Vitis vinifera]
            gi|731390622|ref|XP_010650427.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Vitis vinifera]
          Length = 300

 Score =  283 bits (725), Expect = 4e-73
 Identities = 147/258 (56%), Positives = 177/258 (68%)
 Frame = -2

Query: 2099 QCCRSCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGK 1920
            Q   S YST  Q      S   E+       NN P   +   KD + + K +IG  VS K
Sbjct: 27   QTLASSYSTFTQTQMSDTSNVGEVAFLGGQCNNQPMYHD-SGKDAASVHKHQIGENVSRK 85

Query: 1919 DKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVIK 1740
            DK+ FL+ TLLDL+D KE +Y  LDAWVA EQ FP  SLKR+LI LE+E QWHRV+QV+K
Sbjct: 86   DKINFLVTTLLDLKDSKEAVYGALDAWVAWEQNFPIASLKRVLITLEKEQQWHRVIQVVK 145

Query: 1739 WMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQN 1560
            WMLSKGQG TMGTYGQLIRAL+ DHR EEAH  W +K+G ++HSVP  LC  MIS+YY+N
Sbjct: 146  WMLSKGQGTTMGTYGQLIRALDMDHRAEEAHEFWVKKIGTDLHSVPWHLCHRMISVYYRN 205

Query: 1559 NMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEH 1380
            NM E LVKLFK LEA+DRKP  K VV+KVADAY+ LGLLEE++ + EKY+YL  E     
Sbjct: 206  NMLENLVKLFKGLEAFDRKPQDKLVVKKVADAYEMLGLLEEKERIFEKYDYLFTETVAGK 265

Query: 1379 PRKSEKAA*KKVDKSGTR 1326
            P+KS+K   +K  KSG R
Sbjct: 266  PKKSKKFLSEK-KKSGRR 282


>ref|XP_010923026.1| PREDICTED: uncharacterized protein LOC105046210 [Elaeis guineensis]
          Length = 656

 Score =  279 bits (713), Expect = 9e-72
 Identities = 141/256 (55%), Positives = 182/256 (71%), Gaps = 10/256 (3%)
 Frame = -2

Query: 2096 CCRSC----------YSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKR 1947
            CCRS           YS   +    ++  Q+ +K     +++N  I +  + D+  I  +
Sbjct: 377  CCRSYASSAFGFQSRYSDDSRIVEDQVFEQRGLKAKFPSEHDNNTIDQNQRSDIEPIPGQ 436

Query: 1946 EIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQ 1767
             IG+ +S K+K KFL+NTLLDL++ KE +Y TLDAWVA EQ FP   LKR L+ LE++ Q
Sbjct: 437  RIGKNISSKEKTKFLVNTLLDLKNSKEAVYGTLDAWVAWEQNFPLGMLKRALLVLEKQEQ 496

Query: 1766 WHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCD 1587
            WHRVVQV+KW+LSKGQG TMGTY QLIRALEKD+R EEAH IW +K+ +++HSVP   CD
Sbjct: 497  WHRVVQVVKWILSKGQGTTMGTYEQLIRALEKDNRAEEAHKIWVKKISHDLHSVPWRFCD 556

Query: 1586 LMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNY 1407
            LM+SIYY+NNM ERLVKLFK LE +DRKPP KS+V+KVADAY+ LGLLEE+  +LEKY+ 
Sbjct: 557  LMLSIYYRNNMLERLVKLFKGLEEFDRKPPEKSIVRKVADAYELLGLLEEKNKLLEKYSR 616

Query: 1406 LLIEKSKEHPRKSEKA 1359
            L  + S+E  RKS KA
Sbjct: 617  LFNKSSEECSRKSRKA 632


>ref|XP_002526313.1| conserved hypothetical protein [Ricinus communis]
            gi|223534394|gb|EEF36102.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 300

 Score =  279 bits (713), Expect = 9e-72
 Identities = 145/259 (55%), Positives = 180/259 (69%), Gaps = 1/259 (0%)
 Frame = -2

Query: 2099 QCCRSCYS-TMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSG 1923
            QC    YS TMVQ    ++S +    P  E Q++         +    + K +IG+ VS 
Sbjct: 23   QCSNGRYSSTMVQA---QISNRNTPSPRPEDQDDYKTTCHNSNQSAGGVQKNQIGKNVSR 79

Query: 1922 KDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVI 1743
            K+K+ FL+ TLLDL+D KE +Y  LDAWVA E  FP  SLKR+LI LE+E QWH+VVQVI
Sbjct: 80   KEKIDFLLKTLLDLKDSKEAVYGALDAWVAWEHNFPIASLKRVLILLEKEQQWHKVVQVI 139

Query: 1742 KWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQ 1563
            KWMLSKGQG TMGTYGQLIRAL+ DHR  EAH+ W +K+G ++HSVP +LC  MIS+YY+
Sbjct: 140  KWMLSKGQGNTMGTYGQLIRALDMDHRANEAHMFWLKKIGLDLHSVPWQLCHRMISVYYR 199

Query: 1562 NNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKE 1383
            NNM E LVKLFK LEA+DRKPP KS++QKVADAY+ LG+LEE++ VL+KY  L  E  K 
Sbjct: 200  NNMLESLVKLFKGLEAFDRKPPDKSILQKVADAYEMLGMLEEKERVLQKYKDLFKETEKG 259

Query: 1382 HPRKSEKAA*KKVDKSGTR 1326
             P+KS     KK  KSG R
Sbjct: 260  RPKKSRSTLAKK--KSGER 276


>ref|XP_011015613.1| PREDICTED: pentatricopeptide repeat-containing protein At4g21190-like
            [Populus euphratica]
          Length = 294

 Score =  278 bits (712), Expect = 1e-71
 Identities = 132/207 (63%), Positives = 165/207 (79%)
 Frame = -2

Query: 1952 KREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALERE 1773
            + +IG  VS KDK+KFL+ TLLDL D K+++Y  LDAWVA EQKFP  S+K++LIALE+E
Sbjct: 60   RNQIGDNVSKKDKIKFLITTLLDLNDSKDSVYGALDAWVAWEQKFPIASIKQVLIALEKE 119

Query: 1772 HQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKEL 1593
             QWHR+VQVIKWMLSKGQG TMGTY Q IRAL+ DHR +EAH  W +K+G ++HSVP +L
Sbjct: 120  QQWHRIVQVIKWMLSKGQGTTMGTYSQFIRALDMDHRAKEAHEFWLKKIGRDLHSVPWQL 179

Query: 1592 CDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKY 1413
            C+ MISIYY+NNM E L+KLFK LEA+DR+PP KS+VQKVADAY+ LGLLEE++ VLEKY
Sbjct: 180  CNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADAYEMLGLLEEKERVLEKY 239

Query: 1412 NYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            N++ +E  K   +K   A+ KK  KSG
Sbjct: 240  NHIFVEAGKGRNKKLRNASSKKNKKSG 266


>ref|XP_006478887.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X1 [Citrus sinensis]
          Length = 288

 Score =  278 bits (712), Expect = 1e-71
 Identities = 137/230 (59%), Positives = 173/230 (75%)
 Frame = -2

Query: 2021 SAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDA 1842
            S E Q  N ++ + P+++ +      IG  V  KDK+ FL+NTLLDL++ KE +Y TLDA
Sbjct: 40   SLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDA 99

Query: 1841 WVATEQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHR 1662
            WVA EQ FP  SLK+ L+ALE+E QWHRVVQVIKWMLSKGQG+TMGT GQLIRAL+ DHR
Sbjct: 100  WVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHR 159

Query: 1661 PEEAHIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVV 1482
             EEAH  W +++G ++HSVP +LC  MI+IYY+NNM ERL+KLFK LEA+DRKPP KS+V
Sbjct: 160  AEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIV 219

Query: 1481 QKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            Q+VADAY+ LGLLEE++ VLEKY  L  EK K   +KS+ ++ K   K G
Sbjct: 220  QRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKGKKKKG 269


>gb|KDO50539.1| hypothetical protein CISIN_1g047178mg [Citrus sinensis]
          Length = 287

 Score =  277 bits (709), Expect = 3e-71
 Identities = 137/232 (59%), Positives = 173/232 (74%)
 Frame = -2

Query: 2021 SAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDA 1842
            S E Q  N ++ + P+++ +      IG  V  KDK+ FL+NTLLDL++ KE +Y TLDA
Sbjct: 40   SLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDA 99

Query: 1841 WVATEQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHR 1662
            WVA EQ FP  SLK+ L+ALE+E QWHRVVQVIKWMLSKGQG+TMGT GQLIRAL+ DHR
Sbjct: 100  WVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHR 159

Query: 1661 PEEAHIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVV 1482
             EEAH  W +++G ++HSVP +LC  MI+IYY+NNM ERL+KLFK LEA+DRKPP KS+V
Sbjct: 160  AEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIV 219

Query: 1481 QKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSGTR 1326
            Q+VADAY+ LGLLEE++ VLEKY  L  EK K   +KS+ ++ K      TR
Sbjct: 220  QRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKSKSSSMKGKKSGRTR 271


>ref|XP_006373907.1| hypothetical protein POPTR_0016s10300g [Populus trichocarpa]
            gi|550321203|gb|ERP51704.1| hypothetical protein
            POPTR_0016s10300g [Populus trichocarpa]
          Length = 295

 Score =  277 bits (709), Expect = 3e-71
 Identities = 131/207 (63%), Positives = 165/207 (79%)
 Frame = -2

Query: 1952 KREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALERE 1773
            + +IG  VS KDK+KFL+ TLLDL D K+++Y  LDAWVA EQKFP  S+K++LIALE+E
Sbjct: 60   RNQIGDNVSKKDKIKFLITTLLDLNDSKDSVYGALDAWVAWEQKFPIASIKQVLIALEKE 119

Query: 1772 HQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKEL 1593
             QWHR+VQVIKWMLSKGQG TMGTY Q IRAL+ DHR +EAH  W +K+G ++HSVP +L
Sbjct: 120  QQWHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEAHEFWLKKIGRDLHSVPWQL 179

Query: 1592 CDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKY 1413
            C+ MISIYY+NNM E L+KLFK LEA+DR+PP KS+VQKVAD+Y+ LGLLEE++ VLEKY
Sbjct: 180  CNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADSYEMLGLLEEKERVLEKY 239

Query: 1412 NYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            N++ +E  K   +K   A+ KK  KSG
Sbjct: 240  NHIFVEAGKGQNKKLRNASSKKNKKSG 266


>ref|XP_006443149.1| hypothetical protein CICLE_v10021498mg [Citrus clementina]
            gi|568850372|ref|XP_006478888.1| PREDICTED:
            pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like isoform X2 [Citrus sinensis]
            gi|557545411|gb|ESR56389.1| hypothetical protein
            CICLE_v10021498mg [Citrus clementina]
          Length = 287

 Score =  277 bits (708), Expect = 3e-71
 Identities = 139/230 (60%), Positives = 174/230 (75%)
 Frame = -2

Query: 2021 SAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDA 1842
            S E Q  N ++ + P+++ +      IG  V  KDK+ FL+NTLLDL++ KE +Y TLDA
Sbjct: 40   SLEGQRTNQSVDQYPERNAASTRNFRIGENVPRKDKINFLVNTLLDLKNSKEDVYGTLDA 99

Query: 1841 WVATEQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHR 1662
            WVA EQ FP  SLK+ L+ALE+E QWHRVVQVIKWMLSKGQG+TMGT GQLIRAL+ DHR
Sbjct: 100  WVAWEQNFPVGSLKKALLALEKEQQWHRVVQVIKWMLSKGQGSTMGTCGQLIRALDMDHR 159

Query: 1661 PEEAHIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVV 1482
             EEAH  W +++G ++HSVP +LC  MI+IYY+NNM ERL+KLFK LEA+DRKPP KS+V
Sbjct: 160  AEEAHKFWEKRIGIDLHSVPWQLCKSMIAIYYRNNMLERLIKLFKGLEAFDRKPPEKSIV 219

Query: 1481 QKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            Q+VADAY+ LGLLEE++ VLEKY  L  EK K   +KS K++  K  KSG
Sbjct: 220  QRVADAYEVLGLLEEKERVLEKYKDLFTEKEKRSNKKS-KSSSMKGKKSG 268


>ref|XP_011007631.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic-like [Populus euphratica]
          Length = 294

 Score =  275 bits (704), Expect = 1e-70
 Identities = 131/207 (63%), Positives = 163/207 (78%)
 Frame = -2

Query: 1952 KREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALERE 1773
            + +IG  VS KDK+KFL+ TLLDL D K+ +Y  LDAWVA EQKFP  S+K++LIALE+E
Sbjct: 60   RNQIGDNVSKKDKIKFLITTLLDLNDSKDAVYGALDAWVAWEQKFPIASIKQVLIALEKE 119

Query: 1772 HQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKEL 1593
             QWHR+VQVIKWMLSKGQG TMGTY Q IRAL+ DHR +EAH  W +K+G ++HSVP +L
Sbjct: 120  QQWHRIVQVIKWMLSKGQGTTMGTYSQFIRALDMDHRAKEAHEFWLKKIGRDLHSVPWQL 179

Query: 1592 CDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKY 1413
            C+ MISIYY+NNM E L+KLFK LEA+DR+PP KS+VQKVADAY+ LGLL E++ VLEKY
Sbjct: 180  CNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVADAYEMLGLLYEKERVLEKY 239

Query: 1412 NYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            N++ +E  K   +K   A+ KK  KSG
Sbjct: 240  NHIFVEAGKGRNKKLRNASSKKNKKSG 266


>ref|XP_011463739.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Fragaria vesca subsp. vesca]
          Length = 292

 Score =  275 bits (702), Expect = 2e-70
 Identities = 142/254 (55%), Positives = 180/254 (70%), Gaps = 1/254 (0%)
 Frame = -2

Query: 2102 AQCCRSCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSG 1923
            AQ   S YST         +T K    S E Q++N  I+  P+K+    ++ +IG  VS 
Sbjct: 39   AQVLTSSYSTAAHAQLYHHTTGKAAV-SLEDQHSNQGIRHFPEKNAGGENRNQIGWNVSR 97

Query: 1922 KDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVI 1743
            KDKV FL+ TLLDL D KE +Y TLD WVA EQ FP   L+  LIALE+E QWHR++QVI
Sbjct: 98   KDKVNFLVKTLLDLNDSKEAVYGTLDGWVAWEQDFPIGKLRMALIALEKEQQWHRIIQVI 157

Query: 1742 KWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQ 1563
            KWMLSKGQG TMGTYGQLI AL+ D RPEEAH  W +K+G ++H+VP +LC  M+SIYY+
Sbjct: 158  KWMLSKGQGTTMGTYGQLIHALDMDQRPEEAHKFWKKKIGMDLHAVPWQLCKSMMSIYYR 217

Query: 1562 NNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIE-KSK 1386
            NNM E L+KLF+ LEA+DRKPP KS+V+KVADAY+ LG LE+++ VLEKYNYL  E +S+
Sbjct: 218  NNMLENLIKLFEGLEAFDRKPPQKSIVRKVADAYEILGRLEKKERVLEKYNYLFTEDQSR 277

Query: 1385 EHPRKSEKAA*KKV 1344
            + PRK+     KK+
Sbjct: 278  KKPRKALSKEKKKL 291


>ref|XP_004298657.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X2 [Fragaria vesca subsp. vesca]
          Length = 275

 Score =  275 bits (702), Expect = 2e-70
 Identities = 142/254 (55%), Positives = 180/254 (70%), Gaps = 1/254 (0%)
 Frame = -2

Query: 2102 AQCCRSCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSG 1923
            AQ   S YST         +T K    S E Q++N  I+  P+K+    ++ +IG  VS 
Sbjct: 22   AQVLTSSYSTAAHAQLYHHTTGKAAV-SLEDQHSNQGIRHFPEKNAGGENRNQIGWNVSR 80

Query: 1922 KDKVKFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVI 1743
            KDKV FL+ TLLDL D KE +Y TLD WVA EQ FP   L+  LIALE+E QWHR++QVI
Sbjct: 81   KDKVNFLVKTLLDLNDSKEAVYGTLDGWVAWEQDFPIGKLRMALIALEKEQQWHRIIQVI 140

Query: 1742 KWMLSKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQ 1563
            KWMLSKGQG TMGTYGQLI AL+ D RPEEAH  W +K+G ++H+VP +LC  M+SIYY+
Sbjct: 141  KWMLSKGQGTTMGTYGQLIHALDMDQRPEEAHKFWKKKIGMDLHAVPWQLCKSMMSIYYR 200

Query: 1562 NNMFERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIE-KSK 1386
            NNM E L+KLF+ LEA+DRKPP KS+V+KVADAY+ LG LE+++ VLEKYNYL  E +S+
Sbjct: 201  NNMLENLIKLFEGLEAFDRKPPQKSIVRKVADAYEILGRLEKKERVLEKYNYLFTEDQSR 260

Query: 1385 EHPRKSEKAA*KKV 1344
            + PRK+     KK+
Sbjct: 261  KKPRKALSKEKKKL 274


>ref|XP_011091155.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Sesamum indicum]
          Length = 277

 Score =  273 bits (697), Expect = 6e-70
 Identities = 135/244 (55%), Positives = 182/244 (74%), Gaps = 1/244 (0%)
 Frame = -2

Query: 2087 SCYSTMVQPWRQKLSTQKEMK-PSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGKDKV 1911
            S YST++     + S + +M+ P+    NN+    ++  KD   + +R+IG  VS KDK+
Sbjct: 27   SSYSTVLHSNSYQWSKEDKMEFPNHRTVNNSTT--QIQIKDFGTVTQRQIGENVSRKDKI 84

Query: 1910 KFLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVIKWML 1731
             FL++TL+DL+D KE +YSTLDAWVA E+ FP  +LK++L+ALE+E QWHR++QVIKWML
Sbjct: 85   SFLVSTLMDLQDSKEAVYSTLDAWVAWERNFPIGALKQVLVALEKEQQWHRIIQVIKWML 144

Query: 1730 SKGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQNNMF 1551
            SKGQG T GTYGQLI+AL+ DHR EEA  IW +K+  ++HSVP +LC LMIS+YY+NNM 
Sbjct: 145  SKGQGTTRGTYGQLIQALDMDHRVEEAQEIWKKKLAFDLHSVPWKLCKLMISVYYRNNML 204

Query: 1550 ERLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRK 1371
            + LVKLFK LEA+DRKPP KS+VQKVADAY+ LGL EE++ +LEKY  L +E S E  +K
Sbjct: 205  DDLVKLFKGLEAFDRKPPEKSIVQKVADAYELLGLPEEKERILEKYKDLFVESSNEKAKK 264

Query: 1370 SEKA 1359
              ++
Sbjct: 265  ISRS 268


>ref|XP_012483420.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic isoform X1 [Gossypium raimondii]
            gi|763766109|gb|KJB33324.1| hypothetical protein
            B456_006G006600 [Gossypium raimondii]
          Length = 304

 Score =  272 bits (696), Expect = 8e-70
 Identities = 137/226 (60%), Positives = 171/226 (75%)
 Frame = -2

Query: 2009 QNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVAT 1830
            Q NN      PK ++    K +IG+ VS KDK+KFL+ TLLDL+D KE IYS LDAWVA 
Sbjct: 54   QCNNQVANLYPKPNVGGQQKLQIGQNVSRKDKIKFLVTTLLDLKDSKEAIYSALDAWVAW 113

Query: 1829 EQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEA 1650
            EQ FP   LK +++ALE+EHQWHR+VQVIKWMLSKGQG TMGTYGQL+RAL+ D+R +EA
Sbjct: 114  EQNFPIGPLKNVILALEKEHQWHRIVQVIKWMLSKGQGNTMGTYGQLLRALDMDNRADEA 173

Query: 1649 HIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVA 1470
            H  W +KVG ++HSVP +LC LMIS+YY+NNM E LVKLFK LEA+ RKP  KS+VQ+VA
Sbjct: 174  HQFWVKKVGADLHSVPWQLCGLMISVYYRNNMLENLVKLFKGLEAFGRKPTDKSIVQRVA 233

Query: 1469 DAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSG 1332
            DAY+ LGLLEE++ VLEKY  +  +  K H +KS++ + KK   SG
Sbjct: 234  DAYEMLGLLEEKERVLEKYEDICTKIEKGH-KKSKQTSLKKKKDSG 278


>ref|XP_007048491.1| Uncharacterized protein TCM_046974 [Theobroma cacao]
            gi|508700752|gb|EOX92648.1| Uncharacterized protein
            TCM_046974 [Theobroma cacao]
          Length = 285

 Score =  269 bits (688), Expect = 7e-69
 Identities = 138/228 (60%), Positives = 171/228 (75%)
 Frame = -2

Query: 2009 QNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVKFLMNTLLDLEDCKETIYSTLDAWVAT 1830
            Q  N A     K ++  I K +IG+ VS KDK+KFL+ TLLDL+D KE +Y  LDAWVA 
Sbjct: 54   QGGNQAENLSSKPNIGGILKHQIGQNVSRKDKIKFLVTTLLDLKDGKEAVYGALDAWVAW 113

Query: 1829 EQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEA 1650
            EQ FP   LK +++ALE+EHQWHRVVQVIKWMLSKGQG TMGTY QLIRAL+ D+R EEA
Sbjct: 114  EQNFPIGPLKNVILALEKEHQWHRVVQVIKWMLSKGQGNTMGTYVQLIRALDMDNRAEEA 173

Query: 1649 HIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVA 1470
            H  W +KV  ++HSVP +LC  MIS+YY+NNM E LVKLFK LEA+DRKPP KS+VQ+VA
Sbjct: 174  HQFWLKKVSADLHSVPWQLCRQMISVYYRNNMLENLVKLFKGLEAFDRKPPEKSIVQRVA 233

Query: 1469 DAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSGTR 1326
            DAY+ LGLLEE++ VLEKY  +  +  K H +KS++A+ K+   SG R
Sbjct: 234  DAYEMLGLLEEKERVLEKYKDIPTKTDKVH-KKSKQASSKRKKNSGRR 280


>ref|XP_008455250.1| PREDICTED: pentatricopeptide repeat-containing protein At4g18975,
            chloroplastic [Cucumis melo]
          Length = 302

 Score =  268 bits (684), Expect = 2e-68
 Identities = 142/261 (54%), Positives = 177/261 (67%), Gaps = 10/261 (3%)
 Frame = -2

Query: 2087 SCYSTMVQPWRQKLSTQKEMKPSAEVQNNNPAIQELPKKDLSIIDKREIGRTVSGKDKVK 1908
            S Y TM+Q    K    K+ K + +V N+  A+  + ++++  I K +IG  VS KDK+ 
Sbjct: 38   SWYCTMIQDQMYKQLADKDRK-NKDVDNSK-ALGHISEQNIGDIRKHKIGENVSRKDKIS 95

Query: 1907 FLMNTLLDLEDCKETIYSTLDAWVATEQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLS 1728
            FL+NTLLDL D KE +Y  LDAWVA EQ FP  SLK +L ALE+E QWHR+VQVIKWMLS
Sbjct: 96   FLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIASLKHVLAALEKEQQWHRIVQVIKWMLS 155

Query: 1727 KGQGATMGTYGQLIRALEKDHRPEEAHIIWARKVGNNMHSVPKELCDLMISIYYQNNMFE 1548
            KGQG TM  YGQLIRAL+ DHR EEAH  W  K+G+++HSVP +LC  MI+IYY+N M E
Sbjct: 156  KGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMIAIYYRNKMLE 215

Query: 1547 RLVKLFKCLEAYDRKPPGKSVVQKVADAYKSLGLLEEQKLVLEKYNYLLIEK-------- 1392
             LVKLFK LEA+ RKPP KS+VQ+VADA + LGLLEE++ VL KY YL  EK        
Sbjct: 216  DLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKQESMKKYK 275

Query: 1391 --SKEHPRKSEKAA*KKVDKS 1335
              S E P++  K+     D S
Sbjct: 276  RVSFEKPKRKRKSTKGSEDNS 296


>ref|XP_002323526.2| hypothetical protein POPTR_0016s10300g [Populus trichocarpa]
            gi|550321202|gb|EEF05287.2| hypothetical protein
            POPTR_0016s10300g [Populus trichocarpa]
          Length = 312

 Score =  267 bits (682), Expect = 4e-68
 Identities = 135/238 (56%), Positives = 170/238 (71%), Gaps = 19/238 (7%)
 Frame = -2

Query: 1952 KREIGRTVSGKDKVKFLMNT-------------------LLDLEDCKETIYSTLDAWVAT 1830
            + +IG  VS KDK+KFL+ T                   LLDL D K+++Y  LDAWVA 
Sbjct: 60   RNQIGDNVSKKDKIKFLITTVSTQNPNYQSLFICMVVFTLLDLNDSKDSVYGALDAWVAW 119

Query: 1829 EQKFPTVSLKRMLIALEREHQWHRVVQVIKWMLSKGQGATMGTYGQLIRALEKDHRPEEA 1650
            EQKFP  S+K++LIALE+E QWHR+VQVIKWMLSKGQG TMGTY Q IRAL+ DHR +EA
Sbjct: 120  EQKFPIASIKQVLIALEKEQQWHRIVQVIKWMLSKGQGTTMGTYAQFIRALDMDHRAKEA 179

Query: 1649 HIIWARKVGNNMHSVPKELCDLMISIYYQNNMFERLVKLFKCLEAYDRKPPGKSVVQKVA 1470
            H  W +K+G ++HSVP +LC+ MISIYY+NNM E L+KLFK LEA+DR+PP KS+VQKVA
Sbjct: 180  HEFWLKKIGRDLHSVPWQLCNRMISIYYRNNMLENLIKLFKGLEAFDRQPPEKSIVQKVA 239

Query: 1469 DAYKSLGLLEEQKLVLEKYNYLLIEKSKEHPRKSEKAA*KKVDKSGTRCIY*FAWLSH 1296
            D+Y+ LGLLEE++ VLEKYN++ +E  K   +K   A+ KK  KSG       A+LSH
Sbjct: 240  DSYEMLGLLEEKERVLEKYNHIFVEAGKGQNKKLRNASSKKNKKSGK------AFLSH 291


Top