BLASTX nr result
ID: Catharanthus23_contig00025374
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00025374 (2133 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containi... 683 0.0 ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containi... 673 0.0 ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containi... 663 0.0 gb|EOX92242.1| Tetratricopeptide repeat (TPR)-like superfamily p... 656 0.0 gb|EMJ13220.1| hypothetical protein PRUPE_ppa004899mg [Prunus pe... 650 0.0 ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containi... 649 0.0 ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containi... 649 0.0 gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis] 648 0.0 ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containi... 646 0.0 ref|XP_002532046.1| pentatricopeptide repeat-containing protein,... 639 e-180 ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containi... 636 e-179 ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citr... 635 e-179 gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlise... 633 e-178 ref|XP_002306340.2| pentatricopeptide repeat-containing family p... 628 e-177 ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containi... 626 e-176 ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containi... 623 e-176 ref|XP_003630096.1| Pentatricopeptide repeat-containing protein ... 623 e-176 gb|ESW31740.1| hypothetical protein PHAVU_002G263600g [Phaseolus... 616 e-173 ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutr... 608 e-171 ref|XP_006843693.1| hypothetical protein AMTR_s00007p00203740 [A... 601 e-169 >ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Vitis vinifera] gi|296082481|emb|CBI21486.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 683 bits (1763), Expect = 0.0 Identities = 339/462 (73%), Positives = 392/462 (84%) Frame = -3 Query: 2047 SPTTSTFLLGISEFYPPNSCNSCPWRSMKASIYCLGGMSTKPRKKWGSKTNKDASEADEL 1868 S + TF L S FY P + P + S + +ST+PR+K G K +K SE +EL Sbjct: 18 STLSPTFTLPSSRFYKPTRLHLPP----RPSTTVVSCVSTRPRRKPGPKPDK--SEVEEL 71 Query: 1867 VASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYV 1688 V LM+NF ++PL++TLNKYVK IRTEHCF LFEELGKT +WLQCLEVFRWMQKQRWY+ Sbjct: 72 VRVLMKNFGGERPLISTLNKYVKVIRTEHCFRLFEELGKTDKWLQCLEVFRWMQKQRWYI 131 Query: 1687 ADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAK 1508 ADNGVYSKLIS+MGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDK+KA K Sbjct: 132 ADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKSKALIK 191 Query: 1507 ALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVM 1328 ALGYFDKMKG+ERC+PN+VTYNILLRA AQ++NV Q N LFK+L+ESI++PD FTFNGVM Sbjct: 192 ALGYFDKMKGMERCKPNIVTYNILLRAFAQAQNVNQANALFKELNESIVSPDIFTFNGVM 251 Query: 1327 DAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEK 1148 DAYGK GMI+EME VLS+MKS Q KPDIITFN+LIDSYG++QEF+KMEQVFKSLLRSKEK Sbjct: 252 DAYGKNGMIKEMESVLSRMKSNQCKPDIITFNVLIDSYGRRQEFDKMEQVFKSLLRSKEK 311 Query: 1147 PTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRAREV 968 PTLPTFNSMITNYGKARL+EKAE+V++KMTDMGY+P+ ITYESLI MYG+CDC+SRARE+ Sbjct: 312 PTLPTFNSMITNYGKARLKEKAENVFKKMTDMGYAPNFITYESLIMMYGFCDCISRAREI 371 Query: 967 FDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYT 788 FD++M S+KE KVSTLNAMLEVYC NGLPMEA+ L + A R P SSTYKLLYKAYT Sbjct: 372 FDEMMASKKEMKVSTLNAMLEVYCMNGLPMEADLLLERARKNRPFP-GSSTYKLLYKAYT 430 Query: 787 KAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSSQK 662 KA KEL++ LL MD DGI+PNKRFFL+ALGA GSS +SQ+ Sbjct: 431 KADQKELLEKLLKLMDSDGILPNKRFFLEALGAFGSSPASQE 472 >ref|XP_004237613.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Solanum lycopersicum] Length = 478 Score = 673 bits (1736), Expect = 0.0 Identities = 329/467 (70%), Positives = 393/467 (84%), Gaps = 3/467 (0%) Frame = -3 Query: 2041 TTSTFLLGISEFYPPNSCNSCPWRSM--KASIYCLGG-MSTKPRKKWGSKTNKDASEADE 1871 TT F S P ++ + PW ++ K +Y + +ST+P + S +SEA E Sbjct: 2 TTLQFSSSSSVQLPSSTTTTTPWLNLQKKRPLYSIVRCVSTRPGGR-KSGYGSSSSEAQE 60 Query: 1870 LVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWY 1691 LV +MRNF++K+PLV+TL+KYVK +RTEHCFLLFE+LGKT WLQCLEVFRWMQKQRWY Sbjct: 61 LVTLVMRNFSDKKPLVSTLDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFRWMQKQRWY 120 Query: 1690 VADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFA 1511 +ADNGVYSKLIS+MGKKGQ RMAMWLFSEMRNSGCRPDTSVYNA+I+AHLHSRDK+KA Sbjct: 121 IADNGVYSKLISVMGKKGQIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHSRDKSKALT 180 Query: 1510 KALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGV 1331 KA+GYF+KMK +ERC P++VTYNILLRA AQ++NVEQV+ L KDLDESI+TPD FTFNG+ Sbjct: 181 KAMGYFEKMKEMERCSPSIVTYNILLRAFAQAKNVEQVDALLKDLDESIVTPDIFTFNGL 240 Query: 1330 MDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKE 1151 MDAYGK GMI EME VLS+MKS +LKPDIITFN+LIDSYGKKQ+F+KMEQVFKSLL+SKE Sbjct: 241 MDAYGKNGMINEMEHVLSRMKSNKLKPDIITFNILIDSYGKKQDFQKMEQVFKSLLQSKE 300 Query: 1150 KPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRARE 971 KPT+PTFNSMITNYGKARLREK+E V +KM D+GY PS ITYE LI MYG+CDCVS+ARE Sbjct: 301 KPTIPTFNSMITNYGKARLREKSELVLEKMIDLGYKPSYITYECLIVMYGHCDCVSKARE 360 Query: 970 VFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAY 791 +FD++M+S KEKK STLN+ML+ YC NGLPMEA LF+S H+ + PIDSSTYKLLYKAY Sbjct: 361 LFDRVMESEKEKKASTLNSMLDAYCMNGLPMEAHLLFESIHSAKAFPIDSSTYKLLYKAY 420 Query: 790 TKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSSQKGTSD 650 TKA MKELV+ LL+YMD+DGIIPNK+FFLDALGA GS+ ++++ D Sbjct: 421 TKADMKELVQKLLTYMDEDGIIPNKKFFLDALGAFGSAPTNRRAVGD 467 >ref|XP_006354698.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like isoform X1 [Solanum tuberosum] gi|565376411|ref|XP_006354699.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like isoform X2 [Solanum tuberosum] gi|565376413|ref|XP_006354700.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like isoform X3 [Solanum tuberosum] Length = 457 Score = 663 bits (1711), Expect = 0.0 Identities = 321/449 (71%), Positives = 384/449 (85%), Gaps = 6/449 (1%) Frame = -3 Query: 1978 PWRSMKAS------IYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTT 1817 PW +++ I+C+ ST+P + S +SEA ELV +MRNF++K+PLV+T Sbjct: 2 PWLNLQKKRPVFSIIHCV---STRPGGR-KSGYGSSSSEAQELVTLVMRNFSDKKPLVST 57 Query: 1816 LNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKG 1637 L+KYVK +RTEHCFLLFE+LGKT WLQCLEVFRWMQKQRWY+ADNGVYSKLIS+MGKKG Sbjct: 58 LDKYVKLVRTEHCFLLFEQLGKTDNWLQCLEVFRWMQKQRWYIADNGVYSKLISVMGKKG 117 Query: 1636 QTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPN 1457 Q RMAMWLFSEMRNSGCRPDTSVYNA+I+AHLHSRDK+KA KA+GYF+KMKG+ERC P+ Sbjct: 118 QIRMAMWLFSEMRNSGCRPDTSVYNAVISAHLHSRDKSKALTKAMGYFEKMKGMERCSPS 177 Query: 1456 VVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLS 1277 +VTYNILLRA AQ++NVEQV+ L KDLDESI+TPD FTFNG+MDAYGK GMI EME +LS Sbjct: 178 IVTYNILLRAFAQAKNVEQVDALLKDLDESIVTPDIFTFNGLMDAYGKNGMINEMEHILS 237 Query: 1276 QMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKAR 1097 +MKS QLKPDIITFN+LIDSYGKKQ+F+KMEQVFKSLL+SKEKPT+PTFNSMITNYGKAR Sbjct: 238 RMKSNQLKPDIITFNILIDSYGKKQDFQKMEQVFKSLLQSKEKPTIPTFNSMITNYGKAR 297 Query: 1096 LREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLN 917 LREK+E V +KM D+GY PS ITYE LI MYG+CDCV++ARE+FD++++S EKK STLN Sbjct: 298 LREKSELVLEKMIDLGYKPSYITYECLIVMYGHCDCVAKARELFDRVIESETEKKASTLN 357 Query: 916 AMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDK 737 +ML+ YC NGLPMEA LF+S H+ + PIDSSTYKLLYKAYTKA MKELV+ LL+ MD+ Sbjct: 358 SMLDAYCMNGLPMEAHLLFESIHSAKVFPIDSSTYKLLYKAYTKADMKELVQKLLTCMDE 417 Query: 736 DGIIPNKRFFLDALGAIGSSDSSQKGTSD 650 DGIIPNK+FFLDALGA GS+ ++++ D Sbjct: 418 DGIIPNKKFFLDALGAFGSAPTNRREVGD 446 >gb|EOX92242.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] gi|508700347|gb|EOX92243.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] Length = 488 Score = 656 bits (1692), Expect = 0.0 Identities = 327/448 (72%), Positives = 382/448 (85%), Gaps = 1/448 (0%) Frame = -3 Query: 1960 ASIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEH 1781 A I C+ +T+PR+K GS + + EA ELV LMR+F++K+PLV TLN+YV+ +R EH Sbjct: 39 ARISCI---TTRPRRKTGS-SKAEEPEALELVRVLMRSFSDKEPLVKTLNRYVRVVRCEH 94 Query: 1780 CFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEM 1601 CFLLFEELGKT +WLQCLEVFRWMQKQRWY+ADNG+YSKLI++MGKKGQTRMAMWLFSEM Sbjct: 95 CFLLFEELGKTDKWLQCLEVFRWMQKQRWYIADNGIYSKLITVMGKKGQTRMAMWLFSEM 154 Query: 1600 RNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAA 1421 RNSGCRPD SVYNALITAHLHSRDK+KA KA+GYF+KMKG+ERC+PN+VTYNILLRA + Sbjct: 155 RNSGCRPDVSVYNALITAHLHSRDKSKALDKAMGYFNKMKGMERCKPNIVTYNILLRAFS 214 Query: 1420 QSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDII 1241 Q+RNV+QVN LFKDL ESII PD +T+NGVMDAYGK GMIREME VLS+MKS Q KPD I Sbjct: 215 QARNVDQVNALFKDLAESIIAPDIYTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDTI 274 Query: 1240 TFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKM 1061 TFN+LIDSYGKKQEF+KMEQVFKSLLRSK+KPTLPTFNSMI NYGKARL+EKAE V++KM Sbjct: 275 TFNVLIDSYGKKQEFDKMEQVFKSLLRSKQKPTLPTFNSMIINYGKARLKEKAEHVFKKM 334 Query: 1060 TDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLP 881 TDM Y PS ITYESLI MYG+CDCVSRARE+FD +++S KE +VSTLNAMLEVYCRNGL Sbjct: 335 TDMKYVPSFITYESLIMMYGFCDCVSRAREIFDGIVNSGKEMRVSTLNAMLEVYCRNGLH 394 Query: 880 MEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLD 701 MEA+ LF++AH I DSSTYKLLYKAYTKA MK+L++ L+ M+KDGI+PNKRFFL+ Sbjct: 395 MEADRLFENAHKMGVIR-DSSTYKLLYKAYTKANMKDLMQKLVKQMEKDGIVPNKRFFLE 453 Query: 700 ALGAIGSSDSSQKGTS-DIGDKVANKSK 620 AL A GS +S S IGD+ +K Sbjct: 454 ALEAFGSLPASPDSVSATIGDRPEKSAK 481 >gb|EMJ13220.1| hypothetical protein PRUPE_ppa004899mg [Prunus persica] Length = 486 Score = 650 bits (1676), Expect = 0.0 Identities = 327/456 (71%), Positives = 382/456 (83%), Gaps = 4/456 (0%) Frame = -3 Query: 2023 LGISEFYPPNSCNSCPWRS----MKASIYCLGGMSTKPRKKWGSKTNKDASEADELVASL 1856 L ++ + NS + PW K + + +ST+P++K G+KT + + E+V L Sbjct: 15 LAFAQIFSQNSNLTQPWLPHVLLRKRPVTRISCVSTRPKRKPGTKT--EDPDVREVVRML 72 Query: 1855 MRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNG 1676 MR+F++K+PL+ TLNKYV+ +RTEHCFLLFEELGK+ +WLQCLEVFRWMQKQRWYVADNG Sbjct: 73 MRSFSDKEPLLKTLNKYVRIVRTEHCFLLFEELGKSDEWLQCLEVFRWMQKQRWYVADNG 132 Query: 1675 VYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGY 1496 VYSKLIS+MGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALI+AHL+S+DKAKA KAL Y Sbjct: 133 VYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKAKALDKALRY 192 Query: 1495 FDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYG 1316 FDKMKG+ERC+PN+VTYNILLRA AQSRNVE+VN LFKDLDESI +PD +T+NGVMDAYG Sbjct: 193 FDKMKGMERCQPNIVTYNILLRAFAQSRNVEKVNSLFKDLDESIASPDIYTYNGVMDAYG 252 Query: 1315 KTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLP 1136 K G IREME VLS MKS Q KPDIITFNLLIDSYGKKQ+F+KMEQVFKSL+RSKEKPTLP Sbjct: 253 KNGNIREMESVLSHMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLVRSKEKPTLP 312 Query: 1135 TFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQL 956 TFNSMI NYGKARL+EKAE V++KM DM Y+PS ITYESLI MYG+CD VS+AREVFD+L Sbjct: 313 TFNSMIINYGKARLKEKAEDVFKKMIDMKYTPSFITYESLIMMYGFCDSVSKAREVFDRL 372 Query: 955 MDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGM 776 DS KE KVSTLNAML+VYC NGLP+EA+ LF + ++ P + STYKLLYKAYTKA M Sbjct: 373 ADSGKELKVSTLNAMLDVYCMNGLPVEADKLFVNGNSIGVRP-NVSTYKLLYKAYTKANM 431 Query: 775 KELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSS 668 KEL++ LL MDKDGI+PNKRFFL+ALGA SS S Sbjct: 432 KELLEKLLKCMDKDGIVPNKRFFLEALGAFFSSPGS 467 >ref|XP_004504032.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like isoform X1 [Cicer arietinum] gi|502140047|ref|XP_004504033.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like isoform X2 [Cicer arietinum] Length = 477 Score = 649 bits (1674), Expect = 0.0 Identities = 315/437 (72%), Positives = 373/437 (85%) Frame = -3 Query: 1930 TKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGK 1751 + P + K + SE ELV L R +EK+PLVTTLNKYVK +RTEHCFLLFEELGK Sbjct: 35 SNPTRLNRKKITSERSETQELVRLLTRKISEKEPLVTTLNKYVKLVRTEHCFLLFEELGK 94 Query: 1750 TGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTS 1571 +WLQCLEVFRWMQ+QRWY+ADNGVYSKLIS+MGKKGQ R+AMWLFSEMRN+GCRPDTS Sbjct: 95 HDKWLQCLEVFRWMQRQRWYIADNGVYSKLISVMGKKGQIRLAMWLFSEMRNTGCRPDTS 154 Query: 1570 VYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNI 1391 VYNALI+AHLH+R+K+ A AKALGYF+KMKGIERC+PN+VTYNILLRA AQSRNV+QVN Sbjct: 155 VYNALISAHLHTRNKSNALAKALGYFEKMKGIERCKPNIVTYNILLRAFAQSRNVDQVNS 214 Query: 1390 LFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYG 1211 LFKDLD+S+++PD +TFNGVMDAYGK GMIREME VL++MKS Q+KPD+IT+NLLIDSYG Sbjct: 215 LFKDLDDSVVSPDIYTFNGVMDAYGKNGMIREMETVLARMKSNQVKPDLITYNLLIDSYG 274 Query: 1210 KKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSII 1031 KKQ+F+KMEQVFKSLLRSKEKP+LPTFNSMI NYGKARL++KAE+V+Q MTDMGY+PS + Sbjct: 275 KKQQFDKMEQVFKSLLRSKEKPSLPTFNSMILNYGKARLKDKAENVFQNMTDMGYTPSFV 334 Query: 1030 TYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSA 851 T+ESLI MYG+CDCVS+A E+FD L++S+ KVSTLNAML+VYC NGLP EA++LF A Sbjct: 335 THESLIYMYGFCDCVSKAVELFDGLIESKVPMKVSTLNAMLDVYCINGLPQEADSLFARA 394 Query: 850 HATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDS 671 P D+STYKLLYKAYTKA KEL+ LL +MDKDG+IPNKRFFLDALGAIGSS + Sbjct: 395 RRVNIFP-DASTYKLLYKAYTKANSKELLDKLLKHMDKDGVIPNKRFFLDALGAIGSS-T 452 Query: 670 SQKGTSDIGDKVANKSK 620 + G+++ G N K Sbjct: 453 EKSGSANAGTDSKNPQK 469 >ref|XP_004169853.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Cucumis sativus] Length = 494 Score = 649 bits (1673), Expect = 0.0 Identities = 317/436 (72%), Positives = 376/436 (86%) Frame = -3 Query: 1972 RSMKASIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFI 1793 +++ + C+ ST+P +K+G KT D SEA+ELV ++RNF++K+PL+ TL+KYV+ + Sbjct: 35 KAVSTRVVCI---STRPSRKFGVKT--DRSEAEELVRGIIRNFSDKEPLLKTLDKYVRVM 89 Query: 1792 RTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWL 1613 RTEHCFLLFEELGK +WL+CLEVFRWMQKQRWY+ADNGVYSKLISIMGKKGQ RMAMWL Sbjct: 90 RTEHCFLLFEELGKRDKWLECLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAMWL 149 Query: 1612 FSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILL 1433 FSEMRNSGCRPDTSVYNALITAHLHS+DKAKA K L YF+KMKG+ERC+PN+VTYNIL Sbjct: 150 FSEMRNSGCRPDTSVYNALITAHLHSKDKAKALVKVLSYFEKMKGMERCKPNIVTYNILT 209 Query: 1432 RAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLK 1253 RA AQ+ V+QVN LFKDLDES+++ D +T+NGVMDAYGK G I+EMEL+L++MKS Q+K Sbjct: 210 RAFAQAAKVDQVNTLFKDLDESVVSADIYTYNGVMDAYGKNGNIKEMELMLARMKSNQIK 269 Query: 1252 PDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESV 1073 PDII+FNLLIDSYGKKQ F+KMEQVFKSLLRSKE+PTLPTFNSMITNYGKARLREKAE V Sbjct: 270 PDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLREKAEEV 329 Query: 1072 YQKMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCR 893 ++KM DMGY PS +T ESLI MYG+CDCVS+ARE+FD +++S KE +VSTLNAML+VYC Sbjct: 330 FRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKEVRVSTLNAMLDVYCI 389 Query: 892 NGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKR 713 NGLP+EA+ LF+SA R P DS+TYKLLYKAYTKA KEL++ LL MDK GIIPNKR Sbjct: 390 NGLPLEADLLFESAGNMRVFP-DSTTYKLLYKAYTKADKKELLEKLLKNMDKAGIIPNKR 448 Query: 712 FFLDALGAIGSSDSSQ 665 FFLDALG IGSS +Q Sbjct: 449 FFLDALGTIGSSQENQ 464 >gb|EXB80843.1| hypothetical protein L484_020101 [Morus notabilis] Length = 485 Score = 648 bits (1672), Expect = 0.0 Identities = 320/431 (74%), Positives = 376/431 (87%), Gaps = 1/431 (0%) Frame = -3 Query: 1957 SIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNF-TEKQPLVTTLNKYVKFIRTEH 1781 +I C+ S K ++K + +D SEA +LV LMR+F ++K+PLV TLNKYVK +RTEH Sbjct: 38 TISCVSTQS-KSKRKLTTTAKRDDSEALDLVRLLMRSFNSDKEPLVKTLNKYVKTVRTEH 96 Query: 1780 CFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEM 1601 CFLLFEELG++ +WLQCLEVFRWMQKQRWY+ADNGVYSKLIS+MGKKGQTRMAMWLFSEM Sbjct: 97 CFLLFEELGRSDKWLQCLEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEM 156 Query: 1600 RNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAA 1421 RNS CRPDTSVYNALITAHLHS DK KA KA+GYF+KMKGIERC+PN+VTYNILLRA A Sbjct: 157 RNSSCRPDTSVYNALITAHLHSSDKVKALDKAIGYFEKMKGIERCKPNIVTYNILLRAFA 216 Query: 1420 QSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDII 1241 Q+RNV++VN LFKDLD SI++PD +T+NGVMDAYGK GMIREME VLS MKS +KPDII Sbjct: 217 QARNVQRVNSLFKDLDGSIVSPDIYTYNGVMDAYGKNGMIREMESVLSLMKSNHIKPDII 276 Query: 1240 TFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKM 1061 TFNLLIDSYGKKQEF+KMEQVFKSLLRSKE+PTLPTFNSMI NYGKAR +KAE+V++KM Sbjct: 277 TFNLLIDSYGKKQEFDKMEQVFKSLLRSKERPTLPTFNSMIINYGKARRLDKAENVFEKM 336 Query: 1060 TDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLP 881 TDMGY+PS ITYESLI MYGYCDCVSRA+++F++L++S K+ KVSTLNAML+VYC NGLP Sbjct: 337 TDMGYTPSFITYESLIMMYGYCDCVSRAQDIFNRLVESGKDIKVSTLNAMLDVYCMNGLP 396 Query: 880 MEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLD 701 MEA LF+ + +P +SSTYKLLYKAYTKA MKEL+ NLL +M+KDGI+PNKRFFL+ Sbjct: 397 MEAHKLFEDSKNIGVVP-NSSTYKLLYKAYTKANMKELLGNLLRHMEKDGIVPNKRFFLE 455 Query: 700 ALGAIGSSDSS 668 ALGA SS++S Sbjct: 456 ALGAFCSSNAS 466 >ref|XP_004143565.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Cucumis sativus] Length = 528 Score = 646 bits (1667), Expect = 0.0 Identities = 316/432 (73%), Positives = 374/432 (86%) Frame = -3 Query: 1972 RSMKASIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFI 1793 +++ + C+ ST+P +K+G KT D SEA+ELV ++RNF++K+PL+ TL+KYV+ + Sbjct: 35 KAVSTRVVCI---STRPSRKFGVKT--DRSEAEELVRGIIRNFSDKEPLLKTLDKYVRVM 89 Query: 1792 RTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWL 1613 RTEHCFLLFEELGK +WL+CLEVFRWMQKQRWY+ADNGVYSKLISIMGKKGQ RMAMWL Sbjct: 90 RTEHCFLLFEELGKRDKWLECLEVFRWMQKQRWYIADNGVYSKLISIMGKKGQIRMAMWL 149 Query: 1612 FSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILL 1433 FSEMRNSGCRPDTSVYNALITAHLHS+DKAKA K L YF+KMKG+ERC+PN+VTYNIL Sbjct: 150 FSEMRNSGCRPDTSVYNALITAHLHSKDKAKALVKVLSYFEKMKGMERCKPNIVTYNILT 209 Query: 1432 RAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLK 1253 RA AQ+ V+QVN LFKDLDES+++ D +T+NGVMDAYGK G I+EMEL+L++MKS Q+K Sbjct: 210 RAFAQAAKVDQVNTLFKDLDESVVSADIYTYNGVMDAYGKNGNIKEMELMLARMKSNQIK 269 Query: 1252 PDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESV 1073 PDII+FNLLIDSYGKKQ F+KMEQVFKSLLRSKE+PTLPTFNSMITNYGKARLREKAE V Sbjct: 270 PDIISFNLLIDSYGKKQLFDKMEQVFKSLLRSKERPTLPTFNSMITNYGKARLREKAEEV 329 Query: 1072 YQKMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCR 893 ++KM DMGY PS +T ESLI MYG+CDCVS+ARE+FD +++S KE +VSTLNAML+VYC Sbjct: 330 FRKMKDMGYDPSYVTCESLIMMYGHCDCVSKAREIFDGMVNSGKEVRVSTLNAMLDVYCI 389 Query: 892 NGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKR 713 NGLP+EA+ LF+SA R P DS+TYKLLYKAYTKA KEL++ LL MDK GIIPNKR Sbjct: 390 NGLPLEADLLFESAGNMRVFP-DSTTYKLLYKAYTKADKKELLEKLLKNMDKAGIIPNKR 448 Query: 712 FFLDALGAIGSS 677 FFLDALG IGSS Sbjct: 449 FFLDALGTIGSS 460 >ref|XP_002532046.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528289|gb|EEF30336.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 478 Score = 639 bits (1647), Expect = e-180 Identities = 326/467 (69%), Positives = 387/467 (82%), Gaps = 3/467 (0%) Frame = -3 Query: 2059 TLRISPT-TSTFLLGISEFYPPN-SCNSCPWRSMKASIYCLGGMSTKPRKKWGSKTNKDA 1886 +L++SP +S F + + P S + C + I C+ ST+PRKK + Sbjct: 9 SLQLSPLGSSNFTNNFPQIHNPTLSWHPCK-NLRQTHITCV---STRPRKK--RFPISEE 62 Query: 1885 SEADELVASLMRNFT-EKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWM 1709 SE ++LV ++R+F+ +K PLV TL+KYV+ +RTEHCFLLFEELG+ +WLQCLEVFRWM Sbjct: 63 SETEDLVRYVLRSFSSDKVPLVRTLDKYVRVVRTEHCFLLFEELGRRDKWLQCLEVFRWM 122 Query: 1708 QKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRD 1529 QKQRWY+AD+GVYSKLIS+MGKKGQTRMAMWLFSEMRNSGCRPD+SVYNALITAHLHS+D Sbjct: 123 QKQRWYIADSGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDSSVYNALITAHLHSKD 182 Query: 1528 KAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDT 1349 KAKA KALGYF+KMKG++RC+PNVVTYNILLRA AQ+RNV QVN LFKDLD+SI++PD Sbjct: 183 KAKALIKALGYFEKMKGMQRCQPNVVTYNILLRAFAQARNVNQVNALFKDLDQSIVSPDI 242 Query: 1348 FTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKS 1169 +T+NGVMDAYGK GMIREME VLS+MKS Q KPDIITFNLLIDSYGKKQ+F+KMEQVFKS Sbjct: 243 YTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIITFNLLIDSYGKKQDFDKMEQVFKS 302 Query: 1168 LLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDC 989 LL SKE+PTLPTFNSMITNYGKAR +E AESV QKMT M Y+P+ ITYESLI MYG+CD Sbjct: 303 LLHSKERPTLPTFNSMITNYGKARQKENAESVLQKMTKMKYTPNFITYESLIMMYGFCDS 362 Query: 988 VSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYK 809 VS+ARE+FD +++S KE KVSTLNAML+VYC NGLPMEA+ LF +A +P DS+TYK Sbjct: 363 VSKAREIFDDMIESGKEVKVSTLNAMLDVYCLNGLPMEADLLFDNARNVGLLP-DSTTYK 421 Query: 808 LLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSS 668 LLYKAYTKA MK+LV+ LL +MD+DGIIPNKRFFLDALGA S +S Sbjct: 422 LLYKAYTKANMKKLVQKLLKHMDRDGIIPNKRFFLDALGAFKSLPAS 468 >ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 503 Score = 636 bits (1640), Expect = e-179 Identities = 312/434 (71%), Positives = 371/434 (85%), Gaps = 1/434 (0%) Frame = -3 Query: 1927 KPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKT 1748 +P++K S N +A E L+ S + N +K+PL+ TLNKYVK +RT+HCFLLFEEL K Sbjct: 30 RPKRK-KSNHNSEAQELVRLLTSKISN--DKEPLLKTLNKYVKQVRTQHCFLLFEELAKH 86 Query: 1747 GQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSV 1568 WLQCLEVFRWMQKQRWY+ADNG+YSKLIS+MGKKGQTRMAMWLFSEMRN+GCRPDTSV Sbjct: 87 DNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSV 146 Query: 1567 YNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNIL 1388 YNALITAHLHSRDK KA AKA+GYF KMKG+ERC+PN+VTYNILLRA AQ+RNVEQVN L Sbjct: 147 YNALITAHLHSRDKTKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSL 206 Query: 1387 FKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGK 1208 FKDLDESI++PD +TFNGVMDAYGK GMIREME VL++MKS Q KPD+ITFNLLIDSYGK Sbjct: 207 FKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNLLIDSYGK 266 Query: 1207 KQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIIT 1028 KQEF KMEQVFKSLLRSKE+ +LPTFNSMI NYGKARL++KAE V+++MTDMGY+PS +T Sbjct: 267 KQEFGKMEQVFKSLLRSKERASLPTFNSMILNYGKARLKDKAEDVFKRMTDMGYTPSFVT 326 Query: 1027 YESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAH 848 +ESLI MYG+CDCVSRA ++FD+L++S+ KVSTLNAML+VYC NGLP EA++LF+ A+ Sbjct: 327 HESLIYMYGFCDCVSRAAQLFDELVESKAHIKVSTLNAMLDVYCINGLPQEADSLFERAN 386 Query: 847 ATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSS 668 + + P DSST+KLLYKAYTKA KEL+ LL +MDKDGI+PNKRFFLDALGA+ S ++ Sbjct: 387 SIKIYP-DSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIVPNKRFFLDALGAVASLPAN 445 Query: 667 QKGTSDIGD-KVAN 629 + + D K AN Sbjct: 446 SESANAATDSKTAN 459 >ref|XP_006428072.1| hypothetical protein CICLE_v10025440mg [Citrus clementina] gi|568819570|ref|XP_006464322.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Citrus sinensis] gi|557530062|gb|ESR41312.1| hypothetical protein CICLE_v10025440mg [Citrus clementina] Length = 500 Score = 635 bits (1639), Expect = e-179 Identities = 314/427 (73%), Positives = 365/427 (85%), Gaps = 1/427 (0%) Frame = -3 Query: 1930 TKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGK 1751 T+PR K G K+ + E+ ELV LMR+F++K+PLV TLNKYVK +R+EHCFLLFEELGK Sbjct: 64 TRPRSKRGRKSEE--LESKELVRVLMRSFSDKEPLVRTLNKYVKVVRSEHCFLLFEELGK 121 Query: 1750 TGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTS 1571 + +WLQCLEVFRWMQKQRWY+AD G+YSKLI++MGKKGQTR+AMWLFSEMRNSGCRPD S Sbjct: 122 SDKWLQCLEVFRWMQKQRWYIADTGIYSKLIAVMGKKGQTRLAMWLFSEMRNSGCRPDPS 181 Query: 1570 VYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNI 1391 VYNALITAHLH+RDKAKA AKALGYF KMKG+ERC+PN+VTYNILLRA AQ+RNV+QVN Sbjct: 182 VYNALITAHLHTRDKAKALAKALGYFQKMKGMERCKPNIVTYNILLRACAQARNVDQVNA 241 Query: 1390 LFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYG 1211 LFK+LDESI+ PD +T+NGVMDAYGK GMI+EME VLS+MKS Q KPDIITFNLLIDSYG Sbjct: 242 LFKELDESILAPDIYTYNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNLLIDSYG 301 Query: 1210 KKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSII 1031 K+Q F+KMEQVFKSL+ SKEKPTLPTFNSMI NYGKARL+ KAE V+QKMT M Y+PS I Sbjct: 302 KRQAFDKMEQVFKSLMHSKEKPTLPTFNSMIINYGKARLQGKAEYVFQKMTAMKYTPSFI 361 Query: 1030 TYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSA 851 TYE +ITMYGYCD VSRARE+FD+L K+ KVSTLNAMLE YC NGLP EA+ LF+++ Sbjct: 362 TYECIITMYGYCDNVSRAREIFDELSKLGKDMKVSTLNAMLEAYCMNGLPTEADLLFENS 421 Query: 850 HATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSS-D 674 H P DSSTYKLLYKAYTKA MKELV+ LL M+++GI+PNKRFFL+AL SS Sbjct: 422 HNMGVTP-DSSTYKLLYKAYTKANMKELVQKLLKRMEQNGIVPNKRFFLEALETFSSSLA 480 Query: 673 SSQKGTS 653 SQ G++ Sbjct: 481 GSQSGSA 487 Score = 60.8 bits (146), Expect = 2e-06 Identities = 48/236 (20%), Positives = 97/236 (41%), Gaps = 31/236 (13%) Frame = -3 Query: 1672 YSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAH----------------L 1541 Y+ ++ GK G + + S M+++ C+PD +N LI ++ + Sbjct: 258 YNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFNLLIDSYGKRQAFDKMEQVFKSLM 317 Query: 1540 HSRDK---------------AKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNV 1406 HS++K A+ KA F KM + + P+ +TY ++ NV Sbjct: 318 HSKEKPTLPTFNSMIINYGKARLQGKAEYVFQKMTAM-KYTPSFITYECIITMYGYCDNV 376 Query: 1405 EQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLL 1226 + +F +L + T N +++AY G+ E +L+ + + PD T+ LL Sbjct: 377 SRAREIFDELSKLGKDMKVSTLNAMLEAYCMNGLPTEADLLFENSHNMGVTPDSSTYKLL 436 Query: 1225 IDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMT 1058 +Y K E ++++ K + ++ P F + + + ++ S +T Sbjct: 437 YKAYTKANMKELVQKLLKRMEQNGIVPNKRFFLEALETFSSSLAGSQSGSAKTDLT 492 >gb|EPS58459.1| hypothetical protein M569_16353, partial [Genlisea aurea] Length = 415 Score = 633 bits (1632), Expect = e-178 Identities = 300/404 (74%), Positives = 357/404 (88%) Frame = -3 Query: 1888 ASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWM 1709 +SEA++LV S+MRNFT+ QPL +TLNKYVK +RT HCFL+FEELGK+ +WLQCLEVFRWM Sbjct: 6 SSEAEDLVRSVMRNFTDSQPLTSTLNKYVKLLRTAHCFLIFEELGKSDRWLQCLEVFRWM 65 Query: 1708 QKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRD 1529 QKQRWYVADNGVYSKLIS+MGK+G+TRMAMWLFSEMRNSGCRPDTSVYN+LI+AHLHSRD Sbjct: 66 QKQRWYVADNGVYSKLISVMGKQGKTRMAMWLFSEMRNSGCRPDTSVYNSLISAHLHSRD 125 Query: 1528 KAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDT 1349 K KA K L YF+KMKGIERC+PNVVTYNILLRA AQ++N+EQVN LFK+LD SII+PD Sbjct: 126 KTKALDKVLWYFEKMKGIERCQPNVVTYNILLRAFAQAKNIEQVNALFKELDGSIISPDV 185 Query: 1348 FTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKS 1169 T+NGVMDAYGK GMIREMELVLS+MKS Q+KPD+ITFNLLID+YG++QEF+KMEQVFKS Sbjct: 186 LTYNGVMDAYGKNGMIREMELVLSKMKSAQIKPDVITFNLLIDAYGRRQEFDKMEQVFKS 245 Query: 1168 LLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDC 989 L+ SKEKPT+PTFNSMITNYGKARLR+KA++ YQKM MGY PS +T+ESLI +G+CD Sbjct: 246 LMHSKEKPTVPTFNSMITNYGKARLRDKADATYQKMIGMGYKPSYVTHESLIVAFGHCDY 305 Query: 988 VSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYK 809 VSRARE+FDQ++ + K+ STLNAMLEVYC N L MEA LF+S+H T P+D STY+ Sbjct: 306 VSRAREIFDQVIGTDTVKRTSTLNAMLEVYCMNSLHMEAYMLFESSHDTEAFPVDLSTYR 365 Query: 808 LLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSS 677 +LY+A++KAGMK+LV L+ MD+DGIIPNK+FFLDALG +GSS Sbjct: 366 ILYRAFSKAGMKDLVDKLVMNMDRDGIIPNKKFFLDALGTLGSS 409 >ref|XP_002306340.2| pentatricopeptide repeat-containing family protein, partial [Populus trichocarpa] gi|550338395|gb|EEE93336.2| pentatricopeptide repeat-containing family protein, partial [Populus trichocarpa] Length = 414 Score = 628 bits (1619), Expect = e-177 Identities = 311/417 (74%), Positives = 360/417 (86%) Frame = -3 Query: 1966 MKASIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRT 1787 MK S + +ST+P+K+ + SEA ELV L+R+F++KQPLV TLNKYVK +RT Sbjct: 1 MKYSPTQVSCVSTRPKKR--PVPTDEKSEAQELVRVLVRSFSDKQPLVKTLNKYVKVMRT 58 Query: 1786 EHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFS 1607 EHCF+LFEELGKT +WLQCLEVFRWMQKQRWYVADNG YSKLIS+MGKKGQTRMAMWLFS Sbjct: 59 EHCFMLFEELGKTDKWLQCLEVFRWMQKQRWYVADNGCYSKLISVMGKKGQTRMAMWLFS 118 Query: 1606 EMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRA 1427 EMRNSGCRPDTSVYNALITAHLHS+DKAK+ KAL YF+KMK IERC+PNVVTYNI+LRA Sbjct: 119 EMRNSGCRPDTSVYNALITAHLHSKDKAKSLTKALAYFEKMKSIERCQPNVVTYNIILRA 178 Query: 1426 AAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPD 1247 AQ+RNV QVN LFKDL+ESI++PD +T+NGV+DAYGK GMIREME VLS+MK Q KPD Sbjct: 179 FAQARNVNQVNALFKDLEESIVSPDIYTYNGVLDAYGKNGMIREMESVLSRMKIDQCKPD 238 Query: 1246 IITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQ 1067 IITFNLLIDSYGKKQ+FEKMEQVFKSLLRSKEKPTLPTFNSMI NYGKARL++KAESV++ Sbjct: 239 IITFNLLIDSYGKKQDFEKMEQVFKSLLRSKEKPTLPTFNSMIVNYGKARLKDKAESVFK 298 Query: 1066 KMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNG 887 KM DM Y+PS IT+ESLI MYG CDCVS+AR++FD +++S KE KVSTLNA+L VYC NG Sbjct: 299 KMADMRYTPSFITFESLIMMYGICDCVSKARDIFDDMVESGKEVKVSTLNAVLNVYCMNG 358 Query: 886 LPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNK 716 L MEA L ++A + +P +SSTYKLLY+AYTKA MKELV+ LL +MDKDGIIPNK Sbjct: 359 LHMEAHILLENARSI-GVPPNSSTYKLLYRAYTKAKMKELVQKLLKHMDKDGIIPNK 414 >ref|XP_006585305.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 526 Score = 626 bits (1615), Expect = e-176 Identities = 308/429 (71%), Positives = 363/429 (84%) Frame = -3 Query: 1927 KPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKT 1748 +P K S N +A E L+ S +R+ +K+ L+ TLNKYVK +RT+HCFLLFEELGK Sbjct: 37 RPPKSKKSNLNSEAQELVRLLTSKIRS-NDKEVLLKTLNKYVKQVRTQHCFLLFEELGKH 95 Query: 1747 GQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSV 1568 WLQCLEVFRWMQKQRWY+ADNG+YSKLIS+MGKKGQTRMAMWLFSEMRN+GCRPDTSV Sbjct: 96 DNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSEMRNTGCRPDTSV 155 Query: 1567 YNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNIL 1388 YNALITAHL SRDK KA AKA+GYF KMKG+ERC+PN+VTYNILLRA AQ+RNVEQVN L Sbjct: 156 YNALITAHLRSRDKIKALAKAIGYFQKMKGMERCKPNIVTYNILLRAFAQARNVEQVNSL 215 Query: 1387 FKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGK 1208 FKDLDESI++PD +TFNGVMDAYGK GMIREME VL++MKS Q KPD+ITFNLLIDSYGK Sbjct: 216 FKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDLITFNLLIDSYGK 275 Query: 1207 KQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIIT 1028 KQ F KMEQVFKSLL SKE+P+LPTFNSMI NYGKARL++KAE V++KMTDMGY+ S +T Sbjct: 276 KQAFGKMEQVFKSLLHSKERPSLPTFNSMILNYGKARLKDKAEDVFKKMTDMGYTLSFVT 335 Query: 1027 YESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAH 848 +ES+I MYG+CDCVSRA ++FD+L++S+ KVSTLNAML+VYC NGLP EA++LF+ A Sbjct: 336 HESMIYMYGFCDCVSRAAQLFDELVESKVHIKVSTLNAMLDVYCLNGLPQEADSLFERA- 394 Query: 847 ATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSS 668 + KI DSST+KLLYKAYTKA KEL+ LL +MDKDGIIPNKRFFLDALGA+ S ++ Sbjct: 395 ISIKIHPDSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIIPNKRFFLDALGAVASLPAN 454 Query: 667 QKGTSDIGD 641 + + D Sbjct: 455 SESANAATD 463 >ref|XP_004296690.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 420 Score = 623 bits (1607), Expect = e-176 Identities = 305/407 (74%), Positives = 358/407 (87%), Gaps = 1/407 (0%) Frame = -3 Query: 1870 LVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWY 1691 +V L+R+F++K+PLV TLNKYVK +RTEHCFLLFEELGK+G+WLQCLEVFRWMQKQRWY Sbjct: 1 MVRMLIRSFSDKEPLVKTLNKYVKIVRTEHCFLLFEELGKSGKWLQCLEVFRWMQKQRWY 60 Query: 1690 VADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFA 1511 VADNGVYSKLIS+MGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALI+AHL+S+DK KA Sbjct: 61 VADNGVYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALISAHLNSKDKGKALE 120 Query: 1510 KALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGV 1331 K L YF+KMKG+ERC+PN+VTYNILLRA AQ+RNV++VN LFKDLDESI PD +T+NGV Sbjct: 121 KGLVYFNKMKGMERCQPNIVTYNILLRAYAQARNVDKVNSLFKDLDESIACPDIYTYNGV 180 Query: 1330 MDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKE 1151 MDAYGK GMIR+ME VLS+MKS Q KPDIITFNLLIDSYGKKQ+F+KMEQVFKSLL SKE Sbjct: 181 MDAYGKNGMIRDMESVLSRMKSNQCKPDIITFNLLIDSYGKKQQFDKMEQVFKSLLHSKE 240 Query: 1150 KPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRARE 971 +PTLPTFNSMI NYGKARL+E+AESV+++M DM YSPS ITYESL+ MYGYCD VS+ARE Sbjct: 241 RPTLPTFNSMIINYGKARLKEQAESVFKRMIDMKYSPSFITYESLMMMYGYCDSVSKARE 300 Query: 970 VFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAY 791 +FD + +S +E KVSTLN ML+VYCRNGLPMEA+ L SA++ P + TYKLLYKAY Sbjct: 301 IFDGVAESGQEMKVSTLNVMLDVYCRNGLPMEADKLLLSANSIGIRP-NVCTYKLLYKAY 359 Query: 790 TKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGA-IGSSDSSQKGTS 653 TKA MK+L+ LL MDKDGI+PNKRFFL+ALGA + S+ +S+ G++ Sbjct: 360 TKANMKDLLDKLLKSMDKDGIVPNKRFFLEALGAFLSSTGNSESGSA 406 >ref|XP_003630096.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355524118|gb|AET04572.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 635 Score = 623 bits (1607), Expect = e-176 Identities = 310/457 (67%), Positives = 371/457 (81%) Frame = -3 Query: 2014 SEFYPPNSCNSCPWRSMKASIYCLGGMSTKPRKKWGSKTNKDASEADELVASLMRNFTEK 1835 S +YP + P+ ++ I C+ + RK+ D SE ELV L R ++K Sbjct: 11 SSYYPFPKIHYPPYITIPTRISCVSNPTRINRKQ-----TTDQSETQELVRLLTRKISDK 65 Query: 1834 QPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQRWYVADNGVYSKLIS 1655 +PL+ TLNKYVK +RTEHCFLLFEELGK +WLQCLEVFRWMQ+QRWY+ADNGVYSKLIS Sbjct: 66 EPLLKTLNKYVKLVRTEHCFLLFEELGKHDKWLQCLEVFRWMQRQRWYIADNGVYSKLIS 125 Query: 1654 IMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAKAFAKALGYFDKMKGI 1475 +MGKKGQ R+AMWLFSEMRN+GCRPDTSVYN+LI+AHLHSRDK+KA KALGYF+KMK Sbjct: 126 VMGKKGQIRLAMWLFSEMRNTGCRPDTSVYNSLISAHLHSRDKSKALVKALGYFEKMKTT 185 Query: 1474 ERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTFNGVMDAYGKTGMIRE 1295 ERC+PN+VTYNILLRA AQ+R+V QVN LFKDLDES ++PD +TFNGVMD YGK GMIRE Sbjct: 186 ERCKPNIVTYNILLRAFAQARDVNQVNYLFKDLDESSVSPDIYTFNGVMDGYGKNGMIRE 245 Query: 1294 MELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLRSKEKPTLPTFNSMIT 1115 ME VL +MKS Q+K D+IT+NLLIDSYGKKQ+F+KMEQVFKSL RSKEKPTLPTFNSMI Sbjct: 246 MESVLVRMKSNQVKLDLITYNLLIDSYGKKQQFDKMEQVFKSLSRSKEKPTLPTFNSMIL 305 Query: 1114 NYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSRAREVFDQLMDSRKEK 935 NYGKARL++KAE+V+Q MTDMGY+PS +T+ESLI MYG C CVS A E+FDQL++S+ Sbjct: 306 NYGKARLKDKAENVFQNMTDMGYTPSFVTHESLIHMYGLCGCVSNAVELFDQLIESKVPI 365 Query: 934 KVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLYKAYTKAGMKELVKNL 755 KVSTLNAML+VYC NGL EA++LF A + + P D++TYKLLYKAYTKA KEL+ L Sbjct: 366 KVSTLNAMLDVYCINGLQQEADSLFTRAKSIKIFP-DATTYKLLYKAYTKANSKELLDKL 424 Query: 754 LSYMDKDGIIPNKRFFLDALGAIGSSDSSQKGTSDIG 644 L MDKD +IPNKRFFLDALGAIGSS + + G+++ G Sbjct: 425 LKQMDKDSVIPNKRFFLDALGAIGSS-TEKSGSANAG 460 >gb|ESW31740.1| hypothetical protein PHAVU_002G263600g [Phaseolus vulgaris] Length = 534 Score = 616 bits (1589), Expect = e-173 Identities = 305/429 (71%), Positives = 361/429 (84%) Frame = -3 Query: 1927 KPRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKT 1748 +PRKK + N +A EA ELV L R ++K+PL+ TLNK+VK +RTEHCFLLFEELGK Sbjct: 34 RPRKK---QPNHNA-EARELVRLLTRKISDKEPLLKTLNKFVKQVRTEHCFLLFEELGKE 89 Query: 1747 GQWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSV 1568 G WLQC+EVFRWMQKQRWY+ADNG+YSKLIS+MGK+GQTRMAMWLFSEMRN+GCRPDTSV Sbjct: 90 GNWLQCIEVFRWMQKQRWYIADNGIYSKLISVMGKRGQTRMAMWLFSEMRNAGCRPDTSV 149 Query: 1567 YNALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNIL 1388 YNALITAHLHSRDK KA +KA+GYF KMKGIERC+PN+VTYNILLRA AQ+RN+EQV+ L Sbjct: 150 YNALITAHLHSRDKTKALSKAIGYFQKMKGIERCKPNIVTYNILLRAFAQARNLEQVSSL 209 Query: 1387 FKDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGK 1208 FKDLDES I+PD +TFNGVMDAYGK GMIREME +L+QM+S Q KPD+ITFNLLIDSYGK Sbjct: 210 FKDLDESSISPDIYTFNGVMDAYGKNGMIREMEAILAQMRSSQYKPDLITFNLLIDSYGK 269 Query: 1207 KQEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIIT 1028 KQEF KMEQVFKSLL SKE+PTL TFNSMI NYGKARL+ KAE V++KM DMGY+PS +T Sbjct: 270 KQEFGKMEQVFKSLLSSKERPTLSTFNSMILNYGKARLKNKAEDVFKKMIDMGYTPSFVT 329 Query: 1027 YESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAH 848 +ESLI MYG CDCVS A ++FD+L++S+ KVSTLNA+L+VYC NGL EA +LF+ A Sbjct: 330 HESLIFMYGLCDCVSSAVQLFDELVESKVPIKVSTLNAILDVYCLNGLQQEAHSLFERAK 389 Query: 847 ATRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSS 668 + KI DSSTYKLLY+AYTKA KEL+ LL +MD++GIIPNKRFFL+AL ++ S + Sbjct: 390 SI-KIHPDSSTYKLLYRAYTKAKQKELLDKLLEHMDENGIIPNKRFFLNALDSVASVPGN 448 Query: 667 QKGTSDIGD 641 K + D Sbjct: 449 SKSANAATD 457 >ref|XP_006411755.1| hypothetical protein EUTSA_v10024997mg [Eutrema salsugineum] gi|557112925|gb|ESQ53208.1| hypothetical protein EUTSA_v10024997mg [Eutrema salsugineum] Length = 496 Score = 608 bits (1567), Expect = e-171 Identities = 313/470 (66%), Positives = 369/470 (78%), Gaps = 4/470 (0%) Frame = -3 Query: 2059 TLRISPTTSTFLLGISEFYPPNSCNSCPWRSMKASIYCLGGMSTKPRKKWGSKTNKDASE 1880 +LR S S+F + +S R M + G +S+ R+K + + + E Sbjct: 12 SLRFSDFISSFSQETDHKWLRSSPKPGGARKMSTTTITCGAISS--RRKLAERESAER-E 68 Query: 1879 ADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTGQWLQCLEVFRWMQKQ 1700 LV SLM ++++PLV TL+KYVK +R EHCFLLFEELGK+ +WLQCLEVFRWMQKQ Sbjct: 69 NRVLVRSLMSRISDREPLVKTLDKYVKVVRCEHCFLLFEELGKSDKWLQCLEVFRWMQKQ 128 Query: 1699 RWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVYNALITAHLHSRDKAK 1520 RWY+ADNGVYSKLIS+MGKKGQTRMAMWLFSEM+NSGCRPD SVYNALITAHLH+RDKAK Sbjct: 129 RWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMKNSGCRPDASVYNALITAHLHTRDKAK 188 Query: 1519 AFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILFKDLDESIITPDTFTF 1340 A K GYFDKMKG+ERC+PNVVTYNILLRA AQS V+QVN LFK+LD S ++PD +TF Sbjct: 189 ALEKVRGYFDKMKGMERCQPNVVTYNILLRAFAQSGKVDQVNALFKELDISAVSPDVYTF 248 Query: 1339 NGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKKQEFEKMEQVFKSLLR 1160 NGVMDAYGK GMI+EME VL++M+S + KPDIITFNLLIDSYGKKQEFEKMEQ FKSLLR Sbjct: 249 NGVMDAYGKNGMIKEMESVLTRMRSNECKPDIITFNLLIDSYGKKQEFEKMEQTFKSLLR 308 Query: 1159 SKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITYESLITMYGYCDCVSR 980 SKEKPTLPTFNSMI NYGKAR R+KAE V++KM DM Y PS ITYE +I MYGYC VSR Sbjct: 309 SKEKPTLPTFNSMIINYGKARRRDKAEWVFEKMNDMNYMPSFITYECMIMMYGYCGSVSR 368 Query: 979 AREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHATRKIPIDSSTYKLLY 800 ARE+F+++++S + K STLNAML+VYC NGL MEA+ LF SA A R P D+STYKLLY Sbjct: 369 AREMFEEVVESERVLKPSTLNAMLDVYCLNGLHMEADKLFHSASAFRVHP-DASTYKLLY 427 Query: 799 KAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGS----SDSSQK 662 KAYTKA MKE V+ L+ M+KDGI+PNKRFFL+AL GS SDS ++ Sbjct: 428 KAYTKADMKERVQMLMKKMEKDGIVPNKRFFLEALEVFGSRLPGSDSGRR 477 >ref|XP_006843693.1| hypothetical protein AMTR_s00007p00203740 [Amborella trichopoda] gi|548846061|gb|ERN05368.1| hypothetical protein AMTR_s00007p00203740 [Amborella trichopoda] Length = 494 Score = 601 bits (1550), Expect = e-169 Identities = 292/428 (68%), Positives = 354/428 (82%) Frame = -3 Query: 1924 PRKKWGSKTNKDASEADELVASLMRNFTEKQPLVTTLNKYVKFIRTEHCFLLFEELGKTG 1745 P KK S+ N + SE +ELV+ L+RN ++ +PL++TLNKYV+ IR EHCFLLFEELGK Sbjct: 58 PSKKRRSQ-NSEKSEVEELVSVLVRNSSKDKPLISTLNKYVRIIRNEHCFLLFEELGKRD 116 Query: 1744 QWLQCLEVFRWMQKQRWYVADNGVYSKLISIMGKKGQTRMAMWLFSEMRNSGCRPDTSVY 1565 WLQCLEVFRWMQKQ+WYVADNG+YSKLIS+MGKKGQTRMAMWLFSEMRNSGCRPDTSVY Sbjct: 117 NWLQCLEVFRWMQKQQWYVADNGIYSKLISVMGKKGQTRMAMWLFSEMRNSGCRPDTSVY 176 Query: 1564 NALITAHLHSRDKAKAFAKALGYFDKMKGIERCRPNVVTYNILLRAAAQSRNVEQVNILF 1385 NALITAHLHS+DK KA AKA+GY +KMKGIERC+PN+VTYNILLRA AQSR+V QV LF Sbjct: 177 NALITAHLHSKDKTKALAKAMGYLEKMKGIERCKPNIVTYNILLRAFAQSRDVYQVETLF 236 Query: 1384 KDLDESIITPDTFTFNGVMDAYGKTGMIREMELVLSQMKSKQLKPDIITFNLLIDSYGKK 1205 DL+ + ++ D +T+NGVMDAYGK GM+ E+E +L +M+ Q +PD IT+NLLIDSYGK+ Sbjct: 237 VDLETNALSADIYTYNGVMDAYGKNGMLVELEAMLLRMRKNQCRPDTITYNLLIDSYGKR 296 Query: 1204 QEFEKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAESVYQKMTDMGYSPSIITY 1025 Q F+KMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAE V++KM D+GY PS ITY Sbjct: 297 QAFDKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLREKAEWVFRKMIDLGYKPSFITY 356 Query: 1024 ESLITMYGYCDCVSRAREVFDQLMDSRKEKKVSTLNAMLEVYCRNGLPMEAETLFKSAHA 845 ESL+ YGYCDCVSRAR++F +++DS + +VSTLN ML+VYC NGL EA+ L + A Sbjct: 357 ESLMMAYGYCDCVSRARDIFSEMIDSGMQIQVSTLNTMLDVYCMNGLTGEADLLLQYART 416 Query: 844 TRKIPIDSSTYKLLYKAYTKAGMKELVKNLLSYMDKDGIIPNKRFFLDALGAIGSSDSSQ 665 +P DSSTYKLLYKAYTKA M +L++ L+ M+KD I+PNK+FFL+ALG +GS+ + Sbjct: 417 KEVLP-DSSTYKLLYKAYTKANMMDLLEMLVKQMEKDDIVPNKKFFLEALGTLGSTQAIS 475 Query: 664 KGTSDIGD 641 S + D Sbjct: 476 TQPSSVVD 483