BLASTX nr result
ID: Glycyrrhiza24_contig00009286
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00009286 (1708 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containi... 751 0.0 ref|XP_003630096.1| Pentatricopeptide repeat-containing protein ... 749 0.0 ref|XP_003532845.1| PREDICTED: pentatricopeptide repeat-containi... 745 0.0 ref|XP_002532046.1| pentatricopeptide repeat-containing protein,... 674 0.0 ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containi... 672 0.0 >ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 503 Score = 751 bits (1939), Expect = 0.0 Identities = 385/494 (77%), Positives = 421/494 (85%), Gaps = 29/494 (5%) Frame = +1 Query: 151 ALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKIS-DKEPLVKTLSKYVKLVRTE 327 A L+RISCG R KR KK++ SE +ELVRLL KIS DKEPL+KTL+KYVK VRT+ Sbjct: 20 APLSRISCGA----RPKR-KKSNHNSEAQELVRLLTSKISNDKEPLLKTLNKYVKQVRTQ 74 Query: 328 HCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSE 507 HCFLLFEEL K D WLQC+EVFRWMQKQRWYIADNG+YSKLISVMGKKGQTR+AMWLFSE Sbjct: 75 HCFLLFEELAKHDNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSE 134 Query: 508 MRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAF 687 MRNTGCRPDTSVYN+LI+AHLHSRDK KALAKA+GYF+KMKG ERCKPNIVTYNILLRAF Sbjct: 135 MRNTGCRPDTSVYNALITAHLHSRDKTKALAKAIGYFQKMKGMERCKPNIVTYNILLRAF 194 Query: 688 AQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDL 867 AQARNVE VNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREME+VLARMKSN+CKPDL Sbjct: 195 AQARNVEQVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDL 254 Query: 868 ITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKK 1047 IT+NLLID+YGKKQ+F KMEQVFKSLL SKE+ +LPTFNSMILNYGKARLKDKAE+VFK+ Sbjct: 255 ITFNLLIDSYGKKQEFGKMEQVFKSLLRSKERASLPTFNSMILNYGKARLKDKAEDVFKR 314 Query: 1048 MTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGL 1227 MTDMGYTPSFVTHESLIYM+GFCDCVS+A +LFD LVESK +KVSTLNAMLDVYCINGL Sbjct: 315 MTDMGYTPSFVTHESLIYMYGFCDCVSRAAQLFDELVESKAHIKVSTLNAMLDVYCINGL 374 Query: 1228 PQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLD 1407 PQEADSLF+RA SIKI+PD+ST+KLLYKAYTKAN KELLDKLLKHMDK GI+PNKRFFLD Sbjct: 375 PQEADSLFERANSIKIYPDSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIVPNKRFFLD 434 Query: 1408 ALGAIESLPA----------------------------NSGSANAETASNSPQDLVKNQL 1503 ALGA+ SLPA NS SANA T SN+PQ+ K Sbjct: 435 ALGAVASLPANSESANAATDSKTANSESANAATDSNTSNSKSANAATDSNNPQEFSK--- 491 Query: 1504 ET*LAIHTKNLLAH 1545 LA H KN+LAH Sbjct: 492 ---LATHVKNILAH 502 >ref|XP_003630096.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355524118|gb|AET04572.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 635 Score = 749 bits (1933), Expect = 0.0 Identities = 380/448 (84%), Positives = 403/448 (89%) Frame = +1 Query: 139 PSHVALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKISDKEPLVKTLSKYVKLV 318 P ++ + TRISC S TR R K+T+DQSET+ELVRLL +KISDKEPL+KTL+KYVKLV Sbjct: 22 PPYITIPTRISCV-SNPTRINR-KQTTDQSETQELVRLLTRKISDKEPLLKTLNKYVKLV 79 Query: 319 RTEHCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWL 498 RTEHCFLLFEELGK DKWLQC+EVFRWMQ+QRWYIADNGVYSKLISVMGKKGQ RLAMWL Sbjct: 80 RTEHCFLLFEELGKHDKWLQCLEVFRWMQRQRWYIADNGVYSKLISVMGKKGQIRLAMWL 139 Query: 499 FSEMRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILL 678 FSEMRNTGCRPDTSVYNSLISAHLHSRDK KAL KALGYFEKMK ERCKPNIVTYNILL Sbjct: 140 FSEMRNTGCRPDTSVYNSLISAHLHSRDKSKALVKALGYFEKMKTTERCKPNIVTYNILL 199 Query: 679 RAFAQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCK 858 RAFAQAR+V VN LFKDLDES VSPDIYTFNGVMD YGKNGMIREMESVL RMKSN+ K Sbjct: 200 RAFAQARDVNQVNYLFKDLDESSVSPDIYTFNGVMDGYGKNGMIREMESVLVRMKSNQVK 259 Query: 859 PDLITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENV 1038 DLITYNLLID+YGKKQQFDKMEQVFKSL SKEKPTLPTFNSMILNYGKARLKDKAENV Sbjct: 260 LDLITYNLLIDSYGKKQQFDKMEQVFKSLSRSKEKPTLPTFNSMILNYGKARLKDKAENV 319 Query: 1039 FKKMTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCI 1218 F+ MTDMGYTPSFVTHESLI+M+G C CVS AVELFD L+ESKVP+KVSTLNAMLDVYCI Sbjct: 320 FQNMTDMGYTPSFVTHESLIHMYGLCGCVSNAVELFDQLIESKVPIKVSTLNAMLDVYCI 379 Query: 1219 NGLPQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRF 1398 NGL QEADSLF RA+SIKIFPDA+TYKLLYKAYTKANSKELLDKLLK MDK +IPNKRF Sbjct: 380 NGLQQEADSLFTRAKSIKIFPDATTYKLLYKAYTKANSKELLDKLLKQMDKDSVIPNKRF 439 Query: 1399 FLDALGAIESLPANSGSANAETASNSPQ 1482 FLDALGAI S SGSANA T S+ PQ Sbjct: 440 FLDALGAIGSSTEKSGSANAGTGSSRPQ 467 >ref|XP_003532845.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 580 Score = 745 bits (1923), Expect = 0.0 Identities = 373/444 (84%), Positives = 407/444 (91%), Gaps = 2/444 (0%) Frame = +1 Query: 151 ALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKI--SDKEPLVKTLSKYVKLVRT 324 A L+RISCGG R + KK++ SE +ELVRLL KI +DKE L+KTL+KYVK VRT Sbjct: 80 APLSRISCGGP---RPPKSKKSNLNSEAQELVRLLTSKIRSNDKEVLLKTLNKYVKQVRT 136 Query: 325 EHCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFS 504 +HCFLLFEELGK D WLQC+EVFRWMQKQRWYIADNG+YSKLISVMGKKGQTR+AMWLFS Sbjct: 137 QHCFLLFEELGKHDNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFS 196 Query: 505 EMRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRA 684 EMRNTGCRPDTSVYN+LI+AHL SRDK KALAKA+GYF+KMKG ERCKPNIVTYNILLRA Sbjct: 197 EMRNTGCRPDTSVYNALITAHLRSRDKIKALAKAIGYFQKMKGMERCKPNIVTYNILLRA 256 Query: 685 FAQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPD 864 FAQARNVE VNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREME+VLARMKSN+CKPD Sbjct: 257 FAQARNVEQVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPD 316 Query: 865 LITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFK 1044 LIT+NLLID+YGKKQ F KMEQVFKSLLHSKE+P+LPTFNSMILNYGKARLKDKAE+VFK Sbjct: 317 LITFNLLIDSYGKKQAFGKMEQVFKSLLHSKERPSLPTFNSMILNYGKARLKDKAEDVFK 376 Query: 1045 KMTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCING 1224 KMTDMGYT SFVTHES+IYM+GFCDCVS+A +LFD LVESKV +KVSTLNAMLDVYC+NG Sbjct: 377 KMTDMGYTLSFVTHESMIYMYGFCDCVSRAAQLFDELVESKVHIKVSTLNAMLDVYCLNG 436 Query: 1225 LPQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFL 1404 LPQEADSLF+RA SIKI PD+ST+KLLYKAYTKAN KELLDKLLKHMDK GIIPNKRFFL Sbjct: 437 LPQEADSLFERAISIKIHPDSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIIPNKRFFL 496 Query: 1405 DALGAIESLPANSGSANAETASNS 1476 DALGA+ SLPANS SANA T SN+ Sbjct: 497 DALGAVASLPANSESANAATDSNT 520 >ref|XP_002532046.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528289|gb|EEF30336.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 478 Score = 674 bits (1740), Expect = 0.0 Identities = 332/439 (75%), Positives = 390/439 (88%), Gaps = 4/439 (0%) Frame = +1 Query: 160 TRISCGGSGATRSKREK-KTSDQSETRELVR-LLKQKISDKEPLVKTLSKYVKLVRTEHC 333 T I+C +TR ++++ S++SET +LVR +L+ SDK PLV+TL KYV++VRTEHC Sbjct: 43 THITCV---STRPRKKRFPISEESETEDLVRYVLRSFSSDKVPLVRTLDKYVRVVRTEHC 99 Query: 334 FLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSEMR 513 FLLFEELG+ DKWLQC+EVFRWMQKQRWYIAD+GVYSKLISVMGKKGQTR+AMWLFSEMR Sbjct: 100 FLLFEELGRRDKWLQCLEVFRWMQKQRWYIADSGVYSKLISVMGKKGQTRMAMWLFSEMR 159 Query: 514 NTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAFAQ 693 N+GCRPD+SVYN+LI+AHLHS+DK KAL KALGYFEKMKG +RC+PN+VTYNILLRAFAQ Sbjct: 160 NSGCRPDSSVYNALITAHLHSKDKAKALIKALGYFEKMKGMQRCQPNVVTYNILLRAFAQ 219 Query: 694 ARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDLIT 873 ARNV VN+LFKDLD+SIVSPDIYT+NGVMDAYGKNGMIREMESVL+RMKSN+CKPD+IT Sbjct: 220 ARNVNQVNALFKDLDQSIVSPDIYTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIIT 279 Query: 874 YNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKKMT 1053 +NLLID+YGKKQ FDKMEQVFKSLLHSKE+PTLPTFNSMI NYGKAR K+ AE+V +KMT Sbjct: 280 FNLLIDSYGKKQDFDKMEQVFKSLLHSKERPTLPTFNSMITNYGKARQKENAESVLQKMT 339 Query: 1054 DMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGLPQ 1233 M YTP+F+T+ESLI M+GFCD VSKA E+FD ++ES VKVSTLNAMLDVYC+NGLP Sbjct: 340 KMKYTPNFITYESLIMMYGFCDSVSKAREIFDDMIESGKEVKVSTLNAMLDVYCLNGLPM 399 Query: 1234 EADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLDAL 1413 EAD LF AR++ + PD++TYKLLYKAYTKAN K+L+ KLLKHMD+ GIIPNKRFFLDAL Sbjct: 400 EADLLFDNARNVGLLPDSTTYKLLYKAYTKANMKKLVQKLLKHMDRDGIIPNKRFFLDAL 459 Query: 1414 GAIESLPANSGSA--NAET 1464 GA +SLPA+SG+ NA+T Sbjct: 460 GAFKSLPASSGNQQNNAKT 478 >ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Vitis vinifera] gi|296082481|emb|CBI21486.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 672 bits (1735), Expect = 0.0 Identities = 327/445 (73%), Positives = 382/445 (85%) Frame = +1 Query: 160 TRISCGGSGATRSKREKKTSDQSETRELVRLLKQKISDKEPLVKTLSKYVKLVRTEHCFL 339 T +SC + R K D+SE ELVR+L + + PL+ TL+KYVK++RTEHCF Sbjct: 46 TVVSCVSTRPRRKPGPKP--DKSEVEELVRVLMKNFGGERPLISTLNKYVKVIRTEHCFR 103 Query: 340 LFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSEMRNT 519 LFEELGK DKWLQC+EVFRWMQKQRWYIADNGVYSKLISVMGKKGQTR+AMWLFSEMRN+ Sbjct: 104 LFEELGKTDKWLQCLEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNS 163 Query: 520 GCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAFAQAR 699 GCRPDTSVYN+LI+AHLHSRDK KAL KALGYF+KMKG ERCKPNIVTYNILLRAFAQA+ Sbjct: 164 GCRPDTSVYNALITAHLHSRDKSKALIKALGYFDKMKGMERCKPNIVTYNILLRAFAQAQ 223 Query: 700 NVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDLITYN 879 NV N+LFK+L+ESIVSPDI+TFNGVMDAYGKNGMI+EMESVL+RMKSN+CKPD+IT+N Sbjct: 224 NVNQANALFKELNESIVSPDIFTFNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFN 283 Query: 880 LLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKKMTDM 1059 +LID+YG++Q+FDKMEQVFKSLL SKEKPTLPTFNSMI NYGKARLK+KAENVFKKMTDM Sbjct: 284 VLIDSYGRRQEFDKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLKEKAENVFKKMTDM 343 Query: 1060 GYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGLPQEA 1239 GY P+F+T+ESLI M+GFCDC+S+A E+FD ++ SK +KVSTLNAML+VYC+NGLP EA Sbjct: 344 GYAPNFITYESLIMMYGFCDCISRAREIFDEMMASKKEMKVSTLNAMLEVYCMNGLPMEA 403 Query: 1240 DSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLDALGA 1419 D L +RAR + FP +STYKLLYKAYTKA+ KELL+KLLK MD GI+PNKRFFL+ALGA Sbjct: 404 DLLLERARKNRPFPGSSTYKLLYKAYTKADQKELLEKLLKLMDSDGILPNKRFFLEALGA 463 Query: 1420 IESLPANSGSANAETASNSPQDLVK 1494 S PA+ SA + T P++ K Sbjct: 464 FGSSPASQESAGSTTGLTRPRNSAK 488