BLASTX nr result
ID: Glycyrrhiza23_contig00012253
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00012253 (1657 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containi... 751 0.0 ref|XP_003630096.1| Pentatricopeptide repeat-containing protein ... 749 0.0 ref|XP_003532845.1| PREDICTED: pentatricopeptide repeat-containi... 745 0.0 ref|XP_002532046.1| pentatricopeptide repeat-containing protein,... 674 0.0 ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containi... 672 0.0 >ref|XP_003524280.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 503 Score = 751 bits (1939), Expect = 0.0 Identities = 385/494 (77%), Positives = 421/494 (85%), Gaps = 29/494 (5%) Frame = +3 Query: 156 ALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKIS-DKEPLVKTLSKYVKLVRTE 332 A L+RISCG R KR KK++ SE +ELVRLL KIS DKEPL+KTL+KYVK VRT+ Sbjct: 20 APLSRISCGA----RPKR-KKSNHNSEAQELVRLLTSKISNDKEPLLKTLNKYVKQVRTQ 74 Query: 333 HCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSE 512 HCFLLFEEL K D WLQC+EVFRWMQKQRWYIADNG+YSKLISVMGKKGQTR+AMWLFSE Sbjct: 75 HCFLLFEELAKHDNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFSE 134 Query: 513 MRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAF 692 MRNTGCRPDTSVYN+LI+AHLHSRDK KALAKA+GYF+KMKG ERCKPNIVTYNILLRAF Sbjct: 135 MRNTGCRPDTSVYNALITAHLHSRDKTKALAKAIGYFQKMKGMERCKPNIVTYNILLRAF 194 Query: 693 AQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDL 872 AQARNVE VNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREME+VLARMKSN+CKPDL Sbjct: 195 AQARNVEQVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPDL 254 Query: 873 ITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKK 1052 IT+NLLID+YGKKQ+F KMEQVFKSLL SKE+ +LPTFNSMILNYGKARLKDKAE+VFK+ Sbjct: 255 ITFNLLIDSYGKKQEFGKMEQVFKSLLRSKERASLPTFNSMILNYGKARLKDKAEDVFKR 314 Query: 1053 MTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGL 1232 MTDMGYTPSFVTHESLIYM+GFCDCVS+A +LFD LVESK +KVSTLNAMLDVYCINGL Sbjct: 315 MTDMGYTPSFVTHESLIYMYGFCDCVSRAAQLFDELVESKAHIKVSTLNAMLDVYCINGL 374 Query: 1233 PQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLD 1412 PQEADSLF+RA SIKI+PD+ST+KLLYKAYTKAN KELLDKLLKHMDK GI+PNKRFFLD Sbjct: 375 PQEADSLFERANSIKIYPDSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIVPNKRFFLD 434 Query: 1413 ALGAIESLPA----------------------------NSGSANAETASNSPQDLVKNQL 1508 ALGA+ SLPA NS SANA T SN+PQ+ K Sbjct: 435 ALGAVASLPANSESANAATDSKTANSESANAATDSNTSNSKSANAATDSNNPQEFSK--- 491 Query: 1509 ET*LAIHTKNLLAH 1550 LA H KN+LAH Sbjct: 492 ---LATHVKNILAH 502 >ref|XP_003630096.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355524118|gb|AET04572.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 635 Score = 749 bits (1933), Expect = 0.0 Identities = 380/448 (84%), Positives = 403/448 (89%) Frame = +3 Query: 144 PSHVALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKISDKEPLVKTLSKYVKLV 323 P ++ + TRISC S TR R K+T+DQSET+ELVRLL +KISDKEPL+KTL+KYVKLV Sbjct: 22 PPYITIPTRISCV-SNPTRINR-KQTTDQSETQELVRLLTRKISDKEPLLKTLNKYVKLV 79 Query: 324 RTEHCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWL 503 RTEHCFLLFEELGK DKWLQC+EVFRWMQ+QRWYIADNGVYSKLISVMGKKGQ RLAMWL Sbjct: 80 RTEHCFLLFEELGKHDKWLQCLEVFRWMQRQRWYIADNGVYSKLISVMGKKGQIRLAMWL 139 Query: 504 FSEMRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILL 683 FSEMRNTGCRPDTSVYNSLISAHLHSRDK KAL KALGYFEKMK ERCKPNIVTYNILL Sbjct: 140 FSEMRNTGCRPDTSVYNSLISAHLHSRDKSKALVKALGYFEKMKTTERCKPNIVTYNILL 199 Query: 684 RAFAQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCK 863 RAFAQAR+V VN LFKDLDES VSPDIYTFNGVMD YGKNGMIREMESVL RMKSN+ K Sbjct: 200 RAFAQARDVNQVNYLFKDLDESSVSPDIYTFNGVMDGYGKNGMIREMESVLVRMKSNQVK 259 Query: 864 PDLITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENV 1043 DLITYNLLID+YGKKQQFDKMEQVFKSL SKEKPTLPTFNSMILNYGKARLKDKAENV Sbjct: 260 LDLITYNLLIDSYGKKQQFDKMEQVFKSLSRSKEKPTLPTFNSMILNYGKARLKDKAENV 319 Query: 1044 FKKMTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCI 1223 F+ MTDMGYTPSFVTHESLI+M+G C CVS AVELFD L+ESKVP+KVSTLNAMLDVYCI Sbjct: 320 FQNMTDMGYTPSFVTHESLIHMYGLCGCVSNAVELFDQLIESKVPIKVSTLNAMLDVYCI 379 Query: 1224 NGLPQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRF 1403 NGL QEADSLF RA+SIKIFPDA+TYKLLYKAYTKANSKELLDKLLK MDK +IPNKRF Sbjct: 380 NGLQQEADSLFTRAKSIKIFPDATTYKLLYKAYTKANSKELLDKLLKQMDKDSVIPNKRF 439 Query: 1404 FLDALGAIESLPANSGSANAETASNSPQ 1487 FLDALGAI S SGSANA T S+ PQ Sbjct: 440 FLDALGAIGSSTEKSGSANAGTGSSRPQ 467 >ref|XP_003532845.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic-like [Glycine max] Length = 580 Score = 745 bits (1923), Expect = 0.0 Identities = 373/444 (84%), Positives = 407/444 (91%), Gaps = 2/444 (0%) Frame = +3 Query: 156 ALLTRISCGGSGATRSKREKKTSDQSETRELVRLLKQKI--SDKEPLVKTLSKYVKLVRT 329 A L+RISCGG R + KK++ SE +ELVRLL KI +DKE L+KTL+KYVK VRT Sbjct: 80 APLSRISCGGP---RPPKSKKSNLNSEAQELVRLLTSKIRSNDKEVLLKTLNKYVKQVRT 136 Query: 330 EHCFLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFS 509 +HCFLLFEELGK D WLQC+EVFRWMQKQRWYIADNG+YSKLISVMGKKGQTR+AMWLFS Sbjct: 137 QHCFLLFEELGKHDNWLQCLEVFRWMQKQRWYIADNGIYSKLISVMGKKGQTRMAMWLFS 196 Query: 510 EMRNTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRA 689 EMRNTGCRPDTSVYN+LI+AHL SRDK KALAKA+GYF+KMKG ERCKPNIVTYNILLRA Sbjct: 197 EMRNTGCRPDTSVYNALITAHLRSRDKIKALAKAIGYFQKMKGMERCKPNIVTYNILLRA 256 Query: 690 FAQARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPD 869 FAQARNVE VNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREME+VLARMKSN+CKPD Sbjct: 257 FAQARNVEQVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMEAVLARMKSNQCKPD 316 Query: 870 LITYNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFK 1049 LIT+NLLID+YGKKQ F KMEQVFKSLLHSKE+P+LPTFNSMILNYGKARLKDKAE+VFK Sbjct: 317 LITFNLLIDSYGKKQAFGKMEQVFKSLLHSKERPSLPTFNSMILNYGKARLKDKAEDVFK 376 Query: 1050 KMTDMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCING 1229 KMTDMGYT SFVTHES+IYM+GFCDCVS+A +LFD LVESKV +KVSTLNAMLDVYC+NG Sbjct: 377 KMTDMGYTLSFVTHESMIYMYGFCDCVSRAAQLFDELVESKVHIKVSTLNAMLDVYCLNG 436 Query: 1230 LPQEADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFL 1409 LPQEADSLF+RA SIKI PD+ST+KLLYKAYTKAN KELLDKLLKHMDK GIIPNKRFFL Sbjct: 437 LPQEADSLFERAISIKIHPDSSTFKLLYKAYTKANQKELLDKLLKHMDKDGIIPNKRFFL 496 Query: 1410 DALGAIESLPANSGSANAETASNS 1481 DALGA+ SLPANS SANA T SN+ Sbjct: 497 DALGAVASLPANSESANAATDSNT 520 >ref|XP_002532046.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223528289|gb|EEF30336.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 478 Score = 674 bits (1740), Expect = 0.0 Identities = 332/439 (75%), Positives = 390/439 (88%), Gaps = 4/439 (0%) Frame = +3 Query: 165 TRISCGGSGATRSKREK-KTSDQSETRELVR-LLKQKISDKEPLVKTLSKYVKLVRTEHC 338 T I+C +TR ++++ S++SET +LVR +L+ SDK PLV+TL KYV++VRTEHC Sbjct: 43 THITCV---STRPRKKRFPISEESETEDLVRYVLRSFSSDKVPLVRTLDKYVRVVRTEHC 99 Query: 339 FLLFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSEMR 518 FLLFEELG+ DKWLQC+EVFRWMQKQRWYIAD+GVYSKLISVMGKKGQTR+AMWLFSEMR Sbjct: 100 FLLFEELGRRDKWLQCLEVFRWMQKQRWYIADSGVYSKLISVMGKKGQTRMAMWLFSEMR 159 Query: 519 NTGCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAFAQ 698 N+GCRPD+SVYN+LI+AHLHS+DK KAL KALGYFEKMKG +RC+PN+VTYNILLRAFAQ Sbjct: 160 NSGCRPDSSVYNALITAHLHSKDKAKALIKALGYFEKMKGMQRCQPNVVTYNILLRAFAQ 219 Query: 699 ARNVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDLIT 878 ARNV VN+LFKDLD+SIVSPDIYT+NGVMDAYGKNGMIREMESVL+RMKSN+CKPD+IT Sbjct: 220 ARNVNQVNALFKDLDQSIVSPDIYTYNGVMDAYGKNGMIREMESVLSRMKSNQCKPDIIT 279 Query: 879 YNLLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKKMT 1058 +NLLID+YGKKQ FDKMEQVFKSLLHSKE+PTLPTFNSMI NYGKAR K+ AE+V +KMT Sbjct: 280 FNLLIDSYGKKQDFDKMEQVFKSLLHSKERPTLPTFNSMITNYGKARQKENAESVLQKMT 339 Query: 1059 DMGYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGLPQ 1238 M YTP+F+T+ESLI M+GFCD VSKA E+FD ++ES VKVSTLNAMLDVYC+NGLP Sbjct: 340 KMKYTPNFITYESLIMMYGFCDSVSKAREIFDDMIESGKEVKVSTLNAMLDVYCLNGLPM 399 Query: 1239 EADSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLDAL 1418 EAD LF AR++ + PD++TYKLLYKAYTKAN K+L+ KLLKHMD+ GIIPNKRFFLDAL Sbjct: 400 EADLLFDNARNVGLLPDSTTYKLLYKAYTKANMKKLVQKLLKHMDRDGIIPNKRFFLDAL 459 Query: 1419 GAIESLPANSGSA--NAET 1469 GA +SLPA+SG+ NA+T Sbjct: 460 GAFKSLPASSGNQQNNAKT 478 >ref|XP_002276540.1| PREDICTED: pentatricopeptide repeat-containing protein At4g39620, chloroplastic [Vitis vinifera] gi|296082481|emb|CBI21486.3| unnamed protein product [Vitis vinifera] Length = 489 Score = 672 bits (1735), Expect = 0.0 Identities = 327/445 (73%), Positives = 382/445 (85%) Frame = +3 Query: 165 TRISCGGSGATRSKREKKTSDQSETRELVRLLKQKISDKEPLVKTLSKYVKLVRTEHCFL 344 T +SC + R K D+SE ELVR+L + + PL+ TL+KYVK++RTEHCF Sbjct: 46 TVVSCVSTRPRRKPGPKP--DKSEVEELVRVLMKNFGGERPLISTLNKYVKVIRTEHCFR 103 Query: 345 LFEELGKDDKWLQCIEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRLAMWLFSEMRNT 524 LFEELGK DKWLQC+EVFRWMQKQRWYIADNGVYSKLISVMGKKGQTR+AMWLFSEMRN+ Sbjct: 104 LFEELGKTDKWLQCLEVFRWMQKQRWYIADNGVYSKLISVMGKKGQTRMAMWLFSEMRNS 163 Query: 525 GCRPDTSVYNSLISAHLHSRDKKKALAKALGYFEKMKGYERCKPNIVTYNILLRAFAQAR 704 GCRPDTSVYN+LI+AHLHSRDK KAL KALGYF+KMKG ERCKPNIVTYNILLRAFAQA+ Sbjct: 164 GCRPDTSVYNALITAHLHSRDKSKALIKALGYFDKMKGMERCKPNIVTYNILLRAFAQAQ 223 Query: 705 NVEGVNSLFKDLDESIVSPDIYTFNGVMDAYGKNGMIREMESVLARMKSNRCKPDLITYN 884 NV N+LFK+L+ESIVSPDI+TFNGVMDAYGKNGMI+EMESVL+RMKSN+CKPD+IT+N Sbjct: 224 NVNQANALFKELNESIVSPDIFTFNGVMDAYGKNGMIKEMESVLSRMKSNQCKPDIITFN 283 Query: 885 LLIDAYGKKQQFDKMEQVFKSLLHSKEKPTLPTFNSMILNYGKARLKDKAENVFKKMTDM 1064 +LID+YG++Q+FDKMEQVFKSLL SKEKPTLPTFNSMI NYGKARLK+KAENVFKKMTDM Sbjct: 284 VLIDSYGRRQEFDKMEQVFKSLLRSKEKPTLPTFNSMITNYGKARLKEKAENVFKKMTDM 343 Query: 1065 GYTPSFVTHESLIYMFGFCDCVSKAVELFDGLVESKVPVKVSTLNAMLDVYCINGLPQEA 1244 GY P+F+T+ESLI M+GFCDC+S+A E+FD ++ SK +KVSTLNAML+VYC+NGLP EA Sbjct: 344 GYAPNFITYESLIMMYGFCDCISRAREIFDEMMASKKEMKVSTLNAMLEVYCMNGLPMEA 403 Query: 1245 DSLFQRARSIKIFPDASTYKLLYKAYTKANSKELLDKLLKHMDKVGIIPNKRFFLDALGA 1424 D L +RAR + FP +STYKLLYKAYTKA+ KELL+KLLK MD GI+PNKRFFL+ALGA Sbjct: 404 DLLLERARKNRPFPGSSTYKLLYKAYTKADQKELLEKLLKLMDSDGILPNKRFFLEALGA 463 Query: 1425 IESLPANSGSANAETASNSPQDLVK 1499 S PA+ SA + T P++ K Sbjct: 464 FGSSPASQESAGSTTGLTRPRNSAK 488