BLASTX nr result
ID: Lithospermum22_contig00030159
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum22_contig00030159 (1811 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002269319.2| PREDICTED: pentatricopeptide repeat-containi... 543 e-152 ref|XP_003535694.1| PREDICTED: pentatricopeptide repeat-containi... 521 e-145 ref|XP_002518643.1| pentatricopeptide repeat-containing protein,... 354 4e-95 ref|NP_172286.1| pentatricopeptide repeat-containing protein [Ar... 340 6e-91 gb|AEP33760.1| organelle transcript processing 82, partial [Caps... 337 8e-90 >ref|XP_002269319.2| PREDICTED: pentatricopeptide repeat-containing protein At2g29760, chloroplastic-like [Vitis vinifera] Length = 532 Score = 543 bits (1399), Expect = e-152 Identities = 261/453 (57%), Positives = 340/453 (75%) Frame = -3 Query: 1791 MPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSSLEAAKV 1612 MPD++++ CWT+LI+G A SGQ+E+ L LF MVK+NL+PENDT+VSVLSACS LEA ++ Sbjct: 1 MPDRAMVRCWTSLIAGSAQSGQTEEVLRLFFMMVKENLRPENDTIVSVLSACSKLEAVEI 60 Query: 1611 ERWANVFDELGNDHDREKLVGDNVNVVLVYLYGKQGKFDESRERFDRISIQGKRSVLSWN 1432 E+W + E ND D D+VN VL YLYGK GK ++ +ERFD I GKRSVL WN Sbjct: 61 EKWVMILSEFINDDDTGSFGRDSVNTVLAYLYGKWGKVEKCKERFDEIVGIGKRSVLPWN 120 Query: 1431 VMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGTRVHSYLRT 1252 V+I AYVQNGC E LSL+R+M+E+ + PNHVTMV VLSACA VGD+D+G +H Y+++ Sbjct: 121 VIISAYVQNGCSFEALSLFRVMIEDLNLRPNHVTMVSVLSACAQVGDLDLGKWIHGYVKS 180 Query: 1251 KEHRGVLFSNKNLATALIDMYYKCGNVEKARDVFGEIRTKDIVLFNAMIMGLAVNGKGEA 1072 + + ++ SN LATALIDMY KCGN+ KA+DVF ++ +KD+V FNAMIMGLA+NG+GE Sbjct: 181 EGCKAIVESNTFLATALIDMYSKCGNLGKAKDVFEQMVSKDVVSFNAMIMGLAINGEGEE 240 Query: 1071 AFSYFSEIQELGLHPDAATFLGLLCACSHSGLLEKGRQVFKDMISKFKISPKLEHYASYI 892 A FS++QEL L P++ TFLG+LCACSHSGLL+ GRQ+F DMI F + P+LEHYA Y+ Sbjct: 241 ALRLFSKMQELSLRPNSGTFLGVLCACSHSGLLDTGRQMFLDMIPHFSVPPELEHYACYV 300 Query: 891 DLLARVGCIKEAMQVVSSMPFEPNQYVWGALLSGSLLHDQSDIAQIASSMLLKSDPGNSG 712 DLLARVG ++EA +VV+SMPF PN +VWGALL G LH + ++AQ S L+K DP NS Sbjct: 301 DLLARVGLLEEAFEVVASMPFVPNNFVWGALLQGCRLHSRLELAQDVSQKLVKVDPENSA 360 Query: 711 GYVMLSNSFASDNRWGDVSGVRGFMRDKGVTKHPGCSWIGIYNMVHEFVAGSTSHPNCEV 532 GYVM SN+ ASD +WG+VSG+R MR+KGV KHPGCSWI + +VHEF+AGS SHP + Sbjct: 361 GYVMFSNALASDQQWGEVSGLRWLMREKGVRKHPGCSWISVNRVVHEFLAGSLSHPQIDS 420 Query: 531 VHSRLEELIKEMKIPSL*ETTNLVCILYWMNNI 433 ++ L L+KEMK+ + T L +L+ N + Sbjct: 421 IYHTLNGLVKEMKVFASYHNTQLDALLHTENKV 453 >ref|XP_003535694.1| PREDICTED: pentatricopeptide repeat-containing protein At3g12770-like [Glycine max] Length = 634 Score = 521 bits (1341), Expect = e-145 Identities = 263/452 (58%), Positives = 334/452 (73%), Gaps = 1/452 (0%) Frame = -3 Query: 1809 RKLFDEMPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSS 1630 RK+FDE+PDK ++ CWTNLI+G+A SG SE+ L+LF MV+ NL P++DTMVSVLSACSS Sbjct: 186 RKVFDEIPDKMLVSCWTNLITGFAQSGHSEEVLQLFQVMVRQNLLPQSDTMVSVLSACSS 245 Query: 1629 LEAAKVERWANVFDEL-GNDHDREKLVGDNVNVVLVYLYGKQGKFDESRERFDRISIQGK 1453 LE K+E+W NVF EL G+ + D+VN VLVYL+GK G+ ++SRE FDRIS GK Sbjct: 246 LEMPKIEKWVNVFLELVGDGVSTRETCHDSVNTVLVYLFGKWGRIEKSRENFDRISTSGK 305 Query: 1452 RSVLSWNVMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGTR 1273 SV+ WN MI AYVQNGC +EGL+L+R+M+EE PNH+TMV VLSACA +GD+ G+ Sbjct: 306 SSVVPWNAMINAYVQNGCPVEGLNLFRMMVEEETTRPNHITMVSVLSACAQIGDLSFGSW 365 Query: 1272 VHSYLRTKEHRGVLFSNKNLATALIDMYYKCGNVEKARDVFGEIRTKDIVLFNAMIMGLA 1093 VH YL + HR + SN+ LAT+LIDMY KCGN++KA+ VF +KD+VLFNAMIMGLA Sbjct: 366 VHGYLISLGHRHTIGSNQILATSLIDMYSKCGNLDKAKKVFEHTVSKDVVLFNAMIMGLA 425 Query: 1092 VNGKGEAAFSYFSEIQELGLHPDAATFLGLLCACSHSGLLEKGRQVFKDMISKFKISPKL 913 V GKGE A F +I E GL P+A TFLG L ACSHSGLL +GRQ+F+++ ++ L Sbjct: 426 VYGKGEDALRLFYKIPEFGLQPNAGTFLGALSACSHSGLLVRGRQIFRELTLSTTLT--L 483 Query: 912 EHYASYIDLLARVGCIKEAMQVVSSMPFEPNQYVWGALLSGSLLHDQSDIAQIASSMLLK 733 EH A YIDLLARVGCI+EA++VV+SMPF+PN +VWGALL G LLH + ++AQ S L++ Sbjct: 484 EHCACYIDLLARVGCIEEAIEVVTSMPFKPNNFVWGALLGGCLLHSRVELAQEVSRRLVE 543 Query: 732 SDPGNSGGYVMLSNSFASDNRWGDVSGVRGFMRDKGVTKHPGCSWIGIYNMVHEFVAGST 553 DP NS GYVML+N+ ASDN+W DVSG+R M++KGV K PG SWI + VHEF+ G Sbjct: 544 VDPDNSAGYVMLANALASDNQWSDVSGLRLEMKEKGVKKQPGSSWIIVDGAVHEFLVGCL 603 Query: 552 SHPNCEVVHSRLEELIKEMKIPSL*ETTNLVC 457 SHP E ++ L L+K MK+ S NLVC Sbjct: 604 SHPEIEGIYHTLAGLVKNMKVAS---HFNLVC 632 >ref|XP_002518643.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542024|gb|EEF43568.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 318 Score = 354 bits (909), Expect = 4e-95 Identities = 173/283 (61%), Positives = 221/283 (78%) Frame = -3 Query: 1791 MPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSSLEAAKV 1612 MP+K ++CCWT+LISG+A SG SE+ L++FC MVK+NL+PENDTMVSV SACS+LE ++ Sbjct: 1 MPEKGLLCCWTSLISGFAQSGYSEEVLKIFCLMVKENLEPENDTMVSVFSACSNLEICQI 60 Query: 1611 ERWANVFDELGNDHDREKLVGDNVNVVLVYLYGKQGKFDESRERFDRISIQGKRSVLSWN 1432 E+W V EL D + + D VN VLVYLYGK GK ++SRERFD IS GKRSVL WN Sbjct: 61 EKWLTVLVELNIDGNLKSSTCDKVNNVLVYLYGKLGKIEKSRERFDDISDSGKRSVLPWN 120 Query: 1431 VMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGTRVHSYLRT 1252 MI AYVQNG LE L ++ LM+++ + PNHVTMV VLSACA VG++++G VH YL+ Sbjct: 121 SMINAYVQNGYALEALCIFHLMVKDPNSRPNHVTMVSVLSACAQVGNLELGRWVHEYLKF 180 Query: 1251 KEHRGVLFSNKNLATALIDMYYKCGNVEKARDVFGEIRTKDIVLFNAMIMGLAVNGKGEA 1072 K +GVL SN LATALIDMY KCG+++KA++VF ++ +KD+V FNAMIMGLA+NG+G+ Sbjct: 181 KGRKGVLESNTFLATALIDMYSKCGSLDKAKEVFYQMVSKDVVSFNAMIMGLAINGEGQE 240 Query: 1071 AFSYFSEIQELGLHPDAATFLGLLCACSHSGLLEKGRQVFKDM 943 A FS++QELGLHP+ TFLGLL ACSHSGL ++GRQ+F DM Sbjct: 241 AVKLFSKVQELGLHPNGGTFLGLLWACSHSGLSDEGRQIFLDM 283 >ref|NP_172286.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75174869|sp|Q9LN01.1|PPR21_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g08070 gi|8778839|gb|AAF79838.1|AC026875_18 T6D22.15 [Arabidopsis thaliana] gi|332190118|gb|AEE28239.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 741 Score = 340 bits (873), Expect = 6e-91 Identities = 180/436 (41%), Positives = 274/436 (62%), Gaps = 2/436 (0%) Frame = -3 Query: 1809 RKLFDEMPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSS 1630 +KLFDE+P K V+ W +ISGYA +G ++ALELF M+K N++P+ TMV+V+SAC+ Sbjct: 220 QKLFDEIPVKDVVS-WNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278 Query: 1629 LEAAKVERWANVFDELGNDHDREKLVGDNVNVV--LVYLYGKQGKFDESRERFDRISIQG 1456 + ++ R +++ + DH G N+ +V L+ LY K G+ + + F+R+ Sbjct: 279 SGSIELGRQVHLWID---DHG----FGSNLKIVNALIDLYSKCGELETACGLFERLPY-- 329 Query: 1455 KRSVLSWNVMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGT 1276 + V+SWN +IG Y Y E L L++ M+ + PN VTM+ +L ACAH+G IDIG Sbjct: 330 -KDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGET-PNDVTMLSILPACAHLGAIDIGR 387 Query: 1275 RVHSYLRTKEHRGVLFSNKNLATALIDMYYKCGNVEKARDVFGEIRTKDIVLFNAMIMGL 1096 +H Y+ K +GV ++ +L T+LIDMY KCG++E A VF I K + +NAMI G Sbjct: 388 WIHVYI-DKRLKGVTNAS-SLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGF 445 Query: 1095 AVNGKGEAAFSYFSEIQELGLHPDAATFLGLLCACSHSGLLEKGRQVFKDMISKFKISPK 916 A++G+ +A+F FS ++++G+ PD TF+GLL ACSHSG+L+ GR +F+ M +K++PK Sbjct: 446 AMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPK 505 Query: 915 LEHYASYIDLLARVGCIKEAMQVVSSMPFEPNQYVWGALLSGSLLHDQSDIAQIASSMLL 736 LEHY IDLL G KEA ++++ M EP+ +W +LL +H ++ + + L+ Sbjct: 506 LEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLI 565 Query: 735 KSDPGNSGGYVMLSNSFASDNRWGDVSGVRGFMRDKGVTKHPGCSWIGIYNMVHEFVAGS 556 K +P N G YV+LSN +AS RW +V+ R + DKG+ K PGCS I I ++VHEF+ G Sbjct: 566 KIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGD 625 Query: 555 TSHPNCEVVHSRLEEL 508 HP ++ LEE+ Sbjct: 626 KFHPRNREIYGMLEEM 641 Score = 86.3 bits (212), Expect = 2e-14 Identities = 65/247 (26%), Positives = 109/247 (44%), Gaps = 27/247 (10%) Frame = -3 Query: 1449 SVLSWNVMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGTRV 1270 ++L WN M + + + L LY M+ +PN T +VL +CA G ++ Sbjct: 98 NLLIWNTMFRGHALSSDPVSALKLYVCMIS-LGLLPNSYTFPFVLKSCAKSKAFKEGQQI 156 Query: 1269 HSYLRTKEHRGVLFSNKNLA---------------------------TALIDMYYKCGNV 1171 H ++ L+ + +L TALI Y G + Sbjct: 157 HGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYI 216 Query: 1170 EKARDVFGEIRTKDIVLFNAMIMGLAVNGKGEAAFSYFSEIQELGLHPDAATFLGLLCAC 991 E A+ +F EI KD+V +NAMI G A G + A F ++ + + PD +T + ++ AC Sbjct: 217 ENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSAC 276 Query: 990 SHSGLLEKGRQVFKDMISKFKISPKLEHYASYIDLLARVGCIKEAMQVVSSMPFEPNQYV 811 + SG +E GRQV I L+ + IDL ++ G ++ A + +P++ + Sbjct: 277 AQSGSIELGRQVHL-WIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYK-DVIS 334 Query: 810 WGALLSG 790 W L+ G Sbjct: 335 WNTLIGG 341 >gb|AEP33760.1| organelle transcript processing 82, partial [Capsella bursa-pastoris] Length = 706 Score = 337 bits (863), Expect = 8e-90 Identities = 182/436 (41%), Positives = 273/436 (62%), Gaps = 2/436 (0%) Frame = -3 Query: 1809 RKLFDEMPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSS 1630 +K+FDE+P K V+ W LISGYA +G ++ALELF M+K N++P+ TMV+VLSAC+ Sbjct: 189 QKMFDEIPVKDVVS-WNALISGYAETGNYKEALELFKEMMKTNVKPDESTMVTVLSACA- 246 Query: 1629 LEAAKVERWANVFDELGNDHDREKLVGDNVNVV--LVYLYGKQGKFDESRERFDRISIQG 1456 ++A +E V + +DH G N+ +V L+ LY K G+ + + F+ +S Sbjct: 247 -QSASIELGRQVHSWI-DDHG----FGSNLKIVNALIDLYIKCGEVETASGLFEGLSY-- 298 Query: 1455 KRSVLSWNVMIGAYVQNGCYLEGLSLYRLMMEEYDCIPNHVTMVYVLSACAHVGDIDIGT 1276 + V+SWN +IG Y Y E L L++ M+ + PN VTM+ +L ACAH+G IDIG Sbjct: 299 -KDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGES-PNEVTMLSILPACAHLGAIDIGR 356 Query: 1275 RVHSYLRTKEHRGVLFSNKNLATALIDMYYKCGNVEKARDVFGEIRTKDIVLFNAMIMGL 1096 +H Y+ K +GV + +L T+LIDMY KCG++E A+ VF + + + +NAMI G Sbjct: 357 WIHVYI-DKRLKGVS-NPSSLRTSLIDMYAKCGDIEAAQQVFDSMLNRSLSSWNAMIFGF 414 Query: 1095 AVNGKGEAAFSYFSEIQELGLHPDAATFLGLLCACSHSGLLEKGRQVFKDMISKFKISPK 916 A++G+ AF FS +++ G+ PD TF+GLL ACSHSG+L+ GR +F+ M +KI+PK Sbjct: 415 AMHGRANPAFDIFSRMRKDGIEPDDITFVGLLSACSHSGMLDLGRHIFRSMTEDYKITPK 474 Query: 915 LEHYASYIDLLARVGCIKEAMQVVSSMPFEPNQYVWGALLSGSLLHDQSDIAQIASSMLL 736 LEHY IDLL G KEA ++++SM +P+ +W +LL +H ++ + + L+ Sbjct: 475 LEHYGCMIDLLGHSGLFKEAEEMINSMEMDPDGVIWCSLLKACKMHGNVELGESFAQNLI 534 Query: 735 KSDPGNSGGYVMLSNSFASDNRWGDVSGVRGFMRDKGVTKHPGCSWIGIYNMVHEFVAGS 556 K +P NSG YV+LSN +A+ RW +V+ R + DKG+ K PGCS I I ++VHEF+ G Sbjct: 535 KIEPKNSGSYVLLSNIYATAGRWNEVAKRRALLNDKGMKKVPGCSSIEIDSVVHEFIIGD 594 Query: 555 TSHPNCEVVHSRLEEL 508 HP ++ LEE+ Sbjct: 595 KLHPRNREIYGMLEEM 610 Score = 149 bits (376), Expect = 2e-33 Identities = 113/378 (29%), Positives = 185/378 (48%), Gaps = 30/378 (7%) Frame = -3 Query: 1803 LFDEMPDKSVICCWTNLISGYAVSGQSEKALELFCSMVKDNLQPENDTMVSVLSACSSLE 1624 +FD + + +++ W + G+A+S AL L+ M+ L P + T +L AC+ + Sbjct: 59 VFDSIQEPNLLI-WNTMFRGHALSSDPVSALYLYVCMISLGLVPNSYTFPFLLKACAKSK 117 Query: 1623 AAKV-ERWANVFDELGNDHDREKLVGDNVNVVLVYLYGKQGKFDESRERFDRIS------ 1465 A + ++ +LG D D V+ L+ +Y K G+ +++R+ FD+ S Sbjct: 118 AFREGQQIHGHVLKLGCDLDL------YVHTSLIAMYVKNGRXEDARKVFDQSSHRDVVS 171 Query: 1464 ----IQGKRS------------------VLSWNVMIGAYVQNGCYLEGLSLYRLMMEEYD 1351 I+G S V+SWN +I Y + G Y E L L++ MM+ + Sbjct: 172 YTALIKGYASNGYIXSAQKMFDEIPVKDVVSWNALISGYAETGNYKEALELFKEMMKT-N 230 Query: 1350 CIPNHVTMVYVLSACAHVGDIDIGTRVHSYLRTKEHRGVLFSNKNLATALIDMYYKCGNV 1171 P+ TMV VLSACA I++G +VHS++ +H SN + ALID+Y KCG V Sbjct: 231 VKPDESTMVTVLSACAQSASIELGRQVHSWI--DDHG--FGSNLKIVNALIDLYIKCGEV 286 Query: 1170 EKARDVFGEIRTKDIVLFNAMIMGLAVNGKGEAAFSYFSEIQELGLHPDAATFLGLLCAC 991 E A +F + KD++ +N +I G + A F E+ G P+ T L +L AC Sbjct: 287 ETASGLFEGLSYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGESPNEVTMLSILPAC 346 Query: 990 SHSGLLEKGRQVFKDMISKFK-ISPKLEHYASYIDLLARVGCIKEAMQVVSSMPFEPNQY 814 +H G ++ GR + + + K +S S ID+ A+ G I+ A QV SM + Sbjct: 347 AHLGAIDIGRWIHVYIDKRLKGVSNPSSLRTSLIDMYAKCGDIEAAQQVFDSM-LNRSLS 405 Query: 813 VWGALLSGSLLHDQSDIA 760 W A++ G +H +++ A Sbjct: 406 SWNAMIFGFAMHGRANPA 423