BLASTX nr result

ID: Lithospermum22_contig00022340 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum22_contig00022340
         (1401 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AFK33630.1| unknown [Lotus japonicus]                              313   9e-83
ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi...   310   8e-82
ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containi...   298   2e-78
ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ...   292   1e-76
ref|NP_174459.1| pentatricopeptide repeat-containing protein [Ar...   283   1e-73

>gb|AFK33630.1| unknown [Lotus japonicus]
          Length = 356

 Score =  313 bits (801), Expect = 9e-83
 Identities = 157/345 (45%), Positives = 222/345 (64%), Gaps = 1/345 (0%)
 Frame = -2

Query: 1229 KPNKDTIIKKTTCSDILYLMDTLNLPISLELYISLIKECTKNQDPFQAIKLSNHIMESGL 1050
            K  K    K  T S IL+LMD L  PI +++Y SLIKECT + DP  AI+L  HI  SG+
Sbjct: 2    KKKKKRKRKGATTSHILHLMDVLPFPIPIDIYTSLIKECTLSPDPQTAIELHTHIAHSGI 61

Query: 1049 KPTLYFLNKMLLMHICCGCYDRAKILFDRMPHKNLNTWAMFIAGCVENYEYNDVINMFIR 870
            KP L F+N++L+M + CG  D A  LFD MP K+ N+WA       +N +Y + I++F+ 
Sbjct: 62   KPPLSFINRILVMFVSCGLLDYACQLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLA 121

Query: 869  LLRESKFRDRCDGSLVVSGVVICVLKACLAVGDLELGKQIHGWIFKMGYWRNMSLTSFLI 690
            +L +    +          +  C LKAC  + ++ LG Q+HGW+ K+G   ++ L+S LI
Sbjct: 122  MLHQLGMSE------FPPWICACFLKACACIENIPLGMQVHGWLLKLGTCDHVLLSSSLI 175

Query: 689  SFYRKFGWLEGRENVFDHIPLRNTSIWNARMVT-CGSEEWSEGVRLYKQMGREGVKRSKY 513
             FY +F  ++    VF+ +   NTS W A++V+ C   ++ E    +K+MGR+G+K+  Y
Sbjct: 176  RFYGRFTCVKDANAVFNKLSRHNTSTWTAKIVSGCREMDFPEVFNDFKEMGRQGIKKDTY 235

Query: 512  TFSSVLKACSKVSDGGSSGRQVHGNALKVGLDEDNYVRCGLISMYGKSGLLNEARMVHQT 333
            TFSSVLKAC K+ D G  G QVH +A+K+GL  DNYV+C LI+MYG+SGLL +A+ V +T
Sbjct: 236  TFSSVLKACGKMMDHGRCGEQVHADAMKLGLASDNYVQCSLIAMYGRSGLLRDAKQVFET 295

Query: 332  SREKRNDACWNALLTGYLQNGCCVEAIKLLYDMKAAGMQPPESLL 198
            SR +RN   WNA+L GYL+NG  +EA+K LY MKAAG++P ESLL
Sbjct: 296  SRSERNVDSWNAMLMGYLENGLYIEAVKFLYQMKAAGLKPHESLL 340


>ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Glycine max]
          Length = 423

 Score =  310 bits (793), Expect = 8e-82
 Identities = 164/401 (40%), Positives = 244/401 (60%), Gaps = 8/401 (1%)
 Frame = -2

Query: 1334 TKTSTLSTNYFPSNARIQIQQPLHK-PQHKSYQSIKKPNKDTIIKK------TTCSDILY 1176
            +K  TL +       R+ ++ P+H  P H S Q + +    T  KK       T SDIL+
Sbjct: 27   SKLRTLPSPNHQLEFRLPLRHPIHNFPNHTSPQPLTQTTTFTKKKKKKKRKGATTSDILH 86

Query: 1175 LMDTLNLPISLELYISLIKECTKNQDPFQAIKLSNHIMESGLKPTLYFLNKMLLMHICCG 996
            LM+ L  P+ +++Y SLIKECT + DP  AI+L+ HI +SG+KP L FLN++L+M + CG
Sbjct: 87   LMEALPFPVPIDIYTSLIKECTVSGDPETAIELATHISKSGIKPPLPFLNRILVMFVSCG 146

Query: 995  CYDRAKILFDRMPHKNLNTWAMFIAGCVENYEYNDVINMFIRLLRESKFRDRCDGSLVVS 816
              + A+ +FD+M  ++ NTWA       +N +Y +  N+F+ +L +    +         
Sbjct: 147  LLENARHMFDKMRVRDFNTWATLFVAYYDNTDYEEATNVFVNMLTQLGMME------FPP 200

Query: 815  GVVICVLKACLAVGDLELGKQIHGWIFKMGYWRNMSLTSFLISFYRKFGWLEGRENVFDH 636
             +  C+L+AC    ++ LG Q+HGW+ K+G   ++ L+S LI+FY +F  LE    VFD 
Sbjct: 201  WIWACLLRACACTVNVPLGMQVHGWLLKLGTCDHVLLSSSLINFYGRFTCLEDASVVFDG 260

Query: 635  IPLRNTSIWNARMVT-CGSEEWSEGVRLYKQMGREGVKRSKYTFSSVLKACSKVSDGGSS 459
            +   NT  W A++V+ C    +SE    +K+MG  GVK+  +TFSSVLKAC ++ +    
Sbjct: 261  VSRHNTLTWTAKIVSGCRERHFSEVFDDFKEMGMRGVKKDCFTFSSVLKACGRMLNQERC 320

Query: 458  GRQVHGNALKVGLDEDNYVRCGLISMYGKSGLLNEARMVHQTSREKRNDACWNALLTGYL 279
            G QVH +A+K+GL  D+YV+C LI+MYG+ GLL +A+ V + S+E+R   CWNA+L GY+
Sbjct: 321  GEQVHVDAIKLGLVSDHYVQCSLIAMYGRCGLLEDAKRVFEMSQEERKVDCWNAMLMGYI 380

Query: 278  QNGCCVEAIKLLYDMKAAGMQPPESLLKQSTAWLKSWIYGN 156
            QNG  +EA+K LY M+AAGMQP ESLLK+      S  Y N
Sbjct: 381  QNGLYIEAVKFLYQMQAAGMQPRESLLKKLRMACGSISYSN 421


>ref|XP_003631528.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Vitis vinifera]
          Length = 414

 Score =  298 bits (763), Expect = 2e-78
 Identities = 159/355 (44%), Positives = 228/355 (64%), Gaps = 8/355 (2%)
 Frame = -2

Query: 1232 KKPNKDTIIKKTTCSDILYLMDTLNLPISLELYISLIKECTKNQDPFQAIKLSNHIMESG 1053
            KK N +     +T +DIL LMD L LPI  ++Y SLIKE +   D  QA +L  HI  SG
Sbjct: 48   KKSNSNATPTTSTPTDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAHINRSG 107

Query: 1052 LKPTLYFLNKMLLMHICCGCYDRAKILFDRMP--HKNLNTWAMFIAGCVENYEYNDVINM 879
            L  +   LN++LLM++ CG    A+ +FD+M   +KN  +WA+ +A  ++N  Y + I +
Sbjct: 108  LPLSSALLNRILLMYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFL 167

Query: 878  FIRLLRESKFRDRCDGSLVV---SGVVICVLKACLAVGDLELGKQIHGWIFKMGYWRNMS 708
            F++++           ++++   + + ICVLKAC+   +L LGKQ+HGW+ K+GY  N+ 
Sbjct: 168  FVQMME-------LHSTIMLELPAWIFICVLKACVHTMNLTLGKQVHGWLLKVGYATNLF 220

Query: 707  LTSFLISFYRKFGWLEGRENVFDHIPLRNTSIWNARMVT-CGSEEWSEGVRLYKQMGREG 531
            L+ +LISFY KF  L+  + VFD    RNT IW A+MV  C  E   E +  + +MGR G
Sbjct: 221  LSCYLISFYGKFRCLDDADFVFDQTSERNTVIWTAKMVNKCQGEYMHEALVAFTEMGRAG 280

Query: 530  VKRSKYTFSSVLKACSKVSDGGSSGRQVHGNALKVGLDEDNYVRCGLISMYGKSGLLNEA 351
            VKR+++T+SSVL+AC ++ D G  GR +H + +K+GL+ D YV+CGL+ MYGK GLL EA
Sbjct: 281  VKRNEFTYSSVLRACGRMKDHGRCGRLIHASTIKLGLESDIYVQCGLVDMYGKCGLLVEA 340

Query: 350  RMVHQT--SREKRNDACWNALLTGYLQNGCCVEAIKLLYDMKAAGMQPPESLLKQ 192
            R V +T     K N  CWNA+LTGY+++G  +EAIK LY MKAAG+QP ESLL +
Sbjct: 341  RRVFETVSDTNKTNIVCWNAMLTGYIRHGLYIEAIKFLYQMKAAGIQPQESLLNE 395


>ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355513792|gb|AES95415.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 418

 Score =  292 bits (748), Expect = 1e-76
 Identities = 158/397 (39%), Positives = 245/397 (61%), Gaps = 7/397 (1%)
 Frame = -2

Query: 1361 PTKKSTYNMTKTSTLSTNYFPSNARIQIQQPLHKPQ-----HKSYQSIKKPNKDTIIKKT 1197
            PT++   + T T++  +N+ P   R+ +++   KP+     H S Q I  P K    +K 
Sbjct: 13   PTRRRNADTTSTTSPPSNHQPHLLRLPLRRN-PKPKNLSLIHPSSQPITPPKKSKRRRKC 71

Query: 1196 -TCSDILYLMDTLNLPISLELYISLIKECTKNQDPFQAIKLSNHIMESGLKPTLYFLNKM 1020
             T S IL LMD L+ PI++++Y SL+KECT + DP  AI+L   I+  G++  L  LN++
Sbjct: 72   DTTSHILPLMDALHFPITIDIYTSLVKECTLSTDPETAIELHTQIITRGIELPLTLLNRI 131

Query: 1019 LLMHICCGCYDRAKILFDRMPHKNLNTWAMFIAGCVENYEYNDVINMFIRLLRESKFRDR 840
            L+M + CG  + A+ +FD M  ++ ++WA       EN EY + I++F+ +L +      
Sbjct: 132  LIMFVSCGLLENARRVFDVMSVRDFHSWATLFVSYYENGEYENAIDVFVSMLCQLDVM-- 189

Query: 839  CDGSLVVSGVVICVLKACLAVGDLELGKQIHGWIFKMGYWRNMSLTSFLISFYRKFGWLE 660
              G      +  C+LKAC    ++ LG Q+HG + K+G   ++ ++S LI FY +F  LE
Sbjct: 190  --GFSFPPWIWSCLLKACACTMNVPLGMQVHGCLLKLGACDHVLISSSLIRFYGRFKCLE 247

Query: 659  GRENVFDHIPLRNTSIWNARMVT-CGSEEWSEGVRLYKQMGREGVKRSKYTFSSVLKACS 483
                VF+ +   NT  W A++V+ C    +SE +  +K+MGR GVK+  +TFSSVLKAC 
Sbjct: 248  DANMVFNRVSRHNTLTWTAKIVSSCRERHFSEALGDFKKMGRVGVKKDSFTFSSVLKACG 307

Query: 482  KVSDGGSSGRQVHGNALKVGLDEDNYVRCGLISMYGKSGLLNEARMVHQTSREKRNDACW 303
            ++ + GS G QVH +A+K+GLD D+YV+C LI+MYG+SGLL +A +V + +R +RN    
Sbjct: 308  RMQNRGSCGEQVHADAIKLGLDSDSYVQCSLIAMYGRSGLLRDAELVFEMTRNERNVDSL 367

Query: 302  NALLTGYLQNGCCVEAIKLLYDMKAAGMQPPESLLKQ 192
            NA+L GY+QNG  +EA+K +Y MKAAG+QP E LL++
Sbjct: 368  NAMLMGYIQNGLYIEAVKFVYQMKAAGVQPHEPLLEK 404


>ref|NP_174459.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75169166|sp|Q9C6R9.1|PPR66_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g31790 gi|12321298|gb|AAG50719.1|AC079041_12
            hypothetical protein [Arabidopsis thaliana]
            gi|111074348|gb|ABH04547.1| At1g31790 [Arabidopsis
            thaliana] gi|332193272|gb|AEE31393.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 409

 Score =  283 bits (723), Expect = 1e-73
 Identities = 155/400 (38%), Positives = 243/400 (60%), Gaps = 9/400 (2%)
 Frame = -2

Query: 1361 PTKKSTYNMTKTSTLSTNYFPSNARIQIQQPLHKPQHKSYQS---IKKPNKDTIIKKTTC 1191
            P+   ++N   T+    N   +N  +Q+   L KP+H+  +    I++P        + C
Sbjct: 13   PSLVPSFNYNSTARSVGNDVRTNFDVQLF--LRKPKHQKSEPVVVIQQPQIQPQNPSSRC 70

Query: 1190 S--DILYLMDTLNLPISLELYISLIKECTKNQDPFQAIKLSNHIMESGLKPTLYFLNKML 1017
            S  DIL LMD+L+LP + ++Y  L KE  +  D   A +L  HIM+S ++PT+ F+N++L
Sbjct: 71   STSDILRLMDSLSLPGNEDIYSCLAKESARENDQRGAHELQVHIMKSSIRPTITFINRLL 130

Query: 1016 LMHICCGCYDRAKILFDRMPHKNLNTWAMFIAGCVENYEYNDVINMFIRLLRESKFRDRC 837
            LMH+ CG  D  + +FDRMPH++ ++WA+   GC+E  +Y D   +F+ +L+ S+     
Sbjct: 131  LMHVSCGRLDITRQMFDRMPHRDFHSWAIVFLGCIEMGDYEDAAFLFVSMLKHSQ----- 185

Query: 836  DGSLVV-SGVVICVLKACLAVGDLELGKQIHGWIFKMGYW--RNMSLTSFLISFYRKFGW 666
             G+  + S ++ CVLKAC  + D ELGKQ+H    K+G+    +  L+  LI FY +F  
Sbjct: 186  KGAFKIPSWILGCVLKACAMIRDFELGKQVHALCHKLGFIDEEDSYLSGSLIRFYGEFRC 245

Query: 665  LEGRENVFDHIPLRNTSIWNARMVTCGSE-EWSEGVRLYKQMGREGVKRSKYTFSSVLKA 489
            LE    V   +   NT  W A++     E E+ E +R + +MG  G+K++   FS+VLKA
Sbjct: 246  LEDANLVLHQLSNANTVAWAAKVTNDYREGEFQEVIRDFIEMGNHGIKKNVSVFSNVLKA 305

Query: 488  CSKVSDGGSSGRQVHGNALKVGLDEDNYVRCGLISMYGKSGLLNEARMVHQTSREKRNDA 309
            CS VSDGG SG+QVH NA+K+G + D  +RC LI MYGK G + +A  V ++S+++ + +
Sbjct: 306  CSWVSDGGRSGQQVHANAIKLGFESDCLIRCRLIEMYGKYGKVKDAEKVFKSSKDETSVS 365

Query: 308  CWNALLTGYLQNGCCVEAIKLLYDMKAAGMQPPESLLKQS 189
            CWNA++  Y+QNG  +EAIKLLY MKA G++  ++LL ++
Sbjct: 366  CWNAMVASYMQNGIYIEAIKLLYQMKATGIKAHDTLLNEA 405


Top