BLASTX nr result

ID: Angelica22_contig00006997 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00006997
         (2253 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containi...   711   0.0  
ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|2...   690   0.0  
ref|XP_002521193.1| conserved hypothetical protein [Ricinus comm...   682   0.0  
ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containi...   671   0.0  
ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containi...   669   0.0  

>ref|XP_002278434.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic [Vitis vinifera]
          Length = 511

 Score =  711 bits (1836), Expect = 0.0
 Identities = 364/514 (70%), Positives = 421/514 (81%), Gaps = 8/514 (1%)
 Frame = +3

Query: 333  GFLAGIDHNSKLGFT--NSFYFPQYKFVGARFCTKLKTPNSRSWF------SGTLCSLKT 488
            GF + +   ++LGFT  +SF   + + +  +F        SRS+       + T+C+ + 
Sbjct: 6    GFASSLMSPTELGFTLSSSFSIQRPRLIVPKF--------SRSFLGEYCSRATTICNHQN 57

Query: 489  HNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMAREPSDVLEEMN 668
               VV  R K+REF   KSVELD+F+TSDDEDEMSEGFFEAIEELERM REPSDVLEEMN
Sbjct: 58   PRFVVPKRDKIREFRLFKSVELDQFLTSDDEDEMSEGFFEAIEELERMTREPSDVLEEMN 117

Query: 669  DRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLIEE 848
            DRLS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMVS+MCSWVKKLIE 
Sbjct: 118  DRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLIEG 177

Query: 849  KNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSDDD 1028
            +++                KP FSMIEKVISLY+EM E++ AVLFVKE+LRR I+YS+DD
Sbjct: 178  EHDVGDVVDLLVDMDCVGLKPGFSMIEKVISLYWEMEEKEKAVLFVKEVLRREIAYSEDD 237

Query: 1029 GQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNELAK 1208
            G GH GGPTGYLAWKMM EGNY+ A+KLVI  +ES LKPEVYSYLIAMTAVVKELNE AK
Sbjct: 238  GDGHKGGPTGYLAWKMMAEGNYRGAVKLVIHLRESGLKPEVYSYLIAMTAVVKELNEFAK 297

Query: 1209 ALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLERLL 1388
            ALRKLK F K+GLI+ELD +NV+LIE+YQSDL+ADGVRLS WVIQ+G   L GVV ERLL
Sbjct: 298  ALRKLKGFTKSGLIAELDAENVELIEKYQSDLLADGVRLSSWVIQEGRSPLHGVVYERLL 357

Query: 1389 AMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATSSL 1568
            AMYICAG+GLEAERQLWEMKLVGKEAD  LYD+VLAICAS+ E++AI+RLLT +E TSS+
Sbjct: 358  AMYICAGRGLEAERQLWEMKLVGKEADRELYDIVLAICASKKEASAISRLLTGMEVTSSI 417

Query: 1569 RKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNVET 1748
            R+KKTLSWLLRGYIKG H+ DA+ET+IKML+LGL P++LDRAAVLQGLR RIQQ+GNVET
Sbjct: 418  RRKKTLSWLLRGYIKGSHFDDASETIIKMLDLGLCPEYLDRAAVLQGLRNRIQQTGNVET 477

Query: 1749 YLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850
            YLKLCKHLSDA+LIGPCLVY+Y K+YKLWI K I
Sbjct: 478  YLKLCKHLSDANLIGPCLVYLYIKKYKLWILKTI 511


>ref|XP_002313976.1| predicted protein [Populus trichocarpa] gi|222850384|gb|EEE87931.1|
            predicted protein [Populus trichocarpa]
          Length = 500

 Score =  690 bits (1780), Expect = 0.0
 Identities = 336/456 (73%), Positives = 395/456 (86%)
 Frame = +3

Query: 483  KTHNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMAREPSDVLEE 662
            K  N VV    KVREF   KSVELD+++TSDDE+EM EGFFEAIEELERM REPSD+LEE
Sbjct: 45   KRPNFVVAKTTKVREFRLFKSVELDQYVTSDDEEEMGEGFFEAIEELERMTREPSDILEE 104

Query: 663  MNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLI 842
            MNDRLS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRVDKETMELMVS+MCSWVKKLI
Sbjct: 105  MNDRLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRVDKETMELMVSIMCSWVKKLI 164

Query: 843  EEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSD 1022
            E + +                KPSFSMIEKVISLY++M +++GAV FVKE+LRRGI+YS 
Sbjct: 165  EGEQDVGDVVDLLVDMDCVGLKPSFSMIEKVISLYWDMGKKEGAVSFVKEVLRRGIAYSG 224

Query: 1023 DDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNEL 1202
            DDG+G  GGPTGYL WKMM +GNY++A+KLVI  +ES LKPE+Y+YLIAMTAVVKELNE 
Sbjct: 225  DDGEGQKGGPTGYLTWKMMVDGNYRNAVKLVIHLRESGLKPEIYAYLIAMTAVVKELNEF 284

Query: 1203 AKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLER 1382
            +KALRKLK + ++G+++ELD +NV+L+E+YQSDL+ADGV LS WVIQ+G P+L GVV ER
Sbjct: 285  SKALRKLKGYSRSGMVTELDAENVELVEKYQSDLLADGVCLSSWVIQEGSPALYGVVHER 344

Query: 1383 LLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATS 1562
            LLAMYICAG+GL+AERQLWEMKLVGKEADG+LYD+VLAICASQ E++A+ARLLTR+E  S
Sbjct: 345  LLAMYICAGRGLDAERQLWEMKLVGKEADGDLYDIVLAICASQKEASAVARLLTRIEVAS 404

Query: 1563 SLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNV 1742
            S+RKKK+LSWLLRGYIKGGHY +AAET+IKML+LGLSPD+LDR AV+QGLRKRIQQ GNV
Sbjct: 405  SMRKKKSLSWLLRGYIKGGHYGEAAETLIKMLDLGLSPDYLDRVAVMQGLRKRIQQWGNV 464

Query: 1743 ETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850
            E+YLKLCK LSD +LIGP LVY+Y K+YKLWI K++
Sbjct: 465  ESYLKLCKRLSDVNLIGPSLVYLYIKKYKLWIMKLL 500


>ref|XP_002521193.1| conserved hypothetical protein [Ricinus communis]
            gi|223539607|gb|EEF41193.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 499

 Score =  682 bits (1761), Expect = 0.0
 Identities = 344/476 (72%), Positives = 400/476 (84%), Gaps = 2/476 (0%)
 Frame = +3

Query: 429  KLKTPNSRSWFSGTLCSLKTHNSVVLNRGKVR--EFGFLKSVELDKFITSDDEDEMSEGF 602
            + K  N R +   ++   K+ N VV  + K R  EF  LKSVELD++I SDDE+EMSEGF
Sbjct: 24   RYKLLNPRFFQLSSIKFPKSSNFVVAQQSKSRNREFRVLKSVELDQYIASDDEEEMSEGF 83

Query: 603  FEAIEELERMAREPSDVLEEMNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRV 782
            FEAIEELERM REPSDVLEEMND+LS RELQLVLVYFSQEGRDSWCALEVF+WLRKENRV
Sbjct: 84   FEAIEELERMTREPSDVLEEMNDKLSARELQLVLVYFSQEGRDSWCALEVFEWLRKENRV 143

Query: 783  DKETMELMVSLMCSWVKKLIEEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRE 962
            DKETMELMVS+MCSW+KKLIE ++E                KPSFSMIEKVISLY+E+ E
Sbjct: 144  DKETMELMVSIMCSWIKKLIEGEHEIGDVVDLLVDMDCVGLKPSFSMIEKVISLYWEIGE 203

Query: 963  RDGAVLFVKEILRRGISYSDDDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLK 1142
            ++ +V FVKE+LRR ++Y +DDG+G  GGPTGYLAWKMM +GNY+DA+KLVI F+ES LK
Sbjct: 204  KEKSVSFVKEVLRREVAYFEDDGEGQKGGPTGYLAWKMMVDGNYRDAVKLVIHFRESGLK 263

Query: 1143 PEVYSYLIAMTAVVKELNELAKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVR 1322
            PEVYSYLIAMTAVVKELNE AKALRKLK F K+GLI+ELD +N +LIE+YQSDLIADGV 
Sbjct: 264  PEVYSYLIAMTAVVKELNEFAKALRKLKGFAKSGLIAELDAENTRLIEKYQSDLIADGVC 323

Query: 1323 LSDWVIQQGGPSLQGVVLERLLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAIC 1502
            LS WVIQ+G PSL GVV ERLLAMYICAG+GL+AERQLWEMKLVGK ADG+LYD+VLAIC
Sbjct: 324  LSSWVIQEGSPSLYGVVHERLLAMYICAGRGLDAERQLWEMKLVGKHADGDLYDIVLAIC 383

Query: 1503 ASQNESNAIARLLTRVEATSSLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDF 1682
            ASQ E++A++RLLTRVE TSSL+KKKTLSWLLRGY+KGG Y +AAE ++KML++GL PD+
Sbjct: 384  ASQKEASAVSRLLTRVEVTSSLQKKKTLSWLLRGYLKGGQYDEAAEALVKMLDMGLCPDY 443

Query: 1683 LDRAAVLQGLRKRIQQSGNVETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850
            LDR AVLQGLRKRIQQ GNVE+YL LCK LSD +LIGP LVY+Y K+YKLWI KM+
Sbjct: 444  LDRVAVLQGLRKRIQQWGNVESYLNLCKRLSDENLIGPSLVYLYIKKYKLWIMKML 499


>ref|XP_003538312.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 510

 Score =  671 bits (1731), Expect = 0.0
 Identities = 343/511 (67%), Positives = 408/511 (79%), Gaps = 4/511 (0%)
 Frame = +3

Query: 330  MGFLAGIDHNSKLGFTNSFYFPQYKFVGARFCTKLKTPNSRSWFSGTLC--SLKTHNSVV 503
            M +  G     KLGF  S   P  K     F        S  ++ G L   S K  N   
Sbjct: 1    MAYAHGFAPIFKLGFVFSSVSPSQKRHPLVFPAS-HCGYSLKFYDGVLSARSCKFKNPSF 59

Query: 504  LNRGKVREFGFLKSVELDKFITSDDE-DEMSEGFFEAIEELERMAREPSDVLEEMNDRLS 680
            + +G +R F  LKSVELD+++TSDDE DEMS+GFFEAIEELERM REPSDVLEEMNDRLS
Sbjct: 60   VKQGSIRGFRALKSVELDQYVTSDDEEDEMSDGFFEAIEELERMTREPSDVLEEMNDRLS 119

Query: 681  VRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMCSWVKKLIEEKNEX 860
             RELQLVLVYFSQ+GRDSWCALEVFDWLRKENRVDKETMELMV++MC WVKKLI+E +  
Sbjct: 120  ARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMCGWVKKLIQEHHGV 179

Query: 861  XXXXXXXXXXXXXXX-KPSFSMIEKVISLYFEMRERDGAVLFVKEILRRGISYSDDDGQG 1037
                            +P FSMIEKVISLY+EM E++GAVLFV+E+LRRGI Y ++D +G
Sbjct: 180  VGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLRRGIPYLEEDEEG 239

Query: 1038 HNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAVVKELNELAKALR 1217
            H GGPTGYLAWKMM EG+Y  A++LVI F ES LKPEVYSYL+AMTAVVKELNELAKALR
Sbjct: 240  HKGGPTGYLAWKMMAEGDYTSAVRLVIHFTESGLKPEVYSYLVAMTAVVKELNELAKALR 299

Query: 1218 KLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSLQGVVLERLLAMY 1397
            KLKSF + GL++ELD ++V+L E+YQSDL+ DGVRLS+W IQ G PSL G++ ERLLAMY
Sbjct: 300  KLKSFARTGLVAELDLEDVELTEKYQSDLLGDGVRLSNWAIQDGSPSLHGIIHERLLAMY 359

Query: 1398 ICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLLTRVEATSSLRKK 1577
            ICAG G+EAE+QLWEMKLVGKEADG+LYD+VLAICASQ ESNA ARLLTR+E  SS +KK
Sbjct: 360  ICAGHGIEAEKQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLLTRLEVASSPQKK 419

Query: 1578 KTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKRIQQSGNVETYLK 1757
            K+LSWLLRGYIKGGH+++AAET++KML+LG  P++LDRAAVLQGLRKRIQQ GN++TY++
Sbjct: 420  KSLSWLLRGYIKGGHFNEAAETIMKMLDLGFYPEYLDRAAVLQGLRKRIQQYGNLDTYVR 479

Query: 1758 LCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850
            LCK LSDA+LIGPCLV++Y ++YKLW+ KM+
Sbjct: 480  LCKSLSDANLIGPCLVHLYIRKYKLWVVKML 510


>ref|XP_003551233.1| PREDICTED: pentatricopeptide repeat-containing protein At2g30100,
            chloroplastic-like [Glycine max]
          Length = 508

 Score =  669 bits (1727), Expect = 0.0
 Identities = 327/463 (70%), Positives = 393/463 (84%)
 Frame = +3

Query: 462  SGTLCSLKTHNSVVLNRGKVREFGFLKSVELDKFITSDDEDEMSEGFFEAIEELERMARE 641
            S   C  K  + V    G +R F  LKSVE+D+++TS+DE  MS+GFFEAIEELERM RE
Sbjct: 48   SARSCKFKNPSFVSAKHGSLRGFRALKSVEMDQYVTSNDE--MSDGFFEAIEELERMTRE 105

Query: 642  PSDVLEEMNDRLSVRELQLVLVYFSQEGRDSWCALEVFDWLRKENRVDKETMELMVSLMC 821
            PSDVLEEMNDRLS RELQLVLVYFSQ+GRDSWCALEVFDWLRKENRVDKETMELMV++MC
Sbjct: 106  PSDVLEEMNDRLSARELQLVLVYFSQDGRDSWCALEVFDWLRKENRVDKETMELMVAIMC 165

Query: 822  SWVKKLIEEKNEXXXXXXXXXXXXXXXXKPSFSMIEKVISLYFEMRERDGAVLFVKEILR 1001
             WVKKLI++++                 +P FSMIEKVISLY+EM E++GAVLFV+E+LR
Sbjct: 166  GWVKKLIQQQHGVGDVVDLLVDMDCVGLRPGFSMIEKVISLYWEMGEKEGAVLFVEEVLR 225

Query: 1002 RGISYSDDDGQGHNGGPTGYLAWKMMEEGNYKDAIKLVIDFKESDLKPEVYSYLIAMTAV 1181
            RGI Y ++D +GH GGPTGYLAWKMM EG+Y++A++LVI F+ES LKPE+YSYL+AMTAV
Sbjct: 226  RGIPYVEEDEEGHKGGPTGYLAWKMMAEGDYRNAVRLVIRFRESGLKPEIYSYLVAMTAV 285

Query: 1182 VKELNELAKALRKLKSFRKAGLISELDGKNVQLIEEYQSDLIADGVRLSDWVIQQGGPSL 1361
            VKELNE AKALRKLK F +AGL++ELD ++V+L E+YQSD +ADGVRLS+WVIQ G PSL
Sbjct: 286  VKELNEFAKALRKLKGFTRAGLVAELDLEDVELTEKYQSDTLADGVRLSNWVIQDGSPSL 345

Query: 1362 QGVVLERLLAMYICAGQGLEAERQLWEMKLVGKEADGNLYDMVLAICASQNESNAIARLL 1541
             G+V ERLLAMYICAG G+EAERQLWEMKLVGKEADG+LYD+VLAICASQ ESNA ARLL
Sbjct: 346  HGIVHERLLAMYICAGHGIEAERQLWEMKLVGKEADGDLYDIVLAICASQKESNATARLL 405

Query: 1542 TRVEATSSLRKKKTLSWLLRGYIKGGHYSDAAETVIKMLNLGLSPDFLDRAAVLQGLRKR 1721
            TR+E  SS +KKK+LSWLLRGYIKGGH+++AAET++KML LG  P++LDRAAVLQGLRKR
Sbjct: 406  TRLEVVSSPQKKKSLSWLLRGYIKGGHFNEAAETIMKMLELGFYPEYLDRAAVLQGLRKR 465

Query: 1722 IQQSGNVETYLKLCKHLSDASLIGPCLVYMYFKRYKLWITKMI 1850
            IQQ GN++TY++LCK LSDA+LIGPCLV++Y ++YKLW+ KM+
Sbjct: 466  IQQYGNLDTYVRLCKSLSDANLIGPCLVHLYIRKYKLWVVKML 508


Top