BLASTX nr result

ID: Zingiber23_contig00021102 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber23_contig00021102
         (1752 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

tpg|DAA37393.1| TPA: hypothetical protein ZEAMMB73_558984 [Zea m...    78   1e-11
gb|EEC77440.1| hypothetical protein OsI_16242 [Oryza sativa Indi...    77   3e-11
ref|NP_001053035.1| Os04g0467100 [Oryza sativa Japonica Group] g...    77   3e-11
ref|XP_002454070.1| hypothetical protein SORBIDRAFT_04g024170 [S...    71   1e-09
ref|XP_006652391.1| PREDICTED: dentin sialophosphoprotein-like [...    70   3e-09
ref|XP_002447990.1| hypothetical protein SORBIDRAFT_06g019400 [S...    70   4e-09
ref|NP_001146611.1| hypothetical protein [Zea mays] gi|219888025...    69   6e-09
ref|XP_004975900.1| PREDICTED: uncharacterized protein LOC101761...    68   1e-08
ref|XP_004975899.1| PREDICTED: uncharacterized protein LOC101761...    68   1e-08
ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop...    68   1e-08
ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...    67   2e-08
ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps...    66   4e-08
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...    64   3e-07
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...    62   8e-07
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...    62   8e-07
ref|XP_004952913.1| PREDICTED: uncharacterized protein DDB_G0271...    61   1e-06
gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali...    61   1e-06

>tpg|DAA37393.1| TPA: hypothetical protein ZEAMMB73_558984 [Zea mays]
          Length = 564

 Score = 78.2 bits (191), Expect = 1e-11
 Identities = 70/233 (30%), Positives = 92/233 (39%), Gaps = 11/233 (4%)
 Frame = +3

Query: 1059 VSKAEAAVAMDKGEIRDGQEPLQVPNLTFNSTSTNHSKTEGNLPDVPVNRPVSEAEAMAV 1238
            V ++ + + +   E RD Q      +LT N   T+ +   G      ++   S  E   V
Sbjct: 365  VEESSSTLDVQASEQRDDQRE----SLTNNDVKTDAAHETGTA----ISLSTSNGEHSGV 416

Query: 1239 RAEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEPTAAVKEKKANT 1418
            + EK                C     D R+T+   P D        VEP   V E    T
Sbjct: 417  KGEK----------------CQKHETDGRDTDDFNPRD--------VEPGTKVSED--TT 450

Query: 1419 ICQESLHVRKFSV-----PQLAVLDATADPPRSSFSHSYGADLIISGPRTSSEHIPYXXX 1583
              + S HV+  SV     P  A +       R+ F  S+    I SGP T S HIPY   
Sbjct: 451  DVKSSAHVQTESVVQQNGPDSAKVTTAQTVIRNPFESSFSGPSITSGPLTPSGHIPYSGN 510

Query: 1584 XXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARKHR------WRFGLSCCK 1724
                          FAFPVLQ EWN+SPVKMAKA + R      W + + CCK
Sbjct: 511  ISLRSESSTTSTRSFAFPVLQNEWNSSPVKMAKADRRRQKEDRGWGYRILCCK 563


>gb|EEC77440.1| hypothetical protein OsI_16242 [Oryza sativa Indica Group]
          Length = 663

 Score = 76.6 bits (187), Expect = 3e-11
 Identities = 82/299 (27%), Positives = 120/299 (40%), Gaps = 22/299 (7%)
 Frame = +3

Query: 894  DPKSDENT-GVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKA 1070
            D  SDE    VD+   V+ S+ L +      +    ST +D +  E       E   S++
Sbjct: 382  DILSDERKIPVDQRSPVENSDSLSDPV----DRALSSTETDGARNEDSRLDSTEASPSRS 437

Query: 1071 EAAVAMDKGEIRDGQEPLQVPNLTFN-STSTNHSKTEGNLPDVPVNRPVS-------EAE 1226
                + D+ +        QV N  +   T   H  + G  P      P+        + E
Sbjct: 438  YVQPSEDRND--------QVDNFVYGIRTDAAHGTSSGTSPLTGKTEPIDAKSENDPKCE 489

Query: 1227 AMAVR-------AEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEP 1385
              +V+        E  D T+  E  K    +  ST      TE N P D  ++ + + EP
Sbjct: 490  IDSVQDGHDFNPREANDGTNISEDNK---DSKSSTRQTGPVTEQNEP-DSAKMTM-QTEP 544

Query: 1386 TAAVKEKKANTICQESLHVRKFSVPQLAVLDATADPPRSSFSHSYGADLIISGPRTSSEH 1565
             A   E  +  +  ++  V + +    A + A  +  R+ F  S+    IISGP T S H
Sbjct: 545  VAQRNEPDSAKVTMQTESVAQPNEADSAKVTAR-NVIRNPFESSFSGPSIISGPLTPSGH 603

Query: 1566 IPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARKHR------WRFGLSCCK 1724
            IPY                 FAFPVLQ EWN+SPVKMAKA + R      W + + CCK
Sbjct: 604  IPYSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVKMAKADRRRLRRDRGWGYRILCCK 662


>ref|NP_001053035.1| Os04g0467100 [Oryza sativa Japonica Group]
            gi|113564606|dbj|BAF14949.1| Os04g0467100 [Oryza sativa
            Japonica Group] gi|116309726|emb|CAH66771.1|
            OSIGBa0115M15.9 [Oryza sativa Indica Group]
            gi|215701416|dbj|BAG92840.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 663

 Score = 76.6 bits (187), Expect = 3e-11
 Identities = 82/299 (27%), Positives = 120/299 (40%), Gaps = 22/299 (7%)
 Frame = +3

Query: 894  DPKSDENT-GVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKA 1070
            D  SDE    VD+   V+ S+ L +      +    ST +D +  E       E   S++
Sbjct: 382  DILSDERKIPVDQRSPVENSDSLSDPV----DRALSSTETDGARNEDSRLDSTEASPSRS 437

Query: 1071 EAAVAMDKGEIRDGQEPLQVPNLTFN-STSTNHSKTEGNLPDVPVNRPVS-------EAE 1226
                + D+ +        QV N  +   T   H  + G  P      P+        + E
Sbjct: 438  YVQPSEDRND--------QVDNFVYGIRTDAAHGTSSGTSPLTGKTEPIDAKSENDPKCE 489

Query: 1227 AMAVR-------AEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEP 1385
              +V+        E  D T+  E  K    +  ST      TE N P D  ++ + + EP
Sbjct: 490  IDSVQDGHDFNPREANDGTNISEDNK---DSKSSTRQTGPVTEQNEP-DSAKMTM-QTEP 544

Query: 1386 TAAVKEKKANTICQESLHVRKFSVPQLAVLDATADPPRSSFSHSYGADLIISGPRTSSEH 1565
             A   E  +  +  ++  V + +    A + A  +  R+ F  S+    IISGP T S H
Sbjct: 545  VAQRNEPDSAKVTMQTESVAQPNEADSAKVTAR-NVIRNPFESSFSGPSIISGPLTPSGH 603

Query: 1566 IPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARKHR------WRFGLSCCK 1724
            IPY                 FAFPVLQ EWN+SPVKMAKA + R      W + + CCK
Sbjct: 604  IPYSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVKMAKADRRRLRRDRGWGYRILCCK 662


>ref|XP_002454070.1| hypothetical protein SORBIDRAFT_04g024170 [Sorghum bicolor]
            gi|241933901|gb|EES07046.1| hypothetical protein
            SORBIDRAFT_04g024170 [Sorghum bicolor]
          Length = 490

 Score = 71.2 bits (173), Expect = 1e-09
 Identities = 99/370 (26%), Positives = 152/370 (41%), Gaps = 36/370 (9%)
 Frame = +3

Query: 723  ITLEELLLKEGSDKDHCQADAISLDFSRNHQHCIDHEKVEKTKT---EEACSMTTTSSIV 893
            I+L+ELLL E +++               H   I+ E  EK K    EEA   T+     
Sbjct: 149  ISLQELLLLESAEESR-------------HSSTINSESSEKHKCPLHEEAIGQTSKDG-- 193

Query: 894  DPKSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKAE 1073
            DP   +    +    V T  L +E  S  P  T      DH    ++     + I     
Sbjct: 194  DPNV-QTILANTSEYVITGILSKENASGCPATTMPG---DHVAATALDVREPQKI---DR 246

Query: 1074 AAVAMDKGEIRDGQEP-LQVPNLT----FNSTSTNHSKTEGN--LPDVPVNRP----VSE 1220
                +D   + D   P   +P +T     +ST + H++T+G   L +V  + P    +S 
Sbjct: 247  YNPFVDHRSLEDTSVPECSIPGITDAASTDSTCSIHNETDGTAGLDEVETSEPGVDTLSI 306

Query: 1221 AEAMAVRAEKGDNTDFQESL--KVPSPTCDSTL---------DDHRET-----EANVPVD 1352
            + +    +EK  N D  ES+  K  +   D T           +H +      E +  +D
Sbjct: 307  SSSDIQSSEK--NNDHSESIFSKAITGAVDETAVATSSTPNSAEHSDAYGKNQEKHDEID 364

Query: 1353 EVELIVPEVEPTAAVKEKKANTICQESLHVRKFSVPQLAVLDATADPPRSSFSHSYGADL 1532
            E   I  +    AA K   + T+ Q+   V + ++P  +   A      +++  ++    
Sbjct: 365  EEHSISTD---DAASKSSTSTTLAQDGSAVEQ-TMPDSSTSTARVGN-ENTYEPNFSGPS 419

Query: 1533 IISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKA-----RKHR 1697
            I+SGP + S HI Y                 FAFPVLQREW +SPV+MAKA     R+HR
Sbjct: 420  IMSGPVSRSGHIAYSGSISLRSDSSTTSTRSFAFPVLQREWISSPVRMAKAERRRSRRHR 479

Query: 1698 -WRFGLSCCK 1724
             WR G+ CCK
Sbjct: 480  VWRKGIICCK 489


>ref|XP_006652391.1| PREDICTED: dentin sialophosphoprotein-like [Oryza brachyantha]
          Length = 624

 Score = 70.1 bits (170), Expect = 3e-09
 Identities = 80/300 (26%), Positives = 114/300 (38%), Gaps = 14/300 (4%)
 Frame = +3

Query: 867  SMTTTSSIVDPKSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAG 1046
            S+   SS +D ++ E     +   V+ +E L    S   +    ST +D +  E      
Sbjct: 356  SVEENSSSMDVEASEENNDQRESPVENTESL----SNPVDRALSSTETDEARNEDSRLDC 411

Query: 1047 VELIVSKAEAAVAMDKGEIRDGQEPLQVPNLTFNSTS-TNHSKTE------GNLPDVPVN 1205
             E   S+++   +    +  D        N T  + S T+H  T+       N P   V+
Sbjct: 412  TEASSSRSDVQPSGHSNDQVDNLVDGIRTNATHGTGSVTSHGNTDPGDAKSDNHPKCKVD 471

Query: 1206 RPVSEAEAMAVRAEKGDNTDFQESLK-VPSPTCDSTLDDHRETEANVPVDEVELIVPEVE 1382
              V +      R E+GD  D  E  K   S T   ++    E ++     + E I P+ E
Sbjct: 472  N-VQDVHDFNPR-EEGDVIDISEDSKDSKSSTQTQSVAQQNEPDSAKVTMQTESIAPQNE 529

Query: 1383 PTAAVKEKKANTICQESLHVRKFSVPQLAVLDATADPPRSSFSHSYGADLIISGPRTSSE 1562
              +A K    N I                         R+ F  S+    I SGP T S 
Sbjct: 530  HESA-KVTARNVI-------------------------RNPFESSFSGPSITSGPLTPSG 563

Query: 1563 HIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARKHR------WRFGLSCCK 1724
            HIPY                 FAFPVLQ EWN+SPVKMAKA + R      W + + CCK
Sbjct: 564  HIPYSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVKMAKADRRRLRRDRGWGYRILCCK 623


>ref|XP_002447990.1| hypothetical protein SORBIDRAFT_06g019400 [Sorghum bicolor]
            gi|241939173|gb|EES12318.1| hypothetical protein
            SORBIDRAFT_06g019400 [Sorghum bicolor]
          Length = 562

 Score = 69.7 bits (169), Expect = 4e-09
 Identities = 110/514 (21%), Positives = 186/514 (36%), Gaps = 68/514 (13%)
 Frame = +3

Query: 387  DKAVAYLEQPDTKVLSNSEVCPVIRNSHFDEGLVSRNTVLFEHNEVKDISAIN------K 548
            DK V  ++ PDT VLS+      +++   DEG++       E    + +S IN       
Sbjct: 75   DKDVVEIKLPDT-VLSSDYGVHFVKDVCIDEGVLPDQKTSSEKQVDQKVS-INFDSSKYT 132

Query: 549  NSDLNGQVIVGPLNSIQDLKQQTAI-------EKSTEDQNSPKLIVLDVKTSTLDLVKNH 707
            N DL  ++  G   +  +LK +  I       + +T +QNS        K   L+     
Sbjct: 133  NGDLTEEISAGSTKTAHELKSEIVILPVMCDTDGNTGEQNS------SCKKHDLEDNNTA 186

Query: 708  KFSPNITLEELLLKE---GSDKDHCQ-ADAISLDFSRNHQHCIDHEKVEKTKTEEACSM- 872
              S N   EEL  K+         CQ    +  D + N    +  E   +  + +     
Sbjct: 187  DVSTNSNDEELNPKQLPCHEVAQDCQDVGGVICDSNENQDRLLTGEATHQVSSNDCYETG 246

Query: 873  ----TTTSSIVDPKSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPH 1040
                + TS+I+      N    +  + D SE++ E  +    +  E +   +     I +
Sbjct: 247  IGIASETSNII-----HNDLPVESTAADFSEVIPEEVAVSAGLDMEGSNQVNHYNPFIAY 301

Query: 1041 AGVE----------LIVSKAEAA--VAMDKGE--------IRDGQEPLQVPNLTFNSTST 1160
              ++           IV  A  A    ++K +          +G +P+++          
Sbjct: 302  GSLDGTWESNYSLPTIVDAASIAPICPVEKTDSFSDLVNRALEGFDPIEIDEAIIEENRL 361

Query: 1161 NHSKTEGNLPDVPVNRPVSE------------------AEAMAVRAEKGDNTDFQESLKV 1286
            +  +   N  DV  +   ++                  + A+++    G+++D +     
Sbjct: 362  DSVEENSNTLDVQASEQCNDQVESLTNNDVKTDVAHEMSTAISLSTSNGEHSDVKSEQGQ 421

Query: 1287 PSPTCDSTLDDH--RETEANVPVDEVELIVPEVEPTAAVKEKKANTICQESLHVRKFSVP 1460
                    ++D   R+ E    V E             + + K++T  Q    V++    
Sbjct: 422  KHEIDGQDINDFNPRDAELGTKVSE------------DITDNKSSTPVQTESVVQQNGPD 469

Query: 1461 QLAVLDATADPPRSSFSHSYGADLIISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPV 1640
               V   T    R+ F  S+    I SGP T S HIPY                 FAFPV
Sbjct: 470  SAKVTAQTVI--RNPFESSFSGPSITSGPLTPSGHIPYSGNISLRSESSTTSTRSFAFPV 527

Query: 1641 LQREWNTSPVKMAKARKHR------WRFGLSCCK 1724
            LQ EWN+SPVKMAKA + R      W + + CCK
Sbjct: 528  LQNEWNSSPVKMAKADRRRLKEDRGWGYRILCCK 561


>ref|NP_001146611.1| hypothetical protein [Zea mays] gi|219888025|gb|ACL54387.1| unknown
            [Zea mays] gi|413937495|gb|AFW72046.1| hypothetical
            protein ZEAMMB73_832738 [Zea mays]
            gi|413937496|gb|AFW72047.1| hypothetical protein
            ZEAMMB73_832738 [Zea mays]
          Length = 480

 Score = 68.9 bits (167), Expect = 6e-09
 Identities = 93/379 (24%), Positives = 142/379 (37%), Gaps = 45/379 (11%)
 Frame = +3

Query: 723  ITLEELLLKEGSDKDHCQADAISLDFSRNHQHCIDHEKVEKTKTE-----EACSMTTTSS 887
            I+L+ELLL E +++      A++ + S  H+  ++ E+  +T  +     E     T   
Sbjct: 149  ISLQELLLLESAEESR-NTGAVNSESSEKHKCPLNEEEAAQTSKDGDPDAETVLANTFEH 207

Query: 888  IVDPKSDENTG-------------------------VDKY-PSVDTSELLREGCSTFPEM 989
            I D  S +  G                         +D+Y PSVD     RE  S  PE 
Sbjct: 208  ISDGISSKEKGSGCPATMARDQVDTATALDVREPQKLDRYNPSVDRRS--REYTSV-PEC 264

Query: 990  TFESTRSDHSGTESIPHAGVELIVSKAEAAVAMDKGE-IRDGQEPLQVPNLTFNSTSTNH 1166
            +        S T SI         + + A   + +GE +  G + L   +    S+  +H
Sbjct: 265  SIPGITDAASSTGSICSFHN---ATSSTATAGLGEGETLEPGADTLSAGSSDIGSSEKSH 321

Query: 1167 SKTEGNLPDVPVNRPVSEAEAMAVRAEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVP 1346
                G++     ++P++ A     + EK    D +  +         T DD         
Sbjct: 322  DNHSGSM----FSKPIAGAYGNGKKQEKHGQMDEEHGI--------GTADD--------- 360

Query: 1347 VDEVELIVPEVEPTAAVKEKKANTICQESLHVRKFSVP--QLAVLDATADPPRSSFSHSY 1520
                          AA     A+T+ Q+       S P  Q  V  +++   R +     
Sbjct: 361  --------------AAASTSSASTLAQDG------SAPAVQETVPGSSSSSSRPAARAGD 400

Query: 1521 GADL----IISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKAR 1688
              DL    I+SGP + S HI Y                 FAFPVLQREW +SPV+MAKA 
Sbjct: 401  DPDLSGPSIMSGPVSMSGHIAYSGNVSLRSDSSTTSTQSFAFPVLQREWTSSPVRMAKAE 460

Query: 1689 KHR-------WRFGLSCCK 1724
            + R       WR G+ CCK
Sbjct: 461  RRRSGGRHRVWRKGIICCK 479


>ref|XP_004975900.1| PREDICTED: uncharacterized protein LOC101761685 isoform X2 [Setaria
            italica]
          Length = 519

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 36/82 (43%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
 Frame = +3

Query: 1497 RSSFSHSYGADLIISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKM 1676
            R+ F  S+    I SGP T S HIPY                 FAFPVLQ EWN+SPVKM
Sbjct: 437  RNPFESSFSGPSITSGPLTPSGHIPYSGNISLRSESSTTSTRSFAFPVLQNEWNSSPVKM 496

Query: 1677 AKARKHR------WRFGLSCCK 1724
            AKA + R      W + + CCK
Sbjct: 497  AKADRRRLREDRGWGYRILCCK 518


>ref|XP_004975899.1| PREDICTED: uncharacterized protein LOC101761685 isoform X1 [Setaria
            italica]
          Length = 560

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 36/82 (43%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
 Frame = +3

Query: 1497 RSSFSHSYGADLIISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKM 1676
            R+ F  S+    I SGP T S HIPY                 FAFPVLQ EWN+SPVKM
Sbjct: 478  RNPFESSFSGPSITSGPLTPSGHIPYSGNISLRSESSTTSTRSFAFPVLQNEWNSSPVKM 537

Query: 1677 AKARKHR------WRFGLSCCK 1724
            AKA + R      W + + CCK
Sbjct: 538  AKADRRRLREDRGWGYRILCCK 559


>ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa]
            gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly
            protein gar2 [Populus trichocarpa]
          Length = 486

 Score = 67.8 bits (164), Expect = 1e-08
 Identities = 108/478 (22%), Positives = 176/478 (36%), Gaps = 25/478 (5%)
 Frame = +3

Query: 366  EDTKLYLDKAVAYLEQPDTKVLSNSEVCPVIRNSHFDEGLVSRNTVLFEHNEVKDISAIN 545
            E++  Y+DK+V   E P+  ++   E    +++   DEG+  ++  LF      D  A  
Sbjct: 69   EESVFYMDKSVMVREVPEL-IVCYKENTYHVKDICVDEGVPLQDKFLF------DTDAHK 121

Query: 546  KNSDLNGQVIVGPLNSIQDLKQQTAIEKSTEDQNSPKLIVLDVKTSTLDLVKNHKFSPNI 725
            KN       +   L S +D+  +   EKS  D   P+++    +   +DL   H   P++
Sbjct: 122  KN-------MCEFLPSERDMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDL---HLPVPDV 171

Query: 726  TLEELLLKEGSDKDHCQADAISLDFSRNH----QHCIDH--EKVEKTKTEEACSMTTTSS 887
             +     ++GS  D      +SLD    H    +  +D+  +KV    ++E  S+    S
Sbjct: 172  LISSE--EKGSKHD------LSLDCDPKHLMPTEEVMDYGTKKVTDNASKEILSLRDLLS 223

Query: 888  IVD---PKSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELI 1058
            + +     +  N        V+   LL    +   E    S  S+H G E+I   G+E  
Sbjct: 224  MSELGAKCTPANASYHNMDKVEQQSLLCPRENAILETDSASEESEHCGEETISDNGLE-- 281

Query: 1059 VSKAEAAVAMDKGEIRDGQEPLQVPNLTFNSTSTNHSKTEGNLPDVPVNRPVSEAEAMAV 1238
               A  A+       ++G                +H  TE  L    +     E+++   
Sbjct: 282  --SATLAIPTQDPAYQEG----------------DHGHTEAVLVSPTLTSAAEESDSKET 323

Query: 1239 RAEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEPTAAVKEKKANT 1418
            +        F E          S ++D     +      +        P A+ +E   N 
Sbjct: 324  KLASHALDSFSEG-------STSRIEDELPYNSKTETRSISFDNDSSAPAASARESPQNG 376

Query: 1419 ICQ----------ESLHVRKFSVPQLAVLDATADPPRSSFSHSYGADLIISGPRTSSEHI 1568
              Q          E  +  + S  QL   D       SSFS S      + G  + S  I
Sbjct: 377  ESQRLGTRIVSRFEDPNAERLSGGQLQYADG-----ESSFSSSGP----LFGLTSHSGPI 427

Query: 1569 PYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKA-RKH-----RWRFGLSCCK 1724
             Y                 FAFP+LQ EWN+SP +MAKA R+H     +W  GL CC+
Sbjct: 428  AYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKADRRHFQKPRKWMQGLLCCR 485


>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score = 67.0 bits (162), Expect = 2e-08
 Identities = 107/421 (25%), Positives = 158/421 (37%), Gaps = 53/421 (12%)
 Frame = +3

Query: 621  IEKSTEDQNSPKLIVLDVKTSTLDLVK------------NHKFSPNITLEEL---LLKEG 755
            I+K+  +   P+L VL  K +T  +VK            N  F  ++  E+L   L+ E 
Sbjct: 114  IDKNVMEPELPEL-VLCYKENTYHVVKDICVDEGVPSQENFLFDTSVDQEKLCPYLIPEK 172

Query: 756  SDKDHCQADAISLDFSRNHQHCIDHEKVEKTKTEEACSMTTTSSIVDPKSDENTGVDKYP 935
              K   Q + + LD S  +        + K      C    + +I + + D    +  Y 
Sbjct: 173  DIKSEIQKERVDLDMSTQY--------LSKNDNSFKCDSKESMAIAEIEDDAMEEIANYT 224

Query: 936  SVDT---SELLREGCSTFPEMTFESTRS-------DHSGTESIPHAGVELIVSKAEAAV- 1082
            S +T    ELL       PE+  E + S       D +   SI      ++++ A A   
Sbjct: 225  SKETFSLGELL-----LMPEVVAELSHSKSLLNSTDEAEQLSIQRPSENIVLATASACEE 279

Query: 1083 -------------AMDKGEIRDGQEPLQVPNLTFNST--STNHSKTEGNLPDVPVNRPVS 1217
                         A+D      G E  ++  LT +S+  +++H   E  L  +  +    
Sbjct: 280  SKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSDSSPKASDHGHDEVILASLAPSYATE 339

Query: 1218 EAEAMAVRAEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEPTAAV 1397
            E E  A  A            K PS T DS  D +                    PTA+ 
Sbjct: 340  EPENGAKAA------------KSPSHTLDSVSDLNSSA-----------------PTASG 370

Query: 1398 KEKKANTICQESLHVRKFSVPQLAVLDATADPPRSS---FSH---SYGADLIISGPRTSS 1559
             E+ +     E L  R  S  +    D +   P S    +SH   S+ A   +SG  + S
Sbjct: 371  GEEGSQVGGSEHLESRNSSRHE----DTSITEPFSGQLQYSHGESSFSAAGPLSGLISYS 426

Query: 1560 EHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKA-----RKHR-WRFGLSCC 1721
              I Y                 FAFP+LQ EWN+SPV+MAKA     RKHR WR GL CC
Sbjct: 427  GPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLLCC 486

Query: 1722 K 1724
            +
Sbjct: 487  R 487


>ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella]
            gi|482559818|gb|EOA24009.1| hypothetical protein
            CARUB_v10017222mg [Capsella rubella]
          Length = 455

 Score = 66.2 bits (160), Expect = 4e-08
 Identities = 104/446 (23%), Positives = 171/446 (38%), Gaps = 15/446 (3%)
 Frame = +3

Query: 432  SNSEVCPVIRNSHF-DEGLVSRNTVLFEHNEVKDISAINKNSDLNGQVIVGPLNSIQDLK 608
            +N+ VC +  ++   DE     +  + E +   D     K +       V   +S + + 
Sbjct: 43   NNNNVCELFYDTRSGDEWDKENDGNILEPHSCGDADEAGKKTRDTSHDFVAKGDSPEKVN 102

Query: 609  QQTAIEKSTEDQNSPKLIVLDVKTSTLDLVKNHKFSPNITLEELLLKEGSDKDHCQADAI 788
                ++K+    + P+++V   K ++  +VK+      + ++E  L         + D++
Sbjct: 103  PVFYMDKNVTACDLPEIVVC-YKENSYHVVKDICVDEGVPVQEKFL-------FGEKDSV 154

Query: 789  SLDFSRNHQHCIDHEKVEKT--KTEEACSMTTTSSIVDPKSD---ENTGVDKYPSVDTSE 953
                + NH   +D  KV+KT  K  E  S+  ++S VD  S+   + T  D   S   + 
Sbjct: 155  KSTTNSNHCGSVDLMKVDKTDVKPSETKSLEDSNSKVDDSSEVCNDKTVQDVEESSREAF 214

Query: 954  LLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKAEAAVAMDKGEIRDGQEPLQVP 1133
               EG S + +     T    S T ++  + + L V   E    + K E+    E     
Sbjct: 215  ADAEGSSNYDQEHLIVT----SPTLALKPSEISLEVESEE----ISKDEVVISSEDFLSE 266

Query: 1134 NLTFNSTSTNHSKTEGNLPDVPVNRPVSEAEAMAVRAEKG--DNTDFQESL-KVPSP-TC 1301
            +LT     +   K + +L +   NRP   +       EK   + T     L KV  P T 
Sbjct: 267  SLTLGDILSREDKQK-SLKNDNGNRPEELSPPQHQEKEKRSLETTGLDTKLEKVEEPKTA 325

Query: 1302 DSTLDDHRETEANVPVDEV-ELIVPEVEPTAAVKEKKANTICQESLHVRKFSVPQLAVLD 1478
            +  L     T    P     +L  PE E     + +  N+   + L   +F     +  +
Sbjct: 326  EENLSSASTTTVQEPNKSCNDLEKPETENHQ--QNRLVNSYEDDKLSSSRFGETSFSAAE 383

Query: 1479 ATADPPRSSFSHSYGADLIISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWN 1658
            + +                ISG  T S  I Y                 FAFP+LQ EWN
Sbjct: 384  SVS----------------ISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWN 427

Query: 1659 TSPVKMAKARKHR----WRFGLSCCK 1724
            +SPV+MAKA K R    WR  L CCK
Sbjct: 428  SSPVRMAKADKRRQKGGWRHTLLCCK 453


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 63.5 bits (153), Expect = 3e-07
 Identities = 80/314 (25%), Positives = 120/314 (38%), Gaps = 15/314 (4%)
 Frame = +3

Query: 825  DHEKVEKTKTEEACSMTTTSSIVDPKSDENTGVDKYPSVDTSELLREGCSTFPEMTFEST 1004
            + + V+ + TE+      T+  V+P S+  +  D    VD SE     C T  ++  ES+
Sbjct: 134  EKDSVKSSSTEDLTKADKTN--VNP-SESKSAEDSNTKVDDSEFCNN-CKTDRDVE-ESS 188

Query: 1005 RSDHSGTESIPHAGVELIVSKAEAAVAMDKGEIRDGQEPLQVPNLTFNSTSTNHSKTEGN 1184
            R D +  E       E ++   EA  +   G      EP +  N     +S   SK    
Sbjct: 189  REDFADAEGSSAYNQEHLIVTEEAKASPSHGLNPSEIEPDENSNDEVAISSETDSKESLT 248

Query: 1185 LPDV----PVNRPVSEAEAMAVRAEKGDNTDFQESLKVPSPTCDSTLDDHRETEANVPVD 1352
            L D+       + ++     +   E+   +  Q+  K    T  +   +  +TE   PV+
Sbjct: 249  LGDILSREDEQKSLNHGNISSDSHEEQSPSQLQDKEKRSLETA-AIETELEKTEEPKPVE 307

Query: 1353 EVELIVPEVEPTAAVKEKKANTICQ-------ESLHVRKFSVPQLAVLDATADPPRSSFS 1511
            E    +P    T     ++ N  C        E+ H +   V      D  +       S
Sbjct: 308  EK---LPSASTTTL---QEPNKTCNDPEKPETENHHQQNSLVENSYEDDKLSSSRFGETS 361

Query: 1512 HSYGADLIISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARK 1691
             S    + ISG  T S  I Y                 FAFP+LQ EWN+SPV+MAKA K
Sbjct: 362  FSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADK 421

Query: 1692 HR----WRFGLSCC 1721
             R    WR  L CC
Sbjct: 422  RRQKGGWRHTLLCC 435


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score = 62.0 bits (149), Expect = 8e-07
 Identities = 105/460 (22%), Positives = 160/460 (34%), Gaps = 6/460 (1%)
 Frame = +3

Query: 363  EEDTKLYLDKAVAYLEQPDTKVLSNSEVCPVIRNSHFDEGLVSRNTVLF-EHNEVKDISA 539
            ++D   Y+DK V   + P+  V        ++++   DEG+  +   LF E + VK  S 
Sbjct: 84   KKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVPVQEKFLFGEKDSVKSSST 143

Query: 540  INKNSDLNGQVIVGPLNSIQDLKQQTAIEKSTEDQNSPKLIVLDVKTSTLDLVKNHKFSP 719
             +             L          +  KS ED  S        K    +   +HK   
Sbjct: 144  ED-------------LMKADKTNVNPSETKSAEDSIS--------KVDDSEFCNDHKTDR 182

Query: 720  NITLEELLLKEGSDKDHCQADAISLDFSRNHQHCIDHEKVEKTKTEEACSMTTTSSIVDP 899
            ++       +E S +D   A+  S ++  N +H I  E+V+ + T        + S ++P
Sbjct: 183  DV-------EESSGEDFADAEGTSSNY--NQEHLIVTEEVKASPTHGL-----SPSEIEP 228

Query: 900  KSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKAEAA 1079
              +    V      D+ E L  G     E   +S   D+  ++S                
Sbjct: 229  DENSKDEVAISQDNDSKECLTLGDILSREDEQKSLNQDNISSDS---------------- 272

Query: 1080 VAMDKGEIRDGQEPLQVPNLTFNSTSTNHSKTEGNLPDVPVNRPVSEAEAMAVRAEKGDN 1259
                     + Q P Q+ +    S  T   +TE    + P      E +  +V       
Sbjct: 273  --------HEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQ---GEEKLSSV-----ST 316

Query: 1260 TDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEPTAAVKEKKANTICQESLH 1439
            T  QE    P+ TC+                       E E        + N + + S  
Sbjct: 317  TTSQE----PNKTCN-----------------------EPEKPETENHHQQNCLVENSYE 349

Query: 1440 VRKFSVPQLAVLDATADPPRSSFSHSYGADLI-ISGPRTSSEHIPYXXXXXXXXXXXXXX 1616
              KFS  +            +SFS    AD + ISG  T S  I Y              
Sbjct: 350  DDKFSSSRFG---------ETSFS---AADSVSISGHITYSGPIAYSGSLSVRSDASTTS 397

Query: 1617 XXXFAFPVLQREWNTSPVKMAKARKHR----WRFGLSCCK 1724
               FAFP+LQ EWN+SPV+MAKA K R    WR  L CC+
Sbjct: 398  GRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLLCCR 437


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score = 62.0 bits (149), Expect = 8e-07
 Identities = 105/460 (22%), Positives = 160/460 (34%), Gaps = 6/460 (1%)
 Frame = +3

Query: 363  EEDTKLYLDKAVAYLEQPDTKVLSNSEVCPVIRNSHFDEGLVSRNTVLF-EHNEVKDISA 539
            ++D   Y+DK V   + P+  V        ++++   DEG+  +   LF E + VK  S 
Sbjct: 84   KKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVPVQEKFLFGEKDSVKSSST 143

Query: 540  INKNSDLNGQVIVGPLNSIQDLKQQTAIEKSTEDQNSPKLIVLDVKTSTLDLVKNHKFSP 719
             +             L          +  KS ED  S        K    +   +HK   
Sbjct: 144  ED-------------LMKADKTNVNPSETKSAEDSIS--------KVDDSEFCNDHKTDR 182

Query: 720  NITLEELLLKEGSDKDHCQADAISLDFSRNHQHCIDHEKVEKTKTEEACSMTTTSSIVDP 899
            ++       +E S +D   A+  S ++  N +H I  E+V+ + T        + S ++P
Sbjct: 183  DV-------EESSGEDFADAEGTSSNY--NQEHLIVTEEVKASPTHGL-----SPSEIEP 228

Query: 900  KSDENTGVDKYPSVDTSELLREGCSTFPEMTFESTRSDHSGTESIPHAGVELIVSKAEAA 1079
              +    V      D+ E L  G     E   +S   D+  ++S                
Sbjct: 229  DENSKDEVAISQDNDSKECLTLGDILSREDEQKSLNQDNISSDS---------------- 272

Query: 1080 VAMDKGEIRDGQEPLQVPNLTFNSTSTNHSKTEGNLPDVPVNRPVSEAEAMAVRAEKGDN 1259
                     + Q P Q+ +    S  T   +TE    + P      E +  +V       
Sbjct: 273  --------HEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQ---GEEKLSSV-----ST 316

Query: 1260 TDFQESLKVPSPTCDSTLDDHRETEANVPVDEVELIVPEVEPTAAVKEKKANTICQESLH 1439
            T  QE    P+ TC+                       E E        + N + + S  
Sbjct: 317  TTSQE----PNKTCN-----------------------EPEKPETENHHQQNCLVENSYE 349

Query: 1440 VRKFSVPQLAVLDATADPPRSSFSHSYGADLI-ISGPRTSSEHIPYXXXXXXXXXXXXXX 1616
              KFS  +            +SFS    AD + ISG  T S  I Y              
Sbjct: 350  DDKFSSSRFG---------ETSFS---AADSVSISGHITYSGPIAYSGSLSVRSDASTTS 397

Query: 1617 XXXFAFPVLQREWNTSPVKMAKARKHR----WRFGLSCCK 1724
               FAFP+LQ EWN+SPV+MAKA K R    WR  L CC+
Sbjct: 398  GRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLLCCR 437


>ref|XP_004952913.1| PREDICTED: uncharacterized protein DDB_G0271670-like [Setaria
            italica]
          Length = 474

 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 33/70 (47%), Positives = 38/70 (54%), Gaps = 6/70 (8%)
 Frame = +3

Query: 1533 IISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARKHR----- 1697
            I+SGP + S HI Y                 FAFPVLQREW +SPV+MAKA + R     
Sbjct: 404  IMSGPLSMSGHIAYSGNVSLRSDSSTTSTRSFAFPVLQREWISSPVRMAKAERRRNRRRR 463

Query: 1698 -WRFGLSCCK 1724
             WR GL CCK
Sbjct: 464  AWRKGLICCK 473


>gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana]
            gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810
            [Arabidopsis thaliana]
          Length = 439

 Score = 61.2 bits (147), Expect = 1e-06
 Identities = 91/375 (24%), Positives = 141/375 (37%), Gaps = 20/375 (5%)
 Frame = +3

Query: 660  IVLDVKTSTLDLVKNHKFSPNITLEELLL-------KEGSDKDHCQADAISLDFSRNHQH 818
            IV   K +T  +VK+     ++ ++E  L       K  S +D  +AD  +++ S     
Sbjct: 103  IVACYKENTYHIVKDICVDESVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSA 162

Query: 819  CIDHEKVEKTKTEEACSMTTTSSIVDPKSDENTGVDKYPSVDTSELLREGCSTFPEMTFE 998
                + + K    E C+   T    D   +E++G D   +  TS    +      E    
Sbjct: 163  ---EDSISKVDDSEFCNDHKT----DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEVXA 215

Query: 999  STRSDHSGTESIPHAGVELIVSKAEAAVAMDKGEIRDGQEPLQVPNL--------TFNST 1154
            S     S +E  P        SK E A++ D     D +E L + ++        + N  
Sbjct: 216  SPTHGLSPSEIEPDEN-----SKDEVAISQDN----DSKECLTLGDILSREDEQKSLNQD 266

Query: 1155 STNHSKTEGNLPDVPVNRPVSEAEAMAVRAEKGDNTDFQESLKVPSPTCDSTLDDHRETE 1334
            + +    E   P    ++     E  A+  E     + ++  +  S    +T  +  +T 
Sbjct: 267  NISSDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGEEKLSSVSTTTSQEPNKT- 325

Query: 1335 ANVPVDEVELIVPEVEPTAAVKEKKANTICQESLHVRKFSVPQLAVLDATADPPRSSFSH 1514
             N P        PE E        + N + + S    KFS  +            +SFS 
Sbjct: 326  CNEPEK------PETE-----NHHQQNCLVENSYEDDKFSSSRFG---------ETSFS- 364

Query: 1515 SYGADLI-ISGPRTSSEHIPYXXXXXXXXXXXXXXXXXFAFPVLQREWNTSPVKMAKARK 1691
               AD + ISG  T S  I Y                 FAFP+LQ EWN+SPV+MAKA K
Sbjct: 365  --AADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADK 422

Query: 1692 HR----WRFGLSCCK 1724
             R    WR  L CC+
Sbjct: 423  RRQKGGWRHTLLCCR 437


Top