BLASTX nr result

ID: Catharanthus23_contig00001559 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00001559
         (1097 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006352658.1| PREDICTED: uncharacterized protein LOC102587...   248   4e-63
ref|XP_004242431.1| PREDICTED: uncharacterized protein LOC101255...   247   5e-63
ref|XP_002265251.1| PREDICTED: uncharacterized protein LOC100251...   234   5e-59
ref|XP_004303819.1| PREDICTED: uncharacterized protein LOC101314...   227   7e-57
ref|XP_002301821.2| YGGT family protein [Populus trichocarpa] gi...   224   4e-56
gb|EOX93769.1| YGGT family protein [Theobroma cacao]                  224   6e-56
ref|XP_002527246.1| conserved hypothetical protein [Ricinus comm...   220   9e-55
ref|XP_006284355.1| hypothetical protein CARUB_v10005528mg [Caps...   219   2e-54
ref|XP_002882528.1| hypothetical protein ARALYDRAFT_478063 [Arab...   216   1e-53
ref|XP_006414996.1| hypothetical protein EUTSA_v10026192mg [Eutr...   216   1e-53
gb|EPS74141.1| hypothetical protein M569_00615, partial [Genlise...   216   1e-53
ref|XP_006298360.1| hypothetical protein CARUB_v10014430mg, part...   215   3e-53
gb|AAM66973.1| unknown [Arabidopsis thaliana]                         214   5e-53
ref|XP_006437216.1| hypothetical protein CICLE_v10033520mg [Citr...   213   8e-53
gb|AFK43466.1| unknown [Lotus japonicus]                              212   2e-52
ref|XP_002867475.1| YGGT family protein [Arabidopsis lyrata subs...   211   4e-52
ref|XP_006407837.1| hypothetical protein EUTSA_v10021454mg [Eutr...   211   5e-52
ref|NP_566307.1| protein YLMG1-1 [Arabidopsis thaliana] gi|60418...   211   5e-52
gb|EMJ17134.1| hypothetical protein PRUPE_ppa011088mg [Prunus pe...   209   1e-51
ref|NP_194528.1| YGGT family protein [Arabidopsis thaliana] gi|4...   209   2e-51

>ref|XP_006352658.1| PREDICTED: uncharacterized protein LOC102587508 isoform X1 [Solanum
           tuberosum] gi|565372148|ref|XP_006352659.1| PREDICTED:
           uncharacterized protein LOC102587508 isoform X2 [Solanum
           tuberosum]
          Length = 222

 Score =  248 bits (632), Expect = 4e-63
 Identities = 150/239 (62%), Positives = 166/239 (69%), Gaps = 2/239 (0%)
 Frame = +3

Query: 84  MASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRII-ISLRKTAVPTLSLSS 260
           MASQTLILQNP       PRT    T L+    L+   SKPR I +  R  ++   SL  
Sbjct: 1   MASQTLILQNPNL-----PRT----TILSPPSTLI--YSKPRTISLFTRPNSLAAPSLHI 49

Query: 261 RKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVNLGLQ 440
            K K           T +    A +  L+T STRTITT  A+ L  SK++FQK+   GL 
Sbjct: 50  LKSKKFTVLASSS--TVLENPSAKSDPLITRSTRTITTFFAVTLAVSKLVFQKLSFNGLG 107

Query: 441 SNVRQSLIQSVGPAFFAAVKDQ-ATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFP 617
               QSL  S GP FFAA+++Q  TG LNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFP
Sbjct: 108 ----QSLAYSAGPMFFAALRNQPTTGGLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFP 163

Query: 618 NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRGGY 794
           NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNS+RG Y
Sbjct: 164 NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSSRGSY 222


>ref|XP_004242431.1| PREDICTED: uncharacterized protein LOC101255803 isoform 1 [Solanum
           lycopersicum] gi|460393670|ref|XP_004242432.1|
           PREDICTED: uncharacterized protein LOC101255803 isoform
           2 [Solanum lycopersicum]
          Length = 222

 Score =  247 bits (631), Expect = 5e-63
 Identities = 150/243 (61%), Positives = 169/243 (69%), Gaps = 6/243 (2%)
 Frame = +3

Query: 84  MASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRK----TAVPTLS 251
           MASQTLILQNP       PRT    T L+    L+   SKPR I    +     A  +L 
Sbjct: 1   MASQTLILQNPNL-----PRT----TILSPPSTLI--CSKPRTISLFARPNSLAAPSSLI 49

Query: 252 LSSRKPKXXXXXXXXXXXTDVYRTPAS-AHSLLTGSTRTITTLAAIVLVASKVLFQKIVN 428
           L S+K             + V   P++ +  L+T STRTITT  A+ L  SK++FQK+  
Sbjct: 50  LKSKK------FTVSASSSTVLENPSTKSDPLITSSTRTITTYFAVTLAVSKLIFQKLSF 103

Query: 429 LGLQSNVRQSLIQSVGPAFFAAVKDQAT-GTLNTPFTVVAAGMAKWLDIYSGVLMVRVLL 605
            GL     QSL  S GP FFAA+++Q+T G LNTPFTVVAAGMAKWLDIYSGVLMVRVLL
Sbjct: 104 KGLG----QSLAYSAGPMFFAALRNQSTTGGLNTPFTVVAAGMAKWLDIYSGVLMVRVLL 159

Query: 606 SWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTR 785
           SWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNS+R
Sbjct: 160 SWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSSR 219

Query: 786 GGY 794
           G Y
Sbjct: 220 GSY 222


>ref|XP_002265251.1| PREDICTED: uncharacterized protein LOC100251416 [Vitis vinifera]
           gi|296089606|emb|CBI39425.3| unnamed protein product
           [Vitis vinifera]
          Length = 233

 Score =  234 bits (597), Expect = 5e-59
 Identities = 137/258 (53%), Positives = 163/258 (63%), Gaps = 18/258 (6%)
 Frame = +3

Query: 75  SSLMASQTLILQ--NPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTL 248
           +SLMASQTL+L+  NPT  +P      S H +L    P   H    R+ +  R+ A   +
Sbjct: 2   ASLMASQTLLLRTPNPTLHAPHCTPLPSFHVSLPTFSPPKSH----RLRLRFRRAAKLAI 57

Query: 249 SLSSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVN 428
           S S+  P                 TP       T STRT++ + A  L  SKV   KI +
Sbjct: 58  SASTTTPT----------------TP------FTDSTRTVSAILASALAVSKVFIAKIQD 95

Query: 429 LGLQ----------------SNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAK 560
           + L                  ++R S++ +VGP FFAA +D+ +G LNTP TVVAAGMAK
Sbjct: 96  IFLNLKQIDFTPNPEELAAIRDLRSSVVCAVGPLFFAAARDRPSGYLNTPLTVVAAGMAK 155

Query: 561 WLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLA 740
           WLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPP+FDTLDVSPLLA
Sbjct: 156 WLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPVFDTLDVSPLLA 215

Query: 741 FAVLGTLGSILNSTRGGY 794
           FAVLGTLGSILN++RG Y
Sbjct: 216 FAVLGTLGSILNNSRGMY 233


>ref|XP_004303819.1| PREDICTED: uncharacterized protein LOC101314968 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  227 bits (578), Expect = 7e-57
 Identities = 130/240 (54%), Positives = 152/240 (63%)
 Frame = +3

Query: 75  SSLMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSL 254
           +S+MASQ L  Q  T + P  P T+S                 P  I+ L  T  P+L L
Sbjct: 2   ASVMASQALYSQ--TLIKPPTPTTKS------------RPFRSPPSILRLTPTPTPSLPL 47

Query: 255 SSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVNLG 434
           S R+P              V  T  S   L    TRT+TTL A  L A K   + +++  
Sbjct: 48  SLRRP------------LSVSATVQSHRPLSDSPTRTLTTLFAFTLAAFKAASRPVIDFA 95

Query: 435 LQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWF 614
            Q     ++  S G  FFA++ D+ +G LNTP TVVAAG++KWLDIYSGVLMVRVLLSWF
Sbjct: 96  AQHG--PAVASSAGTLFFASISDRPSGYLNTPLTVVAAGLSKWLDIYSGVLMVRVLLSWF 153

Query: 615 PNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRGGY 794
           PNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILN++RG Y
Sbjct: 154 PNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNNSRGMY 213


>ref|XP_002301821.2| YGGT family protein [Populus trichocarpa]
           gi|550345783|gb|EEE81094.2| YGGT family protein [Populus
           trichocarpa]
          Length = 251

 Score =  224 bits (572), Expect = 4e-56
 Identities = 132/255 (51%), Positives = 157/255 (61%), Gaps = 13/255 (5%)
 Frame = +3

Query: 69  MASSLMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTL 248
           MA  L +    +L+ P   S  N   R   TA       L   S+  + ++L K   P+L
Sbjct: 1   MAMMLSSQTGAVLRAPFLPSHSNLSRRLPLTATTTTTTTLCLPSQNPLSLAL-KFPKPSL 59

Query: 249 SLSSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVN 428
           S +    +           T +   P  +   L GSTRT+ T+  + L  S++    I  
Sbjct: 60  SSTITPKQRIHRVHLSPQSTQI---PTESQPQLIGSTRTVATILTLALSLSRIFVTSIQK 116

Query: 429 LGLQSNV-------------RQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLD 569
             L  N+             + +L+ SVGP FFAA+KD+ TG LNTP TVVAAG+AKWLD
Sbjct: 117 FVLSHNLFPTPDQLVAIRALQSNLVHSVGPFFFAALKDRPTGYLNTPLTVVAAGLAKWLD 176

Query: 570 IYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAV 749
           IYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAV
Sbjct: 177 IYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAV 236

Query: 750 LGTLGSILNSTRGGY 794
           LGTLGSILNS+RG Y
Sbjct: 237 LGTLGSILNSSRGMY 251


>gb|EOX93769.1| YGGT family protein [Theobroma cacao]
          Length = 226

 Score =  224 bits (570), Expect = 6e-56
 Identities = 127/239 (53%), Positives = 159/239 (66%), Gaps = 2/239 (0%)
 Frame = +3

Query: 84  MASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSLSSR 263
           + SQTL+L+   +L P+NP +    +  N+            + +SL    +   + + +
Sbjct: 3   LLSQTLLLRASNYLPPRNPISPIFTSKTNS------------LALSL---PIKPSNPNQK 47

Query: 264 KPKXXXXXXXXXXXTDVYRTPA-SAHSLLTGSTRTITTLAAIVLVASKVLFQKIVNLGLQ 440
            PK           T   R P   A S L  STRT+ TL +I L A+ +  + + N  L+
Sbjct: 48  HPKFTLLASVSPSRTIPCRPPQIPAQSRLKDSTRTLKTLFSIALSATIIFTKMVQNFALK 107

Query: 441 S-NVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFP 617
           + +   +   +VGP FFA++KD+ +G LNTP TVVAAG+AKWLDIYSGVLMVRVLLSWFP
Sbjct: 108 TISQNPNAFSTVGPLFFASLKDRPSGYLNTPLTVVAAGLAKWLDIYSGVLMVRVLLSWFP 167

Query: 618 NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRGGY 794
           NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILN++RG Y
Sbjct: 168 NIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNNSRGMY 226


>ref|XP_002527246.1| conserved hypothetical protein [Ricinus communis]
           gi|223533339|gb|EEF35090.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 226

 Score =  220 bits (560), Expect = 9e-55
 Identities = 113/158 (71%), Positives = 128/158 (81%), Gaps = 3/158 (1%)
 Frame = +3

Query: 330 SAHSLLTGSTRTITTLAAIVLVASKVLFQKIV---NLGLQSNVRQSLIQSVGPAFFAAVK 500
           S  SLLT STRT+TT+ A+    S++   +I     L LQ+    +L  SVGP FFAA++
Sbjct: 71  SQQSLLTDSTRTVTTILALAFSLSRLFLNQISFLPKLALQNTT--NLTHSVGPLFFAAIR 128

Query: 501 DQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLN 680
           D+ +G LNTP TVVAAG+AKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLN
Sbjct: 129 DRPSGYLNTPLTVVAAGLAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLN 188

Query: 681 LFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRGGY 794
           LFRNIIPPIFDTLDVSPLLAFAVLG LGSIL+S+  GY
Sbjct: 189 LFRNIIPPIFDTLDVSPLLAFAVLGMLGSILSSSGAGY 226


>ref|XP_006284355.1| hypothetical protein CARUB_v10005528mg [Capsella rubella]
           gi|482553060|gb|EOA17253.1| hypothetical protein
           CARUB_v10005528mg [Capsella rubella]
          Length = 261

 Score =  219 bits (557), Expect = 2e-54
 Identities = 134/257 (52%), Positives = 165/257 (64%), Gaps = 6/257 (2%)
 Frame = +3

Query: 36  RPQNQETSET---PMASSLMASQTL--ILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLS 200
           RP    TS T    + +S++A+  L  IL  P    P+ P   S + +L+N         
Sbjct: 38  RPTKMATSTTNSLTLRASILANPRLPPILLRPRLSFPRKP---SFNLSLHN--------- 85

Query: 201 KPRIIISLRKTAVPTLSLSSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLA 380
            PR I+S   T+ P+  LS+ K                  TP+      + STR+ITTLA
Sbjct: 86  -PRTIVSSAVTSSPSPVLSTDK------------------TPSQFP--FSDSTRSITTLA 124

Query: 381 AIVLVASKVLFQKI-VNLGLQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMA 557
            +  V +K L QK+ V +   S   QS  ++  P FFA+++D+  G LNTP TVVAAG++
Sbjct: 125 LLAGVVTKSLIQKLSVAMVHLSPQIQSSFRAASPLFFASLRDRPAGYLNTPLTVVAAGLS 184

Query: 558 KWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLL 737
           KWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRN+IPP+FDTLDVSPLL
Sbjct: 185 KWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNVIPPVFDTLDVSPLL 244

Query: 738 AFAVLGTLGSILNSTRG 788
           AFAVLGTLGSILN+TRG
Sbjct: 245 AFAVLGTLGSILNNTRG 261


>ref|XP_002882528.1| hypothetical protein ARALYDRAFT_478063 [Arabidopsis lyrata subsp.
           lyrata] gi|297328368|gb|EFH58787.1| hypothetical protein
           ARALYDRAFT_478063 [Arabidopsis lyrata subsp. lyrata]
          Length = 232

 Score =  216 bits (551), Expect = 1e-53
 Identities = 128/247 (51%), Positives = 160/247 (64%), Gaps = 11/247 (4%)
 Frame = +3

Query: 81  LMASQTLILQNPTFLSPQNPRTRSVH--TALNNHHPLLGHLSKPRIIISLRKTAVPTLSL 254
           + A   L L++P FL P +  T   H  T L+            R+   L+    P+LS+
Sbjct: 1   MAAITALALRSPVFLPPSSATTPRFHGFTKLS-----------ARVFFPLKP--FPSLSI 47

Query: 255 SSRKPKXXXXXXXXXXXTDVYRTPASAH--SLLTGSTRTITTLAAIVLVASKVLFQKIVN 428
            + K K           T    T  S    S LTGSTR++ TLAA+ +  ++VL QK+ +
Sbjct: 48  QNPKSKSIRISASASPMTPTIPTEKSTTRPSTLTGSTRSLATLAALTIAVTRVLAQKL-S 106

Query: 429 LGLQSN-------VRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVL 587
           L +Q++       +R SL  + GP FFA+++D+  G LNTP TVVA G+ KWLDIYSGVL
Sbjct: 107 LAIQTSSPAIADGLRFSL-STAGPVFFASLRDRPPGYLNTPLTVVAVGIKKWLDIYSGVL 165

Query: 588 MVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGS 767
           MVRVLLSWFPNIPW+RQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGS
Sbjct: 166 MVRVLLSWFPNIPWERQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGS 225

Query: 768 ILNSTRG 788
           I++ + G
Sbjct: 226 IVHGSTG 232


>ref|XP_006414996.1| hypothetical protein EUTSA_v10026192mg [Eutrema salsugineum]
           gi|557116166|gb|ESQ56449.1| hypothetical protein
           EUTSA_v10026192mg [Eutrema salsugineum]
          Length = 221

 Score =  216 bits (550), Expect = 1e-53
 Identities = 127/240 (52%), Positives = 159/240 (66%), Gaps = 2/240 (0%)
 Frame = +3

Query: 75  SSLMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSL 254
           ++++A ++  + NP F    NP        L++H+P L  LS P           PT +L
Sbjct: 5   TNILALRSSFIINPRF----NPN-------LHHHNPNL-RLSLPH---------KPTFNL 43

Query: 255 SSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKI--VN 428
           S + PK             +   P    S  + ST++ITTLA +  + +K L QK+    
Sbjct: 44  SLQNPKTIVSAVVTSSSPTLSTKPPPQISF-SNSTKSITTLAILAGIVTKSLVQKLSAAI 102

Query: 429 LGLQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLS 608
           + L   ++ SL  S GP FFA+++D+  G LNTP TVVAAG++KWLDIYSGVLMVRVLLS
Sbjct: 103 VTLSPQIQASLRVS-GPLFFASIRDRPAGYLNTPLTVVAAGLSKWLDIYSGVLMVRVLLS 161

Query: 609 WFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRG 788
           WFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPP+FDTLDVSPLLAFAVLGTLGSILN++RG
Sbjct: 162 WFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPVFDTLDVSPLLAFAVLGTLGSILNNSRG 221


>gb|EPS74141.1| hypothetical protein M569_00615, partial [Genlisea aurea]
          Length = 150

 Score =  216 bits (550), Expect = 1e-53
 Identities = 114/149 (76%), Positives = 125/149 (83%), Gaps = 1/149 (0%)
 Frame = +3

Query: 342 LLTG-STRTITTLAAIVLVASKVLFQKIVNLGLQSNVRQSLIQSVGPAFFAAVKDQATGT 518
           LL+G STR +TTL A+ + A+KV   +I  L  Q  +R+ +    GPAFFAA+K  A G+
Sbjct: 3   LLSGRSTRMVTTLFAVAVTAAKVFGGRIFRLAFQ--LREPIAALAGPAFFAAIKSPA-GS 59

Query: 519 LNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNII 698
            NTP TVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNII
Sbjct: 60  PNTPLTVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNII 119

Query: 699 PPIFDTLDVSPLLAFAVLGTLGSILNSTR 785
           PPIFDTLDVSPLLAFAVLGTLGSILNSTR
Sbjct: 120 PPIFDTLDVSPLLAFAVLGTLGSILNSTR 148


>ref|XP_006298360.1| hypothetical protein CARUB_v10014430mg, partial [Capsella rubella]
           gi|482567069|gb|EOA31258.1| hypothetical protein
           CARUB_v10014430mg, partial [Capsella rubella]
          Length = 260

 Score =  215 bits (547), Expect = 3e-53
 Identities = 124/239 (51%), Positives = 156/239 (65%), Gaps = 9/239 (3%)
 Frame = +3

Query: 99  LILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSLSSRKPKXX 278
           L L++P  L P +    +     NN  P        R+   L+    P  SLS + PK  
Sbjct: 34  LALRSPVLLRPSSASNPTRFHGFNNLPP------PSRLFFPLK----PFPSLSIQNPKSI 83

Query: 279 XXXXXXXXXTDVYRTPASAH--SLLTGSTRTITTLAAIVLVASKVLFQKIVNLGLQSN-- 446
                    T +     S +  S LTGSTR++ TLAA+ +  ++VL QK+ +L +Q++  
Sbjct: 84  RIAASASPMTPILPAANSTNRSSTLTGSTRSLATLAALTIAVTRVLAQKL-SLAIQTSSP 142

Query: 447 -----VRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSW 611
                +R SL  + GP FFA+++D+  G LNTP TVVA G+ KWLDIYSGVLMVRVLLSW
Sbjct: 143 VIAEGLRLSL-STAGPVFFASLRDRPPGYLNTPLTVVAVGIKKWLDIYSGVLMVRVLLSW 201

Query: 612 FPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRG 788
           FPNIPW+RQPLSAIRDLCDPYLNLFRN+IPPIFDTLDVSPLLAFAVLGTLGSI++ + G
Sbjct: 202 FPNIPWERQPLSAIRDLCDPYLNLFRNVIPPIFDTLDVSPLLAFAVLGTLGSIVHGSTG 260


>gb|AAM66973.1| unknown [Arabidopsis thaliana]
          Length = 234

 Score =  214 bits (545), Expect = 5e-53
 Identities = 128/246 (52%), Positives = 158/246 (64%), Gaps = 10/246 (4%)
 Frame = +3

Query: 81  LMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSLSS 260
           + A   L L++P FL P +  T        N  P        R+   L     P  SLS 
Sbjct: 1   MAAITALTLRSPVFLLPPSSVTSPRFHGFTNQPP------PARLFFPLN----PFPSLSI 50

Query: 261 RKPKXXXXXXXXXXXTD-VYRTPASA--HSLLTGSTRTITTLAAIVLVASKVLFQKIVNL 431
           + PK           T  + +T  S    S LTGSTR++ TLAA+ +  ++VL QK+ +L
Sbjct: 51  QNPKSIRISASASPITTPILQTEKSTARSSTLTGSTRSLATLAALAIAVTRVLAQKL-SL 109

Query: 432 GLQSN-------VRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLM 590
            +Q++       +R SL  + GP FFA+++D+  G LNTP TVVA G+ KWLDIYSGVLM
Sbjct: 110 AIQTSSPVIADGLRFSL-STAGPVFFASLRDRPPGYLNTPLTVVAVGIKKWLDIYSGVLM 168

Query: 591 VRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI 770
           VRVLLSWFPNIPW+RQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI
Sbjct: 169 VRVLLSWFPNIPWERQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI 228

Query: 771 LNSTRG 788
           ++ + G
Sbjct: 229 VHGSTG 234


>ref|XP_006437216.1| hypothetical protein CICLE_v10033520mg [Citrus clementina]
           gi|557539412|gb|ESR50456.1| hypothetical protein
           CICLE_v10033520mg [Citrus clementina]
          Length = 156

 Score =  213 bits (543), Expect = 8e-53
 Identities = 103/149 (69%), Positives = 128/149 (85%)
 Frame = +3

Query: 342 LLTGSTRTITTLAAIVLVASKVLFQKIVNLGLQSNVRQSLIQSVGPAFFAAVKDQATGTL 521
           LL  STRT++T+ ++  +  K L Q +     +S++  +++++VGPAFFA ++++ +G L
Sbjct: 7   LLADSTRTVSTIFSLSGLFIKSLIQNLAPTLNKSSLHCNIVRTVGPAFFARMRERPSGYL 66

Query: 522 NTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP 701
           NTP TVVAAG+AKWLDIYSGVL+VRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP
Sbjct: 67  NTPLTVVAAGLAKWLDIYSGVLLVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP 126

Query: 702 PIFDTLDVSPLLAFAVLGTLGSILNSTRG 788
           P+FDTLDVSPLLAFAVLGTLGSILN++RG
Sbjct: 127 PVFDTLDVSPLLAFAVLGTLGSILNNSRG 155


>gb|AFK43466.1| unknown [Lotus japonicus]
          Length = 219

 Score =  212 bits (539), Expect = 2e-52
 Identities = 120/236 (50%), Positives = 154/236 (65%)
 Frame = +3

Query: 72  ASSLMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLS 251
           A+  MAS  LI     F +P   + +S    +   HP       PR++ + R+T   + S
Sbjct: 3   ATMTMASARLI----AFHTPSAAKNQSPPCLIGRRHPHPPQCHLPRLLHNHRQTRTVSCS 58

Query: 252 LSSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVNL 431
           L++ +             + V     SAH+ L+GSTRT+TTL ++ L+ +K +   + N 
Sbjct: 59  LTTNQRN-----------SPVCEIRESAHTTLSGSTRTVTTLLSMALLCAKAI-PPLANG 106

Query: 432 GLQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSW 611
            +  +V  S +      FFA+++D+  G LNTP TVVAAG+ KWLDIYSGVLMVRVLLSW
Sbjct: 107 AISMSVGGSSL------FFASLRDRPEGYLNTPLTVVAAGLGKWLDIYSGVLMVRVLLSW 160

Query: 612 FPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNS 779
           FPNIPW+RQPLSAIRDLCDPYLNLFRNIIPP+FDTLDVSPLLAFAVLGTLGSIL +
Sbjct: 161 FPNIPWERQPLSAIRDLCDPYLNLFRNIIPPVFDTLDVSPLLAFAVLGTLGSILQT 216


>ref|XP_002867475.1| YGGT family protein [Arabidopsis lyrata subsp. lyrata]
           gi|297313311|gb|EFH43734.1| YGGT family protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 210

 Score =  211 bits (537), Expect = 4e-52
 Identities = 114/186 (61%), Positives = 136/186 (73%), Gaps = 4/186 (2%)
 Frame = +3

Query: 240 PTLSLSSRKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQK 419
           P+ +LS   P+           + V  T   +    + STR+ITTLA +  V +K L QK
Sbjct: 29  PSFNLSLHNPRTIVSSAVTSL-SPVLSTKPPSQFPFSDSTRSITTLALLAGVVTKSLIQK 87

Query: 420 ----IVNLGLQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVL 587
               IVN+  Q    Q+ I++  P FFA+++D+  G LNTP TVVAAG++KWLDIYSGVL
Sbjct: 88  LSVAIVNISPQI---QASIRTASPLFFASLRDRPAGYLNTPLTVVAAGLSKWLDIYSGVL 144

Query: 588 MVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGS 767
           MVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPP+FDTLDVSPLLAFAVLGTLGS
Sbjct: 145 MVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPVFDTLDVSPLLAFAVLGTLGS 204

Query: 768 ILNSTR 785
           ILN++R
Sbjct: 205 ILNNSR 210


>ref|XP_006407837.1| hypothetical protein EUTSA_v10021454mg [Eutrema salsugineum]
           gi|557108983|gb|ESQ49290.1| hypothetical protein
           EUTSA_v10021454mg [Eutrema salsugineum]
          Length = 228

 Score =  211 bits (536), Expect = 5e-52
 Identities = 120/236 (50%), Positives = 151/236 (63%)
 Frame = +3

Query: 81  LMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSLSS 260
           + A   L L++P +L P +      H         L   +KP   +S+R      +  SS
Sbjct: 1   MAAFTPLALRSPLYLPPPSAINPKFHCINRIPPARLFFPAKPFPSLSIRNLDSIQIKASS 60

Query: 261 RKPKXXXXXXXXXXXTDVYRTPASAHSLLTGSTRTITTLAAIVLVASKVLFQKIVNLGLQ 440
                              ++  +  S LTGSTR++ TLAA+ +  ++ L QK+ + G+ 
Sbjct: 61  ST------CTPITPLNPAAKSTTTRSSTLTGSTRSLATLAALTIALARALAQKL-SPGIG 113

Query: 441 SNVRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFPN 620
             +R SL  + GP FFA+V+D+  G LNTP TVVA G+ KWLDIYSGVLMVRVLLSWFPN
Sbjct: 114 DALRFSL-STAGPVFFASVRDRPPGYLNTPLTVVAVGIKKWLDIYSGVLMVRVLLSWFPN 172

Query: 621 IPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSILNSTRG 788
           IPW+RQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI++ + G
Sbjct: 173 IPWERQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSIVHGSTG 228


>ref|NP_566307.1| protein YLMG1-1 [Arabidopsis thaliana]
           gi|6041841|gb|AAF02150.1|AC009853_10 unknown protein
           [Arabidopsis thaliana] gi|20466762|gb|AAM20698.1|
           unknown protein [Arabidopsis thaliana]
           gi|30023676|gb|AAP13371.1| At3g07430 [Arabidopsis
           thaliana] gi|332641021|gb|AEE74542.1| YGGT family
           protein [Arabidopsis thaliana]
          Length = 232

 Score =  211 bits (536), Expect = 5e-52
 Identities = 127/246 (51%), Positives = 159/246 (64%), Gaps = 10/246 (4%)
 Frame = +3

Query: 81  LMASQTLILQNPTFLSPQNPRTRSVHTALNNHHPLLGHLSKPRIIISLRKTAVPTLSLSS 260
           + A   L L++P +L P +  +   H   N   P        R+   L     P  SLS 
Sbjct: 1   MAAITALTLRSPVYL-PSSATSPRFHGFTNQPPPA-------RLFFPLN----PFPSLSI 48

Query: 261 RKPKXXXXXXXXXXXTD-VYRTPASA--HSLLTGSTRTITTLAAIVLVASKVLFQKIVNL 431
           + PK           T  + +T  S    S LTGSTR++ TLAA+ +  ++VL QK+ +L
Sbjct: 49  QNPKSIRISASASPITTPILQTEKSTARSSTLTGSTRSLATLAALAIAVTRVLAQKL-SL 107

Query: 432 GLQSN-------VRQSLIQSVGPAFFAAVKDQATGTLNTPFTVVAAGMAKWLDIYSGVLM 590
            +Q++       +R SL  + GP FFA+++D+  G LNTP TVVA G+ KWLDIYSGVLM
Sbjct: 108 AIQTSSPVIADGLRFSL-STAGPVFFASLRDRPPGYLNTPLTVVAVGIKKWLDIYSGVLM 166

Query: 591 VRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI 770
           VRVLLSWFPNIPW+RQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI
Sbjct: 167 VRVLLSWFPNIPWERQPLSAIRDLCDPYLNLFRNIIPPIFDTLDVSPLLAFAVLGTLGSI 226

Query: 771 LNSTRG 788
           ++ + G
Sbjct: 227 VHGSTG 232


>gb|EMJ17134.1| hypothetical protein PRUPE_ppa011088mg [Prunus persica]
          Length = 224

 Score =  209 bits (533), Expect = 1e-51
 Identities = 106/146 (72%), Positives = 120/146 (82%)
 Frame = +3

Query: 357 TRTITTLAAIVLVASKVLFQKIVNLGLQSNVRQSLIQSVGPAFFAAVKDQATGTLNTPFT 536
           TRT+TTL A+ L   + L   ++  G Q     S+  + GP FFAA+ D+ +G LNTP T
Sbjct: 81  TRTLTTLFALTLAVVRNLSISLLKFGSQFG--PSIGSAAGPLFFAALGDRPSGYLNTPLT 138

Query: 537 VVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDT 716
           VVAAG++KWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDT
Sbjct: 139 VVAAGLSKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIPPIFDT 198

Query: 717 LDVSPLLAFAVLGTLGSILNSTRGGY 794
           LDVSPLLAFAVLGTLGSILN++RG Y
Sbjct: 199 LDVSPLLAFAVLGTLGSILNNSRGMY 224


>ref|NP_194528.1| YGGT family protein [Arabidopsis thaliana]
           gi|4455358|emb|CAB36768.1| putative protein [Arabidopsis
           thaliana] gi|7269653|emb|CAB79601.1| putative protein
           [Arabidopsis thaliana] gi|110741138|dbj|BAE98662.1|
           hypothetical protein [Arabidopsis thaliana]
           gi|332660017|gb|AEE85417.1| YGGT family protein
           [Arabidopsis thaliana]
          Length = 218

 Score =  209 bits (532), Expect = 2e-51
 Identities = 108/149 (72%), Positives = 122/149 (81%), Gaps = 4/149 (2%)
 Frame = +3

Query: 354 STRTITTLAAIVLVASKVLFQK----IVNLGLQSNVRQSLIQSVGPAFFAAVKDQATGTL 521
           STR+ITTL  +  V  K L QK    IVNL  Q    Q+  ++  P FFA+++D+  G L
Sbjct: 73  STRSITTLVLLAGVVIKSLIQKLSVAIVNLSPQI---QASFRTASPLFFASLRDRPAGYL 129

Query: 522 NTPFTVVAAGMAKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP 701
           NTP TVVAAG++KWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP
Sbjct: 130 NTPLTVVAAGLSKWLDIYSGVLMVRVLLSWFPNIPWDRQPLSAIRDLCDPYLNLFRNIIP 189

Query: 702 PIFDTLDVSPLLAFAVLGTLGSILNSTRG 788
           P+FDTLDVSPLLAFAVLGTLGSILN++RG
Sbjct: 190 PVFDTLDVSPLLAFAVLGTLGSILNNSRG 218


Top