BLASTX nr result

ID: Cocculus22_contig00025456 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00025456
         (759 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613...   164   2e-38
ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citr...   163   7e-38
gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis]     162   9e-38
emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera]   155   1e-35
ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prun...   155   2e-35
ref|XP_002530358.1| conserved hypothetical protein [Ricinus comm...   154   4e-35
ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797...   152   9e-35
ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Popu...   148   2e-33
ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phas...   146   9e-33
ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutr...   146   9e-33
ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305...   142   1e-31
ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205...   142   2e-31
ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago ...   140   5e-31
ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Caps...   137   3e-30
ref|XP_007026747.1| TATA box-binding protein-associated factor R...   132   1e-28
ref|XP_002885248.1| hypothetical protein ARALYDRAFT_479330 [Arab...   129   1e-27
ref|NP_188460.1| uncharacterized protein [Arabidopsis thaliana] ...   128   2e-27
ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260...   126   7e-27
ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cuc...   121   2e-25
ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [A...   109   9e-22

>ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613824 [Citrus sinensis]
          Length = 910

 Score =  164 bits (416), Expect = 2e-38
 Identities = 106/250 (42%), Positives = 142/250 (56%), Gaps = 5/250 (2%)
 Frame = -3

Query: 736 LLPSIASSIA--FDFVSDHPXXXXSQFTDYDNNVLQMLRCP-DNDILLFFPSGDNWDRIG 566
           LLPS ++SIA  FD V  H        +D D N L++L CP +N  + FFP+GDN D++G
Sbjct: 75  LLPSTSTSIASQFDDVGTHQHPNG-SLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQLG 133

Query: 565 FVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNS-VAVGFLL 389
           F+ +S+ KGS+  V++D     F   + L  RI  I V  V +      GNS V VG+LL
Sbjct: 134 FLVISA-KGSRFDVLSDEDDAVFTVVNRLNGRIRGILVNPVEEFYSAFQGNSLVNVGYLL 192

Query: 388 ARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGE 212
           A T+YSVHWF VKVS    S  KPV+ Y+G K F   SV  ACW+PHL EES+VLL++G+
Sbjct: 193 AFTMYSVHWFSVKVSKASESTIKPVVSYLGFKLFKTCSVVGACWSPHLPEESVVLLQSGD 252

Query: 211 LYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHP 32
           L++FD+             KG R+ VSW                 ++   WL  EFSWHP
Sbjct: 253 LFMFDVNG--------RESKGKRLRVSW----------TDDDLSSSQSCAWLGVEFSWHP 294

Query: 31  QIFVVACLNA 2
           QI +VA ++A
Sbjct: 295 QILIVARMDA 304


>ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citrus clementina]
           gi|557533804|gb|ESR44922.1| hypothetical protein
           CICLE_v10000213mg [Citrus clementina]
          Length = 910

 Score =  163 bits (412), Expect = 7e-38
 Identities = 104/252 (41%), Positives = 142/252 (56%), Gaps = 7/252 (2%)
 Frame = -3

Query: 736 LLPSIASSIAFDF----VSDHPXXXXSQFTDYDNNVLQMLRCP-DNDILLFFPSGDNWDR 572
           LLPS ++SIA  F       HP       +D D N L++L CP +N  + FFP+GDN D+
Sbjct: 75  LLPSTSTSIASQFGDVGTHQHPDG---SLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQ 131

Query: 571 IGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNS-VAVGF 395
           +GF+ +S+ KGS+  V++D     F   + L  RI  I V  V +      GNS V VG+
Sbjct: 132 LGFLVISA-KGSRFDVLSDEDDAIFMVLNRLNGRIRGILVNPVEEFDSAFQGNSLVNVGY 190

Query: 394 LLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLEN 218
           LLA T+YSVHWF VKVS    S  KPV+ Y+G K F   SV  ACW+PHL EES+VLL++
Sbjct: 191 LLAFTMYSVHWFSVKVSKASESTTKPVVSYLGFKLFKTCSVVGACWSPHLPEESVVLLQS 250

Query: 217 GELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSW 38
           G+L++FD+ +           KG R+ VSW                 ++   WL  EFSW
Sbjct: 251 GDLFMFDVNA--------RESKGKRLRVSW----------TDDDLSSSQSCAWLGVEFSW 292

Query: 37  HPQIFVVACLNA 2
           HP+I +VA ++A
Sbjct: 293 HPRILIVARMDA 304


>gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis]
          Length = 1000

 Score =  162 bits (411), Expect = 9e-38
 Identities = 100/252 (39%), Positives = 144/252 (57%), Gaps = 6/252 (2%)
 Frame = -3

Query: 751 SKDPFLLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDND-ILLFFPSGDNWD 575
           S D   LPS +SSIA  F   H     +  + + +N LQ+L CP  D  ++FFP+GDN +
Sbjct: 73  SDDSSQLPSTSSSIASVFGPHHYQDDVA--SAFSHNRLQLLHCPRTDKFIVFFPTGDNAN 130

Query: 574 RIGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLG---FDVTGNSVA 404
           ++GF+ LS +K S   V  D +G+ F        +IL IS+  V D G     + GNS  
Sbjct: 131 QVGFMLLS-IKNSCLDVRVDDNGEAFMVDCGSNHQILRISINPVVDSGSALLALGGNSSG 189

Query: 403 -VGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLV 230
            +G+LLA T+YSVHW+ ++V  LG ++  P L  +G+K F    +  ACW+PH++EES++
Sbjct: 190 TIGYLLASTMYSVHWYVIEVKELGLNLH-PSLTCVGTKVFKTCCIVHACWSPHILEESII 248

Query: 229 LLENGELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKC 50
           LLE+G L+LFDL SC + +      KGTR+ VSW+                    +WL C
Sbjct: 249 LLESGALFLFDLESCLKTNTLSPHFKGTRLKVSWDDSNNSGDL------------KWLSC 296

Query: 49  EFSWHPQIFVVA 14
           EFSWHP+I +VA
Sbjct: 297 EFSWHPRILIVA 308


>emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera]
          Length = 865

 Score =  155 bits (392), Expect = 1e-35
 Identities = 94/211 (44%), Positives = 122/211 (57%), Gaps = 2/211 (0%)
 Frame = -3

Query: 640 LQMLRCPDNDILLFFPSGDNWDRIGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILD 461
           L +LRCP+  +L  FP+G N D+IGF+ LS +K S   V ADR+G  F +   L  RI+ 
Sbjct: 66  LHLLRCPNAAVLALFPTGVNSDQIGFLLLS-VKDSCLDVRADRNGDVFVSKKRLNHRIVQ 124

Query: 460 ISVISVADLGFDVTGNSVAVGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-- 287
           I    +   G+  +GN  +VG +LA T+YSVHWF V+  N+ S   +P L Y+G K F  
Sbjct: 125 ILATPI---GYSFSGNPDSVGLVLACTMYSVHWFSVRNDNIDS---EPGLIYLGGKVFKS 178

Query: 286 CASVASACWNPHLIEESLVLLENGELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXX 107
           CA V SACW+PHL EE LVLLE+GEL+LFDL  C     F    KG R+ + W       
Sbjct: 179 CA-VVSACWSPHLSEECLVLLESGELFLFDLDYCCSNSNF----KGNRLKIMWHNADCSG 233

Query: 106 XXXXXXXXXGARLGEWLKCEFSWHPQIFVVA 14
                        G+WL CEFSWHP+I +VA
Sbjct: 234 D------------GKWLGCEFSWHPRILIVA 252


>ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica]
           gi|462415669|gb|EMJ20406.1| hypothetical protein
           PRUPE_ppa017292mg [Prunus persica]
          Length = 925

 Score =  155 bits (391), Expect = 2e-35
 Identities = 101/249 (40%), Positives = 138/249 (55%), Gaps = 9/249 (3%)
 Frame = -3

Query: 733 LPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPD-NDILLFFPSGDNWDRIGFVK 557
           LPS   S+A      HP    S    Y  N L+ L+CP  N +++FFP+G+N D++GF++
Sbjct: 84  LPSSVPSVASFLGPHHPKSDVSSSLLY--NRLEFLQCPQINTVVVFFPTGENSDQVGFLQ 141

Query: 556 LSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDV---TGNSVAVGFLLA 386
           L  LKGS   V  D +G  F +      RI  ISV  +   GF      G+ V +G+LLA
Sbjct: 142 LV-LKGSTFDVKVDENGGVFASRRWFSYRISRISVNPIP--GFSSLRGNGSCVTIGYLLA 198

Query: 385 RTLYSVHWFRVKVSNLG-SSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGE 212
            T+YSVHWF VKV + G +S  +  L ++GSK F    V  ACW+PHL+EES+VLLENG+
Sbjct: 199 STMYSVHWFIVKVGDFGPNSDSRVSLVHLGSKIFKTCCVVHACWSPHLLEESVVLLENGD 258

Query: 211 LYLFDLGSCSRAD---KFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFS 41
           L+LFDL S  +         +  GTR+ V W+                +R   WL CEFS
Sbjct: 259 LFLFDLDSRLKTPHTLNANFKFNGTRLKVPWD---------IDDGSGSSRNYRWLSCEFS 309

Query: 40  WHPQIFVVA 14
           WHP++ +VA
Sbjct: 310 WHPRLLIVA 318


>ref|XP_002530358.1| conserved hypothetical protein [Ricinus communis]
           gi|223530105|gb|EEF32019.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 912

 Score =  154 bits (388), Expect = 4e-35
 Identities = 91/219 (41%), Positives = 128/219 (58%), Gaps = 3/219 (1%)
 Frame = -3

Query: 649 NNVLQMLRCP-DNDILLFFPSGDNWDRIGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGS 473
           +N LQ L CP DN +++FF +G N D++GF+ LS +   +   V D  G  F    CL  
Sbjct: 106 HNQLQFLNCPHDNSVIVFFSTGCNHDQVGFLLLS-VNDKRLCAVGDSRGGVFVANKCLNQ 164

Query: 472 RILDISVISVADLG-FDVTGNSVAVGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGS 296
           RI+ I V  V D G F+   +S  VG+LL  TL+SVHWF VK+  +    ++P+L ++G 
Sbjct: 165 RIVKILVNPVVDSGYFEGNASSKIVGYLLVYTLFSVHWFCVKIGEIN---ERPILGHVGC 221

Query: 295 KQF-CASVASACWNPHLIEESLVLLENGELYLFDLGSCSRADKFPLRLKGTRVPVSWEXX 119
           K F   S+  ACW+PHLIEES+VLLENG L+LFDL S S    F    +GT++ V W+  
Sbjct: 222 KTFKSCSIVDACWSPHLIEESVVLLENGGLFLFDLNSDSSNAYF----RGTKLKVLWD-- 275

Query: 118 XXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFVVACLNA 2
                         ++  +WL C+FSWHP+I +VA  +A
Sbjct: 276 ----------DLGKSKNFKWLGCQFSWHPRILIVASSDA 304


>ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797045 isoform X1 [Glycine
           max] gi|571481421|ref|XP_006588649.1| PREDICTED:
           uncharacterized protein LOC100797045 isoform X2 [Glycine
           max]
          Length = 894

 Score =  152 bits (385), Expect = 9e-35
 Identities = 100/243 (41%), Positives = 137/243 (56%), Gaps = 2/243 (0%)
 Frame = -3

Query: 736 LLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDN-DILLFFPSGDNWDRIGFV 560
           +LPS ASS+A  F   +     S F     N L +L  P+  + ++FFP+G N D++GF 
Sbjct: 76  ILPSTASSVASLFSFPNQNDAASLFL---RNRLHLLYYPNRPNAVVFFPTGANDDKLGFF 132

Query: 559 KLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLART 380
            L+ +K S+  ++ D +G  FR +     RIL+ISV  VAD G  +   S  +G+LLA  
Sbjct: 133 ILA-VKDSRLDILLDSNGDVFRASTGSAHRILNISVNPVADSG--LFNESHVIGYLLASA 189

Query: 379 LYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGELYL 203
           LYSVHWF VK +++   + +P + Y+G K F    V  ACW+PH++EESLVLLENG+L+L
Sbjct: 190 LYSVHWFAVKHNSV---LDRPSVFYLGGKTFKTCPVVHACWSPHILEESLVLLENGQLFL 246

Query: 202 FDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIF 23
           FDL S    D      KGTR+ V W                      WL CEFSWHP++F
Sbjct: 247 FDLES---HDTTGAAFKGTRLKVPWNDLGFSVNNTV-----------WLSCEFSWHPRVF 292

Query: 22  VVA 14
           VVA
Sbjct: 293 VVA 295


>ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa]
           gi|222858389|gb|EEE95936.1| hypothetical protein
           POPTR_0012s03820g [Populus trichocarpa]
          Length = 906

 Score =  148 bits (373), Expect = 2e-33
 Identities = 97/243 (39%), Positives = 135/243 (55%), Gaps = 5/243 (2%)
 Frame = -3

Query: 727 SIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDND-ILLFFPSGDNWDRIGFVKLS 551
           S ASSIAF F    P            N LQ L+CP +D +++FF +G N DR+GF+ LS
Sbjct: 86  STASSIAFSF---GPQDLHFSSPLLAYNRLQFLKCPHDDTVVVFFSTGTNLDRVGFLLLS 142

Query: 550 SLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGN---SVAVGFLLART 380
            +K        D+ G  F  +  LGS+I+ + V  + D  F + GN   S + G+LL  T
Sbjct: 143 -VKDKSLVATGDQKGGIFTASKSLGSKIVRVLVNPIEDDSF-LNGNYSFSGSFGYLLVYT 200

Query: 379 LYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGELYL 203
           +YSV+WF VK S    S+K+PVL Y+G K F    +ASACW+P++  +S+VLLENG L+L
Sbjct: 201 MYSVNWFCVKYSE---SMKRPVLSYLGCKNFKSCGIASACWSPYIKVQSVVLLENGTLFL 257

Query: 202 FDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIF 23
           FDL     AD   +  +GT++ VSW                    G+WL CEFSWH ++ 
Sbjct: 258 FDL----EADCSDMYFRGTKLKVSWGDEGKLGD------------GKWLGCEFSWHCRVL 301

Query: 22  VVA 14
           +VA
Sbjct: 302 IVA 304


>ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris]
           gi|593199831|ref|XP_007132390.1| hypothetical protein
           PHAVU_011G090800g [Phaseolus vulgaris]
           gi|593199873|ref|XP_007132391.1| hypothetical protein
           PHAVU_011G090800g [Phaseolus vulgaris]
           gi|561005389|gb|ESW04383.1| hypothetical protein
           PHAVU_011G090800g [Phaseolus vulgaris]
           gi|561005390|gb|ESW04384.1| hypothetical protein
           PHAVU_011G090800g [Phaseolus vulgaris]
           gi|561005391|gb|ESW04385.1| hypothetical protein
           PHAVU_011G090800g [Phaseolus vulgaris]
          Length = 894

 Score =  146 bits (368), Expect = 9e-33
 Identities = 101/253 (39%), Positives = 129/253 (50%), Gaps = 7/253 (2%)
 Frame = -3

Query: 751 SKDPFLLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDNDI-LLFFPSGDNWD 575
           S  P +LPS ASSIA  F S H       F    +N L +L  P     LL FP+G N  
Sbjct: 71  SHPPSILPSTASSIASLFSSTHQNDAAPPFL---HNRLHLLTYPHRPYALLLFPAGSNDH 127

Query: 574 RIGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGN-----S 410
           ++ F  L   K S+     D  G  F  +     RIL+ISV  VAD GF  + +     S
Sbjct: 128 KLAFFTLR-FKDSRFHTQLDTKGDVFYASTGSSHRILNISVNPVADFGFTGSDDEDDDAS 186

Query: 409 VAVGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESL 233
             +G+LLA TLYSVHWF   V+     + +P +  +G K F    VA ACW+PH++EES+
Sbjct: 187 RVIGYLLATTLYSVHWF---VARHNQILDRPSVVCLGDKMFKTCPVAHACWSPHILEESV 243

Query: 232 VLLENGELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLK 53
           VLLE+G+L+LFDL  C     F    KGTR+ V W                 +    WL 
Sbjct: 244 VLLESGQLFLFDLECCGAGAGF----KGTRLKVPW--------------IDSSESKVWLS 285

Query: 52  CEFSWHPQIFVVA 14
           CEFSWHP+I VVA
Sbjct: 286 CEFSWHPRILVVA 298


>ref|XP_006406618.1| hypothetical protein EUTSA_v10020051mg [Eutrema salsugineum]
           gi|557107764|gb|ESQ48071.1| hypothetical protein
           EUTSA_v10020051mg [Eutrema salsugineum]
          Length = 852

 Score =  146 bits (368), Expect = 9e-33
 Identities = 95/241 (39%), Positives = 128/241 (53%), Gaps = 2/241 (0%)
 Frame = -3

Query: 730 PSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCP-DNDILLFFPSGDNWDRIGFVKL 554
           PS +S+I   F   HP     +   Y  N LQ+LRCP  N +L+FFP+G N D+IGFV L
Sbjct: 64  PSDSSAIEASFRIPHPNDDAERVLSY--NRLQLLRCPVKNCVLVFFPTGSNLDQIGFVLL 121

Query: 553 SSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLARTLY 374
           S+      +V+    G  F       SRIL I V  +++LG     +S+  G+++  TLY
Sbjct: 122 STGDSGAIRVMGTDEGYVFVAKERFFSRILKIFVQPISNLG----ASSMEFGYVMVYTLY 177

Query: 373 SVHWFRVKVSNLGSSVKKPVLEYMGSKQFC-ASVASACWNPHLIEESLVLLENGELYLFD 197
           S+HWF VK      S+ +PVL Y+G KQF   S+ASA W+PH   E LVLLENGE+++FD
Sbjct: 178 SIHWFSVKYDE---SLGRPVLSYLGQKQFKRCSIASASWSPHFPGECLVLLENGEVFVFD 234

Query: 196 LGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFVV 17
           L       +   R +G ++ VSWE                     WL CEF W   IF+V
Sbjct: 235 LN-----QRHLGRFRGCKMKVSWEGQGKSVNR------------NWLGCEFGWRFGIFIV 277

Query: 16  A 14
           A
Sbjct: 278 A 278


>ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305856 [Fragaria vesca
           subsp. vesca]
          Length = 914

 Score =  142 bits (358), Expect = 1e-31
 Identities = 97/243 (39%), Positives = 132/243 (54%), Gaps = 3/243 (1%)
 Frame = -3

Query: 733 LPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPD-NDILLFFPSGDNWDRIGFVK 557
           LPS +SSIA  F+  H             N L+ L+CP  N IL+FFP+G+N D++G ++
Sbjct: 77  LPSTSSSIA-PFLGPHQYKN--DLLSSFRNRLEFLQCPKTNTILIFFPTGENSDQVGLLE 133

Query: 556 LSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNS-VAVGFLLART 380
           L  LK S   V         +  +    +IL ISV  +  L  ++TGN  V +G++LA T
Sbjct: 134 LV-LKDSTFDVKVGGLSTRCQFKY----QILRISVNPLPSLS-NLTGNGPVTIGYVLAST 187

Query: 379 LYSVHWFRVKVSNLGSSVKKPVLEYMGSKQFCAS-VASACWNPHLIEESLVLLENGELYL 203
           +YSVHWF VK+ + GS+     L Y+G + F A  V  ACW+PH+ EES+VLLENG L+L
Sbjct: 188 MYSVHWFIVKLGDFGSNSDSIRLVYVGDRVFKACCVVHACWSPHVPEESVVLLENGALFL 247

Query: 202 FDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIF 23
           FDL S  R        KGTR+ V W+                     WL CEFSWHP++ 
Sbjct: 248 FDLESRLRNTISNANFKGTRLKVLWDNNGYDSGNY-----------RWLSCEFSWHPRVL 296

Query: 22  VVA 14
           +VA
Sbjct: 297 IVA 299


>ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205354 [Cucumis sativus]
          Length = 907

 Score =  142 bits (357), Expect = 2e-31
 Identities = 98/246 (39%), Positives = 136/246 (55%), Gaps = 5/246 (2%)
 Frame = -3

Query: 736 LLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDND-ILLFFPSGDNWDRIGFV 560
           ++PS +SS+A  F              Y  N LQ L CP++  +++FFP+G N D +GF+
Sbjct: 74  VVPSTSSSVASLFGEQQCCSDPPSVLRY--NRLQCLPCPNSSSVVVFFPTGPNSDHVGFL 131

Query: 559 KLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLART 380
            +SS  GS   V +D S   F     L  +I  I+V    + GF V  +   +GFLLA T
Sbjct: 132 VVSS-NGSGLDVQSDCSNDVFSVESELNYQIFGIAVNP--NSGF-VDDSYEDIGFLLAYT 187

Query: 379 LYSVHWFRVKVSNLGSSVKKPV-LEYMGSKQF-CASVASACWNPHLIEESLVLLENGELY 206
           +YSV WF VK   +GSS +  V L +MGSK F   SV  ACWNPHL EES+VLLE+G L+
Sbjct: 188 MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVVLLEDGSLF 247

Query: 205 LFDLGSCSRADKF--PLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHP 32
           LFD+    +   +   + LKG ++ VSW+                ++  +WL CEFSWHP
Sbjct: 248 LFDMEPLLKTKDYNANVNLKGIKLKVSWD------------GLDCSKKVKWLSCEFSWHP 295

Query: 31  QIFVVA 14
           +I +VA
Sbjct: 296 RILIVA 301


>ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago truncatula]
           gi|355489812|gb|AES71015.1| hypothetical protein
           MTR_3g069120 [Medicago truncatula]
          Length = 884

 Score =  140 bits (353), Expect = 5e-31
 Identities = 96/252 (38%), Positives = 138/252 (54%), Gaps = 2/252 (0%)
 Frame = -3

Query: 751 SKDPFLLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDND-ILLFFPSGDNWD 575
           + DP +LPS AS+IA  F S  P       + + +N +Q+L+CP+    ++ FP+G N +
Sbjct: 67  TSDPSILPSTASTIAHLFDST-PELDDDNVSHFLHNRIQLLKCPNTPKAVVIFPTGANDE 125

Query: 574 RIGFVKLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGF 395
            IGF  L  +K S  +   D  G  FR +    SRIL +SV  V +   +   + V +G+
Sbjct: 126 TIGFFMLG-VKDSLLETRLDVKGDVFRASTGSSSRILRMSVNPVTEDDSEPDSSPV-IGY 183

Query: 394 LLARTLYSVHWFRVKVSNLGSSVKKPVLEYMG-SKQFCASVASACWNPHLIEESLVLLEN 218
           +LA + YSV WF VK  NL S    P + Y+G SK F  +V  ACW+PH++EES+VLLE+
Sbjct: 184 VLASSRYSVCWFDVK-HNLSSD--SPSMSYLGRSKVFKEAVVRACWSPHILEESMVLLES 240

Query: 217 GELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSW 38
           G+L+LFD+ +      F    KGTR+ V W                 +    WL CEFSW
Sbjct: 241 GQLFLFDVDAQGSMKTF----KGTRLRVPWN------------DSACSENKAWLSCEFSW 284

Query: 37  HPQIFVVACLNA 2
           HP+I +VA  +A
Sbjct: 285 HPRILIVARYDA 296


>ref|XP_006299498.1| hypothetical protein CARUB_v10015667mg [Capsella rubella]
           gi|482568207|gb|EOA32396.1| hypothetical protein
           CARUB_v10015667mg [Capsella rubella]
          Length = 866

 Score =  137 bits (346), Expect = 3e-30
 Identities = 96/242 (39%), Positives = 130/242 (53%), Gaps = 3/242 (1%)
 Frame = -3

Query: 730 PSIASSIAFDFVS-DHPXXXXSQFTDYDNNVLQMLRCPD-NDILLFFPSGDNWDRIGFVK 557
           PS +S+IA   +S  +P    ++   Y  N LQ L  P  N +L+FFP+G N DRIGF+ 
Sbjct: 72  PSGSSAIAAASLSVPNPPDDTAKVLSY--NRLQFLPFPSKNSVLVFFPTGTNLDRIGFLL 129

Query: 556 LSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLARTL 377
           LS+      QV+    G  F     L SRIL I V  V+    D + +SV +G++L  +L
Sbjct: 130 LSTGDSGGLQVLGSDEGDVFVATERLFSRILKILVQPVSTFAADDSSSSVELGYVLVYSL 189

Query: 376 YSVHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGELYLF 200
           YS+HWF V   N   S  KPVL  +G KQF    V SA W+PH+  ESLVLLENGE++LF
Sbjct: 190 YSIHWFCV---NYDESQGKPVLRNLGCKQFKMCMVVSAAWSPHITGESLVLLENGEVFLF 246

Query: 199 DLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFV 20
           D+      ++   RL+G+++ VSWE                     WL C+F W   I+V
Sbjct: 247 DV------NQRLSRLRGSKLKVSWE-----------GQGKSVNRRSWLGCDFGWTFGIYV 289

Query: 19  VA 14
           VA
Sbjct: 290 VA 291


>ref|XP_007026747.1| TATA box-binding protein-associated factor RNA polymerase I subunit
           C, putative [Theobroma cacao]
           gi|508715352|gb|EOY07249.1| TATA box-binding
           protein-associated factor RNA polymerase I subunit C,
           putative [Theobroma cacao]
          Length = 910

 Score =  132 bits (333), Expect = 1e-28
 Identities = 88/240 (36%), Positives = 125/240 (52%), Gaps = 2/240 (0%)
 Frame = -3

Query: 727 SIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDNDI-LLFFPSGDNWDRIGFVKLS 551
           S +SSIA  F  +      +  +   +N L +L CPD +I ++FF +G N DRIGF  + 
Sbjct: 75  SASSSIASRFGLESFYDDAASSSFLSHNRLHLLHCPDQNIAVVFFTTGANHDRIGFFAVH 134

Query: 550 SLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLARTLYS 371
            ++ +  + + DR G    + +    +IL I V  V D  F+       VG+L+A TLYS
Sbjct: 135 -VQDNDFKFLGDRDGDILISHNHCNHKILRILVSPVDDDDFEENSGDSVVGYLMACTLYS 193

Query: 370 VHWFRVKVSNLGSSVKKPVLEYMGSKQF-CASVASACWNPHLIEESLVLLENGELYLFDL 194
           VHW+ VK      S K P L+Y+G K F  +S+ SAC++PHL +ES+VLLENG L+ FDL
Sbjct: 194 VHWYSVKFVK---SSKSPALDYLGCKLFKSSSIVSACFSPHLPQESMVLLENGALFFFDL 250

Query: 193 GSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFVVA 14
            S           KG ++ V W                     +WL  EFSWHP+I +VA
Sbjct: 251 ESDVNCQIPNAYFKGNKLRVLWNDSSGSENY------------KWLGVEFSWHPRILIVA 298


>ref|XP_002885248.1| hypothetical protein ARALYDRAFT_479330 [Arabidopsis lyrata subsp.
           lyrata] gi|297331088|gb|EFH61507.1| hypothetical protein
           ARALYDRAFT_479330 [Arabidopsis lyrata subsp. lyrata]
          Length = 856

 Score =  129 bits (324), Expect = 1e-27
 Identities = 89/241 (36%), Positives = 123/241 (51%), Gaps = 2/241 (0%)
 Frame = -3

Query: 730 PSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPD-NDILLFFPSGDNWDRIGFVKL 554
           PS +S+I   F   +P    ++   Y  N LQ L  P  N +L+FFP+G N D+IGF+ L
Sbjct: 64  PSDSSAIHSSFNILNPHDDTARVLSY--NRLQFLPFPSKNSVLVFFPTGTNLDQIGFLLL 121

Query: 553 SSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLARTLY 374
           S+      QV     G  F     L  RIL I V  V+D G     +S  +G++L   LY
Sbjct: 122 STGDSGGLQVTGSDEGDVFVATERLFYRILKILVQPVSDFGAYKCSSSGELGYVLVYCLY 181

Query: 373 SVHWFRVKVSNLGSSVKKPVLEYMGSKQFCA-SVASACWNPHLIEESLVLLENGELYLFD 197
           S+HW+ VK      S  KPVL  +GSKQF    + SA W+PH+  E L+LL+NGE+++FD
Sbjct: 182 SIHWYCVKYD---ESQGKPVLRNLGSKQFKRFMIVSASWSPHVTGECLLLLDNGEVFVFD 238

Query: 196 LGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFVV 17
           L      ++   RL+G ++ VSWE                     WL CEF W   I++V
Sbjct: 239 L------NQRHCRLRGCKLKVSWESQGKSVNK------------SWLGCEFGWRVGIYIV 280

Query: 16  A 14
           A
Sbjct: 281 A 281


>ref|NP_188460.1| uncharacterized protein [Arabidopsis thaliana]
           gi|11994094|dbj|BAB01097.1| unnamed protein product
           [Arabidopsis thaliana] gi|332642560|gb|AEE76081.1|
           uncharacterized protein AT3G18310 [Arabidopsis thaliana]
          Length = 873

 Score =  128 bits (322), Expect = 2e-27
 Identities = 88/242 (36%), Positives = 123/242 (50%), Gaps = 3/242 (1%)
 Frame = -3

Query: 730 PSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPD-NDILLFFPSGDNWDRIGFVKL 554
           PS +S+I   F   +P     +   Y  N LQ L  P  N +L+FFP+G N D+IGF+ L
Sbjct: 77  PSDSSAINSSFKISNPHDDTVRVLSY--NRLQFLPFPSKNSVLVFFPTGTNLDQIGFLLL 134

Query: 553 SSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLG-FDVTGNSVAVGFLLARTL 377
           S       QV     G  F     L SRIL I V  V+D G +  + +S  +G++L  +L
Sbjct: 135 SYGDSGGLQVTGSDEGDVFVATERLFSRILKILVQPVSDFGAYKCSSSSGELGYVLVYSL 194

Query: 376 YSVHWFRVKVSNLGSSVKKPVLEYMGSKQFCASV-ASACWNPHLIEESLVLLENGELYLF 200
           YS+HW+ VK      S  KPVL  +G KQF   V  SA W+PH+  E L+LL+NGE+++F
Sbjct: 195 YSIHWYCVKYD---ESQGKPVLRNLGCKQFKRFVIVSASWSPHVTGECLLLLDNGEVFVF 251

Query: 199 DLGSCSRADKFPLRLKGTRVPVSWEXXXXXXXXXXXXXXXGARLGEWLKCEFSWHPQIFV 20
           DL       +   R++G ++ VSWE                     WL CEF W   +++
Sbjct: 252 DL------SQRHCRVRGCKLKVSWESQGKSVNK------------SWLGCEFGWRVGVYI 293

Query: 19  VA 14
           VA
Sbjct: 294 VA 295


>ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260775 [Solanum
           lycopersicum]
          Length = 907

 Score =  126 bits (317), Expect = 7e-27
 Identities = 91/260 (35%), Positives = 136/260 (52%), Gaps = 19/260 (7%)
 Frame = -3

Query: 736 LLPSIASSIAFDF---VSDHPXXXXSQFTDYDNNVLQMLRCPD-------NDILLFFPSG 587
           +L S ASSIA +F   VSD         T ++ N +Q L  P+       N I+   P+G
Sbjct: 85  MLFSTASSIATEFSPQVSD---------TIHNFNSIQFLPLPNFGENSKPNSIIGISPTG 135

Query: 586 DNWDRIGFVKLSSLKGSKPQVVADR--SGQEFRTA-HCLGSRILDISVISVADLGFDVTG 416
           +N+D++G   L S      Q VA +  +G       H L  RIL + V  V+++    + 
Sbjct: 136 ENYDQVGLFMLCS---EDTQFVAKKFKNGTSILVHNHKLNFRILRLLVNPVSEIDDSCSS 192

Query: 415 NSVAVGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGSKQ---FCASVAS-ACWNPHL 248
           + +  G+LL  TLYSVHW+ VK+   G   +  +L+Y+GS     F   + S ACW+PHL
Sbjct: 193 SCITFGYLLVCTLYSVHWYSVKIGVKGD--ENVMLDYVGSADRNLFKGGIVSHACWSPHL 250

Query: 247 IEESLVLLENGELYLFDLGSCSRADKFPLR--LKGTRVPVSWEXXXXXXXXXXXXXXXGA 74
            EE +V+L+NGE++LFD+GSC ++  F     L+G ++ V W+                 
Sbjct: 251 REECVVMLKNGEMFLFDMGSCGKSQAFCASDVLQGKKLQVLWDKLD-------------- 296

Query: 73  RLGEWLKCEFSWHPQIFVVA 14
           R   W+ CEFSWHP+I +VA
Sbjct: 297 RDEHWVTCEFSWHPRILIVA 316


>ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cucumis sativus]
          Length = 862

 Score =  121 bits (304), Expect = 2e-25
 Identities = 86/209 (41%), Positives = 119/209 (56%), Gaps = 5/209 (2%)
 Frame = -3

Query: 736 LLPSIASSIAFDFVSDHPXXXXSQFTDYDNNVLQMLRCPDND-ILLFFPSGDNWDRIGFV 560
           ++PS +SS+A  F              Y  N LQ L CP++  +++FFP+G N D +GF+
Sbjct: 69  VVPSTSSSVASLFGEQQCYSDPPSVLRY--NRLQCLPCPNSSSVVVFFPTGPNSDHVGFL 126

Query: 559 KLSSLKGSKPQVVADRSGQEFRTAHCLGSRILDISVISVADLGFDVTGNSVAVGFLLART 380
            +SS  GS   V +D S   F     L  +I  I+V    + GF V  +   +GFLLA T
Sbjct: 127 VVSS-NGSGLDVQSDCSNDVFSVESELNYQIFGIAVNP--NSGF-VDDSYEDIGFLLAYT 182

Query: 379 LYSVHWFRVKVSNLGSSVKKPV-LEYMGSKQF-CASVASACWNPHLIEESLVLLENGELY 206
           +YSV WF VK   +GSS +  V L +MGSK F   SV  ACWNPHL EES+VLLE+G L+
Sbjct: 183 MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVVLLEDGSLF 242

Query: 205 LFDLGSCSRADKF--PLRLKGTRVPVSWE 125
           LFD+    +   +   + LKG ++ VSW+
Sbjct: 243 LFDMEPLLKTKDYNANVNLKGIKLKVSWD 271


>ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda]
           gi|548846551|gb|ERN05827.1| hypothetical protein
           AMTR_s00006p00260920 [Amborella trichopoda]
          Length = 929

 Score =  109 bits (273), Expect = 9e-22
 Identities = 73/213 (34%), Positives = 100/213 (46%), Gaps = 3/213 (1%)
 Frame = -3

Query: 646 NVLQMLRCPDNDILLFFPSGDNWDRIGFV--KLSSLKGSKPQVVADRSGQEFRTAHCLGS 473
           N L +L C + + LL FPSG+N DR+  V  +     G    +V D     F  +    +
Sbjct: 104 NPLHLLTCRNGEFLLLFPSGENSDRLACVVGRRERDNGGGFSLVKD---SVFLLSPSFKN 160

Query: 472 RILDISVISVADLGFDV-TGNSVAVGFLLARTLYSVHWFRVKVSNLGSSVKKPVLEYMGS 296
           RI+ +SVIS AD        +    GF+L  + Y VHW RV V N       P+ + + S
Sbjct: 161 RIIRVSVISTADCASSSEVCDQFTEGFVLLCSHYEVHWLRVGVRN-----STPLSQNLAS 215

Query: 295 KQFCASVASACWNPHLIEESLVLLENGELYLFDLGSCSRADKFPLRLKGTRVPVSWEXXX 116
             F   VA ACW+P+L EES VLL NGEL L+DL  C      P++ KG  V  +     
Sbjct: 216 ATFKNQVAHACWSPYLPEESAVLLVNGELRLYDLNYCVGVKNLPVKFKGELVSKNLGSLI 275

Query: 115 XXXXXXXXXXXXGARLGEWLKCEFSWHPQIFVV 17
                            +W  CEF WHP++ +V
Sbjct: 276 SRESD-----------NDWFCCEFGWHPRVLIV 297


Top