BLASTX nr result

ID: Rauwolfia21_contig00011233 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00011233
         (2125 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254...   463   e-127
ref|XP_004512681.1| PREDICTED: uncharacterized protein LOC101496...   448   e-123
ref|XP_003549166.1| PREDICTED: uncharacterized protein LOC100814...   445   e-122
gb|ESW24513.1| hypothetical protein PHAVU_004G137100g [Phaseolus...   443   e-121
ref|XP_003619781.1| hypothetical protein MTR_6g068920 [Medicago ...   437   e-119
ref|XP_006351538.1| PREDICTED: UPF0415 protein C7orf25 homolog [...   434   e-118
ref|XP_004234327.1| PREDICTED: UPF0415 protein C7orf25 homolog [...   432   e-118
gb|EOY09850.1| Uncharacterized protein isoform 1 [Theobroma cacao]    431   e-118
ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214...   418   e-114
gb|EMJ11151.1| hypothetical protein PRUPE_ppa004801mg [Prunus pe...   410   e-111
ref|XP_002305036.2| hypothetical protein POPTR_0004s06580g [Popu...   409   e-111
ref|XP_006478143.1| PREDICTED: uncharacterized protein LOC102624...   401   e-109
ref|XP_002519590.1| conserved hypothetical protein [Ricinus comm...   390   e-105
ref|XP_004299451.1| PREDICTED: uncharacterized protein LOC101308...   382   e-103
ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana] ...   361   6e-97
ref|XP_006848656.1| hypothetical protein AMTR_s00171p00062290 [A...   358   7e-96
ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arab...   356   3e-95
ref|XP_006302279.1| hypothetical protein CARUB_v10020322mg [Caps...   345   6e-92
ref|XP_006390563.1| hypothetical protein EUTSA_v10018586mg [Eutr...   339   2e-90
emb|CBI26409.3| unnamed protein product [Vitis vinifera]              308   5e-81

>ref|XP_002267544.1| PREDICTED: uncharacterized protein LOC100254610 [Vitis vinifera]
          Length = 457

 Score =  463 bits (1191), Expect = e-127
 Identities = 244/406 (60%), Positives = 303/406 (74%), Gaps = 3/406 (0%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKAD-SAVTNIYVDIVCCLNGKP 178
            VNI HLE+V+HIL+QPFITGVSRV K  PLS +     K+D  A   +Y+DIVC LN  P
Sbjct: 58   VNISHLEAVVHILEQPFITGVSRVCKLFPLSPTIGNGEKSDCGAAKGVYLDIVCTLNRNP 117

Query: 179  VWFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHK 358
            VWFIVSDRNPKY+SW   S NKGL+ RIQ VL AA+SS TL+P+S+ILFFS GLD+ + +
Sbjct: 118  VWFIVSDRNPKYVSWDECSGNKGLRTRIQQVLDAARSSLTLKPSSVILFFSNGLDQCICE 177

Query: 359  KLQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKS-SVSAISTPR 535
            KLQ EFGA +  + FP     FL+E E +WIN+ AR YR AC+LEI++   S S +    
Sbjct: 178  KLQGEFGAYECAVEFPDCSFDFLEEPESEWINVFARSYRGACILEIKVDHVSPSVLVYDV 237

Query: 536  KERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGM-ICCVPDTEIGLLKELPSGHMQLVNF 712
            K+         +  KH+  ++ LG SF SL+ GM  CC+    +  L     G   L+NF
Sbjct: 238  KDSPPDAVGTQIPEKHI--DISLGASFSSLILGMKFCCLHAEGVETLL----GQDDLINF 291

Query: 713  DTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIA 892
            DTTAL+A+VSGISNGG +KLLAA E+E+R RFKGNY FV+AQV SE  +PIHVEL G+ +
Sbjct: 292  DTTALIAVVSGISNGGTEKLLAAPETEMRLRFKGNYKFVIAQVLSEIQNPIHVELSGLTS 351

Query: 893  GKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMK 1072
            GK G+IC++V++EF+ELVS+CGGPNEKLRA+ +L  L VVPD PS+R+M LPTTRKLA+K
Sbjct: 352  GKRGIICETVHSEFKELVSMCGGPNEKLRADQLLKCLMVVPDSPSARMMGLPTTRKLALK 411

Query: 1073 NKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            NKVVFGTGD++HAPTLTANMAFVRA+SQTGMSLFTIEHRPRAL G+
Sbjct: 412  NKVVFGTGDYWHAPTLTANMAFVRAISQTGMSLFTIEHRPRALTGN 457


>ref|XP_004512681.1| PREDICTED: uncharacterized protein LOC101496834 [Cicer arietinum]
          Length = 509

 Score =  448 bits (1153), Expect = e-123
 Identities = 240/444 (54%), Positives = 300/444 (67%), Gaps = 41/444 (9%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+V+HILQQPFI+GVSRV K +PLS S     + DSA+ +I+VD+VC LNG PV
Sbjct: 66   VNIGHLEAVVHILQQPFISGVSRVCKSIPLSPSVTREDRQDSALKDIHVDVVCILNGMPV 125

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNP+YISW     +KGLK RIQ VL AA+S+ TLRP+S+I+FF+ G+   V+ K
Sbjct: 126  WIIVSDRNPQYISWSECHKSKGLKLRIQQVLAAAKSNLTLRPSSVIIFFANGIATHVYDK 185

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAI------ 523
            L DEFGA+++ L FP F    L+E+E  W+N++AR  R+ACVLEI I    + +      
Sbjct: 186  LLDEFGASEIRLEFPVFSSKMLEETEGDWVNVIARSCRDACVLEINIVDDKNVVPNLECN 245

Query: 524  -------STPRK--------------------------ERAVTQSSNPLSLKHVAVNVKL 604
                   S+P K                          ER++ ++             K 
Sbjct: 246  VESSTVDSSPVKFSVGKADETRLHCLEENAINRGSSQIERSIDKAETRPQRSQEEFETKF 305

Query: 605  GDSFCSLVSGMICCVPDTEIGLLKELPS--GHMQLVNFDTTALVAIVSGISNGGIDKLLA 778
            GD+FCS++ GM     D E     E     G   LVNFDTTAL+A VSGISNGG +KLLA
Sbjct: 306  GDTFCSVIMGMKLSSLDDENSESTEPRKLLGGSDLVNFDTTALIAFVSGISNGGTEKLLA 365

Query: 779  AGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQELVSICG 958
              E+ELR RFKGN+DFV+ QV SE  +PIHVE   V+ GK G+IC+SV +EF+ELV +CG
Sbjct: 366  TPETELRQRFKGNFDFVIGQVMSELQNPIHVEFGRVLCGKHGIICESVLSEFKELVLMCG 425

Query: 959  GPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAF 1138
            GPNEKLRA+ +++ L+VVPD PS R+M LPTTRKLA+KNK+VFGTGD++HAPTLTANMAF
Sbjct: 426  GPNEKLRADKLISCLRVVPDTPSERVMGLPTTRKLALKNKIVFGTGDYWHAPTLTANMAF 485

Query: 1139 VRAVSQTGMSLFTIEHRPRALIGD 1210
             RAVSQTGMSL TIEHRPRAL GD
Sbjct: 486  ARAVSQTGMSLSTIEHRPRALTGD 509


>ref|XP_003549166.1| PREDICTED: uncharacterized protein LOC100814429 isoform X1 [Glycine
            max] gi|571530136|ref|XP_006599676.1| PREDICTED:
            uncharacterized protein LOC100814429 isoform X2 [Glycine
            max] gi|571530141|ref|XP_006599677.1| PREDICTED:
            uncharacterized protein LOC100814429 isoform X3 [Glycine
            max]
          Length = 507

 Score =  445 bits (1144), Expect = e-122
 Identities = 240/449 (53%), Positives = 298/449 (66%), Gaps = 46/449 (10%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+V+HILQQPF+TGVSRV KP+PLS S     +  S + NI+VD+VC LN KPV
Sbjct: 64   VNIGHLETVVHILQQPFVTGVSRVCKPIPLSPSVSSEERRHSPLNNIHVDVVCTLNKKPV 123

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNP+YISW     +KGLK RI++VL AAQS+ TLRP+S+ILFF+ GL   ++ K
Sbjct: 124  WIIVSDRNPEYISWDRCHKSKGLKLRIEEVLAAAQSNLTLRPSSVILFFANGLPTQIYNK 183

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEI--------------- 496
            L+DEFG +++ L F  F    L+E+E  WIN++AR YR ACVLEI               
Sbjct: 184  LRDEFGGSEIWLDFSVFSSDMLEETEGDWINVIARSYRNACVLEINPADGKDVEGTKSGC 243

Query: 497  ----------EIKSSVSAISTPRK--------------ERAVTQSSNPLSLKHVAVNVKL 604
                      +++ SV    T  +              E +V ++          V + L
Sbjct: 244  SVQGSTVDSSQLEPSVGKAETQPQLVDENARSGDSFHLELSVDEAETQAQPTEENVRINL 303

Query: 605  GDSFCSLVSGMI-------CCVPDTEIGLLKELPSGHMQLVNFDTTALVAIVSGISNGGI 763
            G  FCS++ GM         C       LL     G   LVNFDTTAL+A+VSGISNGG 
Sbjct: 304  GVMFCSILMGMKLSSMESKACESSNPGNLL-----GETDLVNFDTTALIALVSGISNGGT 358

Query: 764  DKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQEL 943
            +KLLA  ES +R RFKGNYDFV+ Q+ SE  +PIH+E   ++ GK G+IC+SV  EF+EL
Sbjct: 359  EKLLATPESGMRQRFKGNYDFVIGQITSEIQNPIHMEFDRILRGKNGLICESVLTEFKEL 418

Query: 944  VSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLT 1123
            VS+CGGPNEKLRA+ ++N L+VVPD PS R+M LP+TRKLA+KNKVVFGTGD +HAPTLT
Sbjct: 419  VSMCGGPNEKLRADWLINCLRVVPDTPSERMMGLPSTRKLALKNKVVFGTGDHWHAPTLT 478

Query: 1124 ANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            ANMAFVRAVSQTGMSL TIEHRPRAL GD
Sbjct: 479  ANMAFVRAVSQTGMSLSTIEHRPRALTGD 507


>gb|ESW24513.1| hypothetical protein PHAVU_004G137100g [Phaseolus vulgaris]
          Length = 504

 Score =  443 bits (1139), Expect = e-121
 Identities = 241/443 (54%), Positives = 305/443 (68%), Gaps = 40/443 (9%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+V+H+LQQPFITGVSRV KP+PLS S     +   ++ +I+VD+VC LN KPV
Sbjct: 64   VNIGHLEAVVHMLQQPFITGVSRVCKPIPLSPSVSSEERC--SLKHIHVDVVCTLNRKPV 121

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNP+YISW     +KGLK RIQ+VL AAQS+ TLRP+S+ILFF+ GL   ++ K
Sbjct: 122  WIIVSDRNPEYISWDRCRKSKGLKLRIQEVLAAAQSNLTLRPSSVILFFANGLATHIYNK 181

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIE-------------- 499
            L+DE GA++++L F  F    L+E+E  WIN++AR YR +CVLEI               
Sbjct: 182  LRDELGASEIKLDFSVFSSDVLEETEGDWINVIARSYRNSCVLEINPAVGRDVVPKSGCS 241

Query: 500  -----IKSSVSAISTPRKERAVTQ-SSNP----------------LSLKHVAVNV--KLG 607
                 + SS   +S  + E    Q   NP                +  + V  NV   LG
Sbjct: 242  VRGSAVDSSQIDLSVGKTETQPQQFEENPRIGECFHLELLVDEAKIQPRPVEENVGTNLG 301

Query: 608  DSFCSLVSGMICCVPDTEI--GLLKELPSGHMQLVNFDTTALVAIVSGISNGGIDKLLAA 781
            D+FCS++ GM     + +I   +      G + LVNFDTTAL+A+VSGISNGG  KLLA 
Sbjct: 302  DTFCSILMGMKPSSMENKIFESMKSRNLLGEIDLVNFDTTALIALVSGISNGGTKKLLAT 361

Query: 782  GESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQELVSICGG 961
             ESE+R RFKGN+DFV+ Q+ SE  +PIH+E   ++ GK G+IC+SV  EF+ELVS+CGG
Sbjct: 362  PESEIRQRFKGNFDFVIGQIMSEIQNPIHIEFGRILHGKNGLICESVLVEFKELVSMCGG 421

Query: 962  PNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAFV 1141
            PNEKLRA+ ++++L+VVPD PS R+M LPTTRKLA+KNKVVFGTGD +HAPTLTANMAFV
Sbjct: 422  PNEKLRADRLIDYLRVVPDTPSERMMGLPTTRKLALKNKVVFGTGDHWHAPTLTANMAFV 481

Query: 1142 RAVSQTGMSLFTIEHRPRALIGD 1210
            RAVSQTGMSL TIEHRPRAL GD
Sbjct: 482  RAVSQTGMSLSTIEHRPRALTGD 504


>ref|XP_003619781.1| hypothetical protein MTR_6g068920 [Medicago truncatula]
            gi|355494796|gb|AES75999.1| hypothetical protein
            MTR_6g068920 [Medicago truncatula]
          Length = 511

 Score =  437 bits (1123), Expect = e-119
 Identities = 238/448 (53%), Positives = 305/448 (68%), Gaps = 45/448 (10%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+V+HILQQPFI+GVSRV K +PLS S     +  S++ +I+VD+VC LNGKPV
Sbjct: 69   VNIGHLEAVVHILQQPFISGVSRVCKSIPLSPSVSREERHSSSLKDIHVDVVCILNGKPV 128

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNPKYISW+    +KGLK RIQ VL AA+S+ TLRP+S+I+FF+ G+   V+ K
Sbjct: 129  WIIVSDRNPKYISWNECHKSKGLKLRIQQVLAAAKSNLTLRPSSVIIFFANGISSNVYDK 188

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSV--------- 514
            L+DEFGA++++L F  F    L+E+E  WIN++AR YR+A VLEI +             
Sbjct: 189  LRDEFGASEIQLEFSVFSSNMLEETECDWINVIARSYRDARVLEINVAGDKDVFLNSGCS 248

Query: 515  ---SAISTPRKERAVTQSSNPLSLK--------------------------HVAVNVKLG 607
               S++++ + E +V +    L L                              +  KLG
Sbjct: 249  VEGSSVNSSQVEFSVEKPETRLHLLDENTINGGSSQLECSIDKAETRPQLIQEGIETKLG 308

Query: 608  DSFCSLVSGM-ICCVPDTEI------GLLKELPSGHMQLVNFDTTALVAIVSGISNGGID 766
            D+FCS+++ M +  + D+         LL E       LVNFDTTAL+A VSGISNGG +
Sbjct: 309  DTFCSVITRMKLSSLDDSNYESTGPTNLLDE-----SDLVNFDTTALIAFVSGISNGGTE 363

Query: 767  KLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQELV 946
            KLLA  E ELR RFKGN+DFV+ Q+ SE  +PIHVE   V+ GK G+IC+SV +EF+ELV
Sbjct: 364  KLLATPEIELRQRFKGNFDFVIGQIMSELQNPIHVEFGKVLCGKLGIICESVLSEFKELV 423

Query: 947  SICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTA 1126
             +CGGPNEKLRA+ ++N L+VV D PS R+M LPTTRKLA+KNKVVFGTGD++ APTLTA
Sbjct: 424  LMCGGPNEKLRADKLINCLRVVSDTPSERMMGLPTTRKLALKNKVVFGTGDYYRAPTLTA 483

Query: 1127 NMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            NMAFVRAVSQTGMSL +IEHRPRAL GD
Sbjct: 484  NMAFVRAVSQTGMSLSSIEHRPRALTGD 511


>ref|XP_006351538.1| PREDICTED: UPF0415 protein C7orf25 homolog [Solanum tuberosum]
          Length = 457

 Score =  434 bits (1115), Expect = e-118
 Identities = 235/413 (56%), Positives = 294/413 (71%), Gaps = 11/413 (2%)
 Frame = +2

Query: 5    NIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPVW 184
            NIG+LE+V+HILQ P +T VSRV KP+ +SS             ++Y+D++C  NG P+W
Sbjct: 65   NIGYLEAVVHILQHPLVTAVSRVCKPISISSK-----------LSVYIDVICSFNGNPIW 113

Query: 185  FIVSDRNPKYISWHGSSAN---KGLKKRIQDVLQAA-QSSDTLRPTSIILFFSRGLDRIV 352
            FIVSDRNP+YISW  S      KGL+ +I +++ AA +SS T+RP+SIILFFS GL   +
Sbjct: 114  FIVSDRNPRYISWEDSEKIRNCKGLRSKIVELMFAASESSVTVRPSSIILFFSNGLQSCI 173

Query: 353  HKKLQDEFGAADLELAFPYFDCIFLDESEDQ-WINILARCYREACVLEIEIKSSVSAIST 529
             + L+ EFGA DL   F  FDC F DE ED+  +++L R +  AC+LEI++ S  S+   
Sbjct: 174  LEDLRGEFGATDLGFGFCDFDCEFYDELEDEDSVSVLGRSFERACILEIKVGSFSSSRDV 233

Query: 530  PRKER---AVTQSSNPLSLKH---VAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSG 691
              + +    +T  S  L   H    + +V LGDSFC+LVS +      +  GL  E    
Sbjct: 234  KLQGKDGETLTDLSGSLGKLHSDDASKDVNLGDSFCALVSAL-----RSWSGLDVE---- 284

Query: 692  HMQLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHV 871
              +LVNFDTTALVAIVSGISNGGID++LA  ESELRSRFK NY+F++ QVNSE   PIH+
Sbjct: 285  EAELVNFDTTALVAIVSGISNGGIDRILATPESELRSRFKVNYEFMIGQVNSEIKKPIHM 344

Query: 872  ELIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPT 1051
            EL+  I  K GV+C+SV +EFQELVS+CGGP EK RA H LNHL+VVPDCPS R+MSLPT
Sbjct: 345  ELMPSILQKRGVVCESVCSEFQELVSMCGGPKEKSRAEHFLNHLRVVPDCPSERLMSLPT 404

Query: 1052 TRKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            TRKLA+KNKV FGTGD++HAPT+TANMAF RAVSQTGMSLFTIEHRPRALIGD
Sbjct: 405  TRKLALKNKVAFGTGDYWHAPTITANMAFARAVSQTGMSLFTIEHRPRALIGD 457


>ref|XP_004234327.1| PREDICTED: UPF0415 protein C7orf25 homolog [Solanum lycopersicum]
          Length = 461

 Score =  432 bits (1112), Expect = e-118
 Identities = 238/421 (56%), Positives = 294/421 (69%), Gaps = 19/421 (4%)
 Frame = +2

Query: 5    NIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPVW 184
            NIG+LE+V+HILQ P +T VSRV K +  S S KL         ++Y+D++C  NG PVW
Sbjct: 62   NIGYLEAVVHILQNPLVTAVSRVCKTI--SVSLKL---------SVYIDVICSFNGNPVW 110

Query: 185  FIVSDRNPKYISWHGSSAN---KGLKKRIQD-VLQAAQSSDTLRPTSIILFFSRGLDRIV 352
            FIVSDRNP+YISW  S      KGL+ +I + V  A++SS T+RP+SIILFFS GL   +
Sbjct: 111  FIVSDRNPRYISWEDSGEIRNCKGLRSKIVELVFAASESSVTVRPSSIILFFSNGLQSCI 170

Query: 353  HKKLQDEFGAADLELAFPYFDCIFLDESEDQ-WINILARCYREACVLEIEI-----KSSV 514
              KL+ EFGA DL   F  FDC F DE ED+ W+++L R +  AC+LEI+I      S++
Sbjct: 171  LDKLRGEFGATDLGFGFSDFDCEFYDELEDEDWVSVLGRSFERACILEIKIGSFSSSSAI 230

Query: 515  SAISTPRKERA--------VTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVP-DTEIG 667
            S  ST  K +         ++ SS  +     + +V LGDSFC+LVS +      D E  
Sbjct: 231  SDSSTDVKLQGKDGETLTDLSGSSGKMHSDDASNDVNLGDSFCTLVSALRSWSGFDVE-- 288

Query: 668  LLKELPSGHMQLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNS 847
                      +LVNFDTTALVAIVSGISNG ID++LA  ESELRSRFK NY+F++ QVNS
Sbjct: 289  --------EAELVNFDTTALVAIVSGISNGSIDRILATPESELRSRFKVNYEFMIGQVNS 340

Query: 848  ETLSPIHVELIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPS 1027
            E   PIH+EL+  I  K G++C+SV AEFQELVS+CGGPNEK RA H LNHL+VVPDCPS
Sbjct: 341  EMKKPIHMELMPSILQKRGIVCESVCAEFQELVSMCGGPNEKSRAEHFLNHLRVVPDCPS 400

Query: 1028 SRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIG 1207
             R+MSLPTTRKLA+KNKV FGTGD++HAPT+TANMAF RAVSQTGMSL TIEHRPRAL+G
Sbjct: 401  ERLMSLPTTRKLALKNKVAFGTGDYWHAPTITANMAFARAVSQTGMSLVTIEHRPRALVG 460

Query: 1208 D 1210
            D
Sbjct: 461  D 461


>gb|EOY09850.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 463

 Score =  431 bits (1108), Expect = e-118
 Identities = 232/408 (56%), Positives = 288/408 (70%), Gaps = 5/408 (1%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+++HILQQPFIT VSRV KP+PL  S      + S+   I+V IVC LN  PV
Sbjct: 65   VNIGHLEAIVHILQQPFITAVSRVCKPLPLPFSNTNKNDSSSSSNPIHVHIVCTLNKNPV 124

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNP YISW+ S   KG K RIQ VL AAQS++TLRP SI+LFFS GL   +H+K
Sbjct: 125  WIIVSDRNPNYISWYPSKKTKGFKSRIQQVLDAAQSTNTLRPFSIVLFFSNGLTNFIHQK 184

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAISTPR-- 535
            LQDEFGA+ L L F  FD  F +E E +WIN++ R Y+EAC+LEI++   V  +++    
Sbjct: 185  LQDEFGASKLALEFSDFD--FCEEFEGEWINVIPRSYKEACILEIKVDRVVDDVASSEHR 242

Query: 536  -KERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLK--ELPSGHMQLV 706
             K+  V          +  +N+ LG+SF +LVS M       ++G  K  + P G    V
Sbjct: 243  TKDPLVNVLPPECQGGNAYLNLGLGNSFSALVSQM------KKVGSTKVEDFP-GEDDFV 295

Query: 707  NFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGV 886
            NFDTTAL+A+VSGISNG  ++LL   E ELR RFKGNY+FV+AQ  SE  +PIH  L   
Sbjct: 296  NFDTTALIALVSGISNGCAEELLNKPEVELRHRFKGNYEFVIAQAMSEIQNPIHGGLSAA 355

Query: 887  IAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLA 1066
            IAGK G+IC+SV +EF+ELV +CGG NEK RA+ +L  L +V D PS R+M LPTTRKLA
Sbjct: 356  IAGKRGIICESVLSEFKELVLMCGGANEKSRADQLLKCLLIVRDSPSERLMGLPTTRKLA 415

Query: 1067 MKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            +KNK+VFGTGD++HAPTLTANMAFVRAV+QT MSLFTIEHRPRAL G+
Sbjct: 416  LKNKIVFGTGDYWHAPTLTANMAFVRAVAQTAMSLFTIEHRPRALTGN 463


>ref|XP_004147991.1| PREDICTED: uncharacterized protein LOC101214095 [Cucumis sativus]
            gi|449494348|ref|XP_004159521.1| PREDICTED:
            uncharacterized LOC101214095 [Cucumis sativus]
          Length = 458

 Score =  418 bits (1074), Expect = e-114
 Identities = 215/408 (52%), Positives = 281/408 (68%), Gaps = 5/408 (1%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            +NIGHLE+++HILQ P +TG+SRV KP+P SSS +           +YVDI+C LN  PV
Sbjct: 62   LNIGHLEAIVHILQHPSVTGISRVCKPIPSSSSSQA----------VYVDIICTLNRNPV 111

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDR P+YISW+    +KGLK R+++V+ AA+S   L P SIILFFS GLD+ + ++
Sbjct: 112  WVIVSDRKPRYISWYKGHRSKGLKSRLEEVIDAARSLHALEPCSIILFFSHGLDQFILER 171

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAISTPRKE 541
            L+DEF A +    F  FD  F  E +  WIN+L R Y EACVLEI++      +++    
Sbjct: 172  LRDEFKATEFHFNFSDFDFAF-SEIDGDWINVLPRSYEEACVLEIKVNDRNCGVTSSNYN 230

Query: 542  RAVTQSS-NPLSLKHVAVNVKLGDSFCSLVSGM----ICCVPDTEIGLLKELPSGHMQLV 706
              V  S  +   + +    +  GDSFCS+V  M    +  + D E    ++L  G   L+
Sbjct: 231  SKVCSSGVDEPEILNNNTEIDFGDSFCSVVMAMKPNPMNGIEDMESANFEKLLGGDSDLI 290

Query: 707  NFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGV 886
            NFDTTAL+A+VSGISNG   KLL+  E+ELR ++K NYDFV+ Q  SE   PI VEL  +
Sbjct: 291  NFDTTALIALVSGISNGCAAKLLSIPENELRQKYKSNYDFVIGQAMSEIKKPILVELSSL 350

Query: 887  IAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLA 1066
            ++GK G+ICQS ++EF+EL+++CGGPNEK RANH+L H+ VV D  S R+  LPTTRKLA
Sbjct: 351  LSGKRGIICQSAHSEFKELITMCGGPNEKSRANHLLKHIMVVLDMVSKRMTCLPTTRKLA 410

Query: 1067 MKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            +KNKVVFGTGD+++APTLTANM+FVRAVSQTGMSLFT EHRPRAL GD
Sbjct: 411  LKNKVVFGTGDYWNAPTLTANMSFVRAVSQTGMSLFTFEHRPRALTGD 458


>gb|EMJ11151.1| hypothetical protein PRUPE_ppa004801mg [Prunus persica]
          Length = 491

 Score =  410 bits (1054), Expect = e-111
 Identities = 236/427 (55%), Positives = 295/427 (69%), Gaps = 24/427 (5%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSS--SQKLLRKADSAVTNIYVDIVCCLNGK 175
            VNIGHLE+V+H+LQQPFITGVSRV KP+PLS+       +K D  +  I+VDIVC L+  
Sbjct: 70   VNIGHLEAVVHLLQQPFITGVSRVCKPIPLSTLAPHPHGQKTDPCL-KIHVDIVCTLHRN 128

Query: 176  PVWFIVSDRNPKYISWHGSSA--------NKGLKKRIQDVLQAAQSSDTLRPTSIILFFS 331
            PVW IVSDRNPKYISW GSS         +KGLK RIQ V  AA+S+  L+P+S+ILFFS
Sbjct: 129  PVWIIVSDRNPKYISWSGSSCGSPYKRDKSKGLKLRIQQVTAAARSAVALKPSSVILFFS 188

Query: 332  R--GLDRIVHKKLQDEFGAADLELAFPYFDCIF-LDESEDQWINIL-ARCYREACVLEIE 499
               GL  IV  KL+DEFGA + +L FP  D  F L +   +W N+L AR Y+EAC  EI+
Sbjct: 189  NRNGLSSIVCDKLKDEFGATEFQLDFPVLDFNFDLSKEAGEWTNVLVARTYQEACAFEIK 248

Query: 500  IKSSVSAISTPRK--------ERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPD 655
            +  + + + +           E A T+     S +H     +   +F +L+S M      
Sbjct: 249  VSDTRNTVLSSESDVKDSSLGEAADTEKDPSDSTEHT----EFCRAFSNLISRMEFYSLY 304

Query: 656  TEIGLLKELPS--GHMQLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFV 829
             + G   ++ S  G  +L+NFDTTAL+A+VSGISNGG  KLLA  ESELR RFKGNY+FV
Sbjct: 305  LKNGESAQVGSLLGQSELINFDTTALIALVSGISNGGTPKLLATPESELRQRFKGNYEFV 364

Query: 830  LAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKV 1009
            + QV SE  +PI VE    I+GK  +IC+SV +EF+ELV +CGGPNEKLRA+ +LN L V
Sbjct: 365  IGQVMSEIQNPILVEFGRTISGKRVIICESVRSEFKELVLMCGGPNEKLRASQLLNCLTV 424

Query: 1010 VPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHR 1189
            VPD PS R+M+LPTTRKLA+KNKVVFGTGD++ APT+TANMAFVRA+SQTGMSLFTIEHR
Sbjct: 425  VPDSPSERMMNLPTTRKLALKNKVVFGTGDYWCAPTVTANMAFVRAISQTGMSLFTIEHR 484

Query: 1190 PRALIGD 1210
            PRAL GD
Sbjct: 485  PRALTGD 491


>ref|XP_002305036.2| hypothetical protein POPTR_0004s06580g [Populus trichocarpa]
            gi|550340460|gb|EEE85547.2| hypothetical protein
            POPTR_0004s06580g [Populus trichocarpa]
          Length = 474

 Score =  409 bits (1052), Expect = e-111
 Identities = 228/418 (54%), Positives = 289/418 (69%), Gaps = 15/418 (3%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTN-IYVDIVCCLNGKP 178
            VNIGHLE+VIHILQQP ITGVSRV KP+P S   +  +K +S   N ++VDIVC LN  P
Sbjct: 61   VNIGHLEAVIHILQQPCITGVSRVCKPIPSSLPNR--KKIESPTKNAVHVDIVCTLNKNP 118

Query: 179  VWFIVSDRNPKYISW-HGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVH 355
            VW IVSDRNP+Y+SW     ++KGLK R++ VL AAQS+  ++P SI+LFFS G+   V+
Sbjct: 119  VWIIVSDRNPRYVSWFRDGKSSKGLKFRLEQVLGAAQSTQIMKPCSIVLFFSHGISDFVN 178

Query: 356  KKLQDEFGAADLELAFPYFDCIFLDESED-QWINILA--RCYREACVLEIEI---KSSVS 517
            +KL++EFGA  L L F  FD    +E E  +WIN++A  R ++EACV EI++   K +  
Sbjct: 179  EKLREEFGAWQLGLEFALFDFDLCEELEGGEWINVVANARSFQEACVFEIKVGGTKENTV 238

Query: 518  AISTPRKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHM 697
              S    ER+++ +   L +        L D F SL+S M   +   ++  +  +  G  
Sbjct: 239  LGSKYGVERSLSLNPTGLEMMEEVTKENLDDGFDSLISEMKLSL--MKVKSVDVVGPGDF 296

Query: 698  -------QLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETL 856
                     +N DTTAL+AIVSGISNG  +KLLA  E ELR RFKGNY+FV+ QV SE  
Sbjct: 297  IGDDDGDDFINLDTTALIAIVSGISNGCTEKLLATPEDELRKRFKGNYEFVIVQVKSEIQ 356

Query: 857  SPIHVELIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRI 1036
            +PI  E+ GVI GK G+IC+SV +EF++LVS+CGGPNEKLRA+ +L  L VVPD PS R+
Sbjct: 357  NPILAEMAGVIQGKRGIICESVLSEFKQLVSMCGGPNEKLRADKILKCLMVVPDSPSERM 416

Query: 1037 MSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            M +PTTRKLA+KNKVVFGTGD + APTLTANMAFVRAVSQTGM LFTIEHRPRAL GD
Sbjct: 417  MGVPTTRKLALKNKVVFGTGDHWRAPTLTANMAFVRAVSQTGMPLFTIEHRPRALTGD 474


>ref|XP_006478143.1| PREDICTED: uncharacterized protein LOC102624608 isoform X1 [Citrus
            sinensis]
          Length = 439

 Score =  401 bits (1031), Expect = e-109
 Identities = 225/406 (55%), Positives = 284/406 (69%), Gaps = 4/406 (0%)
 Frame = +2

Query: 5    NIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPVW 184
            NIGHLESV++ILQQPFITGVSRV K +             +   +++VDI+C L   P+W
Sbjct: 58   NIGHLESVVYILQQPFITGVSRVCKSIK------------NGFKSVHVDIICTLYKTPLW 105

Query: 185  FIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKKL 364
             IVSDRNP+Y+SW GS   KGLK R+++VL AA SS +L+P S++LFFS G+   V+ +L
Sbjct: 106  IIVSDRNPRYVSWDGSDKGKGLKLRVEEVLAAAVSSPSLKPCSVVLFFSNGVGEFVNDRL 165

Query: 365  QDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAISTPRKER 544
              E+GAA+  L+    D  F +E ED W+N+L+R +R+AC+LEI++    SA S+   E 
Sbjct: 166  ISEYGAAEFNLS----DFEFFEEVEDGWVNVLSRLFRDACMLEIKLNFCGSAASS--SEC 219

Query: 545  AVTQSSNPL---SLKHVAVNVKLGDSFCSLVSGM-ICCVPDTEIGLLKELPSGHMQLVNF 712
             V +S   +   +L+     V LG +F SL+S M  CC    E+ L   L  G+  L+NF
Sbjct: 220  GVKRSIGDVPGFTLQEKNKKVSLGGAFGSLLSQMKFCC----EVELDHLLNEGN--LINF 273

Query: 713  DTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIA 892
            DTTAL+A+VSGISNG  +K LA  + ELR RFKGN  FV+AQ  SE  +PI  EL GVIA
Sbjct: 274  DTTALIALVSGISNGCAEKFLATPDIELRQRFKGNTQFVIAQALSEIQTPIDKELGGVIA 333

Query: 893  GKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMK 1072
            GK G+IC+SV +EF+ELVS+CGGPNEKLRA+ +L  L VV D PS RI+ LPTTRKLA+K
Sbjct: 334  GKKGIICESVLSEFKELVSMCGGPNEKLRADELLMCLMVVHDSPSVRIVGLPTTRKLALK 393

Query: 1073 NKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            NK VFGTGD + APTLTANMAFVRAVSQTGMSL TIEHRPRAL GD
Sbjct: 394  NKAVFGTGDHWRAPTLTANMAFVRAVSQTGMSLCTIEHRPRALTGD 439


>ref|XP_002519590.1| conserved hypothetical protein [Ricinus communis]
            gi|223541248|gb|EEF42801.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 425

 Score =  390 bits (1001), Expect = e-105
 Identities = 217/403 (53%), Positives = 272/403 (67%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGHLE+VIH+L+ PF++GVSRV K +  + S K           I+VD+VC  N  PV
Sbjct: 67   VNIGHLEAVIHLLEHPFVSGVSRVCKSIKTTHSSK----------TIHVDVVCIFNKNPV 116

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
            W IVSDRNPKYISWH        K RI+ +L  A+SS  ++PTSI++FF+RGLD  V +K
Sbjct: 117  WIIVSDRNPKYISWHDC-----FKLRIERLLAEARSSQIIKPTSILVFFARGLDDFVFEK 171

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAISTPRKE 541
            L+ EFGA ++EL F         + ED WIN+    Y+++  +EI++  + S+ +    E
Sbjct: 172  LKYEFGAFEIELGF---------DLEDGWINVTDTPYQDSMFIEIKVDGTTSSRNAVL-E 221

Query: 542  RAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHMQLVNFDTT 721
             A  +  + L L+         DSF SL+SG                      LVNFDTT
Sbjct: 222  CAFVEKFDGLELQEEDT---ADDSFTSLISGF----------------RYDGDLVNFDTT 262

Query: 722  ALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKG 901
            AL+AIVSGISNG  +KLLAA E +LR RFKGN++FV+ QV SE  +PIHVE+  +I GKG
Sbjct: 263  ALIAIVSGISNGCREKLLAAPEIQLRQRFKGNFEFVVGQVLSEIQNPIHVEMADIIHGKG 322

Query: 902  GVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKV 1081
            G+IC+SV +EF+ELVS+CGGPNEKLRA+ +L  L VVPD PS R+M LPTTRKLA+KNKV
Sbjct: 323  GIICESVLSEFKELVSLCGGPNEKLRADKILKSLMVVPDSPSERMMCLPTTRKLALKNKV 382

Query: 1082 VFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            VFGTGD + APTLTANMAFVRAVSQTGMSL TIEHRPRAL GD
Sbjct: 383  VFGTGDHWRAPTLTANMAFVRAVSQTGMSLLTIEHRPRALTGD 425


>ref|XP_004299451.1| PREDICTED: uncharacterized protein LOC101308575 [Fragaria vesca
            subsp. vesca]
          Length = 449

 Score =  382 bits (982), Expect = e-103
 Identities = 213/411 (51%), Positives = 275/411 (66%), Gaps = 8/411 (1%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGH+E+V+HIL  P ITGVSRV KP+P+           ++  N +VDIVC LN  PV
Sbjct: 56   VNIGHIETVVHILHHPLITGVSRVCKPIPMCHK--------TSSQNGHVDIVCTLNRNPV 107

Query: 182  WFIVSDRNPKYISWHGSSANK--GLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVH 355
            W IVSDRNP YI+W+    NK  GL+ R+Q +  AA S+  LRP+S+ILFFS GL   + 
Sbjct: 108  WIIVSDRNPNYITWNEPQTNKTKGLQCRVQQLTAAAASAVALRPSSVILFFSHGLSSFLS 167

Query: 356  KKLQDEFGAADLELAFPYFDCIFLDESEDQWINIL-ARCYREACVLEIEIKSSVSAISTP 532
             KL+ EF A  ++L  P F    ++E  D WI++L AR Y+EA V EI++          
Sbjct: 168  DKLKHEFEATQVQLCHPGFRFDLVEEEGD-WIDVLVARTYQEAAVFEIKVGDV------- 219

Query: 533  RKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTE-----IGLLKELPSGHM 697
             K+  V  S + + +  +A N  L +    L  G    V   +     +  ++    G  
Sbjct: 220  -KDDDVLSSVSDVKVSPMAANWDLLEDTTLLFPGFYSAVSKMQLFSFDVENMETAKRGEC 278

Query: 698  QLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVEL 877
             +VNFDTTAL+A+VS ISNGG +KLLA  ESELR RFKGNY+FV+ QV SE  +PI V+L
Sbjct: 279  DVVNFDTTALIALVSAISNGGTEKLLATPESELRQRFKGNYEFVIGQVMSEIQNPILVKL 338

Query: 878  IGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTR 1057
               I+GK G+ C++V++EF+ELVS+ GGPNEKLRA+H+L +L+VVPD PS R+MSLPTTR
Sbjct: 339  GSAISGKRGITCETVHSEFKELVSMYGGPNEKLRASHLLKYLRVVPDSPSKRMMSLPTTR 398

Query: 1058 KLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            KLA+K K+VFGTGD++ APT TANMAFVRAVSQ GMSL TIEHRPRAL GD
Sbjct: 399  KLALKCKIVFGTGDYWCAPTATANMAFVRAVSQAGMSLCTIEHRPRALTGD 449


>ref|NP_565063.1| uncharacterized protein [Arabidopsis thaliana]
            gi|11120791|gb|AAG30971.1|AC012396_7 hypothetical protein
            [Arabidopsis thaliana] gi|14334538|gb|AAK59677.1| unknown
            protein [Arabidopsis thaliana] gi|21436329|gb|AAM51334.1|
            unknown protein [Arabidopsis thaliana]
            gi|332197331|gb|AEE35452.1| uncharacterized protein
            AT1G73380 [Arabidopsis thaliana]
          Length = 434

 Score =  361 bits (927), Expect = 6e-97
 Identities = 203/408 (49%), Positives = 267/408 (65%), Gaps = 5/408 (1%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGH+ESV+ ILQ P ITGVSRV KP+PL             +  ++VD+VC L   PV
Sbjct: 62   VNIGHIESVVRILQLPSITGVSRVCKPIPLP------------IGGVHVDLVCTLGKVPV 109

Query: 182  WFIVSDRNPKYISWHGSS-ANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHK 358
            W IVSDRNP+YISW+G    +KGL+ RI+ +L AA S+ TL+P+S+ILFF+ GL   V++
Sbjct: 110  WIIVSDRNPRYISWNGDRHGSKGLRSRIEQILAAANSTTTLKPSSVILFFANGLPSSVYE 169

Query: 359  KLQDEFGAADLELAFPY---FDCIFLDESEDQWINIL-ARCYREACVLEIEIKSSVSAIS 526
            KL+DEFGA   +  F      D   LD+ + +W+N++  R Y+EA  +EI++     +++
Sbjct: 170  KLKDEFGAVYFDFGFDSDSDSDISMLDDFDCEWVNVVRTRSYKEAVSIEIKLIDQCDSLA 229

Query: 527  TPRKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHMQLV 706
            +P  E  V      LS K         D+F +++S M     D               L+
Sbjct: 230  SPETEVLVQAEVTELSQK---------DAFSTVISSMRLLGEDC--------------LI 266

Query: 707  NFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGV 886
            NFDTTALVA+VSGISNG  ++L+   E EL  +FKGN  FV+AQ  SE   P  V++  V
Sbjct: 267  NFDTTALVALVSGISNGCAERLVDMPEIELEEKFKGNTVFVIAQARSEIEKPGLVKVGTV 326

Query: 887  IAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLA 1066
            ++GK G++C+SV++EF+ELVS+  GPNEKLRA  +L  L VV D PS R+MSLPTTRKLA
Sbjct: 327  LSGKRGIVCKSVFSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPTTRKLA 386

Query: 1067 MKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            MKNK VFGTGD + APTLTANMAFVRAV+Q+GMSL TI+H PRAL GD
Sbjct: 387  MKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 434


>ref|XP_006848656.1| hypothetical protein AMTR_s00171p00062290 [Amborella trichopoda]
            gi|548852007|gb|ERN10237.1| hypothetical protein
            AMTR_s00171p00062290 [Amborella trichopoda]
          Length = 480

 Score =  358 bits (918), Expect = 7e-96
 Identities = 200/429 (46%), Positives = 280/429 (65%), Gaps = 28/429 (6%)
 Frame = +2

Query: 5    NIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPVW 184
            NIG+LES+++ILQQPFI  ++RV K VPLSS      KA+     +++DI+C  +G PVW
Sbjct: 58   NIGYLESIVYILQQPFINSITRVCKSVPLSSLHGKKFKANPK--GVHIDIICTYHGNPVW 115

Query: 185  FIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKKL 364
            FIVSDRNPKYISW  S  +KGL+ R+  V+QAA SS TL+P  +I FF+ GLD +V +KL
Sbjct: 116  FIVSDRNPKYISWAYSHRSKGLRSRLHSVIQAANSSLTLQPAFVIFFFANGLDEVVPQKL 175

Query: 365  QDEFGAADLELAFPYFDCIFLDESEDQWINIL-------------ARCYREACVLEIEI- 502
             DE+ A ++   F +F+    +E ED+W+NI              +R Y+ A V +  + 
Sbjct: 176  IDEYKALEIGKEFNHFEVSIFEELEDEWVNITFNRKTFDGDLKDSSRQYQGARVFQFAVN 235

Query: 503  ---KSSVSAISTPRKERAVTQSSNPLSLKHVAVNVKLGD--------SFCSLVSGMICCV 649
               K      +    E +VT SS+  +         LG+        +FCSL+S M   +
Sbjct: 236  CLEKDDEGRRACVHSESSVTMSSSLGNTMMGFHKNLLGNDPEVLGNAAFCSLISTMQSSL 295

Query: 650  PD---TEIGLLKELPSGHMQLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNY 820
             D   TE+G+     +    ++NFDTTAL+A+VSGIS+GGI+++L A E ++R RFK N+
Sbjct: 296  LDGDGTEMGV-----TVGENMINFDTTALIALVSGISSGGIEQILKAPEDDMRKRFKSNF 350

Query: 821  DFVLAQVNSETLSPIHVELIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNH 1000
             FV+AQV SE  +PI  EL  +I+ +  +IC+SV +EF+EL+++CGGP EK+RA+ +L  
Sbjct: 351  AFVMAQVKSEIENPILEELGCLISCRKVIICESVCSEFKELIAMCGGPGEKMRADRLLQC 410

Query: 1001 LKVVPDCPSSRIMSLPTTRKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTI 1180
            L V+ D PS+R++ LPTTRK+ +KNK++FGTGD + APTLTANM FVRAV QTGMSL T+
Sbjct: 411  LVVIKDNPSARVVGLPTTRKIGLKNKIIFGTGDQWRAPTLTANMGFVRAVFQTGMSLMTL 470

Query: 1181 EHRPRALIG 1207
            EHRPRAL G
Sbjct: 471  EHRPRALTG 479


>ref|XP_002887480.1| hypothetical protein ARALYDRAFT_895197 [Arabidopsis lyrata subsp.
            lyrata] gi|297333321|gb|EFH63739.1| hypothetical protein
            ARALYDRAFT_895197 [Arabidopsis lyrata subsp. lyrata]
          Length = 433

 Score =  356 bits (913), Expect = 3e-95
 Identities = 202/411 (49%), Positives = 266/411 (64%), Gaps = 8/411 (1%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGH+ESV+ ILQ P +TGVSRV KP+PL             +  ++VD+VC L   PV
Sbjct: 62   VNIGHIESVVRILQLPSVTGVSRVCKPIPLP------------IGGVHVDLVCTLGKVPV 109

Query: 182  WFIVSDRNPKYISWHGSS-ANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHK 358
            W IVSDRNP+YISW G    +KGL+ RI+ +L AA S+ TL+P+S+ILFF+ GL   +++
Sbjct: 110  WIIVSDRNPRYISWSGDRHGSKGLRSRIEQILAAANSTTTLKPSSVILFFANGLPCSIYE 169

Query: 359  KLQDEFGAADLELAFPYF------DCIFLDESEDQWINIL-ARCYREACVLEIEIKSSVS 517
            KL+DEFGAA     F +F      D   LD+ + +W+N++  R Y+EA  +EI++     
Sbjct: 170  KLKDEFGAAH----FDFFGLDSDSDISMLDDFDCEWVNVVRTRSYKEAVSVEIKLIDQCD 225

Query: 518  AISTPRKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHM 697
            ++++P  E  V +    LS K         D F S++S M     D              
Sbjct: 226  SLASPETEVLVQEDVTELSQK---------DVFSSVISSMRLLGEDC------------- 263

Query: 698  QLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVEL 877
             L+NFDTTALVA+VSGISNG  ++++   E EL  +FKGN  FV+AQ  SE   P  V++
Sbjct: 264  -LINFDTTALVALVSGISNGCAERIVHTPEIELEEKFKGNTVFVIAQARSEIEKPGLVKM 322

Query: 878  IGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTR 1057
              V++GK G++C+SV +EF+ELVS+  GPNEKLRA  +L  L VV D PS R+MSLPTTR
Sbjct: 323  GSVLSGKRGIVCKSVLSEFKELVSMYAGPNEKLRAEQLLKSLMVVNDNPSERVMSLPTTR 382

Query: 1058 KLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            KLAMKNK VFGTGD + APTLTANMAFVRAV+Q+GMSL T +H PRAL GD
Sbjct: 383  KLAMKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTNDHSPRALTGD 433


>ref|XP_006302279.1| hypothetical protein CARUB_v10020322mg [Capsella rubella]
            gi|482570989|gb|EOA35177.1| hypothetical protein
            CARUB_v10020322mg [Capsella rubella]
          Length = 431

 Score =  345 bits (884), Expect = 6e-92
 Identities = 199/407 (48%), Positives = 261/407 (64%), Gaps = 4/407 (0%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGH+ESV+ ILQ P ITGVSRV KP+PL             +  ++VD+VC L   PV
Sbjct: 62   VNIGHIESVVRILQLPSITGVSRVCKPIPLP------------IGGVHVDLVCTLGKVPV 109

Query: 182  WFIVSDRNPKYISWHGSS-ANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHK 358
            W IVSDRNP+YISW G    +KGL  RI+ +L AA SS TL+P+S+ILFF+ GL  ++++
Sbjct: 110  WIIVSDRNPRYISWSGDRHGSKGLSLRIEQILAAAHSSTTLKPSSVILFFANGLPSLIYE 169

Query: 359  KLQDEFGAADLELAFPY-FDCIFLDES-EDQWINIL-ARCYREACVLEIEIKSSVSAIST 529
            KL+DEFGA          F+C   DE+ E +W++++  R Y+EA  +EI++        +
Sbjct: 170  KLRDEFGAVYFNFDVGSDFEC---DETVEGEWVHVVRTRSYKEAVSVEIKLIDHQCDSPS 226

Query: 530  PRKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHMQLVN 709
               E  V      LS K VA        F S++S M     D                +N
Sbjct: 227  TEPEVVVQAEVPELSQKEVA--------FSSVISSMRLLGEDC--------------FIN 264

Query: 710  FDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVI 889
            FDTTALVA+VSGISNG  ++++   E+EL  +FKGN  FV+AQ  SE  +P+ V++  V+
Sbjct: 265  FDTTALVALVSGISNGCAERIVDMPETELEEKFKGNTVFVIAQARSEIENPVLVKMRTVV 324

Query: 890  AGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAM 1069
            +GK G++C+SV +EF+ELVS+  GPNEK RA  +L  L VV D P+ R+ SLPTTRKLAM
Sbjct: 325  SGKRGIVCESVLSEFKELVSMYAGPNEKRRAEQLLKSLMVVNDSPTDRVKSLPTTRKLAM 384

Query: 1070 KNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            KNK VFGTGD + APTLTANMAFVRAV+Q+GMSL TI+H PRAL GD
Sbjct: 385  KNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 431


>ref|XP_006390563.1| hypothetical protein EUTSA_v10018586mg [Eutrema salsugineum]
            gi|557086997|gb|ESQ27849.1| hypothetical protein
            EUTSA_v10018586mg [Eutrema salsugineum]
          Length = 433

 Score =  339 bits (870), Expect = 2e-90
 Identities = 194/412 (47%), Positives = 260/412 (63%), Gaps = 9/412 (2%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNIGH+ESV+ ILQ P ITGVSRV KP+PL             + +++VD+VC L   PV
Sbjct: 62   VNIGHIESVVRILQLPSITGVSRVCKPLPLP------------IGSVHVDLVCTLGKAPV 109

Query: 182  WFIVSDRNPKYISWHGSS-ANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHK 358
            W IVSDRNP+YISW+G     KGL+ RI+ VL AA S+ TL+P+S+ILFF+ GL   +++
Sbjct: 110  WIIVSDRNPRYISWNGDRHGGKGLRSRIEHVLAAAHSTTTLKPSSLILFFANGLPSSIYE 169

Query: 359  KLQDEFGAADLELAFPY------FDCIFLDESEDQWINIL-ARCYREACVLEIE-IKSSV 514
            KL++EFGA   +L          FDC   +  E  W+ ++ +R Y+EA  +EI+ I    
Sbjct: 170  KLKEEFGAVSFDLGLDSDTSMLDFDCE--ETMEGDWVVVVGSRSYKEAISVEIKLIIDEC 227

Query: 515  SAISTPRKERAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGH 694
             +++ P  E  V               V   D F  ++S +                 G 
Sbjct: 228  DSLAFPEPEVVVE------------AEVSQNDGFSRVISSLRL--------------QGE 261

Query: 695  MQLVNFDTTALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVE 874
              L+NFDTTALVA+VSGI+NG  ++++   + EL  +FKGN  FV+AQ  SE   P+ V+
Sbjct: 262  DCLINFDTTALVALVSGITNGCAERIVDMPQIELEQKFKGNTVFVIAQAESEIEKPVLVK 321

Query: 875  LIGVIAGKGGVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTT 1054
            +  +++GK G++C+SV +EF+ELVS+  GPNEK RA H+L  L VV D P+ R+M LPTT
Sbjct: 322  MATLLSGKRGIVCKSVLSEFKELVSMYAGPNEKHRAEHLLKSLMVVNDNPTERVMGLPTT 381

Query: 1055 RKLAMKNKVVFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            RKLAMKNK VFGTGD + APTLTANMAFVRAV+Q+GMSL TI+H PRAL GD
Sbjct: 382  RKLAMKNKTVFGTGDRWGAPTLTANMAFVRAVAQSGMSLSTIDHSPRALTGD 433


>emb|CBI26409.3| unnamed protein product [Vitis vinifera]
          Length = 332

 Score =  308 bits (790), Expect = 5e-81
 Identities = 188/403 (46%), Positives = 234/403 (58%)
 Frame = +2

Query: 2    VNIGHLESVIHILQQPFITGVSRVSKPVPLSSSQKLLRKADSAVTNIYVDIVCCLNGKPV 181
            VNI HLE+V+HIL+QPFITG                            V  VC L     
Sbjct: 58   VNISHLEAVVHILEQPFITG----------------------------VSRVCKLT---- 85

Query: 182  WFIVSDRNPKYISWHGSSANKGLKKRIQDVLQAAQSSDTLRPTSIILFFSRGLDRIVHKK 361
                                     RIQ VL AA+SS TL+P+S+ILFFS GLD+ + +K
Sbjct: 86   -------------------------RIQQVLDAARSSLTLKPSSVILFFSNGLDQCICEK 120

Query: 362  LQDEFGAADLELAFPYFDCIFLDESEDQWINILARCYREACVLEIEIKSSVSAISTPRKE 541
            LQ EFGA +                           YR AC+LEI++             
Sbjct: 121  LQGEFGAYE--------------------------SYRGACILEIKV------------- 141

Query: 542  RAVTQSSNPLSLKHVAVNVKLGDSFCSLVSGMICCVPDTEIGLLKELPSGHMQLVNFDTT 721
                   + +  KH+ +         SLV  ++                G   L+NFDTT
Sbjct: 142  -------DHIPEKHIDI---------SLVETLL----------------GQDDLINFDTT 169

Query: 722  ALVAIVSGISNGGIDKLLAAGESELRSRFKGNYDFVLAQVNSETLSPIHVELIGVIAGKG 901
            AL+A+VSGISNGG +KLLAA E+E+R RFKGNY FV+AQV SE  +PIHVEL G+ +GK 
Sbjct: 170  ALIAVVSGISNGGTEKLLAAPETEMRLRFKGNYKFVIAQVLSEIQNPIHVELSGLTSGKR 229

Query: 902  GVICQSVYAEFQELVSICGGPNEKLRANHVLNHLKVVPDCPSSRIMSLPTTRKLAMKNKV 1081
            G+IC++V++EF+ELVS+CGGPNEKLRA+ +L  L VVPD PS+R+M LPTTRKLA+KNKV
Sbjct: 230  GIICETVHSEFKELVSMCGGPNEKLRADQLLKCLMVVPDSPSARMMGLPTTRKLALKNKV 289

Query: 1082 VFGTGDFFHAPTLTANMAFVRAVSQTGMSLFTIEHRPRALIGD 1210
            VFGTGD++HAPTLTANMAFVRA+SQTGMSLFTIEHRPRAL G+
Sbjct: 290  VFGTGDYWHAPTLTANMAFVRAISQTGMSLFTIEHRPRALTGN 332


Top