BLASTX nr result

ID: Catharanthus22_contig00030119 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00030119
         (1014 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277810.1| PREDICTED: uncharacterized protein LOC100254...   206   2e-50
emb|CAN61665.1| hypothetical protein VITISV_037831 [Vitis vinifera]   204   3e-50
ref|XP_006338562.1| PREDICTED: uncharacterized protein LOC102592...   199   1e-48
ref|XP_004232296.1| PREDICTED: uncharacterized protein LOC101250...   191   4e-46
ref|XP_006345057.1| PREDICTED: uncharacterized protein LOC102596...   184   4e-44
emb|CAN65788.1| hypothetical protein VITISV_037591 [Vitis vinifera]   178   3e-42
ref|XP_003631260.1| PREDICTED: uncharacterized protein LOC100852...   174   4e-41
gb|EOY02405.1| Uncharacterized protein TCM_016889 [Theobroma cacao]   174   6e-41
ref|XP_002524645.1| conserved hypothetical protein [Ricinus comm...   173   8e-41
emb|CAN80034.1| hypothetical protein VITISV_019834 [Vitis vinifera]   172   2e-40
ref|XP_006376855.1| hypothetical protein POPTR_0012s08240g [Popu...   171   5e-40
ref|XP_004236105.1| PREDICTED: uncharacterized protein LOC101245...   169   1e-39
ref|XP_002282787.1| PREDICTED: uncharacterized protein LOC100243...   167   5e-39
ref|XP_006579758.1| PREDICTED: uncharacterized protein LOC102667...   166   2e-38
ref|XP_006374520.1| hypothetical protein POPTR_0015s08750g [Popu...   166   2e-38
ref|XP_006603809.1| PREDICTED: uncharacterized protein LOC102669...   164   4e-38
gb|EMJ16960.1| hypothetical protein PRUPE_ppa009500mg [Prunus pe...   164   7e-38
gb|EOY20727.1| Uncharacterized protein TCM_012072 [Theobroma cacao]   163   1e-37
ref|XP_006341323.1| PREDICTED: uncharacterized protein LOC102604...   157   6e-36
ref|XP_006439821.1| hypothetical protein CICLE_v10021025mg [Citr...   157   6e-36

>ref|XP_002277810.1| PREDICTED: uncharacterized protein LOC100254241 [Vitis vinifera]
          Length = 254

 Score =  206 bits (523), Expect = 2e-50
 Identities = 114/257 (44%), Positives = 159/257 (61%), Gaps = 3/257 (1%)
 Frame = -2

Query: 896 CPTFNSYSSDKLA-IAARVTDEMRTESNTAGEKQNDDDDFEFSLVREDPEIYYTGQIQPI 720
           CP+FNSYSSD+LA +AA+V +E       + +++ D+D+FEF+ V    E+    QI P+
Sbjct: 11  CPSFNSYSSDRLADVAAKVGEE----ECASDDRRVDEDNFEFTFVSTG-EVLRETQIGPV 65

Query: 719 FPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENV 540
           FP+FNRDLL +D              G++  +R PLK LF                 E +
Sbjct: 66  FPVFNRDLLVED--------------GEAKTLRFPLKKLFMEERDGMSSSSSDADELEGI 111

Query: 539 PPGTYCVWRPKTNSESPNRCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP-KVDH 363
           P GTYCVW+PK   ++P RCKKS+STGSAS+RW+F DLLRRSNS+GKDS+VFLTP K + 
Sbjct: 112 PEGTYCVWQPKKVPDTPERCKKSSSTGSASKRWRFRDLLRRSNSEGKDSFVFLTPSKREE 171

Query: 362 REDKSDKIERSKVVLKRSPGRSN-XXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSY 186
           + + +DK + SK        ++N                 HE FY++NRA+KEGDK++S+
Sbjct: 172 KAEITDKSDHSKETRNSVHAKTNVAGKAKPKVISGEKASPHEIFYMKNRAMKEGDKRQSF 231

Query: 185 LPYRQDLVGFFASVNSL 135
           LPYR+DLVGFFA+V+ +
Sbjct: 232 LPYRRDLVGFFANVSRM 248


>emb|CAN61665.1| hypothetical protein VITISV_037831 [Vitis vinifera]
          Length = 254

 Score =  204 bits (520), Expect = 3e-50
 Identities = 114/257 (44%), Positives = 158/257 (61%), Gaps = 3/257 (1%)
 Frame = -2

Query: 896 CPTFNSYSSDKLA-IAARVTDEMRTESNTAGEKQNDDDDFEFSLVREDPEIYYTGQIQPI 720
           CP+FNSYSSD+LA +AA+V +E       + +++ D+D+FEF+ V    E+    QI P+
Sbjct: 11  CPSFNSYSSDRLADVAAKVGEE----ECASDDRRVDEDNFEFTFVSTG-EVLRETQIGPV 65

Query: 719 FPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENV 540
           FP+FNRDLL +D              G++  +R PLK LF                 E +
Sbjct: 66  FPVFNRDLLVED--------------GEAKTLRFPLKKLFMEERDGMSSSSSDADELEGI 111

Query: 539 PPGTYCVWRPKTNSESPNRCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP-KVDH 363
           P GTYCVW+PK    +P RCKKS+STGSAS+RW+F DLLRRSNS+GKDS+VFLTP K + 
Sbjct: 112 PEGTYCVWQPKKVPXTPERCKKSSSTGSASKRWRFRDLLRRSNSEGKDSFVFLTPSKREE 171

Query: 362 REDKSDKIERSKVVLKRSPGRSN-XXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSY 186
           + + +DK + SK        ++N                 HE FY++NRA+KEGDK++S+
Sbjct: 172 KAEITDKSDHSKETRNSVHAKTNVAGKAKPKVISGEKASPHEIFYMKNRAMKEGDKRQSF 231

Query: 185 LPYRQDLVGFFASVNSL 135
           LPYR+DLVGFFA+V+ +
Sbjct: 232 LPYRRDLVGFFANVSRM 248


>ref|XP_006338562.1| PREDICTED: uncharacterized protein LOC102592047 [Solanum tuberosum]
          Length = 276

 Score =  199 bits (507), Expect = 1e-48
 Identities = 124/281 (44%), Positives = 167/281 (59%), Gaps = 21/281 (7%)
 Frame = -2

Query: 914 QEELCVCPTFNSYSSDKLA-IAARVTDEMRTESNTAGEKQND---DDDFEFSLVREDPE- 750
           QEE    P+F+SYSS+++A IA +++DE++ +S    E  +    D+DFEFSLV E+PE 
Sbjct: 2   QEEEYFSPSFSSYSSNRVAEIAGKISDEIKCDSELIEENVDGAEGDEDFEFSLVCENPED 61

Query: 749 ----IYYTGQIQPIFPIFNRDLLFDD------SEGNNNNNKQSSDEGQSSEIRIPLKNLF 600
                 +  QIQPI+P+FNRDLL +D       EG N      S E  +S +++ LK+LF
Sbjct: 62  SVGVFPFDRQIQPIYPVFNRDLLLNDVSYDVDREGVNGE----SSENVNSSVQVSLKDLF 117

Query: 599 XXXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNRCKKSNSTGSASRRWKFLDLLR 420
                            E+VPPGTYCVW+P     SP+RCKKSNSTGSA +RW   DL+R
Sbjct: 118 LEEREPLSSSSSEVDELESVPPGTYCVWKPNITEPSPSRCKKSNSTGSAFKRWNIRDLMR 177

Query: 419 RSNSDGKDSYVFLTPKVDHREDKS---DKIERSKVVLK-RSPGRSNXXXXXXXXXXXXXX 252
           RSNSDGKDS+VFLTP+   + + S   D  E SKV  K ++ G S+              
Sbjct: 178 RSNSDGKDSFVFLTPEKGLKSETSKAKDSAEASKVAGKLKAKGNSSSGNKASSMTD---- 233

Query: 251 XAHEAFYVRNRAIKEGDK--KKSYLPYRQDLVGFFASVNSL 135
                 Y+RN+A KE DK  +KSYLPYR+DLVGFFA+ +++
Sbjct: 234 -----VYLRNQAAKEMDKNRRKSYLPYRRDLVGFFANASNI 269


>ref|XP_004232296.1| PREDICTED: uncharacterized protein LOC101250946 [Solanum
           lycopersicum]
          Length = 273

 Score =  191 bits (485), Expect = 4e-46
 Identities = 119/278 (42%), Positives = 165/278 (59%), Gaps = 18/278 (6%)
 Frame = -2

Query: 914 QEELCVCPTFNSYSSDKLA-IAARVTDEMRTESNTAGEK---QNDDDDFEFSLVREDPE- 750
           QE+    P+F+SYSS+++A IA +++DE++ +S    E     + D+DFEFSLV E+PE 
Sbjct: 2   QEDEFFSPSFSSYSSNRVAEIAGKISDEIKRDSQVVEENVDGADGDEDFEFSLVCENPED 61

Query: 749 -IYYTGQIQPIFPIFNRDLLFDD------SEGNNNNNKQSSDEGQSSEIRIPLKNLFXXX 591
            +      +PI+P+FNRDL+ +D       EG N      S E  +S +++ LK+LF   
Sbjct: 62  SVGVFPFDRPIYPVFNRDLIPNDVSYGVDREGVNGE----SSENVNSSVQVSLKDLFLEE 117

Query: 590 XXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNRCKKSNSTGSASRRWKFLDLLRRSN 411
                         E+VPPGTYCVW+P     SP+RCKKSNSTGSA +RW   DL+RRSN
Sbjct: 118 REPLSSSSSEVDELESVPPGTYCVWKPNITEPSPSRCKKSNSTGSAFKRWNIRDLMRRSN 177

Query: 410 SDGKDSYVFLTPKVDHREDKS---DKIERSKVVLK-RSPGRSNXXXXXXXXXXXXXXXAH 243
           SDGKDS++FLTP+   R + S   D  E SK+  K ++ G S+                 
Sbjct: 178 SDGKDSFLFLTPEKGLRNETSKAKDSAEASKIAGKLKAKGNSSSGNKTSSMTD------- 230

Query: 242 EAFYVRNRAIKEGDK--KKSYLPYRQDLVGFFASVNSL 135
              Y+RN+A KE DK  +KSYLPYR+DLVGFFAS +++
Sbjct: 231 --VYLRNQAAKEMDKNRRKSYLPYRRDLVGFFASASNI 266


>ref|XP_006345057.1| PREDICTED: uncharacterized protein LOC102596863 [Solanum tuberosum]
          Length = 295

 Score =  184 bits (468), Expect = 4e-44
 Identities = 117/292 (40%), Positives = 153/292 (52%), Gaps = 41/292 (14%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTESNT---------------------------AGEKQ 798
           P+FNSYS+  LA IA RV  E R E+ T                            G + 
Sbjct: 11  PSFNSYSNKNLAEIADRVVQEFRAENGTDEEFFVDDDGGLSCFESGRRLNEKEEEQGNED 70

Query: 797 NDDDDFEFSLVREDP-------EIYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSDEG 639
            D+D+FEFS V++         EI+Y GQI+PI+P+FNRDLL D   G +N+N  S +  
Sbjct: 71  EDEDEFEFSFVKQSEISPVAADEIFYNGQIRPIYPLFNRDLLSDFMNGRSNSNSTSEEIS 130

Query: 638 QSS--EIRIPLKNLFXXXXXXXXXXXXXXXXXE--NVPPGTYCVWRPKTNSESPNRCKKS 471
            +    IR+PL+ LF                 +   +P GTYC WRPK   +S   CKKS
Sbjct: 131 SAGPKSIRLPLRKLFMEEEREANFSCSSSEADDLEGIPEGTYCTWRPKPEEKSAGSCKKS 190

Query: 470 NSTGSASRRWKFLDLLRRSNSDGKDSYVFLTPKVDHREDKSDK--IERSKVVLKRSPGRS 297
           NSTGS S+RWKF DLL RSNSDGKD++VFLTP    +E+K++K  I+ +  V  +S G  
Sbjct: 191 NSTGS-SKRWKFRDLLYRSNSDGKDTFVFLTPSFRKKENKAEKTTIDNAPKVTAKSKG-- 247

Query: 296 NXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVN 141
                                YV+N     G+K+K+YLPYRQDLVGFFA+ N
Sbjct: 248 ----------IPVADGYPAMHYVKNGG---GEKRKTYLPYRQDLVGFFANTN 286


>emb|CAN65788.1| hypothetical protein VITISV_037591 [Vitis vinifera]
          Length = 268

 Score =  178 bits (452), Expect = 3e-42
 Identities = 115/275 (41%), Positives = 147/275 (53%), Gaps = 18/275 (6%)
 Frame = -2

Query: 905 LCVCPTFNSYSSDKLA-IAARVTDEM-RTESNTAGEKQND------DDDFEFSLVREDPE 750
           L + P+FNSY+S +LA IAAR+ +E+  ++ N   E + D      DDD EF+ V  +PE
Sbjct: 7   LGLSPSFNSYNSGRLAEIAARIIEELGESDGNDDEEVKEDEVDNDGDDDSEFAFVWREPE 66

Query: 749 --------IYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLFXX 594
                   I+Y GQI+P+FPIFNRDLL  + +    N + S      + IR PL+ L   
Sbjct: 67  TSPISADEIFYNGQIRPVFPIFNRDLLLXEGQ----NQEVSVKPPTPASIRRPLRKLLIE 122

Query: 593 XXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNRCKKSNSTGSASRRWKFLDLLRRS 414
                          E VPPGTYCVW PK     P RC+KSNSTGS S+RWKF D L RS
Sbjct: 123 ERGTDSCSSSEADELEGVPPGTYCVWTPKVAESPPARCRKSNSTGS-SKRWKFRDFLHRS 181

Query: 413 NSDGKDSYVFLTP--KVDHREDKSDKIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXAHE 240
           NSDGKD++VFLTP   V  + +K       K   K   G +                   
Sbjct: 182 NSDGKDTFVFLTPNSSVKKKAEKEAPSGAGKPKPKAVAGENATSANRKN----------- 230

Query: 239 AFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
                  A KEGD+++S+LPYRQDLVGFFA+VN L
Sbjct: 231 ----NTAAKKEGDRRRSFLPYRQDLVGFFANVNGL 261


>ref|XP_003631260.1| PREDICTED: uncharacterized protein LOC100852726 [Vitis vinifera]
          Length = 339

 Score =  174 bits (442), Expect = 4e-41
 Identities = 101/252 (40%), Positives = 142/252 (56%), Gaps = 17/252 (6%)
 Frame = -2

Query: 839 DEMRTESNTAGEKQNDDDDFEFSLVREDPE--------IYYTGQIQPIFPIFNRDLLFDD 684
           +E   E     E++ D+ DFEFS VR DP+        I++ GQI+P+FP+FNRDLLF D
Sbjct: 88  EEEEEEEEEEEEEEEDNGDFEFSFVRADPDSSPISADDIFFNGQIRPMFPLFNRDLLFAD 147

Query: 683 SEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXEN-VPPGTYCVWRPK 507
           S   +     S   G++S +R PL+ LF                    V P TYC W  K
Sbjct: 148 SHEGD-----SEASGKASPLRPPLRKLFVEKSEHPSSSSSSESDELEGVAPETYCAWSEK 202

Query: 506 TNSESPNRCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP--------KVDHREDK 351
               SP  CKKS+STG  S+ W+F DL+ RSNSDGKD++VFL P        + D + ++
Sbjct: 203 AVEASPGACKKSSSTGF-SKFWRFRDLVLRSNSDGKDAFVFLDPSAAKAKTVQTDEKVER 261

Query: 350 SDKIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQ 171
           + KIE+++V +++     +               AHE+ Y++NRA++EG ++KSYLPYRQ
Sbjct: 262 AKKIEKTEVSVEKI-NSGDGKVGGGKVKGEKRVSAHESLYIKNRALREGGRRKSYLPYRQ 320

Query: 170 DLVGFFASVNSL 135
           DLVGFF +VN L
Sbjct: 321 DLVGFFTNVNGL 332


>gb|EOY02405.1| Uncharacterized protein TCM_016889 [Theobroma cacao]
          Length = 264

 Score =  174 bits (440), Expect = 6e-41
 Identities = 114/274 (41%), Positives = 149/274 (54%), Gaps = 10/274 (3%)
 Frame = -2

Query: 926 MHENQEELCVCPTFNSYSSDK--LAIAARVTDEMRTESNTAGEKQNDDDDFEFSLVREDP 753
           M  +  EL  CP+FN YS DK  + IAA+VT + +++         DD++FEF  + E+ 
Sbjct: 1   MQNDTSELSFCPSFNCYSDDKKLVDIAAKVTRDFKSDDVL------DDEEFEFFNLWEN- 53

Query: 752 EIYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXX 573
               T Q    FPIFNRDLL      N    K   D+     IRIPL++LF         
Sbjct: 54  ----TDQTSS-FPIFNRDLLL-----NGEEEKGGDDDDAEEAIRIPLRDLFIGDGDLPSS 103

Query: 572 XXXXXXXXEN-VPPGTYCVWRPKTNSES-PNRCKKSNSTGSASRRWKFLDLLRRSNSDGK 399
                      VP GTYCVW PK ++ES PNRCKKS STGS S+RW+  DLL+RSNSDGK
Sbjct: 104 SSSSEADELEGVPTGTYCVWTPKQSAESSPNRCKKSRSTGSCSKRWRLKDLLKRSNSDGK 163

Query: 398 ----DSYVFLTPKVDHREDKSDKIERSKVVLKRSPGR--SNXXXXXXXXXXXXXXXAHEA 237
                S + L   ++  ++ + K    K+  K +  +  +                AHEA
Sbjct: 164 VSSSSSSLSLPSFLNFEKNSTGKKHEEKLSEKTATTKKKAQGEVQAKKTKRVEKLSAHEA 223

Query: 236 FYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
           FYVRN+A KEGDK++SYLPYRQDLVG FA+V+ L
Sbjct: 224 FYVRNKASKEGDKRRSYLPYRQDLVGIFANVHGL 257


>ref|XP_002524645.1| conserved hypothetical protein [Ricinus communis]
           gi|223536006|gb|EEF37664.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 283

 Score =  173 bits (439), Expect = 8e-41
 Identities = 122/290 (42%), Positives = 161/290 (55%), Gaps = 40/290 (13%)
 Frame = -2

Query: 890 TFNSYSSD-KLA-IAARVTDEMR---------------------TESNTAGEKQNDDDDF 780
           +FNSYSS+ +LA IAA+VT E R                      E +++ ++  +DDDF
Sbjct: 9   SFNSYSSNHQLADIAAKVTKEERQGNEFLEEASIFTVPRHPNGGVEESSSSDEDENDDDF 68

Query: 779 EFSLVREDPEIYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLF 600
           EF LVR +P+   T    PIFP+FNRDLL D      NN ++  D G    IR+PLK LF
Sbjct: 69  EFVLVRANPDGNETAF--PIFPLFNRDLLLD----YENNKEEHHDHGDM--IRLPLKKLF 120

Query: 599 XXXXXXXXXXXXXXXXXE--NVPPGTYCVWRPK----TNSESPNRCKKSNSTGSASR--R 444
                            E   V PGTYCVW P     + S SP+RCKKSNSTGS+S+  R
Sbjct: 121 NDDRDPPSSSSSSSEADELEGVSPGTYCVWTPSKFSSSPSPSPSRCKKSNSTGSSSKQQR 180

Query: 443 WKFLDLL--RRSNSDGKDSYVFLTPKVDHREDKS-------DKIERSKVVLKRSPGRSNX 291
           W+  DLL  +RS+SDGK+S++FL    DH  + +       +KIE+SK +  +   +   
Sbjct: 181 WRLRDLLHLKRSSSDGKESFIFLN--TDHNNNNNNNNKKLEEKIEKSKSIKGKGKDK--- 235

Query: 290 XXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVN 141
                         AHE FYVR++A+KEGDK++SYLPYRQ LVGFFA+VN
Sbjct: 236 -----------IASAHEVFYVRSKALKEGDKRRSYLPYRQGLVGFFANVN 274


>emb|CAN80034.1| hypothetical protein VITISV_019834 [Vitis vinifera]
          Length = 402

 Score =  172 bits (436), Expect = 2e-40
 Identities = 102/256 (39%), Positives = 142/256 (55%), Gaps = 21/256 (8%)
 Frame = -2

Query: 839 DEMRTESNTAGEKQNDDDDFEFSLVREDPE--------IYYTGQIQPIFPIFNRDLLFDD 684
           +E   E     E++ D+ D EFS VR DP+        I++ GQI+P+FP+FNRDLLF D
Sbjct: 90  EEEEEEEEEEEEEEEDNXDXEFSFVRADPDSSPISADDIFFNGQIRPMFPLFNRDLLFAD 149

Query: 683 SEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXEN-VPPGTYCVWRPK 507
           S   +     S   G +S +R PL+ LF                    V PGTYC W  K
Sbjct: 150 SHEGD-----SEASGXASPLRPPLRKLFVEKSEHPSSSSSSESDELEGVAPGTYCAWSEK 204

Query: 506 TNSESPNRCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP--------KVDHREDK 351
               SP  CKKS+STG  S+ W+F DL+ RSNSDGKD++VFL P        + D + ++
Sbjct: 205 AVEASPGACKKSSSTGF-SKFWRFRDLVLRSNSDGKDAFVFLDPSAAKAKTVQTDEKVER 263

Query: 350 SDKIERSKVVLKR---SPGR-SNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYL 183
           + KIE+++V +++     G+                  AHE+ Y++NRA++EG ++KSYL
Sbjct: 264 AXKIEKTEVSVEKINSGDGKVGGGKVKGKGVKGEKRVSAHESLYIKNRALREGGRRKSYL 323

Query: 182 PYRQDLVGFFASVNSL 135
           PYRQDLVGFF +VN L
Sbjct: 324 PYRQDLVGFFTNVNGL 339


>ref|XP_006376855.1| hypothetical protein POPTR_0012s08240g [Populus trichocarpa]
           gi|550326646|gb|ERP54652.1| hypothetical protein
           POPTR_0012s08240g [Populus trichocarpa]
          Length = 304

 Score =  171 bits (432), Expect = 5e-40
 Identities = 109/294 (37%), Positives = 147/294 (50%), Gaps = 41/294 (13%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTES--------------------------NTAGEKQN 795
           P+FNSYSS+KLA IAARV  E   ES                          N   E + 
Sbjct: 12  PSFNSYSSNKLAEIAARVVQEFTNESEQVEGANNNIFSWQEQGGEEKSNHPQNDNEEGEE 71

Query: 794 DDDDFEFSLV-REDPE--------IYYTGQIQPIFPIFNRDLLFDDSEG-NNNNNKQSSD 645
           ++DDFEF+++ R +P+        I+Y GQI+P +P+FN  LL DD E    +    +S 
Sbjct: 72  EEDDFEFAVLSRPEPQFPPISADDIFYNGQIRPFYPLFNTKLLLDDQEFLPRSKTATNST 131

Query: 644 EGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNRCKKSNS 465
           +      R+PL+ LF                 +++ PGTYCVW PK    SP  CKKS+S
Sbjct: 132 QDAKKPNRLPLRKLFYEDRETFSCSSSEADDIDSLEPGTYCVWTPKKEEGSPGSCKKSSS 191

Query: 464 TGSASRRWKFLDLLRRSNSDGKDSYVFLTPKVDHREDKSDKIERSKVVLKRSPGRSN--- 294
           TGS S+RWKF D + RSNSDGKD++VFL P      +K   +   ++      G  N   
Sbjct: 192 TGSNSKRWKFKDFIHRSNSDGKDTFVFLMP-----NNKKSGLHHQRLDSDDQDGNHNKQG 246

Query: 293 -XXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
                             E +YVR+   KEGDK++SYLPYR DLVGF ++VN +
Sbjct: 247 TEKRKEAKGAGGGLFQFQEHYYVRS---KEGDKRRSYLPYRPDLVGFLSNVNGV 297


>ref|XP_004236105.1| PREDICTED: uncharacterized protein LOC101245934 [Solanum
           lycopersicum]
          Length = 289

 Score =  169 bits (429), Expect = 1e-39
 Identities = 111/288 (38%), Positives = 148/288 (51%), Gaps = 39/288 (13%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTESNTAGE-------------------------KQND 792
           P+FNSYS+  LA IA RV  E R E+ T  E                            D
Sbjct: 11  PSFNSYSNKNLAEIADRVVQEFRAENGTDEEFFIDDDGGLSCFESGGTLDEKEEEQGNED 70

Query: 791 DDDFEFSLVREDP-------EIYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSD--EG 639
           +D+FEFS V++         EI+Y GQI+PI+P+FNRDLL D   G + +N  S +    
Sbjct: 71  EDEFEFSFVKQSEISPVAADEIFYNGQIRPIYPLFNRDLLSDFMNGRSKSNSTSEEISSA 130

Query: 638 QSSEIRIPLKNLFXXXXXXXXXXXXXXXXXE--NVPPGTYCVWRPKTNSESPNRCKKSNS 465
           +   IR+PL+ LF                 +   +P GTYC W PK   +S   CKKS+S
Sbjct: 131 RPKSIRLPLRKLFMEEEREGNFSCSSSEADDLEGIPEGTYCTWTPKPEEKSAGSCKKSSS 190

Query: 464 TGSASRRWKFLDLLRRSNSDGKDSYVFLTPKVDHREDKSDK--IERSKVVLKRSPGRSNX 291
           TGS S+RWKF DLL RSNS+GKD++VFLT     +E+K++K  I+ +  V  +S G    
Sbjct: 191 TGS-SKRWKFRDLLYRSNSEGKDTFVFLTHSCRKKENKAEKTTIDNAAKVTAKSKG---- 245

Query: 290 XXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFAS 147
                              YV+N     G+K+K+YLPYRQDLVGFFA+
Sbjct: 246 --------IPVADGCPAMHYVKNGG---GEKRKTYLPYRQDLVGFFAT 282


>ref|XP_002282787.1| PREDICTED: uncharacterized protein LOC100243388 [Vitis vinifera]
          Length = 309

 Score =  167 bits (424), Expect = 5e-39
 Identities = 118/310 (38%), Positives = 146/310 (47%), Gaps = 53/310 (17%)
 Frame = -2

Query: 905 LCVCPTFNSYSSDKLA-IAARVTDEM-RTESNTAGEKQND-------------------- 792
           L + P+FNSY+S +LA IAAR+ +E+  +E  T  E  ND                    
Sbjct: 13  LGLSPSFNSYNSGRLAEIAARIIEELGESEFQTEIEAGNDLEESIPSLQVKEGDEESALF 72

Query: 791 ---------------------DDDFEFSLVREDPE--------IYYTGQIQPIFPIFNRD 699
                                DDD EF+ V  +PE        I+Y GQI+P+FPIFNRD
Sbjct: 73  SEVKSGNDDEEVKEDEVDNDGDDDSEFAFVWREPETSPISADEIFYNGQIRPVFPIFNRD 132

Query: 698 LLFDDSEGNNNNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENVPPGTYCV 519
           LL     G   N + S      + IR PL+ L                  E VPPGTYCV
Sbjct: 133 LLL----GEGQNQEVSVKPPTPASIRRPLRKLLIEERGTDSCSSSEADELEGVPPGTYCV 188

Query: 518 WRPKTNSESPNRCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP--KVDHREDKSD 345
           W PK     P RC+KSNSTGS S+RWKF D L RSNSDGKD++VFLTP   V  + +K  
Sbjct: 189 WTPKVAESPPARCRKSNSTGS-SKRWKFRDFLHRSNSDGKDTFVFLTPNSSVKKKAEKEA 247

Query: 344 KIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDL 165
                K   K   G +                          A KEGD+++S+LPYRQDL
Sbjct: 248 PSGAGKPKPKAVAGENATSANRKN---------------NTAAKKEGDRRRSFLPYRQDL 292

Query: 164 VGFFASVNSL 135
           VGFFA+VN L
Sbjct: 293 VGFFANVNGL 302


>ref|XP_006579758.1| PREDICTED: uncharacterized protein LOC102667078 [Glycine max]
          Length = 268

 Score =  166 bits (419), Expect = 2e-38
 Identities = 112/274 (40%), Positives = 147/274 (53%), Gaps = 7/274 (2%)
 Frame = -2

Query: 935 IEKMHENQEELCVCPTFNSYSSDKLA-IAARVT--DEMRTESNTAGEKQNDDDDFEFSLV 765
           +++  E+  E  VCP+F++YSS+ L  IA +VT  D++R E N        D DFEF   
Sbjct: 1   MQREEESVMESLVCPSFSAYSSNTLDDIADQVTRNDDVRFEEN--------DTDFEFVAF 52

Query: 764 REDPE-IYYTGQIQPIFPIFNRDLLFDDSE-GNNNNNKQSSDEGQSSEIRIPLKNLFXXX 591
           RE  + ++      P FPIF+RDL    +E G     + S +E     I+I L  L    
Sbjct: 53  REVADGVFVDDNATPAFPIFDRDLATATAEDGGEVKRRVSGEEDDVVAIQITLGKLLMDD 112

Query: 590 XXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESP-NRCKKSNSTGSA-SRRWKFLDLLRR 417
                         ENVPPGTYCVW P+    +P + C+KS STGS+ S+RWK LDLLRR
Sbjct: 113 SSPSCSSSEVEDELENVPPGTYCVWTPRKAPPAPASPCRKSKSTGSSLSKRWKLLDLLRR 172

Query: 416 SNSDGKDSYVFLTPKVDHREDKSDKIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXAHEA 237
           SNS+GK+S VFLTP   +   K  K E  K  L  S G                  AHEA
Sbjct: 173 SNSEGKESVVFLTPSSVNSAKKGTKSETGKKSLASSGGGEK------RIAAVPAVSAHEA 226

Query: 236 FYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
            YVRNR ++   K++SYLPYRQDLVGF  ++N +
Sbjct: 227 LYVRNREMRREVKRRSYLPYRQDLVGFCVNLNPM 260


>ref|XP_006374520.1| hypothetical protein POPTR_0015s08750g [Populus trichocarpa]
           gi|550322333|gb|ERP52317.1| hypothetical protein
           POPTR_0015s08750g [Populus trichocarpa]
          Length = 316

 Score =  166 bits (419), Expect = 2e-38
 Identities = 109/300 (36%), Positives = 148/300 (49%), Gaps = 47/300 (15%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTESNTA--------------GEKQN------------ 795
           P+FNSYSS KLA IAARV  E  +ES+                GE+ N            
Sbjct: 15  PSFNSYSSSKLAEIAARVVLEFTSESDQPEDSNNNIFSWRVHEGEENNHPQNDSELGEND 74

Query: 794 -------DDDDFEFSLV-REDPE--------IYYTGQIQPIFPIFNRDLLFDDSEGNNNN 663
                  DDDDFEF+++ + +P+        I+Y GQI+P +P+FN  LL DD +    +
Sbjct: 75  HEEEEEEDDDDFEFAVLSKPEPQFSPISADDIFYNGQIRPFYPLFNTKLLLDDQDSLPKS 134

Query: 662 NKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNR 483
              ++ +      R+PLK LF                 ++  PGTYCVW  K    S   
Sbjct: 135 KTATNTQDNKKPNRLPLKKLFFEDRETFSCSSSEADDIDSAEPGTYCVWTAKKEEGSLGS 194

Query: 482 CKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTP----KVDHREDKSDKIERSKVVLK 315
           CKKS+STGS S+RWKF DLL RSNSDGKD++VFLTP       H+   SD  +       
Sbjct: 195 CKKSSSTGSNSKRWKFKDLLHRSNSDGKDTFVFLTPNNKKSGGHKRFGSDDHDGKN--NN 252

Query: 314 RSPGRSNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
           ++  + +                 + FYV+    KEGDK++SYLPYR DLVGF ++V  +
Sbjct: 253 KNINKGSTEKRKEVKGAGGLFELQQQFYVKG---KEGDKRRSYLPYRPDLVGFMSNVKGV 309


>ref|XP_006603809.1| PREDICTED: uncharacterized protein LOC102669230 [Glycine max]
          Length = 326

 Score =  164 bits (416), Expect = 4e-38
 Identities = 108/277 (38%), Positives = 153/277 (55%), Gaps = 7/277 (2%)
 Frame = -2

Query: 944 QRRIEKMHENQEELCVCPTFNSYSSDKLAIAAR--VTDEMRTESNTAGEKQNDDDDFEFS 771
           +++++K+ E+  E  VCP+F++YSS+ L   A   + +++R + N        D DFEF 
Sbjct: 56  EKKMQKVEESVMESLVCPSFSAYSSNTLDDIADQVIRNDVRFQEN--------DTDFEFV 107

Query: 770 LVREDPE-IYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSS-DEGQSSEIRIPLKNLFX 597
             R+  + ++      P FPIF+RDL     E +    +++S +E   + I+I L  L  
Sbjct: 108 AFRKVADGVFVDNNATPAFPIFDRDLATAAEEDSGEGKRRASGEEDDVAAIQITLGKLLM 167

Query: 596 XXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESP-NRCKKSNSTGSAS-RRWKFLDLL 423
                           ENVPPGTYCVW P+  S +P   C+KS STGS+S +RWK LDLL
Sbjct: 168 DDSSASCSSSEVEDELENVPPGTYCVWTPRKASPAPATPCRKSKSTGSSSSKRWKLLDLL 227

Query: 422 RRSNSDGKDSYVFLTP-KVDHREDKSDKIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXA 246
           RRSNS+GK+S VFLTP  V+  + K  K E      K+SP  S                A
Sbjct: 228 RRSNSEGKESVVFLTPSSVNSAKKKGTKSETG----KKSPASSG--GGEKRIVAVPAVSA 281

Query: 245 HEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
           HEA YVRNR ++   K++SYLPYRQDLVG   ++NS+
Sbjct: 282 HEALYVRNREMRREVKRRSYLPYRQDLVGLCVNLNSM 318


>gb|EMJ16960.1| hypothetical protein PRUPE_ppa009500mg [Prunus persica]
          Length = 290

 Score =  164 bits (414), Expect = 7e-38
 Identities = 118/294 (40%), Positives = 163/294 (55%), Gaps = 32/294 (10%)
 Frame = -2

Query: 920 ENQEELCVCPTFNSYSSDKLA-IAARVTDE------MRTESNTAGEKQNDD-------DD 783
           E+ E+  VCP+FNSYSSDKLA +AA+V  E      +   + +A +K++DD       DD
Sbjct: 4   EDFEQGLVCPSFNSYSSDKLADVAAKVCREFDNLNLLHKSNYSAEQKEHDDHRNDDVDDD 63

Query: 782 FEF-SLVREDPEIYYT-GQIQPIFPIFNRDLLFDDSEGN-----NNNNKQSSDEGQSSEI 624
           FEF S      ++++   QI P+FP+FNRDLL D S+ +     NN      +E +    
Sbjct: 64  FEFVSFQSSGSQVFFDDNQIGPVFPVFNRDLLLDRSQRDLAAAPNNKEVVEEEEDEDDAA 123

Query: 623 RIPLKNLFXXXXXXXXXXXXXXXXXENVPPGTYCVWRPKT--NSESPNRCK-KSNSTG-S 456
            +P  +                   ++VP GTYCVW PK+    ++  +CK KS STG S
Sbjct: 124 ALPSSS------------SSDVDELDSVPQGTYCVWMPKSVVAQDARGKCKIKSKSTGTS 171

Query: 455 ASRRWKFLDLLRRSNSDG--KDSYVFLTPKVDHREDKSDKIERSKVVLKRS---PGRSNX 291
           +SRRW   DLLRRSNS+   KDS+VFLTP     +  ++  E  K + K S    G  + 
Sbjct: 172 SSRRWSIKDLLRRSNSESGSKDSFVFLTPLSSSSKKAAE--EEPKEIKKSSGSGSGPGSG 229

Query: 290 XXXXXXXXXXXXXXAHEAFYVRNRAI-KEG-DKKKSYLPYRQDLVGFFASVNSL 135
                         AHEAFYVRN+ + K+G +K++SYLPYRQDLVGFFASVN++
Sbjct: 230 GGPNKPKGSKAVSMAHEAFYVRNKTVAKDGYNKRRSYLPYRQDLVGFFASVNAM 283


>gb|EOY20727.1| Uncharacterized protein TCM_012072 [Theobroma cacao]
          Length = 298

 Score =  163 bits (412), Expect = 1e-37
 Identities = 109/297 (36%), Positives = 149/297 (50%), Gaps = 44/297 (14%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTESNTA----------------------------GEK 801
           P+FN+YSS +LA IAARV +E R ES  +                             E+
Sbjct: 16  PSFNTYSSGRLAEIAARVVEEFRQESGDSCQDDIYETWPPQQKQNPQLQQQVIEEEDNEE 75

Query: 800 QNDDDDFEFSLVREDP--------EIYYTGQIQPIFPIFNRDLLFDDSEGNNNNNKQSSD 645
           + +DDDFEF+ V  +P        EI++ GQI+P +P+FN +LL  D +  +     S+ 
Sbjct: 76  EEEDDDFEFAFVCREPETSPISADEIFHNGQIRPTYPLFNTNLLLSDDQTPDGKTVDSTH 135

Query: 644 E--GQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNS-----ESPN 486
               +    R+PL+ L                  E V PG+YCVW+PK  S     ESP 
Sbjct: 136 PFVSKPGPRRLPLRKLMSEERETTSCSSSEADELEGVTPGSYCVWKPKGGSSGNDQESPG 195

Query: 485 RCKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTPKVDHREDKSDKIERSKVVLKRSP 306
           RCKKSNSTGS S+RWK  DLL RSNSDGKD++VFL P    + +K+     +K +    P
Sbjct: 196 RCKKSNSTGS-SKRWKLRDLLYRSNSDGKDTFVFLAPS---KREKTSNTNGNKAM--EIP 249

Query: 305 GRSNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEGDKKKSYLPYRQDLVGFFASVNSL 135
           G+                   E      R +K GDK++S+LPYRQDLVG F++V+ L
Sbjct: 250 GKFQAA---------------EEHCGGTRNLKPGDKRRSFLPYRQDLVGLFSNVHGL 291


>ref|XP_006341323.1| PREDICTED: uncharacterized protein LOC102604349 [Solanum tuberosum]
          Length = 276

 Score =  157 bits (397), Expect = 6e-36
 Identities = 111/282 (39%), Positives = 150/282 (53%), Gaps = 19/282 (6%)
 Frame = -2

Query: 938 RIEKMHENQEELCVCPTFNSYS--SDKLAIAARVTDEMRTESNTAGEKQNDDDDFEFSLV 765
           R   M +  E+  +C +FN++   SD   IAA+++DE + E   +      ++DFEFSLV
Sbjct: 3   RKTNMKQVGEDEYLCLSFNNFGLLSD---IAAKISDEFQAEEKDS--LNGGEEDFEFSLV 57

Query: 764 REDP-----EIYYTGQI---QPIFPIFNRDLLFDDSEGNNNNNKQSSDEGQSSEIRIPLK 609
            E P     E  Y GQ    QPIFP+FN DLL  DS+  + +            I IPLK
Sbjct: 58  SEKPDNSIAEFIYDGQTKFQQPIFPLFNCDLLLSDSDLKDVDKS----------ILIPLK 107

Query: 608 NLFXXXXXXXXXXXXXXXXXENVPPGTYCVWRPKTNSESPNRCKKSNSTGSASRRW-KFL 432
            LF                   +P GTYC+W+PK +  SP +CKKS STG  S+RW +  
Sbjct: 108 KLFLEKSESSASSEAEELE--TIPAGTYCMWKPKISEPSPGKCKKSKSTGFVSKRWPRIR 165

Query: 431 DLLRRSNSDGK--DSYVFLTPKVDHREDKSDKIERSKVVLKRS----PGRSNXXXXXXXX 270
           DLLRRSNS+GK  DS+VFL PK    E++S K + S  V+K +      +++        
Sbjct: 166 DLLRRSNSEGKEEDSFVFLKPK-KTIENESAKTKNSSEVVKTAGKFKQTKTSTSGGEKVL 224

Query: 269 XXXXXXXAHEAFYVRNRAIKEGD--KKKSYLPYRQDLVGFFA 150
                      +YVRNRA KE D  K+KSYLPYR++L+GFFA
Sbjct: 225 PPPPPASQQAVYYVRNRADKEADKNKRKSYLPYRKELIGFFA 266


>ref|XP_006439821.1| hypothetical protein CICLE_v10021025mg [Citrus clementina]
           gi|557542083|gb|ESR53061.1| hypothetical protein
           CICLE_v10021025mg [Citrus clementina]
          Length = 337

 Score =  157 bits (397), Expect = 6e-36
 Identities = 114/321 (35%), Positives = 161/321 (50%), Gaps = 70/321 (21%)
 Frame = -2

Query: 893 PTFNSYSSDKLA-IAARVTDEMRTES----------NTAGEKQNDDD------------- 786
           P+F+SYSS  LA IAARV +E R +           N   E+Q ++D             
Sbjct: 17  PSFSSYSSANLAEIAARVVEEFRQQEPEYSEDIFDINWERERQQEEDRRGSSLDDDVVLF 76

Query: 785 -----------------DFEFSLVREDP---------EIYYTGQIQPIFPIFNRDLLFDD 684
                            +FEF++V             EI++ GQI+P++P+FNRDLL  +
Sbjct: 77  SQQKHEEEEEEEEEEEEEFEFAVVCNKKQECSSITADEIFFNGQIKPLYPLFNRDLLLYN 136

Query: 683 SEGNN----------NNNKQSSDEGQSSEIRIPLKNLFXXXXXXXXXXXXXXXXXE---N 543
            + NN          +    ++  G+SS  R+PL  L                  +   N
Sbjct: 137 YDQNNAAAPSPAPAASPTTTTTTTGRSSN-RLPLGKLMSEERETSVSCSCSSSEADDLDN 195

Query: 542 VPPGTYCVWRPKTN--SESPNR-CKKSNSTGSASRRWKFLDLLRRSNSDGKDSYVFLTPK 372
           + PGTYCVW PK +  S+SP R CKKS+STGS S+RWKF DLL RSNSDGKD++VF TP 
Sbjct: 196 LTPGTYCVWTPKPSKESQSPGRSCKKSHSTGSNSKRWKFRDLLYRSNSDGKDTFVFFTPL 255

Query: 371 V--DHREDKSD--KIERSKVVLKRSPGRSNXXXXXXXXXXXXXXXAHEAFYVRNRAIKEG 204
           V  +H   K+   +    +    +   +S+                ++  YVRNR++KE 
Sbjct: 256 VTGNHHNKKATAARHRHHRHEDSQDYNKSSSRSATGTGKAADKATVNKEHYVRNRSVKEE 315

Query: 203 DKKKSYLPYRQDLVGFFASVN 141
           DKK+S+LPYR+DLVGFF++VN
Sbjct: 316 DKKRSFLPYRKDLVGFFSNVN 336


Top