BLASTX nr result

ID: Forsythia23_contig00022083 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00022083
         (556 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166...   108   1e-21
ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583...   102   8e-20
ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260...   100   5e-19
ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260...   100   5e-19
ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260...   100   5e-19
ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236...    98   2e-18
ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236...    98   2e-18
ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116...    98   2e-18
emb|CDP05166.1| unnamed protein product [Coffea canephora]             97   4e-18
ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family prot...    94   3e-17
gb|EYU19567.1| hypothetical protein MIMGU_mgv1a011273mg [Erythra...    94   5e-17
ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family prot...    94   5e-17
ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family prot...    94   5e-17
gb|KHF98668.1| hypothetical protein F383_11887 [Gossypium arboreum]    93   8e-17
gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium r...    91   3e-16
ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765...    91   3e-16
gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]    91   3e-16
ref|XP_012486505.1| PREDICTED: uncharacterized protein LOC105800...    90   5e-16
ref|XP_002513675.1| conserved hypothetical protein [Ricinus comm...    86   1e-14
ref|XP_008439268.1| PREDICTED: uncharacterized protein LOC103484...    85   2e-14

>ref|XP_011084440.1| PREDICTED: uncharacterized protein LOC105166690 [Sesamum indicum]
          Length = 479

 Score =  108 bits (271), Expect = 1e-21
 Identities = 52/97 (53%), Positives = 68/97 (70%), Gaps = 1/97 (1%)
 Frame = -1

Query: 550 RDLMLMNRDVSSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVL 371
           +DL   N D   +++  ET NE   +   GEG +  QK RT+SLGSSKDFNF+N KGE+ 
Sbjct: 383 KDLTTKNADSCREHNDGETTNEVPEIPLDGEGGELHQKQRTVSLGSSKDFNFNNAKGEIP 442

Query: 370 DKSTINCEWWTSDIDARKEL-RQNNWTFFPMLQPGVS 263
           +KS+INCEWWT++   RKEL  +N+W+FFPMLQ G S
Sbjct: 443 EKSSINCEWWTNEKVVRKELGPRNSWSFFPMLQSGAS 479


>ref|XP_006353896.1| PREDICTED: uncharacterized protein LOC102583548 [Solanum tuberosum]
          Length = 470

 Score =  102 bits (255), Expect = 8e-20
 Identities = 52/94 (55%), Positives = 63/94 (67%), Gaps = 1/94 (1%)
 Frame = -1

Query: 541 MLMNRDVSSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKS 362
           +L N   S    A E    +   AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K 
Sbjct: 377 LLANEMKSGSSMAEEKTYGSPRKASESGEDQCHRKHRNITFGSSKDFDFDNVKIEVLEKD 436

Query: 361 TINCEWWTSDIDARKELR-QNNWTFFPMLQPGVS 263
           +I+CEWWTSD  A KE   QNNWTFFP+LQPGVS
Sbjct: 437 SIDCEWWTSDKAAGKESGIQNNWTFFPVLQPGVS 470


>ref|XP_010317637.1| PREDICTED: uncharacterized protein LOC101260903 isoform X3 [Solanum
           lycopersicum]
          Length = 469

 Score =  100 bits (248), Expect = 5e-19
 Identities = 50/87 (57%), Positives = 60/87 (68%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           S    A E    +   AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K +I+CEWW
Sbjct: 383 SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 442

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD  A KE   QNNWTFFP+LQPGVS
Sbjct: 443 TSDKAAVKESGIQNNWTFFPVLQPGVS 469


>ref|XP_010317636.1| PREDICTED: uncharacterized protein LOC101260903 isoform X1 [Solanum
           lycopersicum]
          Length = 476

 Score =  100 bits (248), Expect = 5e-19
 Identities = 50/87 (57%), Positives = 60/87 (68%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           S    A E    +   AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K +I+CEWW
Sbjct: 390 SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 449

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD  A KE   QNNWTFFP+LQPGVS
Sbjct: 450 TSDKAAVKESGIQNNWTFFPVLQPGVS 476


>ref|XP_004234428.1| PREDICTED: uncharacterized protein LOC101260903 isoform X2 [Solanum
           lycopersicum]
          Length = 470

 Score =  100 bits (248), Expect = 5e-19
 Identities = 50/87 (57%), Positives = 60/87 (68%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           S    A E    +   AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K +I+CEWW
Sbjct: 384 SGSSMAEEKTYGSPRKASESGEDECHRKHRNITFGSSKDFDFDNVKIEVLEKDSIDCEWW 443

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD  A KE   QNNWTFFP+LQPGVS
Sbjct: 444 TSDKAAVKESGIQNNWTFFPVLQPGVS 470


>ref|XP_009788654.1| PREDICTED: uncharacterized protein LOC104236433 isoform X2
           [Nicotiana sylvestris]
          Length = 442

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 48/87 (55%), Positives = 59/87 (67%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           SS     E  +     AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K +++CEWW
Sbjct: 356 SSSSIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWW 415

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD    KE   QNNWTFFP+LQPGVS
Sbjct: 416 TSDKATGKESSIQNNWTFFPVLQPGVS 442


>ref|XP_009788653.1| PREDICTED: uncharacterized protein LOC104236433 isoform X1
           [Nicotiana sylvestris]
          Length = 470

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 48/87 (55%), Positives = 59/87 (67%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           SS     E  +     AS+   D+C +KHR I+ GSSKDF+FDN K EVL+K +++CEWW
Sbjct: 384 SSSSIVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEKHSVDCEWW 443

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD    KE   QNNWTFFP+LQPGVS
Sbjct: 444 TSDKATGKESSIQNNWTFFPVLQPGVS 470


>ref|XP_009625236.1| PREDICTED: uncharacterized protein LOC104116142 [Nicotiana
           tomentosiformis]
          Length = 470

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 47/87 (54%), Positives = 60/87 (68%), Gaps = 1/87 (1%)
 Frame = -1

Query: 520 SSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWW 341
           SS  +  E  +     AS+   D+C +KHR I+ GSSKDF+FDN K EVL++ +++CEWW
Sbjct: 384 SSSSNVEEKTDGLPEKASERGDDQCHRKHRNITFGSSKDFDFDNVKIEVLEEDSVDCEWW 443

Query: 340 TSDIDARKELR-QNNWTFFPMLQPGVS 263
           TSD    KE   QNNWTFFP+LQPGVS
Sbjct: 444 TSDKATGKESSIQNNWTFFPVLQPGVS 470


>emb|CDP05166.1| unnamed protein product [Coffea canephora]
          Length = 452

 Score = 97.1 bits (240), Expect = 4e-18
 Identities = 46/67 (68%), Positives = 53/67 (79%), Gaps = 1/67 (1%)
 Frame = -1

Query: 460 EGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWWTSDIDARKEL-RQNNWTFFP 284
           EG +C + +RT SLGSSKDFNFDN K E  DKSTI+CEWWT++  A KEL  +N WTFFP
Sbjct: 386 EGKQCLKNNRTFSLGSSKDFNFDNMKQESPDKSTIDCEWWTNETAAAKELGSKNKWTFFP 445

Query: 283 MLQPGVS 263
           MLQPGVS
Sbjct: 446 MLQPGVS 452


>ref|XP_007038760.1| Hydroxyproline-rich glycoprotein family protein [Theobroma cacao]
           gi|508776005|gb|EOY23261.1| Hydroxyproline-rich
           glycoprotein family protein [Theobroma cacao]
          Length = 540

 Score = 94.4 bits (233), Expect = 3e-17
 Identities = 47/82 (57%), Positives = 59/82 (71%), Gaps = 1/82 (1%)
 Frame = -1

Query: 508 SARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWWTSDI 329
           ++ ETV +ASG A   E +   QKHR+++LGS K+FNFDNTKGE  DK TI  EWW ++ 
Sbjct: 461 TSNETVEKASGKA---EEEHSYQKHRSVTLGSIKEFNFDNTKGEASDKPTIRSEWWANEK 517

Query: 328 DARKELRQ-NNWTFFPMLQPGV 266
            ARKE R  N+WTFFPM +PGV
Sbjct: 518 FARKEARPGNSWTFFPMFRPGV 539


>gb|EYU19567.1| hypothetical protein MIMGU_mgv1a011273mg [Erythranthe guttata]
           gi|604299725|gb|EYU19568.1| hypothetical protein
           MIMGU_mgv1a011273mg [Erythranthe guttata]
          Length = 287

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 45/82 (54%), Positives = 59/82 (71%), Gaps = 2/82 (2%)
 Frame = -1

Query: 502 RETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTK-GEVLDKSTINCEWWTSDID 326
           +ETV+E  G+    EG+  D KHRTIS GSSKDFNF+N K  EV +KS+++CEWW ++  
Sbjct: 206 KETVSEVRGVPLDSEGEDLDLKHRTISFGSSKDFNFNNAKEEEVSEKSSVDCEWWINENG 265

Query: 325 ARKELR-QNNWTFFPMLQPGVS 263
             KEL  +N+W+FFPMLQ G S
Sbjct: 266 VTKELSPRNSWSFFPMLQSGAS 287


>ref|XP_007038766.1| Hydroxyproline-rich glycoprotein family protein isoform 2
           [Theobroma cacao] gi|508776011|gb|EOY23267.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           2 [Theobroma cacao]
          Length = 489

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 51/98 (52%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
 Frame = -1

Query: 550 RDLMLMNRDVSSKYSARETVNEASGMAS-QGEGDKCDQKHRTISLGSSKDFNFDNTKGEV 374
           RD +  + + S +   RET NE    AS + E +   QKHR+++LGS K+FNFDNTKGE 
Sbjct: 392 RDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEA 451

Query: 373 LDKSTINCEWWTSDIDARKELRQ-NNWTFFPMLQPGVS 263
            DK TI  EWW ++  A KE R  N+WTFFPMLQP VS
Sbjct: 452 SDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 489


>ref|XP_007038765.1| Hydroxyproline-rich glycoprotein family protein isoform 1
           [Theobroma cacao] gi|508776010|gb|EOY23266.1|
           Hydroxyproline-rich glycoprotein family protein isoform
           1 [Theobroma cacao]
          Length = 485

 Score = 93.6 bits (231), Expect = 5e-17
 Identities = 51/98 (52%), Positives = 64/98 (65%), Gaps = 2/98 (2%)
 Frame = -1

Query: 550 RDLMLMNRDVSSKYSARETVNEASGMAS-QGEGDKCDQKHRTISLGSSKDFNFDNTKGEV 374
           RD +  + + S +   RET NE    AS + E +   QKHR+++LGS K+FNFDNTKGE 
Sbjct: 388 RDGIKKDLESSCELFIRETSNETVEKASGEAEEEHSYQKHRSVTLGSIKEFNFDNTKGEA 447

Query: 373 LDKSTINCEWWTSDIDARKELRQ-NNWTFFPMLQPGVS 263
            DK TI  EWW ++  A KE R  N+WTFFPMLQP VS
Sbjct: 448 SDKPTIRSEWWANEKVAGKEARPGNSWTFFPMLQPEVS 485


>gb|KHF98668.1| hypothetical protein F383_11887 [Gossypium arboreum]
          Length = 489

 Score = 92.8 bits (229), Expect = 8e-17
 Identities = 48/94 (51%), Positives = 64/94 (68%), Gaps = 2/94 (2%)
 Frame = -1

Query: 550 RDLMLMNRDVSSKYSARETVNEASGMAS-QGEGDKCDQKHRTISLGSSKDFNFDNTKGEV 374
           RD M  + + S K  +RET NE    AS + E + C QKHR+++LGS K+FNFD+TKGE 
Sbjct: 395 RDGMKKDLESSCKLFSRETSNETVEKASGESEEEHCYQKHRSVTLGSIKEFNFDSTKGEA 454

Query: 373 LDKSTINCEWWTSDIDARKELRQ-NNWTFFPMLQ 275
            DK +I  EWW ++  A KE++  NNW+FFPMLQ
Sbjct: 455 SDKPSIRSEWWANEKVAGKEVKPGNNWSFFPMLQ 488


>gb|KJB52747.1| hypothetical protein B456_008G275500 [Gossypium raimondii]
          Length = 464

 Score = 90.9 bits (224), Expect = 3e-16
 Identities = 42/82 (51%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
 Frame = -1

Query: 505 ARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWWTSDID 326
           A+  + +   ++ + E D C QKHR+++LGS K+FNFDN KGE  +K T+  EWW ++  
Sbjct: 383 AQGRIEKDEKVSGEAEEDHCYQKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWANEKV 442

Query: 325 ARKELRQ-NNWTFFPMLQPGVS 263
           A KE R  NNWTFFPMLQP VS
Sbjct: 443 AGKEARPGNNWTFFPMLQPEVS 464


>ref|XP_012440132.1| PREDICTED: uncharacterized protein LOC105765522 [Gossypium
           raimondii] gi|763785675|gb|KJB52746.1| hypothetical
           protein B456_008G275500 [Gossypium raimondii]
          Length = 465

 Score = 90.9 bits (224), Expect = 3e-16
 Identities = 42/82 (51%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
 Frame = -1

Query: 505 ARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWWTSDID 326
           A+  + +   ++ + E D C QKHR+++LGS K+FNFDN KGE  +K T+  EWW ++  
Sbjct: 384 AQGRIEKDEKVSGEAEEDHCYQKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWANEKV 443

Query: 325 ARKELRQ-NNWTFFPMLQPGVS 263
           A KE R  NNWTFFPMLQP VS
Sbjct: 444 AGKEARPGNNWTFFPMLQPEVS 465


>gb|KHG09821.1| hypothetical protein F383_13171 [Gossypium arboreum]
          Length = 465

 Score = 90.9 bits (224), Expect = 3e-16
 Identities = 42/82 (51%), Positives = 56/82 (68%), Gaps = 1/82 (1%)
 Frame = -1

Query: 505 ARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKSTINCEWWTSDID 326
           A+  + +   ++ + E D C QKHR+++LGS K+FNFDN KGE  +K T+  EWW ++  
Sbjct: 384 AQGRIEKDEKVSGEAEEDHCYQKHRSVTLGSIKEFNFDNRKGEASEKPTVRSEWWANEKV 443

Query: 325 ARKELRQ-NNWTFFPMLQPGVS 263
           A KE R  NNWTFFPMLQP VS
Sbjct: 444 AGKEARPGNNWTFFPMLQPEVS 465


>ref|XP_012486505.1| PREDICTED: uncharacterized protein LOC105800123 [Gossypium
           raimondii] gi|763770082|gb|KJB37297.1| hypothetical
           protein B456_006G198500 [Gossypium raimondii]
           gi|763770083|gb|KJB37298.1| hypothetical protein
           B456_006G198500 [Gossypium raimondii]
          Length = 478

 Score = 90.1 bits (222), Expect = 5e-16
 Identities = 46/95 (48%), Positives = 63/95 (66%), Gaps = 2/95 (2%)
 Frame = -1

Query: 553 WRDLMLMNRDVSSKYSARETVNEASGMAS-QGEGDKCDQKHRTISLGSSKDFNFDNTKGE 377
           +RD M  + + S K  +RET NE    AS + E + C QKHR+++LGS K+FNFD+ KGE
Sbjct: 383 YRDGMTKDLESSCKLFSRETSNETVEKASGESEEEHCYQKHRSVTLGSIKEFNFDSAKGE 442

Query: 376 VLDKSTINCEWWTSDIDARKELRQ-NNWTFFPMLQ 275
             D  +I  EWW ++  A KE++  NNW+FFPMLQ
Sbjct: 443 ASDNPSIRSEWWANEKVAGKEVKPGNNWSFFPMLQ 477


>ref|XP_002513675.1| conserved hypothetical protein [Ricinus communis]
           gi|223547583|gb|EEF49078.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 510

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 45/95 (47%), Positives = 66/95 (69%), Gaps = 1/95 (1%)
 Frame = -1

Query: 544 LMLMNRDVSSKYSARETVNEASGMASQGEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDK 365
           +++ + ++ +  ++ ET  + SG   + E + C +KHR+I+LGS K+FNFDN+K EV DK
Sbjct: 420 MLMTDENLPTGETSGETPEKPSG---EMEEEHCYRKHRSITLGSIKEFNFDNSK-EVPDK 475

Query: 364 STINCEWWTSDIDARKELR-QNNWTFFPMLQPGVS 263
            +IN EWW ++  A KE R  NNWTFFP+LQP VS
Sbjct: 476 PSINSEWWANETIAGKEARPANNWTFFPLLQPEVS 510


>ref|XP_008439268.1| PREDICTED: uncharacterized protein LOC103484098 [Cucumis melo]
          Length = 497

 Score = 85.1 bits (209), Expect = 2e-14
 Identities = 41/94 (43%), Positives = 61/94 (64%), Gaps = 4/94 (4%)
 Frame = -1

Query: 532 NRDVSSKYSARETVNEASGMASQ---GEGDKCDQKHRTISLGSSKDFNFDNTKGEVLDKS 362
           N+++S +    E  +  + MA +   GE D+C Q  R ++LGS K+FNFD TKGEV + +
Sbjct: 404 NKELSREAETCEFFDIKTSMAPEKTPGEDDQCYQNQRAVTLGSFKEFNFDQTKGEVHNTA 463

Query: 361 TINCEWWTSD-IDARKELRQNNWTFFPMLQPGVS 263
           +I  EWW ++ +  ++    NNWTFFP+LQPGVS
Sbjct: 464 SIGAEWWANEKVGVKEASPGNNWTFFPLLQPGVS 497


Top