BLASTX nr result

ID: Forsythia21_contig00023823 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00023823
         (450 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CDP01750.1| unnamed protein product [Coffea canephora]             97   5e-18
ref|XP_002528523.1| conserved hypothetical protein [Ricinus comm...    89   1e-15
ref|XP_007046438.1| Uncharacterized protein TCM_000023 [Theobrom...    83   8e-14
gb|KDO71601.1| hypothetical protein CISIN_1g043132mg [Citrus sin...    77   4e-12
gb|EYU30912.1| hypothetical protein MIMGU_mgv1a023162mg [Erythra...    74   5e-11
ref|XP_006370753.1| hypothetical protein POPTR_0001s46090g, part...    73   8e-11
gb|EYU30910.1| hypothetical protein MIMGU_mgv1a021709mg [Erythra...    71   2e-10
gb|KHN03125.1| hypothetical protein glysoja_040209 [Glycine soja]      70   4e-10
ref|XP_007158040.1| hypothetical protein PHAVU_002G119100g [Phas...    70   5e-10
ref|XP_002317632.1| hypothetical protein POPTR_0011s14830g [Popu...    70   7e-10
ref|XP_012458451.1| PREDICTED: uncharacterized protein LOC105779...    69   9e-10
gb|KHN35934.1| hypothetical protein glysoja_003057 [Glycine soja]      69   9e-10
ref|XP_007203185.1| hypothetical protein PRUPE_ppa025168mg [Prun...    68   2e-09
gb|KHN41798.1| hypothetical protein glysoja_003538 [Glycine soja]      64   3e-08
ref|XP_003631927.1| PREDICTED: uncharacterized protein LOC100854...    64   3e-08
gb|KCW53312.1| hypothetical protein EUGRSUZ_J02566 [Eucalyptus g...    64   4e-08
gb|KHN38849.1| hypothetical protein glysoja_014841 [Glycine soja]      62   2e-07
ref|XP_003629053.1| hypothetical protein MTR_8g072610 [Medicago ...    60   7e-07

>emb|CDP01750.1| unnamed protein product [Coffea canephora]
          Length = 142

 Score = 96.7 bits (239), Expect = 5e-18
 Identities = 50/109 (45%), Positives = 67/109 (61%), Gaps = 1/109 (0%)
 Frame = -2

Query: 326 CYGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXXRNY 147
           C+G++GFRIN RRFSV R+R KF+YL +L  +W+ SY NAL S+++N+            
Sbjct: 23  CHGIRGFRINPRRFSVQRLRTKFLYLLKLFNKWRFSYGNALRSMRKNLTRNSSCKRNSGS 82

Query: 146 NKDDNIHM-ELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
            +   + M E+P  Y+      D  R RS   +NSF SEAIADCLDFIK
Sbjct: 83  GRKSLVVMKEVPYTYTMG----DSSRLRSYARTNSFYSEAIADCLDFIK 127


>ref|XP_002528523.1| conserved hypothetical protein [Ricinus communis]
           gi|223532025|gb|EEF33835.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 127

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 50/117 (42%), Positives = 68/117 (58%), Gaps = 2/117 (1%)
 Frame = -2

Query: 347 YDRVRRRC--YGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITX 174
           Y ++ RR   YG +GFR+N +RFSV R+R +F+YLF+LL RWK SY +A+ SLKR++   
Sbjct: 9   YSKIGRRYHGYGGRGFRLNCKRFSVQRLRARFVYLFKLLSRWKSSYGHAVQSLKRSM--- 65

Query: 173 XXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
                    ++   I        S   +     R R+   SNSF SEAIADCL+FIK
Sbjct: 66  ---------SRSSGIKRNTSSRRSLVVEVSSDCRMRTFGRSNSFYSEAIADCLEFIK 113


>ref|XP_007046438.1| Uncharacterized protein TCM_000023 [Theobroma cacao]
           gi|508698699|gb|EOX90595.1| Uncharacterized protein
           TCM_000023 [Theobroma cacao]
          Length = 146

 Score = 82.8 bits (203), Expect = 8e-14
 Identities = 56/127 (44%), Positives = 68/127 (53%), Gaps = 8/127 (6%)
 Frame = -2

Query: 359 SKMDYDRVRRRCYGV--KGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKR- 189
           S M Y+RV RRC G   KGFR+N RRFSV  +R +F YLFRLL RW+ SY  AL  +K+ 
Sbjct: 3   SHMRYNRVGRRCQGSGNKGFRLNPRRFSVQGLRARFFYLFRLLSRWRTSYGRALRLIKKL 62

Query: 188 --NI--ITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFR-SLTHSNSFCSEAIA 24
             NI   +          +   +  +  P             R R SL  SNSF SEAIA
Sbjct: 63  GINIGNSSIKRDNSSGRVSSSSSRTLVTPKELQLPLPNSTTTRLRPSLGRSNSFYSEAIA 122

Query: 23  DCLDFIK 3
           DCL+FIK
Sbjct: 123 DCLEFIK 129


>gb|KDO71601.1| hypothetical protein CISIN_1g043132mg [Citrus sinensis]
          Length = 153

 Score = 77.0 bits (188), Expect = 4e-12
 Identities = 49/120 (40%), Positives = 68/120 (56%), Gaps = 12/120 (10%)
 Frame = -2

Query: 326 CYGVKGFRINL---RRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXX 156
           C   KGFR+N+   +RFSV  +R +F+YLFRLL RW+ SY  AL SL   I++       
Sbjct: 23  CCKGKGFRVNVSKTKRFSVQGLRARFVYLFRLLSRWRFSYGRALKSL---ILSKKEKGIF 79

Query: 155 RNYNKDDNIHME--LPCAYSYSSDRHDQYR-------FRSLTHSNSFCSEAIADCLDFIK 3
            N  ++++      +P   S SS  H+ ++        RS   SNSF +EAIADCL+FIK
Sbjct: 80  VNIKRNNSSSKRNLVPNVNSMSSQGHEYHQPSVGCRSMRSFGRSNSFYAEAIADCLEFIK 139


>gb|EYU30912.1| hypothetical protein MIMGU_mgv1a023162mg [Erythranthe guttata]
          Length = 137

 Score = 73.6 bits (179), Expect = 5e-11
 Identities = 49/123 (39%), Positives = 71/123 (57%), Gaps = 4/123 (3%)
 Frame = -2

Query: 359 SKMDYDRVRRRCYGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSL-KRNI 183
           S+ +Y++  +RC G  G ++N RRF V R+R  FM + R+L RWK  Y N +  L  RN+
Sbjct: 8   SERNYEKTSKRCCG--GIKLNPRRFCVQRLRTNFMNIVRVLRRWKNYYKNGVRKLITRNL 65

Query: 182 ITXXXXXXXRNYNKDDNIHMELPCAYSYSS---DRHDQYRFRSLTHSNSFCSEAIADCLD 12
            T       R   + D++   +   Y YSS   + +++ R   + HSNSF SEAIADCL+
Sbjct: 66  GT-------RKKERIDDVEF-VNYYYYYSSNNNNNNNKRRVPIMEHSNSFSSEAIADCLE 117

Query: 11  FIK 3
           FIK
Sbjct: 118 FIK 120


>ref|XP_006370753.1| hypothetical protein POPTR_0001s46090g, partial [Populus
           trichocarpa] gi|550350000|gb|ERP67322.1| hypothetical
           protein POPTR_0001s46090g, partial [Populus trichocarpa]
          Length = 141

 Score = 72.8 bits (177), Expect = 8e-11
 Identities = 46/105 (43%), Positives = 55/105 (52%), Gaps = 1/105 (0%)
 Frame = -2

Query: 314 KGFRINL-RRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXXRNYNKD 138
           +GFR+   RRFSV R R +F  LFR L RW+ SY  A+  LKR  +           +K 
Sbjct: 26  RGFRLKYPRRFSVQRFRARFFCLFRFLSRWRSSYGQAVQYLKRG-MGRDSGIKRCGSSKR 84

Query: 137 DNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
             +     C Y    D H  Y  RS   SNSF SEAIADCL+FIK
Sbjct: 85  VLVMDATSCHYMEKGDEH--YSCRSFGRSNSFYSEAIADCLEFIK 127


>gb|EYU30910.1| hypothetical protein MIMGU_mgv1a021709mg [Erythranthe guttata]
          Length = 139

 Score = 71.2 bits (173), Expect = 2e-10
 Identities = 47/123 (38%), Positives = 69/123 (56%), Gaps = 4/123 (3%)
 Frame = -2

Query: 359 SKMDYDRVR-RRCYGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSL-KRN 186
           S+ +Y++   +RC G  G ++N RRF V R+R  FM + R+L RWK  Y N +  L  RN
Sbjct: 8   SRRNYEKTSCKRCCG--GIKLNPRRFCVQRLRTNFMNIVRVLRRWKNYYKNGVRKLITRN 65

Query: 185 IITXXXXXXXRNYNKDDNIHMELPCAYSYS--SDRHDQYRFRSLTHSNSFCSEAIADCLD 12
           + T           + D++       Y YS  ++ +++ R   + HSNSF SEAIADCL+
Sbjct: 66  LGTRKK-------ERIDDVEFVYYNYYYYSKNNNNNNKRRVAIMEHSNSFSSEAIADCLE 118

Query: 11  FIK 3
           FIK
Sbjct: 119 FIK 121


>gb|KHN03125.1| hypothetical protein glysoja_040209 [Glycine soja]
          Length = 145

 Score = 70.5 bits (171), Expect = 4e-10
 Identities = 39/104 (37%), Positives = 55/104 (52%)
 Frame = -2

Query: 314 KGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXXRNYNKDD 135
           +GFR+N R+F V R+R++F +  RL   WK SY  A+  LK+ +            N  +
Sbjct: 24  RGFRLNPRKFYVLRLRKRFNFFLRLFDSWKLSYGEAIQLLKKMV----CRKSGLKRNNSN 79

Query: 134 NIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
           N    L  +       H+  + RS   SNSF +EAIADCL+FIK
Sbjct: 80  NSTRSLVRSREEKIKDHEDCKMRSCGRSNSFYAEAIADCLEFIK 123


>ref|XP_007158040.1| hypothetical protein PHAVU_002G119100g [Phaseolus vulgaris]
           gi|561031455|gb|ESW30034.1| hypothetical protein
           PHAVU_002G119100g [Phaseolus vulgaris]
          Length = 142

 Score = 70.1 bits (170), Expect = 5e-10
 Identities = 48/125 (38%), Positives = 64/125 (51%), Gaps = 6/125 (4%)
 Frame = -2

Query: 359 SKMDYDRVRR-----RCYG-VKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSS 198
           S M Y+RV        C+G  +GFR+NLRR    R+R++F +L  L  RWK SY+ AL  
Sbjct: 2   SHMSYNRVSSGRGSSSCHGKCRGFRLNLRRLYFLRLRKRFTFLLNLFDRWKLSYAQALQL 61

Query: 197 LKRNIITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADC 18
           LK+  +        RNY+      +         SD     R  S   +NSF +EAIADC
Sbjct: 62  LKK--VFRRKSGFKRNYSNSSRSGLVRDERIKGQSDS----RLSSYGRNNSFYAEAIADC 115

Query: 17  LDFIK 3
           L+FIK
Sbjct: 116 LEFIK 120


>ref|XP_002317632.1| hypothetical protein POPTR_0011s14830g [Populus trichocarpa]
           gi|222860697|gb|EEE98244.1| hypothetical protein
           POPTR_0011s14830g [Populus trichocarpa]
          Length = 129

 Score = 69.7 bits (169), Expect = 7e-10
 Identities = 50/126 (39%), Positives = 61/126 (48%), Gaps = 7/126 (5%)
 Frame = -2

Query: 359 SKMDYDRVRRRCYG------VKGFRINL-RRFSVGRIRRKFMYLFRLLGRWKCSYSNALS 201
           S M Y RV +R +        +GFR+   RRFSV R+R           RW+ SY  A+ 
Sbjct: 2   SHMRYTRVGKRSHHRHGASCTRGFRLKYPRRFSVQRLR----------ARWRSSYGRAVQ 51

Query: 200 SLKRNIITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIAD 21
            LKR +            N+   +     C Y    D  DQYRFRS   SNSF SEAIAD
Sbjct: 52  YLKRGVNRNSSIIERCGSNERGFMMDATSCHYMGKVD--DQYRFRSFGRSNSFYSEAIAD 109

Query: 20  CLDFIK 3
           CL+FIK
Sbjct: 110 CLEFIK 115


>ref|XP_012458451.1| PREDICTED: uncharacterized protein LOC105779221 [Gossypium
           raimondii] gi|763808209|gb|KJB75111.1| hypothetical
           protein B456_012G025100 [Gossypium raimondii]
          Length = 130

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 49/123 (39%), Positives = 65/123 (52%), Gaps = 3/123 (2%)
 Frame = -2

Query: 362 FSKMDYDRVRRRCYGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNI 183
           F  M Y RV +R  G++     LRRFSV  +R +F+YLF  + RW+ SY  AL S+ +  
Sbjct: 3   FRHMGYTRVGKRSKGLR-----LRRFSVQGLRARFLYLFNQISRWRSSYGRALRSIIKKT 57

Query: 182 ---ITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLD 12
              I        R++    N H+ L  + S  S         SL HSNSF SEAI+DCL+
Sbjct: 58  GGDIMAIRNNSSRSWRSRTN-HVPLTNSCSLRS---------SLGHSNSFYSEAISDCLE 107

Query: 11  FIK 3
           FIK
Sbjct: 108 FIK 110


>gb|KHN35934.1| hypothetical protein glysoja_003057 [Glycine soja]
          Length = 142

 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 47/123 (38%), Positives = 62/123 (50%), Gaps = 6/123 (4%)
 Frame = -2

Query: 353 MDYDRVRR-----RCYG-VKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLK 192
           M Y+RV        C+G  +GFR+NLRR    R+R++F +L RL   WK SYS AL  LK
Sbjct: 4   MSYNRVSSGRGSSSCHGKCRGFRLNLRRLYFLRLRKRFTFLLRLFDMWKLSYSQALQLLK 63

Query: 191 RNIITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLD 12
           +  +        RNY+      +            H   R  S   +NSF +EAIADCL+
Sbjct: 64  K--VLRRKSGFKRNYSNSSRSGL----VRDERIKGHADCRVSSYGRNNSFYAEAIADCLE 117

Query: 11  FIK 3
           FIK
Sbjct: 118 FIK 120


>ref|XP_007203185.1| hypothetical protein PRUPE_ppa025168mg [Prunus persica]
           gi|462398716|gb|EMJ04384.1| hypothetical protein
           PRUPE_ppa025168mg [Prunus persica]
          Length = 146

 Score = 68.2 bits (165), Expect = 2e-09
 Identities = 45/114 (39%), Positives = 63/114 (55%), Gaps = 8/114 (7%)
 Frame = -2

Query: 320 GVKGFRINLRRFSVGRI-RRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXXR--- 153
           G +GFR+N RRFSV R+ R +F+ LFR L   +CSY  AL SLK+ +             
Sbjct: 22  GSRGFRLNPRRFSVSRLLRARFVCLFRFL---RCSYGQALQSLKKGMSRSSRPSAGSGPS 78

Query: 152 NYNKDDNIHMELPCAYSYS----SDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
           N  ++++    +    ++     S+  D  R RS   SNSF +EAIADCL+FIK
Sbjct: 79  NIKRNNSSSRRILVTETHQNKARSEPADYCRLRSFARSNSFYAEAIADCLEFIK 132


>gb|KHN41798.1| hypothetical protein glysoja_003538 [Glycine soja]
          Length = 143

 Score = 64.3 bits (155), Expect = 3e-08
 Identities = 44/124 (35%), Positives = 60/124 (48%), Gaps = 7/124 (5%)
 Frame = -2

Query: 353 MDYDRVRR------RCYG-VKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSL 195
           M Y+RV         C+G  +GFR+NLRR    R+R++F ++  +   WK SYS AL  L
Sbjct: 4   MSYNRVSSGRASSSSCHGKCRGFRLNLRRLYFLRLRKRFTFILSIFDSWKLSYSQALQLL 63

Query: 194 KRNIITXXXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCL 15
           K   +        RNY+      +            H   R  S   +NSF +EAIADCL
Sbjct: 64  KN--VFRRKSGFKRNYSNSSRSGL----VRDEGIKGHTDCRASSYGRNNSFYAEAIADCL 117

Query: 14  DFIK 3
           +FIK
Sbjct: 118 EFIK 121


>ref|XP_003631927.1| PREDICTED: uncharacterized protein LOC100854263 [Vitis vinifera]
          Length = 125

 Score = 64.3 bits (155), Expect = 3e-08
 Identities = 51/118 (43%), Positives = 66/118 (55%), Gaps = 3/118 (2%)
 Frame = -2

Query: 347 YDRVRRRCY-GVKGFRINLRRFSVG-RIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITX 174
           Y RV RR + G +GFR+N +RFSV  R+R + MYLFRL G+       A+ SLKR I   
Sbjct: 8   YKRVGRRSWHGGRGFRLNPKRFSVLIRVRVRLMYLFRLCGQ-------AVESLKRGISRS 60

Query: 173 XXXXXXRNYNKDD-NIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
                  + ++   ++ ME       S D     RFRS   +NSF SEAIADCL+FIK
Sbjct: 61  RSTKKGSDSSRSRMSLMME-------SGD----CRFRSFGRTNSFYSEAIADCLEFIK 107


>gb|KCW53312.1| hypothetical protein EUGRSUZ_J02566 [Eucalyptus grandis]
          Length = 169

 Score = 63.9 bits (154), Expect = 4e-08
 Identities = 42/112 (37%), Positives = 57/112 (50%), Gaps = 7/112 (6%)
 Frame = -2

Query: 317 VKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNII-----TXXXXXXXR 153
           ++GFRIN RR SV R+R +F  LFR+L R + SY  AL  LK +       +        
Sbjct: 34  IRGFRINPRRLSVKRLRVRFASLFRVLSRCRFSYGQALKLLKSSCFFCRSRSRGWGGGGI 93

Query: 152 NYNKDDNIHMELPCAYSYSSDRHDQY--RFRSLTHSNSFCSEAIADCLDFIK 3
             +  +     L  A+  +      Y  + R  + SNSF SEAIADCL+FIK
Sbjct: 94  KRSPSNGSRRNLVVAHVKAEPGRADYGSKLRCYSRSNSFYSEAIADCLEFIK 145


>gb|KHN38849.1| hypothetical protein glysoja_014841 [Glycine soja]
          Length = 141

 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 36/104 (34%), Positives = 51/104 (49%)
 Frame = -2

Query: 314 KGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITXXXXXXXRNYNKDD 135
           +GFR+N R+  V R+R++F +   L   WK SY  A+  LK+ +          + N   
Sbjct: 24  RGFRLNPRKLYVLRLRKRFNFFLGLFDSWKLSYGEAIQLLKKVVCRKSGLKRNNSNNSTR 83

Query: 134 NIHMELPCAYSYSSDRHDQYRFRSLTHSNSFCSEAIADCLDFIK 3
           +   E            D  + RS   SNSF +EAIADCL+FIK
Sbjct: 84  SFVRE------EKIKGQDDCKMRSFGRSNSFYAEAIADCLEFIK 121


>ref|XP_003629053.1| hypothetical protein MTR_8g072610 [Medicago truncatula]
           gi|355523075|gb|AET03529.1| hypothetical protein
           MTR_8g072610 [Medicago truncatula]
          Length = 125

 Score = 59.7 bits (143), Expect = 7e-07
 Identities = 44/119 (36%), Positives = 60/119 (50%), Gaps = 2/119 (1%)
 Frame = -2

Query: 353 MDYDRVRRRCYGVKGFRINLRRFSVGRIRRKFMYLFRLLGRWKCSYSNALSSLKRNIITX 174
           M Y+RV       KGFR+N R+F V R+R++F +  RL    K SY +AL  LK+ +   
Sbjct: 4   MSYNRVSNGSN--KGFRLNPRKFYVLRLRKRFNFFLRLFNNLKLSYGDALQMLKK-VFCR 60

Query: 173 XXXXXXRNYNKDDNIHMELPCAYSYSSDRHDQY-RFR-SLTHSNSFCSEAIADCLDFIK 3
                  N ++   +  E           HD Y + R S   SNSF +EAI DCL+FIK
Sbjct: 61  KIGFKRNNSSRRSLVRDE-------EVKGHDDYWKMRSSYGRSNSFYAEAIEDCLEFIK 112


Top