BLASTX nr result

ID: Mentha25_contig00037338 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00037338
         (702 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37877.1| hypothetical protein MIMGU_mgv1a000071mg [Mimulus...   171   2e-40
ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248...   139   7e-31
emb|CBI37935.3| unnamed protein product [Vitis vinifera]              124   3e-26
ref|XP_002305483.2| hypothetical protein POPTR_0004s17490g [Popu...   112   9e-23
ref|XP_006487402.1| PREDICTED: uncharacterized protein LOC102615...   106   7e-21
ref|XP_006487400.1| PREDICTED: uncharacterized protein LOC102615...   106   7e-21
ref|XP_007200948.1| hypothetical protein PRUPE_ppa000049mg [Prun...   105   1e-20
ref|XP_002529253.1| conserved hypothetical protein [Ricinus comm...   103   4e-20
gb|EXB36837.1| hypothetical protein L484_003222 [Morus notabilis]     100   5e-19
ref|XP_006423585.1| hypothetical protein CICLE_v10030126mg, part...    98   2e-18
ref|XP_006367335.1| PREDICTED: uncharacterized protein LOC102601...    94   3e-17
ref|XP_006282534.1| hypothetical protein CARUB_v10003970mg [Caps...    91   4e-16
ref|XP_007041938.1| Urb2/Npa2, putative isoform 5 [Theobroma cac...    90   8e-16
ref|XP_007041937.1| Urb2/Npa2, putative isoform 4 [Theobroma cac...    90   8e-16
ref|XP_007041936.1| Urb2/Npa2, putative isoform 3 [Theobroma cac...    90   8e-16
ref|XP_007041935.1| Urb2/Npa2, putative isoform 2 [Theobroma cac...    90   8e-16
ref|XP_007041934.1| Urb2/Npa2, putative isoform 1 [Theobroma cac...    90   8e-16
ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana] ...    89   1e-15
emb|CAB43850.1| hypothetical protein [Arabidopsis thaliana] gi|7...    89   1e-15
ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208...    89   2e-15

>gb|EYU37877.1| hypothetical protein MIMGU_mgv1a000071mg [Mimulus guttatus]
          Length = 1929

 Score =  171 bits (434), Expect = 2e-40
 Identities = 102/229 (44%), Positives = 141/229 (61%)
 Frame = +3

Query: 9    SLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFPLPWLLKSMSVAIEFQHAFP 188
            S N YQIF+LLVTCRK L  LA+AS K +V+GS    C  PLPWLLKS+S  I  Q+ FP
Sbjct: 1170 SHNPYQIFRLLVTCRKVLPTLALASGKVNVSGSLK--CSLPLPWLLKSLSAVIGVQNTFP 1227

Query: 189  EDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQSDLSE 368
            ED AFEA+ A+FS L Y+S+ +L+A+++Q+ H I S++S R  RR        K+ +L  
Sbjct: 1228 EDNAFEAKVAIFSMLHYTSYAWLLASKDQFHHEIGSILSDRKLRR--------KRKNLKP 1279

Query: 369  GDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSSIL 548
            G +   P  +    +S+++L   L++++ KS TTF++  + K      G  DLNKLSS +
Sbjct: 1280 GTVE--PDISECNLQSVLQLTDTLDENMHKSLTTFKDEFLHK------GCQDLNKLSSTI 1331

Query: 549  ACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            ACFQGLLWG+AS      T+D+ S     S   K+M RI S V   M+F
Sbjct: 1332 ACFQGLLWGLAS------TLDNKSFRMKLSNNTKMMTRINSSVHSCMNF 1374


>ref|XP_002276690.1| PREDICTED: uncharacterized protein LOC100248664 [Vitis vinifera]
          Length = 2129

 Score =  139 bits (351), Expect = 7e-31
 Identities = 74/236 (31%), Positives = 144/236 (61%), Gaps = 7/236 (2%)
 Frame = +3

Query: 9    SLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRS------PLCLFPLPWLLKSMSVAIE 170
            S N Y++++L ++CR+ L+ L +A  ++ +  S+S      P   FP+ WLLKS+SV + 
Sbjct: 1270 SHNHYELYRLFLSCRRTLKHLIMAFCEEKMEASQSSLTSIFPEVSFPVLWLLKSVSVMVG 1329

Query: 171  FQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSV-RNARRQKAADRIS 347
             QH F ED A + R   FS +D +S+VFLM +++Q+ H ++  ++V ++   Q  +D + 
Sbjct: 1330 LQHTFSEDRASQFRYMSFSLMDQTSYVFLMFSKSQFSHVVHFSMNVKKSCAEQLNSDLVH 1389

Query: 348  KQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDL 527
            ++S L+E D  S   +   A ++++ +A+AL++  +    + ++A  +K++E   G++DL
Sbjct: 1390 EESHLTETDPCSDSSKAVDAWKNVVLVAEALKEQTENLLISLKDALCNKRVE--VGTVDL 1447

Query: 528  NKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            N+LSS+++CFQG +WG+ASA       + +  M+   +  +  +++  C++V+ DF
Sbjct: 1448 NRLSSLVSCFQGFMWGLASAMNHIDVKECDDEMKLLKWKNEPFSKLNLCINVFTDF 1503


>emb|CBI37935.3| unnamed protein product [Vitis vinifera]
          Length = 1831

 Score =  124 bits (311), Expect = 3e-26
 Identities = 71/235 (30%), Positives = 131/235 (55%), Gaps = 6/235 (2%)
 Frame = +3

Query: 9    SLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRS------PLCLFPLPWLLKSMSVAIE 170
            S N Y++++L ++CR+ L+ L +A  ++ +  S+S      P   FP+ WLLKS+SV + 
Sbjct: 1103 SHNHYELYRLFLSCRRTLKHLIMAFCEEKMEASQSSLTSIFPEVSFPVLWLLKSVSVMVG 1162

Query: 171  FQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISK 350
             QH F ED A + R   FS +D +S+VFLM +++Q+ H                      
Sbjct: 1163 LQHTFSEDRASQFRYMSFSLMDQTSYVFLMFSKSQFSHV--------------------- 1201

Query: 351  QSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLN 530
             S L+E D  S   +   A ++++ +A+AL++  +    + ++A  +K++E   G++DLN
Sbjct: 1202 -SHLTETDPCSDSSKAVDAWKNVVLVAEALKEQTENLLISLKDALCNKRVE--VGTVDLN 1258

Query: 531  KLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +LSS+++CFQG +WG+ASA       + +  M+   +  +  +++  C++V+ DF
Sbjct: 1259 RLSSLVSCFQGFMWGLASAMNHIDVKECDDEMKLLKWKNEPFSKLNLCINVFTDF 1313


>ref|XP_002305483.2| hypothetical protein POPTR_0004s17490g [Populus trichocarpa]
            gi|550341234|gb|EEE85994.2| hypothetical protein
            POPTR_0004s17490g [Populus trichocarpa]
          Length = 2070

 Score =  112 bits (281), Expect = 9e-23
 Identities = 66/235 (28%), Positives = 128/235 (54%), Gaps = 6/235 (2%)
 Frame = +3

Query: 9    SLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRSPLC------LFPLPWLLKSMSVAIE 170
            S   Y++ +LLV CR+AL+ L +A  ++ V  + S L       +  + WL +S+SV   
Sbjct: 1244 SHKQYELLRLLVACRRALKCLIMAYCEEKVRTTHSALIPVLFEDVHSVLWLSRSVSVVFR 1303

Query: 171  FQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISK 350
             Q    ED A E    +FS +D++S+VFL  ++ Q   A+ S+++ +    Q  +D   +
Sbjct: 1304 LQETLSEDKACEVADMIFSLMDHTSYVFLTLSKYQCPSAV-SIIAEKPYTEQLNSDVTQE 1362

Query: 351  QSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLN 530
            QS ++E            +C+S++ +A++L++  Q    + ++A  ++K       +D N
Sbjct: 1363 QSSVNESLPCLDTSNDVESCKSVILIAESLKEQAQDLIISLKDAHCNEKSSDEI-DVDWN 1421

Query: 531  KLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            KLSS+++CF G +WG+ASA   +   DS+ + +   +  +++++I  C++ + DF
Sbjct: 1422 KLSSMVSCFSGFMWGLASALDHSNATDSDYKAKLLRWKCEVISKISHCINAFADF 1476


>ref|XP_006487402.1| PREDICTED: uncharacterized protein LOC102615643 isoform X3 [Citrus
            sinensis]
          Length = 1811

 Score =  106 bits (265), Expect = 7e-21
 Identities = 72/241 (29%), Positives = 127/241 (52%), Gaps = 11/241 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP------LPWLLKSMS 158
            GSL S   Y++F+L V+CR+ L+ + +AS +D    S+S L          + WL KSM 
Sbjct: 1240 GSLFSNKYYELFRLFVSCRRTLKNIIMASCEDKTECSQSSLIPMLSEGSDFVLWLFKSMV 1299

Query: 159  VAIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVV-SVRNARRQKAA 335
            + I  Q A  + +  E R  +FS +D +SH+FL  ++  +  A+NS + S ++ + Q ++
Sbjct: 1300 LVIGLQEAVSDHLFHEIRDMIFSLMDLTSHIFLTLSKLHFSSALNSFIFSQKDFKEQSSS 1359

Query: 336  DRISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAG 515
            D  S  S+L E        +   A + I+ + + LE+  Q    +  +A  +     +  
Sbjct: 1360 DVASGNSNLKESSSRVDSSKDVDAWKCILFVLENLEEQAQSILMSVEDALCEGNSGILLK 1419

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKL-MARIKSCVDVYMD 692
             ++LNKLSS+++CF G+LWG+AS   VN      S    S ++  + +++I   ++V+ D
Sbjct: 1420 DVNLNKLSSVVSCFNGILWGLASV--VNHINAEKSDKVKSLWWKSIHISKINHSINVFSD 1477

Query: 693  F 695
            F
Sbjct: 1478 F 1478


>ref|XP_006487400.1| PREDICTED: uncharacterized protein LOC102615643 isoform X1 [Citrus
            sinensis] gi|568868198|ref|XP_006487401.1| PREDICTED:
            uncharacterized protein LOC102615643 isoform X2 [Citrus
            sinensis]
          Length = 2093

 Score =  106 bits (265), Expect = 7e-21
 Identities = 72/241 (29%), Positives = 127/241 (52%), Gaps = 11/241 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP------LPWLLKSMS 158
            GSL S   Y++F+L V+CR+ L+ + +AS +D    S+S L          + WL KSM 
Sbjct: 1240 GSLFSNKYYELFRLFVSCRRTLKNIIMASCEDKTECSQSSLIPMLSEGSDFVLWLFKSMV 1299

Query: 159  VAIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVV-SVRNARRQKAA 335
            + I  Q A  + +  E R  +FS +D +SH+FL  ++  +  A+NS + S ++ + Q ++
Sbjct: 1300 LVIGLQEAVSDHLFHEIRDMIFSLMDLTSHIFLTLSKLHFSSALNSFIFSQKDFKEQSSS 1359

Query: 336  DRISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAG 515
            D  S  S+L E        +   A + I+ + + LE+  Q    +  +A  +     +  
Sbjct: 1360 DVASGNSNLKESSSRVDSSKDVDAWKCILFVLENLEEQAQSILMSVEDALCEGNSGILLK 1419

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKL-MARIKSCVDVYMD 692
             ++LNKLSS+++CF G+LWG+AS   VN      S    S ++  + +++I   ++V+ D
Sbjct: 1420 DVNLNKLSSVVSCFNGILWGLASV--VNHINAEKSDKVKSLWWKSIHISKINHSINVFSD 1477

Query: 693  F 695
            F
Sbjct: 1478 F 1478


>ref|XP_007200948.1| hypothetical protein PRUPE_ppa000049mg [Prunus persica]
            gi|462396348|gb|EMJ02147.1| hypothetical protein
            PRUPE_ppa000049mg [Prunus persica]
          Length = 2128

 Score =  105 bits (263), Expect = 1e-20
 Identities = 62/231 (26%), Positives = 120/231 (51%), Gaps = 6/231 (2%)
 Frame = +3

Query: 21   YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLF-----PLPWLLKSMSVAIEFQHAF 185
            +++F+L V+CRKAL+ + +A      +   S   +F     P+ WL KS+   +  + + 
Sbjct: 1277 HELFRLFVSCRKALKYIILACEGKTADSQTSHTLVFFEDSFPILWLYKSVYAVVGLEESL 1336

Query: 186  PEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAIN-SVVSVRNARRQKAADRISKQSDL 362
            P+D        + S +D++ +VFL  ++ Q  HA++ S V+  NA        + + S L
Sbjct: 1337 PKDNCRPVSDMILSLMDHTFYVFLTLSKYQSNHAVHFSKVAELNA------GLVHEHSSL 1390

Query: 363  SEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSS 542
            SE D+     +   A +S+  +AK+L++ +Q      ++A  + K+      L+LNK SS
Sbjct: 1391 SESDMCLDSSDYIEAWKSVTIIAKSLKEQMQSLLVNLKDALCNGKVGIGVDGLNLNKFSS 1450

Query: 543  ILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++C  G LWG+A       +  S+ ++ SS   ++ ++ +  C+DV+ +F
Sbjct: 1451 LISCISGFLWGLACFVNHTDSRSSDHKVNSSRQKLEPISELHLCIDVFAEF 1501


>ref|XP_002529253.1| conserved hypothetical protein [Ricinus communis]
            gi|223531289|gb|EEF33131.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 2057

 Score =  103 bits (258), Expect = 4e-20
 Identities = 64/240 (26%), Positives = 130/240 (54%), Gaps = 9/240 (3%)
 Frame = +3

Query: 3    TGSLNSYQIFQLL---VTCRKALQILAVASTKDDVNGSRSPLC------LFPLPWLLKSM 155
            TG+++SY +F+LL   ++CR+AL+ L +A +++    S S +       LF + WL KS+
Sbjct: 1251 TGAMSSYNLFELLRLLISCRRALKYLVMALSEEKTITSHSSVTPVLSEGLFSVLWLFKSV 1310

Query: 156  SVAIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAA 335
             + +  Q  F +D + E    +FS +D++S++FL  +++    AI S++S    + Q   
Sbjct: 1311 FMVVGLQETFSKDDSDEIGEMIFSLMDHTSYLFLELSKHSCTCAIRSIISKEPHKEQTNV 1370

Query: 336  DRISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAG 515
              + + S  +E D       +    ++I+ +A++L++  Q      ++A  ++K+     
Sbjct: 1371 RSVQEVSTSNESDSRVDSWGSDKGWKNILVMAESLKEQTQGLLIYLKDALCNEKLGNGVD 1430

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
             ++LN LSS+++   G LWG++SA      +DS+ ++E      +  ++I  C++V+ DF
Sbjct: 1431 LVNLNNLSSMVSWISGFLWGVSSALNHTNKIDSD-KVEILKLNFEPSSQIGLCINVFTDF 1489


>gb|EXB36837.1| hypothetical protein L484_003222 [Morus notabilis]
          Length = 2053

 Score =  100 bits (249), Expect = 5e-19
 Identities = 63/241 (26%), Positives = 121/241 (50%), Gaps = 9/241 (3%)
 Frame = +3

Query: 3    TGSLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRSPLC-LFP-----LPWLLKSMSVA 164
            + S   Y++ +L + CRK ++ + +AS ++    S++ L  ++P     + WL KS+   
Sbjct: 1218 SNSHKGYKLLRLFLCCRKVMKYIIMASCEEKTGASQTSLTQMYPGKSLSVMWLFKSLYAV 1277

Query: 165  IEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNA-RRQKAADR 341
            +  Q    +D   +    +FS LD++ +VFL   +  + HA+ SV + +N+   Q  A  
Sbjct: 1278 VGIQELLSKDSGTQVDNTIFSLLDHTLYVFLTLNQYHFNHAVQSVKNPQNSCNEQHNAGV 1337

Query: 342  ISKQSDL--SEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAG 515
              +QSDL  S+  L SC          +  +AK+L + +Q      ++   D+ +  +  
Sbjct: 1338 NYEQSDLTGSKRCLSSCSYVEP--WNGVFCVAKSLREQMQSLLIPLKDVLCDENVGVLTN 1395

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
             ++LN+ SS+++CF G LWG+AS         S+ ++  S +  K    I  C++V+ +F
Sbjct: 1396 VVNLNRFSSVISCFSGFLWGLASVMKQTDVRSSDHKVILSWWKEKSNTEINLCINVFEEF 1455

Query: 696  A 698
            +
Sbjct: 1456 S 1456


>ref|XP_006423585.1| hypothetical protein CICLE_v10030126mg, partial [Citrus clementina]
            gi|557525519|gb|ESR36825.1| hypothetical protein
            CICLE_v10030126mg, partial [Citrus clementina]
          Length = 2119

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 66/229 (28%), Positives = 118/229 (51%), Gaps = 8/229 (3%)
 Frame = +3

Query: 33   QLLVTCRKALQILAVASTKDDVNGSRSPLCLFP------LPWLLKSMSVAIEFQHAFPED 194
            +L V+CR+ L+ + +AS +D    S+S L          + WL KSM + I  Q A  + 
Sbjct: 1314 RLFVSCRRTLKNIIMASCEDKTECSQSSLIPMLSEGSDFVLWLFKSMVLVIGLQEAVSDH 1373

Query: 195  VAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVV-SVRNARRQKAADRISKQSDLSEG 371
            +  E R  +FS +D +SH+FL  ++  +  A+NS++ S ++   Q ++D  S  S+L E 
Sbjct: 1374 LFHEIRDMIFSLVDLTSHIFLTLSKLHFSRALNSLIFSPKDFTEQSSSDVASGNSNLKES 1433

Query: 372  DLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSSILA 551
                   +   A + I+ + + LE+  Q    +   A  +     +   ++LNKLSS+++
Sbjct: 1434 SSRVDSSKDVDAWKCILFVLENLEEQAQSILMSVENALCEGNSGILLKDVNLNKLSSVVS 1493

Query: 552  CFQGLLWGIASASGVNRTVDSNSRMESSSYYVKL-MARIKSCVDVYMDF 695
            CF G+LWG+AS   VN      S    S ++  + +++I   ++V+ DF
Sbjct: 1494 CFNGILWGLASV--VNHINAEKSDKVKSIWWKSIHISKINLSINVFSDF 1540


>ref|XP_006367335.1| PREDICTED: uncharacterized protein LOC102601821 [Solanum tuberosum]
          Length = 2086

 Score = 94.4 bits (233), Expect = 3e-17
 Identities = 61/225 (27%), Positives = 121/225 (53%), Gaps = 4/225 (1%)
 Frame = +3

Query: 21   YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLF----PLPWLLKSMSVAIEFQHAFP 188
            Y++ +LLVTCR+  + L +AS K          CL     P+ WLLKS+S    F     
Sbjct: 1248 YELLRLLVTCRRTFKNLLMASCKGKKGHQSLLACLLSERSPVFWLLKSLSAVTGFLSVIS 1307

Query: 189  EDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQSDLSE 368
            ++ + + +  +FS +D++S + L   ++Q++ AI ++ + ++     ++    K++ L E
Sbjct: 1308 QETSPQLKHMIFSLMDHTSFILLTLFKDQFE-AIFALTAGKSYGGAISSVDGHKETVLRE 1366

Query: 369  GDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSSIL 548
                S   + + A  S+  +A  L    Q+   +   A V++K++ +AG  +++K+S ++
Sbjct: 1367 NGPRSDFSDNNNAWRSVSSVAGTLTRHAQELLDSLNLAVVNRKVDDLAGLQEMDKVSPLV 1426

Query: 549  ACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDV 683
            +CFQG L G+ SA        S++ +ES+S+ +K+   I++C D+
Sbjct: 1427 SCFQGFLCGLVSAMDSLDIKRSSTLIESTSHNLKMKPCIETCADL 1471


>ref|XP_006282534.1| hypothetical protein CARUB_v10003970mg [Capsella rubella]
            gi|482551239|gb|EOA15432.1| hypothetical protein
            CARUB_v10003970mg [Capsella rubella]
          Length = 1963

 Score = 90.9 bits (224), Expect = 4e-16
 Identities = 69/233 (29%), Positives = 110/233 (47%), Gaps = 2/233 (0%)
 Frame = +3

Query: 3    TGSLNSYQIFQLLVTCRKALQILAVASTKDDVNGSRSPLC--LFPLPWLLKSMSVAIEFQ 176
            TG + +  +F L +TCRK L+ + + S    +  S+ PL   L    WL KS   A+  Q
Sbjct: 1191 TGDMQN--LFSLFITCRKTLKSILIVSCDKVLGASKLPLSDSLLLASWLFKSAQAAVTCQ 1248

Query: 177  HAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQS 356
                 D   +AR  +FS +D++S++F   ++NQ+  A+              +D     S
Sbjct: 1249 MNIRNDFTGKARDTVFSLMDHTSYMFQTVSKNQFSKAL------------PLSDGQLISS 1296

Query: 357  DLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKL 536
            +LSEG       +  +  ES+ E A+ L   L     TFR    D+K  +   +L LNKL
Sbjct: 1297 ELSEGT-----GQVDLIFESLTEQAETL---LNALIVTFR----DEKTAFECENLILNKL 1344

Query: 537  SSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            + I ACF GLLWG+ASA    R +  N +     +  +  +++   + V  +F
Sbjct: 1345 APIFACFSGLLWGLASAVS-QRDMHKNHQNTKLKWKSEQFSKLSCIIHVLSNF 1396


>ref|XP_007041938.1| Urb2/Npa2, putative isoform 5 [Theobroma cacao]
            gi|508705873|gb|EOX97769.1| Urb2/Npa2, putative isoform 5
            [Theobroma cacao]
          Length = 1387

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 62/240 (25%), Positives = 120/240 (50%), Gaps = 10/240 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP-----LPWLLKSMSV 161
            G+L+S   Y++FQL V CR+ L+ + +AS ++ + GS S L         + WL KS+S 
Sbjct: 866  GALSSNGCYELFQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSFVIWLFKSVST 925

Query: 162  AIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSV-VSVRNARRQKAAD 338
             I       ED   E    +F  +D++S+VF   ++ Q+  A++ +  S +  ++Q  + 
Sbjct: 926  VIGVLDTMMEDCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSG 985

Query: 339  RISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDK-KIEYMAG 515
             +  +S L++    S   + S A  S+   A+ L++  +      + A  D  K+     
Sbjct: 986  VVGDESILNQPGSCSNYLKDSEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNK 1045

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++ NK+S  ++CF G LWG+ASA             +   +  + ++++  C++V++DF
Sbjct: 1046 AVNTNKMSFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDF 1105


>ref|XP_007041937.1| Urb2/Npa2, putative isoform 4 [Theobroma cacao]
            gi|508705872|gb|EOX97768.1| Urb2/Npa2, putative isoform 4
            [Theobroma cacao]
          Length = 1533

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 62/240 (25%), Positives = 120/240 (50%), Gaps = 10/240 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP-----LPWLLKSMSV 161
            G+L+S   Y++FQL V CR+ L+ + +AS ++ + GS S L         + WL KS+S 
Sbjct: 866  GALSSNGCYELFQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSFVIWLFKSVST 925

Query: 162  AIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSV-VSVRNARRQKAAD 338
             I       ED   E    +F  +D++S+VF   ++ Q+  A++ +  S +  ++Q  + 
Sbjct: 926  VIGVLDTMMEDCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSG 985

Query: 339  RISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDK-KIEYMAG 515
             +  +S L++    S   + S A  S+   A+ L++  +      + A  D  K+     
Sbjct: 986  VVGDESILNQPGSCSNYLKDSEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNK 1045

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++ NK+S  ++CF G LWG+ASA             +   +  + ++++  C++V++DF
Sbjct: 1046 AVNTNKMSFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDF 1105


>ref|XP_007041936.1| Urb2/Npa2, putative isoform 3 [Theobroma cacao]
            gi|508705871|gb|EOX97767.1| Urb2/Npa2, putative isoform 3
            [Theobroma cacao]
          Length = 1777

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 62/240 (25%), Positives = 120/240 (50%), Gaps = 10/240 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP-----LPWLLKSMSV 161
            G+L+S   Y++FQL V CR+ L+ + +AS ++ + GS S L         + WL KS+S 
Sbjct: 1256 GALSSNGCYELFQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSFVIWLFKSVST 1315

Query: 162  AIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSV-VSVRNARRQKAAD 338
             I       ED   E    +F  +D++S+VF   ++ Q+  A++ +  S +  ++Q  + 
Sbjct: 1316 VIGVLDTMMEDCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSG 1375

Query: 339  RISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDK-KIEYMAG 515
             +  +S L++    S   + S A  S+   A+ L++  +      + A  D  K+     
Sbjct: 1376 VVGDESILNQPGSCSNYLKDSEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNK 1435

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++ NK+S  ++CF G LWG+ASA             +   +  + ++++  C++V++DF
Sbjct: 1436 AVNTNKMSFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDF 1495


>ref|XP_007041935.1| Urb2/Npa2, putative isoform 2 [Theobroma cacao]
            gi|508705870|gb|EOX97766.1| Urb2/Npa2, putative isoform 2
            [Theobroma cacao]
          Length = 2065

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 62/240 (25%), Positives = 120/240 (50%), Gaps = 10/240 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP-----LPWLLKSMSV 161
            G+L+S   Y++FQL V CR+ L+ + +AS ++ + GS S L         + WL KS+S 
Sbjct: 1256 GALSSNGCYELFQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSFVIWLFKSVST 1315

Query: 162  AIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSV-VSVRNARRQKAAD 338
             I       ED   E    +F  +D++S+VF   ++ Q+  A++ +  S +  ++Q  + 
Sbjct: 1316 VIGVLDTMMEDCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSG 1375

Query: 339  RISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDK-KIEYMAG 515
             +  +S L++    S   + S A  S+   A+ L++  +      + A  D  K+     
Sbjct: 1376 VVGDESILNQPGSCSNYLKDSEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNK 1435

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++ NK+S  ++CF G LWG+ASA             +   +  + ++++  C++V++DF
Sbjct: 1436 AVNTNKMSFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDF 1495


>ref|XP_007041934.1| Urb2/Npa2, putative isoform 1 [Theobroma cacao]
            gi|508705869|gb|EOX97765.1| Urb2/Npa2, putative isoform 1
            [Theobroma cacao]
          Length = 2090

 Score = 89.7 bits (221), Expect = 8e-16
 Identities = 62/240 (25%), Positives = 120/240 (50%), Gaps = 10/240 (4%)
 Frame = +3

Query: 6    GSLNS---YQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFP-----LPWLLKSMSV 161
            G+L+S   Y++FQL V CR+ L+ + +AS ++ + GS S L         + WL KS+S 
Sbjct: 1280 GALSSNGCYELFQLFVACRRTLKNIIMASCEEKIEGSLSSLLSVAEGSSFVIWLFKSVST 1339

Query: 162  AIEFQHAFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSV-VSVRNARRQKAAD 338
             I       ED   E    +F  +D++S+VF   ++ Q+  A++ +  S +  ++Q  + 
Sbjct: 1340 VIGVLDTMMEDCLPEFELKIFLLMDHTSYVFFAISKYQFGQAVHFIGNSEKPCKKQPYSG 1399

Query: 339  RISKQSDLSEGDLLSCPKETSVACESIMELAKALEDDLQKSFTTFREASVDK-KIEYMAG 515
             +  +S L++    S   + S A  S+   A+ L++  +      + A  D  K+     
Sbjct: 1400 VVGDESILNQPGSCSNYLKDSEALRSLSITAENLKEQAESLLDPLKGALDDNAKVGDGNK 1459

Query: 516  SLDLNKLSSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            +++ NK+S  ++CF G LWG+ASA             +   +  + ++++  C++V++DF
Sbjct: 1460 AVNTNKMSFAISCFGGFLWGLASALNQGDEKSGEVNAKYLRWKCEPLSKLNICINVFLDF 1519


>ref|NP_194744.2| uncharacterized protein [Arabidopsis thaliana]
            gi|332660326|gb|AEE85726.1| uncharacterized protein
            AT4G30150 [Arabidopsis thaliana]
          Length = 2009

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 68/225 (30%), Positives = 111/225 (49%), Gaps = 2/225 (0%)
 Frame = +3

Query: 27   IFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFPL--PWLLKSMSVAIEFQHAFPEDVA 200
            +F L  TCRK L+ +A+ S    +  ++ PL    L   WL KS   A   Q  F  DV 
Sbjct: 1241 LFSLFSTCRKTLKSIAMISCDKVLGATKLPLSDSSLLASWLFKSAQAAT-CQVRFRNDVT 1299

Query: 201  FEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQSDLSEGDLL 380
             +AR ALFS +D++S++FL  ++ Q+  A+              +D     S++SEG   
Sbjct: 1300 GKARDALFSLMDHTSYMFLTVSKYQFSKAL------------PFSDEKLISSEISEGT-- 1345

Query: 381  SCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSSILACFQ 560
                + ++  E++ E A+ L + L+ +F        D+K  +   SL LNKL+ I +CF 
Sbjct: 1346 ---GQANLIIENLTEQAETLLNALRATFR-------DEKTAFKCESLILNKLTPIFSCFS 1395

Query: 561  GLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            GLLWG+ASA   NR +  N +     +  +  +++   + V  +F
Sbjct: 1396 GLLWGLASAVS-NRDMQKNHQNAKLRWKSEQFSKLSRIIHVLSNF 1439


>emb|CAB43850.1| hypothetical protein [Arabidopsis thaliana]
            gi|7269915|emb|CAB81008.1| hypothetical protein
            [Arabidopsis thaliana]
          Length = 1966

 Score = 89.4 bits (220), Expect = 1e-15
 Identities = 68/225 (30%), Positives = 111/225 (49%), Gaps = 2/225 (0%)
 Frame = +3

Query: 27   IFQLLVTCRKALQILAVASTKDDVNGSRSPLCLFPL--PWLLKSMSVAIEFQHAFPEDVA 200
            +F L  TCRK L+ +A+ S    +  ++ PL    L   WL KS   A   Q  F  DV 
Sbjct: 1241 LFSLFSTCRKTLKSIAMISCDKVLGATKLPLSDSSLLASWLFKSAQAAT-CQVRFRNDVT 1299

Query: 201  FEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQSDLSEGDLL 380
             +AR ALFS +D++S++FL  ++ Q+  A+              +D     S++SEG   
Sbjct: 1300 GKARDALFSLMDHTSYMFLTVSKYQFSKAL------------PFSDEKLISSEISEGT-- 1345

Query: 381  SCPKETSVACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKLSSILACFQ 560
                + ++  E++ E A+ L + L+ +F        D+K  +   SL LNKL+ I +CF 
Sbjct: 1346 ---GQANLIIENLTEQAETLLNALRATFR-------DEKTAFKCESLILNKLTPIFSCFS 1395

Query: 561  GLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVDVYMDF 695
            GLLWG+ASA   NR +  N +     +  +  +++   + V  +F
Sbjct: 1396 GLLWGLASAVS-NRDMQKNHQNAKLRWKSEQFSKLSRIIHVLSNF 1439


>ref|XP_004150076.1| PREDICTED: uncharacterized protein LOC101208263 [Cucumis sativus]
          Length = 1981

 Score = 88.6 bits (218), Expect = 2e-15
 Identities = 55/228 (24%), Positives = 113/228 (49%), Gaps = 6/228 (2%)
 Frame = +3

Query: 15   NSYQIFQLLVTCRKALQILAVASTKDDVNGSRSPLCL-----FPLPWLLKSMSVAIEFQH 179
            N +++ +L  +CRKAL+ +  A   +  NG  S + +     FP  WL KS+S+  + Q 
Sbjct: 1165 NKFELLKLFASCRKALKYIFRAYC-EAANGQSSSVPILSENQFPFLWLFKSLSLVNQIQE 1223

Query: 180  AFPEDVAFEARAALFSFLDYSSHVFLMATRNQYQHAINSVVSVRNARRQKAADRISKQSD 359
              PE    + +  +FS +D++ ++FL  ++ Q++ A+ + V V    +++  D      D
Sbjct: 1224 VSPEGTDRQIKDIIFSLMDHTLYLFLTTSKYQFKEALCTSVKVNKPCKEQPQDVC---QD 1280

Query: 360  LSEGDLLSCPKETSV-ACESIMELAKALEDDLQKSFTTFREASVDKKIEYMAGSLDLNKL 536
            L++GD L      SV  C S ++++ +L++ ++    + ++++    +       D+ K 
Sbjct: 1281 LNDGDDLCLDSIHSVEVCSSAIQMSNSLKEQVESELISLKKSNF--AVGDAKNRADICKF 1338

Query: 537  SSILACFQGLLWGIASASGVNRTVDSNSRMESSSYYVKLMARIKSCVD 680
            +S+ +C  G LWG+AS          N  M S     +  + + +C++
Sbjct: 1339 NSLASCLNGFLWGLASVDDHTDLRKGNHHMRSMKLKREYSSELNNCMN 1386


Top