BLASTX nr result

ID: Catharanthus22_contig00009194 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00009194
         (1205 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006357258.1| PREDICTED: uncharacterized protein LOC102597...   357   5e-96
ref|XP_004238763.1| PREDICTED: uncharacterized protein LOC101255...   347   7e-93
ref|XP_006438571.1| hypothetical protein CICLE_v10032129mg [Citr...   333   1e-88
ref|XP_006483250.1| PREDICTED: uncharacterized protein LOC102616...   330   5e-88
gb|EMJ25322.1| hypothetical protein PRUPE_ppa015818mg [Prunus pe...   325   2e-86
ref|XP_003631283.1| PREDICTED: uncharacterized protein LOC100853...   317   8e-84
ref|XP_002311878.1| predicted protein [Populus trichocarpa] gi|5...   313   7e-83
ref|XP_004297341.1| PREDICTED: uncharacterized protein LOC101291...   313   9e-83
ref|XP_002520148.1| conserved hypothetical protein [Ricinus comm...   311   3e-82
gb|EXB63806.1| hypothetical protein L484_021078 [Morus notabilis...   308   2e-81
gb|EOY00205.1| Uncharacterized protein TCM_009967 [Theobroma cacao]   308   4e-81
ref|XP_004135199.1| PREDICTED: uncharacterized protein LOC101204...   305   2e-80
gb|ESW29572.1| hypothetical protein PHAVU_002G080900g [Phaseolus...   295   3e-77
ref|XP_004489960.1| PREDICTED: uncharacterized protein LOC101489...   290   6e-76
ref|XP_006395772.1| hypothetical protein EUTSA_v10004613mg [Eutr...   290   8e-76
ref|NP_001240072.1| uncharacterized protein LOC100813905 [Glycin...   287   7e-75
gb|EPS63886.1| hypothetical protein M569_10896, partial [Genlise...   286   2e-74
ref|XP_006291472.1| hypothetical protein CARUB_v10017608mg [Caps...   281   3e-73
ref|NP_178363.1| uncharacterized protein [Arabidopsis thaliana] ...   278   2e-72
gb|AAT68342.1| hypothetical protein At2g02590 [Arabidopsis thali...   278   2e-72

>ref|XP_006357258.1| PREDICTED: uncharacterized protein LOC102597342 [Solanum tuberosum]
          Length = 313

 Score =  357 bits (916), Expect = 5e-96
 Identities = 195/312 (62%), Positives = 222/312 (71%), Gaps = 1/312 (0%)
 Frame = +3

Query: 153  MAYSLARPWMLLTLTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFE 332
            MA SL+    LLT +   S+  F+   P+I   S  ++QI +K +SFP   T +HLG   
Sbjct: 1    MATSLSHSPQLLTFSYRNSNPSFS--FPKIHSFSHQNRQIHLKTQSFPILQTFSHLGR-- 56

Query: 333  TSSFPIARAIRVASND-FLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAK 509
                 + R IR + +D FLE IEE+E +L ++E P KFL WVL WASVS+G+FAVSG+AK
Sbjct: 57   -----VQRVIRASDDDSFLEVIEEEEGLLANEEKPLKFLFWVLLWASVSVGLFAVSGDAK 111

Query: 510  AAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTV 689
            AAADSIRAS FGVK A +LR  GWPDEAVVFALATLPVIELRGAIPVGYWLQLKPT+LTV
Sbjct: 112  AAADSIRASGFGVKVANSLRSSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTVLTV 171

Query: 690  LSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLF 869
            LSVLGNMVPVPFI+LYLK  A FLAG NKSAS  LD LFERAK KAGPV+EFQWLGLMLF
Sbjct: 172  LSVLGNMVPVPFIVLYLKKLAIFLAGTNKSASKLLDLLFERAKDKAGPVKEFQWLGLMLF 231

Query: 870  VAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXX 1049
            VAVPFPGTGAWTGAI+AS+LDM FWS +SA                    KYA       
Sbjct: 232  VAVPFPGTGAWTGAIIASVLDMPFWSAVSANFVGVVLAGLLVNLLVNLGLKYAIITGIIL 291

Query: 1050 XXXSTFMWSILR 1085
               STFMWSILR
Sbjct: 292  FIISTFMWSILR 303


>ref|XP_004238763.1| PREDICTED: uncharacterized protein LOC101255587 [Solanum
            lycopersicum]
          Length = 314

 Score =  347 bits (889), Expect = 7e-93
 Identities = 192/307 (62%), Positives = 217/307 (70%), Gaps = 7/307 (2%)
 Frame = +3

Query: 186  LTLTPGKSHFKFTPPSPRICFP---SFPDQ--QIRIKFKSFPTFYTHNHLGHFETSSFPI 350
            L+ +P    F +   +P   FP   SF  Q  +I++K +SFP   T +HLG        +
Sbjct: 5    LSHSPQLWTFSYRNTNPSFSFPKIHSFLHQNPKIQLKTQSFPILQTFSHLGR-------V 57

Query: 351  ARAIRVASND-FLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA-DS 524
             R IR +S+D FLE IEE+E +L + E P KFL WVL WASVS+G+FAVSG+AKAAA DS
Sbjct: 58   QRLIRASSSDSFLEVIEEEEGLLANDEKPLKFLFWVLLWASVSVGLFAVSGDAKAAAADS 117

Query: 525  IRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLG 704
            IRAS FGVK A ALR  GWPDEAVVFALATLPVIELRGAIPVGYWLQLKP++LTVLSVLG
Sbjct: 118  IRASGFGVKVANALRSSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPSVLTVLSVLG 177

Query: 705  NMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPF 884
            NMVPVPFI+LYLK  A FLAG NKSAS  LD LFERAK KAGPV+EFQWLGLMLFVAVPF
Sbjct: 178  NMVPVPFIVLYLKKLAIFLAGTNKSASKLLDLLFERAKDKAGPVKEFQWLGLMLFVAVPF 237

Query: 885  PGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXST 1064
            PGTGAWTGAI+AS+LDM FWS +SA                    KYA          ST
Sbjct: 238  PGTGAWTGAIIASVLDMPFWSAVSANFVGVVLAGLLVNLLVNLGLKYAIITGIILFIIST 297

Query: 1065 FMWSILR 1085
            FMWSILR
Sbjct: 298  FMWSILR 304


>ref|XP_006438571.1| hypothetical protein CICLE_v10032129mg [Citrus clementina]
            gi|557540767|gb|ESR51811.1| hypothetical protein
            CICLE_v10032129mg [Citrus clementina]
          Length = 322

 Score =  333 bits (853), Expect = 1e-88
 Identities = 175/270 (64%), Positives = 200/270 (74%), Gaps = 1/270 (0%)
 Frame = +3

Query: 279  KFKSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKIL-VSKETPTKFLLWV 455
            K K F TF +  HLG   +S FP   +   +S+ F + I E+E+IL V++ETP KFLLWV
Sbjct: 45   KSKPFSTFQSRRHLGPLVSSCFPTRASF--SSDMFPDNITEEERILPVTEETPLKFLLWV 102

Query: 456  LFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELR 635
            +FWAS+S+  F+ SG+A AA DSIRAS+ G+K ATALR  GWPDEAVVFALATLPV+ELR
Sbjct: 103  VFWASLSLVWFSTSGDANAAVDSIRASAIGLKIATALRRSGWPDEAVVFALATLPVLELR 162

Query: 636  GAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERA 815
            GAIPVGYW+QLKP LLTVLSVLGNMVPVPFIILYLK FA+FLAGKN+SAS FLD LF++A
Sbjct: 163  GAIPVGYWMQLKPVLLTVLSVLGNMVPVPFIILYLKKFASFLAGKNRSASQFLDMLFQKA 222

Query: 816  KAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXX 995
            K KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FWS LSA            
Sbjct: 223  KEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWSALSANFFGVVIAGLLV 282

Query: 996  XXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                    KYA          STFMWS LR
Sbjct: 283  NLLVNLGLKYAIVTGAILFIISTFMWSTLR 312


>ref|XP_006483250.1| PREDICTED: uncharacterized protein LOC102616695 [Citrus sinensis]
          Length = 322

 Score =  330 bits (847), Expect = 5e-88
 Identities = 174/270 (64%), Positives = 199/270 (73%), Gaps = 1/270 (0%)
 Frame = +3

Query: 279  KFKSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKIL-VSKETPTKFLLWV 455
            K K F TF +  HLG   +S FP   +   +S+ F + I E+E+IL V++ETP KFLLWV
Sbjct: 45   KSKPFSTFQSRRHLGPLVSSCFPTRASF--SSDMFPDNITEEERILPVTEETPLKFLLWV 102

Query: 456  LFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELR 635
            +FWAS+S+  F+ SG+A AA DSIRAS+ G+K ATALR   WPDEAVVFALATLPV+ELR
Sbjct: 103  VFWASLSLVWFSTSGDANAAVDSIRASAIGLKIATALRRSSWPDEAVVFALATLPVLELR 162

Query: 636  GAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERA 815
            GAIPVGYW+QLKP LLTVLSVLGNMVPVPFIILYLK FA+FLAGKN+SAS FLD LF++A
Sbjct: 163  GAIPVGYWMQLKPVLLTVLSVLGNMVPVPFIILYLKKFASFLAGKNRSASQFLDMLFQKA 222

Query: 816  KAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXX 995
            K KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FWS LSA            
Sbjct: 223  KEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWSALSANFFGVVIAGLLV 282

Query: 996  XXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                    KYA          STFMWS LR
Sbjct: 283  NLLVNLGLKYAIVTGAILFIISTFMWSTLR 312


>gb|EMJ25322.1| hypothetical protein PRUPE_ppa015818mg [Prunus persica]
          Length = 325

 Score =  325 bits (833), Expect = 2e-86
 Identities = 184/304 (60%), Positives = 211/304 (69%), Gaps = 6/304 (1%)
 Frame = +3

Query: 192  LTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFP--TFYTHNHLGHFETSSFPIARAIR 365
            L+ GK+ F+F+P   R   PS     I+  F S     F T + L     +S     A R
Sbjct: 16   LSLGKTRFRFSPKHGR---PSIA-HSIQPPFNSNADLNFQTLSPLNPLLANSPLSHAATR 71

Query: 366  VASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA----DSIRA 533
            V+S+ FL+  E+ + + V +E P KF+ WVL WASVS+ +FA SG+A AAA    DSIRA
Sbjct: 72   VSSHGFLDKDEKDDILPVFEERPVKFVFWVLVWASVSLALFAASGDANAAAAAAADSIRA 131

Query: 534  SSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMV 713
            SSFG+K A+ALRG GWPDEAVVFALATLPVIELRGAIPVGYWLQLKP +LTVLSVLGNMV
Sbjct: 132  SSFGLKIASALRGSGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPVMLTVLSVLGNMV 191

Query: 714  PVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGT 893
            PVPFIILYLK FA+FLAGKNK+A+ FLD LF RAK KAGPVEEFQWLGLMLFVAVPFPGT
Sbjct: 192  PVPFIILYLKRFASFLAGKNKAAARFLDILFVRAKEKAGPVEEFQWLGLMLFVAVPFPGT 251

Query: 894  GAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMW 1073
            GAWTGAI+ASILDM FW+ +SA                    KYA          STFMW
Sbjct: 252  GAWTGAIIASILDMPFWAAVSANFFGVVLAGLLVNLLVNLGLKYAIITGIILFIISTFMW 311

Query: 1074 SILR 1085
            SILR
Sbjct: 312  SILR 315


>ref|XP_003631283.1| PREDICTED: uncharacterized protein LOC100853229 [Vitis vinifera]
            gi|296086436|emb|CBI32025.3| unnamed protein product
            [Vitis vinifera]
          Length = 311

 Score =  317 bits (811), Expect = 8e-84
 Identities = 171/277 (61%), Positives = 193/277 (69%), Gaps = 2/277 (0%)
 Frame = +3

Query: 261  DQQIRIKFKSFPTFYTHN--HLGHFETSSFPIARAIRVASNDFLETIEEKEKILVSKETP 434
            D Q R+ FK  P+    N  H  H  T S P +   + + ++FL+ + + E        P
Sbjct: 32   DNQHRL-FKPNPSLALRNSRHSRHPLTISPPHSTPAQASPDEFLDKVGDFEG------PP 84

Query: 435  TKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALAT 614
             KFL WVLFWAS+S+  FA SG+A AA DSIRASSFG+K A+ALR  GWPDEAVV ALAT
Sbjct: 85   VKFLFWVLFWASLSVAWFAASGDANAATDSIRASSFGLKVASALRSSGWPDEAVVVALAT 144

Query: 615  LPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFL 794
            LPVIELRGAIPVGYW+QLKP  LT+LSVLGNM+PVPFIILYLK FATFLAGKNKSAS FL
Sbjct: 145  LPVIELRGAIPVGYWMQLKPATLTILSVLGNMIPVPFIILYLKRFATFLAGKNKSASRFL 204

Query: 795  DKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXX 974
            D LFE+AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+ASILDM FW  +SA     
Sbjct: 205  DMLFEKAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWPAVSANFFGV 264

Query: 975  XXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                           KYA          STFMWS+LR
Sbjct: 265  VLAGLLVNLLVNLGLKYAIVTGVILFFISTFMWSVLR 301


>ref|XP_002311878.1| predicted protein [Populus trichocarpa]
            gi|566189003|ref|XP_006378162.1| hypothetical protein
            POPTR_0010s04340g [Populus trichocarpa]
            gi|550329033|gb|ERP55959.1| hypothetical protein
            POPTR_0010s04340g [Populus trichocarpa]
          Length = 310

 Score =  313 bits (803), Expect = 7e-83
 Identities = 163/267 (61%), Positives = 190/267 (71%)
 Frame = +3

Query: 285  KSFPTFYTHNHLGHFETSSFPIARAIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFW 464
            KS P+F   + L      S   + + R +SN F +T ++KE +   +  P KFL WV FW
Sbjct: 43   KSKPSFLAFHRLDGLRFLSS--STSTRASSNGFFDTTQDKEILPSFEPKPAKFLFWVAFW 100

Query: 465  ASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAI 644
            AS+S+  FA SG+A AA DSI+AS FG+K ATA R LGWPDEAVVFALATLPV+ELRGAI
Sbjct: 101  ASLSLVWFAASGDANAAVDSIKASGFGLKIATAFRRLGWPDEAVVFALATLPVLELRGAI 160

Query: 645  PVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAK 824
            PVGYW+QLKP +LT+LSV+GNMVPVPFIILYLK FA+FLAG+N+ AS FLD LFE AK K
Sbjct: 161  PVGYWMQLKPIMLTILSVVGNMVPVPFIILYLKPFASFLAGRNQPASRFLDMLFENAKEK 220

Query: 825  AGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXX 1004
            +GPV+EFQWLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +SA               
Sbjct: 221  SGPVKEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSANFCGVVLAGLLVNLL 280

Query: 1005 XXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                 KYA          STFMWSILR
Sbjct: 281  VNLGLKYATITGIILFFISTFMWSILR 307


>ref|XP_004297341.1| PREDICTED: uncharacterized protein LOC101291815 [Fragaria vesca
            subsp. vesca]
          Length = 314

 Score =  313 bits (802), Expect = 9e-83
 Identities = 165/235 (70%), Positives = 176/235 (74%), Gaps = 5/235 (2%)
 Frame = +3

Query: 396  EEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAA-----DSIRASSFGVKGAT 560
            EE E + + +E P KF LWVLFWASVS+  FA SG+A AAA     DSIRASSFGVK A 
Sbjct: 70   EEDEVLSIFEEKPVKFGLWVLFWASVSLAWFAASGDANAAANAAAADSIRASSFGVKIAN 129

Query: 561  ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYL 740
            ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQL P +LTVL+VLGNMVPVP IILYL
Sbjct: 130  ALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLTPVMLTVLAVLGNMVPVPIIILYL 189

Query: 741  KSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVA 920
            K FATFLAGKN + S FLD LFE+AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+A
Sbjct: 190  KRFATFLAGKNNATSRFLDLLFEKAKKKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIA 249

Query: 921  SILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
            SILDM FWS +SA                    KYA          STFMWSILR
Sbjct: 250  SILDMPFWSAVSANFFGVVLAGLLVNLLVNLGLKYAIVTGIALFFISTFMWSILR 304


>ref|XP_002520148.1| conserved hypothetical protein [Ricinus communis]
           gi|223540640|gb|EEF42203.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 401

 Score =  311 bits (797), Expect = 3e-82
 Identities = 165/263 (62%), Positives = 197/263 (74%), Gaps = 3/263 (1%)
 Frame = +3

Query: 180 MLLTLTPGKSHFKFTPPSPRIC--FPSFPDQ-QIRIKFKSFPTFYTHNHLGHFETSSFPI 350
           +LL+ +  K++ +F P   +    +PS   + Q  +K K F +F T N       S+ P 
Sbjct: 8   LLLSASFRKTYLRFLPNHVKNLNLYPSIAQKKQSFVKSKPFLSFQTVNFPSCNPLSAAPF 67

Query: 351 ARAIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIR 530
                 +S+ FL + E++E +   +E P KFL WV+FWASVS+  FAVS +A AA DSI+
Sbjct: 68  TTTRASSSHGFLNSAEDEEILPSFEEKPVKFLFWVVFWASVSLAWFAVSRDANAAVDSIK 127

Query: 531 ASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNM 710
           ASSFG+K A +LRGLGWPDEAVVFALATLPVIELRGAIPVGYW+QLKP +LTVLSV GNM
Sbjct: 128 ASSFGLKIANSLRGLGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPLILTVLSVAGNM 187

Query: 711 VPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPG 890
           VPVPFIILYLK FA+FLAG+N+SAS FLD LFE AK KA PVEEFQWLGLMLFVAVPFPG
Sbjct: 188 VPVPFIILYLKRFASFLAGRNQSASRFLDMLFENAKQKADPVEEFQWLGLMLFVAVPFPG 247

Query: 891 TGAWTGAIVASILDMTFWSGLSA 959
           TGAWTGAI+ASILDM FW  +SA
Sbjct: 248 TGAWTGAIIASILDMPFWPAVSA 270


>gb|EXB63806.1| hypothetical protein L484_021078 [Morus notabilis]
            gi|587990949|gb|EXC75168.1| hypothetical protein
            L484_000412 [Morus notabilis]
          Length = 323

 Score =  308 bits (790), Expect = 2e-81
 Identities = 174/303 (57%), Positives = 200/303 (66%), Gaps = 9/303 (2%)
 Frame = +3

Query: 204  KSHFKFTPPS--PRICFP--SFPDQQIRIKFKSFPTFYTHNHLGHFETSSFPIARAIRVA 371
            K H + +P    P I FP  S P+        +  TF T  HL     SS     +   +
Sbjct: 21   KIHRRISPNHKVPTIIFPKKSLPNPN------ALATFQTSPHLKPPRASS----SSSYSS 70

Query: 372  SNDFLETIEEKEKILV-----SKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRAS 536
            S    +T EE +  ++       + P KF  WVLFWAS+S+  FA S +A AAADSI+AS
Sbjct: 71   SGGLHDTAEENDTDIIITASFDHQRPVKFAFWVLFWASLSLLWFATSKDANAAADSIKAS 130

Query: 537  SFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVP 716
            SFG+K A ALRG GWPDEAVVFALATLP++ELRGAIPVGYW+QLKP +LTVLSVLGNMVP
Sbjct: 131  SFGLKIANALRGSGWPDEAVVFALATLPLLELRGAIPVGYWMQLKPVVLTVLSVLGNMVP 190

Query: 717  VPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTG 896
            VPFIILYLKSFA+FLAGKNK+AS  +D LF+ AKAKAGPVEEFQWLGLMLFVAVPFPGTG
Sbjct: 191  VPFIILYLKSFASFLAGKNKTASRLIDLLFKNAKAKAGPVEEFQWLGLMLFVAVPFPGTG 250

Query: 897  AWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWS 1076
            AWTGA +A+ILDM FWSG SA                    KYA          STFMWS
Sbjct: 251  AWTGAFIAAILDMPFWSGFSANFIGVVLAGLLVNLLVNLGLKYAIITGIILFFVSTFMWS 310

Query: 1077 ILR 1085
            ILR
Sbjct: 311  ILR 313


>gb|EOY00205.1| Uncharacterized protein TCM_009967 [Theobroma cacao]
          Length = 316

 Score =  308 bits (788), Expect = 4e-81
 Identities = 176/317 (55%), Positives = 207/317 (65%), Gaps = 6/317 (1%)
 Frame = +3

Query: 153  MAYSLARPWMLLTLTPGKSHFKFTPPSPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFE 332
            MA S +    LL L P          +PRI FP+    QI         F + +   +++
Sbjct: 1    MAASASAATSLLVLAPS-----LRKTNPRI-FPT----QIHWPTTRSKQFLSRSKFQNWQ 50

Query: 333  TSSFPIARAIRVASNDFLETIE---EKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGE 503
                P+    R +SN FL+T     EKE +   +E P KFL WV+ WAS+S+  FA S +
Sbjct: 51   RFPLPLT-ITRASSNVFLDTAHTSREKEILPTFEEKPVKFLFWVVLWASLSLVWFAASSD 109

Query: 504  AKA---AADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKP 674
            A A   AADSIRASSFG+K A+ALRG GWPDEAVVF LATLP++ELRGAIPVGYW+QLKP
Sbjct: 110  ANASAAAADSIRASSFGLKIASALRGSGWPDEAVVFTLATLPILELRGAIPVGYWMQLKP 169

Query: 675  TLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWL 854
             LLT+LS+LGNMVPVPFIILYLK FATFLAG+N+SAS  L+ +FE+AK KAGPVEEFQWL
Sbjct: 170  RLLTILSILGNMVPVPFIILYLKRFATFLAGRNQSASGLLNMIFEKAKEKAGPVEEFQWL 229

Query: 855  GLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXX 1034
            GLMLFVAVPFPGTGAWTG I+ASILDM FWS +SA                    KYA  
Sbjct: 230  GLMLFVAVPFPGTGAWTGGIIASILDMPFWSAVSANFFGVVLAGLLVNLLVNMGLKYAIV 289

Query: 1035 XXXXXXXXSTFMWSILR 1085
                    STFMWSILR
Sbjct: 290  TGIILFFISTFMWSILR 306


>ref|XP_004135199.1| PREDICTED: uncharacterized protein LOC101204187 [Cucumis sativus]
            gi|449478468|ref|XP_004155326.1| PREDICTED:
            uncharacterized LOC101204187 [Cucumis sativus]
          Length = 315

 Score =  305 bits (782), Expect = 2e-80
 Identities = 166/254 (65%), Positives = 184/254 (72%), Gaps = 7/254 (2%)
 Frame = +3

Query: 345  PIARAIRV-------ASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGE 503
            P++R  R+       +SN FLE   + E I   +E P K LL VLFWAS+S+  FA SG+
Sbjct: 55   PVSRTSRIIRTVPRSSSNGFLE---DDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASGD 111

Query: 504  AKAAADSIRASSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLL 683
            AKAA DSIRAS+FG+K A+AL+  GWP EAVVFALATLPVIELRGAIPVGYW+QLKP  L
Sbjct: 112  AKAAVDSIRASNFGLKIASALQNSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPVAL 171

Query: 684  TVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLM 863
            TVLSVLGNMVPVPFIILYLK FATFLAG+N SAS FLD LF+RAK KA PVEEFQWLGLM
Sbjct: 172  TVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLM 231

Query: 864  LFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXX 1043
            LFVAVPFPGTGAWTGAI+ASILDM FWSG+SA                    K A     
Sbjct: 232  LFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLGLKEAIVTGV 291

Query: 1044 XXXXXSTFMWSILR 1085
                 STFMWSILR
Sbjct: 292  ILFIISTFMWSILR 305


>gb|ESW29572.1| hypothetical protein PHAVU_002G080900g [Phaseolus vulgaris]
          Length = 317

 Score =  295 bits (754), Expect = 3e-77
 Identities = 155/278 (55%), Positives = 187/278 (67%), Gaps = 4/278 (1%)
 Frame = +3

Query: 264  QQIRIKFKSFPTFYTHNHLGHFETS----SFPIARAIRVASNDFLETIEEKEKILVSKET 431
            Q+ R   KS  +F T +   HF  S      P+ +  R +S++  +  +E E++L+S E 
Sbjct: 31   QKGRESLKSNFSFSTLHGSPHFRPSIAISPSPLTQT-RASSDECFDPADEAERLLLSGEK 89

Query: 432  PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611
            P KF  WV+FWAS+S+  FAVS +A AA DSI+AS FG+  A +LR LGWPD  VVF LA
Sbjct: 90   PVKFAFWVIFWASLSLAWFAVSKDANAAVDSIKASGFGLNIANSLRKLGWPDGVVVFTLA 149

Query: 612  TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791
            TLPV+ELRGAIPVGYW+QL PT LT+LS+LGNMVPVPFI+LYLK FA+FLA ++   S  
Sbjct: 150  TLPVLELRGAIPVGYWMQLNPTTLTILSILGNMVPVPFIVLYLKRFASFLAARSSYVSRL 209

Query: 792  LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971
            LD LFE AK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGA +A+ILDM FW+ +SA    
Sbjct: 210  LDMLFENAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAFIAAILDMPFWAAVSANFFG 269

Query: 972  XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                            KYA          STFMWSILR
Sbjct: 270  VVFAGLLVNLIVNLGLKYAIITGIILFFVSTFMWSILR 307


>ref|XP_004489960.1| PREDICTED: uncharacterized protein LOC101489688 [Cicer arietinum]
          Length = 320

 Score =  290 bits (743), Expect = 6e-76
 Identities = 148/242 (61%), Positives = 175/242 (72%)
 Frame = +3

Query: 360  IRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASS 539
            IRV+S + L+ I+E E++++  E P KF +WV+FWAS+S+  FA S +A AA DSI+AS 
Sbjct: 66   IRVSSVECLDAIDEPERLMLYDEKPVKFAIWVIFWASMSLAWFAYSKDANAAVDSIKASG 125

Query: 540  FGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPV 719
            FG+K A +LR  G PD  VVF LATLPV+ELRGAIPVGYWLQL P  LTV+S++GNMVPV
Sbjct: 126  FGLKIANSLRKFGLPDWVVVFTLATLPVLELRGAIPVGYWLQLNPATLTVVSIIGNMVPV 185

Query: 720  PFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGA 899
            PFIILYLK FA+FLA K+ SAS FLD LF+ AK KAGPVEEFQWLGLMLFVAVPFPGTGA
Sbjct: 186  PFIILYLKRFASFLASKSPSASRFLDILFKNAKEKAGPVEEFQWLGLMLFVAVPFPGTGA 245

Query: 900  WTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSI 1079
            W+GAI+ASILDM FW  +SA                    KYA          STFMW+I
Sbjct: 246  WSGAIIASILDMPFWIAVSANFFGVVFAGLLVNLLVNLGLKYAIITGIVLFFVSTFMWTI 305

Query: 1080 LR 1085
            LR
Sbjct: 306  LR 307


>ref|XP_006395772.1| hypothetical protein EUTSA_v10004613mg [Eutrema salsugineum]
            gi|557092411|gb|ESQ33058.1| hypothetical protein
            EUTSA_v10004613mg [Eutrema salsugineum]
          Length = 318

 Score =  290 bits (742), Expect = 8e-76
 Identities = 153/242 (63%), Positives = 177/242 (73%), Gaps = 3/242 (1%)
 Frame = +3

Query: 369  ASNDFLETIEEKEKILVSKE---TPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASS 539
            +SN FL   EE+E+I+        P KF + V+FWAS S+  FA SG+AKAAADSI++SS
Sbjct: 66   SSNGFLGKTEEEEEIIKLPSIGVNPLKFAICVVFWASFSLLWFARSGDAKAAADSIKSSS 125

Query: 540  FGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPV 719
            FG++ A  LR  GWPDEAVVFALATLPVIELRGAIPVGYW+QLKPT+LT  SVLGNMVPV
Sbjct: 126  FGLRIAATLRRFGWPDEAVVFALATLPVIELRGAIPVGYWMQLKPTVLTFFSVLGNMVPV 185

Query: 720  PFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGA 899
            P IILYLK FA+FLAGK+++AS  L+ LF+RAK KAGPVEEFQWLGLMLFVAVPFPGTGA
Sbjct: 186  PVIILYLKKFASFLAGKSRTASKLLEILFKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGA 245

Query: 900  WTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSI 1079
            WTGAI+ASILDM FWS +S+                    K A          STFMWS+
Sbjct: 246  WTGAIIASILDMPFWSAVSSNFCGVVLAGLLVNFLVNLGLKEAIVAGIALFFVSTFMWSV 305

Query: 1080 LR 1085
            LR
Sbjct: 306  LR 307


>ref|NP_001240072.1| uncharacterized protein LOC100813905 [Glycine max]
            gi|255635459|gb|ACU18082.1| unknown [Glycine max]
          Length = 321

 Score =  287 bits (734), Expect = 7e-75
 Identities = 160/304 (52%), Positives = 193/304 (63%), Gaps = 11/304 (3%)
 Frame = +3

Query: 207  SHFKFTPP----SPRICFPSFPDQQIRIKFKSFPTFYTHNHLGHFET------SSFPIAR 356
            S F F  P    SP    P    Q+ +   KS  +F T N   HF        S+  + R
Sbjct: 10   SPFHFRKPHNRVSPLNAHPLILIQKGKQSLKSNFSFSTLNASPHFRPPIAIAPSTLTLTR 69

Query: 357  AIRVASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADS-IRA 533
            A   +S++  +   E +++L+S+E P  F  WV+FWAS+S+  FAVS +A AA +S I+A
Sbjct: 70   AS--SSDECFDPAGEAQRLLLSEEKPVNFAFWVIFWASLSLAWFAVSRDANAAVESSIKA 127

Query: 534  SSFGVKGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMV 713
            S FG   A +LR LGWPD  VVF LATLPV+ELRGAIPVGYW+QL P  LTVLS+LGNMV
Sbjct: 128  SGFGFNIANSLRKLGWPDWVVVFTLATLPVLELRGAIPVGYWMQLNPVTLTVLSILGNMV 187

Query: 714  PVPFIILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGT 893
            PVPFI+LYLK  A+F+A ++ SAS FLD LFE AK KAGPVEEFQWLGLMLFVAVPFPGT
Sbjct: 188  PVPFIVLYLKKIASFVAARSPSASRFLDMLFENAKEKAGPVEEFQWLGLMLFVAVPFPGT 247

Query: 894  GAWTGAIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMW 1073
            GAWTGA +ASILDM FW+ +SA                    KYA          STFMW
Sbjct: 248  GAWTGAFIASILDMPFWAAVSANFFGVVFAGLLVNLLVNLGLKYAIITGVILFFVSTFMW 307

Query: 1074 SILR 1085
            S+LR
Sbjct: 308  SVLR 311


>gb|EPS63886.1| hypothetical protein M569_10896, partial [Genlisea aurea]
          Length = 243

 Score =  286 bits (731), Expect = 2e-74
 Identities = 151/239 (63%), Positives = 172/239 (71%)
 Frame = +3

Query: 369  ASNDFLETIEEKEKILVSKETPTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGV 548
            AS    E IE+ E  +     P KFLL VL WASVSIG +A SG+AKAA+DSIRAS FG+
Sbjct: 2    ASRGLTEYIEKAEPDV--DVNPAKFLLMVLLWASVSIGFYAFSGDAKAASDSIRASGFGI 59

Query: 549  KGATALRGLGWPDEAVVFALATLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFI 728
            K A+ALR  GWP+EA+VF+LATLPVIELRGAIPVGYWL LKP  LT+LS+LGNMVPVPFI
Sbjct: 60   KVASALRASGWPNEAIVFSLATLPVIELRGAIPVGYWLHLKPLTLTLLSILGNMVPVPFI 119

Query: 729  ILYLKSFATFLAGKNKSASHFLDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTG 908
            +LYLK  AT+L   NK++S FL+ L +RAK KAGPVEEFQWLGLMLFVAVPFPGTGAWTG
Sbjct: 120  LLYLKKLATYLTSDNKTSS-FLEMLLKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTG 178

Query: 909  AIVASILDMTFWSGLSAXXXXXXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
            AIVAS+LDM FW G+SA                    K+A          STFMW ILR
Sbjct: 179  AIVASVLDMPFWEGVSANLAGVVLAGLLVNLLVNLGVKHAIFTGVLLFGFSTFMWRILR 237


>ref|XP_006291472.1| hypothetical protein CARUB_v10017608mg [Capsella rubella]
            gi|482560179|gb|EOA24370.1| hypothetical protein
            CARUB_v10017608mg [Capsella rubella]
          Length = 332

 Score =  281 bits (720), Expect = 3e-73
 Identities = 143/218 (65%), Positives = 165/218 (75%)
 Frame = +3

Query: 432  PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611
            P KF + V+ WAS+S+  FA SG+AKAA DSI++SSFG++ A  LR  GWPDEAVVFALA
Sbjct: 104  PVKFAVCVVLWASLSLLWFARSGDAKAATDSIKSSSFGLRIAATLRRFGWPDEAVVFALA 163

Query: 612  TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791
            TLPVIELRGAIPVGYW+QLKPT+LT  SVLGNMVPVPFI+LYLK FA+FLAGK+++AS  
Sbjct: 164  TLPVIELRGAIPVGYWMQLKPTVLTFFSVLGNMVPVPFIVLYLKKFASFLAGKSQTASKL 223

Query: 792  LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971
            LD LF+RAK KAGPVEEFQWLGLMLFVAVPFPGTGAWTGAI+ASIL+M FWS +S+    
Sbjct: 224  LDILFKRAKEKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILEMPFWSAVSSNFCG 283

Query: 972  XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                            K A          STFMWS+LR
Sbjct: 284  VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 321


>ref|NP_178363.1| uncharacterized protein [Arabidopsis thaliana]
            gi|3184280|gb|AAC18927.1| putative transport protein
            [Arabidopsis thaliana] gi|21554968|gb|AAM63742.1|
            putative transport protein [Arabidopsis thaliana]
            gi|61742564|gb|AAX55103.1| hypothetical protein At2g02590
            [Arabidopsis thaliana] gi|110741534|dbj|BAE98716.1|
            putative transport protein [Arabidopsis thaliana]
            gi|330250508|gb|AEC05602.1| uncharacterized protein
            AT2G02590 [Arabidopsis thaliana]
          Length = 324

 Score =  278 bits (712), Expect = 2e-72
 Identities = 141/218 (64%), Positives = 165/218 (75%)
 Frame = +3

Query: 432  PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611
            P KF + V+ WAS S+  FA SG+AKAA DSI++SSFG++ A+ LR  GWPDEAVVFALA
Sbjct: 96   PVKFAICVVLWASFSLLWFARSGDAKAATDSIKSSSFGLRIASTLRRFGWPDEAVVFALA 155

Query: 612  TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791
            TLPVIELRGAIPVGYW+QLKP +LT  SVLGNMVPVPFI+LYLK+FA+F+AGK+++AS  
Sbjct: 156  TLPVIELRGAIPVGYWMQLKPVVLTSFSVLGNMVPVPFIVLYLKTFASFVAGKSQTASKL 215

Query: 792  LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971
            LD LF+RAK KAGPVEEF+WLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +S+    
Sbjct: 216  LDILFKRAKEKAGPVEEFKWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSSNFCG 275

Query: 972  XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                            K A          STFMWS+LR
Sbjct: 276  VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 313


>gb|AAT68342.1| hypothetical protein At2g02590 [Arabidopsis thaliana]
          Length = 324

 Score =  278 bits (712), Expect = 2e-72
 Identities = 141/218 (64%), Positives = 165/218 (75%)
 Frame = +3

Query: 432  PTKFLLWVLFWASVSIGIFAVSGEAKAAADSIRASSFGVKGATALRGLGWPDEAVVFALA 611
            P KF + V+ WAS S+  FA SG+AKAA DSI++SSFG++ A+ LR  GWPDEAVVFALA
Sbjct: 96   PVKFAICVVLWASFSLLWFARSGDAKAATDSIKSSSFGLRIASTLRRFGWPDEAVVFALA 155

Query: 612  TLPVIELRGAIPVGYWLQLKPTLLTVLSVLGNMVPVPFIILYLKSFATFLAGKNKSASHF 791
            TLPVIELRGAIPVGYW+QLKP +LT  SVLGNMVPVPFI+LYLK+FA+F+AGK+++AS  
Sbjct: 156  TLPVIELRGAIPVGYWMQLKPVVLTSFSVLGNMVPVPFIVLYLKTFASFVAGKSQTASKL 215

Query: 792  LDKLFERAKAKAGPVEEFQWLGLMLFVAVPFPGTGAWTGAIVASILDMTFWSGLSAXXXX 971
            LD LF+RAK KAGPVEEF+WLGLMLFVAVPFPGTGAWTGAI+ASILDM FWS +S+    
Sbjct: 216  LDILFKRAKEKAGPVEEFKWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSAVSSNFCG 275

Query: 972  XXXXXXXXXXXXXXXXKYAXXXXXXXXXXSTFMWSILR 1085
                            K A          STFMWS+LR
Sbjct: 276  VVLAGLLVNLLVNLGLKQAIVAGIALFFVSTFMWSVLR 313


Top