BLASTX nr result

ID: Akebia23_contig00007701 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00007701
         (1475 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006359817.1| PREDICTED: uncharacterized protein LOC102587...   324   5e-86
ref|XP_004237791.1| PREDICTED: uncharacterized protein LOC101267...   323   9e-86
ref|XP_002276129.1| PREDICTED: uncharacterized protein LOC100245...   319   2e-84
emb|CAN75809.1| hypothetical protein VITISV_004630 [Vitis vinifera]   319   2e-84
ref|XP_006493111.1| PREDICTED: uncharacterized protein LOC102609...   314   7e-83
ref|XP_004138945.1| PREDICTED: uncharacterized protein LOC101210...   313   1e-82
ref|XP_007202319.1| hypothetical protein PRUPE_ppa008110mg [Prun...   313   2e-82
ref|XP_002533350.1| conserved hypothetical protein [Ricinus comm...   312   3e-82
gb|EYU34242.1| hypothetical protein MIMGU_mgv1a010657mg [Mimulus...   309   2e-81
ref|XP_006410313.1| hypothetical protein EUTSA_v10016862mg [Eutr...   309   2e-81
ref|XP_007028457.1| Thioredoxin superfamily protein isoform 1 [T...   308   4e-81
ref|XP_006380250.1| hypothetical protein POPTR_0007s00360g [Popu...   304   6e-80
ref|XP_002309712.2| hypothetical protein POPTR_0007s00360g [Popu...   304   6e-80
ref|XP_003555189.1| PREDICTED: uncharacterized protein LOC100793...   303   1e-79
ref|XP_007142732.1| hypothetical protein PHAVU_007G012300g [Phas...   302   3e-79
ref|XP_002881198.1| hypothetical protein ARALYDRAFT_902219 [Arab...   301   6e-79
ref|XP_004287186.1| PREDICTED: uncharacterized protein LOC101305...   298   3e-78
ref|NP_180743.1| mesophyll-cell RNAi library line 7-like protein...   298   4e-78
ref|XP_006294510.1| hypothetical protein CARUB_v10023543mg [Caps...   297   7e-78
gb|AAO89195.1| hypothetical protein [Arabidopsis thaliana]            296   2e-77

>ref|XP_006359817.1| PREDICTED: uncharacterized protein LOC102587975 [Solanum tuberosum]
          Length = 342

 Score =  324 bits (831), Expect = 5e-86
 Identities = 172/288 (59%), Positives = 203/288 (70%), Gaps = 3/288 (1%)
 Frame = -1

Query: 1277 LEVGSERRRGTRSTKLLRASNEVVEETVFDDGNKKERKG---KSNNKXXXXXXXXXXXXX 1107
            L V   R++G++++KL         E +   GN +E  G   +S++              
Sbjct: 63   LMVHGLRQKGSKASKL---------EELLKFGNGEEDDGDESESDSDGGPKRDDYFHMDE 113

Query: 1106 XXXXEWRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGF 927
                EWRRKIR VI M+P VEEE DP+E+ +KMQKLLADYPLVV              GF
Sbjct: 114  DERREWRRKIRDVIKMSPDVEEEVDPVERRQKMQKLLADYPLVVDEEDPDWPEDADGRGF 173

Query: 926  NLGQFFNKITIKNVXXXXXXXXXXKELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVF 747
            NL QFFNKI+IKNV           ELVWQDD+YIRP+KD+TT EWEETV+KDISPLIV 
Sbjct: 174  NLDQFFNKISIKNVKKDDDENDDDNELVWQDDDYIRPVKDLTTAEWEETVYKDISPLIVL 233

Query: 746  VHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIF 567
            VHNRYKRPKENE VRD+LEKA+HI WN RLPSPRCVAIDAVVE+DLVS LKV+VFPE+IF
Sbjct: 234  VHNRYKRPKENEMVRDELEKAIHIIWNCRLPSPRCVAIDAVVEVDLVSALKVSVFPELIF 293

Query: 566  TKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            TKAGKILYREK  RTADELS++MAFFYYGA KPPCL  + ++QE IP+
Sbjct: 294  TKAGKILYREKVSRTADELSKMMAFFYYGAAKPPCLSGIENSQELIPT 341


>ref|XP_004237791.1| PREDICTED: uncharacterized protein LOC101267802 [Solanum
            lycopersicum]
          Length = 342

 Score =  323 bits (829), Expect = 9e-86
 Identities = 173/288 (60%), Positives = 202/288 (70%), Gaps = 3/288 (1%)
 Frame = -1

Query: 1277 LEVGSERRRGTRSTKL---LRASNEVVEETVFDDGNKKERKGKSNNKXXXXXXXXXXXXX 1107
            L V   R +G++++KL   L+  N   EE   DDG++ E      +K             
Sbjct: 63   LTVQGLRLKGSKASKLEELLKFGNG--EE---DDGDESESDSDGGHKRDDYFHMDEDERR 117

Query: 1106 XXXXEWRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGF 927
                 WRRKIR VI M+P VEEE DP+E+ +KMQKLLADYPLVV              GF
Sbjct: 118  E----WRRKIRDVIKMSPDVEEEVDPVERRQKMQKLLADYPLVVDEEDPDWPEDADGRGF 173

Query: 926  NLGQFFNKITIKNVXXXXXXXXXXKELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVF 747
            NL QFFNKI+IKNV           ELVWQDD+YIRP+KD+TT EWEETV+KDISPLIV 
Sbjct: 174  NLDQFFNKISIKNVKKDDDENDDDNELVWQDDDYIRPVKDLTTAEWEETVYKDISPLIVL 233

Query: 746  VHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIF 567
            VHNRYKRPKENE  RD+LEKA+HI WN RLPSPRCVAIDAVVE+DLVS LKV+VFPE+IF
Sbjct: 234  VHNRYKRPKENEMARDELEKAIHIIWNCRLPSPRCVAIDAVVEVDLVSALKVSVFPELIF 293

Query: 566  TKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            TKAGKILYREK  RTADELS++MAFFYYGA KPPCL  + ++QE IP+
Sbjct: 294  TKAGKILYREKVSRTADELSKMMAFFYYGAAKPPCLSSIENSQELIPT 341


>ref|XP_002276129.1| PREDICTED: uncharacterized protein LOC100245762 [Vitis vinifera]
            gi|296084374|emb|CBI24762.3| unnamed protein product
            [Vitis vinifera]
          Length = 328

 Score =  319 bits (817), Expect = 2e-84
 Identities = 162/228 (71%), Positives = 179/228 (78%), Gaps = 3/228 (1%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WR KIRQVID NP VEEE DP+ K + MQKLLADYPLVV              GFNL QF
Sbjct: 98   WRSKIRQVIDGNPDVEEEMDPVLKRRMMQKLLADYPLVVEEDDPNWPEDADGRGFNLDQF 157

Query: 911  FNKITIKNVXXXXXXXXXXK---ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVH 741
            F+KITIKNV              E+VWQDDNYIRPIKDI T EWEETV KDISPLIV VH
Sbjct: 158  FDKITIKNVKKDKGDDDNYDSEDEIVWQDDNYIRPIKDIKTAEWEETVLKDISPLIVLVH 217

Query: 740  NRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTK 561
            NRYKRPKEN+K+R++LEKAVHI WN RLPSPRCVAIDAVVE+DLVS L+V+VFPE+IFTK
Sbjct: 218  NRYKRPKENQKIREELEKAVHIIWNCRLPSPRCVAIDAVVEVDLVSALQVSVFPEVIFTK 277

Query: 560  AGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPSRP 417
            AGKIL+REK I+TADELS+IMAFFYYGA KPPCL+  GD+QEAIP  P
Sbjct: 278  AGKILHREKVIQTADELSKIMAFFYYGAAKPPCLNGTGDSQEAIPLVP 325


>emb|CAN75809.1| hypothetical protein VITISV_004630 [Vitis vinifera]
          Length = 324

 Score =  319 bits (817), Expect = 2e-84
 Identities = 162/228 (71%), Positives = 179/228 (78%), Gaps = 3/228 (1%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WR KIRQVID NP VEEE DP+ K + MQKLLADYPLVV              GFNL QF
Sbjct: 94   WRSKIRQVIDGNPDVEEEMDPVLKRRMMQKLLADYPLVVEEDDPNWPEDADGRGFNLDQF 153

Query: 911  FNKITIKNVXXXXXXXXXXK---ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVH 741
            F+KITIKNV              E+VWQDDNYIRPIKDI T EWEETV KDISPLIV VH
Sbjct: 154  FDKITIKNVKKDKGDDDNYDSEDEIVWQDDNYIRPIKDIKTAEWEETVLKDISPLIVLVH 213

Query: 740  NRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTK 561
            NRYKRPKEN+K+R++LEKAVHI WN RLPSPRCVAIDAVVE+DLVS L+V+VFPE+IFTK
Sbjct: 214  NRYKRPKENQKIREELEKAVHIIWNCRLPSPRCVAIDAVVEVDLVSALQVSVFPEVIFTK 273

Query: 560  AGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPSRP 417
            AGKIL+REK I+TADELS+IMAFFYYGA KPPCL+  GD+QEAIP  P
Sbjct: 274  AGKILHREKVIQTADELSKIMAFFYYGAAKPPCLNGTGDSQEAIPLVP 321


>ref|XP_006493111.1| PREDICTED: uncharacterized protein LOC102609941 [Citrus sinensis]
          Length = 351

 Score =  314 bits (804), Expect = 7e-83
 Identities = 160/225 (71%), Positives = 175/225 (77%), Gaps = 2/225 (0%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WRRKIR+VI  +P VEEESDPIEK KKMQKLLADYPLV+              GF+L QF
Sbjct: 120  WRRKIREVIAQSPDVEEESDPIEKKKKMQKLLADYPLVMEEDDPDWPEDADGWGFSLSQF 179

Query: 911  FNKITIKNVXXXXXXXXXXKE--LVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVHN 738
            F+KITIKNV           E  +VWQDDNYIRPIKDI T EWEE VFKDISPLIV VHN
Sbjct: 180  FDKITIKNVKKDEDDENYDSENEIVWQDDNYIRPIKDIKTAEWEEAVFKDISPLIVLVHN 239

Query: 737  RYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKA 558
            RYKRPKENE++RD LEKAVHI WN RLPSPRCVA+DA VE DLVS L+V+VFPE+IFTKA
Sbjct: 240  RYKRPKENERIRDGLEKAVHIIWNCRLPSPRCVAVDANVEHDLVSALQVSVFPEVIFTKA 299

Query: 557  GKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            GKILYREKA RTADELS+IMAFFYYGA KP CL  +  +QE IPS
Sbjct: 300  GKILYREKATRTADELSKIMAFFYYGAAKPDCLSSIESSQEMIPS 344


>ref|XP_004138945.1| PREDICTED: uncharacterized protein LOC101210591 [Cucumis sativus]
          Length = 347

 Score =  313 bits (803), Expect = 1e-82
 Identities = 158/227 (69%), Positives = 177/227 (77%), Gaps = 4/227 (1%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WR KIR+VID NP+VEEE D +E+  KMQKLLADYPLVV              GFNLGQF
Sbjct: 118  WREKIRKVIDTNPNVEEEIDNMERRIKMQKLLADYPLVVEEEDPDWPEDADGWGFNLGQF 177

Query: 911  FNKITIKNVXXXXXXXXXXK----ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFV 744
            F+KITIKN                E+VWQDDNYIRPIKDIT +EWEE VFKDISPLI+FV
Sbjct: 178  FDKITIKNKKKDDKYDDDKDDTDNEVVWQDDNYIRPIKDITISEWEEAVFKDISPLIIFV 237

Query: 743  HNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFT 564
            HNRYKRPKENEKVR++LEKA+HI WN  LPSPRCVA+DAVVE +LV+ L+V+ FPEIIFT
Sbjct: 238  HNRYKRPKENEKVREELEKAIHIIWNCNLPSPRCVAVDAVVECNLVTALQVSAFPEIIFT 297

Query: 563  KAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            KAGKILYREK    ADELS+IMAFFYYGA KPPCL++VGD QEAIPS
Sbjct: 298  KAGKILYREKGFVNADELSKIMAFFYYGAAKPPCLNDVGDYQEAIPS 344


>ref|XP_007202319.1| hypothetical protein PRUPE_ppa008110mg [Prunus persica]
            gi|462397850|gb|EMJ03518.1| hypothetical protein
            PRUPE_ppa008110mg [Prunus persica]
          Length = 344

 Score =  313 bits (801), Expect = 2e-82
 Identities = 154/225 (68%), Positives = 176/225 (78%), Gaps = 3/225 (1%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WR KIRQV+D NP VEEE DPIE+TKK+Q+LLA+YPLVV              GF L QF
Sbjct: 112  WRSKIRQVLDTNPDVEEELDPIERTKKVQQLLANYPLVVEEDDPEWPDDADGRGFKLDQF 171

Query: 911  FNKITIKNVXXXXXXXXXXK---ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVH 741
            F+KITIKN               E+VWQDDNYIRPIKD+ T EWEE VFKDISPLI+ VH
Sbjct: 172  FDKITIKNNTARKDDNDNDDSDNEIVWQDDNYIRPIKDVVTAEWEEAVFKDISPLIILVH 231

Query: 740  NRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTK 561
            NRYKRPKENEK+R++LEKAVHI WN +LPSPRC+A+DAV E DLVS LKV+VFPEIIFTK
Sbjct: 232  NRYKRPKENEKIRNELEKAVHIIWNCKLPSPRCIAVDAVTEHDLVSALKVSVFPEIIFTK 291

Query: 560  AGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIP 426
            AGKILYREKAIR+ DELS++MAFFYYGA +PPCL+ +GD QE IP
Sbjct: 292  AGKILYREKAIRSGDELSKVMAFFYYGAARPPCLNGIGDRQEPIP 336


>ref|XP_002533350.1| conserved hypothetical protein [Ricinus communis]
            gi|223526815|gb|EEF29035.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 351

 Score =  312 bits (799), Expect = 3e-82
 Identities = 156/227 (68%), Positives = 177/227 (77%), Gaps = 2/227 (0%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WRR IR+VID +P +EEE +  +K  +MQKLLADYPLVV              GFNLGQF
Sbjct: 123  WRRNIREVIDKHPDIEEELNAEDKKIRMQKLLADYPLVVEEDDPDWPEDSDGWGFNLGQF 182

Query: 911  FNKITIKNVXXXXXXXXXXKE--LVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVHN 738
            FNKITIKN            E  +VWQDDNYIRPIKDITT +WEETVFKDI+PLI+ VHN
Sbjct: 183  FNKITIKNKKKDDDDENYDSENEIVWQDDNYIRPIKDITTADWEETVFKDINPLIILVHN 242

Query: 737  RYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKA 558
            RYKRPKENEK RD+LEKAV+I WN RLPSPRCVA+DAVVE DLVS LKV++FPEIIFTKA
Sbjct: 243  RYKRPKENEKARDELEKAVNIIWNCRLPSPRCVAVDAVVETDLVSALKVSIFPEIIFTKA 302

Query: 557  GKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPSRP 417
            GKILYRE+A RTADE S+IMA+FYYGA KPPCL  +G++QE IPS P
Sbjct: 303  GKILYRERATRTADEFSKIMAYFYYGAGKPPCLSGIGESQELIPSVP 349


>gb|EYU34242.1| hypothetical protein MIMGU_mgv1a010657mg [Mimulus guttatus]
          Length = 306

 Score =  309 bits (792), Expect = 2e-81
 Identities = 160/257 (62%), Positives = 185/257 (71%), Gaps = 1/257 (0%)
 Frame = -1

Query: 1190 DDGNKKERKGKSNNKXXXXXXXXXXXXXXXXXEWRRKIRQVIDMNPHVEEESDPIEKTKK 1011
            DD +++E K   N                    WRRKIR+VID  P +EEE D +EK KK
Sbjct: 54   DDDDEEEEKSSGNRNEDSFVMNPEERKE-----WRRKIREVIDKAPDIEEEIDLVEKRKK 108

Query: 1010 MQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNVXXXXXXXXXXK-ELVWQD 834
            MQ LLA YPLVV              GFNLGQFFNKI+IKNV          + E+VWQD
Sbjct: 109  MQNLLAQYPLVVDEEDPDWPEDADGWGFNLGQFFNKISIKNVKKEDDEGYDSENEVVWQD 168

Query: 833  DNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLP 654
            D+YI+PIKDIT+ EWEE +FKD SPL+VFVHNRYKRPKENEK+RD+LEKAVHI WN RLP
Sbjct: 169  DDYIQPIKDITSAEWEEAIFKDFSPLVVFVHNRYKRPKENEKIRDELEKAVHIIWNCRLP 228

Query: 653  SPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKAIRTADELSRIMAFFYYGAT 474
            SPRC AIDA +ELDLVS L+V+VFPE+IFTKAGKILYREKAIRTADELS+IMAFFY+GA 
Sbjct: 229  SPRCYAIDANLELDLVSALQVSVFPELIFTKAGKILYREKAIRTADELSKIMAFFYFGAA 288

Query: 473  KPPCLDEVGDNQEAIPS 423
            KPPCL+ V   +E IP+
Sbjct: 289  KPPCLNGVDHIEEPIPT 305


>ref|XP_006410313.1| hypothetical protein EUTSA_v10016862mg [Eutrema salsugineum]
            gi|557111482|gb|ESQ51766.1| hypothetical protein
            EUTSA_v10016862mg [Eutrema salsugineum]
          Length = 354

 Score =  309 bits (792), Expect = 2e-81
 Identities = 174/340 (51%), Positives = 215/340 (63%), Gaps = 15/340 (4%)
 Frame = -1

Query: 1397 TRFQCLFPSLKEASSTLIPFLNH-KHTNFKIIMSSRIKFPILEVGSERRRGTRSTKLLRA 1221
            T+F C       + S+ +P  +H K   F++  S    F  +++ S +    RS + ++A
Sbjct: 7    TQFTCPLKENGFSFSSAVPASSHFKRYPFELASSRHECFGSVKIVSSKGNVMRSRRNVKA 66

Query: 1220 SNEVVE-------ETVFDDGNKKERKGKSNNKXXXXXXXXXXXXXXXXXEWRRKIRQVID 1062
               V +            D   +E + K +                   EWR+KIR+VID
Sbjct: 67   FGLVDKLGKKSWRREEESDSEDEEDEAKKDTSSKRLGDEASLDDPEERREWRKKIREVID 126

Query: 1061 MNPHVEEES-DPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNV 885
             +P +EEE  D +EK +KMQKLLADYPLVV              GF+  QFFNKITIKN 
Sbjct: 127  KHPDIEEEEIDLVEKRRKMQKLLADYPLVVNEEDPNWPDDADGWGFSFNQFFNKITIKNE 186

Query: 884  XXXXXXXXXXK------ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRP 723
                             E+VWQDDNYIRPIKD+TT EWE++VFKDISPL+VFVHNRYKRP
Sbjct: 187  KKDDDDDEDDNGDDSHKEIVWQDDNYIRPIKDLTTAEWEDSVFKDISPLMVFVHNRYKRP 246

Query: 722  KENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILY 543
            KENEK R++LEKA+H+ WN  LPSPRCVA+DAVVE DLVS L+V+VFPEIIFTKAGKILY
Sbjct: 247  KENEKFREELEKAIHVIWNCGLPSPRCVAVDAVVETDLVSALQVSVFPEIIFTKAGKILY 306

Query: 542  REKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            REK IRTADELS+IMAFFYYGA KPPCL+ V ++QE IPS
Sbjct: 307  REKGIRTADELSKIMAFFYYGAAKPPCLNGVVNSQEQIPS 346


>ref|XP_007028457.1| Thioredoxin superfamily protein isoform 1 [Theobroma cacao]
            gi|590634748|ref|XP_007028458.1| Thioredoxin superfamily
            protein isoform 1 [Theobroma cacao]
            gi|590634752|ref|XP_007028459.1| Thioredoxin superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508717062|gb|EOY08959.1| Thioredoxin superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508717063|gb|EOY08960.1| Thioredoxin superfamily
            protein isoform 1 [Theobroma cacao]
            gi|508717064|gb|EOY08961.1| Thioredoxin superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 330

 Score =  308 bits (789), Expect = 4e-81
 Identities = 172/319 (53%), Positives = 204/319 (63%), Gaps = 10/319 (3%)
 Frame = -1

Query: 1349 LIPFLNHKHTNFKIIMSSRIKFPILEV--GSERRRGTRSTKLLR------ASNEVVEETV 1194
            L P   +K   F +  S+    P L    GS  R G R+  ++       A  E+ +   
Sbjct: 15   LSPAKENKLKPFLLSSSANPNLPKLPFWRGSSIRNGVRTHGMVEEIGKKFAGRELSDSDD 74

Query: 1193 FDDGNKKERKGKSNNKXXXXXXXXXXXXXXXXXEWRRKIRQVIDMNPHVEEESDPIEKTK 1014
             DD +   +KG+  +                   WR KIR V+  +P ++EE DP+EK  
Sbjct: 75   EDDDDSSTKKGEMGDAYHFDDDERRE--------WRAKIRDVLCKHPEIQEELDPVEKLN 126

Query: 1013 KMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNVXXXXXXXXXXKE--LVW 840
            +MQKLLADYPLVV              GFNLGQFF+KITIKN            E  +VW
Sbjct: 127  QMQKLLADYPLVVDEDDPDWPEDADGWGFNLGQFFDKITIKNAKKDKDDEDYDSENEVVW 186

Query: 839  QDDNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSR 660
            QDDNYIRPIK I   EWEETVFKDISPLI+ VHNRYKRPKENE+V D+LEKAVH+ WN  
Sbjct: 187  QDDNYIRPIKQIKIAEWEETVFKDISPLIILVHNRYKRPKENERVWDELEKAVHVIWNCS 246

Query: 659  LPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKAIRTADELSRIMAFFYYG 480
            LPSPRCVA+DAVVE  LVS LKV+VFPE+IFTKAGKILYRE+AIRTADELS++MAFFYYG
Sbjct: 247  LPSPRCVAVDAVVEDALVSALKVSVFPELIFTKAGKILYREQAIRTADELSKMMAFFYYG 306

Query: 479  ATKPPCLDEVGDNQEAIPS 423
            A KPPCLD VG++QE IPS
Sbjct: 307  AAKPPCLDCVGNSQEMIPS 325


>ref|XP_006380250.1| hypothetical protein POPTR_0007s00360g [Populus trichocarpa]
            gi|550333815|gb|ERP58047.1| hypothetical protein
            POPTR_0007s00360g [Populus trichocarpa]
          Length = 356

 Score =  304 bits (779), Expect = 6e-80
 Identities = 175/317 (55%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
 Frame = -1

Query: 1331 HKHTNFKIIMS--SRIKFPILEVGSERRRGTRSTKLLRASNEVVEETVFDDGNKKERKGK 1158
            +K T +  + S  S +    L+  S+   G   + + RAS    E  + DDG KK R+  
Sbjct: 32   YKPTTYTEVSSTCSCLSSKFLKYPSQLHGGGFQSSISRASRRD-ERVLSDDGKKKRREEF 90

Query: 1157 SNN--------KXXXXXXXXXXXXXXXXXEWRRKIRQVIDMNPHVEE--ESDPIEKTKKM 1008
            S +                          EWR KIR+V+  +P V+E  E D  EK  +M
Sbjct: 91   SESDDDDDDYSSIKGKVNDPYLMDAEERREWRMKIREVMKKHPDVDEDEELDSEEKRMRM 150

Query: 1007 QKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNVXXXXXXXXXXK--ELVWQD 834
            +KLLADYPL+V              GF L QFFNKITIKN              E+VWQD
Sbjct: 151  EKLLADYPLIVDEDDPDWPEDADGRGFGLDQFFNKITIKNKKKDDDDENYDSDKEIVWQD 210

Query: 833  DNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLP 654
            D+YIRPIKDITT  WEE VFKDISPLIV VHNRYKRPKENE +RD LEKAVHI WN RLP
Sbjct: 211  DDYIRPIKDITTAGWEEAVFKDISPLIVLVHNRYKRPKENENIRDALEKAVHIIWNCRLP 270

Query: 653  SPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKAIRTADELSRIMAFFYYGAT 474
            SPRCVAIDAVVE DLVS LKV+VFPEIIFTKAGKILYREKAIRTADE S+IMA+FYYGA 
Sbjct: 271  SPRCVAIDAVVETDLVSALKVSVFPEIIFTKAGKILYREKAIRTADEFSKIMAYFYYGAG 330

Query: 473  KPPCLDEVGDNQEAIPS 423
            KPPCL+++GD+QE IPS
Sbjct: 331  KPPCLNDIGDSQELIPS 347


>ref|XP_002309712.2| hypothetical protein POPTR_0007s00360g [Populus trichocarpa]
            gi|550333814|gb|EEE90162.2| hypothetical protein
            POPTR_0007s00360g [Populus trichocarpa]
          Length = 355

 Score =  304 bits (779), Expect = 6e-80
 Identities = 175/317 (55%), Positives = 203/317 (64%), Gaps = 14/317 (4%)
 Frame = -1

Query: 1331 HKHTNFKIIMS--SRIKFPILEVGSERRRGTRSTKLLRASNEVVEETVFDDGNKKERKGK 1158
            +K T +  + S  S +    L+  S+   G   + + RAS    E  + DDG KK R+  
Sbjct: 32   YKPTTYTEVSSTCSCLSSKFLKYPSQLHGGGFQSSISRASRRD-ERVLSDDGKKKRREEF 90

Query: 1157 SNN--------KXXXXXXXXXXXXXXXXXEWRRKIRQVIDMNPHVEE--ESDPIEKTKKM 1008
            S +                          EWR KIR+V+  +P V+E  E D  EK  +M
Sbjct: 91   SESDDDDDDYSSIKGKVNDPYLMDAEERREWRMKIREVMKKHPDVDEDEELDSEEKRMRM 150

Query: 1007 QKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNVXXXXXXXXXXK--ELVWQD 834
            +KLLADYPL+V              GF L QFFNKITIKN              E+VWQD
Sbjct: 151  EKLLADYPLIVDEDDPDWPEDADGRGFGLDQFFNKITIKNKKKDDDDENYDSDKEIVWQD 210

Query: 833  DNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLP 654
            D+YIRPIKDITT  WEE VFKDISPLIV VHNRYKRPKENE +RD LEKAVHI WN RLP
Sbjct: 211  DDYIRPIKDITTAGWEEAVFKDISPLIVLVHNRYKRPKENENIRDALEKAVHIIWNCRLP 270

Query: 653  SPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKAIRTADELSRIMAFFYYGAT 474
            SPRCVAIDAVVE DLVS LKV+VFPEIIFTKAGKILYREKAIRTADE S+IMA+FYYGA 
Sbjct: 271  SPRCVAIDAVVETDLVSALKVSVFPEIIFTKAGKILYREKAIRTADEFSKIMAYFYYGAG 330

Query: 473  KPPCLDEVGDNQEAIPS 423
            KPPCL+++GD+QE IPS
Sbjct: 331  KPPCLNDIGDSQELIPS 347


>ref|XP_003555189.1| PREDICTED: uncharacterized protein LOC100793560 [Glycine max]
          Length = 333

 Score =  303 bits (776), Expect = 1e-79
 Identities = 170/326 (52%), Positives = 213/326 (65%), Gaps = 5/326 (1%)
 Frame = -1

Query: 1385 CLFPSLKEASSTLIPFLNHKHTNFKIIMSSRIKFPILEVGSERRRGTRSTKLLRASNEVV 1206
            C  P  K  + TL PF  +  +   I + SR++    + G  ++RG        AS++  
Sbjct: 18   CCLPKHKPTNFTL-PFKLNGDSCRSIRIPSRVQALKSDGGKWKKRGQE------ASSDTD 70

Query: 1205 EETVFDDGNKKERKGKSNNKXXXXXXXXXXXXXXXXXEWRRKIRQVIDMNPHVEEESDPI 1026
            ++   DD +  +R  K++                   EWRR IRQV+D  P VEEE DP+
Sbjct: 71   DDD--DDDDAPQRFNKND---------PYLMSPEERLEWRRNIRQVLDRKPDVEEELDPL 119

Query: 1025 EKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNVXXXXXXXXXXK-- 852
            EK KK++KLL DYPLVV              GF+LGQFF+KITIKN              
Sbjct: 120  EKKKKLEKLLEDYPLVVDEDDPDWPEDADGWGFSLGQFFDKITIKNKKKDDDDDDDNDDV 179

Query: 851  ---ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENEKVRDDLEKAV 681
               E++WQDDNYIRPIKDI T EWEETVFKDISPLI+ VHNRYKRPKENEK+RD+LEKAV
Sbjct: 180  DRPEIMWQDDNYIRPIKDIKTAEWEETVFKDISPLIILVHNRYKRPKENEKIRDELEKAV 239

Query: 680  HIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKAIRTADELSRI 501
            HI WN RLPSPRCVAIDAVVE +LV+ L+V++FPEIIFTKAGKIL+R+KAIR+A+E S++
Sbjct: 240  HIIWNCRLPSPRCVAIDAVVETELVAALQVSIFPEIIFTKAGKILFRDKAIRSAEEWSKV 299

Query: 500  MAFFYYGATKPPCLDEVGDNQEAIPS 423
            MA+FYYGA KP CL+ +  +QE IPS
Sbjct: 300  MAYFYYGAAKPSCLNSLTYSQENIPS 325


>ref|XP_007142732.1| hypothetical protein PHAVU_007G012300g [Phaseolus vulgaris]
            gi|561015922|gb|ESW14726.1| hypothetical protein
            PHAVU_007G012300g [Phaseolus vulgaris]
          Length = 335

 Score =  302 bits (773), Expect = 3e-79
 Identities = 177/336 (52%), Positives = 217/336 (64%), Gaps = 3/336 (0%)
 Frame = -1

Query: 1421 MVTLQLTK-TRFQCLFPSLKEASSTLIPFLNHKHTNFKIIMSSRIKFPILEVGSERRRGT 1245
            +VT ++TK +   C  PS    + TL   LN    +  I ++SR++  +   G + +RG 
Sbjct: 14   IVTSRVTKQSSISCFLPSNALFNFTLP--LNLDGNSRTIRLTSRVQ-ALKSDGGKWKRGK 70

Query: 1244 RSTKLLRASNEVVEETVFDDGNKKERKGKSNNKXXXXXXXXXXXXXXXXXEWRRKIRQVI 1065
                   AS++  ++   DD +K    G +NN                   WRR IRQV+
Sbjct: 71   E------ASSDSDDDDDNDD-DKYAPTGSANNDPYLMSPEERLE-------WRRAIRQVL 116

Query: 1064 DMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQFFNKITIKNV 885
               P VEEE DP EK KKMQKL+ DYPLVV              GF++GQFF+KITIKN 
Sbjct: 117  VKKPDVEEEIDPEEKKKKMQKLMEDYPLVVEEDDPNWPEDADGRGFSMGQFFDKITIKNE 176

Query: 884  XXXXXXXXXXK--ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVFVHNRYKRPKENE 711
                         E+VWQDDNYIRPIKDI T EWEETVFKDISPLIV VHNRY+RPKENE
Sbjct: 177  KKDDDNDDDVDHPEIVWQDDNYIRPIKDIKTAEWEETVFKDISPLIVLVHNRYRRPKENE 236

Query: 710  KVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIFTKAGKILYREKA 531
            K+RD+LEKAVHI WN RLPSPRCVAIDAVVE +LV  L+V+VFPEIIFTKAGKIL+R+K 
Sbjct: 237  KIRDELEKAVHIIWNCRLPSPRCVAIDAVVETELVDALQVSVFPEIIFTKAGKILFRDKD 296

Query: 530  IRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPS 423
            IRTA+E S++MA+FYYGA KPPCL+ +  +QE IPS
Sbjct: 297  IRTAEEWSKVMAYFYYGAAKPPCLNNMTFSQENIPS 332


>ref|XP_002881198.1| hypothetical protein ARALYDRAFT_902219 [Arabidopsis lyrata subsp.
            lyrata] gi|297327037|gb|EFH57457.1| hypothetical protein
            ARALYDRAFT_902219 [Arabidopsis lyrata subsp. lyrata]
          Length = 348

 Score =  301 bits (770), Expect = 6e-79
 Identities = 154/229 (67%), Positives = 173/229 (75%), Gaps = 7/229 (3%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEES--DPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLG 918
            WR+ IR+VID +P +EEE   D +EK +KMQKLLADYPLVV              GF+  
Sbjct: 113  WRKTIREVIDKHPDIEEEEEIDMVEKRRKMQKLLADYPLVVNEEDPNWPEDADGWGFSFN 172

Query: 917  QFFNKITIKNVXXXXXXXXXXK-----ELVWQDDNYIRPIKDITTNEWEETVFKDISPLI 753
            QFFNKITIKN                 E+VWQDDNYIRPIKD+TT EWEETVFKDISPL+
Sbjct: 173  QFFNKITIKNEKKDVDDEDNEGDDSEKEIVWQDDNYIRPIKDLTTAEWEETVFKDISPLM 232

Query: 752  VFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEI 573
            V VHNRYKRPKENEK R++LEKA+ + WN  LPSPRCVA+DAVVE DLVS LKV VFPEI
Sbjct: 233  VLVHNRYKRPKENEKFREELEKAIQVIWNCGLPSPRCVAVDAVVETDLVSALKVCVFPEI 292

Query: 572  IFTKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIP 426
            IFTKAGKILYREK IRTADELS+IMAFFYYGA KPPCL+ V ++QE IP
Sbjct: 293  IFTKAGKILYREKGIRTADELSKIMAFFYYGAAKPPCLNGVVNSQEQIP 341


>ref|XP_004287186.1| PREDICTED: uncharacterized protein LOC101305502 [Fragaria vesca
            subsp. vesca]
          Length = 297

 Score =  298 bits (764), Expect = 3e-78
 Identities = 149/234 (63%), Positives = 174/234 (74%), Gaps = 5/234 (2%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEEESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLGQF 912
            WR KIRQV+D NP +EEE DP E++KK+Q LL +YPLVV              GF L  F
Sbjct: 63   WRDKIRQVLDKNPDLEEELDPSERSKKVQDLLTNYPLVVDEDDPEWPDDADGRGFKLDNF 122

Query: 911  FNKITIKNVXXXXXXXXXXK-----ELVWQDDNYIRPIKDITTNEWEETVFKDISPLIVF 747
            F+KITIKN                 E+VWQDDNYIRPIKD+TT+EWE+TVFKDISPL++ 
Sbjct: 123  FDKITIKNNPRKKNENDDDDDDSDDEIVWQDDNYIRPIKDVTTSEWEDTVFKDISPLVIL 182

Query: 746  VHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFPEIIF 567
            VHNRYKRPKENE++R +LEKAV I WN RLPSPRCVAIDAV E  LVS L+V+V+PE+IF
Sbjct: 183  VHNRYKRPKENERIRTELEKAVQIIWNCRLPSPRCVAIDAVTEHYLVSALQVSVYPELIF 242

Query: 566  TKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIPSRPKQQQ 405
            TKAGKILYREK IR+ DELS++MAFFYYGA KPPCL+ +GD QE IPS P   Q
Sbjct: 243  TKAGKILYREKEIRSGDELSKVMAFFYYGAAKPPCLNGIGDRQEEIPSIPINAQ 296


>ref|NP_180743.1| mesophyll-cell RNAi library line 7-like protein [Arabidopsis
            thaliana] gi|4887752|gb|AAD32288.1| hypothetical protein
            [Arabidopsis thaliana] gi|330253498|gb|AEC08592.1|
            mesophyll-cell RNAi library line 7-like protein
            [Arabidopsis thaliana]
          Length = 350

 Score =  298 bits (763), Expect = 4e-78
 Identities = 153/231 (66%), Positives = 173/231 (74%), Gaps = 9/231 (3%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEE--ESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLG 918
            WR+ IR+VID +P +EE  E D +EK +KMQKLLADYPLVV              GF+  
Sbjct: 113  WRKTIREVIDKHPDIEEDEEIDMVEKRRKMQKLLADYPLVVNEEDPNWPEDADGWGFSFN 172

Query: 917  QFFNKITIKNVXXXXXXXXXXKE-------LVWQDDNYIRPIKDITTNEWEETVFKDISP 759
            QFFNKITIKN            E       +VWQDDNYIRPIKD+TT EWEE VFKDISP
Sbjct: 173  QFFNKITIKNEKKEEEDDDEDSEGDDSEKEIVWQDDNYIRPIKDLTTAEWEEAVFKDISP 232

Query: 758  LIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFP 579
            L+V VHNRYKRPKENEK R++LEKA+ + WN  LPSPRCVA+DAVVE DLVS LKV+VFP
Sbjct: 233  LMVLVHNRYKRPKENEKFREELEKAIQVIWNCGLPSPRCVAVDAVVETDLVSALKVSVFP 292

Query: 578  EIIFTKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIP 426
            EIIFTKAGKILYREK IRTADELS+IMAFFYYGA KPPCL+ V ++QE IP
Sbjct: 293  EIIFTKAGKILYREKGIRTADELSKIMAFFYYGAAKPPCLNGVVNSQEQIP 343


>ref|XP_006294510.1| hypothetical protein CARUB_v10023543mg [Capsella rubella]
            gi|482563218|gb|EOA27408.1| hypothetical protein
            CARUB_v10023543mg [Capsella rubella]
          Length = 348

 Score =  297 bits (761), Expect = 7e-78
 Identities = 152/231 (65%), Positives = 174/231 (75%), Gaps = 9/231 (3%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVE--EESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLG 918
            WR+ IR+V+D +P +E  EE D +EK +KMQKLLADYPLVV              GF+  
Sbjct: 111  WRKTIREVMDKHPDIEDEEEIDMVEKRRKMQKLLADYPLVVNEEDPNWPEDAEGWGFSFN 170

Query: 917  QFFNKITIKNVXXXXXXXXXXK-------ELVWQDDNYIRPIKDITTNEWEETVFKDISP 759
            QFFNKITIKN                   E+VWQDDNYIRPIKD+TT EWEETVFKDISP
Sbjct: 171  QFFNKITIKNEKKEDDDDDDDDQGEDSEKEIVWQDDNYIRPIKDLTTAEWEETVFKDISP 230

Query: 758  LIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFP 579
            L+V VHNRYKRPKENEK R++LEKA+ + WN  LPSPRCVA+DAVVE DLVS L+V+VFP
Sbjct: 231  LMVLVHNRYKRPKENEKFREELEKAIQMIWNCGLPSPRCVAVDAVVETDLVSALQVSVFP 290

Query: 578  EIIFTKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIP 426
            EIIFTKAGKILYREK IRTADELS+IMAFFYYGA KPPCL+ V ++QE IP
Sbjct: 291  EIIFTKAGKILYREKGIRTADELSKIMAFFYYGAAKPPCLNVVDNSQEQIP 341


>gb|AAO89195.1| hypothetical protein [Arabidopsis thaliana]
          Length = 350

 Score =  296 bits (757), Expect = 2e-77
 Identities = 152/231 (65%), Positives = 172/231 (74%), Gaps = 9/231 (3%)
 Frame = -1

Query: 1091 WRRKIRQVIDMNPHVEE--ESDPIEKTKKMQKLLADYPLVVXXXXXXXXXXXXXXGFNLG 918
            WR+ IR+VID +P +EE  E D +EK +KMQKLLADYPLVV              GF+  
Sbjct: 113  WRKTIREVIDKHPDIEEDEEIDMVEKXRKMQKLLADYPLVVNEEDPNWPEDADGWGFSFN 172

Query: 917  QFFNKITIKNVXXXXXXXXXXKE-------LVWQDDNYIRPIKDITTNEWEETVFKDISP 759
            QFFNKITIKN            E       +VWQDDNYIRPIKD+TT EWEE VFKDISP
Sbjct: 173  QFFNKITIKNEKKEEEDDDEDSEGDDSEKEIVWQDDNYIRPIKDLTTAEWEEAVFKDISP 232

Query: 758  LIVFVHNRYKRPKENEKVRDDLEKAVHIFWNSRLPSPRCVAIDAVVELDLVSVLKVTVFP 579
            L+V VHNRYKRPKENEK R++LEKA+ + WN  LPSPRCVA+DAVVE DLVS LKV+VFP
Sbjct: 233  LMVLVHNRYKRPKENEKFREELEKAIQVIWNCGLPSPRCVAVDAVVETDLVSALKVSVFP 292

Query: 578  EIIFTKAGKILYREKAIRTADELSRIMAFFYYGATKPPCLDEVGDNQEAIP 426
            EIIFTKAG ILYREK IRTADELS+IMAFFYYGA KPPCL+ V ++QE IP
Sbjct: 293  EIIFTKAGXILYREKGIRTADELSKIMAFFYYGAAKPPCLNGVVNSQEQIP 343


Top