BLASTX nr result

ID: Achyranthes23_contig00023459 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00023459
         (2188 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   682   0.0  
ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   680   0.0  
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   677   0.0  
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   672   0.0  
ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   672   0.0  
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   668   0.0  
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   666   0.0  
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   661   0.0  
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   647   0.0  
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   645   0.0  
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   645   0.0  
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   645   0.0  
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   644   0.0  
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   643   0.0  
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     641   0.0  
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   621   e-175
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   612   e-172
ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217...   604   e-170
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   594   e-167
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 593   e-166

>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  682 bits (1759), Expect = 0.0
 Identities = 335/477 (70%), Positives = 392/477 (82%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            N+KA +C ICE SN ASIC  CVN  L+E  TLLKSL+ RRD LY RL++ LVAKGKAD+
Sbjct: 2    NKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNWR++QNEK+  L+EKL R++EQL+ GK K+EK S DLKVRY +LD +++M+EK+R E
Sbjct: 62   QLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNRAE 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPN+ICTQSLGHMAI S+ LHKQSV++KQICKLFPQR V +D +++DG +GQYD 
Sbjct: 122  QLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQ 181

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            IC ARLP+GL+PHSVPSEELAASLGYM+QLLNLVV +L +P LHNSGFAGSCSR+WQRDS
Sbjct: 182  ICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQRDS 241

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YWDARPSSRSNEYPLFIPRQN CST GEN                  R+P+L+ S S SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSTSF 301

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N++ AS HS+ETH DLQKGI+LLKKS+AC+TAYCYNSLCLDVP++ASTFEAFAKLLA LS
Sbjct: 302  NYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLS 361

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMACSRS KQVQ+LN+SVWN+NS I S+T+LES +  P   N+ D NLP
Sbjct: 362  SSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLP 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            SS  SFLYAT   D GK+ESL DGWDLVEHP FPPPPS+TED+EHWTRAM  DATKK
Sbjct: 422  SSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  680 bits (1754), Expect = 0.0
 Identities = 342/477 (71%), Positives = 391/477 (81%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            N+K+  C ICE SN ASICTVCVN  LNEY TLLKSL+ RRD LYSRL++ LVAKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNWR+ QNEK+  L+EKL R++EQL   KAK EKMSSDL  +YGLL+ S++ LEK+RV+
Sbjct: 62   QLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVD 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK +PNLICTQSLGHMAITS+ LH  SV VKQICKLFPQR V V+ +KKDG +GQYD 
Sbjct: 122  QLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQ 181

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICNARLPRGL+PHS+PSEELAASLGYM+QLLNLVVH+L  PALHNSGFAGSCSR+WQRDS
Sbjct: 182  ICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDS 241

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YW+ARPSSRSNEYPLFIPRQ  CST GEN                  R+ RL+ S S SF
Sbjct: 242  YWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSF 301

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N++ ASPHS+ETH DLQKGI+L+KKS+ACVTAY YN LCLDVP++ASTFEAFAKLLA LS
Sbjct: 302  NYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLS 361

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMACSRS KQVQ+LNKSVWNVNS+I SST++ES +A     N+ D NL 
Sbjct: 362  SSKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNLR 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +S  SFL+A    DAGK+ESL DGWDLVEHP FPPPPS+TED+EHWTRAMF DATKK
Sbjct: 422  NSATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  677 bits (1747), Expect = 0.0
 Identities = 333/477 (69%), Positives = 390/477 (81%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            N+KA +C ICE SN ASIC  CVN  L+E  TLLKSL+ RRD LY RL++ LVAKGKAD+
Sbjct: 2    NKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNWR++QNEK+  L+EKL R++EQL+ GK K+EK S DLK RY +LD +++M+EK+R E
Sbjct: 62   QLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNRAE 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPN+ICTQSLGHMAI S+ LHKQSV++KQICKLFPQR V +D +++DG +GQYD 
Sbjct: 122  QLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQ 181

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            IC ARLP+GL+PHSVPSEELAASLGYM+QLLNLVV +L +P LHNSGFAGSCSR+WQRDS
Sbjct: 182  ICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQRDS 241

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YWDARPSSRSNEYPLFIPRQN CST GEN                  R+P+L+ S S SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSASF 301

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N++ AS HS+ETH DLQKGI+LLKKS+AC+TAYCYNSLCLDVP++ASTFEAFAKLLA LS
Sbjct: 302  NYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLS 361

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
             SKEVRSV SLKMACSRS KQVQ+LN+SVWN+NS I S+T+LES +  P   N+ D NLP
Sbjct: 362  LSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLP 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            SS  SFLYAT   D GK+ESL DGWDLVEHP FPPPPS+TED+EHWTRAM  DATKK
Sbjct: 422  SSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  672 bits (1735), Expect = 0.0
 Identities = 332/477 (69%), Positives = 393/477 (82%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            ++KA +C IC+ SN ASIC VCVN  LNEY +LLKSL+ RRD LYS+L + L AK KAD+
Sbjct: 3    SKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKADD 62

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNW+I+QNEK+  LKEKL R++EQL  GKAK+E++S DLKV+YG+L+ ++ MLEK+RVE
Sbjct: 63   QLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNRVE 122

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            +LEK YPNLICTQSLG MAITS+RLHKQSV++KQICKLFPQR V +D + +DG  GQYD+
Sbjct: 123  KLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQYDL 182

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICN  LPRGL+PHSVPSE+LAASLGYM+QLLNLVVH+L  PALHNSGFAGSCSR+WQRDS
Sbjct: 183  ICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDS 242

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YW+ARPSSRSNEYPLFIPRQN CST G+N                  R+PRL+ SGS SF
Sbjct: 243  YWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSNSF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S AS H++ETH DLQ GI+LLKKS+AC+TA+CYNSLCLDVP++ASTFEAF+KLLA LS
Sbjct: 303  NYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            S+KEVRSV SLKMACSRSSKQ QQLNKSVWNVNS + SS +LES +  P   N+ D NLP
Sbjct: 363  STKEVRSVFSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHNLP 422

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            SS  SFL+AT   D GK+ESL + WDLVEHP FPPPPS+TED+EHWTRAMF DATK+
Sbjct: 423  SSAASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479


>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  672 bits (1733), Expect = 0.0
 Identities = 336/476 (70%), Positives = 386/476 (81%), Gaps = 1/476 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK  SC ICE SNLASIC VCVN  LNEY T LKS + RRD LY RL++ LVAKGKAD+Q
Sbjct: 3    RKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKADDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
            +NWR++QNEK+  L+EKL   +EQ   GKAK+EKMS+DLK++YGLL+ + +MLEK+RVEQ
Sbjct: 63   INWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQ+LG MAITS+R HKQSV++KQICKLFPQR V +D +KKDG +  YD I
Sbjct: 123  LEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYDQI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CN RLPR L+PHSVPS+ELAASLGYM+QLLNLVV++L  PALHNSGFAGSCSR+WQR+SY
Sbjct: 183  CNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRESY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISFN 1071
            W+ RPSSRSNEYPLFIPRQN CST GEN                  RKPRLE SGS SFN
Sbjct: 243  WNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSSFN 302

Query: 1070 FSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILSS 891
            +S AS HS+ETH DLQKGI+LLKKS+AC+T YCY+SLCLDVP++ASTFEAFAKLLAILSS
Sbjct: 303  YSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAILSS 362

Query: 890  SKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLPS 711
            SKEVRSV SLKMACSRS KQVQQLNKS+WN+NS I SST+LES +  P   N+ D NLP+
Sbjct: 363  SKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNLPN 422

Query: 710  STPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            S  SFLY T   D GK+ESL + WDLVEH  FPPPPS+TEDIEHWTRAM  DATKK
Sbjct: 423  SAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  668 bits (1723), Expect = 0.0
 Identities = 335/478 (70%), Positives = 386/478 (80%), Gaps = 1/478 (0%)
 Frame = -3

Query: 1973 TNRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKAD 1794
            TN+K+ +C ICE SNLASIC VCVN  LN+Y   LK+L+ RRD LYSRL+D LVAKGKAD
Sbjct: 2    TNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKAD 61

Query: 1793 EQLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRV 1614
            +QLNWRI+Q+EK+  L+EKL R +EQL  GKAK+EK S DLKV+YG+L+ + +MLEK+R 
Sbjct: 62   DQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNRA 121

Query: 1613 EQLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYD 1434
            EQLEK YPNLICTQSLGHMAITS+RLHKQSV++KQICKLFPQR V VD  +K+G  GQYD
Sbjct: 122  EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQYD 181

Query: 1433 VICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRD 1254
             ICNA LPRGL+PHSVPSEELAASLGYM+QLLNLVV +L  PALHNSGFAGSCSR+WQRD
Sbjct: 182  QICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQRD 241

Query: 1253 SYWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSIS 1077
            SYWDARPSSRSNEYPLFIPRQN CST GEN                  RKPRL+ SGS S
Sbjct: 242  SYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSSS 301

Query: 1076 FNFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAIL 897
            FN+S AS HS+ETH DLQ+GI+LLKKS+AC+TAYCYNSLCLDVPS+ASTFEAFAKLL+ L
Sbjct: 302  FNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLSTL 361

Query: 896  SSSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNL 717
            SSSKEV SV SLKMACSRS KQVQQLNKSVWNVNS I S+T+L+S +      N  + N+
Sbjct: 362  SSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHTMTMTKNFYENNI 421

Query: 716  PSSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            P+   SFL +T   D GK+E   +GWDLVEHP   PPPS++EDIEHWTRAMF D TK+
Sbjct: 422  PNYATSFLSSTEMSDVGKNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  666 bits (1719), Expect = 0.0
 Identities = 332/477 (69%), Positives = 385/477 (80%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            NRK+ +C ICE SNLAS+C +CVN  L EY + LK+L+ RRD LYSRLT+ LVAKGKAD+
Sbjct: 3    NRKSSNCAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKADD 62

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNWR++QNEK+  L+EKL   +EQL  GKAK+EK S DLKV+ G+L+ +  +LEK+R E
Sbjct: 63   QLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNRAE 122

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPN ICTQ+LGHMAITS+RLHKQSV++KQICKLFPQR V VD  +KD   GQYD 
Sbjct: 123  QLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQYDQ 182

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICNA LPRGL+PHSVPSEELAASLGYM+QLLNLVV +L  PALHNSGFAGSCSR+WQRDS
Sbjct: 183  ICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQRDS 242

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YWDARPSSRSNEYPLFIPRQN CST GEN                  RKP L+ SGS SF
Sbjct: 243  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSSSF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N++ AS HS+ETH DLQ+GI+LLKKS+AC+TAYCYNSLCLDVPS+ASTFEAFAKLLA LS
Sbjct: 303  NYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEV SV SLKMACSRS KQVQQLNKSVWNVNS I S+T+L+S +A     N+ + NLP
Sbjct: 363  SSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYNLP 422

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +   S L +T   D+GK+ESL +GWDLVEHP FPPPPS++EDIEHWTRAMF DA +K
Sbjct: 423  TYATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  661 bits (1705), Expect = 0.0
 Identities = 331/477 (69%), Positives = 381/477 (79%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            N+K+  C ICE SN ASIC +CVN  LNEY TLLKSL  RRD LYS+L+  L+AKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            Q NWR+ QNEK+   +EKL R +EQL  GKAK+EK+S DLK + G+L+ ++N+LEK+R+E
Sbjct: 62   QFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRME 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPNLICTQSLGHMAITS+ LHKQSV++KQICKLFPQR V VD ++   F+GQYD 
Sbjct: 122  QLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQ 179

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICNARLPRGL+PHSV SEELAASLGYM+QLLNLV H+L  P LHN+GFAGSCSR+WQRDS
Sbjct: 180  ICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDS 239

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YW+A PSSRSNEYPLFIPRQN CST  EN                  R+P L+ + S SF
Sbjct: 240  YWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSF 299

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S  SPHS+ETH DLQKG++LLKKS+ACVTAYCYN LCLDVPSD STFEAFAKLL+ LS
Sbjct: 300  NYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLS 359

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV +LKMACSRS KQVQ+LNKSVWNVNS I SS +LES +A   M N  D NLP
Sbjct: 360  SSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLP 419

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +S  SFL+ATG  D GK+ES  DGWDLVEHP FPPPPS+ EDIEHWTRAMF DATKK
Sbjct: 420  NSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  647 bits (1668), Expect = 0.0
 Identities = 321/477 (67%), Positives = 388/477 (81%), Gaps = 2/477 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK  +C ICE SN ASIC++CVN  LNEY T LKSL+ RRD LYS+L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKGDDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             N+ ++QNEK+  LKEKL R++EQ+  G+AK+E +S+DLK +YGLL+ + + LEK+RVEQ
Sbjct: 63   ENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQSLGH+AITS+RLHKQSV++KQICKLFPQR V ++ + +DG +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQYDQI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CNARLPR L+PHSVPSEEL+ASLGYM+QLLNLVVH+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGEN--XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            WDARPSSRSNEYPLFIPRQN CST GEN                   ++ RL+ SG+ +F
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGNSNF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S AS HS++TH DLQKGI+LLKKS+AC+TAYCYNSLCLD PS+ASTFE+FAKLLA LS
Sbjct: 303  NYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMA SR+ KQVQQLNKSVWN+NS+I S+T+LES ++ P      +  LP
Sbjct: 363  SSKEVRSVFSLKMAQSRTCKQVQQLNKSVWNMNSVISSTTLLESAHSVPT--TRIENYLP 420

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            SST SFLYAT   D GK+E L +GWD++EHP FPPPPS++ED+EHWTRAMF DA +K
Sbjct: 421  SSTASFLYATDLND-GKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  645 bits (1665), Expect = 0.0
 Identities = 320/481 (66%), Positives = 385/481 (80%), Gaps = 7/481 (1%)
 Frame = -3

Query: 1964 KACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQL 1785
            K   CGICE SNL S+CT+CVN  LNEY T+LKSL+ RR+ L  +L++ L+AKGKAD+QL
Sbjct: 4    KTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKADDQL 63

Query: 1784 NWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQL 1605
            +WR+ +NEK+  L+EKL + +EQ++ GKAK+EKMS DLKV+Y LL  +  MLEK+R EQL
Sbjct: 64   SWRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQL 123

Query: 1604 EKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVIC 1425
            EK YPNLICTQ+LGHMAITS+ LHKQSV+VKQICKLFPQR V +D DKKDG +GQYD IC
Sbjct: 124  EKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSIC 183

Query: 1424 NARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSYW 1245
            NARLP+GL+PHSVPS+EL+ASLGYM+QLLNLV+  +C PALHNSGFAGSCSR+WQRDSYW
Sbjct: 184  NARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYW 243

Query: 1244 DARPSSRSNEYPLFIPRQNCCSTGGEN-------XXXXXXXXXXXXXXXXXRKPRLEPSG 1086
            DARPSSRS EYPLFIPRQN CS+GGE                         RKPRL+ S 
Sbjct: 244  DARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSS 303

Query: 1085 SISFNFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLL 906
            S SFN++ AS HS+ETH DLQKGIALLKKS+AC+TAYCYN+LCL+VP++ASTFE FA+LL
Sbjct: 304  SSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLL 363

Query: 905  AILSSSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCD 726
            A LSSSKEVRSV SLKM+ SR+SKQVQ LNKSVWNV+S   SST++ESG+  P + N  +
Sbjct: 364  ATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHV-PVLRNTFE 422

Query: 725  RNLPSSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATK 546
              LPSS+ + +YAT   DA ++E+L + WDL+EHP FPPPPS TED+EHWTRAMF DATK
Sbjct: 423  NALPSSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATK 482

Query: 545  K 543
            K
Sbjct: 483  K 483


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  645 bits (1663), Expect = 0.0
 Identities = 331/508 (65%), Positives = 381/508 (75%), Gaps = 32/508 (6%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            N+K+  C ICE SN ASIC +CVN  LNEY TLLKSL  RRD LYS+L+  L+AKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            Q NWR+ QNEK+   +EKL R +EQL  GKAK+EK+S DLK + G+L+ ++N+LEK+R+E
Sbjct: 62   QFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRME 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPNLICTQSLGHMAITS+ LHKQSV++KQICKLFPQR V VD ++   F+GQYD 
Sbjct: 122  QLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQ 179

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICNARLPRGL+PHSV SEELAASLGYM+QLLNLV H+L  P LHN+GFAGSCSR+WQRDS
Sbjct: 180  ICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDS 239

Query: 1250 YWDARPSSR-------------------------------SNEYPLFIPRQNCCSTGGEN 1164
            YW+A PSSR                               SNEYPLFIPRQN CST  EN
Sbjct: 240  YWNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSEN 299

Query: 1163 -XXXXXXXXXXXXXXXXXRKPRLEPSGSISFNFSCASPHSMETHNDLQKGIALLKKSIAC 987
                              R+P L+ + S SFN+S  SPHS+ETH DLQKG++LLKKS+AC
Sbjct: 300  SWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVAC 359

Query: 986  VTAYCYNSLCLDVPSDASTFEAFAKLLAILSSSKEVRSVMSLKMACSRSSKQVQQLNKSV 807
            VTAYCYN LCLDVPSD STFEAFAKLL+ LSSSKEVRSV +LKMACSRS KQVQ+LNKSV
Sbjct: 360  VTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKSV 419

Query: 806  WNVNSMIGSSTMLESGNAQPAMINMCDRNLPSSTPSFLYATGTVDAGKHESLFDGWDLVE 627
            WNVNS I SS +LES +A   M N  D NLP+S  SFL+ATG  D GK+ES  DGWDLVE
Sbjct: 420  WNVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-GKNESFIDGWDLVE 478

Query: 626  HPKFPPPPSETEDIEHWTRAMFTDATKK 543
            HP FPPPPS+ EDIEHWTRAMF DATKK
Sbjct: 479  HPTFPPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  645 bits (1663), Expect = 0.0
 Identities = 319/476 (67%), Positives = 384/476 (80%), Gaps = 1/476 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK  +C ICE SN ASIC++CVN  LNEY T LK L+ RRD LY +L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGDDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NWR++Q+EK+  LKEKL +++EQ+  G+AK+E MS+DLK++YGLL+ + + LEK+RVEQ
Sbjct: 63   ANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQSLGH+AITS+ LHK+SV++KQICKLFPQR V ++ +++DG +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYDQI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CNARLPR L+PHSVPSEEL+ SLGYM+QLLNLV+H+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISFN 1071
            WDARPSSRSNEYPLFIPRQN CST GEN                  R+ RL+ SGS SFN
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTSFN 302

Query: 1070 FSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILSS 891
            +S AS HS++TH DLQKGI+LLKKS+ C+TAYCYNSLCLDVPS+ASTFEAFAKLLA L+S
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATLAS 362

Query: 890  SKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLPS 711
            SKEVRSV SLKMA SR+ KQVQQLNKSVWN+NS I S+T+LES ++ P      +  LPS
Sbjct: 363  SKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPT--TRIENYLPS 420

Query: 710  STPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            ST SFLYA    D GK+E L +GWD+VEHP FPPPPS++ED+EHWTRAMF DA  K
Sbjct: 421  STGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  644 bits (1662), Expect = 0.0
 Identities = 320/482 (66%), Positives = 386/482 (80%), Gaps = 7/482 (1%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK   CGICE SNL S+CT+CVN  LNEY T+LKSL+ RR+ L  +L++ L+AKGKAD+Q
Sbjct: 3    RKTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKADDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
            L+WR+ +NEK+  L+EKL + +EQ++ GKAK+EKMS DLKV+Y LL  +  MLEK+R EQ
Sbjct: 63   LSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQ+LGHMAITS+ LHKQSV+VKQICKLFPQR V +D DKKDG +GQYD I
Sbjct: 123  LEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CNARLP+GL+PHSVPS+EL+ASLGYM+QLLNLVV  +C PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGE-------NXXXXXXXXXXXXXXXXXRKPRLEPS 1089
            WDARPSSRS EYPLFIPRQN CS+GGE       +                 RKPRL+ S
Sbjct: 243  WDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRLDSS 302

Query: 1088 GSISFNFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKL 909
             S SFN++ AS HS+ETH DLQKGIALLKKS+AC+TAYCYN+LCL+VP++ASTFE FA+L
Sbjct: 303  SSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARL 362

Query: 908  LAILSSSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMC 729
            LA LSSSKEVRSV SLKM+ SR+SKQVQ LNKSVWNV+S   SST++ESG+      N  
Sbjct: 363  LATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVPR---NTF 419

Query: 728  DRNLPSSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDAT 549
            +++LPSS  + +YAT   + G++E+L + WDL+EHP FPPPPS TED+EHWTRAMF DAT
Sbjct: 420  EKSLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDAT 479

Query: 548  KK 543
            KK
Sbjct: 480  KK 481


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  643 bits (1659), Expect = 0.0
 Identities = 320/476 (67%), Positives = 384/476 (80%), Gaps = 1/476 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK  +C ICE SN ASIC++CVN  LNEY T LK L+ RRD LYS+L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKGDDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NWR++Q+EK+  LKEKL + +EQ+  G+AK+E  S+DLK++YGLL+ + + LEK+RVEQ
Sbjct: 63   ANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQSLGH+AITS+RLHKQSV++KQICKLFPQR V ++ ++ DG  GQ+D I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQFDQI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CNARLPR L+P SVPSEEL+ SLGYM+QLLNL+VH+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISFN 1071
            WDARPSSRSNEYPLFIPRQN CSTGGEN                  R+ RL+ SGS SFN
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSSSFN 302

Query: 1070 FSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILSS 891
            +S AS HS++TH DLQKGI+LLKKS+AC+TAYCYNSLCLDVPS+ASTFEAFAKLLA LSS
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLSS 362

Query: 890  SKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLPS 711
            SKEVRSV SLKM  SR+ KQVQQLNKSVWN+NS I S+T+LES ++ P      +  LPS
Sbjct: 363  SKEVRSVFSLKMPRSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPT--TRIENYLPS 420

Query: 710  STPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +T SFLYAT +   GK+E L +GWD+VEHP FPPPPS++ED+EHWTRAMF DA +K
Sbjct: 421  ATASFLYATDS--DGKNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  641 bits (1653), Expect = 0.0
 Identities = 317/477 (66%), Positives = 383/477 (80%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            NRK+ SC +CE SNL SIC++CVN  L ++Y +LKS +  RD LYSRL + L+AKGKAD+
Sbjct: 2    NRKSTSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            Q+ WR+ QNEK+  L+EK  R++E+L  GKAK+E+M  DLKV+ G+L+ +++MLE +R+E
Sbjct: 62   QVGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNRME 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK YPN ICTQ+LGHMAITS+RLHKQSV++KQICKLFP R V +D ++K+G   QYD 
Sbjct: 122  QLEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQYDQ 181

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICNARLPRG++PHSV SEEL ASLGYM+QLLNL+V  L  PALHNSGFAGS SR+WQRDS
Sbjct: 182  ICNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQRDS 241

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YWDARPSSRSNEYPLFIPRQN CST  EN                  RK RL+ SGS SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSNSF 301

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S ASPHS+ETH DLQKGI+LLKKS+AC+T YCYNSLCLDVPS+ASTFEAFAKLLA LS
Sbjct: 302  NYSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLATLS 361

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKE+RSV S+K ACSRS+KQVQQLNKSVWNVNS   S+T+L+S +   +M N+ + NLP
Sbjct: 362  SSKELRSVCSIKSACSRSNKQVQQLNKSVWNVNSAFASTTLLDSAHTVASMKNIGENNLP 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +   SFLYAT + DAGK+E + +GWDL+EHP FPPPPS+ ED+EHWTRAMF DATKK
Sbjct: 422  NPATSFLYATES-DAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  621 bits (1602), Expect = e-175
 Identities = 311/486 (63%), Positives = 378/486 (77%), Gaps = 14/486 (2%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            RK+ +C ICE  N  SIC+VCVN  LNEY + LKSL+ RRD LYS+L++ LV KGK D+Q
Sbjct: 3    RKSTNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKGDDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NWR++++EK+   +EKL   +EQ+  G+AK++ MS+DLK++YG+L+ + +MLEK+RVEQ
Sbjct: 63   TNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            LEK YPNLICTQSLGH+AITS+RLHKQSV++KQICKLFPQR V ++ +K D  +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CNARLPR L+PHSVPSEEL+ASLGYM+QLLNLV H+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSR-------------SNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXR 1110
            WDARPSSR             SNEYPLFIPRQN CST GEN                  R
Sbjct: 243  WDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASMESDR 302

Query: 1109 KPRLEPSGSISFNFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDAST 930
            +PRL+ SGS SFN+S AS HS+++H DLQKGI+LLKKS+AC+TAYCYNSLC D+PS+AST
Sbjct: 303  RPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSEAST 362

Query: 929  FEAFAKLLAILSSSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQ 750
            FEAFAKLLA LSSSKEVRSV SLKMA SR+ KQVQQLNKSVWN+NS   S+T+LES ++ 
Sbjct: 363  FEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSANSSTTLLESTHSV 422

Query: 749  PAMINMCDRNLPSSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTR 570
            P      +  +P+S  SFLY T + D  K E L +GWD+VEHP  PPPPS++ED+EHWTR
Sbjct: 423  PT--TRIENYMPNSAASFLYPTDSSDR-KSECLIEGWDIVEHPTLPPPPSQSEDVEHWTR 479

Query: 569  AMFTDA 552
            AMF DA
Sbjct: 480  AMFIDA 485


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  612 bits (1577), Expect = e-172
 Identities = 304/477 (63%), Positives = 374/477 (78%), Gaps = 2/477 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            +++ +C ICE +N ASIC+VCVN  L EY TLLKSL+ RRD LYS+L++ L AKGKAD+Q
Sbjct: 3    KRSSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NW+++QNEK+ GLK  L R +EQ+  GKAK+E+ S DLK++YG+LD +++ LE+ RVEQ
Sbjct: 63   KNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            +EK +PNLICTQSLGHMAI+S+RLHKQSV++KQ+CKLFPQR V  D + ++G  GQY++I
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CN+RLP+GL+PHS+PSEELAASLG M+QLLNLVVH+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGENXXXXXXXXXXXXXXXXXRK--PRLEPSGSISF 1074
            WDARPS+RSNEYPLFIPRQN CST  EN                  +   RL+ +G  SF
Sbjct: 243  WDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S ASPHS+E+H DLQKGIALLKKS+AC+TAYCYNSLCL+VP +ASTFEAFAKLLA LS
Sbjct: 303  NYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMA SRS KQ QQLNKS+WN +S+I SS +  S   + A  N      P
Sbjct: 363  SSKEVRSVFSLKMASSRSCKQAQQLNKSIWNAHSVISSSILESSHLPRNASYNQD----P 418

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +S  S+L  T   +  K   + +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 419  NSAASYLSGTELSEIRKSNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474


>ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus]
            gi|449524750|ref|XP_004169384.1| PREDICTED:
            uncharacterized LOC101217421 [Cucumis sativus]
          Length = 476

 Score =  604 bits (1558), Expect = e-170
 Identities = 310/477 (64%), Positives = 371/477 (77%), Gaps = 1/477 (0%)
 Frame = -3

Query: 1970 NRKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADE 1791
            NRK C+C ICE SN ASICT CVN  LN+Y + LKSLR RRD LYSRL+D LVAKGKAD+
Sbjct: 2    NRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADD 61

Query: 1790 QLNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVE 1611
            QLNWR+ +NEK+  L+EKL R+REQL  GKA++E  S DL+++Y +L+ ++++LEK R+E
Sbjct: 62   QLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQRLE 121

Query: 1610 QLEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDV 1431
            QLEK+YP+LI T++LGHMAITS+RLHKQSV++KQ+CKLFPQR V V  +K+ G    +D 
Sbjct: 122  QLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQ 181

Query: 1430 ICNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDS 1251
            ICN  LPR L+PHSV   EL+ASLGYM+QLLNLVV  L  PALH SGFAGSCSR+WQRDS
Sbjct: 182  ICNVSLPRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDS 241

Query: 1250 YWDARPSSRSNEYPLFIPRQNCCSTGGEN-XXXXXXXXXXXXXXXXXRKPRLEPSGSISF 1074
            YW+A PSSRSNEYP+F+PRQ+ CST GEN                  RKP+L    + SF
Sbjct: 242  YWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRSF 301

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
            N+S ASPHS+E+H DLQKGIALLKKS+ACVTAY YNSL LDVPS+ASTFEAFAKLLA LS
Sbjct: 302  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLS 361

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMA SRS+K +Q+  KS WNVNS I SS + ESG++Q    N  + NLP
Sbjct: 362  SSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNS-IASSMLFESGHSQIMKTNY-ESNLP 419

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            SS  S+LYAT   D GK++S  +GWDLVEHP FPPPPS+ EDIEHWTRAM  DATK+
Sbjct: 420  SSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  594 bits (1531), Expect = e-167
 Identities = 300/477 (62%), Positives = 364/477 (76%), Gaps = 2/477 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            +++ +C IC+ +N   ICT CVN  L EY TLLKSL+ RRD L SR  + L +KGKAD+Q
Sbjct: 3    KRSSNCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NWR++QNEKI  LK+KL   +E +  GK K+E+ SSDLKV+YG+LD +++ LEK RVEQ
Sbjct: 63   KNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            +EK +PNLICTQSLGHMAI+S+RLHKQSV+VKQICKLFP R V  D + ++G   QYDVI
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CN+RLP GL+PHS+PSEELA SLGYM+QLLNLVVH+L  PALH+SGFAGSCSR+WQRDSY
Sbjct: 183  CNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGENXXXXXXXXXXXXXXXXXRK--PRLEPSGSISF 1074
            WD R S+RSNEYPLFIPR+N CST  EN                  +  PRL+  GS SF
Sbjct: 243  WDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
             +S ASPHS+E+H DLQKGIALLKKS+AC+TAYCYNSLCL+VP +ASTFEAFAKLLA LS
Sbjct: 303  KYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMA SRS KQ QQLNKS+WN +S+I SS++LES +         D N P
Sbjct: 363  SSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVI-SSSLLESAHLPRNTSYNQDPNSP 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +S     Y + T  + +  +  +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 422  AS-----YLSATELSTRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  593 bits (1529), Expect = e-166
 Identities = 300/477 (62%), Positives = 364/477 (76%), Gaps = 2/477 (0%)
 Frame = -3

Query: 1967 RKACSCGICECSNLASICTVCVNSSLNEYYTLLKSLRRRRDQLYSRLTDKLVAKGKADEQ 1788
            +++ +C IC+ +N   ICT CVN  L EY TLLKSL+ RRD L SR  + L +KGKAD+Q
Sbjct: 3    KRSSNCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQ 62

Query: 1787 LNWRIVQNEKIFGLKEKLCRTREQLNHGKAKLEKMSSDLKVRYGLLDFSKNMLEKHRVEQ 1608
             NWR++QNEKI  LK+KL   +E +  GK K+E+ SSDLKV+YG+LD +++ LEK RVEQ
Sbjct: 63   KNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQ 122

Query: 1607 LEKSYPNLICTQSLGHMAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGFNGQYDVI 1428
            +EK +PNLICTQSLGHMAI+S+RLHKQSV+VKQICKLFP R V  D + ++G   QYDVI
Sbjct: 123  VEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVI 182

Query: 1427 CNARLPRGLNPHSVPSEELAASLGYMIQLLNLVVHSLCLPALHNSGFAGSCSRLWQRDSY 1248
            CN+RLP GL+PHS+PSEELA SLGYM+QLLNLVVH+L  PALH+SGFAGSCSR+WQRDSY
Sbjct: 183  CNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSY 242

Query: 1247 WDARPSSRSNEYPLFIPRQNCCSTGGENXXXXXXXXXXXXXXXXXRK--PRLEPSGSISF 1074
            WD R S+RSNEYPLFIPR+N CST  EN                  +  PRL+  GS SF
Sbjct: 243  WDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSF 302

Query: 1073 NFSCASPHSMETHNDLQKGIALLKKSIACVTAYCYNSLCLDVPSDASTFEAFAKLLAILS 894
             +S ASPHS+E+H DLQKGIALLKKS+AC+TAYCYNSLCL+VP +ASTFEAFAKLLA LS
Sbjct: 303  MYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLS 362

Query: 893  SSKEVRSVMSLKMACSRSSKQVQQLNKSVWNVNSMIGSSTMLESGNAQPAMINMCDRNLP 714
            SSKEVRSV SLKMA SRS KQ QQLNKS+WN +S+I SS++LES +         D N P
Sbjct: 363  SSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSVI-SSSLLESAHLPRNTSYNQDPNSP 421

Query: 713  SSTPSFLYATGTVDAGKHESLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 543
            +S     Y + T  + +  +  +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 422  AS-----YLSATELSTRKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


Top