BLASTX nr result

ID: Achyranthes23_contig00018112 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Achyranthes23_contig00018112
         (1775 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri...   531   0.0  
ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626...   521   0.0  
ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292...   541   0.0  
ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr...   516   0.0  
gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus pe...   534   e-180
ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260...   520   e-180
gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 ...   517   e-180
ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu...   517   e-179
gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]     513   e-176
ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776...   520   e-176
ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590...   511   e-176
gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus...   515   e-175
ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264...   513   e-175
ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813...   516   e-175
ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu...   501   e-174
ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ...   504   e-170
ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr...   504   e-166
ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217...   488   e-164
ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido...   486   e-160
gb|AAL59980.1| unknown protein [Arabidopsis thaliana]                 486   e-160

>ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis]
            gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase
            II, putative [Ricinus communis]
          Length = 478

 Score =  531 bits (1368), Expect(2) = 0.0
 Identities = 267/365 (73%), Positives = 309/365 (84%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            N+K  CC IC+ SN ASICT+CVN  LNEY T LKS ++ RD LYSRL++ LVAKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNWR+ QNEKL  ++EKL R++EQL   KAK EKMSSDL  +YGLLE S++ALEK+RV+
Sbjct: 62   QLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRVD 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK +PN ICTQSLGH AITS+ LH  SV VKQICKLFPQR V V+ +KKDG +GQYD 
Sbjct: 122  QLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYDQ 181

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICNARLPRGL+PHSIPSEELAASLG MVQLLNLVVH+L  PALHNSGFAGSCSR+WQRDS
Sbjct: 182  ICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDS 241

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YW+ARPS+RSNEYPLFIPRQ   STSGENSW++R SS+F V S+ESER+ +L+ S S SF
Sbjct: 242  YWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSSF 301

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +Y+ ASPHS E H DLQKGI+L+KKSVACVTAY YN L LDVP++ASTFEAFAKLLATLS
Sbjct: 302  NYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATLS 361

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 362  SSKEV 366



 Score =  145 bits (367), Expect(2) = 0.0
 Identities = 67/101 (66%), Positives = 79/101 (78%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQ+LNKSVWNVNS+I SST++E  H+     N  D N  +SA SFL+AN+  D  
Sbjct: 378  RSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNLRNSATSFLFANEISDAG 437

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L DGWDLVEHP FPPPPS+TED+EHWTRAMF DATKK
Sbjct: 438  KNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478


>ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus
            sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED:
            uncharacterized protein LOC102626964 isoform X2 [Citrus
            sinensis]
          Length = 478

 Score =  521 bits (1341), Expect(2) = 0.0
 Identities = 256/365 (70%), Positives = 306/365 (83%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            N+K   C IC+ SN ASIC  CVN  L+E  T LKS ++ RD LY RL++ LVAKGKAD+
Sbjct: 2    NKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNWR+ QNEKL  ++EKL R++EQLS GK K+EK S DLKVRY +L+ +++ +EK+R E
Sbjct: 62   QLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNRAE 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPN ICTQSLGH AI S+ LHKQSV++KQICKLFPQR V +D +++DG +GQYD 
Sbjct: 122  QLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQ 181

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            IC ARLP+GL+PHS+PSEELAASLG MVQLLNLVV +L +P LHNSGFAGSCSR+WQRDS
Sbjct: 182  ICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQRDS 241

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YWDARPS+RSNEYPLFIPRQN  STSGENSW++R SS+F V S+ESER+P+L+ S S SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSTSF 301

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +Y+ AS HS E H DLQKGI+LLKKSVAC+TAYCYNSL LDVP++ASTFEAFAKLLATLS
Sbjct: 302  NYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLS 361

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 362  SSKEV 366



 Score =  146 bits (369), Expect(2) = 0.0
 Identities = 67/101 (66%), Positives = 78/101 (77%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQ+LN+SVWN+NS I S+T+LE  H  P   N  D N PSSA SFLYA +  D+ 
Sbjct: 378  RSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLPSSAASFLYATEMSDIG 437

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L DGWDLVEHP FPPPPS+TED+EHWTRAM  DATKK
Sbjct: 438  KNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca
            subsp. vesca]
          Length = 478

 Score =  541 bits (1394), Expect(2) = 0.0
 Identities = 269/366 (73%), Positives = 312/366 (85%), Gaps = 1/366 (0%)
 Frame = +3

Query: 108  TNRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKAD 287
            TN+K   C IC+ SNLASIC +CVN  LN+Y  +LK+ ++ RD LYSRL+D LVAKGKAD
Sbjct: 2    TNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKAD 61

Query: 288  EQLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRV 467
            +QLNWRI Q+EKL+ ++EKL R +EQL  GKAK+EK S DLKV+YG+LE + + LEK+R 
Sbjct: 62   DQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNRA 121

Query: 468  EQLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYD 647
            EQLEK YPN ICTQSLGH AITS+RLHKQSV++KQICKLFPQR V VD  +K+G  GQYD
Sbjct: 122  EQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQYD 181

Query: 648  VICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRD 827
             ICNA LPRGL+PHS+PSEELAASLG MVQLLNLVV +L  PALHNSGFAGSCSR+WQRD
Sbjct: 182  QICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQRD 241

Query: 828  SYWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSIS 1004
            SYWDARPS+RSNEYPLFIPRQN  STSGENSWS+R SS+F V S+ESERKP+L+ SGS S
Sbjct: 242  SYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSSS 301

Query: 1005 FSYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATL 1184
            F+YS AS HS E H DLQ+GI+LLKKSVAC+TAYCYNSL LDVPS+ASTFEAFAKLL+TL
Sbjct: 302  FNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLSTL 361

Query: 1185 SSSKEL 1202
            SSSKE+
Sbjct: 362  SSSKEV 367



 Score =  122 bits (305), Expect(2) = 0.0
 Identities = 59/101 (58%), Positives = 72/101 (71%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQQLNKSVWNVNS I S+T+L+  H+     N  + N P+ A SFL + +  DV 
Sbjct: 379  RSCKQVQQLNKSVWNVNSAISSTTLLDSAHTMTMTKNFYENNIPNYATSFLSSTEMSDVG 438

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E   +GWDLVEHP   PPPS++EDIEHWTRAMF D TK+
Sbjct: 439  KNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478


>ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina]
            gi|567883029|ref|XP_006434073.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883031|ref|XP_006434074.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|567883033|ref|XP_006434075.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536194|gb|ESR47312.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536195|gb|ESR47313.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536196|gb|ESR47314.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
            gi|557536197|gb|ESR47315.1| hypothetical protein
            CICLE_v10001001mg [Citrus clementina]
          Length = 478

 Score =  516 bits (1329), Expect(2) = 0.0
 Identities = 254/365 (69%), Positives = 304/365 (83%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            N+K   C IC+ SN ASIC  CVN  L+E  T LKS ++ RD LY RL++ LVAKGKAD+
Sbjct: 2    NKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNWR+ QNEKL  ++EKL R++EQLS GK K+EK S DLK RY +L+ +++ +EK+R E
Sbjct: 62   QLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNRAE 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPN ICTQSLGH AI S+ LHKQSV++KQICKLFPQR V +D +++DG +GQYD 
Sbjct: 122  QLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYDQ 181

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            IC ARLP+GL+PHS+PSEELAASLG MVQLLNLVV +L +P LHNSGFAGSCSR+WQRDS
Sbjct: 182  ICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQRDS 241

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YWDARPS+RSNEYPLFIPRQN  STSGENSW++R SS+F V S+ESER+P+L+ S S SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSASF 301

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +Y+ AS HS E H DLQKGI+LLKKSVAC+TAYCYNSL LDVP++ASTFEAFAKLLATLS
Sbjct: 302  NYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATLS 361

Query: 1188 SSKEL 1202
             SKE+
Sbjct: 362  LSKEV 366



 Score =  146 bits (369), Expect(2) = 0.0
 Identities = 67/101 (66%), Positives = 78/101 (77%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQ+LN+SVWN+NS I S+T+LE  H  P   N  D N PSSA SFLYA +  D+ 
Sbjct: 378  RSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNLPSSAASFLYATEMSDIG 437

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L DGWDLVEHP FPPPPS+TED+EHWTRAM  DATKK
Sbjct: 438  KNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478


>gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica]
            gi|462422646|gb|EMJ26909.1| hypothetical protein
            PRUPE_ppa005050mg [Prunus persica]
          Length = 479

 Score =  534 bits (1375), Expect(2) = e-180
 Identities = 265/365 (72%), Positives = 307/365 (84%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            NRK   C IC+ SNLAS+C +CVN  L EY ++LK+ ++ RD LYSRLT+ LVAKGKAD+
Sbjct: 3    NRKSSNCAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKADD 62

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNWR+ QNEKL+ ++EKL   +EQL  GKAK+EK S DLKV+ G+LE +   LEK+R E
Sbjct: 63   QLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNRAE 122

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPNFICTQ+LGH AITS+RLHKQSV++KQICKLFPQR V VD  +KD   GQYD 
Sbjct: 123  QLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQYDQ 182

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICNA LPRGL+PHS+PSEELAASLG MVQLLNLVV +L  PALHNSGFAGSCSR+WQRDS
Sbjct: 183  ICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQRDS 242

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YWDARPS+RSNEYPLFIPRQN  STSGENSWS+R SS+F V S++SERKP L+ SGS SF
Sbjct: 243  YWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSSSF 302

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +Y+ AS HS E H DLQ+GI+LLKKSVAC+TAYCYNSL LDVPS+ASTFEAFAKLLATLS
Sbjct: 303  NYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLS 362

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 363  SSKEV 367



 Score =  128 bits (321), Expect(2) = e-180
 Identities = 60/101 (59%), Positives = 74/101 (73%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQQLNKSVWNVNS I S+T+L+  H+     N  + N P+ A S L + +  D  
Sbjct: 379  RSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYNLPTYATSSLCSTELSDSG 438

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L +GWDLVEHP FPPPPS++EDIEHWTRAMF DA +K
Sbjct: 439  KNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479


>ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera]
            gi|302141899|emb|CBI19102.3| unnamed protein product
            [Vitis vinifera]
          Length = 478

 Score =  520 bits (1340), Expect(2) = e-180
 Identities = 257/364 (70%), Positives = 306/364 (84%), Gaps = 1/364 (0%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK   C IC+ SNLASIC +CVN  LNEY T+LKSS+  RD LY RL++ LVAKGKAD+Q
Sbjct: 3    RKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKADDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
            +NWR+ QNEKL  ++EKL   +EQ  +GKAK+EKMS+DLK++YGLLE + + LEK+RVEQ
Sbjct: 63   INWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRVEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQ+LG  AITS+R HKQSV++KQICKLFPQR V +D +KKDG +  YD I
Sbjct: 123  LEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYDQI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CN RLPR L+PHS+PS+ELAASLG MVQLLNLVV++L  PALHNSGFAGSCSR+WQR+SY
Sbjct: 183  CNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRESY 242

Query: 834  WDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISFS 1010
            W+ RPS+RSNEYPLFIPRQN  ST+GENSWSER SS+F + S+ES+RKP+LE SGS SF+
Sbjct: 243  WNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSSFN 302

Query: 1011 YSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSS 1190
            YS AS HS E H DLQKGI+LLKKSVAC+T YCY+SL LDVP++ASTFEAFAKLLA LSS
Sbjct: 303  YSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAILSS 362

Query: 1191 SKEL 1202
            SKE+
Sbjct: 363  SKEV 366



 Score =  140 bits (354), Expect(2) = e-180
 Identities = 65/101 (64%), Positives = 76/101 (75%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQQLNKS+WN+NS I SST+LE  H+ P   N  D N P+SA SFLY  +  D+ 
Sbjct: 378  RSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNLPNSAASFLYTTEMSDIG 437

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L + WDLVEH  FPPPPS+TEDIEHWTRAM  DATKK
Sbjct: 438  KNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478


>gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao]
          Length = 479

 Score =  517 bits (1331), Expect(2) = e-180
 Identities = 254/365 (69%), Positives = 309/365 (84%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            ++K   C ICD SN ASIC +CVN  LNEY + LKS ++ RD LYS+L + L AK KAD+
Sbjct: 3    SKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKADD 62

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNW+I QNEKL  +KEKL R++EQL+ GKAK+E++S DLKV+YG+LE ++  LEK+RVE
Sbjct: 63   QLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNRVE 122

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            +LEK YPN ICTQSLG  AITS+RLHKQSV++KQICKLFPQR V +D + +DG  GQYD+
Sbjct: 123  KLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQYDL 182

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICN  LPRGL+PHS+PSE+LAASLG MVQLLNLVVH+L  PALHNSGFAGSCSR+WQRDS
Sbjct: 183  ICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDS 242

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YW+ARPS+RSNEYPLFIPRQN  STSG+NSW++R SS+F V S+ESER+P+L+ SGS SF
Sbjct: 243  YWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSNSF 302

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +YS AS H+ E H DLQ GI+LLKKSVAC+TA+CYNSL LDVP++ASTFEAF+KLLATLS
Sbjct: 303  NYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLATLS 362

Query: 1188 SSKEL 1202
            S+KE+
Sbjct: 363  STKEV 367



 Score =  143 bits (360), Expect(2) = e-180
 Identities = 66/101 (65%), Positives = 77/101 (76%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RSSKQ QQLNKSVWNVNS + SS +LE  H  P   N  D N PSSA SFL+A +  D+ 
Sbjct: 379  RSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHNLPSSAASFLFATEMPDIG 438

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+L + WDLVEHP FPPPPS+TED+EHWTRAMF DATK+
Sbjct: 439  KNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479


>ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|566157047|ref|XP_006386388.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|566157050|ref|XP_006386389.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|222843996|gb|EEE81543.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344610|gb|ERP64185.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
            gi|550344611|gb|ERP64186.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 475

 Score =  517 bits (1331), Expect(2) = e-179
 Identities = 254/365 (69%), Positives = 301/365 (82%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            N+K  CC IC+ SN ASIC +CVN  LNEY T LKS  + RD LYS+L+  L+AKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            Q NWR+ QNEKL   +EKLHR +EQL+ GKAK+EK+S DLK + G+LE ++N LEK+R+E
Sbjct: 62   QFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRME 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPN ICTQSLGH AITS+ LHKQSV++KQICKLFPQR V VD ++   ++GQYD 
Sbjct: 122  QLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQ 179

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICNARLPRGL+PHS+ SEELAASLG MVQLLNLV H+L  P LHN+GFAGSCSR+WQRDS
Sbjct: 180  ICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDS 239

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YW+A PS+RSNEYPLFIPRQN  STS ENSW+++ SS+F V S+ESER+P L+ + S SF
Sbjct: 240  YWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNSF 299

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +YS  SPHS E H DLQKG++LLKKSVACVTAYCYN L LDVPSD STFEAFAKLL+TLS
Sbjct: 300  NYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLS 359

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 360  SSKEV 364



 Score =  140 bits (353), Expect(2) = e-179
 Identities = 68/101 (67%), Positives = 76/101 (75%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQ+LNKSVWNVNS I SS +LE  H+   M NT D N P+SA SFL+A    D  
Sbjct: 376  RSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-G 434

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+  DGWDLVEHP FPPPPS+ EDIEHWTRAMF DATKK
Sbjct: 435  KNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475


>gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis]
          Length = 478

 Score =  513 bits (1320), Expect(2) = e-176
 Identities = 253/365 (69%), Positives = 305/365 (83%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            NRK   C +C+ SNL SIC++CVN  L ++Y  LKS+++HRD LYSRL + L+AKGKAD+
Sbjct: 2    NRKSTSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            Q+ WR++QNEKL  ++EK  R++E+L  GKAK+E+M  DLKV+ G+LE +++ LE +R+E
Sbjct: 62   QVGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNRME 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPNFICTQ+LGH AITS+RLHKQSV++KQICKLFP R V +D ++K+G   QYD 
Sbjct: 122  QLEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQYDQ 181

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICNARLPRG++PHS+ SEEL ASLG MVQLLNL+V  L  PALHNSGFAGS SR+WQRDS
Sbjct: 182  ICNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQRDS 241

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YWDARPS+RSNEYPLFIPRQN  STS ENSWS+R SS+F V S+ESERK +L+ SGS SF
Sbjct: 242  YWDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSNSF 301

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +YS ASPHS E H DLQKGI+LLKKSVAC+T YCYNSL LDVPS+ASTFEAFAKLLATLS
Sbjct: 302  NYSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLATLS 361

Query: 1188 SSKEL 1202
            SSKEL
Sbjct: 362  SSKEL 366



 Score =  136 bits (342), Expect(2) = e-176
 Identities = 62/101 (61%), Positives = 78/101 (77%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS+KQVQQLNKSVWNVNS   S+T+L+  H+  +M N  + N P+ A SFLYA ++ D  
Sbjct: 378  RSNKQVQQLNKSVWNVNSAFASTTLLDSAHTVASMKNIGENNLPNPATSFLYATES-DAG 436

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E + +GWDL+EHP FPPPPS+ ED+EHWTRAMF DATKK
Sbjct: 437  KNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477


>ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine
            max]
          Length = 475

 Score =  520 bits (1339), Expect(2) = e-176
 Identities = 253/364 (69%), Positives = 306/364 (84%), Gaps = 1/364 (0%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK   C IC+ SN ASIC++CVN  LNEY T+LK  +  RD LY +L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGDDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
             NWR+ Q+EKL  +KEKL +++EQ++ G+AK+E MS+DLK++YGLLE + + LEK+RVEQ
Sbjct: 63   ANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQSLGH AITS+ LHK+SV++KQICKLFPQR V ++ +++DG +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYDQI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CNARLPR L+PHS+PSEEL+ SLG MVQLLNLV+H+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 834  WDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISFS 1010
            WDARPS+RSNEYPLFIPRQN  ST GENSWSER SS+F V SVESER+ +L+ SGS SF+
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTSFN 302

Query: 1011 YSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSS 1190
            YS AS HS + H DLQKGI+LLKKSV C+TAYCYNSL LDVPS+ASTFEAFAKLLATL+S
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATLAS 362

Query: 1191 SKEL 1202
            SKE+
Sbjct: 363  SKEV 366



 Score =  127 bits (320), Expect(2) = e-176
 Identities = 65/102 (63%), Positives = 77/102 (75%), Gaps = 1/102 (0%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNF-PSSAPSFLYANDTIDV 1438
            R+ KQVQQLNKSVWN+NS I S+T+LE  HS P    TR +N+ PSS  SFLYA D  D 
Sbjct: 378  RTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPT---TRIENYLPSSTGSFLYAADLSD- 433

Query: 1439 RKHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
             K+E L +GWD+VEHP FPPPPS++ED+EHWTRAMF DA  K
Sbjct: 434  GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475


>ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum
            tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED:
            uncharacterized protein LOC102590673 isoform X2 [Solanum
            tuberosum]
          Length = 483

 Score =  511 bits (1317), Expect(2) = e-176
 Identities = 251/369 (68%), Positives = 302/369 (81%), Gaps = 7/369 (1%)
 Frame = +3

Query: 117  KPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQL 296
            K  CCGIC+ SNL S+CTLCVN  LNEY T LKS +  R+ L  +L++ L+AKGKAD+QL
Sbjct: 4    KTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKADDQL 63

Query: 297  NWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQL 476
            +WR+ +NEKL  ++EKL + +EQ+S GKAK+EKMS DLKV+Y LL  +   LEK+R EQL
Sbjct: 64   SWRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQL 123

Query: 477  EKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVIC 656
            EK YPN ICTQ+LGH AITS+ LHKQSV+VKQICKLFPQR V +D DKKDG +GQYD IC
Sbjct: 124  EKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSIC 183

Query: 657  NARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSYW 836
            NARLP+GL+PHS+PS+EL+ASLG MVQLLNLV+  +C PALHNSGFAGSCSR+WQRDSYW
Sbjct: 184  NARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYW 243

Query: 837  DARPSTRSNEYPLFIPRQNCFSTSGENSWSER-------SSHFEVPSVESERKPKLEPSG 995
            DARPS+RS EYPLFIPRQN  S+ GE SW +R       SS+F V S+ES+RKP+L+ S 
Sbjct: 244  DARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSS 303

Query: 996  SISFSYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLL 1175
            S SF+Y+ AS HS E H DLQKGIALLKKSVAC+TAYCYN+L L+VP++ASTFE FA+LL
Sbjct: 304  SSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARLL 363

Query: 1176 ATLSSSKEL 1202
            ATLSSSKE+
Sbjct: 364  ATLSSSKEV 372



 Score =  135 bits (339), Expect(2) = e-176
 Identities = 62/101 (61%), Positives = 77/101 (76%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            R+SKQVQ LNKSVWNV+S   SST++E GH  P + NT +   PSS+ + +YA +  D R
Sbjct: 384  RASKQVQPLNKSVWNVDSAGSSSTLMESGHV-PVLRNTFENALPSSSGNLIYATEVSDAR 442

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            ++ENL + WDL+EHP FPPPPS TED+EHWTRAMF DATKK
Sbjct: 443  RNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 483


>gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris]
            gi|561014256|gb|ESW13117.1| hypothetical protein
            PHAVU_008G169200g [Phaseolus vulgaris]
          Length = 476

 Score =  515 bits (1326), Expect(2) = e-175
 Identities = 253/365 (69%), Positives = 308/365 (84%), Gaps = 2/365 (0%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK   C IC+ SN ASIC++CVN  LNEY T+LKS +  RD LYS+L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKGDDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
             N+ + QNEKL  +KEKLHR++EQ++ G+AK+E +S+DLK +YGLLE + + LEK+RVEQ
Sbjct: 63   ENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNRVEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQSLGH AITS+RLHKQSV++KQICKLFPQR V ++ + +DG +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQYDQI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CNARLPR L+PHS+PSEEL+ASLG MVQLLNLVVH+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 834  WDARPSTRSNEYPLFIPRQNCFSTSGENSWS--ERSSHFEVPSVESERKPKLEPSGSISF 1007
            WDARPS+RSNEYPLFIPRQN  ST+GENSWS  + SS+F V S+ESE++ +L+ SG+ +F
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGNSNF 302

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +YS AS HS + H DLQKGI+LLKKSVAC+TAYCYNSL LD PS+ASTFE+FAKLLATLS
Sbjct: 303  NYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLATLS 362

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 363  SSKEV 367



 Score =  130 bits (327), Expect(2) = e-175
 Identities = 64/102 (62%), Positives = 79/102 (77%), Gaps = 1/102 (0%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNF-PSSAPSFLYANDTIDV 1438
            R+ KQVQQLNKSVWN+NS+I S+T+LE  HS P    TR +N+ PSS  SFLYA D  D 
Sbjct: 379  RTCKQVQQLNKSVWNMNSVISSTTLLESAHSVPT---TRIENYLPSSTASFLYATDLND- 434

Query: 1439 RKHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
             K+E L +GWD++EHP FPPPPS++ED+EHWTRAMF DA +K
Sbjct: 435  GKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476


>ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum
            lycopersicum]
          Length = 481

 Score =  513 bits (1321), Expect(2) = e-175
 Identities = 253/370 (68%), Positives = 303/370 (81%), Gaps = 7/370 (1%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK  CCGIC+ SNL S+CTLCVN  LNEY T LKS +  R+ L  +L++ L+AKGKAD+Q
Sbjct: 3    RKTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKADDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
            L+WR+ +NEKL  ++EKL + +EQ+S GKAK+EKMS DLKV+Y LL  +   LEK+R EQ
Sbjct: 63   LSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQ+LGH AITS+ LHKQSV+VKQICKLFPQR V +D DKKDG +GQYD I
Sbjct: 123  LEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CNARLP+GL+PHS+PS+EL+ASLG MVQLLNLVV  +C PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQRDSY 242

Query: 834  WDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-------SSHFEVPSVESERKPKLEPS 992
            WDARPS+RS EYPLFIPRQN  S+ GE SW +R       SS+F V S+ES+RKP+L+ S
Sbjct: 243  WDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRLDSS 302

Query: 993  GSISFSYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKL 1172
             S SF+Y+ AS HS E H DLQKGIALLKKSVAC+TAYCYN+L L+VP++ASTFE FA+L
Sbjct: 303  SSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFARL 362

Query: 1173 LATLSSSKEL 1202
            LATLSSSKE+
Sbjct: 363  LATLSSSKEV 372



 Score =  130 bits (327), Expect(2) = e-175
 Identities = 61/101 (60%), Positives = 76/101 (75%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            R+SKQVQ LNKSVWNV+S   SST++E GH      NT +K+ PSS  + +YA +  +V 
Sbjct: 384  RASKQVQPLNKSVWNVDSAGSSSTLMESGHVPR---NTFEKSLPSSGGNLMYATEVSNVG 440

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            ++ENL + WDL+EHP FPPPPS TED+EHWTRAMF DATKK
Sbjct: 441  RNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 481


>ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max]
          Length = 474

 Score =  516 bits (1329), Expect(2) = e-175
 Identities = 254/364 (69%), Positives = 304/364 (83%), Gaps = 1/364 (0%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK   C IC+ SN ASIC++CVN  LNEY T+LK  +  RD LYS+L++ LV KGK D+Q
Sbjct: 3    RKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKGDDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
             NWR+ Q+EKL  +KEKL + +EQ++ G+AK+E  S+DLK++YGLLE + + LEK+RVEQ
Sbjct: 63   ANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNRVEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQSLGH AITS+RLHKQSV++KQICKLFPQR V ++ ++ DG  GQ+D I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQFDQI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CNARLPR L+P S+PSEEL+ SLG MVQLLNL+VH+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 834  WDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISFS 1010
            WDARPS+RSNEYPLFIPRQN  ST GENSWSER SS+F V S+ESER+ +L+ SGS SF+
Sbjct: 243  WDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSSSFN 302

Query: 1011 YSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSS 1190
            YS AS HS + H DLQKGI+LLKKSVAC+TAYCYNSL LDVPS+ASTFEAFAKLLATLSS
Sbjct: 303  YSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATLSS 362

Query: 1191 SKEL 1202
            SKE+
Sbjct: 363  SKEV 366



 Score =  127 bits (318), Expect(2) = e-175
 Identities = 63/102 (61%), Positives = 78/102 (76%), Gaps = 1/102 (0%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNF-PSSAPSFLYANDTIDV 1438
            R+ KQVQQLNKSVWN+NS I S+T+LE  HS P    TR +N+ PS+  SFLYA D+   
Sbjct: 378  RTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPT---TRIENYLPSATASFLYATDSDG- 433

Query: 1439 RKHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
             K+E L +GWD+VEHP FPPPPS++ED+EHWTRAMF DA +K
Sbjct: 434  -KNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474


>ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa]
            gi|550344612|gb|ERP64187.1| hypothetical protein
            POPTR_0002s09150g [Populus trichocarpa]
          Length = 506

 Score =  501 bits (1289), Expect(2) = e-174
 Identities = 254/396 (64%), Positives = 301/396 (76%), Gaps = 32/396 (8%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            N+K  CC IC+ SN ASIC +CVN  LNEY T LKS  + RD LYS+L+  L+AKGKAD+
Sbjct: 2    NKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            Q NWR+ QNEKL   +EKLHR +EQL+ GKAK+EK+S DLK + G+LE ++N LEK+R+E
Sbjct: 62   QFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRME 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK YPN ICTQSLGH AITS+ LHKQSV++KQICKLFPQR V VD ++   ++GQYD 
Sbjct: 122  QLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYDQ 179

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICNARLPRGL+PHS+ SEELAASLG MVQLLNLV H+L  P LHN+GFAGSCSR+WQRDS
Sbjct: 180  ICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDS 239

Query: 831  YWDARPSTR-------------------------------SNEYPLFIPRQNCFSTSGEN 917
            YW+A PS+R                               SNEYPLFIPRQN  STS EN
Sbjct: 240  YWNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSEN 299

Query: 918  SWSER-SSHFEVPSVESERKPKLEPSGSISFSYSCASPHSTEIHNDLQKGIALLKKSVAC 1094
            SW+++ SS+F V S+ESER+P L+ + S SF+YS  SPHS E H DLQKG++LLKKSVAC
Sbjct: 300  SWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVAC 359

Query: 1095 VTAYCYNSLGLDVPSDASTFEAFAKLLATLSSSKEL 1202
            VTAYCYN L LDVPSD STFEAFAKLL+TLSSSKE+
Sbjct: 360  VTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEV 395



 Score =  140 bits (353), Expect(2) = e-174
 Identities = 68/101 (67%), Positives = 76/101 (75%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQVQ+LNKSVWNVNS I SS +LE  H+   M NT D N P+SA SFL+A    D  
Sbjct: 407  RSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-G 465

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+E+  DGWDLVEHP FPPPPS+ EDIEHWTRAMF DATKK
Sbjct: 466  KNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 506


>ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula]
            gi|355516236|gb|AES97859.1| hypothetical protein
            MTR_5g061040 [Medicago truncatula]
          Length = 501

 Score =  504 bits (1299), Expect(2) = e-170
 Identities = 250/377 (66%), Positives = 304/377 (80%), Gaps = 14/377 (3%)
 Frame = +3

Query: 114  RKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQ 293
            RK   C IC+  N  SIC++CVN  LNEY ++LKS +  RD LYS+L++ LV KGK D+Q
Sbjct: 3    RKSTNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKGDDQ 62

Query: 294  LNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQ 473
             NWR+ ++EKL   +EKL   +EQ++ G+AK++ MS+DLK++YG+LE + + LEK+RVEQ
Sbjct: 63   TNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNRVEQ 122

Query: 474  LEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVI 653
            LEK YPN ICTQSLGH AITS+RLHKQSV++KQICKLFPQR V ++ +K D  +GQYD I
Sbjct: 123  LEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQI 182

Query: 654  CNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSY 833
            CNARLPR L+PHS+PSEEL+ASLG MVQLLNLV H+L  PALHNSGFAGSCSR+WQRDSY
Sbjct: 183  CNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQRDSY 242

Query: 834  WDARPSTR-------------SNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESER 971
            WDARPS+R             SNEYPLFIPRQN  STSGENSWSE+ SS+F V S+ES+R
Sbjct: 243  WDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASMESDR 302

Query: 972  KPKLEPSGSISFSYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDAST 1151
            +P+L+ SGS SF+YS AS HS + H DLQKGI+LLKKSVAC+TAYCYNSL  D+PS+AST
Sbjct: 303  RPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSEAST 362

Query: 1152 FEAFAKLLATLSSSKEL 1202
            FEAFAKLLATLSSSKE+
Sbjct: 363  FEAFAKLLATLSSSKEV 379



 Score =  123 bits (309), Expect(2) = e-170
 Identities = 62/99 (62%), Positives = 75/99 (75%), Gaps = 1/99 (1%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNF-PSSAPSFLYANDTIDV 1438
            R+ KQVQQLNKSVWN+NS   S+T+LE  HS P    TR +N+ P+SA SFLY  D+ D 
Sbjct: 391  RTCKQVQQLNKSVWNMNSANSSTTLLESTHSVPT---TRIENYMPNSAASFLYPTDSSD- 446

Query: 1439 RKHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDA 1555
            RK E L +GWD+VEHP  PPPPS++ED+EHWTRAMF DA
Sbjct: 447  RKSECLIEGWDIVEHPTLPPPPSQSEDVEHWTRAMFIDA 485


>ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum]
            gi|557098297|gb|ESQ38733.1| hypothetical protein
            EUTSA_v10028627mg [Eutrema salsugineum]
          Length = 474

 Score =  504 bits (1297), Expect(2) = e-166
 Identities = 248/360 (68%), Positives = 304/360 (84%), Gaps = 2/360 (0%)
 Frame = +3

Query: 129  CGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQLNWRI 308
            C IC+ +N ASIC++CVN  L EY T LKS +T RD LYS+L++ L AKGKAD+Q NW++
Sbjct: 8    CAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKADDQKNWKL 67

Query: 309  AQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQLEKSY 488
             QNEKL G+K  L R +EQ++ GKAK+E+ S DLK++YG+L+ +++ LE+ RVEQ+EK +
Sbjct: 68   IQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRVEQVEKYF 127

Query: 489  PNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVICNARL 668
            PN ICTQSLGH AI+S+RLHKQSV++KQ+CKLFPQR V  D + ++G  GQY++ICN+RL
Sbjct: 128  PNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLICNSRL 187

Query: 669  PRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSYWDARP 848
            P+GL+PHSIPSEELAASLG MVQLLNLVVH+L  PALHNSGFAGSCSR+WQRDSYWDARP
Sbjct: 188  PKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWDARP 247

Query: 849  STRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERK-PKLEPSGSISFSYSCA 1022
            STRSNEYPLFIPRQN  STS ENSW+++ SS+F V S+ES+RK  +L+ +G  SF+YS A
Sbjct: 248  STRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRNSFNYSSA 307

Query: 1023 SPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSSSKEL 1202
            SPHS E H DLQKGIALLKKSVAC+TAYCYNSL L+VP +ASTFEAFAKLLATLSSSKE+
Sbjct: 308  SPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSSKEV 367



 Score =  111 bits (277), Expect(2) = e-166
 Identities = 55/101 (54%), Positives = 73/101 (72%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQ QQLNKS+WN +S+I SS++LE  H        +D   P+SA S+L   +  ++R
Sbjct: 379  RSCKQAQQLNKSIWNAHSVI-SSSILESSHLPRNASYNQD---PNSAASYLSGTELSEIR 434

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K  ++ +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 435  KSNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474


>ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus]
            gi|449524750|ref|XP_004169384.1| PREDICTED:
            uncharacterized LOC101217421 [Cucumis sativus]
          Length = 476

 Score =  488 bits (1256), Expect(2) = e-164
 Identities = 246/365 (67%), Positives = 296/365 (81%), Gaps = 1/365 (0%)
 Frame = +3

Query: 111  NRKPCCCGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADE 290
            NRK C C IC+ SN ASICT CVN  LN+Y ++LKS R  RD LYSRL+D LVAKGKAD+
Sbjct: 2    NRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKADD 61

Query: 291  QLNWRIAQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVE 470
            QLNWR+ +NEKL  ++EKL R+REQL  GKA++E  S DL+++Y +LE +++ LEK R+E
Sbjct: 62   QLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQRLE 121

Query: 471  QLEKSYPNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDV 650
            QLEK+YP+ I T++LGH AITS+RLHKQSV++KQ+CKLFPQR V V  +K+ G    +D 
Sbjct: 122  QLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFDQ 181

Query: 651  ICNARLPRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDS 830
            ICN  LPR L+PHS+   EL+ASLG MVQLLNLVV  L  PALH SGFAGSCSR+WQRDS
Sbjct: 182  ICNVSLPRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRDS 241

Query: 831  YWDARPSTRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERKPKLEPSGSISF 1007
            YW+A PS+RSNEYP+F+PRQ+  STSGENSWS++ SS+F V S+ESERKP+L    + SF
Sbjct: 242  YWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRSF 301

Query: 1008 SYSCASPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLS 1187
            +YS ASPHS E H DLQKGIALLKKSVACVTAY YNSL LDVPS+ASTFEAFAKLLATLS
Sbjct: 302  NYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATLS 361

Query: 1188 SSKEL 1202
            SSKE+
Sbjct: 362  SSKEV 366



 Score =  120 bits (301), Expect(2) = e-164
 Identities = 59/101 (58%), Positives = 72/101 (71%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS+K +Q+  KS WNVNS I SS + E GHSQ  M    + N PSSA S+LYA +  D  
Sbjct: 378  RSTKHIQKPIKSTWNVNS-IASSMLFESGHSQ-IMKTNYESNLPSSASSYLYATEFSDTG 435

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+++  +GWDLVEHP FPPPPS+ EDIEHWTRAM  DATK+
Sbjct: 436  KNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476


>ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana]
            gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis
            thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA
            polymerase II protein [Arabidopsis thaliana]
          Length = 473

 Score =  486 bits (1251), Expect(2) = e-160
 Identities = 245/360 (68%), Positives = 291/360 (80%), Gaps = 2/360 (0%)
 Frame = +3

Query: 129  CGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQLNWRI 308
            C ICD +N   ICT CVN  L EY T LKS +T RD L SR  + L +KGKAD+Q NWR+
Sbjct: 8    CAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQKNWRL 67

Query: 309  AQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQLEKSY 488
             QNEK+  +K+KL   +E ++ GK K+E+ SSDLKV+YG+L+ +++ LEK RVEQ+EK +
Sbjct: 68   IQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQVEKYF 127

Query: 489  PNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVICNARL 668
            PN ICTQSLGH AI+S+RLHKQSV+VKQICKLFP R V  D + ++G   QYDVICN+RL
Sbjct: 128  PNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVICNSRL 187

Query: 669  PRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSYWDARP 848
            P GL+PHSIPSEELA SLG MVQLLNLVVH+L  PALH+SGFAGSCSR+WQRDSYWD R 
Sbjct: 188  PSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSYWDGRT 247

Query: 849  STRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERK-PKLEPSGSISFSYSCA 1022
            STRSNEYPLFIPR+N  STS ENSW+++ SS+F V S+ES+RK P+L+  GS SF YS A
Sbjct: 248  STRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSFKYSSA 307

Query: 1023 SPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSSSKEL 1202
            SPHS E H DLQKGIALLKKSVAC+TAYCYNSL L+VP +ASTFEAFAKLLATLSSSKE+
Sbjct: 308  SPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSSKEV 367



 Score =  109 bits (272), Expect(2) = e-160
 Identities = 56/101 (55%), Positives = 73/101 (72%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQ QQLNKS+WN +S+I SS++LE  H      NT     P+S  S+L A + +  R
Sbjct: 379  RSGKQAQQLNKSIWNAHSVI-SSSLLESAHLPR---NTSYNQDPNSPASYLSATE-LSTR 433

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+ ++ +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 434  KNNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


>gb|AAL59980.1| unknown protein [Arabidopsis thaliana]
          Length = 473

 Score =  486 bits (1250), Expect(2) = e-160
 Identities = 245/360 (68%), Positives = 291/360 (80%), Gaps = 2/360 (0%)
 Frame = +3

Query: 129  CGICDCSNLASICTLCVNSSLNEYYTTLKSSRTHRDQLYSRLTDKLVAKGKADEQLNWRI 308
            C ICD +N   ICT CVN  L EY T LKS +T RD L SR  + L +KGKAD+Q NWR+
Sbjct: 8    CAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKADDQKNWRL 67

Query: 309  AQNEKLLGMKEKLHRTREQLSNGKAKLEKMSSDLKVRYGLLEFSKNALEKHRVEQLEKSY 488
             QNEK+  +K+KL   +E ++ GK K+E+ SSDLKV+YG+L+ +++ LEK RVEQ+EK +
Sbjct: 68   IQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQVEKYF 127

Query: 489  PNFICTQSLGHKAITSDRLHKQSVIVKQICKLFPQRVVKVDNDKKDGYNGQYDVICNARL 668
            PN ICTQSLGH AI+S+RLHKQSV+VKQICKLFP R V  D + ++G   QYDVICN+RL
Sbjct: 128  PNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYDVICNSRL 187

Query: 669  PRGLNPHSIPSEELAASLGCMVQLLNLVVHSLCLPALHNSGFAGSCSRVWQRDSYWDARP 848
            P GL+PHSIPSEELA SLG MVQLLNLVVH+L  PALH+SGFAGSCSR+WQRDSYWD R 
Sbjct: 188  PSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSYWDGRT 247

Query: 849  STRSNEYPLFIPRQNCFSTSGENSWSER-SSHFEVPSVESERK-PKLEPSGSISFSYSCA 1022
            STRSNEYPLFIPR+N  STS ENSW+++ SS+F V S+ES+RK P+L+  GS SF YS A
Sbjct: 248  STRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSNSFMYSSA 307

Query: 1023 SPHSTEIHNDLQKGIALLKKSVACVTAYCYNSLGLDVPSDASTFEAFAKLLATLSSSKEL 1202
            SPHS E H DLQKGIALLKKSVAC+TAYCYNSL L+VP +ASTFEAFAKLLATLSSSKE+
Sbjct: 308  SPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSSKEV 367



 Score =  109 bits (272), Expect(2) = e-160
 Identities = 56/101 (55%), Positives = 73/101 (72%)
 Frame = +2

Query: 1262 RSSKQVQQLNKSVWNVNSMIGSSTMLEGGHSQPAMINTRDKNFPSSAPSFLYANDTIDVR 1441
            RS KQ QQLNKS+WN +S+I SS++LE  H      NT     P+S  S+L A + +  R
Sbjct: 379  RSGKQAQQLNKSIWNAHSVI-SSSLLESAHLPR---NTSYNQDPNSPASYLSATE-LSTR 433

Query: 1442 KHENLFDGWDLVEHPKFPPPPSETEDIEHWTRAMFTDATKK 1564
            K+ ++ +GWDLVEHPK+PPPPS++ED+EHWTRAMF DA KK
Sbjct: 434  KNNDM-NGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473


Top