BLASTX nr result

ID: Catharanthus23_contig00009590 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00009590
         (1990 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343336.1| PREDICTED: uncharacterized protein LOC102605...   563   e-158
ref|XP_006343335.1| PREDICTED: uncharacterized protein LOC102605...   563   e-158
ref|XP_006340829.1| PREDICTED: uncharacterized protein LOC102592...   561   e-157
ref|XP_004234515.1| PREDICTED: uncharacterized protein LOC101265...   559   e-156
ref|XP_004232557.1| PREDICTED: uncharacterized protein LOC101254...   556   e-155
ref|XP_002521705.1| poly(A) polymerase, putative [Ricinus commun...   517   e-144
gb|EMJ04079.1| hypothetical protein PRUPE_ppa015787mg, partial [...   514   e-143
gb|EOX90681.1| Polynucleotide adenylyltransferase family protein...   497   e-138
ref|XP_006467069.1| PREDICTED: uncharacterized protein LOC102627...   495   e-137
ref|XP_006425301.1| hypothetical protein CICLE_v10025049mg [Citr...   491   e-136
ref|XP_004287737.1| PREDICTED: uncharacterized protein LOC101302...   477   e-132
ref|XP_002307206.2| hypothetical protein POPTR_0005s10310g [Popu...   471   e-130
ref|XP_003516528.1| PREDICTED: uncharacterized protein LOC100794...   451   e-124
ref|XP_004146209.1| PREDICTED: uncharacterized protein LOC101212...   443   e-121
gb|ESW29983.1| hypothetical protein PHAVU_002G115200g [Phaseolus...   437   e-120
ref|XP_006828719.1| hypothetical protein AMTR_s00001p00028400 [A...   420   e-114
ref|XP_006299875.1| hypothetical protein CARUB_v10016083mg [Caps...   416   e-113
ref|XP_003612324.1| Poly(A) polymerase [Medicago truncatula] gi|...   415   e-113
ref|NP_179349.2| polynucleotide adenylyltransferase family prote...   413   e-112
ref|XP_002884069.1| polynucleotide adenylyltransferase family pr...   408   e-111

>ref|XP_006343336.1| PREDICTED: uncharacterized protein LOC102605830 isoform X2 [Solanum
            tuberosum]
          Length = 767

 Score =  563 bits (1452), Expect = e-158
 Identities = 311/653 (47%), Positives = 421/653 (64%), Gaps = 19/653 (2%)
 Frame = -1

Query: 1990 QCSPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGG 1811
            Q S D G  SV E G+I+ SKW+++DSR  GI+RSMIP S  ++LK+L   GFEAYLVGG
Sbjct: 60   QVSADSGIESVVEAGQIDFSKWRKLDSRNFGISRSMIPPSPRVVLKILHGEGFEAYLVGG 119

Query: 1810 CVRDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVA 1631
            CVRDL+LNR+PKDFD+IT A L QIKK+F RCEIVGR FPICRVH+KG++VEVSSF+TVA
Sbjct: 120  CVRDLILNRIPKDFDIITNATLRQIKKQFRRCEIVGRIFPICRVHVKGSIVEVSSFDTVA 179

Query: 1630 ENGKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
            ++ + +E   +P+MPKGC  KDF  W N MHRDFT+NSLFF+PFVN+IYDYANAM D+RS
Sbjct: 180  KHAEKEEAPLVPKMPKGCPEKDFILWNNSMHRDFTINSLFFNPFVNRIYDYANAMQDLRS 239

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             KLRTLVPA +SF EDC            L  SF+KE E+                I ME
Sbjct: 240  LKLRTLVPAHLSFGEDCARILRGLRLAARLNLSFTKEIEDAMHELSSAIMSLSNSRIMME 299

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            L+YM+S+GAAE SL LLQRY++L+I+LPF   +LT+Q   +  ++ +MLMKLFS+LD+  
Sbjct: 300  LNYMMSYGAAEPSLSLLQRYNILEIVLPFHGTYLTQQASKQLGKSSVMLMKLFSSLDQSV 359

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            +C +PSH  +WVA+   H+ALI  PQ   V+LTFASVL+H NWK+ V+FA +H+E A V+
Sbjct: 360  TCGQPSHDSVWVALLVFHMALITHPQHVFVILTFASVLYHANWKEAVKFAEKHSEDAAVY 419

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
             PE  D   SI +DEL++++ QLA+ V+ S+++LT+ DSL E M  KFPG+PCSGLVFVS
Sbjct: 420  GPEFSDSQGSISEDELAKKVAQLAVQVQKSINILTDRDSLLEAM-SKFPGAPCSGLVFVS 478

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTL----- 566
             KMG  V+ MF++L+ +VT+L +++    I+Y SLG+GN+ E RF+LGK+ILDT+     
Sbjct: 479  NKMGRAVELMFDILVKDVTSLKTRKDVYRIDYVSLGKGNMCETRFLLGKVILDTIIPRVT 538

Query: 565  -GCGVALEEVH-----DKEEKDDLHAPGSHQKREIFVEKF-----PTVLNTRERDPLGKQ 419
             G  V  EE H     D ++K+D       Q  E+   +F       +LN   +      
Sbjct: 539  QGVKVIKEEKHILLGVDGQQKEDASHENFLQNLELMKPEFVSDDDDNLLNENYQLQHDIF 598

Query: 418  EKKRSTLPADSEQVQVVSKKQKLMVNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKD 239
            EKKR      S+ V+ V+KKQK++  E  +                  E+ +++Q L  D
Sbjct: 599  EKKRLNHEGCSDLVEAVTKKQKIVAAEHYE------------------ELAVKKQDLIDD 640

Query: 238  YNCSNKGKKRQGMLAEDIGRKFQNYTIKMQEKDRSRQL---EIKSVLETVKYQ 89
             +C          L++++ R     T+K + K    QL   EI+ +LE VKYQ
Sbjct: 641  TDC----------LSQELDRVRDTVTMK-KIKGEDAQLPKDEIRMLLEEVKYQ 682


>ref|XP_006343335.1| PREDICTED: uncharacterized protein LOC102605830 isoform X1 [Solanum
            tuberosum]
          Length = 771

 Score =  563 bits (1452), Expect = e-158
 Identities = 311/653 (47%), Positives = 421/653 (64%), Gaps = 19/653 (2%)
 Frame = -1

Query: 1990 QCSPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGG 1811
            Q S D G  SV E G+I+ SKW+++DSR  GI+RSMIP S  ++LK+L   GFEAYLVGG
Sbjct: 64   QVSADSGIESVVEAGQIDFSKWRKLDSRNFGISRSMIPPSPRVVLKILHGEGFEAYLVGG 123

Query: 1810 CVRDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVA 1631
            CVRDL+LNR+PKDFD+IT A L QIKK+F RCEIVGR FPICRVH+KG++VEVSSF+TVA
Sbjct: 124  CVRDLILNRIPKDFDIITNATLRQIKKQFRRCEIVGRIFPICRVHVKGSIVEVSSFDTVA 183

Query: 1630 ENGKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
            ++ + +E   +P+MPKGC  KDF  W N MHRDFT+NSLFF+PFVN+IYDYANAM D+RS
Sbjct: 184  KHAEKEEAPLVPKMPKGCPEKDFILWNNSMHRDFTINSLFFNPFVNRIYDYANAMQDLRS 243

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             KLRTLVPA +SF EDC            L  SF+KE E+                I ME
Sbjct: 244  LKLRTLVPAHLSFGEDCARILRGLRLAARLNLSFTKEIEDAMHELSSAIMSLSNSRIMME 303

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            L+YM+S+GAAE SL LLQRY++L+I+LPF   +LT+Q   +  ++ +MLMKLFS+LD+  
Sbjct: 304  LNYMMSYGAAEPSLSLLQRYNILEIVLPFHGTYLTQQASKQLGKSSVMLMKLFSSLDQSV 363

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            +C +PSH  +WVA+   H+ALI  PQ   V+LTFASVL+H NWK+ V+FA +H+E A V+
Sbjct: 364  TCGQPSHDSVWVALLVFHMALITHPQHVFVILTFASVLYHANWKEAVKFAEKHSEDAAVY 423

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
             PE  D   SI +DEL++++ QLA+ V+ S+++LT+ DSL E M  KFPG+PCSGLVFVS
Sbjct: 424  GPEFSDSQGSISEDELAKKVAQLAVQVQKSINILTDRDSLLEAM-SKFPGAPCSGLVFVS 482

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTL----- 566
             KMG  V+ MF++L+ +VT+L +++    I+Y SLG+GN+ E RF+LGK+ILDT+     
Sbjct: 483  NKMGRAVELMFDILVKDVTSLKTRKDVYRIDYVSLGKGNMCETRFLLGKVILDTIIPRVT 542

Query: 565  -GCGVALEEVH-----DKEEKDDLHAPGSHQKREIFVEKF-----PTVLNTRERDPLGKQ 419
             G  V  EE H     D ++K+D       Q  E+   +F       +LN   +      
Sbjct: 543  QGVKVIKEEKHILLGVDGQQKEDASHENFLQNLELMKPEFVSDDDDNLLNENYQLQHDIF 602

Query: 418  EKKRSTLPADSEQVQVVSKKQKLMVNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKD 239
            EKKR      S+ V+ V+KKQK++  E  +                  E+ +++Q L  D
Sbjct: 603  EKKRLNHEGCSDLVEAVTKKQKIVAAEHYE------------------ELAVKKQDLIDD 644

Query: 238  YNCSNKGKKRQGMLAEDIGRKFQNYTIKMQEKDRSRQL---EIKSVLETVKYQ 89
             +C          L++++ R     T+K + K    QL   EI+ +LE VKYQ
Sbjct: 645  TDC----------LSQELDRVRDTVTMK-KIKGEDAQLPKDEIRMLLEEVKYQ 686


>ref|XP_006340829.1| PREDICTED: uncharacterized protein LOC102592062 [Solanum tuberosum]
          Length = 724

 Score =  561 bits (1445), Expect = e-157
 Identities = 328/701 (46%), Positives = 434/701 (61%), Gaps = 39/701 (5%)
 Frame = -1

Query: 1990 QCSPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGG 1811
            Q S +MGS SV + G I+ SKW+++ SR+ GI+ S+IP  +W +LK LQ+ GFEAYLVGG
Sbjct: 39   QHSANMGSGSVVDAGEIDFSKWRKLYSRDFGISNSLIPAYAWTVLKSLQSGGFEAYLVGG 98

Query: 1810 CVRDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVA 1631
            CVRDL+LN++PKDFDVITTA L QIK++F+R  IVGRRFPICRVH+ G++VEVSSF+T A
Sbjct: 99   CVRDLILNKIPKDFDVITTARLPQIKRRFHRAIIVGRRFPICRVHVNGSIVEVSSFDTRA 158

Query: 1630 ENGKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
            +     E+  +P+MPK    KDF  W++ MHRDFTVNSLFFDP VN IYDYA+A++D++S
Sbjct: 159  KPTGASEKLSVPKMPKKFHQKDFILWKDSMHRDFTVNSLFFDPSVNTIYDYADAIVDLKS 218

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             +LRTLVPAQ+SFEEDC            LK SFSKE E                 + ME
Sbjct: 219  LQLRTLVPAQLSFEEDCARILRGMRLAARLKLSFSKEIEIAMHKLSPSIMILNKSRLMME 278

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            ++YMLS+GAAE SL LLQRY +L++LLPF AAHL +Q + +  ++ +MLMKLFS+LD+L 
Sbjct: 279  VNYMLSYGAAEPSLSLLQRYSILEMLLPFHAAHLAQQSNKQLSESSVMLMKLFSSLDQLV 338

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            +CDRPSH  LWVA+ A HLALI  PQ A VVLTFASVL+HGNWK+GV FAR+H++AA ++
Sbjct: 339  TCDRPSHDSLWVALLAFHLALITDPQGAFVVLTFASVLYHGNWKEGVNFARRHSDAASIY 398

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
             PEI D   SI DDEL ER+T+LA+LV++S+D+LT+ DSLQE M  KFPGSPCSGLVFVS
Sbjct: 399  APEISDSQGSISDDELVERVTELAVLVQNSLDILTDKDSLQEAM-SKFPGSPCSGLVFVS 457

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGV- 554
            K M   V+ +F+VLI +V +L ++R   EI+Y  LG+G  RE RFVLGK+ILDT+  GV 
Sbjct: 458  KNMRKVVEVIFDVLIGDVRSLKTRRNSFEIDYTLLGKGQTRETRFVLGKVILDTIAPGVI 517

Query: 553  ---------------ALEEVHDKEE--------------KDDLHAPGSHQKREIFVEKFP 461
                            L +   KE               +DD     S + +    EK  
Sbjct: 518  PGSEVMKHTLVKVDAQLSKNAPKENMDGNVKLKKRKFACEDDNQPNASSEIKHNRDEKLN 577

Query: 460  TVLNTRERDPLGKQEKKRSTLPADSEQVQ--------VVSKKQKLMVNEEIQSVKGRSKN 305
            TV        + KQ K+++    D+E +         V  KK K    E+ +  +  ++ 
Sbjct: 578  TV--------IAKQSKEKAVEKHDNESLSQKLDNGDAVTKKKSK---GEQSELPQNGTRM 626

Query: 304  QVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGMLAEDIGRKFQNYTIKMQEKDRSRQL 125
             + A    +   K EQ+G +K    S    K   +   D G   + + +K +      Q 
Sbjct: 627  VLEAMKHPVPSHKKEQKGGNKYQQTSANHLKL--IHNVDNGATTKQHKLKEKRDSSQEQK 684

Query: 124  EIKS-VLETVKYQLVPEPLAEDQKPVNKKIAKKIRSRLTRR 5
            E KS  L + K+ L    LA D+K    K   +  S L RR
Sbjct: 685  EEKSESLHSRKHDLKGGKLASDEKS-KPKAGNQTLSDLFRR 724


>ref|XP_004234515.1| PREDICTED: uncharacterized protein LOC101265314 [Solanum
            lycopersicum]
          Length = 769

 Score =  559 bits (1440), Expect = e-156
 Identities = 293/590 (49%), Positives = 399/590 (67%), Gaps = 17/590 (2%)
 Frame = -1

Query: 1990 QCSPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGG 1811
            Q S D G+ SV E G+I+ SKW+++DSR  GI+RSMIP S  ++LK+L   GFEAYLVGG
Sbjct: 59   QVSTDSGNESVVEAGQIDFSKWRKLDSRNFGISRSMIPPSPQVVLKILHGEGFEAYLVGG 118

Query: 1810 CVRDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVA 1631
            CVRDL+LNR+PKDFD+IT A L QIKK+F RCEIVGR FPICRVH+KG++VEVSSF+TVA
Sbjct: 119  CVRDLILNRIPKDFDIITNARLRQIKKQFRRCEIVGRIFPICRVHVKGSIVEVSSFDTVA 178

Query: 1630 ENGKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
            ++ + +E   +P++PKGC  KDF  W N MHRDFT+NSLFF+PF N+IYDYANAM D+RS
Sbjct: 179  KHAEKEEAPLVPEIPKGCPEKDFILWNNSMHRDFTINSLFFNPFANRIYDYANAMQDLRS 238

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             KLRTLVPA +SF ED             L  SF+KE E+                I ME
Sbjct: 239  LKLRTLVPAHLSFGEDSARILRGLRLAARLNLSFTKEIEDAMHELSSAIMSLSKSRIMME 298

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            L+YM+S+GAAE SL LLQRY++L+I+LPF   +LT+Q   +S ++ +MLMKLFS+LD+L 
Sbjct: 299  LNYMMSYGAAEPSLSLLQRYNILEIVLPFHGTYLTQQASKQSGKSSVMLMKLFSSLDQLV 358

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            SC +PSH  +WVA+ A H+ALI  PQ   V+LTFASVL+H NWK+ V+FA +H+E A V+
Sbjct: 359  SCGQPSHDSVWVALLAFHMALITHPQHVFVILTFASVLYHANWKEAVKFAEKHSEDAAVY 418

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
             PE+ D + SI +DEL++++ QLA+ V+ S+++L + DSL E M  KFPG+PCSGLVFVS
Sbjct: 419  GPELSDSHGSISEDELAKKVAQLAVQVQKSINILADRDSLLEAM-SKFPGAPCSGLVFVS 477

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVA 551
             KMG TV+ MF++L+ +VT+L +++    I+Y SLG+GN+ E RF+LGK+ILDT+   V 
Sbjct: 478  NKMGRTVELMFDILVKDVTSLKTRKDVYRIDYISLGKGNMCETRFLLGKVILDTIVPRVT 537

Query: 550  LEEVHDKEEKDDLHAPGSHQKREIFVEKF--------PTVLNTRERDPLGKQ-------- 419
             +    KE K  L      QK +     F        P  ++  + D L  +        
Sbjct: 538  QDVKMIKEGKHILLGVDGQQKEDASHRNFLENLELMKPEFVSDDDDDNLLNENYQLQHNI 597

Query: 418  -EKKRSTLPADSEQVQVVSKKQKLMVNEEIQSVKGRSKNQVVASNEWLKE 272
             EKKR      S+ V+ V+KKQK++  E  + + G+ K  ++   ++L +
Sbjct: 598  FEKKRLNHEGCSDLVEAVTKKQKIVAAEHYEEMAGK-KQDLIDDTDFLSQ 646


>ref|XP_004232557.1| PREDICTED: uncharacterized protein LOC101254858 [Solanum
            lycopersicum]
          Length = 810

 Score =  556 bits (1432), Expect = e-155
 Identities = 281/479 (58%), Positives = 360/479 (75%)
 Frame = -1

Query: 1990 QCSPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGG 1811
            Q S +MGS S+ + G I+ SKW+++ SRE GI+ SMIP  +W +LK LQ+ GFEAYLVGG
Sbjct: 36   QHSANMGSGSLVDTGEIDFSKWRKLYSREFGISNSMIPAYAWTVLKGLQSGGFEAYLVGG 95

Query: 1810 CVRDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVA 1631
            CVRDL+LN++PKDFDVITTA L QIK+ F+R  I+GRRFPICRVHIKG++VEVSSF+T A
Sbjct: 96   CVRDLILNKIPKDFDVITTARLPQIKRCFHRAIIIGRRFPICRVHIKGSIVEVSSFDTKA 155

Query: 1630 ENGKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
            +     ++  +P+MPK    KDF  W++ MHRDFTVNSLFFDP VN IYDYANA++D++S
Sbjct: 156  KPIGESKKLPIPKMPKKFHQKDFILWKDSMHRDFTVNSLFFDPSVNTIYDYANAIVDLKS 215

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             +LRTLVPA +SFEEDC            LK SFSKE E                 + ME
Sbjct: 216  LQLRTLVPAHLSFEEDCARILRGLRLAARLKLSFSKEIEIAMHKLSSSIMILNKSRLMME 275

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            ++YMLS+GAAE SL LLQRY++L +LLPF AAHL +Q + +  ++ +MLMKLFS+LD+L 
Sbjct: 276  VNYMLSYGAAEPSLSLLQRYNILGMLLPFHAAHLAQQSNKQLSESSVMLMKLFSSLDQLV 335

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            +CDRPSH  LWVA+ A HLALIN PQ A VVLTFASVL+HG+WK+GV+FAR+H++AA ++
Sbjct: 336  TCDRPSHDSLWVALLAFHLALINDPQGAFVVLTFASVLYHGDWKEGVKFARRHSDAASIY 395

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
            +PEI D   SI DDEL ER+T+LA+LV++S+D+LT+ DSLQE M  KFPGSPCSGLVFVS
Sbjct: 396  VPEISDSQGSISDDELVERVTELAVLVQNSLDILTDKDSLQEAM-SKFPGSPCSGLVFVS 454

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGV 554
            K M   V+ +F+VL  +V +L ++R   EI+Y  LG+G  RE RFVLGK+ILDT+  GV
Sbjct: 455  KNMRKVVEVIFDVLTEDVRSLKTRRNSFEIDYTLLGKGQTRETRFVLGKVILDTIAPGV 513


>ref|XP_002521705.1| poly(A) polymerase, putative [Ricinus communis]
            gi|223539096|gb|EEF40692.1| poly(A) polymerase, putative
            [Ricinus communis]
          Length = 675

 Score =  517 bits (1332), Expect = e-144
 Identities = 282/585 (48%), Positives = 379/585 (64%), Gaps = 12/585 (2%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            I+ISKWK++++  +GI RSMIP S W++LK+L N GFEAYLVGGCVRDLLLNR+PKDFDV
Sbjct: 51   IDISKWKKINASAVGIKRSMIPPSPWLVLKILHNKGFEAYLVGGCVRDLLLNRIPKDFDV 110

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L Q+KK+F+RCEIVGRRFPICRVH+KG+VVEVSSFETVA++ +GKEE  + Q P 
Sbjct: 111  ITTAKLKQVKKQFHRCEIVGRRFPICRVHVKGSVVEVSSFETVAQHNEGKEEVLISQKPS 170

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
            GC+ +DF RWRN MHRDFT+NSLFFDPF+N+I+DYAN M D+   KLRT++PA++SF+ED
Sbjct: 171  GCNGRDFIRWRNSMHRDFTINSLFFDPFMNQIFDYANGMADLSFLKLRTVIPARLSFQED 230

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  S SK+TE+                I MEL+YMLS+GAAE+++ L
Sbjct: 231  CARILRGLRIAGRLGLSISKDTESAIRKLSSSVKSLDKARIMMELNYMLSYGAAESTIYL 290

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFA 1043
            LQR++LL++ LPF AA+L++Q         +MLMKLF NLD L SCDRP    LWV + A
Sbjct: 291  LQRFNLLELFLPFHAAYLSQQAGETFSLGSVMLMKLFFNLDTLVSCDRPCTSSLWVGLLA 350

Query: 1042 VHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDEL 863
             H AL+ +PQ ALV   FASVL+HG WK GV+FAR++A+    F PEI    +   D+EL
Sbjct: 351  FHQALVTNPQDALVSWVFASVLYHGKWKDGVEFARENAKMQVKFAPEISGFSEFKSDEEL 410

Query: 862  SERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLIC 683
            +E ++ LA LV+DSVD L +TD+L ++M R F  +  SGLVFVSKK+ + V  +FNVL+ 
Sbjct: 411  AEEVSHLASLVQDSVDALMDTDTLAQSMSR-FGVTSSSGLVFVSKKIANDVAQLFNVLVD 469

Query: 682  EVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGV------------ALEEV 539
            +V +  ++R+   I+Y  LG+GN  E RFVLGK+IL+TL  G+             +EE 
Sbjct: 470  DVESYKTERESFMIDYYLLGKGNQHETRFVLGKVILETLSGGLTKGVEVAEDGPKVIEEK 529

Query: 538  HDKEEKDDLHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKK 359
            HD +  D +       K EI     P +          K   KR  +   S   + V+ K
Sbjct: 530  HDSKLSDLVKDYMVEWKEEI-----PVLSPLDHEHSQKKTGNKRKLVMTKSFYEEKVATK 584

Query: 358  QKLMVNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSN 224
            + ++ N+     K   K Q +     L E++ ++  LS++   SN
Sbjct: 585  EDVLKNKSEAVAK---KPQKILKITQLPELEKKKHHLSENSGTSN 626


>gb|EMJ04079.1| hypothetical protein PRUPE_ppa015787mg, partial [Prunus persica]
          Length = 669

 Score =  514 bits (1324), Expect = e-143
 Identities = 296/642 (46%), Positives = 404/642 (62%), Gaps = 9/642 (1%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            I+ SKWK++DSR LGI  SM+   SWI+LK+LQ+ GFEAYLVGGCVRDL+L R+PKDFDV
Sbjct: 2    IDTSKWKKLDSRNLGIKPSMVSQPSWIVLKILQSEGFEAYLVGGCVRDLILKRIPKDFDV 61

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L QIK++F R EIVGRRFPICRVH+KG+V+EVSSFETVA++  GK+E   P  P 
Sbjct: 62   ITTANLKQIKRQFYRAEIVGRRFPICRVHVKGSVIEVSSFETVAKHA-GKKEADSPCRPP 120

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
            GCD KDF RWRN MHRDFT+NSLFFDPF NKIYDYAN M+D+RS KLRTL  A++SFEED
Sbjct: 121  GCDKKDFIRWRNSMHRDFTINSLFFDPFANKIYDYANGMVDLRSLKLRTLGSAKLSFEED 180

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  S SKETE                 I MEL+YMLS+GAAE S  L
Sbjct: 181  CARILRGLRIAARLSLSISKETETAMHRLSSSILKLDKSRIMMELNYMLSYGAAEPSFCL 240

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHN-RSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIF 1046
            L R+ LL ILLP  AA+   Q  N ++ Q+  MLMKLFS+LD++ SCDRPS   LWV + 
Sbjct: 241  LWRFDLLKILLPLHAAYFDRQSKNMKTAQSSTMLMKLFSSLDKVVSCDRPSESTLWVGLL 300

Query: 1045 AVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDE 866
            A HLAL+N+PQ ALVVLTFASVL+H  W++GV+F+R +AE    ++PEI+   +   D  
Sbjct: 301  AFHLALVNNPQDALVVLTFASVLYHEEWEEGVKFSRDNAEGIVNYVPEILCSSEFKSDKV 360

Query: 865  LSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLI 686
            L++ ++QLA  V DS+  LT T+SL E+M R +P  PCSGLVFV KKM   V ++F  L+
Sbjct: 361  LAKEVSQLASFVLDSISALTATESLIESMSR-YPVFPCSGLVFVPKKMAEEVAEIFKGLV 419

Query: 685  CEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVAL-EEVHDKEEKDDLH 509
              + + N  R+  +I+Y SLG+G + E+RFV+GK+IL+T+  G+   +EV  + +   L 
Sbjct: 420  -NIKSYNKGRKSFQIDYHSLGKGYLSEVRFVIGKVILETMNSGILQGKEVVQELDYHLLL 478

Query: 508  APGSHQKREIFVEKF---PTVLNTRERD-PLGKQEKKRSTLPADSEQVQVVSK---KQKL 350
             P + + ++   +K        N  E+D  + KQ           E + V+     K+K 
Sbjct: 479  IPDTPEVKQEMSKKLKLKEQKCNLFEQDTAIDKQGVVELCQAPQRELIAVLGNMLAKRKF 538

Query: 349  MVNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGMLAEDIGRKFQ 170
             + EE    +G  KNQ ++ +E     K + +G  K +      +K Q  +  D+  K Q
Sbjct: 539  QLPEE----EGIKKNQELSEDE-----KFQNKGEKKMH--LKTIEKLQECMPPDMATKQQ 587

Query: 169  NYTIKMQEKDRSRQLEIKSVLETVKYQLVPEPLAEDQKPVNK 44
               +K     +   +++K++ +    +++ E + ED++  +K
Sbjct: 588  LNKVKKHNSSQEDTIKLKNMFQKKTQKVISEEIVEDRQVTDK 629


>gb|EOX90681.1| Polynucleotide adenylyltransferase family protein, putative
            [Theobroma cacao]
          Length = 714

 Score =  497 bits (1280), Expect = e-138
 Identities = 281/590 (47%), Positives = 380/590 (64%), Gaps = 5/590 (0%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            I+ SKWK++ + ++GIT SMI + SWI+L  L+  GFEAYLVGGCVRDLLL R+PKDFDV
Sbjct: 52   IDTSKWKKIQASKVGITGSMISLPSWIVLNTLRKEGFEAYLVGGCVRDLLLKRIPKDFDV 111

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L QIKKKF+R EIVGRRFPICRVHIKG V+EVSSFETVA++ + K +     +P 
Sbjct: 112  ITTANLKQIKKKFHRAEIVGRRFPICRVHIKGFVIEVSSFETVAKHDEDKAKALSSLIPN 171

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
            GCD KD  RWRN M+RDFT+NSLFFDPF  KIYDY + M D++S KL+T++PA +SF+ED
Sbjct: 172  GCDEKDLIRWRNSMNRDFTINSLFFDPFTFKIYDYNSGMSDLKSLKLQTIIPAHLSFQED 231

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  SFSK+TE+                + +EL+YMLS+GAAE+S+ L
Sbjct: 232  CARILRGLRIAARLGLSFSKDTESAMHNLSSSIEGLDKFRLMLELNYMLSYGAAESSIYL 291

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFA 1043
            LQR++LL+ILLPFQAA++   +H +S QN +MLMKLF NLD+L SCD P+   LW+ +  
Sbjct: 292  LQRFNLLNILLPFQAAYI---NHQKSTQNSMMLMKLFFNLDKLVSCDHPADSSLWIGLLI 348

Query: 1042 VHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDEL 863
             HLAL+N+PQ ALVV TFASVL+HG WK+GV+F+R+H +    F+PEI    ++  D++L
Sbjct: 349  FHLALLNNPQDALVVWTFASVLYHGKWKEGVEFSREHTKVGVKFVPEISGFSETKSDEDL 408

Query: 862  SERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLIC 683
            ++ ++Q A LV+DSV  LTET SL E+M R +  SPCSGLVFV KK       +F++++ 
Sbjct: 409  AKEVSQFASLVQDSVCALTETSSLFESMSR-YSFSPCSGLVFVPKKTARDAAKIFDLMVD 467

Query: 682  EVTTL--NSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEEVH-DKEEKDDL 512
            ++ +     +R+   INY  LG+G+ RE R+VLGKIIL+T+  G   E       EKD L
Sbjct: 468  DIESFVNGRQRESPGINYHLLGKGDPRETRYVLGKIILETMKDGRLGEGTRIANGEKDHL 527

Query: 511  HAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLM-VNEE 335
                        +EK     N        K++KKR     + E  Q + KKQK +  N  
Sbjct: 528  QPK--------VIEK-----NLANNQLPLKKDKKRVPSLLNPEAKQGLPKKQKSVDSNHN 574

Query: 334  IQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKK-RQGMLAED 188
            I  +    KNQ+  + E  +++  + Q L + Y  S +     QG + E+
Sbjct: 575  ISELYAAIKNQL--AKEEFQDLAKKHQKLVEAYKFSEQETSLMQGNILEE 622


>ref|XP_006467069.1| PREDICTED: uncharacterized protein LOC102627987 [Citrus sinensis]
          Length = 689

 Score =  495 bits (1275), Expect = e-137
 Identities = 275/558 (49%), Positives = 364/558 (65%), Gaps = 5/558 (0%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            +++S WK +DSR LGITR+MIP  SW++LK+L++ GF+AYLVGGCVRDLLL RVPKDFDV
Sbjct: 48   VDVSNWKTVDSRNLGITRAMIPQPSWVVLKILKSQGFQAYLVGGCVRDLLLRRVPKDFDV 107

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L QI+++F+R EI+GRRFPICRVHIKG+V+EVSSFETVA++G+GKE   L Q+P 
Sbjct: 108  ITTANLKQIRRQFHRSEIIGRRFPICRVHIKGSVIEVSSFETVAKHGEGKETVLLSQIPY 167

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
            GCD  D  RWRN +HRDFT+NSLFFDPF+NKIYDYAN + D+R  KLRTL+PA +SF ED
Sbjct: 168  GCDEIDLVRWRNSIHRDFTINSLFFDPFLNKIYDYANGISDLRCLKLRTLIPAYLSFTED 227

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  SF K+ +                 I MEL+YMLS+GAAE+S+ L
Sbjct: 228  CARILRGLRIAARLGLSFCKDIDTAMHSLSSSIERLDKSRIMMELNYMLSYGAAESSICL 287

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFA 1043
            L+RY+LL ILLPF AA+L +Q    + +NP+MLM+LF NLD+L SCDRP+   LWV + +
Sbjct: 288  LRRYNLLKILLPFHAAYLDQQAGKITAENPMMLMRLFFNLDKLVSCDRPADYTLWVGLLS 347

Query: 1042 VHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDE- 866
             H AL++ PQ A VV  FASVL+HG WK+GV+FAR HA+    F+PE I G+  I  DE 
Sbjct: 348  FHQALVSDPQDAFVVWVFASVLYHGKWKEGVKFARDHAKEPVKFVPE-ISGFSEIESDEQ 406

Query: 865  LSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLI 686
            L+ ++T+LA+ V+D V+ LT+                 SG VFVSKK+   V+ +F+VL+
Sbjct: 407  LAVKVTELALSVQDCVNDLTKAS---------------SGYVFVSKKIERNVQQIFDVLV 451

Query: 685  CEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCG-VALEEVHDKEEKDDLH 509
              + + NS ++   I+Y+ LG+GN+ E RFVLGKIIL T+  G VA EE  D+EE  ++ 
Sbjct: 452  NSIESYNSGKRSHIIDYDMLGKGNLVETRFVLGKIILKTISGGLVAGEEEIDEEEMPEVL 511

Query: 508  APGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLMVNEEIQ 329
               S       VE +           L K+ +KR   P+ +E     +KK K    E+  
Sbjct: 512  DKDS-------VENY-----------LAKKNRKRGLQPSSAELKLKTAKKCKW--TEKFS 551

Query: 328  SVK---GRSKNQVVASNE 284
            S+      +K  VV   E
Sbjct: 552  SINHELSMNKEDVVPKEE 569


>ref|XP_006425301.1| hypothetical protein CICLE_v10025049mg [Citrus clementina]
            gi|557527291|gb|ESR38541.1| hypothetical protein
            CICLE_v10025049mg [Citrus clementina]
          Length = 689

 Score =  491 bits (1264), Expect = e-136
 Identities = 273/558 (48%), Positives = 363/558 (65%), Gaps = 5/558 (0%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            +++S WK +DSR LGITR+MIP  SW++LK+L++ GF+AYLVGGCVRDLLL RVPKDFDV
Sbjct: 48   VDVSNWKTVDSRNLGITRAMIPQPSWVVLKILKSQGFQAYLVGGCVRDLLLRRVPKDFDV 107

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L QI+++F+R EI+GRRFPICRVHIKG+V+EVSSFETVA++G+GKE   L Q+P 
Sbjct: 108  ITTANLKQIRRQFHRSEIIGRRFPICRVHIKGSVIEVSSFETVAKHGEGKETVLLSQIPY 167

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
            GCD  D  RWRN +HRDFT+NSLFFDPF+NKIYDYAN + D+R  KLRTL+PA +SF ED
Sbjct: 168  GCDEIDLVRWRNSIHRDFTINSLFFDPFLNKIYDYANGISDLRCLKLRTLIPAYLSFTED 227

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  SF K+ +                 I MEL+YMLS+GAAE+S+ L
Sbjct: 228  CARILRGLRIAARLGLSFCKDIDTAMHSLSSSIERLDKSRIMMELNYMLSYGAAESSICL 287

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFA 1043
            L+RY+LL ILLPF AA+L +Q    + +NP+MLM+LF NLD+L SCDRP+   LWV + +
Sbjct: 288  LRRYNLLKILLPFHAAYLDQQAGKITAENPMMLMRLFFNLDKLVSCDRPADYTLWVGLLS 347

Query: 1042 VHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDE- 866
             H AL++ PQ A +V  FASVL+HG WK+GV+FAR  A+    F+PE I G+  I  DE 
Sbjct: 348  FHQALVSDPQDAFLVWVFASVLYHGKWKEGVKFARDRAKEPVKFVPE-ISGFSEIESDEQ 406

Query: 865  LSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLI 686
            L+ ++T+LA+ V+D V+ LT+                 SG VFVSKK+   V+ +F+VL+
Sbjct: 407  LAVKVTELALSVQDCVNDLTKAS---------------SGYVFVSKKIERNVQQIFDVLV 451

Query: 685  CEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCG-VALEEVHDKEEKDDLH 509
              + + NS ++   I+Y+ LG+GN+ E RFVLGKIIL T+  G VA EE  D+EE  ++ 
Sbjct: 452  NSIESYNSGKRSHIIDYDMLGKGNLVETRFVLGKIILKTISGGLVAGEEEIDEEEMPEVL 511

Query: 508  APGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLMVNEEIQ 329
               S       VE +           L K+ +KR   P+ +E     +KK K    E+  
Sbjct: 512  DKDS-------VENY-----------LAKKNRKRGLQPSSAELKLKTAKKCKW--TEKFS 551

Query: 328  SVK---GRSKNQVVASNE 284
            S+      +K  VV   E
Sbjct: 552  SINHELSMNKEDVVPKEE 569


>ref|XP_004287737.1| PREDICTED: uncharacterized protein LOC101302293 [Fragaria vesca
            subsp. vesca]
          Length = 699

 Score =  477 bits (1228), Expect = e-132
 Identities = 276/641 (43%), Positives = 394/641 (61%), Gaps = 5/641 (0%)
 Frame = -1

Query: 1984 SPDMGSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCV 1805
            +P    + V E G I++SKWK++DSR  GITR+MIP SS+ +L++L+  GFEAYLVGGCV
Sbjct: 35   APQPRPKMVPEEGLIDMSKWKKIDSRVFGITRAMIPDSSYHVLRILRGRGFEAYLVGGCV 94

Query: 1804 RDLLLNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAEN 1625
            RDL+L RVPKDFDVITTA L +IK++F+R  IVG RFPIC V+IKG+ +EVSSFETVA N
Sbjct: 95   RDLILKRVPKDFDVITTANLKEIKRQFHRSRIVGHRFPICMVNIKGSWIEVSSFETVANN 154

Query: 1624 GKGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGK 1445
               K E  + ++PKGC  KDF RWRN MHRDFT+NSLFFDP  NKIYDYAN M D+ S K
Sbjct: 155  HSDK-EVTISEIPKGCGKKDFIRWRNSMHRDFTINSLFFDPISNKIYDYANGMADLNSLK 213

Query: 1444 LRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELD 1265
            LR+LVPA++SF+EDC            L+ S SKETE                 I ME++
Sbjct: 214  LRSLVPAKLSFKEDCARILRGLRIAARLRLSISKETETAIHKCSSSILTLTTSRIMMEMN 273

Query: 1264 YMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSC 1085
            YMLS+GAAE SL LL R++LL +LLP  AA+L +Q   +  QN  MLMKLFSNLD++ + 
Sbjct: 274  YMLSYGAAEPSLCLLWRFNLLKLLLPIHAAYLDQQSIRKFPQNSTMLMKLFSNLDKVVTV 333

Query: 1084 DRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIP 905
            DRPS   LWVA+  VH+AL++ PQ ALVV +FAS+L+HG  ++G++ AR  A+    ++P
Sbjct: 334  DRPSDCSLWVALLVVHMALVSHPQDALVVFSFASILYHGGCEKGLKSARDDAQVTVDYLP 393

Query: 904  EIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKK 725
            EI        +++L + +++ A LV DS+  LT T+SL E M  K+P +PCSG+VFV K+
Sbjct: 394  EISIPSACKSEEQLEKEVSRFASLVLDSIAALTATESLVEGM-SKYPETPCSGVVFVPKR 452

Query: 724  MGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGV--- 554
                V ++F VL  ++ + N+KR+   I+Y  L +G + E  FVLGKIIL+T+G G+   
Sbjct: 453  TAQGVAEIFRVLADDIKSYNTKRKSYVIDYPLLQKGFLCETSFVLGKIILETMGSGILKG 512

Query: 553  --ALEEVHDKEEKDDLHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQ 380
               ++E +   E++ +    +  K+ +     P +     ++   K  +K+  +  +   
Sbjct: 513  EEVVQEDYGHLEQESIKEKCNKGKKRVHKSDTPELKQESAKE--RKLIEKKCRVLKEEMD 570

Query: 379  VQVVSKKQKLMVNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGM 200
            ++     + L   E+I SV G  +          K+ K    G SK      K +K +  
Sbjct: 571  IEKPEVLETLSPEEDIFSVLGPVE----------KKHKSGINGQSK-----RKMEKERQC 615

Query: 199  LAEDIGRKFQNYTIKMQEKDRSRQLEIKSVLETVKYQLVPE 77
            ++ D   K Q  T+K    ++   L ++++LE +  + +PE
Sbjct: 616  ISPDKTMKQQIKTVKKNNLNQKETLNLETILEDIS-ENIPE 655


>ref|XP_002307206.2| hypothetical protein POPTR_0005s10310g [Populus trichocarpa]
            gi|550338545|gb|EEE94202.2| hypothetical protein
            POPTR_0005s10310g [Populus trichocarpa]
          Length = 848

 Score =  471 bits (1212), Expect = e-130
 Identities = 273/643 (42%), Positives = 400/643 (62%), Gaps = 1/643 (0%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            ++ SKW+++++R  GITRSMIP + W +LK+L+  GFEAYLVGGCVRDLLLNRVPKDFDV
Sbjct: 53   VDRSKWRKVNARYHGITRSMIPDAPWTVLKLLRVGGFEAYLVGGCVRDLLLNRVPKDFDV 112

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPK 1583
            ITTA L QIKKKF+R  IVGRRFPIC VH+KG+V+EVSSFET A+  + KE+  L QM +
Sbjct: 113  ITTANLQQIKKKFHRAHIVGRRFPICIVHVKGSVIEVSSFETSAQQCQEKEKVLLSQMRR 172

Query: 1582 GCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEED 1403
             CD KDF  W+N M RDFT+NSLFFDPF+N+IYDYAN M D+RS KL+TL+PA++SF+ED
Sbjct: 173  SCDEKDFLLWKNSMQRDFTINSLFFDPFMNRIYDYANGMEDVRSLKLQTLIPARLSFQED 232

Query: 1402 CXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDL 1223
            C            L  S SK+TE                 I+MEL+YMLS+GAAE+++ L
Sbjct: 233  CARILRGIRIAGRLGLSISKDTETAICKLQSSVKSLNKDRIKMELNYMLSYGAAESTILL 292

Query: 1222 LQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFA 1043
            LQR+HLL I LPF AA+L EQ    S Q   MLMKL  +LD++ S DRP    LWV + A
Sbjct: 293  LQRFHLLKIFLPFHAAYLHEQADEVSAQGSTMLMKLLYSLDKIVSSDRPCDCSLWVGLLA 352

Query: 1042 VHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDE- 866
             H AL+ +PQ A V+  FAS+L+ G W++GV+FAR++A+    F+PE I G+  I  DE 
Sbjct: 353  FHQALVLNPQDAFVIWAFASILYCGTWQEGVKFARENAKVEGRFVPE-ISGFSEIKSDEK 411

Query: 865  LSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLI 686
            L+E ++QLA LV+D+V+  T+  SL E++ R +   P    VFVSKK+G     +F++  
Sbjct: 412  LAEEVSQLASLVQDAVNAFTDEISLSESLSR-YLDPPLDVFVFVSKKIGEHAGLLFHMQS 470

Query: 685  CEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEEVHDKEEKDDLHA 506
            CE      +R+  +I+Y+ L +G++ E RFVLGK+IL TL  G  L +   +  K++L  
Sbjct: 471  CEY-----RRESFKIDYDLLVKGDLYETRFVLGKVILKTLSGG--LVQGGKEIIKEEL-- 521

Query: 505  PGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLMVNEEIQS 326
                  + + V   PT+ +  +   + K+ K+   L  D ++++ V +K+K+ +    + 
Sbjct: 522  ------KVVKVNHEPTLSDLAKDGRVVKKVKEHVLLSFDEQKIEKVKQKEKVKMKCSSEQ 575

Query: 325  VKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGMLAEDIGRKFQNYTIKMQE 146
                +K +V   +  ++  K +++   K  +   +  K+Q  + ++  +       KM E
Sbjct: 576  NINSTKEEVDLKDASMEIAKKQRKVEEKLCSPLQESDKKQVAVGDEEHQHRAKKHRKMVE 635

Query: 145  KDRSRQLEIKSVLETVKYQLVPEPLAEDQKPVNKKIAKKIRSR 17
            K +  +L  K  + ++K ++V   L +    + K++ K + +R
Sbjct: 636  KVKRHELCYKETINSMKEEVV---LQDTPMEIAKELRKVVDTR 675


>ref|XP_003516528.1| PREDICTED: uncharacterized protein LOC100794882 [Glycine max]
          Length = 714

 Score =  451 bits (1160), Expect = e-124
 Identities = 278/631 (44%), Positives = 374/631 (59%), Gaps = 7/631 (1%)
 Frame = -1

Query: 1948 GRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDF 1769
            GRI++SKWK +D+ ELGIT SMI   S  +LK+L+  GFE+YLVGGCVRDLLLNR PKDF
Sbjct: 49   GRIDVSKWKTLDAEELGITSSMISYPSQFVLKLLRRKGFESYLVGGCVRDLLLNRTPKDF 108

Query: 1768 DVITTAALTQIKKKFN---RCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFL 1598
            DVITTA L +++ +F    R E+VGRRFPIC VHIKG+VVEV+SFETVA     KE+F  
Sbjct: 109  DVITTAKLMEVRAQFRGLARAEVVGRRFPICLVHIKGSVVEVTSFETVARTSNRKEQFLY 168

Query: 1597 PQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQV 1418
              +PK  + KD  R +N + RDFT+NSLF+DPF NKIYDY + M D+RS KL T++PAQ+
Sbjct: 169  SLLPKCSNKKDLFRCKNSLRRDFTINSLFYDPFANKIYDYTDGMADLRSLKLETVIPAQM 228

Query: 1417 SFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAE 1238
            SF+ED             L  S S+ETE                 I +EL+YMLS+GAAE
Sbjct: 229  SFKEDPGRILRGFRIAARLGLSLSRETEAAMWKYSSLVKSLDKNKIMIELNYMLSYGAAE 288

Query: 1237 ASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLW 1058
             SL LL ++ LL+ LLP  AA+L EQ          MLMKLF  LD L +CDRP    LW
Sbjct: 289  PSLHLLWKFKLLEFLLPVHAAYLDEQAIKEDAPASNMLMKLFFYLDNLVACDRPCDCTLW 348

Query: 1057 VAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSI 878
            V + A HL L+N+PQ ALVV  FASVL+HG W++G++FA++HA+    F PEI     SI
Sbjct: 349  VGLLAFHLTLVNNPQDALVVWAFASVLYHGEWEKGIKFAKEHAKMYVNFAPEI--RTSSI 406

Query: 877  Y--DDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKD 704
            Y  D+E+++ +T+LA LV  S+  L E++SL ++M R +P  P S ++FV KK G     
Sbjct: 407  YKSDEEIAKAVTKLASLVMHSIPALVESNSLLQSMSR-YPSFPQSDMIFVPKKAGKLASA 465

Query: 703  MFNVLICEVTTLNS-KRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEEVHDKE 527
            +F +L  +V    + +R+  +INY  LG+G++ E+ FVLGKI+L+T+  G   +    + 
Sbjct: 466  IFKMLASDVEFYKTERRKNSKINYGMLGKGHLSEIAFVLGKIVLETMSSGTVGDGEDSEA 525

Query: 526  EKDDLHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLM 347
             +  L   G+   +EI   + P ++N       G  E    ++P +SE  Q  SKK+KL+
Sbjct: 526  GQCHLKTEGT---KEIAQSQLPDLVNHEVAAMNG--EGHLLSIP-NSECRQGKSKKRKLV 579

Query: 346  VNEEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGMLAEDIGRKFQN 167
             N  I   K  S NQ ++     KE K EQQ L        K  ++  M  ED   K +N
Sbjct: 580  KNRCIAKKKMSSGNQELSEKFEYKENKEEQQKLV-------KLSQKVDMSTEDSLPKKKN 632

Query: 166  YTIKMQEKDRSRQLEI-KSVLETVKYQLVPE 77
               K    DR +     KS L   K+    E
Sbjct: 633  DHRKQLISDRKKITSANKSFLHQAKHMKTDE 663


>ref|XP_004146209.1| PREDICTED: uncharacterized protein LOC101212579 [Cucumis sativus]
          Length = 810

 Score =  443 bits (1139), Expect = e-121
 Identities = 270/651 (41%), Positives = 376/651 (57%), Gaps = 11/651 (1%)
 Frame = -1

Query: 1939 EISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDVI 1760
            ++ KW +++ R  G+TRSMIP SSW +L+VL   GFEAYLVGGCVRDLLL RVPKDFDVI
Sbjct: 76   DMPKWNKINGRAFGLTRSMIPSSSWKVLEVLHREGFEAYLVGGCVRDLLLRRVPKDFDVI 135

Query: 1759 TTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLPQMPKG 1580
            TTA LTQI   F R  IVGRRFPIC VHI+G++ EVSSF+T A++ +  +     Q+PK 
Sbjct: 136  TTAGLTQIHNLFCRSRIVGRRFPICMVHIRGSITEVSSFDTAAKHSEENKITAHSQIPKK 195

Query: 1579 CDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVSFEEDC 1400
            CD KD  RWRN M RDFT+NSLFFDPF N IYDYA  M D+RS KLRTL+PA +SF+ DC
Sbjct: 196  CDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKLDC 255

Query: 1399 XXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEASLDLL 1220
                        L  S SKETE                 + MEL+YMLS+GAA  SL LL
Sbjct: 256  ARILRGLRIAARLGLSISKETETAIHKFSPSITSLDKSRLMMELNYMLSYGAAVPSLYLL 315

Query: 1219 QRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWVAIFAV 1040
            QR+ LL  LLPF AA+L +Q   +S  + +MLMKLF NLD+L SC  PS+  +WVA+ A 
Sbjct: 316  QRFKLLGSLLPFHAAYLDKQGIEKSSLSSVMLMKLFFNLDKLVSCAHPSNCNIWVALLAF 375

Query: 1039 HLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIYDDELS 860
            HLAL+N+PQ +LVVL FA+ L+HG W +GV +AR+ +       PEI        +++L+
Sbjct: 376  HLALVNNPQNSLVVLAFAATLYHGEWNEGVNYAREKSLVEINLRPEITRSAKFKSEEKLA 435

Query: 859  ERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGSTVKDMFNVLICE 680
            E +T+ A+ V+  +  LT  D L E M   FP S  SGLVFVS K    V  +F VL   
Sbjct: 436  EGVTRFALKVQGCIAALTSKDCLLEAM-STFPASSNSGLVFVSNKTARDVAIIFEVLAKH 494

Query: 679  VTTLNSKRQGLEINYESLGRG-NVRELRFVLGKIILDTLGCGV--ALEEVHDKEEKDDLH 509
            V +   +++  +I+Y+ LG+G  +RE R+VLGKIIL+TL   +    E + D+ +   + 
Sbjct: 495  VKSYKDEKKDFKIDYKRLGKGLFLRENRYVLGKIILETLEDAILQGNENIPDRNQNLRID 554

Query: 508  APGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLMVNEEIQ 329
            AP          E   + +    ++ L K  KK    P+ SE     +KK KL+  E   
Sbjct: 555  APTK--------ETSDSPVADLVQEQLVKGNKKVRKRPSVSEVELKANKKYKLVRKE--- 603

Query: 328  SVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNCSNKGKKRQGMLAEDIGRKFQNYTIK-- 155
               G   ++VV +   +   +M ++G+          ++    + E   RK  +  ++  
Sbjct: 604  ---GSISDKVVENGRCINMTEMYKKGVEGSQLPLAPMEESMEPILE--SRKCHHLEVRAT 658

Query: 154  --MQEKDRSRQLEIKSVLETVKYQLVPE----PLAEDQKPVNKKIAKKIRS 20
              M+E   S   E+K ++    +Q V +    P+  + + ++K   ++++S
Sbjct: 659  ENMRENPESMGNEVKKIIPKKAFQKVTKELLHPVEINPRKMDKVAGQEVKS 709


>gb|ESW29983.1| hypothetical protein PHAVU_002G115200g [Phaseolus vulgaris]
          Length = 730

 Score =  437 bits (1125), Expect = e-120
 Identities = 254/573 (44%), Positives = 355/573 (61%), Gaps = 5/573 (0%)
 Frame = -1

Query: 1945 RIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFD 1766
            RI IS+ K +D++E G+T SMI  SS  +LK+L++ GFE+YLVGGCVRDL+LNR PKDFD
Sbjct: 57   RIGISEGKTLDAKEFGVTSSMISHSSMFVLKLLRSKGFESYLVGGCVRDLILNRTPKDFD 116

Query: 1765 VITTAALTQIKKKFNR---CEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFFLP 1595
            VITTA L +++K+  R    E+VGRRFPIC VHIKG+VVEV+SFETVA+   GKE+F   
Sbjct: 117  VITTAKLMEVRKQLRRSAHAEVVGRRFPICLVHIKGSVVEVTSFETVAQTSNGKEQFLYS 176

Query: 1594 QMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQVS 1415
             +PK  + KD  R +N + RDFT+NSLF+DPF NKIYDY N M D+++ KL T++PAQ+S
Sbjct: 177  LLPKCSNKKDLFRCKNSLRRDFTINSLFYDPFANKIYDYTNGMADLKTLKLETVIPAQLS 236

Query: 1414 FEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAAEA 1235
            F+ED             L  S S+E E                 I +EL YMLS+GAAE 
Sbjct: 237  FKEDPGRILRGFRITARLGLSISREIEAAIWTYSSLVKTLDKSRIMIELKYMLSYGAAEP 296

Query: 1234 SLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCLWV 1055
            SL LL ++ LL+ LLP  AA+L EQ      Q   ML+KLF +LD+L +CDRP    LW+
Sbjct: 297  SLRLLWKFKLLEFLLPVHAAYLDEQAIEEDAQASNMLLKLFFHLDKLVACDRPCDCTLWI 356

Query: 1054 AIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDSIY 875
             + A HLAL+N+PQ A+VV  FASVL+HG WK+GV+FA++ A  +  F+PEI        
Sbjct: 357  GLLAFHLALVNNPQDAIVVWAFASVLYHGEWKEGVKFAKEQARMSVNFVPEIRKSNLYKS 416

Query: 874  DDELSERITQLAILVKDSVDVLTETDSLQETM-VRKFPGSPCSGLVFVSKKMGSTVKDMF 698
            D+E++  +T+LA LV  S+  L E +SL++ + + ++P  P SG+VFVS+K G+    +F
Sbjct: 417  DEEIAIAVTKLASLVIHSISPLVEKNSLRQFLSISRYPSFPQSGMVFVSRKAGNLAHAIF 476

Query: 697  NVLICEVTTLNS-KRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEEVHDKEEK 521
              L  +     S +R  L+INY+ LG+G + E  FVLGKI+L+T+  G+  +    +  +
Sbjct: 477  KKLASDGKFYKSGRRTDLKINYDMLGKGQLSETGFVLGKIVLETMSSGIVGDGKDPEAGQ 536

Query: 520  DDLHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADSEQVQVVSKKQKLMVN 341
              L   G+    EI   + P ++N +     G  E +  ++P +S   Q  +KK+KL+ N
Sbjct: 537  CHLKTKGT---EEIGPSQHPDLVNHQVASMDG--EGQLLSIP-NSACGQEKNKKRKLVEN 590

Query: 340  EEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSK 242
              +   K  S N  ++     KE K EQQ L K
Sbjct: 591  RCVARKKMGSGNHELSEKFKCKENKKEQQKLVK 623


>ref|XP_006828719.1| hypothetical protein AMTR_s00001p00028400 [Amborella trichopoda]
            gi|548833698|gb|ERM96135.1| hypothetical protein
            AMTR_s00001p00028400 [Amborella trichopoda]
          Length = 528

 Score =  420 bits (1079), Expect = e-114
 Identities = 220/477 (46%), Positives = 305/477 (63%)
 Frame = -1

Query: 1972 GSRSVFEPGRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLL 1793
            G    F+    + S WK +DSR +G+ + M+    W I+KVLQ  G+++YLVGGCVRDLL
Sbjct: 38   GENLCFQNDCFDTSTWKVVDSRTVGVNKWMVASPVWTIMKVLQRKGYDSYLVGGCVRDLL 97

Query: 1792 LNRVPKDFDVITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGK 1613
            L + PKDFDV+TTA L +I++ F+ C IVGRRFP+C V+I+G+ VEVSSFET     +  
Sbjct: 98   LGKTPKDFDVVTTAGLNKIRRTFHDCLIVGRRFPVCHVNIQGSTVEVSSFETTDPFAQ-T 156

Query: 1612 EEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTL 1433
            E      +P GC+ KDF RW++CM RDFT+N LF+DPF N IYDYAN M D+   KLRTL
Sbjct: 157  ESNLDSHVPSGCNHKDFIRWKDCMRRDFTINGLFYDPFANTIYDYANGMKDLSMCKLRTL 216

Query: 1432 VPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLS 1253
             PA  SF EDC            L   FSKET +                + +EL+YML+
Sbjct: 217  KPAHASFREDCARILRGFRIAARLNLVFSKETASAIRDLSSSIAMLSKARLLLELNYMLA 276

Query: 1252 FGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPS 1073
            FGAAEASL LL ++ LL+I+LP  AA+LT Q    S  NP MLM LF+NLD+L + +RPS
Sbjct: 277  FGAAEASLQLLWKFRLLEIVLPLHAAYLTHQAKILSDGNPNMLMGLFANLDKLLASNRPS 336

Query: 1072 HGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIID 893
            +  LW+ + A HLAL++ PQ+ALVV TF+S LFHGNW + V   RQ+A+    ++PEI++
Sbjct: 337  NTNLWIGLLAFHLALLSQPQEALVVWTFSSFLFHGNWSKAVSIGRQNAQVGVHYLPEILE 396

Query: 892  GYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVSKKMGST 713
              ++  D  LS  ++Q A+ V  S+D ++E D+  ET V   P SPC GL  +SK+M + 
Sbjct: 397  SKETKTDALLSSEVSQFALKVISSIDAMSEPDASLET-VSDLPLSPCLGLALISKRMANQ 455

Query: 712  VKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEE 542
            V  +F V+  +     + ++  EI+Y+ L +G++ ++RFVLGK+I++TL  G  L E
Sbjct: 456  VAMIFEVVRSD-RNPRAVQESSEIDYKLLRQGDLEQVRFVLGKVIMETLKAGAPLIE 511


>ref|XP_006299875.1| hypothetical protein CARUB_v10016083mg [Capsella rubella]
            gi|482568584|gb|EOA32773.1| hypothetical protein
            CARUB_v10016083mg [Capsella rubella]
          Length = 763

 Score =  416 bits (1069), Expect = e-113
 Identities = 216/496 (43%), Positives = 324/496 (65%), Gaps = 20/496 (4%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            ++ SKWK++ + + GI  SMIP SS  +L++L+  GF+AYLVGGCVRDL+L+RVPKD+DV
Sbjct: 54   VDTSKWKKVRASDAGIRNSMIPDSSMNVLRLLRRQGFDAYLVGGCVRDLILHRVPKDYDV 113

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKG--------KEE 1607
            ITTA L QI++ F+R +++G+RFPIC V + G+++EVSSF+TVA +  G        KE+
Sbjct: 114  ITTANLKQIRRLFHRAQVIGKRFPICHVWMGGSIIEVSSFDTVAHSDSGSENDLDKSKEK 173

Query: 1606 FFLP------------QMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAML 1463
            + +P            ++  GCD +D  RWRN + RDFT+NSLF+DPF  KIYDY N M 
Sbjct: 174  YDVPLDIEADKNNSLFKLYSGCDVRDCNRWRNSLQRDFTINSLFYDPFDFKIYDYTNGME 233

Query: 1462 DIRSGKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXX 1283
            D+   KLRTLVPA +SF+EDC            L  S SK+ E                 
Sbjct: 234  DLTDLKLRTLVPAHLSFKEDCARILRGLRIAARLGLSLSKDVETAIPEFVSSVANLGQFR 293

Query: 1282 IQMELDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNL 1103
            + ME++YML++GAA  S+ LL ++ LL +LLPFQAA+L +Q    S+   LML++LFSN+
Sbjct: 294  LIMEMNYMLAYGAAAPSILLLMKFKLLHVLLPFQAAYL-DQASETSLSTSLMLVRLFSNM 352

Query: 1102 DRLCSCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEA 923
            D+L SCD+P+   LW+A+ A H+AL+ +PQ+A+VV  FA++L+H NW++ V+FAR H  +
Sbjct: 353  DKLVSCDQPADSKLWIAVLAFHIALVRNPQEAIVVRAFAALLYHRNWRKAVEFARGHETS 412

Query: 922  AHVFIPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGL 743
               + PE++       D++L+E +++   ++KD+  VLT+ ++L+E +   +P    SGL
Sbjct: 413  VVGYTPEVLKSLRKRSDEDLAEAVSEFICILKDTQYVLTDIEALREALY-LYPDFKFSGL 471

Query: 742  VFVSKKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLG 563
            VF+ KK G  V + F   + +V    S+++G  I+Y SLG+GN  E+RFVLGKIILDT+ 
Sbjct: 472  VFIPKKKGRDVAEGF-ARLSDVEYYESEKEGFSIDYLSLGKGNSCEVRFVLGKIILDTIT 530

Query: 562  CGVALEEVHDKEEKDD 515
             G+ +E ++  ++K +
Sbjct: 531  EGIVIEPLNSVKKKQN 546


>ref|XP_003612324.1| Poly(A) polymerase [Medicago truncatula] gi|355513659|gb|AES95282.1|
            Poly(A) polymerase [Medicago truncatula]
          Length = 675

 Score =  415 bits (1066), Expect = e-113
 Identities = 246/598 (41%), Positives = 349/598 (58%), Gaps = 11/598 (1%)
 Frame = -1

Query: 1948 GRIEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDF 1769
            GRI+IS WK  D+ +LG+T SMI   S  +LK+LQ+ G ++YLVGGCVRDL+LNR PKDF
Sbjct: 14   GRIDISNWKTFDAWKLGVTGSMISKPSHFVLKLLQDKGLKSYLVGGCVRDLVLNRTPKDF 73

Query: 1768 DVITTAALTQIKKKF----NRCEIVGRRFPICRVHIKGTVVEVSSFETVAENGKGKEEFF 1601
            DV+TTA L ++K+ F    +R ++VGRRFP+C VH++G+V+EV+SF T +E  K  +   
Sbjct: 74   DVVTTAKLIEVKRLFRRFGHRADVVGRRFPVCLVHMQGSVIEVTSFHTESETPKAMKNVL 133

Query: 1600 LPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRSGKLRTLVPAQ 1421
               MPK  + ++    +N + RDFT+NSLF+DPF NKIYDYAN M D+RS KL T++PAQ
Sbjct: 134  HSLMPKCKNKENRFLCKNSLRRDFTINSLFYDPFANKIYDYANGMADLRSLKLETVIPAQ 193

Query: 1420 VSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQMELDYMLSFGAA 1241
            +SF+ED             L  S S+E E                 + +E++YMLS+GAA
Sbjct: 194  ISFKEDPGRILRGFRIAARLGLSLSREIEAAIWTCSSLVEDLNKDRMMIEMNYMLSYGAA 253

Query: 1240 EASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLCSCDRPSHGCL 1061
            E SL LL ++ LL  LLP QAA+L EQ      Q+  MLMKLF ++D L  C RPS   L
Sbjct: 254  EPSLRLLWKFKLLQFLLPVQAAYLDEQATKEDAQDSNMLMKLFFHMDNLVGCGRPSDCTL 313

Query: 1060 WVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVFIPEIIDGYDS 881
            W+ + A HLAL+N+PQ ALVV  FASVL+HG+W+ G++FA++HA+ +  F PEI      
Sbjct: 314  WIGLLAFHLALVNNPQDALVVWAFASVLYHGDWEGGIKFAKEHAKMSVNFEPEIKRSSIC 373

Query: 880  IYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVF-VSKKMGSTVKD 704
              D++++E +T+LA LV DS+  L   +SL +++ R +P  P   +V  VSKK G  V +
Sbjct: 374  KSDEDIAEAVTKLASLVIDSIHPLVNIESLSQSLSR-YPSVPPPHMVLVVSKKTGKAVSE 432

Query: 703  MFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVALEEVHDKEE 524
            +F VL+ ++    S+R+  +INY+ LG G+  E RFVLGKI+L T+  G+         +
Sbjct: 433  IFEVLVNDIKFYKSERKSAKINYDMLGSGHTSETRFVLGKIVLQTMQSGII-------GD 485

Query: 523  KDDLHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTLPADS-EQVQVVSKKQKLM 347
             D       H          P    T++   L  +E KR  L   + E      KKQKL 
Sbjct: 486  ADGFGTEKCH----------PDTEGTKDFGQLVTREDKRKVLSTPNLEHRPQKLKKQKLA 535

Query: 346  VN---EEIQSVKGRSKNQVVASNEWLKEVKMEQQGLSKDYNC--SNKGKKRQGMLAED 188
             N   EE ++           + E  K VK+ Q+      N    NK  KR+ ++ ++
Sbjct: 536  ENACIEEQKTGLDEFCKYKETNEEHQKPVKLHQEVHLSMVNSMPKNKSNKRKQLINDE 593


>ref|NP_179349.2| polynucleotide adenylyltransferase family protein [Arabidopsis
            thaliana] gi|330251561|gb|AEC06655.1| polynucleotide
            adenylyltransferase family protein [Arabidopsis thaliana]
          Length = 757

 Score =  413 bits (1061), Expect = e-112
 Identities = 221/513 (43%), Positives = 328/513 (63%), Gaps = 21/513 (4%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            ++ SKWK++ + + GI  SMIP SS  +L++L+  GF+AYLVGGCVRDL+LNRVPKD+DV
Sbjct: 54   VDTSKWKKVRASDAGIKNSMIPESSMNVLRLLRRQGFDAYLVGGCVRDLILNRVPKDYDV 113

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENG------------- 1622
            ITTA L QI++ F+R +++G+RFPIC V + G+++EVSSF+TVA +              
Sbjct: 114  ITTADLKQIRRLFHRAQVIGKRFPICHVWMGGSIIEVSSFDTVAHSDSDLEKSKEKSGVS 173

Query: 1621 ---KGKEEFFLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAMLDIRS 1451
               K  +   L +M  G D KD  RWRN + RDFT+NSLF++PF   IYDYAN M D+  
Sbjct: 174  LDTKANKNNSLFKMYSGWDIKDCKRWRNSLQRDFTINSLFYNPFDFTIYDYANGMEDLTD 233

Query: 1450 GKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXXIQME 1271
             KLRTLVPA +SF+EDC            L  S SK+ +                 + ME
Sbjct: 234  LKLRTLVPAHLSFKEDCARILRGLRIAARLGLSLSKDVKTAIPEFVSSVANLDQFRLIME 293

Query: 1270 LDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNLDRLC 1091
            ++YML++GAA  S+ LL ++ LL +LLPFQAA+L +Q    S+ + LML++LFSN+D+L 
Sbjct: 294  MNYMLAYGAAAPSILLLMKFKLLHVLLPFQAAYL-DQASKTSLSSSLMLVRLFSNMDKLV 352

Query: 1090 SCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEAAHVF 911
            SCD+P+   LW+A+ A H+AL+ +PQ+A+VV  FA++L+HGNW + V+FAR+H  +   +
Sbjct: 353  SCDQPADPKLWIAVLAFHIALVRNPQEAIVVRAFAALLYHGNWSKAVEFAREHETSVIGY 412

Query: 910  IPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGLVFVS 731
             PE+        D++L+E +++   L+KD+  VLT+ ++L+E +   +P    SGLVF+ 
Sbjct: 413  APEVSKSSRKRSDEDLAEAVSEFTCLLKDTQYVLTDKEALREALY-LYPDFKFSGLVFIP 471

Query: 730  KKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLGCGVA 551
            KK G  V + F + + +V +  S+++G  I+Y  LG+GN  E+RFVLGKIILDT+  G  
Sbjct: 472  KKKGRDVAEGF-MRLSDVESYESQKEGFSIDYVLLGKGNPCEVRFVLGKIILDTITEGTV 530

Query: 550  LEEVHDKEEKDD-----LHAPGSHQKREIFVEK 467
            +E ++  ++K       + A    +K E+FV K
Sbjct: 531  IEPLNSVKKKQSTRNHIVPAACLEKKDELFVSK 563


>ref|XP_002884069.1| polynucleotide adenylyltransferase family protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329909|gb|EFH60328.1| polynucleotide
            adenylyltransferase family protein [Arabidopsis lyrata
            subsp. lyrata]
          Length = 761

 Score =  408 bits (1048), Expect = e-111
 Identities = 238/602 (39%), Positives = 360/602 (59%), Gaps = 35/602 (5%)
 Frame = -1

Query: 1942 IEISKWKRMDSRELGITRSMIPVSSWIILKVLQNSGFEAYLVGGCVRDLLLNRVPKDFDV 1763
            ++ SKWK++ + + GI  SMIP SS  +L++L+  GF+AYLVGGCVRDL+L+RVPKD+DV
Sbjct: 54   VDTSKWKKVRASDAGIRNSMIPESSMNVLRLLRRQGFDAYLVGGCVRDLILHRVPKDYDV 113

Query: 1762 ITTAALTQIKKKFNRCEIVGRRFPICRVHIKGTVVEVSSFETVAENG--------KGKEE 1607
            ITTA L QI++ F+R +++G+RFPIC V + G+++EVSSF+TVA +         K KE+
Sbjct: 114  ITTANLKQIRRLFHRAQVIGKRFPICHVWMGGSIIEVSSFDTVAHSDSEHEDDLEKSKEK 173

Query: 1606 F------------FLPQMPKGCDFKDFARWRNCMHRDFTVNSLFFDPFVNKIYDYANAML 1463
                          L  M  G D KD  RWRN + RDFT+NSLF++PF  KIYDYAN M 
Sbjct: 174  SGVSLDTEANKNNSLFTMYSGWDVKDCNRWRNSLQRDFTINSLFYNPFELKIYDYANGME 233

Query: 1462 DIRSGKLRTLVPAQVSFEEDCXXXXXXXXXXXXLKFSFSKETENXXXXXXXXXXXXXXXX 1283
            D+   KLRTLVPA +SF+EDC            L  S SK+ E                 
Sbjct: 234  DLTDLKLRTLVPAHLSFKEDCARILRGLRIAARLGLSLSKDIETAIPEFVSSVANLDQFR 293

Query: 1282 IQMELDYMLSFGAAEASLDLLQRYHLLDILLPFQAAHLTEQDHNRSIQNPLMLMKLFSNL 1103
            + ME++YML++GAA  S+ LL ++ LL +LLPFQAA+L +Q    S+ + LML++LFSN+
Sbjct: 294  LIMEMNYMLAYGAAAPSILLLMKFKLLHVLLPFQAAYL-DQASETSLSSSLMLVRLFSNM 352

Query: 1102 DRLCSCDRPSHGCLWVAIFAVHLALINSPQQALVVLTFASVLFHGNWKQGVQFARQHAEA 923
            D+L SCD+P+   LW+A+ A H+AL+ +PQ+A+VV  FA++L+H NW + V+FAR+H  +
Sbjct: 353  DKLVSCDQPADPKLWIAVLAFHIALVRNPQEAIVVRAFAALLYHRNWSKAVKFAREHETS 412

Query: 922  AHVFIPEIIDGYDSIYDDELSERITQLAILVKDSVDVLTETDSLQETMVRKFPGSPCSGL 743
               + PE+        D++L+E +++   L+KD+  VLT+ ++L+E +   +P    SGL
Sbjct: 413  VVGYAPEVSKFSRKRSDEDLAEAVSEFTCLLKDTQYVLTDIEALREALY-LYPDFKFSGL 471

Query: 742  VFVSKKMGSTVKDMFNVLICEVTTLNSKRQGLEINYESLGRGNVRELRFVLGKIILDTLG 563
            VF+ K+ G  V +     + +V +  SK++G  I+Y  LG+GN  E+RFVLGKIILDT+ 
Sbjct: 472  VFIPKRKGRDVAEGL-ARLSDVESYESKKEGFSIDYLLLGKGNPCEVRFVLGKIILDTIT 530

Query: 562  CGVALEEVHDKEEKDD-----LHAPGSHQKREIFVEKFPTVLNTRERDPLGKQEKKRSTL 398
             G+ +E ++  ++K       + A    +K E+FV K              K+E    T 
Sbjct: 531  EGIVIEPLNSVKKKQSTSNQIVSAACLEKKDELFVTK------------SSKEENNNHTP 578

Query: 397  PADSEQVQVV--------SKKQKLMVNEEI--QSVKGRSKNQVVASNEWLKEVKMEQQGL 248
              DS    V+          +QK+    E+  +++ G +KNQ  +  + LK  + ++  +
Sbjct: 579  VYDSNASSVLKILKRTRKESEQKIDQETEVCPRTLSGPAKNQDQSVVQKLKRRRSKEAQV 638

Query: 247  SK 242
            S+
Sbjct: 639  SE 640


Top