BLASTX nr result

ID: Catharanthus23_contig00007083 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00007083
         (2671 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI29694.3| unnamed protein product [Vitis vinifera]              498   e-138
ref|XP_006349314.1| PREDICTED: regulator of nonsense transcripts...   470   e-129
ref|XP_006492554.1| PREDICTED: regulator of nonsense transcripts...   467   e-128
ref|XP_004230442.1| PREDICTED: uncharacterized protein LOC101264...   467   e-128
ref|XP_006431585.1| hypothetical protein CICLE_v10000901mg [Citr...   463   e-127
ref|XP_006385051.1| hypothetical protein POPTR_0004s23450g [Popu...   461   e-126
ref|XP_006339922.1| PREDICTED: regulator of nonsense transcripts...   454   e-124
ref|XP_002328787.1| predicted protein [Populus trichocarpa] gi|5...   454   e-124
ref|XP_006389505.1| hypothetical protein POPTR_0022s00460g [Popu...   450   e-123
gb|EMJ23015.1| hypothetical protein PRUPE_ppa004923mg [Prunus pe...   439   e-120
gb|EOX97031.1| Smg-4/UPF3 family protein, putative isoform 1 [Th...   432   e-118
gb|EMJ23014.1| hypothetical protein PRUPE_ppa004923mg [Prunus pe...   421   e-115
gb|ESW20566.1| hypothetical protein PHAVU_006G219800g [Phaseolus...   395   e-107
ref|XP_006598794.1| PREDICTED: regulator of nonsense transcripts...   393   e-106
ref|XP_004485448.1| PREDICTED: regulator of nonsense transcripts...   393   e-106
ref|XP_006592654.1| PREDICTED: regulator of nonsense transcripts...   389   e-105
ref|XP_004248850.1| PREDICTED: uncharacterized protein LOC101263...   381   e-103
gb|EOX97032.1| Smg-4/UPF3 family protein, putative isoform 2, pa...   377   e-101
ref|XP_004485447.1| PREDICTED: regulator of nonsense transcripts...   372   e-100
gb|EOY26871.1| Smg-4/UPF3 family protein, putative isoform 2 [Th...   345   5e-92

>emb|CBI29694.3| unnamed protein product [Vitis vinifera]
          Length = 519

 Score =  498 bits (1283), Expect = e-138
 Identities = 281/521 (53%), Positives = 337/521 (64%), Gaps = 17/521 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKGPLDRTKVV+RHLPPT+S+++ +EQ+D+ F GRY  V +RPGK SQK QSYSRAY+DF
Sbjct: 1    MKGPLDRTKVVVRHLPPTISEAAFLEQIDTVFKGRYTLVKFRPGKNSQKRQSYSRAYLDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQR+PK W KKDGREGTI +DPEY+E
Sbjct: 61   KRPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRIPKHWPKKDGREGTIFKDPEYME 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            F+E LAKPVENLPSAEIQL           KD PIVTPLMD+VRQKRAAK   RRS+SNG
Sbjct: 121  FVELLAKPVENLPSAEIQLERREAERAGAVKDTPIVTPLMDFVRQKRAAKGVSRRSLSNG 180

Query: 1759 KSTKRVSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R SG+++G                 + MYVLRD+AK TS KD+ST+++V K+DDQ 
Sbjct: 181  KLSRRASGSSSGNPSLGSSKRGSEKRRLSTTMYVLRDTAKSTSAKDKSTFILVPKRDDQL 240

Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424
            L DK  N       + LEEE G S  V                     +   QQN  SPV
Sbjct: 241  LSDKSVNLAAGGGAEALEEESGVSGAVDAGKKKVLLLKGKEREISHHLL---QQNVTSPV 297

Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            KN + +   KQNQ           ILLNKD R               SNL+K+KRPPRPP
Sbjct: 298  KNILGANAPKQNQRREGSGRIIRSILLNKDARQSQSSMFQTEQQSQASNLEKEKRPPRPP 357

Query: 1246 SLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR-XXXXXXXXX 1073
             + L  K+TNGA DD+   ND H F ++KQ++R +NK+RPDRGVW PLRR          
Sbjct: 358  HIQLASKETNGAQDDKVVGNDVHSFVSEKQDKRTRNKDRPDRGVWTPLRRSDGSHASDES 417

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896
                          EG+H E RS+++ AR GE K   SGRG HS+LDNG++KH  RRG  
Sbjct: 418  LSSSASQPTSSDFPEGSHGEMRSDMSNARSGEVKALGSGRGGHSALDNGSHKHSGRRGPT 477

Query: 895  H-IKDSDGS---AEGKSLRRGGS-CYGSHEKQVWVQKSSSG 788
            H +KD+DGS   +EGK  +RG +  YGSHEKQVWVQKSSSG
Sbjct: 478  HSVKDADGSSIVSEGKHSKRGSAPGYGSHEKQVWVQKSSSG 518


>ref|XP_006349314.1| PREDICTED: regulator of nonsense transcripts UPF3-like [Solanum
            tuberosum]
          Length = 483

 Score =  470 bits (1209), Expect = e-129
 Identities = 270/519 (52%), Positives = 325/519 (62%), Gaps = 14/519 (2%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKGPLDR+KVVLRHLPPT+SQS L++QVDSRFAGRYNW  + PGK+SQK Q+YSRAYI+F
Sbjct: 1    MKGPLDRSKVVLRHLPPTISQSMLLDQVDSRFAGRYNWFCFLPGKSSQKHQTYSRAYIEF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            K P+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQRVPK WSKKDGREGTIL+DPEYLE
Sbjct: 61   KMPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKHWSKKDGREGTILKDPEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEF++KP+ENLPSAEIQL           KD PIVTPLMDY+RQKRAAKSG R+S++NG
Sbjct: 121  FLEFISKPIENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYIRQKRAAKSGARKSIANG 180

Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            + T+R SG +TG                 + MYVLRDS+K  SGKD+ TY++  K+DDQQ
Sbjct: 181  RPTRRTSGTSTGSPSSSASKRSSEKRRASTTMYVLRDSSKAGSGKDK-TYILAPKRDDQQ 239

Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAAS-P 1427
              +K          + +EEE G +A V                      P+ Q+  AS  
Sbjct: 240  RAEKSGTSAAGSVANAVEEETGGAADVGKKKILLLKEKEN---------PNNQRREASGR 290

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            +   I   + +QNQ               +D                  +DKDK+PPRPP
Sbjct: 291  IIRSILLKDARQNQAPSAS---------QQDKH---------------RVDKDKKPPRPP 326

Query: 1246 SLHLLQKDTNGAPDDRTPN-DFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXX 1076
            S+ L Q++TNGA +D+    D H  HT+KQE+R + ++RPDRGVW PLRR          
Sbjct: 327  SVQLFQRETNGANEDKVLGADLHIVHTEKQEKRTRIRDRPDRGVWTPLRRSDSLHASDES 386

Query: 1075 XXXXXXXXXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGS 899
                           EG+  ET+  +  ARG EF+   SGR S+SS DNGTYKHG RRG 
Sbjct: 387  LSSSASQSSEVPDFVEGSQGETKHGLANARGAEFRPMGSGRNSYSSFDNGTYKHGGRRGM 446

Query: 898  AHIKDSDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
                D     EGK LRRGG S YG+HEKQVWVQKSSSG+
Sbjct: 447  R--DDGISVGEGKPLRRGGPSSYGTHEKQVWVQKSSSGT 483


>ref|XP_006492554.1| PREDICTED: regulator of nonsense transcripts UPF3-like [Citrus
            sinensis]
          Length = 514

 Score =  467 bits (1202), Expect = e-128
 Identities = 276/522 (52%), Positives = 335/522 (64%), Gaps = 17/522 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKGPLDRTKVV+R+LPP ++Q +  EQ+D  F GRYNWVS+R GKTSQK QS +RAY+DF
Sbjct: 1    MKGPLDRTKVVVRNLPPAITQPAFTEQIDGAFGGRYNWVSFRQGKTSQKHQSCARAYLDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            K+P+DV+EFAEFF+GHVFVNEKG QFKT VEYAPSQRVPKQWSKKDGREGT+L+DPEYLE
Sbjct: 61   KKPEDVLEFAEFFNGHVFVNEKGVQFKTIVEYAPSQRVPKQWSKKDGREGTLLKDPEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEF++KPVENLPSAEIQL           K+  IVTPLMD+VRQKRAAK+G RR +SNG
Sbjct: 121  FLEFISKPVENLPSAEIQLERREAERAGAAKEALIVTPLMDFVRQKRAAKAGPRRLLSNG 180

Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R SG++TG                 + MYVLRD+AK +SGKD+STY++V K+DDQ 
Sbjct: 181  KLSRRASGSSTGSPASGSSKRGSDKKKASTTMYVLRDTAKNSSGKDKSTYILVPKRDDQD 240

Query: 1582 LLDKPRNDG--------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427
              DKP +          LEE G  + +                I  +S   S QQ+A+  
Sbjct: 241  -FDKPVSSSSATGSEVVLEESGVPANSDGGKKKVLLLKGKEREISQVSGSVSHQQSAS-- 297

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            VKN ISS  +KQNQ           ILLNKD R              SNL+KDKRPPRP 
Sbjct: 298  VKNIISSPALKQNQRRENSGRIIRGILLNKDAR-QNQASGLHSEQQISNLEKDKRPPRPS 356

Query: 1246 SLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXX 1079
             +HL+ KDTNG  DD+   ND    H++KQERR +NK+RPDR  W  LRR          
Sbjct: 357  HVHLVMKDTNGVSDDKVIVND---LHSEKQERRTRNKDRPDRAAWT-LRRSDGSYQSDES 412

Query: 1078 XXXXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRG 902
                           +EGN  + + +++  R GE K    GR SHSS+DNG+++H  RRG
Sbjct: 413  LSSSASQLSLSAVDSSEGNLGDGKFDLSNMRSGEVKAVGGGRSSHSSVDNGSHRHIGRRG 472

Query: 901  SAHIKD--SDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
              H+KD  S   +EGK LRRGG S YGSHEKQVWVQKSSSGS
Sbjct: 473  PTHVKDDSSPVMSEGKPLRRGGASGYGSHEKQVWVQKSSSGS 514


>ref|XP_004230442.1| PREDICTED: uncharacterized protein LOC101264766 [Solanum
            lycopersicum]
          Length = 485

 Score =  467 bits (1201), Expect = e-128
 Identities = 269/518 (51%), Positives = 323/518 (62%), Gaps = 13/518 (2%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKGPLDR+KVVLRHLPPT+SQS L++QVDSRFAGRYNW  + PGK+SQK Q+YSRAYI+F
Sbjct: 1    MKGPLDRSKVVLRHLPPTISQSMLLDQVDSRFAGRYNWFCFLPGKSSQKHQTYSRAYIEF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQRVP+ WSKKDGREGTIL+DPEYLE
Sbjct: 61   KRPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPQHWSKKDGREGTILKDPEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEF++KP+ENLPSAEIQL           KD PIVTPLMDY+RQKRAAKSG R+S++NG
Sbjct: 121  FLEFISKPIENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYIRQKRAAKSGARKSIANG 180

Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            + T+R SG + G                 + MYVLRDS+K  SGKD+ TY++  K+DDQQ
Sbjct: 181  RPTRRASGTSAGSPSSSASKRSSEKRRASTTMYVLRDSSKAGSGKDK-TYILAPKRDDQQ 239

Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424
              +K          + +EEE G +A V                      P+ Q+  A   
Sbjct: 240  RAEKSGTSAPGSVANAVEEETGGAADVGKKKILLLKEKEKEN-------PNNQRREA--- 289

Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPS 1244
                 SG + ++             +L KD R                +DKDK+PPRPPS
Sbjct: 290  -----SGRIIRS-------------ILLKDAR--QNQAPSASQQEKHRVDKDKKPPRPPS 329

Query: 1243 LHLLQKDTNGAPDDRTPN-DFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXXX 1073
            + L Q++TNGA +DR    D H  HT+KQE+R + ++RPDRGVW PLRR           
Sbjct: 330  VQLFQRETNGANEDRVLGADLHVVHTEKQEKRTRIRDRPDRGVWTPLRRSDSLHASDESL 389

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896
                          EG+  ET+  +  AR  EF+   SGR SHSS DNGTYKHG RRG  
Sbjct: 390  SSSASQSSEVPDFVEGSPGETKHGLVNARVAEFRPMGSGRNSHSSFDNGTYKHGGRRGMR 449

Query: 895  HIKDSDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
               D     EGK LRRGG S Y +HEKQVWVQKSSSG+
Sbjct: 450  --DDGISVGEGKPLRRGGPSSYNTHEKQVWVQKSSSGT 485


>ref|XP_006431585.1| hypothetical protein CICLE_v10000901mg [Citrus clementina]
            gi|557533707|gb|ESR44825.1| hypothetical protein
            CICLE_v10000901mg [Citrus clementina]
          Length = 514

 Score =  463 bits (1192), Expect = e-127
 Identities = 276/523 (52%), Positives = 335/523 (64%), Gaps = 18/523 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKGPLDRTKVV+R+LPP ++Q +  EQ+D  F GRYNWVS+R GKTSQK QS +RAY+DF
Sbjct: 1    MKGPLDRTKVVVRNLPPAITQPAFTEQIDGAFGGRYNWVSFRQGKTSQKHQSCARAYLDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            K+P+DV+EFAEFF+GHVFVNEKG QFKT VEYAPSQRVPKQWSKKDGREGT+L+DPEYLE
Sbjct: 61   KKPEDVLEFAEFFNGHVFVNEKGVQFKTIVEYAPSQRVPKQWSKKDGREGTLLKDPEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEF++KPVENLPSAEIQL           K+  IVTPLMD+VRQKRAAK+G RR +SNG
Sbjct: 121  FLEFISKPVENLPSAEIQLERREAERAGAAKEALIVTPLMDFVRQKRAAKAGPRRLLSNG 180

Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R SG++TG                 + MYVLRD+AK +SGKD+STY++V K+DDQ 
Sbjct: 181  KLSRRASGSSTGSPASGSSKRGSDKKKASTTMYVLRDTAKNSSGKDKSTYILVPKRDDQD 240

Query: 1582 LLDKPRNDG--------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427
              DKP +          LEE G  + +                I  +S   S QQ+A+  
Sbjct: 241  -FDKPVSSSSATGSEVVLEESGVPANSDGGKKKVLLLKGKEREISQVSGSVSHQQSAS-- 297

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            VK  ISS  +KQNQ           ILLNKD R              SNL+KDKRPPRP 
Sbjct: 298  VKTIISSPALKQNQRRENSGRIIRGILLNKDAR-QNQASGLHSEQQISNLEKDKRPPRPS 356

Query: 1246 SLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXX 1079
             + L+ KDTNG  DD+   ND    H++KQERR +NK+RPDR  W  LRR          
Sbjct: 357  HVQLVMKDTNGVSDDKVIVND---LHSEKQERRTRNKDRPDRAAWT-LRRSDGSYQSDES 412

Query: 1078 XXXXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRG 902
                           +EGN  + + +++  R GE K    GR SHSS+DNG+++H  RRG
Sbjct: 413  LSSSASQLSLSAVDSSEGNLGDGKFDLSNMRSGEVKAVGGGRSSHSSVDNGSHRHIGRRG 472

Query: 901  SAHIKDSDGS---AEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
              H+KD DGS   +EGK LRRGG S YGSHEKQVWVQKSSSGS
Sbjct: 473  PTHVKD-DGSPVMSEGKPLRRGGASGYGSHEKQVWVQKSSSGS 514


>ref|XP_006385051.1| hypothetical protein POPTR_0004s23450g [Populus trichocarpa]
            gi|550341819|gb|ERP62848.1| hypothetical protein
            POPTR_0004s23450g [Populus trichocarpa]
          Length = 520

 Score =  461 bits (1185), Expect = e-126
 Identities = 270/516 (52%), Positives = 326/516 (63%), Gaps = 16/516 (3%)
 Frame = -1

Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114
            G  D+TKVV+RHLPP +SQ   +EQ+D  F+GRYNW+SYRPG  SQK QSYSRAYIDFKR
Sbjct: 5    GQSDKTKVVVRHLPPGISQPMFVEQIDVAFSGRYNWLSYRPGNNSQKHQSYSRAYIDFKR 64

Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934
            P+DVI+FAEFF+GH+FVNEKGTQFK  VEY+PSQRVPKQWSKKDGREGTI +DPEYLEFL
Sbjct: 65   PEDVIDFAEFFNGHIFVNEKGTQFKAIVEYSPSQRVPKQWSKKDGREGTISKDPEYLEFL 124

Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754
            E +AKPVENLPSAEIQL           KD PIVTPLMD+VRQKR AK+G RR +SNGK 
Sbjct: 125  ELIAKPVENLPSAEIQLERREAERAGAAKDAPIVTPLMDFVRQKRVAKNGPRRILSNGKL 184

Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574
            ++R  G+ +                 + MYVLRD+AK TSGKD+STYV V K+DDQQL +
Sbjct: 185  SRRAGGSGSP-SSSSLKRGSEKKRISTTMYVLRDTAKSTSGKDKSTYVHVPKRDDQQLSN 243

Query: 1573 KPRNDG------LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNF 1415
                        LE+E   S  T                I  ++   S QQ+ +S  +N 
Sbjct: 244  AVTLGSGSGTAVLEDESVVSGITDSGKKKILLLKGKEKEISLVTGTMSQQQSISSSDRNI 303

Query: 1414 ISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLH 1238
            ISS  +K +Q           ILLNKD+R               SNL+K+KRPPRPP   
Sbjct: 304  ISSTALK-SQRRETSGRMIRSILLNKDSRHIRSSGVHSEPQMQTSNLEKEKRPPRPPHAQ 362

Query: 1237 LLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXXXXX 1070
            L  KD NG PDD+   ND HGF  +KQE+R +NK+RPDRGVW PLRR             
Sbjct: 363  LGLKDANGTPDDKVVGNDLHGFPNEKQEKRTRNKDRPDRGVWTPLRRSDGSYASDESLLS 422

Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893
                        ++GNH + + +    R GE K   SGRG+HSSLDNG++KH  RRG +H
Sbjct: 423  SASQSTQSVFDSSQGNHGDVKVDSLNLRSGEVKVLGSGRGNHSSLDNGSHKHFGRRGPSH 482

Query: 892  I-KDSDGS-AEGKSLRRGGSC-YGSHEKQVWVQKSS 794
            I +D+DGS  E K+ +RGGS  YGSHEKQVWVQKS+
Sbjct: 483  IVRDADGSTVEAKTPKRGGSSGYGSHEKQVWVQKST 518


>ref|XP_006339922.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1
            [Solanum tuberosum] gi|565345688|ref|XP_006339923.1|
            PREDICTED: regulator of nonsense transcripts UPF3-like
            isoform X2 [Solanum tuberosum]
            gi|565345690|ref|XP_006339924.1| PREDICTED: regulator of
            nonsense transcripts UPF3-like isoform X3 [Solanum
            tuberosum]
          Length = 508

 Score =  454 bits (1167), Expect = e-124
 Identities = 270/514 (52%), Positives = 319/514 (62%), Gaps = 15/514 (2%)
 Frame = -1

Query: 2281 RTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDDV 2102
            RTKVVLRHLPPTLSQS L+E VDSRFAGRYNW ++RP KTS K QSYS+AYIDF+  +DV
Sbjct: 5    RTKVVLRHLPPTLSQSMLLEHVDSRFAGRYNWFTFRPAKTSLKHQSYSKAYIDFRNMEDV 64

Query: 2101 IEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFLA 1922
             EFAEFFDGH+FVNEKGTQFKT VEYAPSQRVPK W KKD REGTIL+DP Y+EFLEFLA
Sbjct: 65   TEFAEFFDGHMFVNEKGTQFKTIVEYAPSQRVPKHWLKKDAREGTILKDPAYMEFLEFLA 124

Query: 1921 KPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKRV 1742
            KPVENLPSAEIQL           KD PIVTPLMDYVRQKRA KSG RRS+SNGKS+K V
Sbjct: 125  KPVENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYVRQKRAVKSGARRSISNGKSSKSV 184

Query: 1741 SGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKPRN 1562
             G ++                 + MYV RDS+K  + KD+S Y+++ K+ DQQL  K  +
Sbjct: 185  GGTSSRSPSSTASRRGSEKRTSTTMYVQRDSSKAGNSKDKS-YILLPKRGDQQLSVKSGS 243

Query: 1561 -------DGLEEEGGSSATV-PXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISS 1406
                   D +E E G S T                  P++S     QQN +S +KN  S 
Sbjct: 244  SAPGSEIDVVEGEIGRSVTADSGKKKILLLKGKEKEGPNVSGGSLAQQNVSSALKNSPSL 303

Query: 1405 GNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQK 1226
              +KQNQ           ILL KD R                 DKD RPPRPPS+ L QK
Sbjct: 304  SALKQNQRQEASGRIIRSILL-KDARQNQSAFQSDQIQ-----DKDMRPPRPPSMQLFQK 357

Query: 1225 DTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXXXXXXXXX 1055
            DT+GA +D+   N+ H  H +KQERR +N++RPDRGVWAPLRR                 
Sbjct: 358  DTSGANEDKVVGNEKHVVHIEKQERRSRNRDRPDRGVWAPLRRADSSQASNGSLSSGIPQ 417

Query: 1054 XXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGSAHIKDSD 878
                    EG   E ++++  ARG EF+   SGR SHSS DNG YKHG RRG   ++D  
Sbjct: 418  SSQVREFVEGGQGELKNDLPIARGTEFRPIGSGRNSHSSADNGNYKHGGRRG---LRDVA 474

Query: 877  GSA--EGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
            G++  EGK +++GG S Y S EKQVWVQKSSSGS
Sbjct: 475  GTSIGEGKPVKKGGTSAYSSLEKQVWVQKSSSGS 508


>ref|XP_002328787.1| predicted protein [Populus trichocarpa]
            gi|566168252|ref|XP_006385052.1| Smg-4/UPF3 family
            protein [Populus trichocarpa] gi|550341820|gb|ERP62849.1|
            Smg-4/UPF3 family protein [Populus trichocarpa]
          Length = 527

 Score =  454 bits (1167), Expect = e-124
 Identities = 270/523 (51%), Positives = 326/523 (62%), Gaps = 23/523 (4%)
 Frame = -1

Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114
            G  D+TKVV+RHLPP +SQ   +EQ+D  F+GRYNW+SYRPG  SQK QSYSRAYIDFKR
Sbjct: 5    GQSDKTKVVVRHLPPGISQPMFVEQIDVAFSGRYNWLSYRPGNNSQKHQSYSRAYIDFKR 64

Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934
            P+DVI+FAEFF+GH+FVNEKGTQFK  VEY+PSQRVPKQWSKKDGREGTI +DPEYLEFL
Sbjct: 65   PEDVIDFAEFFNGHIFVNEKGTQFKAIVEYSPSQRVPKQWSKKDGREGTISKDPEYLEFL 124

Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754
            E +AKPVENLPSAEIQL           KD PIVTPLMD+VRQKR AK+G RR +SNGK 
Sbjct: 125  ELIAKPVENLPSAEIQLERREAERAGAAKDAPIVTPLMDFVRQKRVAKNGPRRILSNGKL 184

Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574
            ++R  G+ +                 + MYVLRD+AK TSGKD+STYV V K+DDQQL +
Sbjct: 185  SRRAGGSGSP-SSSSLKRGSEKKRISTTMYVLRDTAKSTSGKDKSTYVHVPKRDDQQLSN 243

Query: 1573 KPRNDG------LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNF 1415
                        LE+E   S  T                I  ++   S QQ+ +S  +N 
Sbjct: 244  AVTLGSGSGTAVLEDESVVSGITDSGKKKILLLKGKEKEISLVTGTMSQQQSISSSDRNI 303

Query: 1414 ISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLH 1238
            ISS  +K +Q           ILLNKD+R               SNL+K+KRPPRPP   
Sbjct: 304  ISSTALK-SQRRETSGRMIRSILLNKDSRHIRSSGVHSEPQMQTSNLEKEKRPPRPPHAQ 362

Query: 1237 LLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXXXXX 1070
            L  KD NG PDD+   ND HGF  +KQE+R +NK+RPDRGVW PLRR             
Sbjct: 363  LGLKDANGTPDDKVVGNDLHGFPNEKQEKRTRNKDRPDRGVWTPLRRSDGSYASDESLLS 422

Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893
                        ++GNH + + +    R GE K   SGRG+HSSLDNG++KH  RRG +H
Sbjct: 423  SASQSTQSVFDSSQGNHGDVKVDSLNLRSGEVKVLGSGRGNHSSLDNGSHKHFGRRGPSH 482

Query: 892  I-KDSDGS-AEGKSLRRGGSC-YGSHE-------KQVWVQKSS 794
            I +D+DGS  E K+ +RGGS  YGSHE       KQVWVQKS+
Sbjct: 483  IVRDADGSTVEAKTPKRGGSSGYGSHEVCSLDSQKQVWVQKST 525


>ref|XP_006389505.1| hypothetical protein POPTR_0022s00460g [Populus trichocarpa]
            gi|550312328|gb|ERP48419.1| hypothetical protein
            POPTR_0022s00460g [Populus trichocarpa]
          Length = 511

 Score =  450 bits (1158), Expect = e-123
 Identities = 269/513 (52%), Positives = 319/513 (62%), Gaps = 10/513 (1%)
 Frame = -1

Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114
            G  D+TKVV+RHLPP +SQ   +EQ+D  F+GRYNW+SYRPGK+SQK QS SRAYIDFKR
Sbjct: 4    GQSDKTKVVVRHLPPGVSQPMFVEQIDLAFSGRYNWLSYRPGKSSQKHQSCSRAYIDFKR 63

Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934
            PDDVI+FAEFF+GH+FVNEKGTQFK  VEYAPSQ VPKQWSKKDGREGTIL+DPEYLEFL
Sbjct: 64   PDDVIDFAEFFNGHLFVNEKGTQFKAIVEYAPSQHVPKQWSKKDGREGTILKDPEYLEFL 123

Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754
            EF+AKPVENLPSAEIQL           KD PIVTPLM+++RQKRAAKSG RR +SNGK 
Sbjct: 124  EFIAKPVENLPSAEIQLERREAERAGVAKDAPIVTPLMEFIRQKRAAKSGPRRILSNGKP 183

Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574
            ++R  G+ +                 + MYVLRD+ K TSGK++S Y  V K DD+QL  
Sbjct: 184  SRRAGGSGSP-SSSSSKRGSEKKRASTTMYVLRDTVKGTSGKEKSIYAQVPKLDDRQLSK 242

Query: 1573 K-PRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISSGNV 1397
                  G   E     T                      I S QQ+ +   +N ISS  +
Sbjct: 243  AVTLGSGSGTEVSEEETAVSGITGTGKKKILLLKGKEKEI-SLQQSISPSDRNIISSTAL 301

Query: 1396 KQNQXXXXXXXXXXXILLNKDN-RXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQKDT 1220
            K +Q           ILLNKD+ R              SNL+KDKRPPRPP   L+ KD 
Sbjct: 302  K-SQRHESSGRVIKSILLNKDSRRIQSSGVQSEPQMQTSNLEKDKRPPRPPHA-LVLKDA 359

Query: 1219 NGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXXXXXXXX 1043
            NG PDD+   ND HGF  +KQERR +NK+RPDR VW  LRR                   
Sbjct: 360  NGTPDDKVVGNDLHGFPNEKQERRTRNKDRPDRVVWT-LRRSEGSYASDESLSSSAYLST 418

Query: 1042 XXXAEG---NHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IKDSD 878
                +    NH + +++    R GE K   SGR +HSSLDNG++KH  RRG  H ++D+D
Sbjct: 419  QSGFDSSQVNHGDVKADTLNLRSGEVKALGSGRSNHSSLDNGSHKHSGRRGPPHPVRDAD 478

Query: 877  GS-AEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
            GS  EGKSL+RGG S YGSHEKQVWVQKSSSGS
Sbjct: 479  GSTVEGKSLKRGGASGYGSHEKQVWVQKSSSGS 511


>gb|EMJ23015.1| hypothetical protein PRUPE_ppa004923mg [Prunus persica]
          Length = 485

 Score =  439 bits (1129), Expect = e-120
 Identities = 268/521 (51%), Positives = 317/521 (60%), Gaps = 16/521 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            +K  LDRTKVVLRHLPP++SQ+SL+EQ+D  F+GRYNWV++RPGK SQK  SYSRAYID 
Sbjct: 2    LKDQLDRTKVVLRHLPPSISQTSLVEQIDVFFSGRYNWVAFRPGKRSQKNPSYSRAYIDL 61

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVIEFAEFFDGH+FVNEKG+QFK  VEYAPSQRVPKQWSKKDGREGTI RDPEYLE
Sbjct: 62   KRPEDVIEFAEFFDGHLFVNEKGSQFKVIVEYAPSQRVPKQWSKKDGREGTIFRDPEYLE 121

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEFLAKP ENLPSAEIQL           KD PIVTPLMD+VRQKRA+K+G RRS++NG
Sbjct: 122  FLEFLAKPAENLPSAEIQLERREAERSGAGKDAPIVTPLMDFVRQKRASKAGSRRSLTNG 181

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K+++R  G ++                 SA MYVLRD+ K TS KD+STY++V K+DDQQ
Sbjct: 182  KTSRRAGGPSSRSPSLATSKRGSERKRNSATMYVLRDARKNTSAKDKSTYILVPKRDDQQ 241

Query: 1582 -------LLDKPRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424
                   L        LEEE G S                  I H+ A  S QQ  +S  
Sbjct: 242  PSEKSVTLASAAGTHVLEEESGVSGADAVKKKILLLKGKEREITHVPANMSQQQ--SSSA 299

Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            KN   +  +KQN            ILLNKD R               SN D+DKRPPR  
Sbjct: 300  KNMGGTIALKQNLRRQENGRIIRGILLNKDARQSQSSGIYSAQQIQTSNSDRDKRPPRSQ 359

Query: 1246 SLHLLQKDTNGAPD-DRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXX 1070
             + L+ KDTNGAPD +   ND HG  ++KQE+R +NKERPDR VW PL R          
Sbjct: 360  HVQLILKDTNGAPDYNIVGNDLHGICSEKQEKRIRNKERPDRVVWTPLNR---------- 409

Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDN--GTYKHGARRGSA 896
                         +G+ A   S             + + +HS LD+  G +KH  RRG+ 
Sbjct: 410  ------------LDGSSASDES----------LSSAFQPAHSLLDSSEGCHKHHGRRGTT 447

Query: 895  H-IKDSDGS---AEGKSLRRGGSCYGSHEKQVWVQKSSSGS 785
            H +KD DGS    EGK  +RG   YGSHEKQVWVQKSSSGS
Sbjct: 448  HGVKDLDGSPVAGEGKHSKRG---YGSHEKQVWVQKSSSGS 485


>gb|EOX97031.1| Smg-4/UPF3 family protein, putative isoform 1 [Theobroma cacao]
          Length = 514

 Score =  432 bits (1110), Expect = e-118
 Identities = 264/524 (50%), Positives = 323/524 (61%), Gaps = 19/524 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKG LDRTKV+LRHLPP ++++ L+EQVD+ F+GRYNW+S+RPGK+SQK QSYSRAYIDF
Sbjct: 1    MKGALDRTKVILRHLPPAITEAMLVEQVDTAFSGRYNWLSFRPGKSSQKHQSYSRAYIDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KR +DV+EFAEFF+GHVFVNEKGTQFKT VEYAPSQRVPK+ SKKDGREGTIL+D EYLE
Sbjct: 61   KRSEDVLEFAEFFNGHVFVNEKGTQFKTIVEYAPSQRVPKRSSKKDGREGTILKDLEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLE L KPVENLPSAEIQL           KD PIVTPLMD+VRQKRAAK G RRS+SNG
Sbjct: 121  FLECLGKPVENLPSAEIQLERKEAERAGVPKDTPIVTPLMDFVRQKRAAKGGSRRSLSNG 180

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R  G++ G                S  MYVLRDS K  SGKD+STY++V+K+D+QQ
Sbjct: 181  KLSRRAGGSSGGTPSSASSKRGSEKRRGSTTMYVLRDSLKNASGKDKSTYILVSKRDEQQ 240

Query: 1582 LLDKP--------RNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427
            L DK              EE G    T                I  ++     QQN  SP
Sbjct: 241  LSDKHVALASSMGTEISEEESGVPGITDAVKKKVLLLKGKEKEISPVAGNVLHQQNVTSP 300

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXS-NLDKDKRPPRP 1250
            +K  + S   KQN             LLNKD R              + NL+KD+RPPR 
Sbjct: 301  IKTILGSTPTKQNSRREGRMIRGI--LLNKDARQNQSSGVQSEQQIRTSNLEKDRRPPRH 358

Query: 1249 PSLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXX 1082
               HL+ KDTN A DD+   ND HG  ++K ERR +NK+RPDRGVW  LRR         
Sbjct: 359  SHSHLVLKDTNTASDDKVVGNDLHG--SEKPERRCRNKDRPDRGVWT-LRRSDGSYASDE 415

Query: 1081 XXXXXXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDNGTY-KHGARR 905
                             EG + +T+ +++  R   + +  G G +SSLDNG++ KH +RR
Sbjct: 416  SMSSSASQSALIPLDPLEGTYGDTKVDLSNVR-SVQVKTVGSGRNSSLDNGSHNKHVSRR 474

Query: 904  GSAHIKDSDGS---AEGKSLRRG-GSCYGSHEKQVWVQKSSSGS 785
            G+     +DGS   ++GK  +RG  + YGSHEKQVWVQKSSSGS
Sbjct: 475  GAV----ADGSSVMSDGKPGKRGCAAGYGSHEKQVWVQKSSSGS 514


>gb|EMJ23014.1| hypothetical protein PRUPE_ppa004923mg [Prunus persica]
          Length = 482

 Score =  421 bits (1083), Expect = e-115
 Identities = 259/518 (50%), Positives = 310/518 (59%), Gaps = 16/518 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            +K  LDRTKVVLRHLPP++SQ+SL+EQ+D  F+GRYNWV++RPGK SQK  SYSRAYID 
Sbjct: 2    LKDQLDRTKVVLRHLPPSISQTSLVEQIDVFFSGRYNWVAFRPGKRSQKNPSYSRAYIDL 61

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVIEFAEFFDGH+FVNEKG+QFK  VEYAPSQRVPKQWSKKDGREGTI RDPEYLE
Sbjct: 62   KRPEDVIEFAEFFDGHLFVNEKGSQFKVIVEYAPSQRVPKQWSKKDGREGTIFRDPEYLE 121

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLEFLAKP ENLPSAEIQL           KD PIVTPLMD+VRQKRA+K+G RRS++NG
Sbjct: 122  FLEFLAKPAENLPSAEIQLERREAERSGAGKDAPIVTPLMDFVRQKRASKAGSRRSLTNG 181

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K+++R  G ++                 SA MYVLRD+ K TS KD+STY++V K+DDQQ
Sbjct: 182  KTSRRAGGPSSRSPSLATSKRGSERKRNSATMYVLRDARKNTSAKDKSTYILVPKRDDQQ 241

Query: 1582 -------LLDKPRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424
                   L        LEEE G S                  I H+ A  S QQ  +S  
Sbjct: 242  PSEKSVTLASAAGTHVLEEESGVSGADAVKKKILLLKGKEREITHVPANMSQQQ--SSSA 299

Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247
            KN   +  +KQN            ILLNKD R               SN D+DKRPPR  
Sbjct: 300  KNMGGTIALKQNLRRQENGRIIRGILLNKDARQSQSSGIYSAQQIQTSNSDRDKRPPRSQ 359

Query: 1246 SLHLLQKDTNGAPD-DRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXX 1070
             + L+ KDTNGAPD +   ND HG  ++KQE+R +NKERPDR VW PL R          
Sbjct: 360  HVQLILKDTNGAPDYNIVGNDLHGICSEKQEKRIRNKERPDRVVWTPLNR---------- 409

Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDN--GTYKHGARRGSA 896
                         +G+ A   S             + + +HS LD+  G +KH  RRG+ 
Sbjct: 410  ------------LDGSSASDES----------LSSAFQPAHSLLDSSEGCHKHHGRRGTT 447

Query: 895  H-IKDSDGS---AEGKSLRRGGSCYGSHEKQVWVQKSS 794
            H +KD DGS    EGK  +RG   YGSHE  VW+ + S
Sbjct: 448  HGVKDLDGSPVAGEGKHSKRG---YGSHECDVWLLEPS 482


>gb|ESW20566.1| hypothetical protein PHAVU_006G219800g [Phaseolus vulgaris]
            gi|561021796|gb|ESW20567.1| hypothetical protein
            PHAVU_006G219800g [Phaseolus vulgaris]
          Length = 513

 Score =  395 bits (1016), Expect = e-107
 Identities = 250/522 (47%), Positives = 308/522 (59%), Gaps = 17/522 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKG LDRTKVVLRHLPP+LS+++L+ Q+DS FA RYNW+S+RP K SQK  SYSRAYIDF
Sbjct: 1    MKGSLDRTKVVLRHLPPSLSEAALLAQIDSAFADRYNWLSFRPAKVSQKHISYSRAYIDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRPDDVI FAEFF+GHVFVNEKG+QFK  VEYAPSQRVP+QWSKKDGR+GTI +D EYLE
Sbjct: 61   KRPDDVILFAEFFNGHVFVNEKGSQFKVIVEYAPSQRVPRQWSKKDGRDGTIYKDSEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLE LAKPVENLPSAEIQL           KD PI+TPLMD+VRQKRAAK G RRS+SNG
Sbjct: 121  FLELLAKPVENLPSAEIQLEKREAERSGAAKDTPIITPLMDFVRQKRAAK-GPRRSLSNG 179

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQL 1580
            K ++R + +                   + MYV R   K ++ KDRS Y +V  Q DQ +
Sbjct: 180  KVSRRGTSSNGSPSSGTSRRGSGKKRVSATMYVARHPGKNSTMKDRSIYTLVPSQGDQHI 239

Query: 1579 LDKPRN----DG---LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAASP 1427
             +K  N    DG   L+E G S  +                I  +S + S  Q  N  S 
Sbjct: 240  SNKSSNVASSDGKQTLDENGFSGNSDSGKKKILLLKGKEREIIAVSDLDSMSQHHNVISS 299

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRP 1250
             K  + +  +KQNQ           IL  K+ R               SNL+KDK+ PRP
Sbjct: 300  AKEIVGATVLKQNQRQEGSGRIIRSILSKKELRQSQSSRALSEQQIQTSNLEKDKQSPRP 359

Query: 1249 PSLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073
              + L+ K  NG PD++    D H F +++QER  ++K+RPDRGVW              
Sbjct: 360  IQVQLILKGMNGTPDNKIGVLDSHVF-SERQERHIRHKDRPDRGVWTSCSN------GAD 412

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896
                          EG+HA+ + ++   R GE K     R SHSS +NG  KH  RRG  
Sbjct: 413  ESFPSAAFSQVDPLEGSHADLKHDMPNTRSGEVKSLGGVRTSHSS-ENGFNKHFGRRGPT 471

Query: 895  H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
            H +KD DG   S+EGK  RR G + YGS+EKQVWVQK+SSG+
Sbjct: 472  HGVKDVDGYSVSSEGKHPRRSGTTAYGSNEKQVWVQKASSGT 513


>ref|XP_006598794.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1
            [Glycine max] gi|571524272|ref|XP_006598795.1| PREDICTED:
            regulator of nonsense transcripts UPF3-like isoform X2
            [Glycine max]
          Length = 512

 Score =  393 bits (1010), Expect = e-106
 Identities = 250/522 (47%), Positives = 310/522 (59%), Gaps = 17/522 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKG LDRTKVVLRHLPP++S+++L+ Q+D+ FAGRYNW+S+RPGK SQK  SYSRAYIDF
Sbjct: 1    MKGALDRTKVVLRHLPPSISEAALLAQIDAAFAGRYNWLSFRPGKISQKHISYSRAYIDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVI FAEFF+GHVFVNEKG+QFK  VEYAPSQRVP+QWSKKDGR+GTI +D EYLE
Sbjct: 61   KRPEDVILFAEFFNGHVFVNEKGSQFKVIVEYAPSQRVPRQWSKKDGRDGTIYKDSEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLE LAKPVENLPSAEIQL           KD PI+TPLMD+VRQKRAAK G RR +SNG
Sbjct: 121  FLELLAKPVENLPSAEIQLEKREAERSGAAKDIPIITPLMDFVRQKRAAK-GPRRLLSNG 179

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R   ++ G                SA MYV RD  K ++ KD+ST  +V KQ DQ 
Sbjct: 180  KVSQRAGTSSNGSPSSVTSRRGSGKKRVSATMYVARDPGKNSTIKDKST--LVPKQGDQH 237

Query: 1582 LLDKPRNDG-------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAAS 1430
            L DK  N         L+E G S                   I  +S + S  Q  N  S
Sbjct: 238  LSDKASNMASSDANLTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 297

Query: 1429 PVKNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPR 1253
              K  + S  +KQ+Q           IL  K+ R               SNL+K+K+PPR
Sbjct: 298  SAKMIVGSTVLKQSQRHEGSGRIIRSILSKKELRQSQYSRALSEQQIQTSNLEKEKQPPR 357

Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073
            P  + L+ K +NG P+++         +++QER  ++K+RPDRGVW              
Sbjct: 358  PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT------SRSNGAD 411

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896
                          EG+HA+ + +   AR GE K   S R SHSS +NG  KH  RRG +
Sbjct: 412  DSFSSSASSQVDPLEGSHADLKHDTPNARSGEVKSLGSVRTSHSS-ENGFNKHFGRRGPS 470

Query: 895  H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
            H +KD DG   S+EGK  RR   S YGS+EKQVWVQK+SSG+
Sbjct: 471  HGVKDVDGYSVSSEGKHPRRSSTSAYGSNEKQVWVQKASSGT 512


>ref|XP_004485448.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X2
            [Cicer arietinum] gi|502076758|ref|XP_004485449.1|
            PREDICTED: regulator of nonsense transcripts UPF3-like
            isoform X3 [Cicer arietinum]
            gi|502076762|ref|XP_004485450.1| PREDICTED: regulator of
            nonsense transcripts UPF3-like isoform X4 [Cicer
            arietinum]
          Length = 510

 Score =  393 bits (1009), Expect = e-106
 Identities = 248/518 (47%), Positives = 301/518 (58%), Gaps = 18/518 (3%)
 Frame = -1

Query: 2284 DRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDD 2105
            DRTKVV+RHLPPT+S+ SL   +D  F+GRYNW+S+RP K S K  S+SRAYIDF +P+D
Sbjct: 3    DRTKVVVRHLPPTISEDSLSSLIDGSFSGRYNWLSFRPAKISPKHTSFSRAYIDFNKPED 62

Query: 2104 VIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFL 1925
            VIEFAEFF+GHVFVNEKGTQFK +VEYAPSQRVPKQWSKKDGR+GTI +DPEYLEFLE L
Sbjct: 63   VIEFAEFFNGHVFVNEKGTQFKVTVEYAPSQRVPKQWSKKDGRDGTIYKDPEYLEFLELL 122

Query: 1924 AKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKR 1745
            AKPVENLPSAEIQL           KD PIVTPLMD+VRQKRAAK G RR  SNGK T+R
Sbjct: 123  AKPVENLPSAEIQLEKREAERSGAGKDVPIVTPLMDFVRQKRAAK-GPRRLSSNGKVTRR 181

Query: 1744 VSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKP 1568
                + G                 + MYV RD  K ++ KD+STY++V +Q DQ L +K 
Sbjct: 182  TGTPSNGSSSSAPSRRGSARKRVSTTMYVARDPGKNSTVKDKSTYILVPRQGDQHLSNKS 241

Query: 1567 RN----DG---LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFI 1412
             N    DG    +E G   S                      S   S   +  S  K  +
Sbjct: 242  SNIASSDGNPTFDENGIAGSNDAGKKVLLLKGKEREIITASDSDSMSQHHSITSSAKTIL 301

Query: 1411 SSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLHL 1235
            +S  +KQNQ           IL NKD R               SNL+K+K+P RP  + L
Sbjct: 302  NSTALKQNQRHEGSGRIIKSILSNKDLRQNQSSRAYSERQLQTSNLEKEKQPTRPLHVQL 361

Query: 1234 LQKDTNGAPDDRTPNDFHGFH--TDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXX 1061
            + K T+GAP++R     HG H  +++QERR + K+RPDRG+W                  
Sbjct: 362  ILKGTDGAPENRI--TVHGLHVSSERQERRFRQKDRPDRGIWT------SRSNGGDESLS 413

Query: 1060 XXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IK 887
                      EG HAE + +   AR GE K   S R SHSS +NG  KH  RRG  H +K
Sbjct: 414  SSASSQVDPLEGGHAELKHDTRSARSGEVKSFGSLRASHSS-ENGFNKHFGRRGPIHGVK 472

Query: 886  DSDG---SAEGKSLRR-GGSCYGSHEKQVWVQKSSSGS 785
            D DG   S+EGK  R+   S YGS+EKQVWVQK+SSG+
Sbjct: 473  DVDGYSVSSEGKHPRKPSSSAYGSNEKQVWVQKASSGT 510


>ref|XP_006592654.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1
            [Glycine max] gi|571493781|ref|XP_006592655.1| PREDICTED:
            regulator of nonsense transcripts UPF3-like isoform X2
            [Glycine max]
          Length = 514

 Score =  389 bits (1000), Expect = e-105
 Identities = 246/522 (47%), Positives = 312/522 (59%), Gaps = 17/522 (3%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKG LDRTKVVLRHLPP++S+++L+ Q+D+ FAGRYNW+S+RPGK SQK  S+SRAYIDF
Sbjct: 1    MKGALDRTKVVLRHLPPSISEAALLSQIDAAFAGRYNWLSFRPGKISQKHMSFSRAYIDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DVI FAEFF+GHVFVN KG+QFK  VEYAPSQRVP+QWSKKD R+GTI +D EYLE
Sbjct: 61   KRPEDVILFAEFFNGHVFVNVKGSQFKVIVEYAPSQRVPRQWSKKDLRDGTIYKDSEYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FLE LAKPVENLPSAEIQL           KD PI+TPLMD+VRQKRAAK G RR +SNG
Sbjct: 121  FLELLAKPVENLPSAEIQLEKREAERSGAAKDIPIITPLMDFVRQKRAAK-GPRRPLSNG 179

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583
            K ++R   ++ G                SA MYV RD  K ++ KD+S+Y +V KQDDQ 
Sbjct: 180  KVSRRAGTSSNGGPSSATSRRGSGKKRVSATMYVARDPGKSSTIKDKSSYTLVPKQDDQH 239

Query: 1582 LLDKPRN----DG---LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAAS 1430
            L +K  N    DG   L+E G S                   I  +S + S  Q  N  S
Sbjct: 240  LPNKASNMASSDGNQTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 299

Query: 1429 PVKNFISSGNVKQNQXXXXXXXXXXXILLNKD-NRXXXXXXXXXXXXXXSNLDKDKRPPR 1253
              K  + S  +KQ+Q           IL  K+ ++              SNL+K+K+PPR
Sbjct: 300  SAKTVVGSTVLKQSQRHEGSGRIIRSILSKKELHQSQSSRALSEQKILTSNLEKEKQPPR 359

Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073
            P  + L+ K +NG P+++         +++QER  ++K+RPDRGVW              
Sbjct: 360  PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT------SRFNGAD 413

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGSA 896
                          EG+ A+ + ++  AR  E K   S R SHSS +NG  KH  RRG +
Sbjct: 414  VSFSSPASSQVDPLEGSQADLKHDMPNARSVEVKSFGSVRTSHSS-ENGFNKHFGRRGPS 472

Query: 895  H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785
            + +KD DG   S+EGK  RR   S YGS+EKQVWVQK+SSGS
Sbjct: 473  YGVKDVDGYSVSSEGKHPRRSSTSAYGSNEKQVWVQKASSGS 514


>ref|XP_004248850.1| PREDICTED: uncharacterized protein LOC101263168 [Solanum
            lycopersicum]
          Length = 438

 Score =  381 bits (979), Expect = e-103
 Identities = 220/403 (54%), Positives = 256/403 (63%), Gaps = 9/403 (2%)
 Frame = -1

Query: 2281 RTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDDV 2102
            RTKVVLRHLPPTLSQS L+E VDSRFAGRYNW ++RP KTS K QSYS+AYIDF+  +DV
Sbjct: 5    RTKVVLRHLPPTLSQSMLLEHVDSRFAGRYNWFNFRPAKTSLKHQSYSKAYIDFRNMEDV 64

Query: 2101 IEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFLA 1922
             EFAEFFDGH+FVNEKGTQFKT VEYAPSQRVPK W KKD REGTIL+DP Y+EFLEFLA
Sbjct: 65   TEFAEFFDGHMFVNEKGTQFKTIVEYAPSQRVPKHWLKKDAREGTILKDPAYMEFLEFLA 124

Query: 1921 KPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKRV 1742
            KPVENLPSAEIQL           KD PIVTPLMDYVRQKRA  SG R+S+SNGKS+K V
Sbjct: 125  KPVENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYVRQKRAVTSGARKSISNGKSSKSV 184

Query: 1741 SGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKPRN 1562
             G ++                 + MYV RDS+KV + KD+S Y++ +K   QQL DK   
Sbjct: 185  GGTSSRSPSSTASRRGSEKRTSTTMYVQRDSSKVGNSKDKS-YILASKCGYQQLSDKSSA 243

Query: 1561 -------DGLEEEGGSSATV-PXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISS 1406
                   D +E E G S T                  P++S     QQN +S +KN  S 
Sbjct: 244  SAPGSWIDVVEGEIGRSVTSDSGKKKILLLKGKEKESPNVSGGSLAQQNVSSALKNSPSL 303

Query: 1405 GNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQK 1226
              +K NQ           ILL KD R                 DKD RPPRPPS+ L QK
Sbjct: 304  SALKLNQHQEVGGRIIRSILL-KDARQNQSAFQSDQIQ-----DKDMRPPRPPSMQLFQK 357

Query: 1225 DTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR 1100
            DT+GA +D+   N+ H  H +KQERR +N++RPDRGVWAPLRR
Sbjct: 358  DTSGANEDKVVGNEKHVVHIEKQERRSRNRDRPDRGVWAPLRR 400


>gb|EOX97032.1| Smg-4/UPF3 family protein, putative isoform 2, partial [Theobroma
            cacao]
          Length = 440

 Score =  377 bits (968), Expect = e-101
 Identities = 223/418 (53%), Positives = 264/418 (63%), Gaps = 18/418 (4%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MKG LDRTKV+LRHLPP ++++ L+EQVD+ F+GRYNW+S+RPGK+SQK QSYSRAYIDF
Sbjct: 1    MKGALDRTKVILRHLPPAITEAMLVEQVDTAFSGRYNWLSFRPGKSSQKHQSYSRAYIDF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILR------ 1958
            KR +DV+EFAEFF+GHVFVNEKGTQFKT VEYAPSQRVPK+ SKKDGREGTIL+      
Sbjct: 61   KRSEDVLEFAEFFNGHVFVNEKGTQFKTIVEYAPSQRVPKRSSKKDGREGTILKVFLDEH 120

Query: 1957 -DPEYLEFLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGG 1781
             D EYLEFLE L KPVENLPSAEIQL           KD PIVTPLMD+VRQKRAAK G 
Sbjct: 121  LDLEYLEFLECLGKPVENLPSAEIQLERKEAERAGVPKDTPIVTPLMDFVRQKRAAKGGS 180

Query: 1780 RRSVSNGKSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMV 1604
            RRS+SNGK ++R  G++ G                S  MYVLRDS K  SGKD+STY++V
Sbjct: 181  RRSLSNGKLSRRAGGSSGGTPSSASSKRGSEKRRGSTTMYVLRDSLKNASGKDKSTYILV 240

Query: 1603 TKQDDQQLLDKP--------RNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPST 1448
            +K+D+QQL DK              EE G    T                I  ++     
Sbjct: 241  SKRDEQQLSDKHVALASSMGTEISEEESGVPGITDAVKKKVLLLKGKEKEISPVAGNVLH 300

Query: 1447 QQNAASPVKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXS-NLDK 1271
            QQN  SP+K  + S   KQN             LLNKD R              + NL+K
Sbjct: 301  QQNVTSPIKTILGSTPTKQNSRREGRMIRGI--LLNKDARQNQSSGVQSEQQIRTSNLEK 358

Query: 1270 DKRPPRPPSLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR 1100
            D+RPPR    HL+ KDTN A DD+   ND HG  ++K ERR +NK+RPDRGVW  LRR
Sbjct: 359  DRRPPRHSHSHLVLKDTNTASDDKVVGNDLHG--SEKPERRCRNKDRPDRGVWT-LRR 413


>ref|XP_004485447.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1
            [Cicer arietinum]
          Length = 517

 Score =  372 bits (954), Expect = e-100
 Identities = 238/506 (47%), Positives = 289/506 (57%), Gaps = 18/506 (3%)
 Frame = -1

Query: 2284 DRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDD 2105
            DRTKVV+RHLPPT+S+ SL   +D  F+GRYNW+S+RP K S K  S+SRAYIDF +P+D
Sbjct: 3    DRTKVVVRHLPPTISEDSLSSLIDGSFSGRYNWLSFRPAKISPKHTSFSRAYIDFNKPED 62

Query: 2104 VIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFL 1925
            VIEFAEFF+GHVFVNEKGTQFK +VEYAPSQRVPKQWSKKDGR+GTI +DPEYLEFLE L
Sbjct: 63   VIEFAEFFNGHVFVNEKGTQFKVTVEYAPSQRVPKQWSKKDGRDGTIYKDPEYLEFLELL 122

Query: 1924 AKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKR 1745
            AKPVENLPSAEIQL           KD PIVTPLMD+VRQKRAAK G RR  SNGK T+R
Sbjct: 123  AKPVENLPSAEIQLEKREAERSGAGKDVPIVTPLMDFVRQKRAAK-GPRRLSSNGKVTRR 181

Query: 1744 VSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKP 1568
                + G                 + MYV RD  K ++ KD+STY++V +Q DQ L +K 
Sbjct: 182  TGTPSNGSSSSAPSRRGSARKRVSTTMYVARDPGKNSTVKDKSTYILVPRQGDQHLSNKS 241

Query: 1567 RN----DG---LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFI 1412
             N    DG    +E G   S                      S   S   +  S  K  +
Sbjct: 242  SNIASSDGNPTFDENGIAGSNDAGKKVLLLKGKEREIITASDSDSMSQHHSITSSAKTIL 301

Query: 1411 SSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLHL 1235
            +S  +KQNQ           IL NKD R               SNL+K+K+P RP  + L
Sbjct: 302  NSTALKQNQRHEGSGRIIKSILSNKDLRQNQSSRAYSERQLQTSNLEKEKQPTRPLHVQL 361

Query: 1234 LQKDTNGAPDDRTPNDFHGFH--TDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXX 1061
            + K T+GAP++R     HG H  +++QERR + K+RPDRG+W                  
Sbjct: 362  ILKGTDGAPENRI--TVHGLHVSSERQERRFRQKDRPDRGIWT------SRSNGGDESLS 413

Query: 1060 XXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IK 887
                      EG HAE + +   AR GE K   S R SHSS +NG  KH  RRG  H +K
Sbjct: 414  SSASSQVDPLEGGHAELKHDTRSARSGEVKSFGSLRASHSS-ENGFNKHFGRRGPIHGVK 472

Query: 886  DSDG---SAEGKSLRR-GGSCYGSHE 821
            D DG   S+EGK  R+   S YGS+E
Sbjct: 473  DVDGYSVSSEGKHPRKPSSSAYGSNE 498


>gb|EOY26871.1| Smg-4/UPF3 family protein, putative isoform 2 [Theobroma cacao]
          Length = 487

 Score =  345 bits (886), Expect = 5e-92
 Identities = 217/519 (41%), Positives = 282/519 (54%), Gaps = 14/519 (2%)
 Frame = -1

Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120
            MK PL RTKVV+RHLPP+++QS L  Q+D RF+ RYNW S+R GK+S K Q YSRAYI+F
Sbjct: 1    MKEPLRRTKVVIRHLPPSVTQSFLFSQIDDRFSDRYNWFSFRLGKSSHKHQRYSRAYINF 60

Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940
            KRP+DV EFAEFFDGHVFVNEKGTQFK  VEYAPSQRVPK  +KKDGREGTI +DP+YLE
Sbjct: 61   KRPEDVFEFAEFFDGHVFVNEKGTQFKAIVEYAPSQRVPKPGTKKDGREGTIFKDPDYLE 120

Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760
            FL+ +AKPV+NLPSAEIQL           K+ P++TPLM +VRQKRAA+SG +  V+  
Sbjct: 121  FLKLIAKPVDNLPSAEIQLERKEVELSGAPKETPVITPLMAFVRQKRAAESGTQGPVTRR 180

Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQL 1580
            K  ++   A+TG                   Y+L+DS K T  KD+S + + +KQ+DQ +
Sbjct: 181  KIGRKAGAASTG-----KSGSSSKRGSEKKKYILKDSVKGTHHKDKSKFFVASKQEDQPV 235

Query: 1579 ----LDKPRNDGLEEEGGSSATV-----PXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427
                 +K  N  +    G    +                     PH+    S QQ ++SP
Sbjct: 236  PSVGKEKRENGTVYGIDGPVTGITLTADSGKKKILLLKPKDQEAPHVPQGASEQQGSSSP 295

Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKD--NRXXXXXXXXXXXXXXSNLDKDKRPPR 1253
            V N   S   KQ+Q           ILL+ +                   NLD  KRPPR
Sbjct: 296  VANSPGSTAPKQSQRREAGGRLIRSILLSNEASQNQPLAGVKPQQKTQTMNLDNVKRPPR 355

Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073
            P +  L                  G  ++K E+R +NK+R DRGVWAPLR          
Sbjct: 356  PANTRL------------------GSGSEKHEKRIRNKDRLDRGVWAPLR-------GSD 390

Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893
                         ++   A + S     +G+  +  SGR   S  +NG+ +H  RR +A+
Sbjct: 391  VSQASEERFSPSMSQSAQASSNSIEGEMKGDIPNGRSGRNVPS--ENGSNRHFDRRSAAY 448

Query: 892  IKDSDG---SAEGKSLRRGGSCYGSHEKQVWVQKSSSGS 785
                DG   S+E KS +RG +  G+HEKQ+WVQKSSSGS
Sbjct: 449  NIKDDGSVISSESKSSKRGATGSGAHEKQIWVQKSSSGS 487


Top