BLASTX nr result
ID: Catharanthus23_contig00007083
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00007083 (2671 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI29694.3| unnamed protein product [Vitis vinifera] 498 e-138 ref|XP_006349314.1| PREDICTED: regulator of nonsense transcripts... 470 e-129 ref|XP_006492554.1| PREDICTED: regulator of nonsense transcripts... 467 e-128 ref|XP_004230442.1| PREDICTED: uncharacterized protein LOC101264... 467 e-128 ref|XP_006431585.1| hypothetical protein CICLE_v10000901mg [Citr... 463 e-127 ref|XP_006385051.1| hypothetical protein POPTR_0004s23450g [Popu... 461 e-126 ref|XP_006339922.1| PREDICTED: regulator of nonsense transcripts... 454 e-124 ref|XP_002328787.1| predicted protein [Populus trichocarpa] gi|5... 454 e-124 ref|XP_006389505.1| hypothetical protein POPTR_0022s00460g [Popu... 450 e-123 gb|EMJ23015.1| hypothetical protein PRUPE_ppa004923mg [Prunus pe... 439 e-120 gb|EOX97031.1| Smg-4/UPF3 family protein, putative isoform 1 [Th... 432 e-118 gb|EMJ23014.1| hypothetical protein PRUPE_ppa004923mg [Prunus pe... 421 e-115 gb|ESW20566.1| hypothetical protein PHAVU_006G219800g [Phaseolus... 395 e-107 ref|XP_006598794.1| PREDICTED: regulator of nonsense transcripts... 393 e-106 ref|XP_004485448.1| PREDICTED: regulator of nonsense transcripts... 393 e-106 ref|XP_006592654.1| PREDICTED: regulator of nonsense transcripts... 389 e-105 ref|XP_004248850.1| PREDICTED: uncharacterized protein LOC101263... 381 e-103 gb|EOX97032.1| Smg-4/UPF3 family protein, putative isoform 2, pa... 377 e-101 ref|XP_004485447.1| PREDICTED: regulator of nonsense transcripts... 372 e-100 gb|EOY26871.1| Smg-4/UPF3 family protein, putative isoform 2 [Th... 345 5e-92 >emb|CBI29694.3| unnamed protein product [Vitis vinifera] Length = 519 Score = 498 bits (1283), Expect = e-138 Identities = 281/521 (53%), Positives = 337/521 (64%), Gaps = 17/521 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKGPLDRTKVV+RHLPPT+S+++ +EQ+D+ F GRY V +RPGK SQK QSYSRAY+DF Sbjct: 1 MKGPLDRTKVVVRHLPPTISEAAFLEQIDTVFKGRYTLVKFRPGKNSQKRQSYSRAYLDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQR+PK W KKDGREGTI +DPEY+E Sbjct: 61 KRPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRIPKHWPKKDGREGTIFKDPEYME 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 F+E LAKPVENLPSAEIQL KD PIVTPLMD+VRQKRAAK RRS+SNG Sbjct: 121 FVELLAKPVENLPSAEIQLERREAERAGAVKDTPIVTPLMDFVRQKRAAKGVSRRSLSNG 180 Query: 1759 KSTKRVSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R SG+++G + MYVLRD+AK TS KD+ST+++V K+DDQ Sbjct: 181 KLSRRASGSSSGNPSLGSSKRGSEKRRLSTTMYVLRDTAKSTSAKDKSTFILVPKRDDQL 240 Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424 L DK N + LEEE G S V + QQN SPV Sbjct: 241 LSDKSVNLAAGGGAEALEEESGVSGAVDAGKKKVLLLKGKEREISHHLL---QQNVTSPV 297 Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 KN + + KQNQ ILLNKD R SNL+K+KRPPRPP Sbjct: 298 KNILGANAPKQNQRREGSGRIIRSILLNKDARQSQSSMFQTEQQSQASNLEKEKRPPRPP 357 Query: 1246 SLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR-XXXXXXXXX 1073 + L K+TNGA DD+ ND H F ++KQ++R +NK+RPDRGVW PLRR Sbjct: 358 HIQLASKETNGAQDDKVVGNDVHSFVSEKQDKRTRNKDRPDRGVWTPLRRSDGSHASDES 417 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896 EG+H E RS+++ AR GE K SGRG HS+LDNG++KH RRG Sbjct: 418 LSSSASQPTSSDFPEGSHGEMRSDMSNARSGEVKALGSGRGGHSALDNGSHKHSGRRGPT 477 Query: 895 H-IKDSDGS---AEGKSLRRGGS-CYGSHEKQVWVQKSSSG 788 H +KD+DGS +EGK +RG + YGSHEKQVWVQKSSSG Sbjct: 478 HSVKDADGSSIVSEGKHSKRGSAPGYGSHEKQVWVQKSSSG 518 >ref|XP_006349314.1| PREDICTED: regulator of nonsense transcripts UPF3-like [Solanum tuberosum] Length = 483 Score = 470 bits (1209), Expect = e-129 Identities = 270/519 (52%), Positives = 325/519 (62%), Gaps = 14/519 (2%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKGPLDR+KVVLRHLPPT+SQS L++QVDSRFAGRYNW + PGK+SQK Q+YSRAYI+F Sbjct: 1 MKGPLDRSKVVLRHLPPTISQSMLLDQVDSRFAGRYNWFCFLPGKSSQKHQTYSRAYIEF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 K P+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQRVPK WSKKDGREGTIL+DPEYLE Sbjct: 61 KMPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPKHWSKKDGREGTILKDPEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEF++KP+ENLPSAEIQL KD PIVTPLMDY+RQKRAAKSG R+S++NG Sbjct: 121 FLEFISKPIENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYIRQKRAAKSGARKSIANG 180 Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 + T+R SG +TG + MYVLRDS+K SGKD+ TY++ K+DDQQ Sbjct: 181 RPTRRTSGTSTGSPSSSASKRSSEKRRASTTMYVLRDSSKAGSGKDK-TYILAPKRDDQQ 239 Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAAS-P 1427 +K + +EEE G +A V P+ Q+ AS Sbjct: 240 RAEKSGTSAAGSVANAVEEETGGAADVGKKKILLLKEKEN---------PNNQRREASGR 290 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 + I + +QNQ +D +DKDK+PPRPP Sbjct: 291 IIRSILLKDARQNQAPSAS---------QQDKH---------------RVDKDKKPPRPP 326 Query: 1246 SLHLLQKDTNGAPDDRTPN-DFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXX 1076 S+ L Q++TNGA +D+ D H HT+KQE+R + ++RPDRGVW PLRR Sbjct: 327 SVQLFQRETNGANEDKVLGADLHIVHTEKQEKRTRIRDRPDRGVWTPLRRSDSLHASDES 386 Query: 1075 XXXXXXXXXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGS 899 EG+ ET+ + ARG EF+ SGR S+SS DNGTYKHG RRG Sbjct: 387 LSSSASQSSEVPDFVEGSQGETKHGLANARGAEFRPMGSGRNSYSSFDNGTYKHGGRRGM 446 Query: 898 AHIKDSDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 D EGK LRRGG S YG+HEKQVWVQKSSSG+ Sbjct: 447 R--DDGISVGEGKPLRRGGPSSYGTHEKQVWVQKSSSGT 483 >ref|XP_006492554.1| PREDICTED: regulator of nonsense transcripts UPF3-like [Citrus sinensis] Length = 514 Score = 467 bits (1202), Expect = e-128 Identities = 276/522 (52%), Positives = 335/522 (64%), Gaps = 17/522 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKGPLDRTKVV+R+LPP ++Q + EQ+D F GRYNWVS+R GKTSQK QS +RAY+DF Sbjct: 1 MKGPLDRTKVVVRNLPPAITQPAFTEQIDGAFGGRYNWVSFRQGKTSQKHQSCARAYLDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 K+P+DV+EFAEFF+GHVFVNEKG QFKT VEYAPSQRVPKQWSKKDGREGT+L+DPEYLE Sbjct: 61 KKPEDVLEFAEFFNGHVFVNEKGVQFKTIVEYAPSQRVPKQWSKKDGREGTLLKDPEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEF++KPVENLPSAEIQL K+ IVTPLMD+VRQKRAAK+G RR +SNG Sbjct: 121 FLEFISKPVENLPSAEIQLERREAERAGAAKEALIVTPLMDFVRQKRAAKAGPRRLLSNG 180 Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R SG++TG + MYVLRD+AK +SGKD+STY++V K+DDQ Sbjct: 181 KLSRRASGSSTGSPASGSSKRGSDKKKASTTMYVLRDTAKNSSGKDKSTYILVPKRDDQD 240 Query: 1582 LLDKPRNDG--------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427 DKP + LEE G + + I +S S QQ+A+ Sbjct: 241 -FDKPVSSSSATGSEVVLEESGVPANSDGGKKKVLLLKGKEREISQVSGSVSHQQSAS-- 297 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 VKN ISS +KQNQ ILLNKD R SNL+KDKRPPRP Sbjct: 298 VKNIISSPALKQNQRRENSGRIIRGILLNKDAR-QNQASGLHSEQQISNLEKDKRPPRPS 356 Query: 1246 SLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXX 1079 +HL+ KDTNG DD+ ND H++KQERR +NK+RPDR W LRR Sbjct: 357 HVHLVMKDTNGVSDDKVIVND---LHSEKQERRTRNKDRPDRAAWT-LRRSDGSYQSDES 412 Query: 1078 XXXXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRG 902 +EGN + + +++ R GE K GR SHSS+DNG+++H RRG Sbjct: 413 LSSSASQLSLSAVDSSEGNLGDGKFDLSNMRSGEVKAVGGGRSSHSSVDNGSHRHIGRRG 472 Query: 901 SAHIKD--SDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 H+KD S +EGK LRRGG S YGSHEKQVWVQKSSSGS Sbjct: 473 PTHVKDDSSPVMSEGKPLRRGGASGYGSHEKQVWVQKSSSGS 514 >ref|XP_004230442.1| PREDICTED: uncharacterized protein LOC101264766 [Solanum lycopersicum] Length = 485 Score = 467 bits (1201), Expect = e-128 Identities = 269/518 (51%), Positives = 323/518 (62%), Gaps = 13/518 (2%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKGPLDR+KVVLRHLPPT+SQS L++QVDSRFAGRYNW + PGK+SQK Q+YSRAYI+F Sbjct: 1 MKGPLDRSKVVLRHLPPTISQSMLLDQVDSRFAGRYNWFCFLPGKSSQKHQTYSRAYIEF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVIEFAEFFDGHVFVNEKGTQFKT VEYAPSQRVP+ WSKKDGREGTIL+DPEYLE Sbjct: 61 KRPEDVIEFAEFFDGHVFVNEKGTQFKTIVEYAPSQRVPQHWSKKDGREGTILKDPEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEF++KP+ENLPSAEIQL KD PIVTPLMDY+RQKRAAKSG R+S++NG Sbjct: 121 FLEFISKPIENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYIRQKRAAKSGARKSIANG 180 Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 + T+R SG + G + MYVLRDS+K SGKD+ TY++ K+DDQQ Sbjct: 181 RPTRRASGTSAGSPSSSASKRSSEKRRASTTMYVLRDSSKAGSGKDK-TYILAPKRDDQQ 239 Query: 1582 LLDKPRN-------DGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424 +K + +EEE G +A V P+ Q+ A Sbjct: 240 RAEKSGTSAPGSVANAVEEETGGAADVGKKKILLLKEKEKEN-------PNNQRREA--- 289 Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPS 1244 SG + ++ +L KD R +DKDK+PPRPPS Sbjct: 290 -----SGRIIRS-------------ILLKDAR--QNQAPSASQQEKHRVDKDKKPPRPPS 329 Query: 1243 LHLLQKDTNGAPDDRTPN-DFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXXX 1073 + L Q++TNGA +DR D H HT+KQE+R + ++RPDRGVW PLRR Sbjct: 330 VQLFQRETNGANEDRVLGADLHVVHTEKQEKRTRIRDRPDRGVWTPLRRSDSLHASDESL 389 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896 EG+ ET+ + AR EF+ SGR SHSS DNGTYKHG RRG Sbjct: 390 SSSASQSSEVPDFVEGSPGETKHGLVNARVAEFRPMGSGRNSHSSFDNGTYKHGGRRGMR 449 Query: 895 HIKDSDGSAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 D EGK LRRGG S Y +HEKQVWVQKSSSG+ Sbjct: 450 --DDGISVGEGKPLRRGGPSSYNTHEKQVWVQKSSSGT 485 >ref|XP_006431585.1| hypothetical protein CICLE_v10000901mg [Citrus clementina] gi|557533707|gb|ESR44825.1| hypothetical protein CICLE_v10000901mg [Citrus clementina] Length = 514 Score = 463 bits (1192), Expect = e-127 Identities = 276/523 (52%), Positives = 335/523 (64%), Gaps = 18/523 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKGPLDRTKVV+R+LPP ++Q + EQ+D F GRYNWVS+R GKTSQK QS +RAY+DF Sbjct: 1 MKGPLDRTKVVVRNLPPAITQPAFTEQIDGAFGGRYNWVSFRQGKTSQKHQSCARAYLDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 K+P+DV+EFAEFF+GHVFVNEKG QFKT VEYAPSQRVPKQWSKKDGREGT+L+DPEYLE Sbjct: 61 KKPEDVLEFAEFFNGHVFVNEKGVQFKTIVEYAPSQRVPKQWSKKDGREGTLLKDPEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEF++KPVENLPSAEIQL K+ IVTPLMD+VRQKRAAK+G RR +SNG Sbjct: 121 FLEFISKPVENLPSAEIQLERREAERAGAAKEALIVTPLMDFVRQKRAAKAGPRRLLSNG 180 Query: 1759 KSTKRVSGAATGI-XXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R SG++TG + MYVLRD+AK +SGKD+STY++V K+DDQ Sbjct: 181 KLSRRASGSSTGSPASGSSKRGSDKKKASTTMYVLRDTAKNSSGKDKSTYILVPKRDDQD 240 Query: 1582 LLDKPRNDG--------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427 DKP + LEE G + + I +S S QQ+A+ Sbjct: 241 -FDKPVSSSSATGSEVVLEESGVPANSDGGKKKVLLLKGKEREISQVSGSVSHQQSAS-- 297 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 VK ISS +KQNQ ILLNKD R SNL+KDKRPPRP Sbjct: 298 VKTIISSPALKQNQRRENSGRIIRGILLNKDAR-QNQASGLHSEQQISNLEKDKRPPRPS 356 Query: 1246 SLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXX 1079 + L+ KDTNG DD+ ND H++KQERR +NK+RPDR W LRR Sbjct: 357 HVQLVMKDTNGVSDDKVIVND---LHSEKQERRTRNKDRPDRAAWT-LRRSDGSYQSDES 412 Query: 1078 XXXXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRG 902 +EGN + + +++ R GE K GR SHSS+DNG+++H RRG Sbjct: 413 LSSSASQLSLSAVDSSEGNLGDGKFDLSNMRSGEVKAVGGGRSSHSSVDNGSHRHIGRRG 472 Query: 901 SAHIKDSDGS---AEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 H+KD DGS +EGK LRRGG S YGSHEKQVWVQKSSSGS Sbjct: 473 PTHVKD-DGSPVMSEGKPLRRGGASGYGSHEKQVWVQKSSSGS 514 >ref|XP_006385051.1| hypothetical protein POPTR_0004s23450g [Populus trichocarpa] gi|550341819|gb|ERP62848.1| hypothetical protein POPTR_0004s23450g [Populus trichocarpa] Length = 520 Score = 461 bits (1185), Expect = e-126 Identities = 270/516 (52%), Positives = 326/516 (63%), Gaps = 16/516 (3%) Frame = -1 Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114 G D+TKVV+RHLPP +SQ +EQ+D F+GRYNW+SYRPG SQK QSYSRAYIDFKR Sbjct: 5 GQSDKTKVVVRHLPPGISQPMFVEQIDVAFSGRYNWLSYRPGNNSQKHQSYSRAYIDFKR 64 Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934 P+DVI+FAEFF+GH+FVNEKGTQFK VEY+PSQRVPKQWSKKDGREGTI +DPEYLEFL Sbjct: 65 PEDVIDFAEFFNGHIFVNEKGTQFKAIVEYSPSQRVPKQWSKKDGREGTISKDPEYLEFL 124 Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754 E +AKPVENLPSAEIQL KD PIVTPLMD+VRQKR AK+G RR +SNGK Sbjct: 125 ELIAKPVENLPSAEIQLERREAERAGAAKDAPIVTPLMDFVRQKRVAKNGPRRILSNGKL 184 Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574 ++R G+ + + MYVLRD+AK TSGKD+STYV V K+DDQQL + Sbjct: 185 SRRAGGSGSP-SSSSLKRGSEKKRISTTMYVLRDTAKSTSGKDKSTYVHVPKRDDQQLSN 243 Query: 1573 KPRNDG------LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNF 1415 LE+E S T I ++ S QQ+ +S +N Sbjct: 244 AVTLGSGSGTAVLEDESVVSGITDSGKKKILLLKGKEKEISLVTGTMSQQQSISSSDRNI 303 Query: 1414 ISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLH 1238 ISS +K +Q ILLNKD+R SNL+K+KRPPRPP Sbjct: 304 ISSTALK-SQRRETSGRMIRSILLNKDSRHIRSSGVHSEPQMQTSNLEKEKRPPRPPHAQ 362 Query: 1237 LLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXXXXX 1070 L KD NG PDD+ ND HGF +KQE+R +NK+RPDRGVW PLRR Sbjct: 363 LGLKDANGTPDDKVVGNDLHGFPNEKQEKRTRNKDRPDRGVWTPLRRSDGSYASDESLLS 422 Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893 ++GNH + + + R GE K SGRG+HSSLDNG++KH RRG +H Sbjct: 423 SASQSTQSVFDSSQGNHGDVKVDSLNLRSGEVKVLGSGRGNHSSLDNGSHKHFGRRGPSH 482 Query: 892 I-KDSDGS-AEGKSLRRGGSC-YGSHEKQVWVQKSS 794 I +D+DGS E K+ +RGGS YGSHEKQVWVQKS+ Sbjct: 483 IVRDADGSTVEAKTPKRGGSSGYGSHEKQVWVQKST 518 >ref|XP_006339922.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1 [Solanum tuberosum] gi|565345688|ref|XP_006339923.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X2 [Solanum tuberosum] gi|565345690|ref|XP_006339924.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X3 [Solanum tuberosum] Length = 508 Score = 454 bits (1167), Expect = e-124 Identities = 270/514 (52%), Positives = 319/514 (62%), Gaps = 15/514 (2%) Frame = -1 Query: 2281 RTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDDV 2102 RTKVVLRHLPPTLSQS L+E VDSRFAGRYNW ++RP KTS K QSYS+AYIDF+ +DV Sbjct: 5 RTKVVLRHLPPTLSQSMLLEHVDSRFAGRYNWFTFRPAKTSLKHQSYSKAYIDFRNMEDV 64 Query: 2101 IEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFLA 1922 EFAEFFDGH+FVNEKGTQFKT VEYAPSQRVPK W KKD REGTIL+DP Y+EFLEFLA Sbjct: 65 TEFAEFFDGHMFVNEKGTQFKTIVEYAPSQRVPKHWLKKDAREGTILKDPAYMEFLEFLA 124 Query: 1921 KPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKRV 1742 KPVENLPSAEIQL KD PIVTPLMDYVRQKRA KSG RRS+SNGKS+K V Sbjct: 125 KPVENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYVRQKRAVKSGARRSISNGKSSKSV 184 Query: 1741 SGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKPRN 1562 G ++ + MYV RDS+K + KD+S Y+++ K+ DQQL K + Sbjct: 185 GGTSSRSPSSTASRRGSEKRTSTTMYVQRDSSKAGNSKDKS-YILLPKRGDQQLSVKSGS 243 Query: 1561 -------DGLEEEGGSSATV-PXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISS 1406 D +E E G S T P++S QQN +S +KN S Sbjct: 244 SAPGSEIDVVEGEIGRSVTADSGKKKILLLKGKEKEGPNVSGGSLAQQNVSSALKNSPSL 303 Query: 1405 GNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQK 1226 +KQNQ ILL KD R DKD RPPRPPS+ L QK Sbjct: 304 SALKQNQRQEASGRIIRSILL-KDARQNQSAFQSDQIQ-----DKDMRPPRPPSMQLFQK 357 Query: 1225 DTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR--XXXXXXXXXXXXXXX 1055 DT+GA +D+ N+ H H +KQERR +N++RPDRGVWAPLRR Sbjct: 358 DTSGANEDKVVGNEKHVVHIEKQERRSRNRDRPDRGVWAPLRRADSSQASNGSLSSGIPQ 417 Query: 1054 XXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGSAHIKDSD 878 EG E ++++ ARG EF+ SGR SHSS DNG YKHG RRG ++D Sbjct: 418 SSQVREFVEGGQGELKNDLPIARGTEFRPIGSGRNSHSSADNGNYKHGGRRG---LRDVA 474 Query: 877 GSA--EGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 G++ EGK +++GG S Y S EKQVWVQKSSSGS Sbjct: 475 GTSIGEGKPVKKGGTSAYSSLEKQVWVQKSSSGS 508 >ref|XP_002328787.1| predicted protein [Populus trichocarpa] gi|566168252|ref|XP_006385052.1| Smg-4/UPF3 family protein [Populus trichocarpa] gi|550341820|gb|ERP62849.1| Smg-4/UPF3 family protein [Populus trichocarpa] Length = 527 Score = 454 bits (1167), Expect = e-124 Identities = 270/523 (51%), Positives = 326/523 (62%), Gaps = 23/523 (4%) Frame = -1 Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114 G D+TKVV+RHLPP +SQ +EQ+D F+GRYNW+SYRPG SQK QSYSRAYIDFKR Sbjct: 5 GQSDKTKVVVRHLPPGISQPMFVEQIDVAFSGRYNWLSYRPGNNSQKHQSYSRAYIDFKR 64 Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934 P+DVI+FAEFF+GH+FVNEKGTQFK VEY+PSQRVPKQWSKKDGREGTI +DPEYLEFL Sbjct: 65 PEDVIDFAEFFNGHIFVNEKGTQFKAIVEYSPSQRVPKQWSKKDGREGTISKDPEYLEFL 124 Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754 E +AKPVENLPSAEIQL KD PIVTPLMD+VRQKR AK+G RR +SNGK Sbjct: 125 ELIAKPVENLPSAEIQLERREAERAGAAKDAPIVTPLMDFVRQKRVAKNGPRRILSNGKL 184 Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574 ++R G+ + + MYVLRD+AK TSGKD+STYV V K+DDQQL + Sbjct: 185 SRRAGGSGSP-SSSSLKRGSEKKRISTTMYVLRDTAKSTSGKDKSTYVHVPKRDDQQLSN 243 Query: 1573 KPRNDG------LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNF 1415 LE+E S T I ++ S QQ+ +S +N Sbjct: 244 AVTLGSGSGTAVLEDESVVSGITDSGKKKILLLKGKEKEISLVTGTMSQQQSISSSDRNI 303 Query: 1414 ISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLH 1238 ISS +K +Q ILLNKD+R SNL+K+KRPPRPP Sbjct: 304 ISSTALK-SQRRETSGRMIRSILLNKDSRHIRSSGVHSEPQMQTSNLEKEKRPPRPPHAQ 362 Query: 1237 LLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXXXXXX 1070 L KD NG PDD+ ND HGF +KQE+R +NK+RPDRGVW PLRR Sbjct: 363 LGLKDANGTPDDKVVGNDLHGFPNEKQEKRTRNKDRPDRGVWTPLRRSDGSYASDESLLS 422 Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893 ++GNH + + + R GE K SGRG+HSSLDNG++KH RRG +H Sbjct: 423 SASQSTQSVFDSSQGNHGDVKVDSLNLRSGEVKVLGSGRGNHSSLDNGSHKHFGRRGPSH 482 Query: 892 I-KDSDGS-AEGKSLRRGGSC-YGSHE-------KQVWVQKSS 794 I +D+DGS E K+ +RGGS YGSHE KQVWVQKS+ Sbjct: 483 IVRDADGSTVEAKTPKRGGSSGYGSHEVCSLDSQKQVWVQKST 525 >ref|XP_006389505.1| hypothetical protein POPTR_0022s00460g [Populus trichocarpa] gi|550312328|gb|ERP48419.1| hypothetical protein POPTR_0022s00460g [Populus trichocarpa] Length = 511 Score = 450 bits (1158), Expect = e-123 Identities = 269/513 (52%), Positives = 319/513 (62%), Gaps = 10/513 (1%) Frame = -1 Query: 2293 GPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKR 2114 G D+TKVV+RHLPP +SQ +EQ+D F+GRYNW+SYRPGK+SQK QS SRAYIDFKR Sbjct: 4 GQSDKTKVVVRHLPPGVSQPMFVEQIDLAFSGRYNWLSYRPGKSSQKHQSCSRAYIDFKR 63 Query: 2113 PDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFL 1934 PDDVI+FAEFF+GH+FVNEKGTQFK VEYAPSQ VPKQWSKKDGREGTIL+DPEYLEFL Sbjct: 64 PDDVIDFAEFFNGHLFVNEKGTQFKAIVEYAPSQHVPKQWSKKDGREGTILKDPEYLEFL 123 Query: 1933 EFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKS 1754 EF+AKPVENLPSAEIQL KD PIVTPLM+++RQKRAAKSG RR +SNGK Sbjct: 124 EFIAKPVENLPSAEIQLERREAERAGVAKDAPIVTPLMEFIRQKRAAKSGPRRILSNGKP 183 Query: 1753 TKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLD 1574 ++R G+ + + MYVLRD+ K TSGK++S Y V K DD+QL Sbjct: 184 SRRAGGSGSP-SSSSSKRGSEKKRASTTMYVLRDTVKGTSGKEKSIYAQVPKLDDRQLSK 242 Query: 1573 K-PRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISSGNV 1397 G E T I S QQ+ + +N ISS + Sbjct: 243 AVTLGSGSGTEVSEEETAVSGITGTGKKKILLLKGKEKEI-SLQQSISPSDRNIISSTAL 301 Query: 1396 KQNQXXXXXXXXXXXILLNKDN-RXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQKDT 1220 K +Q ILLNKD+ R SNL+KDKRPPRPP L+ KD Sbjct: 302 K-SQRHESSGRVIKSILLNKDSRRIQSSGVQSEPQMQTSNLEKDKRPPRPPHA-LVLKDA 359 Query: 1219 NGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXXXXXXXX 1043 NG PDD+ ND HGF +KQERR +NK+RPDR VW LRR Sbjct: 360 NGTPDDKVVGNDLHGFPNEKQERRTRNKDRPDRVVWT-LRRSEGSYASDESLSSSAYLST 418 Query: 1042 XXXAEG---NHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IKDSD 878 + NH + +++ R GE K SGR +HSSLDNG++KH RRG H ++D+D Sbjct: 419 QSGFDSSQVNHGDVKADTLNLRSGEVKALGSGRSNHSSLDNGSHKHSGRRGPPHPVRDAD 478 Query: 877 GS-AEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 GS EGKSL+RGG S YGSHEKQVWVQKSSSGS Sbjct: 479 GSTVEGKSLKRGGASGYGSHEKQVWVQKSSSGS 511 >gb|EMJ23015.1| hypothetical protein PRUPE_ppa004923mg [Prunus persica] Length = 485 Score = 439 bits (1129), Expect = e-120 Identities = 268/521 (51%), Positives = 317/521 (60%), Gaps = 16/521 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 +K LDRTKVVLRHLPP++SQ+SL+EQ+D F+GRYNWV++RPGK SQK SYSRAYID Sbjct: 2 LKDQLDRTKVVLRHLPPSISQTSLVEQIDVFFSGRYNWVAFRPGKRSQKNPSYSRAYIDL 61 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVIEFAEFFDGH+FVNEKG+QFK VEYAPSQRVPKQWSKKDGREGTI RDPEYLE Sbjct: 62 KRPEDVIEFAEFFDGHLFVNEKGSQFKVIVEYAPSQRVPKQWSKKDGREGTIFRDPEYLE 121 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEFLAKP ENLPSAEIQL KD PIVTPLMD+VRQKRA+K+G RRS++NG Sbjct: 122 FLEFLAKPAENLPSAEIQLERREAERSGAGKDAPIVTPLMDFVRQKRASKAGSRRSLTNG 181 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K+++R G ++ SA MYVLRD+ K TS KD+STY++V K+DDQQ Sbjct: 182 KTSRRAGGPSSRSPSLATSKRGSERKRNSATMYVLRDARKNTSAKDKSTYILVPKRDDQQ 241 Query: 1582 -------LLDKPRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424 L LEEE G S I H+ A S QQ +S Sbjct: 242 PSEKSVTLASAAGTHVLEEESGVSGADAVKKKILLLKGKEREITHVPANMSQQQ--SSSA 299 Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 KN + +KQN ILLNKD R SN D+DKRPPR Sbjct: 300 KNMGGTIALKQNLRRQENGRIIRGILLNKDARQSQSSGIYSAQQIQTSNSDRDKRPPRSQ 359 Query: 1246 SLHLLQKDTNGAPD-DRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXX 1070 + L+ KDTNGAPD + ND HG ++KQE+R +NKERPDR VW PL R Sbjct: 360 HVQLILKDTNGAPDYNIVGNDLHGICSEKQEKRIRNKERPDRVVWTPLNR---------- 409 Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDN--GTYKHGARRGSA 896 +G+ A S + + +HS LD+ G +KH RRG+ Sbjct: 410 ------------LDGSSASDES----------LSSAFQPAHSLLDSSEGCHKHHGRRGTT 447 Query: 895 H-IKDSDGS---AEGKSLRRGGSCYGSHEKQVWVQKSSSGS 785 H +KD DGS EGK +RG YGSHEKQVWVQKSSSGS Sbjct: 448 HGVKDLDGSPVAGEGKHSKRG---YGSHEKQVWVQKSSSGS 485 >gb|EOX97031.1| Smg-4/UPF3 family protein, putative isoform 1 [Theobroma cacao] Length = 514 Score = 432 bits (1110), Expect = e-118 Identities = 264/524 (50%), Positives = 323/524 (61%), Gaps = 19/524 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKG LDRTKV+LRHLPP ++++ L+EQVD+ F+GRYNW+S+RPGK+SQK QSYSRAYIDF Sbjct: 1 MKGALDRTKVILRHLPPAITEAMLVEQVDTAFSGRYNWLSFRPGKSSQKHQSYSRAYIDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KR +DV+EFAEFF+GHVFVNEKGTQFKT VEYAPSQRVPK+ SKKDGREGTIL+D EYLE Sbjct: 61 KRSEDVLEFAEFFNGHVFVNEKGTQFKTIVEYAPSQRVPKRSSKKDGREGTILKDLEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLE L KPVENLPSAEIQL KD PIVTPLMD+VRQKRAAK G RRS+SNG Sbjct: 121 FLECLGKPVENLPSAEIQLERKEAERAGVPKDTPIVTPLMDFVRQKRAAKGGSRRSLSNG 180 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R G++ G S MYVLRDS K SGKD+STY++V+K+D+QQ Sbjct: 181 KLSRRAGGSSGGTPSSASSKRGSEKRRGSTTMYVLRDSLKNASGKDKSTYILVSKRDEQQ 240 Query: 1582 LLDKP--------RNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427 L DK EE G T I ++ QQN SP Sbjct: 241 LSDKHVALASSMGTEISEEESGVPGITDAVKKKVLLLKGKEKEISPVAGNVLHQQNVTSP 300 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXS-NLDKDKRPPRP 1250 +K + S KQN LLNKD R + NL+KD+RPPR Sbjct: 301 IKTILGSTPTKQNSRREGRMIRGI--LLNKDARQNQSSGVQSEQQIRTSNLEKDRRPPRH 358 Query: 1249 PSLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR---XXXXXX 1082 HL+ KDTN A DD+ ND HG ++K ERR +NK+RPDRGVW LRR Sbjct: 359 SHSHLVLKDTNTASDDKVVGNDLHG--SEKPERRCRNKDRPDRGVWT-LRRSDGSYASDE 415 Query: 1081 XXXXXXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDNGTY-KHGARR 905 EG + +T+ +++ R + + G G +SSLDNG++ KH +RR Sbjct: 416 SMSSSASQSALIPLDPLEGTYGDTKVDLSNVR-SVQVKTVGSGRNSSLDNGSHNKHVSRR 474 Query: 904 GSAHIKDSDGS---AEGKSLRRG-GSCYGSHEKQVWVQKSSSGS 785 G+ +DGS ++GK +RG + YGSHEKQVWVQKSSSGS Sbjct: 475 GAV----ADGSSVMSDGKPGKRGCAAGYGSHEKQVWVQKSSSGS 514 >gb|EMJ23014.1| hypothetical protein PRUPE_ppa004923mg [Prunus persica] Length = 482 Score = 421 bits (1083), Expect = e-115 Identities = 259/518 (50%), Positives = 310/518 (59%), Gaps = 16/518 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 +K LDRTKVVLRHLPP++SQ+SL+EQ+D F+GRYNWV++RPGK SQK SYSRAYID Sbjct: 2 LKDQLDRTKVVLRHLPPSISQTSLVEQIDVFFSGRYNWVAFRPGKRSQKNPSYSRAYIDL 61 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVIEFAEFFDGH+FVNEKG+QFK VEYAPSQRVPKQWSKKDGREGTI RDPEYLE Sbjct: 62 KRPEDVIEFAEFFDGHLFVNEKGSQFKVIVEYAPSQRVPKQWSKKDGREGTIFRDPEYLE 121 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLEFLAKP ENLPSAEIQL KD PIVTPLMD+VRQKRA+K+G RRS++NG Sbjct: 122 FLEFLAKPAENLPSAEIQLERREAERSGAGKDAPIVTPLMDFVRQKRASKAGSRRSLTNG 181 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K+++R G ++ SA MYVLRD+ K TS KD+STY++V K+DDQQ Sbjct: 182 KTSRRAGGPSSRSPSLATSKRGSERKRNSATMYVLRDARKNTSAKDKSTYILVPKRDDQQ 241 Query: 1582 -------LLDKPRNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPV 1424 L LEEE G S I H+ A S QQ +S Sbjct: 242 PSEKSVTLASAAGTHVLEEESGVSGADAVKKKILLLKGKEREITHVPANMSQQQ--SSSA 299 Query: 1423 KNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPP 1247 KN + +KQN ILLNKD R SN D+DKRPPR Sbjct: 300 KNMGGTIALKQNLRRQENGRIIRGILLNKDARQSQSSGIYSAQQIQTSNSDRDKRPPRSQ 359 Query: 1246 SLHLLQKDTNGAPD-DRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXX 1070 + L+ KDTNGAPD + ND HG ++KQE+R +NKERPDR VW PL R Sbjct: 360 HVQLILKDTNGAPDYNIVGNDLHGICSEKQEKRIRNKERPDRVVWTPLNR---------- 409 Query: 1069 XXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDN--GTYKHGARRGSA 896 +G+ A S + + +HS LD+ G +KH RRG+ Sbjct: 410 ------------LDGSSASDES----------LSSAFQPAHSLLDSSEGCHKHHGRRGTT 447 Query: 895 H-IKDSDGS---AEGKSLRRGGSCYGSHEKQVWVQKSS 794 H +KD DGS EGK +RG YGSHE VW+ + S Sbjct: 448 HGVKDLDGSPVAGEGKHSKRG---YGSHECDVWLLEPS 482 >gb|ESW20566.1| hypothetical protein PHAVU_006G219800g [Phaseolus vulgaris] gi|561021796|gb|ESW20567.1| hypothetical protein PHAVU_006G219800g [Phaseolus vulgaris] Length = 513 Score = 395 bits (1016), Expect = e-107 Identities = 250/522 (47%), Positives = 308/522 (59%), Gaps = 17/522 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKG LDRTKVVLRHLPP+LS+++L+ Q+DS FA RYNW+S+RP K SQK SYSRAYIDF Sbjct: 1 MKGSLDRTKVVLRHLPPSLSEAALLAQIDSAFADRYNWLSFRPAKVSQKHISYSRAYIDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRPDDVI FAEFF+GHVFVNEKG+QFK VEYAPSQRVP+QWSKKDGR+GTI +D EYLE Sbjct: 61 KRPDDVILFAEFFNGHVFVNEKGSQFKVIVEYAPSQRVPRQWSKKDGRDGTIYKDSEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLE LAKPVENLPSAEIQL KD PI+TPLMD+VRQKRAAK G RRS+SNG Sbjct: 121 FLELLAKPVENLPSAEIQLEKREAERSGAAKDTPIITPLMDFVRQKRAAK-GPRRSLSNG 179 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQL 1580 K ++R + + + MYV R K ++ KDRS Y +V Q DQ + Sbjct: 180 KVSRRGTSSNGSPSSGTSRRGSGKKRVSATMYVARHPGKNSTMKDRSIYTLVPSQGDQHI 239 Query: 1579 LDKPRN----DG---LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAASP 1427 +K N DG L+E G S + I +S + S Q N S Sbjct: 240 SNKSSNVASSDGKQTLDENGFSGNSDSGKKKILLLKGKEREIIAVSDLDSMSQHHNVISS 299 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRP 1250 K + + +KQNQ IL K+ R SNL+KDK+ PRP Sbjct: 300 AKEIVGATVLKQNQRQEGSGRIIRSILSKKELRQSQSSRALSEQQIQTSNLEKDKQSPRP 359 Query: 1249 PSLHLLQKDTNGAPDDRT-PNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073 + L+ K NG PD++ D H F +++QER ++K+RPDRGVW Sbjct: 360 IQVQLILKGMNGTPDNKIGVLDSHVF-SERQERHIRHKDRPDRGVWTSCSN------GAD 412 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896 EG+HA+ + ++ R GE K R SHSS +NG KH RRG Sbjct: 413 ESFPSAAFSQVDPLEGSHADLKHDMPNTRSGEVKSLGGVRTSHSS-ENGFNKHFGRRGPT 471 Query: 895 H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 H +KD DG S+EGK RR G + YGS+EKQVWVQK+SSG+ Sbjct: 472 HGVKDVDGYSVSSEGKHPRRSGTTAYGSNEKQVWVQKASSGT 513 >ref|XP_006598794.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1 [Glycine max] gi|571524272|ref|XP_006598795.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X2 [Glycine max] Length = 512 Score = 393 bits (1010), Expect = e-106 Identities = 250/522 (47%), Positives = 310/522 (59%), Gaps = 17/522 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKG LDRTKVVLRHLPP++S+++L+ Q+D+ FAGRYNW+S+RPGK SQK SYSRAYIDF Sbjct: 1 MKGALDRTKVVLRHLPPSISEAALLAQIDAAFAGRYNWLSFRPGKISQKHISYSRAYIDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVI FAEFF+GHVFVNEKG+QFK VEYAPSQRVP+QWSKKDGR+GTI +D EYLE Sbjct: 61 KRPEDVILFAEFFNGHVFVNEKGSQFKVIVEYAPSQRVPRQWSKKDGRDGTIYKDSEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLE LAKPVENLPSAEIQL KD PI+TPLMD+VRQKRAAK G RR +SNG Sbjct: 121 FLELLAKPVENLPSAEIQLEKREAERSGAAKDIPIITPLMDFVRQKRAAK-GPRRLLSNG 179 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R ++ G SA MYV RD K ++ KD+ST +V KQ DQ Sbjct: 180 KVSQRAGTSSNGSPSSVTSRRGSGKKRVSATMYVARDPGKNSTIKDKST--LVPKQGDQH 237 Query: 1582 LLDKPRNDG-------LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAAS 1430 L DK N L+E G S I +S + S Q N S Sbjct: 238 LSDKASNMASSDANLTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 297 Query: 1429 PVKNFISSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPR 1253 K + S +KQ+Q IL K+ R SNL+K+K+PPR Sbjct: 298 SAKMIVGSTVLKQSQRHEGSGRIIRSILSKKELRQSQYSRALSEQQIQTSNLEKEKQPPR 357 Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073 P + L+ K +NG P+++ +++QER ++K+RPDRGVW Sbjct: 358 PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT------SRSNGAD 411 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSA 896 EG+HA+ + + AR GE K S R SHSS +NG KH RRG + Sbjct: 412 DSFSSSASSQVDPLEGSHADLKHDTPNARSGEVKSLGSVRTSHSS-ENGFNKHFGRRGPS 470 Query: 895 H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 H +KD DG S+EGK RR S YGS+EKQVWVQK+SSG+ Sbjct: 471 HGVKDVDGYSVSSEGKHPRRSSTSAYGSNEKQVWVQKASSGT 512 >ref|XP_004485448.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X2 [Cicer arietinum] gi|502076758|ref|XP_004485449.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X3 [Cicer arietinum] gi|502076762|ref|XP_004485450.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X4 [Cicer arietinum] Length = 510 Score = 393 bits (1009), Expect = e-106 Identities = 248/518 (47%), Positives = 301/518 (58%), Gaps = 18/518 (3%) Frame = -1 Query: 2284 DRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDD 2105 DRTKVV+RHLPPT+S+ SL +D F+GRYNW+S+RP K S K S+SRAYIDF +P+D Sbjct: 3 DRTKVVVRHLPPTISEDSLSSLIDGSFSGRYNWLSFRPAKISPKHTSFSRAYIDFNKPED 62 Query: 2104 VIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFL 1925 VIEFAEFF+GHVFVNEKGTQFK +VEYAPSQRVPKQWSKKDGR+GTI +DPEYLEFLE L Sbjct: 63 VIEFAEFFNGHVFVNEKGTQFKVTVEYAPSQRVPKQWSKKDGRDGTIYKDPEYLEFLELL 122 Query: 1924 AKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKR 1745 AKPVENLPSAEIQL KD PIVTPLMD+VRQKRAAK G RR SNGK T+R Sbjct: 123 AKPVENLPSAEIQLEKREAERSGAGKDVPIVTPLMDFVRQKRAAK-GPRRLSSNGKVTRR 181 Query: 1744 VSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKP 1568 + G + MYV RD K ++ KD+STY++V +Q DQ L +K Sbjct: 182 TGTPSNGSSSSAPSRRGSARKRVSTTMYVARDPGKNSTVKDKSTYILVPRQGDQHLSNKS 241 Query: 1567 RN----DG---LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFI 1412 N DG +E G S S S + S K + Sbjct: 242 SNIASSDGNPTFDENGIAGSNDAGKKVLLLKGKEREIITASDSDSMSQHHSITSSAKTIL 301 Query: 1411 SSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLHL 1235 +S +KQNQ IL NKD R SNL+K+K+P RP + L Sbjct: 302 NSTALKQNQRHEGSGRIIKSILSNKDLRQNQSSRAYSERQLQTSNLEKEKQPTRPLHVQL 361 Query: 1234 LQKDTNGAPDDRTPNDFHGFH--TDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXX 1061 + K T+GAP++R HG H +++QERR + K+RPDRG+W Sbjct: 362 ILKGTDGAPENRI--TVHGLHVSSERQERRFRQKDRPDRGIWT------SRSNGGDESLS 413 Query: 1060 XXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IK 887 EG HAE + + AR GE K S R SHSS +NG KH RRG H +K Sbjct: 414 SSASSQVDPLEGGHAELKHDTRSARSGEVKSFGSLRASHSS-ENGFNKHFGRRGPIHGVK 472 Query: 886 DSDG---SAEGKSLRR-GGSCYGSHEKQVWVQKSSSGS 785 D DG S+EGK R+ S YGS+EKQVWVQK+SSG+ Sbjct: 473 DVDGYSVSSEGKHPRKPSSSAYGSNEKQVWVQKASSGT 510 >ref|XP_006592654.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1 [Glycine max] gi|571493781|ref|XP_006592655.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X2 [Glycine max] Length = 514 Score = 389 bits (1000), Expect = e-105 Identities = 246/522 (47%), Positives = 312/522 (59%), Gaps = 17/522 (3%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKG LDRTKVVLRHLPP++S+++L+ Q+D+ FAGRYNW+S+RPGK SQK S+SRAYIDF Sbjct: 1 MKGALDRTKVVLRHLPPSISEAALLSQIDAAFAGRYNWLSFRPGKISQKHMSFSRAYIDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DVI FAEFF+GHVFVN KG+QFK VEYAPSQRVP+QWSKKD R+GTI +D EYLE Sbjct: 61 KRPEDVILFAEFFNGHVFVNVKGSQFKVIVEYAPSQRVPRQWSKKDLRDGTIYKDSEYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FLE LAKPVENLPSAEIQL KD PI+TPLMD+VRQKRAAK G RR +SNG Sbjct: 121 FLELLAKPVENLPSAEIQLEKREAERSGAAKDIPIITPLMDFVRQKRAAK-GPRRPLSNG 179 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMVTKQDDQQ 1583 K ++R ++ G SA MYV RD K ++ KD+S+Y +V KQDDQ Sbjct: 180 KVSRRAGTSSNGGPSSATSRRGSGKKRVSATMYVARDPGKSSTIKDKSSYTLVPKQDDQH 239 Query: 1582 LLDKPRN----DG---LEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQ--NAAS 1430 L +K N DG L+E G S I +S + S Q N S Sbjct: 240 LPNKASNMASSDGNQTLDENGVSGNHDAGKKKVLLLKGKEREIITVSDLDSMSQHHNVTS 299 Query: 1429 PVKNFISSGNVKQNQXXXXXXXXXXXILLNKD-NRXXXXXXXXXXXXXXSNLDKDKRPPR 1253 K + S +KQ+Q IL K+ ++ SNL+K+K+PPR Sbjct: 300 SAKTVVGSTVLKQSQRHEGSGRIIRSILSKKELHQSQSSRALSEQKILTSNLEKEKQPPR 359 Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073 P + L+ K +NG P+++ +++QER ++K+RPDRGVW Sbjct: 360 PLHVQLILKGSNGTPENKIGVHDSHVSSERQERHVRHKDRPDRGVWT------SRFNGAD 413 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGARG-EFKHRESGRGSHSSLDNGTYKHGARRGSA 896 EG+ A+ + ++ AR E K S R SHSS +NG KH RRG + Sbjct: 414 VSFSSPASSQVDPLEGSQADLKHDMPNARSVEVKSFGSVRTSHSS-ENGFNKHFGRRGPS 472 Query: 895 H-IKDSDG---SAEGKSLRRGG-SCYGSHEKQVWVQKSSSGS 785 + +KD DG S+EGK RR S YGS+EKQVWVQK+SSGS Sbjct: 473 YGVKDVDGYSVSSEGKHPRRSSTSAYGSNEKQVWVQKASSGS 514 >ref|XP_004248850.1| PREDICTED: uncharacterized protein LOC101263168 [Solanum lycopersicum] Length = 438 Score = 381 bits (979), Expect = e-103 Identities = 220/403 (54%), Positives = 256/403 (63%), Gaps = 9/403 (2%) Frame = -1 Query: 2281 RTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDDV 2102 RTKVVLRHLPPTLSQS L+E VDSRFAGRYNW ++RP KTS K QSYS+AYIDF+ +DV Sbjct: 5 RTKVVLRHLPPTLSQSMLLEHVDSRFAGRYNWFNFRPAKTSLKHQSYSKAYIDFRNMEDV 64 Query: 2101 IEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFLA 1922 EFAEFFDGH+FVNEKGTQFKT VEYAPSQRVPK W KKD REGTIL+DP Y+EFLEFLA Sbjct: 65 TEFAEFFDGHMFVNEKGTQFKTIVEYAPSQRVPKHWLKKDAREGTILKDPAYMEFLEFLA 124 Query: 1921 KPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKRV 1742 KPVENLPSAEIQL KD PIVTPLMDYVRQKRA SG R+S+SNGKS+K V Sbjct: 125 KPVENLPSAEIQLERKEAERAGSAKDAPIVTPLMDYVRQKRAVTSGARKSISNGKSSKSV 184 Query: 1741 SGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKPRN 1562 G ++ + MYV RDS+KV + KD+S Y++ +K QQL DK Sbjct: 185 GGTSSRSPSSTASRRGSEKRTSTTMYVQRDSSKVGNSKDKS-YILASKCGYQQLSDKSSA 243 Query: 1561 -------DGLEEEGGSSATV-PXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFISS 1406 D +E E G S T P++S QQN +S +KN S Sbjct: 244 SAPGSWIDVVEGEIGRSVTSDSGKKKILLLKGKEKESPNVSGGSLAQQNVSSALKNSPSL 303 Query: 1405 GNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXSNLDKDKRPPRPPSLHLLQK 1226 +K NQ ILL KD R DKD RPPRPPS+ L QK Sbjct: 304 SALKLNQHQEVGGRIIRSILL-KDARQNQSAFQSDQIQ-----DKDMRPPRPPSMQLFQK 357 Query: 1225 DTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR 1100 DT+GA +D+ N+ H H +KQERR +N++RPDRGVWAPLRR Sbjct: 358 DTSGANEDKVVGNEKHVVHIEKQERRSRNRDRPDRGVWAPLRR 400 >gb|EOX97032.1| Smg-4/UPF3 family protein, putative isoform 2, partial [Theobroma cacao] Length = 440 Score = 377 bits (968), Expect = e-101 Identities = 223/418 (53%), Positives = 264/418 (63%), Gaps = 18/418 (4%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MKG LDRTKV+LRHLPP ++++ L+EQVD+ F+GRYNW+S+RPGK+SQK QSYSRAYIDF Sbjct: 1 MKGALDRTKVILRHLPPAITEAMLVEQVDTAFSGRYNWLSFRPGKSSQKHQSYSRAYIDF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILR------ 1958 KR +DV+EFAEFF+GHVFVNEKGTQFKT VEYAPSQRVPK+ SKKDGREGTIL+ Sbjct: 61 KRSEDVLEFAEFFNGHVFVNEKGTQFKTIVEYAPSQRVPKRSSKKDGREGTILKVFLDEH 120 Query: 1957 -DPEYLEFLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGG 1781 D EYLEFLE L KPVENLPSAEIQL KD PIVTPLMD+VRQKRAAK G Sbjct: 121 LDLEYLEFLECLGKPVENLPSAEIQLERKEAERAGVPKDTPIVTPLMDFVRQKRAAKGGS 180 Query: 1780 RRSVSNGKSTKRVSGAATGIXXXXXXXXXXXXXXXSA-MYVLRDSAKVTSGKDRSTYVMV 1604 RRS+SNGK ++R G++ G S MYVLRDS K SGKD+STY++V Sbjct: 181 RRSLSNGKLSRRAGGSSGGTPSSASSKRGSEKRRGSTTMYVLRDSLKNASGKDKSTYILV 240 Query: 1603 TKQDDQQLLDKP--------RNDGLEEEGGSSATVPXXXXXXXXXXXXXXIPHISAIPST 1448 +K+D+QQL DK EE G T I ++ Sbjct: 241 SKRDEQQLSDKHVALASSMGTEISEEESGVPGITDAVKKKVLLLKGKEKEISPVAGNVLH 300 Query: 1447 QQNAASPVKNFISSGNVKQNQXXXXXXXXXXXILLNKDNRXXXXXXXXXXXXXXS-NLDK 1271 QQN SP+K + S KQN LLNKD R + NL+K Sbjct: 301 QQNVTSPIKTILGSTPTKQNSRREGRMIRGI--LLNKDARQNQSSGVQSEQQIRTSNLEK 358 Query: 1270 DKRPPRPPSLHLLQKDTNGAPDDR-TPNDFHGFHTDKQERRPKNKERPDRGVWAPLRR 1100 D+RPPR HL+ KDTN A DD+ ND HG ++K ERR +NK+RPDRGVW LRR Sbjct: 359 DRRPPRHSHSHLVLKDTNTASDDKVVGNDLHG--SEKPERRCRNKDRPDRGVWT-LRR 413 >ref|XP_004485447.1| PREDICTED: regulator of nonsense transcripts UPF3-like isoform X1 [Cicer arietinum] Length = 517 Score = 372 bits (954), Expect = e-100 Identities = 238/506 (47%), Positives = 289/506 (57%), Gaps = 18/506 (3%) Frame = -1 Query: 2284 DRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDFKRPDD 2105 DRTKVV+RHLPPT+S+ SL +D F+GRYNW+S+RP K S K S+SRAYIDF +P+D Sbjct: 3 DRTKVVVRHLPPTISEDSLSSLIDGSFSGRYNWLSFRPAKISPKHTSFSRAYIDFNKPED 62 Query: 2104 VIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLEFLEFL 1925 VIEFAEFF+GHVFVNEKGTQFK +VEYAPSQRVPKQWSKKDGR+GTI +DPEYLEFLE L Sbjct: 63 VIEFAEFFNGHVFVNEKGTQFKVTVEYAPSQRVPKQWSKKDGRDGTIYKDPEYLEFLELL 122 Query: 1924 AKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNGKSTKR 1745 AKPVENLPSAEIQL KD PIVTPLMD+VRQKRAAK G RR SNGK T+R Sbjct: 123 AKPVENLPSAEIQLEKREAERSGAGKDVPIVTPLMDFVRQKRAAK-GPRRLSSNGKVTRR 181 Query: 1744 VSGAATG-IXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQLLDKP 1568 + G + MYV RD K ++ KD+STY++V +Q DQ L +K Sbjct: 182 TGTPSNGSSSSAPSRRGSARKRVSTTMYVARDPGKNSTVKDKSTYILVPRQGDQHLSNKS 241 Query: 1567 RN----DG---LEEEG-GSSATVPXXXXXXXXXXXXXXIPHISAIPSTQQNAASPVKNFI 1412 N DG +E G S S S + S K + Sbjct: 242 SNIASSDGNPTFDENGIAGSNDAGKKVLLLKGKEREIITASDSDSMSQHHSITSSAKTIL 301 Query: 1411 SSGNVKQNQXXXXXXXXXXXILLNKDNR-XXXXXXXXXXXXXXSNLDKDKRPPRPPSLHL 1235 +S +KQNQ IL NKD R SNL+K+K+P RP + L Sbjct: 302 NSTALKQNQRHEGSGRIIKSILSNKDLRQNQSSRAYSERQLQTSNLEKEKQPTRPLHVQL 361 Query: 1234 LQKDTNGAPDDRTPNDFHGFH--TDKQERRPKNKERPDRGVWAPLRRXXXXXXXXXXXXX 1061 + K T+GAP++R HG H +++QERR + K+RPDRG+W Sbjct: 362 ILKGTDGAPENRI--TVHGLHVSSERQERRFRQKDRPDRGIWT------SRSNGGDESLS 413 Query: 1060 XXXXXXXXXAEGNHAETRSEITGAR-GEFKHRESGRGSHSSLDNGTYKHGARRGSAH-IK 887 EG HAE + + AR GE K S R SHSS +NG KH RRG H +K Sbjct: 414 SSASSQVDPLEGGHAELKHDTRSARSGEVKSFGSLRASHSS-ENGFNKHFGRRGPIHGVK 472 Query: 886 DSDG---SAEGKSLRR-GGSCYGSHE 821 D DG S+EGK R+ S YGS+E Sbjct: 473 DVDGYSVSSEGKHPRKPSSSAYGSNE 498 >gb|EOY26871.1| Smg-4/UPF3 family protein, putative isoform 2 [Theobroma cacao] Length = 487 Score = 345 bits (886), Expect = 5e-92 Identities = 217/519 (41%), Positives = 282/519 (54%), Gaps = 14/519 (2%) Frame = -1 Query: 2299 MKGPLDRTKVVLRHLPPTLSQSSLMEQVDSRFAGRYNWVSYRPGKTSQKLQSYSRAYIDF 2120 MK PL RTKVV+RHLPP+++QS L Q+D RF+ RYNW S+R GK+S K Q YSRAYI+F Sbjct: 1 MKEPLRRTKVVIRHLPPSVTQSFLFSQIDDRFSDRYNWFSFRLGKSSHKHQRYSRAYINF 60 Query: 2119 KRPDDVIEFAEFFDGHVFVNEKGTQFKTSVEYAPSQRVPKQWSKKDGREGTILRDPEYLE 1940 KRP+DV EFAEFFDGHVFVNEKGTQFK VEYAPSQRVPK +KKDGREGTI +DP+YLE Sbjct: 61 KRPEDVFEFAEFFDGHVFVNEKGTQFKAIVEYAPSQRVPKPGTKKDGREGTIFKDPDYLE 120 Query: 1939 FLEFLAKPVENLPSAEIQLXXXXXXXXXXXKDPPIVTPLMDYVRQKRAAKSGGRRSVSNG 1760 FL+ +AKPV+NLPSAEIQL K+ P++TPLM +VRQKRAA+SG + V+ Sbjct: 121 FLKLIAKPVDNLPSAEIQLERKEVELSGAPKETPVITPLMAFVRQKRAAESGTQGPVTRR 180 Query: 1759 KSTKRVSGAATGIXXXXXXXXXXXXXXXSAMYVLRDSAKVTSGKDRSTYVMVTKQDDQQL 1580 K ++ A+TG Y+L+DS K T KD+S + + +KQ+DQ + Sbjct: 181 KIGRKAGAASTG-----KSGSSSKRGSEKKKYILKDSVKGTHHKDKSKFFVASKQEDQPV 235 Query: 1579 ----LDKPRNDGLEEEGGSSATV-----PXXXXXXXXXXXXXXIPHISAIPSTQQNAASP 1427 +K N + G + PH+ S QQ ++SP Sbjct: 236 PSVGKEKRENGTVYGIDGPVTGITLTADSGKKKILLLKPKDQEAPHVPQGASEQQGSSSP 295 Query: 1426 VKNFISSGNVKQNQXXXXXXXXXXXILLNKD--NRXXXXXXXXXXXXXXSNLDKDKRPPR 1253 V N S KQ+Q ILL+ + NLD KRPPR Sbjct: 296 VANSPGSTAPKQSQRREAGGRLIRSILLSNEASQNQPLAGVKPQQKTQTMNLDNVKRPPR 355 Query: 1252 PPSLHLLQKDTNGAPDDRTPNDFHGFHTDKQERRPKNKERPDRGVWAPLRRXXXXXXXXX 1073 P + L G ++K E+R +NK+R DRGVWAPLR Sbjct: 356 PANTRL------------------GSGSEKHEKRIRNKDRLDRGVWAPLR-------GSD 390 Query: 1072 XXXXXXXXXXXXXAEGNHAETRSEITGARGEFKHRESGRGSHSSLDNGTYKHGARRGSAH 893 ++ A + S +G+ + SGR S +NG+ +H RR +A+ Sbjct: 391 VSQASEERFSPSMSQSAQASSNSIEGEMKGDIPNGRSGRNVPS--ENGSNRHFDRRSAAY 448 Query: 892 IKDSDG---SAEGKSLRRGGSCYGSHEKQVWVQKSSSGS 785 DG S+E KS +RG + G+HEKQ+WVQKSSSGS Sbjct: 449 NIKDDGSVISSESKSSKRGATGSGAHEKQIWVQKSSSGS 487