BLASTX nr result
ID: Catharanthus22_contig00002972
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00002972 (750 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266002.1| PREDICTED: uncharacterized protein LOC100258... 141 2e-31 gb|EXC36067.1| hypothetical protein L484_001367 [Morus notabilis] 138 2e-30 gb|EOY26267.1| Tetratricopeptide repeat-like superfamily protein... 137 4e-30 gb|EOY26266.1| Tetratricopeptide repeat (TPR)-like superfamily p... 137 4e-30 ref|XP_004144414.1| PREDICTED: uncharacterized protein LOC101220... 136 9e-30 ref|XP_004235499.1| PREDICTED: uncharacterized protein LOC101262... 134 3e-29 ref|XP_006468126.1| PREDICTED: uncharacterized protein LOC102618... 132 9e-29 ref|XP_006342848.1| PREDICTED: uncharacterized protein LOC102600... 131 2e-28 ref|NP_683507.1| tetratricopeptide repeat-containing protein [Ar... 129 8e-28 gb|AAC83019.1| F9K20.3 [Arabidopsis thaliana] 129 8e-28 ref|NP_001185431.1| tetratricopeptide repeat-containing protein ... 129 8e-28 ref|NP_001185430.1| tetratricopeptide repeat-containing protein ... 129 8e-28 ref|XP_002889219.1| binding protein [Arabidopsis lyrata subsp. l... 128 2e-27 ref|XP_002310923.2| hypothetical protein POPTR_0008s00440g [Popu... 127 3e-27 ref|XP_002332853.1| predicted protein [Populus trichocarpa] 127 3e-27 gb|EOY26268.1| Tetratricopeptide repeat (TPR)-like superfamily p... 124 3e-26 ref|XP_006379387.1| hypothetical protein POPTR_0008s00440g [Popu... 124 3e-26 ref|XP_003529421.1| PREDICTED: uncharacterized protein LOC100790... 124 4e-26 ref|XP_006301680.1| hypothetical protein CARUB_v10022135mg [Caps... 123 6e-26 ref|XP_006389941.1| hypothetical protein EUTSA_v100188051mg, par... 123 7e-26 >ref|XP_002266002.1| PREDICTED: uncharacterized protein LOC100258138 [Vitis vinifera] gi|297735765|emb|CBI18452.3| unnamed protein product [Vitis vinifera] Length = 418 Score = 141 bits (356), Expect = 2e-31 Identities = 78/159 (49%), Positives = 108/159 (67%), Gaps = 3/159 (1%) Frame = +2 Query: 281 NRKPHHYYGLAIRTLRCCSNSASTPKPRGFG-SPPPHEPKPAKRT--RDDKTPVLPPRKS 451 NRKP L + C S+S T RGFG PP + K +K T ++ K VL RKS Sbjct: 21 NRKPP---SLLTFRIHCSSDSKPT---RGFGPQPPQRDNKMSKSTTSKEGKGGVLQQRKS 74 Query: 452 SPQRSVTTPNEAPVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDY 631 + ++S + P +AP L+S+ GK N + FEERL+ V+R+AL+QK+A+++K+YGAIDY Sbjct: 75 TSKQSGSVPTQAPGLSSRSGGKSNDAAIDLDFEERLEAVRRTALEQKKADEKKEYGAIDY 134 Query: 632 DAPIEPKSGSVGMGTKIGAGIAVVIFGLVFAFGDFLPSG 748 D P+E + ++G+GTKIG G+AVV+FGLVFA GDFLPSG Sbjct: 135 DTPVESEEKTIGLGTKIGVGVAVVVFGLVFALGDFLPSG 173 >gb|EXC36067.1| hypothetical protein L484_001367 [Morus notabilis] Length = 398 Score = 138 bits (347), Expect = 2e-30 Identities = 73/143 (51%), Positives = 100/143 (69%), Gaps = 1/143 (0%) Frame = +2 Query: 323 LRCC-SNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLN 499 +RC SNS S PK RGFG P ++ + + +K V+ RKS+ +RS + P +AP L Sbjct: 16 IRCSDSNSNSNPK-RGFG-PNTNDNNKTNKGKKNKGLVIDQRKSAARRSGSEPAQAPGLR 73 Query: 500 SQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTK 679 SQ GK + S FEERL+ +KR+AL+QK+ E+EK++GAIDYD PIE + ++G+GTK Sbjct: 74 SQFGGKSKNSSIDVDFEERLKAIKRAALEQKKVEEEKEFGAIDYDVPIESEKKTIGLGTK 133 Query: 680 IGAGIAVVIFGLVFAFGDFLPSG 748 IG G+AV +FGLVFA GDFLP+G Sbjct: 134 IGVGVAVAVFGLVFALGDFLPTG 156 >gb|EOY26267.1| Tetratricopeptide repeat-like superfamily protein isoform 2 [Theobroma cacao] Length = 373 Score = 137 bits (345), Expect = 4e-30 Identities = 70/140 (50%), Positives = 100/140 (71%), Gaps = 1/140 (0%) Frame = +2 Query: 332 CSNSASTPKPRGFGSPPPHEP-KPAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLNSQV 508 CS+S + RGFGS P++ +R++K L RKS+ ++S +P +AP L++Q Sbjct: 25 CSDSKAK---RGFGSKKPNQKANKVSASREEKGMKLQQRKSTSKQSGPSPAQAPGLSAQF 81 Query: 509 DGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTKIGA 688 DGK NS S FEERL+ ++R+A+QQK+AE++K++G IDYDAP E ++G+GT+IG Sbjct: 82 DGKSNSSSLDIDFEERLEAIRRAAVQQKKAEEQKEFGPIDYDAPAESDKKTIGLGTQIGV 141 Query: 689 GIAVVIFGLVFAFGDFLPSG 748 G+AVV+FGLVFA GDFLPSG Sbjct: 142 GVAVVVFGLVFALGDFLPSG 161 >gb|EOY26266.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1 [Theobroma cacao] Length = 406 Score = 137 bits (345), Expect = 4e-30 Identities = 70/140 (50%), Positives = 100/140 (71%), Gaps = 1/140 (0%) Frame = +2 Query: 332 CSNSASTPKPRGFGSPPPHEP-KPAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLNSQV 508 CS+S + RGFGS P++ +R++K L RKS+ ++S +P +AP L++Q Sbjct: 25 CSDSKAK---RGFGSKKPNQKANKVSASREEKGMKLQQRKSTSKQSGPSPAQAPGLSAQF 81 Query: 509 DGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTKIGA 688 DGK NS S FEERL+ ++R+A+QQK+AE++K++G IDYDAP E ++G+GT+IG Sbjct: 82 DGKSNSSSLDIDFEERLEAIRRAAVQQKKAEEQKEFGPIDYDAPAESDKKTIGLGTQIGV 141 Query: 689 GIAVVIFGLVFAFGDFLPSG 748 G+AVV+FGLVFA GDFLPSG Sbjct: 142 GVAVVVFGLVFALGDFLPSG 161 >ref|XP_004144414.1| PREDICTED: uncharacterized protein LOC101220521 [Cucumis sativus] Length = 389 Score = 136 bits (342), Expect = 9e-30 Identities = 68/138 (49%), Positives = 98/138 (71%) Frame = +2 Query: 335 SNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLNSQVDG 514 S S S P+ RGFG+ ++ A + +K V PRK P++S T P +AP ++ + DG Sbjct: 16 SRSDSNPR-RGFGNKEDNKADKAGSSGKEKGRVYQPRKPIPKQSSTVPTQAPAVSFRNDG 74 Query: 515 KYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTKIGAGI 694 +KS QFEERL+ VKRSAL++K+A+ +K++GAIDYDAP+E + ++G+GTK+G G+ Sbjct: 75 NSYNKSLDLQFEERLEAVKRSALEKKKADIKKEFGAIDYDAPVESEEKTIGLGTKVGIGV 134 Query: 695 AVVIFGLVFAFGDFLPSG 748 AV++FG VFA GDFLPSG Sbjct: 135 AVLVFGFVFALGDFLPSG 152 >ref|XP_004235499.1| PREDICTED: uncharacterized protein LOC101262497 [Solanum lycopersicum] Length = 444 Score = 134 bits (337), Expect = 3e-29 Identities = 77/144 (53%), Positives = 95/144 (65%), Gaps = 4/144 (2%) Frame = +2 Query: 329 CCSNSASTPKPRGFGSPPPHEPK---PAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLN 499 CCS +++ RGFG + K A R K V P R SS ++S N+AP LN Sbjct: 60 CCSTDSNSS--RGFGPSDGNTNKGKNSATARRQAKGTVRPQRNSSTRQSDDLLNQAPRLN 117 Query: 500 SQVD-GKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGT 676 S D K S + QFEERL VVKRSALQQK+ E+EK YGAIDYDAP+E +S ++G+GT Sbjct: 118 STNDVRKSKSVGSDLQFEERLDVVKRSALQQKKTEEEKAYGAIDYDAPVESRSTTIGLGT 177 Query: 677 KIGAGIAVVIFGLVFAFGDFLPSG 748 KIG G AV++FGL+FA GDFLPSG Sbjct: 178 KIGVGAAVIVFGLLFALGDFLPSG 201 >ref|XP_006468126.1| PREDICTED: uncharacterized protein LOC102618377 [Citrus sinensis] Length = 398 Score = 132 bits (333), Expect = 9e-29 Identities = 75/150 (50%), Positives = 102/150 (68%), Gaps = 2/150 (1%) Frame = +2 Query: 305 GLAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVL--PPRKSSPQRSVTTP 478 GL ++C S S P+ RGFG+ K K +++K V+ P RKS ++S + P Sbjct: 12 GLLAFRIQC---SDSKPR-RGFGN------KTDKTNKEEKKGVMSQPKRKSLSKQSGSLP 61 Query: 479 NEAPVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSG 658 +AP+L S + K N+ S+ FEERL V+RSAL+QK+AE+ K++G IDYDAPIE + Sbjct: 62 TQAPILGSGYNSKSNNSSSDIDFEERLAAVRRSALEQKKAEEIKEFGPIDYDAPIETEKK 121 Query: 659 SVGMGTKIGAGIAVVIFGLVFAFGDFLPSG 748 ++G+GTKIG G+AVVIFGLVFA GDFLPSG Sbjct: 122 TIGLGTKIGVGVAVVIFGLVFALGDFLPSG 151 >ref|XP_006342848.1| PREDICTED: uncharacterized protein LOC102600762 [Solanum tuberosum] Length = 451 Score = 131 bits (330), Expect = 2e-28 Identities = 83/185 (44%), Positives = 104/185 (56%), Gaps = 11/185 (5%) Frame = +2 Query: 227 NWIRKISYPPISISLLNSNRKPHHY-------YGLAIRTLRCCSNSASTPKPRGFGSPPP 385 N+ R S+P +R HH+ YG+ L C S + RG G Sbjct: 34 NYTRTFSFPS------GKSRSNHHFQYHTKSKYGV----LNCICCSTDSNSRRGLGPSDG 83 Query: 386 HEPKPAKRT---RDDKTPVLPPRKSSPQRSVTTPNEAPVLNSQVD-GKYNSKSNYNQFEE 553 + K R K V P R SS +S N+AP LNS D K S + +FEE Sbjct: 84 NTNKGKNSVTARRQAKGTVRPQRNSSTPQSDDLLNKAPRLNSTNDVRKSKSVGSDLKFEE 143 Query: 554 RLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTKIGAGIAVVIFGLVFAFGD 733 RLQ VKRSALQQK+ E+EK YGAIDYDAP+E +S ++G+GTKIG G AV++FGL+FA GD Sbjct: 144 RLQAVKRSALQQKKTEEEKPYGAIDYDAPVESRSTTIGLGTKIGVGAAVIVFGLLFALGD 203 Query: 734 FLPSG 748 FLPSG Sbjct: 204 FLPSG 208 >ref|NP_683507.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] gi|26452264|dbj|BAC43219.1| unknown protein [Arabidopsis thaliana] gi|30725310|gb|AAP37677.1| At1g78915 [Arabidopsis thaliana] gi|332198053|gb|AEE36174.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 385 Score = 129 bits (325), Expect = 8e-28 Identities = 71/147 (48%), Positives = 100/147 (68%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFGS +++K P L RKSS ++SV+ P +A Sbjct: 17 LLLFRIRC---SDSNPK-RGFGSK-----------KEEKDPALQQRKSSSKQSVSVPRKA 61 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P LN+Q +GK + +S F+ERL+ ++RSAL+QK+ E K++G IDYDAP++ ++G Sbjct: 62 PGLNTQFEGK-SGRSFDIDFDERLENIRRSALEQKKTEVVKEFGPIDYDAPVKSDQKTIG 120 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 121 LGTKVGVGIAVVVFGLVFALGDFLPTG 147 >gb|AAC83019.1| F9K20.3 [Arabidopsis thaliana] Length = 372 Score = 129 bits (325), Expect = 8e-28 Identities = 71/147 (48%), Positives = 100/147 (68%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFGS +++K P L RKSS ++SV+ P +A Sbjct: 17 LLLFRIRC---SDSNPK-RGFGSK-----------KEEKDPALQQRKSSSKQSVSVPRKA 61 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P LN+Q +GK + +S F+ERL+ ++RSAL+QK+ E K++G IDYDAP++ ++G Sbjct: 62 PGLNTQFEGK-SGRSFDIDFDERLENIRRSALEQKKTEVVKEFGPIDYDAPVKSDQKTIG 120 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 121 LGTKVGVGIAVVVFGLVFALGDFLPTG 147 >ref|NP_001185431.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332198055|gb|AEE36176.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 405 Score = 129 bits (325), Expect = 8e-28 Identities = 71/147 (48%), Positives = 100/147 (68%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFGS +++K P L RKSS ++SV+ P +A Sbjct: 17 LLLFRIRC---SDSNPK-RGFGSK-----------KEEKDPALQQRKSSSKQSVSVPRKA 61 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P LN+Q +GK + +S F+ERL+ ++RSAL+QK+ E K++G IDYDAP++ ++G Sbjct: 62 PGLNTQFEGK-SGRSFDIDFDERLENIRRSALEQKKTEVVKEFGPIDYDAPVKSDQKTIG 120 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 121 LGTKVGVGIAVVVFGLVFALGDFLPTG 147 >ref|NP_001185430.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] gi|332198054|gb|AEE36175.1| tetratricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 402 Score = 129 bits (325), Expect = 8e-28 Identities = 71/147 (48%), Positives = 100/147 (68%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFGS +++K P L RKSS ++SV+ P +A Sbjct: 17 LLLFRIRC---SDSNPK-RGFGSK-----------KEEKDPALQQRKSSSKQSVSVPRKA 61 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P LN+Q +GK + +S F+ERL+ ++RSAL+QK+ E K++G IDYDAP++ ++G Sbjct: 62 PGLNTQFEGK-SGRSFDIDFDERLENIRRSALEQKKTEVVKEFGPIDYDAPVKSDQKTIG 120 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 121 LGTKVGVGIAVVVFGLVFALGDFLPTG 147 >ref|XP_002889219.1| binding protein [Arabidopsis lyrata subsp. lyrata] gi|297335060|gb|EFH65478.1| binding protein [Arabidopsis lyrata subsp. lyrata] Length = 384 Score = 128 bits (322), Expect = 2e-27 Identities = 71/147 (48%), Positives = 98/147 (66%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFG +++K P L RKSS ++SV+ P +A Sbjct: 16 LLVFRIRC---SDSNPK-RGFGFK-----------KEEKDPALQQRKSSSKQSVSVPRKA 60 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P LN+Q +GK + + FEERL+ ++RSAL+QK+ E K++G IDYDAPI+ ++G Sbjct: 61 PGLNTQFEGKSGPSFDID-FEERLENIRRSALEQKKTEVVKEFGPIDYDAPIKSDQKTIG 119 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 120 LGTKVGVGIAVVVFGLVFALGDFLPTG 146 >ref|XP_002310923.2| hypothetical protein POPTR_0008s00440g [Populus trichocarpa] gi|550332069|gb|EEE88290.2| hypothetical protein POPTR_0008s00440g [Populus trichocarpa] Length = 411 Score = 127 bits (320), Expect = 3e-27 Identities = 69/144 (47%), Positives = 101/144 (70%), Gaps = 5/144 (3%) Frame = +2 Query: 332 CSNSASTPKPRGFGSPPPHEPKPAK----RTRDDKTPVLPPRKSSPQRS-VTTPNEAPVL 496 CS+++S P+ RGFGS + K +R++K L RKS+ ++S + P++AP L Sbjct: 25 CSDNSS-PR-RGFGSKSDNNTNNKKVRSSSSREEKGMALQQRKSTTKQSGASLPSQAPGL 82 Query: 497 NSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGT 676 +S+ DGK + S FEERLQ V+RSAL+QK+ E K++G IDYD P++ ++ ++G+GT Sbjct: 83 SSRFDGKSSRNSADTDFEERLQAVRRSALEQKKTEAIKEFGPIDYDEPVKTENKTIGLGT 142 Query: 677 KIGAGIAVVIFGLVFAFGDFLPSG 748 KIG G+AV++FGLVFA GDFLPSG Sbjct: 143 KIGVGVAVLVFGLVFALGDFLPSG 166 >ref|XP_002332853.1| predicted protein [Populus trichocarpa] Length = 212 Score = 127 bits (320), Expect = 3e-27 Identities = 69/144 (47%), Positives = 101/144 (70%), Gaps = 5/144 (3%) Frame = +2 Query: 332 CSNSASTPKPRGFGSPPPHEPKPAK----RTRDDKTPVLPPRKSSPQRS-VTTPNEAPVL 496 CS+++S P+ RGFGS + K +R++K L RKS+ ++S + P++AP L Sbjct: 25 CSDNSS-PR-RGFGSKSDNNTNNKKVRSSSSREEKGMALQQRKSTTKQSGASLPSQAPGL 82 Query: 497 NSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGT 676 +S+ DGK + S FEERLQ V+RSAL+QK+ E K++G IDYD P++ ++ ++G+GT Sbjct: 83 SSRFDGKSSRNSADTDFEERLQAVRRSALEQKKTEAIKEFGPIDYDEPVKTENKTIGLGT 142 Query: 677 KIGAGIAVVIFGLVFAFGDFLPSG 748 KIG G+AV++FGLVFA GDFLPSG Sbjct: 143 KIGVGVAVLVFGLVFALGDFLPSG 166 >gb|EOY26268.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 3, partial [Theobroma cacao] Length = 354 Score = 124 bits (311), Expect = 3e-26 Identities = 58/102 (56%), Positives = 81/102 (79%) Frame = +2 Query: 443 RKSSPQRSVTTPNEAPVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGA 622 RKS+ ++S +P +AP L++Q DGK NS S FEERL+ ++R+A+QQK+AE++K++G Sbjct: 8 RKSTSKQSGPSPAQAPGLSAQFDGKSNSSSLDIDFEERLEAIRRAAVQQKKAEEQKEFGP 67 Query: 623 IDYDAPIEPKSGSVGMGTKIGAGIAVVIFGLVFAFGDFLPSG 748 IDYDAP E ++G+GT+IG G+AVV+FGLVFA GDFLPSG Sbjct: 68 IDYDAPAESDKKTIGLGTQIGVGVAVVVFGLVFALGDFLPSG 109 >ref|XP_006379387.1| hypothetical protein POPTR_0008s00440g [Populus trichocarpa] gi|118486611|gb|ABK95143.1| unknown [Populus trichocarpa] gi|550332068|gb|ERP57184.1| hypothetical protein POPTR_0008s00440g [Populus trichocarpa] Length = 405 Score = 124 bits (311), Expect = 3e-26 Identities = 69/143 (48%), Positives = 98/143 (68%), Gaps = 4/143 (2%) Frame = +2 Query: 332 CSNSASTPKPRGFGSPPPHEPKPAK----RTRDDKTPVLPPRKSSPQRSVTTPNEAPVLN 499 CS+++S P+ RGFGS + K +R++K L RKS+ ++S EAP L+ Sbjct: 25 CSDNSS-PR-RGFGSKSDNNTNNKKVRSSSSREEKGMALQQRKSTTKQS-----EAPGLS 77 Query: 500 SQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTK 679 S+ DGK + S FEERLQ V+RSAL+QK+ E K++G IDYD P++ ++ ++G+GTK Sbjct: 78 SRFDGKSSRNSADTDFEERLQAVRRSALEQKKTEAIKEFGPIDYDEPVKTENKTIGLGTK 137 Query: 680 IGAGIAVVIFGLVFAFGDFLPSG 748 IG G+AV++FGLVFA GDFLPSG Sbjct: 138 IGVGVAVLVFGLVFALGDFLPSG 160 >ref|XP_003529421.1| PREDICTED: uncharacterized protein LOC100790462 [Glycine max] Length = 389 Score = 124 bits (310), Expect = 4e-26 Identities = 68/138 (49%), Positives = 91/138 (65%), Gaps = 1/138 (0%) Frame = +2 Query: 338 NSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQR-SVTTPNEAPVLNSQVDG 514 N + + + RGFG K + DK V K S + S ++AP L+SQ+DG Sbjct: 22 NCSDSKQGRGFGENT--NSNRIKTNKSDKGLVSQQSKGSANKQSRPLSSQAPRLSSQLDG 79 Query: 515 KYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVGMGTKIGAGI 694 K + FEERL+ V+RSAL+QK+AE+EK++GAIDYDAPI + ++G+GTKIG G+ Sbjct: 80 KSRNDFLDVDFEERLKAVRRSALEQKKAEEEKEFGAIDYDAPIPSDNTTIGVGTKIGVGV 139 Query: 695 AVVIFGLVFAFGDFLPSG 748 AV +FGLVFAFGDFLPSG Sbjct: 140 AVAVFGLVFAFGDFLPSG 157 >ref|XP_006301680.1| hypothetical protein CARUB_v10022135mg [Capsella rubella] gi|482570390|gb|EOA34578.1| hypothetical protein CARUB_v10022135mg [Capsella rubella] Length = 388 Score = 123 bits (309), Expect = 6e-26 Identities = 71/140 (50%), Positives = 92/140 (65%), Gaps = 2/140 (1%) Frame = +2 Query: 335 SNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEAPVLNSQVDG 514 S S S PK RGFGS +++K L RKSS ++SV+ P +AP LN+Q DG Sbjct: 24 SCSDSNPK-RGFGSK-----------KEEKDSALQQRKSSSKQSVSVPRKAPGLNTQFDG 71 Query: 515 KYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGS--VGMGTKIGA 688 K + +S FEERL+ ++RSAL+QK+ E K++G IDYDAP S +G+GTK+G Sbjct: 72 K-SDRSFDIDFEERLETIRRSALEQKKTEVVKEFGPIDYDAPASVNSDQKKIGLGTKVGV 130 Query: 689 GIAVVIFGLVFAFGDFLPSG 748 GIAVV+FGLVFA GDFLP G Sbjct: 131 GIAVVVFGLVFALGDFLPMG 150 >ref|XP_006389941.1| hypothetical protein EUTSA_v100188051mg, partial [Eutrema salsugineum] gi|557086375|gb|ESQ27227.1| hypothetical protein EUTSA_v100188051mg, partial [Eutrema salsugineum] Length = 308 Score = 123 bits (308), Expect = 7e-26 Identities = 68/147 (46%), Positives = 95/147 (64%) Frame = +2 Query: 308 LAIRTLRCCSNSASTPKPRGFGSPPPHEPKPAKRTRDDKTPVLPPRKSSPQRSVTTPNEA 487 L + +RC S S PK RGFGS +++K RK+S + SV+ P +A Sbjct: 17 LLVFRIRC---SDSNPK-RGFGSK-----------KEEKDSAFQQRKTSSKPSVSVPRKA 61 Query: 488 PVLNSQVDGKYNSKSNYNQFEERLQVVKRSALQQKRAEQEKQYGAIDYDAPIEPKSGSVG 667 P L SQ +GK + +S FEERL+ ++RSAL+QK+ E K++G IDYD P++ ++G Sbjct: 62 PGLKSQFEGK-SGRSFDIDFEERLETIRRSALEQKKTEVVKEFGPIDYDTPVQSDQKTIG 120 Query: 668 MGTKIGAGIAVVIFGLVFAFGDFLPSG 748 +GTK+G GIAVV+FGLVFA GDFLP+G Sbjct: 121 LGTKVGVGIAVVVFGLVFALGDFLPTG 147