BLASTX nr result
ID: Astragalus22_contig00032832
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00032832 (991 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_013624380.1| PREDICTED: uncharacterized protein LOC106330... 247 8e-72 ref|XP_015960841.1| uncharacterized protein LOC107484813 [Arachi... 243 2e-71 ref|XP_013594438.1| PREDICTED: uncharacterized protein LOC106302... 237 3e-67 gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 236 4e-66 emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 235 7e-66 gb|KYP69518.1| Retrovirus-related Pol polyprotein from transposo... 225 3e-64 gb|KYP64799.1| Retrovirus-related Pol polyprotein from transposo... 224 7e-64 gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) ... 227 4e-63 gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygro... 214 1e-62 ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804... 221 2e-62 ref|XP_022548561.1| uncharacterized protein LOC111201234 [Brassi... 223 3e-62 gb|KYP41111.1| Retrovirus-related Pol polyprotein from transposo... 214 4e-62 ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804... 221 6e-62 ref|XP_018454140.1| PREDICTED: uncharacterized protein LOC108825... 222 6e-62 gb|KZV17946.1| hypothetical protein F511_10775 [Dorcoceras hygro... 222 7e-62 emb|CAA72989.1| unnamed protein product [Brassica oleracea var. ... 223 7e-62 ref|XP_009121252.1| PREDICTED: uncharacterized protein LOC103846... 221 8e-62 ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804... 221 9e-62 dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis t... 223 1e-61 gb|PRQ38882.1| putative RNA-directed DNA polymerase [Rosa chinen... 223 1e-61 >ref|XP_013624380.1| PREDICTED: uncharacterized protein LOC106330465 [Brassica oleracea var. oleracea] Length = 803 Score = 247 bits (630), Expect = 8e-72 Identities = 121/222 (54%), Positives = 152/222 (68%), Gaps = 4/222 (1%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KS S + F + HTQ+++ IK IRSDNA E TD LQ+ G H F CP+ PQQN Sbjct: 526 KNKSSVSTVFPEFLRLIHTQYNSNIKAIRSDNAPELAFTDLLQEKGIEHYFSCPYTPQQN 585 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+L+FQ+ V L YWGEC+ A+YLIN+TPSPLL N+SPFE+L SK Sbjct: 586 SVVERKHQHILNVARALLFQSKVPLIYWGECIQTAIYLINRTPSPLLQNKSPFELLTSKI 645 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P Y ++VFGCL Y ST RNKFTP A P +F+GYP GYKG+++ D K +ISR+V Sbjct: 646 PSYDHLRVFGCLCYTSTLQKDRNKFTPRANPGVFLGYPHGYKGYRVLDTTTNKIIISRNV 705 Query: 580 IFHESIFPFKKNS-TIDLHLHDPATTVLP---NISDLPSTFD 693 +FHES FPF K+S +D + VLP S P+ FD Sbjct: 706 VFHESYFPFAKDSNNLDAENYFFDQDVLPMHVPDSSFPAFFD 747 >ref|XP_015960841.1| uncharacterized protein LOC107484813 [Arachis duranensis] Length = 672 Score = 243 bits (620), Expect = 2e-71 Identities = 134/308 (43%), Positives = 175/308 (56%), Gaps = 10/308 (3%) Frame = +1 Query: 67 IKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQNAVVERKHQH 246 IKAF+ TQFS IKC RSDNAKE TDFLQ+ G H F CP+RPQQNAVVERKHQH Sbjct: 380 IKAFYAMIKTQFSKKIKCFRSDNAKELAATDFLQEKGVLHHFSCPYRPQQNAVVERKHQH 439 Query: 247 LLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSPDYSLVKVF 426 LLNVAR+L FQ+ V +T+ GECVS A +LIN+TPS LL +SPFE+L+ KSP+Y +K+F Sbjct: 440 LLNVARALYFQSQVPITFLGECVSTAAFLINRTPSSLLKMKSPFELLFEKSPNYKAMKIF 499 Query: 427 GCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVIFHESIFPF 606 GCL+Y +T R KF P A +F GYP GYKG+KL+++ ++F+IS DVIFHE PF Sbjct: 500 GCLAYATTNTSSRLKFDPRADTTVFFGYPFGYKGYKLYNLRTKQFLISMDVIFHEDTMPF 559 Query: 607 KKNS--------TIDLHLHDPA--TTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756 +N D+ L +P + LPN +PS Sbjct: 560 AQNPHTQLNNDIFFDVVLPNPILDSEPLPNAPTIPSV----------------------- 596 Query: 757 XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936 IPQ + +T + L + ++ PRRS+RT + P+ Sbjct: 597 -----------------PKIPQPSTTTNTQSQILPLIENQISPSTSIQPRRSTRTKHTPS 639 Query: 937 YLKEYDCN 960 YL +Y C+ Sbjct: 640 YLHDYICH 647 >ref|XP_013594438.1| PREDICTED: uncharacterized protein LOC106302482 [Brassica oleracea var. oleracea] Length = 977 Score = 237 bits (604), Expect = 3e-67 Identities = 109/222 (49%), Positives = 154/222 (69%), Gaps = 2/222 (0%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K+KSD L F + +TQ++ +K IRSDNA E ++ NG H F C + P+QN Sbjct: 683 KRKSDVITLFPEFLQRVYTQYNVRVKAIRSDNAPELRFAKLIKTNGMIHYFSCAYTPEQN 742 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQHLLNVAR+L+FQ+ V L YW +C++ AV+LIN+ PSPLLD+++P+E+L + Sbjct: 743 SVVERKHQHLLNVARALLFQSNVPLLYWSDCITTAVFLINRIPSPLLDHRTPYEVLLKRK 802 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDYSL++ FGCL YVST RNKF+P A PC+F+GYP GYKG+KL D+ + ISR V Sbjct: 803 PDYSLLRSFGCLCYVSTLQKDRNKFSPRARPCLFLGYPSGYKGYKLLDLDNNSVSISRHV 862 Query: 580 IFHESIFPFKKNSTI--DLHLHDPATTVLPNISDLPSTFDYS 699 +FHES++P K +++I D H +P +DL ++ D++ Sbjct: 863 VFHESVYPLKSSTSIIPDFFSHYILPNSVPYTADLDASIDHN 904 >gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score: 11.19) [Arabidopsis thaliana] Length = 1633 Score = 236 bits (601), Expect = 4e-66 Identities = 127/307 (41%), Positives = 172/307 (56%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KS+ S++ F K TQ++ IK IRSDN KE T F+++ G HQF C + PQQN Sbjct: 595 KNKSEVSNIFPVFVKLIFTQYNAKIKAIRSDNVKELAFTKFVKEQGMIHQFSCAYTPQQN 654 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQHLLN+ARSL+FQ+ V L YW +CV A YLIN+ PSPLLDN++PFE+L K Sbjct: 655 SVVERKHQHLLNIARSLLFQSNVPLQYWSDCVLTAAYLINRLPSPLLDNKTPFELLLKKI 714 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDY+L+K CL Y ST + RNKF+P A PC+F+GYP GYKG+K+ D+ I+R+V Sbjct: 715 PDYTLLK--SCLCYASTNVHDRNKFSPRARPCVFLGYPSGYKGYKVLDLESHSISITRNV 772 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 +FHE+ FPFK + + + ++LP LP+ + Sbjct: 773 VFHETKFPFKTSKFLKESVDMFPNSILP----LPAPLHF--------VESMPLDDDLRAD 820 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939 +P Q+T + LD+ + ++VP R R PAY Sbjct: 821 DNNASTSNSASSASSIPPLPSTVNTQNT-DALDI-------DTNSVPIARPKRNAKAPAY 872 Query: 940 LKEYDCN 960 L EY CN Sbjct: 873 LSEYHCN 879 >emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana] emb|CAB78488.1| retrovirus-related like polyprotein [Arabidopsis thaliana] Length = 1489 Score = 235 bits (599), Expect = 7e-66 Identities = 124/310 (40%), Positives = 174/310 (56%), Gaps = 1/310 (0%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + K D S + F K TQF+ IK IRSDNA E T+ ++++G H F C + PQQN Sbjct: 672 RNKKDVSSVFPEFIKLVSTQFNAKIKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQN 731 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+L+FQ+ + + YW +CV+ AV+LIN+ PSPLL+N+SP+E++ +K Sbjct: 732 SVVERKHQHILNVARALLFQSNIPMQYWSDCVTTAVFLINRLPSPLLNNKSPYELILNKQ 791 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDYSL+K FGCL +VST R KFTP A C+F+GYP GYKG+K+ D+ +SR+V Sbjct: 792 PDYSLLKNFGCLCFVSTNAHERTKFTPRARACVFLGYPSGYKGYKVLDLESHSVTVSRNV 851 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPN-ISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756 +F E +FPFK + L + A + PN I LP+ + Sbjct: 852 VFKEHVFPFKTS-----ELLNKAVDMFPNSILPLPAPLHF-------VETMPLIDEDSLI 899 Query: 757 XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936 +P S +E D+ + + VP RS RT P+ Sbjct: 900 PTTTDSRTADNHASSSSSALPSIIPPSSNTETQDI-------DSNAVPITRSKRTTRAPS 952 Query: 937 YLKEYDCNLL 966 YL EY C+L+ Sbjct: 953 YLSEYHCSLV 962 >gb|KYP69518.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 731 Score = 225 bits (574), Expect = 3e-64 Identities = 127/306 (41%), Positives = 169/306 (55%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + KSD I F Y QF T IK RSDN +E DF + G HQF C RP+QN Sbjct: 237 QNKSDCIKNIPQIFAYVENQFQTKIKSFRSDNVRELHFKDFFLEKGVLHQFLCVERPKQN 296 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKH H+LN+AR+LMFQ+ V + WG+ V V+++N+TPSP+L++ SP+EILY+K Sbjct: 297 SVVERKHLHILNIARTLMFQSNVPIKIWGDYVKTVVFIMNRTPSPILNHISPYEILYNKV 356 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P+YS + FG L Y ST L R+KF+P A +FIGYP GYKG+KLFD+ + IS+DV Sbjct: 357 PNYSDFRTFGTLCYASTLLSGRHKFSPRAIAAVFIGYPHGYKGYKLFDLTTHQTFISKDV 416 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 F+E IFPF+ +++ D + PN++ P+TFD S Sbjct: 417 KFYEHIFPFQNSNSSDRGHGVFNDQITPNLNH-PATFDDSDPIQ---------------- 459 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939 P +T Q T + L SP EP P RRS+R NPP Y Sbjct: 460 -------------------PTHTQNQFTIQQL----SP-PNEPDQAPIRRSNRAVNPPGY 495 Query: 940 LKEYDC 957 L +Y C Sbjct: 496 LSDYHC 501 >gb|KYP64799.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 714 Score = 224 bits (571), Expect = 7e-64 Identities = 132/307 (42%), Positives = 169/307 (55%), Gaps = 1/307 (0%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KSD + +I F Y TQF+ T K RSDNAKE TD + G HQF C RPQQN Sbjct: 407 KNKSDCAIVIPQFISYIETQFNKTPKTFRSDNAKELSFTDLFSKKGIIHQFSCVERPQQN 466 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKH H+LN+AR+LMFQ+ V L +WGECV AV+L+N+TPS +LD++SPFEILY K Sbjct: 467 SVVERKHLHILNIARALMFQSNVPLKFWGECVKTAVFLMNRTPSLILDSKSPFEILYDKI 526 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P+Y +VFG L Y ST L R+KFT A +F+GYP+GYKG+KL D+ ++ ISRDV Sbjct: 527 PNYVDFRVFGSLCYASTLLSSRHKFTHRAVAAVFLGYPKGYKGYKLLDLSTKQIFISRDV 586 Query: 580 IFHESIFPFK-KNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXX 756 F E IF FK + TI DP T PNI+ + D Sbjct: 587 KFFEHIFSFKPAHQTIS----DPYIT--PNITYSHTLDDID------------------- 621 Query: 757 XXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPA 936 P T + T P ++ P+ RRS++T NPP+ Sbjct: 622 --------------DHNTIAPYTTTIPHT--PPTIMDQPSL--------RRSNKTSNPPS 657 Query: 937 YLKEYDC 957 YL +Y C Sbjct: 658 YLNDYHC 664 >gb|KZV25004.1| Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum] Length = 1404 Score = 227 bits (578), Expect = 4e-63 Identities = 110/215 (51%), Positives = 146/215 (67%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KSD + F + TQF T+K +RSDNA E DF + G TH C RPQQN Sbjct: 613 KSKSDVLSIFPDFCRMVSTQFGVTVKSVRSDNAPELGFADFFAKAGITHYHSCVERPQQN 672 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+L+FQ+ + L YW +C++ +VYLIN+TPSP+L +++PFE+L+ K Sbjct: 673 SVVERKHQHILNVARALLFQSHIPLDYWCDCINTSVYLINRTPSPILAHKTPFELLHGKL 732 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS +KVFGCL Y ST L R+KF+P A C+FIGYP GYKG+KL ++ + ISRDV Sbjct: 733 PSYSHLKVFGCLCYASTLLSSRHKFSPRAIRCVFIGYPPGYKGYKLLNLETNEIFISRDV 792 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPS 684 IFHE+ FP++ +T + L D V P+ PS Sbjct: 793 IFHENTFPYQ--NTSPMSLSDMTFEVSPSSQITPS 825 >gb|KZV44334.1| hypothetical protein F511_18136 [Dorcoceras hygrometricum] Length = 442 Score = 214 bits (546), Expect = 1e-62 Identities = 100/190 (52%), Positives = 133/190 (70%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + KS+ S + F + +TQF IK +RSDNA E + + G H C RPQQN Sbjct: 3 RSKSNVSSIFPTFCQKIYTQFGAKIKAVRSDNAPELGFVNLFNKLGIIHNHSCVERPQQN 62 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+LMFQ+ + + Y +C+ +VYLIN+TPSPLL +Q+PFE+L+ K Sbjct: 63 SVVERKHQHILNVARALMFQSHLPIAYGSDCIVTSVYLINRTPSPLLSHQTPFEVLHRKR 122 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS +KVFGCL Y ST L R+KF+P A C+F+GYP GYKG+KL ++ + ISRDV Sbjct: 123 PAYSHLKVFGCLCYASTLLSSRSKFSPRAVKCVFLGYPPGYKGYKLINLDTNEIFISRDV 182 Query: 580 IFHESIFPFK 609 IFHE +FPF+ Sbjct: 183 IFHEHVFPFQ 192 >ref|XP_010526684.1| PREDICTED: uncharacterized protein LOC104804180 isoform X6 [Tarenaya hassleriana] Length = 789 Score = 221 bits (564), Expect = 2e-62 Identities = 121/309 (39%), Positives = 163/309 (52%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KSD F + QF+ +IKC+RSDNA E + G HQF CP+ PQQN Sbjct: 187 KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 246 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+ +VYLIN+TPSPLL N++PFE+L S Sbjct: 247 SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 306 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS ++VFGCL YVST R+KF P A +F+GYP G KG+K+ D+ +ISR+V Sbjct: 307 PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 366 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 +FHE+ FPFK L +V P + S + S Sbjct: 367 VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 406 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939 P + S D T++ H P+R S+T PAY Sbjct: 407 -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 456 Query: 940 LKEYDCNLL 966 L +Y C L+ Sbjct: 457 LSDYHCYLI 465 >ref|XP_022548561.1| uncharacterized protein LOC111201234 [Brassica napus] Length = 927 Score = 223 bits (567), Expect = 3e-62 Identities = 124/315 (39%), Positives = 174/315 (55%), Gaps = 7/315 (2%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + K + + F TQ+ T ++ +RSDNAKE + TD + G CP PQQN Sbjct: 657 RTKDEVLRVFPEFITMVETQYKTKVRGVRSDNAKELMFTDLYRAKGIKAFHSCPETPQQN 716 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+LMFQ+ +SL YW +CV AV+LIN+ PSPLL ++SP+++L+ K Sbjct: 717 SVVERKHQHILNVARALMFQSKLSLEYWSDCVLTAVFLINRLPSPLLQDKSPYQLLHKKK 776 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDYS +KVFGCL YVST+ +R+KF P + PC+F+GYP G+KG+K+ D+ +SR+V Sbjct: 777 PDYSEIKVFGCLCYVSTSSKNRHKFQPRSRPCLFLGYPAGFKGYKVMDLDTNIISVSRNV 836 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNI-------SDLPSTFDYSXXXXXXXXXXXXX 738 +FHE IFPF + + DLH HD + P + SD+P++ Sbjct: 837 VFHEDIFPFTCSES-DLH-HDLYPNIDPVVVNKTHIASDVPTS----------------- 877 Query: 739 XXXXXXXXXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSR 918 + V T EP+ PT AE + S R Sbjct: 878 -------------------------VNTEIPVVVTDEPVVDSQIPTKAE------KISKR 906 Query: 919 THNPPAYLKEYDCNL 963 T PAYL++Y CN+ Sbjct: 907 TSKQPAYLEDYYCNM 921 >gb|KYP41111.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 476 Score = 214 bits (545), Expect = 4e-62 Identities = 106/220 (48%), Positives = 144/220 (65%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + KSD I F Y QF T IK RSDN +E DF + G HQF C RP+QN Sbjct: 246 QNKSDCIKNIPQIFAYVENQFQTKIKSFRSDNVRELHFKDFFLEKGVLHQFLCVERPKQN 305 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKH H+LN+AR+LMFQ+ V + WG+ V V+++N+TPSP+L++ SP+EILY+K Sbjct: 306 SVVERKHLHILNIARTLMFQSNVPIKIWGDYVKTVVFIMNRTPSPILNHISPYEILYNKV 365 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P+YS + FG L Y ST L R+KF+P A +FIGYP GYKG+KLFD+ + IS+DV Sbjct: 366 PNYSDFRTFGTLCYASTLLSGRHKFSPRAIAAVFIGYPHGYKGYKLFDLTTHQTFISKDV 425 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYS 699 F+E IFPF+ +++ D + PN++ P+TFD S Sbjct: 426 KFYEHIFPFQNSNSSDRGHGVFNDQITPNLNH-PATFDDS 464 >ref|XP_010526683.1| PREDICTED: uncharacterized protein LOC104804180 isoform X5 [Tarenaya hassleriana] Length = 886 Score = 221 bits (564), Expect = 6e-62 Identities = 121/309 (39%), Positives = 163/309 (52%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KSD F + QF+ +IKC+RSDNA E + G HQF CP+ PQQN Sbjct: 284 KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 343 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+ +VYLIN+TPSPLL N++PFE+L S Sbjct: 344 SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 403 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS ++VFGCL YVST R+KF P A +F+GYP G KG+K+ D+ +ISR+V Sbjct: 404 PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 463 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 +FHE+ FPFK L +V P + S + S Sbjct: 464 VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 503 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939 P + S D T++ H P+R S+T PAY Sbjct: 504 -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 553 Query: 940 LKEYDCNLL 966 L +Y C L+ Sbjct: 554 LSDYHCYLI 562 >ref|XP_018454140.1| PREDICTED: uncharacterized protein LOC108825334 [Raphanus sativus] Length = 980 Score = 222 bits (566), Expect = 6e-62 Identities = 108/214 (50%), Positives = 143/214 (66%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + K + + AF K T++ +K +RSDNA+E L T F Q G T CP P+QN Sbjct: 656 RTKDEVIQVFPAFVKQVETKYGVRVKSVRSDNAQELLFTKFYQAQGITAYNSCPETPEQN 715 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVARSLMFQ+ V L++WG+CV AV+LIN+TP+ LL N++PFE+L S Sbjct: 716 SVVERKHQHILNVARSLMFQSHVPLSFWGDCVLTAVFLINRTPAKLLHNKTPFEVLNGTS 775 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDYS +K FGCL Y ST+ R+KF P + CIF+GYP G KG+KL D+ K ISR+V Sbjct: 776 PDYSQLKTFGCLCYGSTSPKQRHKFLPRSRACIFLGYPPGVKGYKLMDLESNKIYISRNV 835 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLP 681 +FHE +FP KKN DLH+ + ++ LP Sbjct: 836 LFHEDLFPLKKN--YDLHVPEWVNPSSEPLATLP 867 >gb|KZV17946.1| hypothetical protein F511_10775 [Dorcoceras hygrometricum] Length = 989 Score = 222 bits (566), Expect = 7e-62 Identities = 104/194 (53%), Positives = 137/194 (70%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KS+ + F + H QF +IK +RSDNA E ++F + G C RPQQN Sbjct: 552 KSKSEVIDIFPTFCRMIHKQFGKSIKSVRSDNAPELKFSEFFKAEGIVAFHSCVERPQQN 611 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+L+FQ+ + L YW EC+ AVYLIN+TP+PLL N++PFE++++K Sbjct: 612 SVVERKHQHILNVARALLFQSGIPLVYWSECILTAVYLINRTPAPLLSNKTPFELMHNKP 671 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS ++VFGCL Y ST L R KF+P AT IF+GYP GYKG+KL ++ + ISRDV Sbjct: 672 PTYSHLRVFGCLCYGSTLLNQRTKFSPRATRSIFLGYPPGYKGYKLLNLDTNEVYISRDV 731 Query: 580 IFHESIFPFKKNST 621 IFHE++FPFK ST Sbjct: 732 IFHETVFPFKNKST 745 >emb|CAA72989.1| unnamed protein product [Brassica oleracea var. viridis] Length = 1131 Score = 223 bits (568), Expect = 7e-62 Identities = 109/221 (49%), Positives = 141/221 (63%), Gaps = 7/221 (3%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 + KSD H+ F TQ++T IK +R DNA E T+ ++ G CP +QN Sbjct: 613 QSKSDVLHIFPTFVNQIETQYNTKIKSVRRDNAPELSFTELFKEKGIVSYHSCPETLEQN 672 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +V+ERKHQHLLNVAR+LMFQ+ V L YWG+CV A +LIN+TPSPLL N+SP+E+L K+ Sbjct: 673 SVLERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLMGKA 732 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P Y ++ FGCL Y ST+ R+KF P + C+F+GYP GYKG+KL D+ K ISR+V Sbjct: 733 PQYDQLRTFGCLCYGSTSPKQRHKFMPRSRACVFLGYPSGYKGYKLLDLESNKIYISRNV 792 Query: 580 IFHESIFPFKKNSTID---LHLHDPATTV----LPNISDLP 681 FHE IFP K+ +D LH P TV PNIS P Sbjct: 793 TFHEDIFPMAKHQKMDESSLHFFPPKVTVPSAPSPNISSSP 833 >ref|XP_009121252.1| PREDICTED: uncharacterized protein LOC103846085 [Brassica rapa] Length = 860 Score = 221 bits (562), Expect = 8e-62 Identities = 123/303 (40%), Positives = 170/303 (56%), Gaps = 7/303 (2%) Frame = +1 Query: 76 FFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQNAVVERKHQHLLN 255 F TQ+ T ++ +RSDNAKE + TD + G CP PQQN+VVERKHQH+LN Sbjct: 602 FITMVETQYKTKVRGVRSDNAKELMFTDLYRAKGIKAFHSCPETPQQNSVVERKHQHILN 661 Query: 256 VARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSPDYSLVKVFGCL 435 VAR+LMFQ+ +SL YW +CV AV+LIN+ PSPLL ++SP+++L+ K PDYS +KVFGCL Sbjct: 662 VARALMFQSKLSLEYWSDCVLTAVFLINRLPSPLLQDKSPYQLLHKKKPDYSEIKVFGCL 721 Query: 436 SYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVIFHESIFPFKKN 615 YVST+ +R+KF P + PC+F+GYP G+KG+K+ D+ +SR+V+FHE IFPF + Sbjct: 722 CYVSTSSKNRHKFQPRSRPCLFLGYPAGFKGYKVMDLDTNIISVSRNVVFHEDIFPFTCS 781 Query: 616 STIDLHLHDPATTVLPNI-------SDLPSTFDYSXXXXXXXXXXXXXXXXXXXXXXXXX 774 + DLH HD + P + SD+P++ Sbjct: 782 ES-DLH-HDLYPNIDPVVVNKTHIASDVPTS----------------------------- 810 Query: 775 XXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAYLKEYD 954 + V T EP+ PT AE + S RT PAYL++Y Sbjct: 811 -------------VNTEIPVVVTDEPVVDSQIPTKAE------KISKRTSKQPAYLEDYY 851 Query: 955 CNL 963 CN+ Sbjct: 852 CNM 854 >ref|XP_010526682.1| PREDICTED: uncharacterized protein LOC104804180 isoform X4 [Tarenaya hassleriana] Length = 940 Score = 221 bits (564), Expect = 9e-62 Identities = 121/309 (39%), Positives = 163/309 (52%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K KSD F + QF+ +IKC+RSDNA E + G HQF CP+ PQQN Sbjct: 338 KSKSDVLQKFPEFVSFVENQFNASIKCVRSDNAPELGFKSLFAKKGILHQFSCPYTPQQN 397 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 ++VERKHQH+LNVAR+L+FQ+ V L +WG+C+ +VYLIN+TPSPLL N++PFE+L S Sbjct: 398 SIVERKHQHILNVARALLFQSNVPLAFWGDCILTSVYLINRTPSPLLQNKTPFELLTGCS 457 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 P YS ++VFGCL YVST R+KF P A +F+GYP G KG+K+ D+ +ISR+V Sbjct: 458 PSYSHLRVFGCLCYVSTLTKDRHKFNPRAMSAVFLGYPHGVKGYKVLDLHSNAVLISRNV 517 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 +FHE+ FPFK L +V P + S + S Sbjct: 518 VFHETTFPFKSFPQSQPALDPFPQSVSPFFYESISPQNLS-------------------- 557 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLDLL*SPTAAEPHTVPPRRSSRTHNPPAY 939 P + S D T++ H P+R S+T PAY Sbjct: 558 -------SSSALSPVSQEFPTDPISSLGSSETDSSGFVTSSSAHVTRPQRQSKT---PAY 607 Query: 940 LKEYDCNLL 966 L +Y C L+ Sbjct: 608 LSDYHCYLI 616 >dbj|BAB10503.1| retroelement pol polyprotein-like [Arabidopsis thaliana] Length = 1475 Score = 223 bits (567), Expect = 1e-61 Identities = 106/216 (49%), Positives = 141/216 (65%) Frame = +1 Query: 40 KKKSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFLLTDFLQQNGTTHQFYCPHRPQQN 219 K K+D + F K TQ+ T +K +RSDNA E Q G CP PQQN Sbjct: 681 KAKNDVLQIFPDFLKMVETQYGTLVKAVRSDNAPELRFEALYQAKGIISYHSCPETPQQN 740 Query: 220 AVVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKS 399 +VVERKHQH+LNVAR+LMF+A + L +WG+C+ +AV+LIN+ P+PLL N+SPFE+L+ K Sbjct: 741 SVVERKHQHILNVARALMFEANMPLEFWGDCILSAVFLINRLPTPLLSNKSPFELLHLKV 800 Query: 400 PDYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDV 579 PDY+ +KVFGCL Y ST+ R+KF P A C+F+GYP GYKG+KL D+ ISR V Sbjct: 801 PDYTSLKVFGCLCYESTSPQQRHKFAPRARACVFLGYPSGYKGYKLLDLETNTIHISRHV 860 Query: 580 IFHESIFPFKKNSTIDLHLHDPATTVLPNISDLPST 687 +F+E++FPF + I + D V NI + PST Sbjct: 861 VFYETVFPFTDKTIIPRDVFDLVDPVHENIENPPST 896 >gb|PRQ38882.1| putative RNA-directed DNA polymerase [Rosa chinensis] Length = 2324 Score = 223 bits (567), Expect = 1e-61 Identities = 127/317 (40%), Positives = 174/317 (54%), Gaps = 8/317 (2%) Frame = +1 Query: 46 KSDASHLIKAFFKYGHTQFSTTIKCIRSDNAKEFL-LTDFLQQNGTTHQFYCPHRPQQNA 222 KS+ +L+K+FF + TQF+ ++ IRSDN EFL + F Q NG HQ C + PQQN Sbjct: 836 KSETQNLLKSFFAFTETQFNQKVQHIRSDNGSEFLSMRSFFQANGIIHQHSCVYTPQQNG 895 Query: 223 VVERKHQHLLNVARSLMFQATVSLTYWGECVSAAVYLINKTPSPLLDNQSPFEILYSKSP 402 VVERKH+H++ +AR+L+FQA + L +W ECV VYLIN+ P+PLL +SPFE ++ + P Sbjct: 896 VVERKHRHIITIARALLFQANLPLEFWAECVLTVVYLINRLPAPLLSGKSPFEKIFQRVP 955 Query: 403 DYSLVKVFGCLSYVSTTLPHRNKFTPCATPCIFIGYPQGYKGFKLFDILHQKFVISRDVI 582 YS ++VFGCL+Y + P + KF P A CIF+GYP G K +KL+D+ +KF SRDV+ Sbjct: 956 QYSHIRVFGCLAYATNVHP-KQKFDPRAHKCIFVGYPFGQKAYKLYDLTTKKFFTSRDVV 1014 Query: 583 FHESIFPFKKNS-TIDLHLHDPATTVLPNISDLPSTFDYSXXXXXXXXXXXXXXXXXXXX 759 FHE IFP+K++S + L HD VLPN+ +P D Sbjct: 1015 FHEDIFPYKQDSPNLSLQPHD---AVLPNV--IPEN-DIPQEPLSASRVSPIEHTLPQVD 1068 Query: 760 XXXXXXXXXXXXXXXXXXIPQNTAVQSTSEPLD-----LL*SPTAAEPHTVPP-RRSSRT 921 P + + +S PLD SP TVP RRS R Sbjct: 1069 NSLSPNVLSDHETHPNDQTPPSPSSHHSSPPLDNSSPSSPSSPPVPNEDTVPALRRSERV 1128 Query: 922 HNPPAYLKEYDCNLLYL 972 P LK+Y C+ + L Sbjct: 1129 RKPNVKLKDYVCSHVVL 1145