BLASTX nr result
ID: Rehmannia24_contig00013236
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia24_contig00013236 (923 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY14361.1| Uncharacterized protein TCM_033758 [Theobroma cacao] 248 2e-63 gb|EOY27025.1| Uncharacterized protein TCM_028976 [Theobroma cacao] 231 3e-58 gb|EOY00669.1| Uncharacterized protein TCM_010591 [Theobroma cacao] 188 2e-45 emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] 188 3e-45 gb|EOY04499.1| Uncharacterized protein TCM_019740 [Theobroma cacao] 180 8e-43 gb|EOY09663.1| Uncharacterized protein isoform 2 [Theobroma cacao] 175 2e-41 gb|EOY09662.1| Uncharacterized protein isoform 1 [Theobroma cacao] 175 2e-41 ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662... 174 5e-41 emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera] 174 5e-41 gb|EOY21969.1| Integrase, catalytic region, putative [Theobroma ... 172 1e-40 ref|XP_006603149.1| PREDICTED: uncharacterized protein LOC102663... 171 5e-40 ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500... 170 6e-40 ref|XP_004501782.1| PREDICTED: uncharacterized protein LOC101501... 170 6e-40 gb|EOX93232.1| Uncharacterized protein TCM_002073 [Theobroma cacao] 169 2e-39 ref|XP_006575768.1| PREDICTED: uncharacterized protein LOC102663... 167 5e-39 gb|ABD32334.2| Polynucleotidyl transferase, Ribonuclease H fold ... 167 5e-39 gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 167 7e-39 ref|XP_006586460.1| PREDICTED: uncharacterized protein LOC102664... 166 1e-38 ref|XP_006586459.1| PREDICTED: uncharacterized protein LOC102664... 166 1e-38 ref|XP_004236387.1| PREDICTED: uncharacterized protein LOC101254... 165 2e-38 >gb|EOY14361.1| Uncharacterized protein TCM_033758 [Theobroma cacao] Length = 328 Score = 248 bits (634), Expect = 2e-63 Identities = 125/276 (45%), Positives = 172/276 (62%), Gaps = 1/276 (0%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G+IP PD SD L++P RCN+LIL WL+ S++ IAS++ Y+ A +VW+TLK R+SQPD Sbjct: 39 GSIPEPDVSDKLFVPCTRCNSLILAWLLESISPPIASTVFYIRKAYEVWETLKERFSQPD 98 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 RI YFT LN IWEELRNYRPLPHCSCG C ++ D Sbjct: 99 DARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHCSCGICNSACFQTYIDQ 158 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 D F+FL GLNES+ + R QIL+M P PSL+ Y +++++E QR L +P ESS Sbjct: 159 YQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDESQRNLYLHTMPIIESS 218 Query: 542 ALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSKNVPGKSN-GPNH 718 A+A T K K K DV+C +C K GH+ DKC+R+IGFPP+FKF K K+ K N + Sbjct: 219 AMATMTE-GKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKFLKGKSPLKKGNVWSIN 277 Query: 719 SANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLLN 826 + V ++E S+++ + ++ Q+QKLM+L+N Sbjct: 278 NVGPVTSKEECDESTKSLSSLTLSKHQIQKLMSLIN 313 >gb|EOY27025.1| Uncharacterized protein TCM_028976 [Theobroma cacao] Length = 318 Score = 231 bits (589), Expect = 3e-58 Identities = 110/228 (48%), Positives = 144/228 (63%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G+IP P +D L+ W RCNNLI++WL+NS+++ IAS+I +M S ++W+TLKL Y+QPD Sbjct: 76 GSIPKPSITDDLHPIWNRCNNLIVSWLLNSISQPIASTIFFMESVAEIWNTLKLNYAQPD 135 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 + + YF L IWEELRNYRPLPHC CG+C + D Sbjct: 136 NTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCECGKCNANCFKKFSDQ 195 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 D F+FL GLNES+ + R QI+LM+PIPSLD VY+M+L+EE Q+ L P ES Sbjct: 196 YQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQKNMFLQSQPFLESL 255 Query: 542 ALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSK 685 A+ T+ KK K D+ C HCGK GH +KC+RII FP +FKFTK K Sbjct: 256 AMLAATNVKKKPMK-DLTCTHCGKKGHVKEKCYRIIRFPEDFKFTKGK 302 >gb|EOY00669.1| Uncharacterized protein TCM_010591 [Theobroma cacao] Length = 336 Score = 188 bits (478), Expect = 2e-45 Identities = 105/276 (38%), Positives = 153/276 (55%), Gaps = 1/276 (0%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GTI P ++ L+ W RCN LI+TWL+ S+T +IAS+++ M+SAK++ +TLK R+SQP Sbjct: 64 GTIKKPSEANSLFEDWSRCNILIVTWLLESLTPKIASNVLDMDSAKEILETLKNRFSQPY 123 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 I YFT LN++W+EL+N+RPLP C + D Sbjct: 124 ETIICNLQFQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQCDYEGRKNNCYKKYADQ 183 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 Q D F FL GLNES+ R IL++ P S+D Y++++++ QR L P S+ Sbjct: 184 QNKDAVFCFLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKMLQRSLILQ-SPVENST 242 Query: 542 ALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSK-NVPGKSNGPNH 718 V T +K T+++C HCGK GHS +K + IIGFP NFKFTK K N+ + N Sbjct: 243 MATVITEEKRK--NTNLVCSHCGKKGHSKEKYYCIIGFPENFKFTKLKRNMRKGGSSVNS 300 Query: 719 SANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLLN 826 + + E + + S T+ Q+QKLMTL++ Sbjct: 301 AISGSEQDEYDETVTNSISQLSLTKAQIQKLMTLIS 336 >emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] Length = 970 Score = 188 bits (477), Expect = 3e-45 Identities = 96/283 (33%), Positives = 162/283 (57%), Gaps = 8/283 (2%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G+IP P+ D L+ W+RCN+++++W++NSV K+IA S++Y ++A +W+ L+ R+ Q + Sbjct: 72 GSIPCPESDDLLFGTWIRCNSMVISWILNSVHKDIADSLLYFDTAVGIWNDLRDRFCQSN 131 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 RI Y+T L +W+EL+ ++PLP C+CG +++ + Sbjct: 132 GPRIFQIKKHLIALSQGSLDVSTYYTRLKILWDELKGFQPLPECACG-----TMKTWMEF 186 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARL-------SF 520 Q +Y +FLMGLNES+ TR QIL+M P+P + V++++ Q+ERQ S Sbjct: 187 QQQEYVMQFLMGLNESFVQTRSQILMMEPLPPIAKVFSLVAQDERQCSINYGLYTPPDSV 246 Query: 521 IPSSESSALAVGTHPSKKKFKTD-VICQHCGKPGHSIDKCFRIIGFPPNFKFTKSKNVPG 697 + +S +A+ K K D C HCG GH++DKC+++ G+PP +KF KSKN Sbjct: 247 AANDSNSTVAISAARLNSKPKKDRPTCSHCGILGHTVDKCYKLYGYPPGYKF-KSKNPHA 305 Query: 698 KSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLLN 826 K+ AN ++ A+++ ++ L S + Q Q+L+ LL+ Sbjct: 306 KA-----QANQTSSRTTEASATADSPLVSLSPAQCQQLIALLS 343 >gb|EOY04499.1| Uncharacterized protein TCM_019740 [Theobroma cacao] Length = 211 Score = 180 bits (456), Expect = 8e-43 Identities = 93/213 (43%), Positives = 123/213 (57%), Gaps = 6/213 (2%) Frame = +2 Query: 125 MNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPL 304 MNSA D+W TLK +SQPD RI YFT LN IWEEL+NYRPL Sbjct: 1 MNSAADIWQTLKNHFSQPDDTRICNLQYSLCNITQDTRPVDSYFTKLNGIWEELKNYRPL 60 Query: 305 PHCSCGQCTCQAIRSVGDIQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLL 484 P+C CG+CT + ++ D F+FL GLNES+ + R I+++ P PSLD Y ++L Sbjct: 61 PYCECGKCTQSCFQKYIELWEKDRVFRFLNGLNESFSALRSHIIMIKPFPSLDEAYNLVL 120 Query: 485 QEERQREARLSFIPSSESSALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPN 664 +EE QR + P +++ +AV T SK + K +V+C HC K GH +KC+ IIGFPP+ Sbjct: 121 REESQRSILMQSQPLLDTTVVAVVTE-SKIRVKNEVVCSHCAKNGHVKEKCYCIIGFPPD 179 Query: 665 FKFTKSKN------VPGKSNGPNHSANCVPNQE 745 FKFTK K + +N N S V NQE Sbjct: 180 FKFTKGKGNFSRKAMSAVANSTNQSQ--VENQE 210 >gb|EOY09663.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 260 Score = 175 bits (444), Expect = 2e-41 Identities = 99/246 (40%), Positives = 134/246 (54%), Gaps = 12/246 (4%) Frame = +2 Query: 125 MNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPL 304 M+SA ++W+TLK ++QPD R+ YF L IWEELRNYRPL Sbjct: 1 MDSAAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPL 60 Query: 305 PHCSCGQCTCQAIRSVGDIQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLL 484 PHC CG + D D F+FL GLN+S+ + R QILLM+PIP LD VY+++L Sbjct: 61 PHCECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLIL 120 Query: 485 QEERQREARLSFIPSSESSALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPN 664 +EE QR + P ES A+ +KKK + D+IC HCGK GH+ DKC++II F + Sbjct: 121 REESQRNILVQPQPLLESFAMFTAA-DNKKKARKDIICNHCGKKGHTKDKCYKIISFLDD 179 Query: 665 FKFTKS------------KNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQK 808 FKFTK NV S+ S + V +E A++ +L S ++QV K Sbjct: 180 FKFTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQL-SMIKQQVNK 238 Query: 809 LMTLLN 826 LM L+ Sbjct: 239 LMQFLS 244 >gb|EOY09662.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 263 Score = 175 bits (444), Expect = 2e-41 Identities = 99/246 (40%), Positives = 134/246 (54%), Gaps = 12/246 (4%) Frame = +2 Query: 125 MNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPL 304 M+SA ++W+TLK ++QPD R+ YF L IWEELRNYRPL Sbjct: 1 MDSAAEIWNTLKQNFAQPDDTRVCNLQYTLGNVSQGARTVDVYFIELKGIWEELRNYRPL 60 Query: 305 PHCSCGQCTCQAIRSVGDIQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLL 484 PHC CG + D D F+FL GLN+S+ + R QILLM+PIP LD VY+++L Sbjct: 61 PHCECGSYNPGCFKKYTDQFQKDMVFRFLNGLNKSFSAIRSQILLMDPIPGLDKVYSLIL 120 Query: 485 QEERQREARLSFIPSSESSALAVGTHPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPN 664 +EE QR + P ES A+ +KKK + D+IC HCGK GH+ DKC++II F + Sbjct: 121 REESQRNILVQPQPLLESFAMFTAA-DNKKKARKDIICNHCGKKGHTKDKCYKIISFLDD 179 Query: 665 FKFTKS------------KNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQK 808 FKFTK NV S+ S + V +E A++ +L S ++QV K Sbjct: 180 FKFTKGGRSNPRKGKNLVNNVFAVSDASTDSESQVETKEEQASAGFVCQL-SMIKQQVNK 238 Query: 809 LMTLLN 826 LM L+ Sbjct: 239 LMQFLS 244 >ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max] Length = 424 Score = 174 bits (440), Expect = 5e-41 Identities = 98/296 (33%), Positives = 154/296 (52%), Gaps = 22/296 (7%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GT+ P SDPLY PWLRCNNL+L+WL S ++EIA S+++ + A VW +L+ R+SQ D Sbjct: 60 GTLSPPPISDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGD 119 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCG-QCTCQAIRSVGD 358 R+ YFT L T+WEE+ N+RP+ C+C C+C A + Sbjct: 120 IFRVADIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRK 179 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPS--S 532 + D KFL GL + Y R QI+LM+P+P+LD + ++LQ+ERQ + S + Sbjct: 180 FKEQDKVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIEN 239 Query: 533 ESSALAVGTHPSKKKFKT--------------DVICQHCGKPGHSIDKCFRIIGFPPNFK 670 +SS PS+ + + +C HC + H+++ CF G+PP F+ Sbjct: 240 QSSVNHFSQTPSRPSNNSGCGRGRGYSSGGRGNRLCTHCNRTNHTVETCFIKHGYPPGFQ 299 Query: 671 FTKSKNVPGKSNGPNH-----SANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLL 823 KS N G ++ N SA+ + A +++ ++ S Q+Q +++ LL Sbjct: 300 HRKS-NSSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQLL 354 >emb|CAN60829.1| hypothetical protein VITISV_012059 [Vitis vinifera] Length = 1128 Score = 174 bits (440), Expect = 5e-41 Identities = 89/250 (35%), Positives = 141/250 (56%), Gaps = 13/250 (5%) Frame = +2 Query: 59 NNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXXXXXXXXXXXXX 238 NNL+L+WL+NS+ KEI S++Y +A D+W+ LK+RY + D R+ Sbjct: 50 NNLVLSWLMNSIAKEIRGSLLYFTNAFDIWEELKIRYLRSDGPRVFSLEKSLSSISQNSK 109 Query: 239 XXXDYFTSLNTIWEELRNYRPLPHCSCG---QCTCQAIRSVGDIQLSDYTFKFLMGLNES 409 +YF+ +W+E +Y P+P C CG +C+C ++ + D Q S+Y KFLMGL++S Sbjct: 110 SITEYFSEFKALWDEYISYHPIPSCRCGNLNRCSCNILKDLTDRQQSNYVMKFLMGLHDS 169 Query: 410 YDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESSALAV----------GT 559 Y + R Q+L +P+ S+ V+++LLQEE QR + S +S A+ V T Sbjct: 170 YSAIRSQLLPQSPLLSMSRVFSLLLQEESQRSLTNAVGISIDSQAMVVEQSSRIVSTSNT 229 Query: 560 HPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSKNVPGKSNGPNHSANCVPN 739 +K+K K++ I HCG GH +DKCF++IG+PP +K + K ++ P + N Sbjct: 230 QFTKQKGKSNAIYSHCGYSGHLVDKCFQLIGYPPGWKEPRGKRF---NSTPTTTKNF--- 283 Query: 740 QEAPAASSEN 769 Q P A++ N Sbjct: 284 QRLPTANNTN 293 >gb|EOY21969.1| Integrase, catalytic region, putative [Theobroma cacao] Length = 242 Score = 172 bits (437), Expect = 1e-40 Identities = 81/167 (48%), Positives = 106/167 (63%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GTI P +DPLY W+RCNNLI+ WL++S+T IAS+I YM+S D+W+TLK ++QPD Sbjct: 72 GTISKPQPTDPLYPSWIRCNNLIVAWLLDSITPPIASTIFYMDSVVDIWNTLKQSFAQPD 131 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 R+ YF L IWEELRNYRPLPHC CG+ + + R D Sbjct: 132 DSRVCNLQYTLGNVTQGTRSVDSYFIELKGIWEELRNYRPLPHCVCGKYSPECFRRYSDQ 191 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQR 502 D F+FL GLN+ + + R QI+LM+PIPSLD VY ++L+EE QR Sbjct: 192 YQKDMVFRFLNGLNDFFSAVRSQIILMDPIPSLDKVYNLVLREEAQR 238 >ref|XP_006603149.1| PREDICTED: uncharacterized protein LOC102663081 [Glycine max] Length = 415 Score = 171 bits (432), Expect = 5e-40 Identities = 101/313 (32%), Positives = 156/313 (49%), Gaps = 39/313 (12%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G P P +D Y W RCNN++++W+++SV+ I SI++MN A+++W+ LK RY+Q D Sbjct: 16 GNAPEPLKTDRTYGVWSRCNNMVVSWIMHSVSVAIRQSILWMNKAEEIWNDLKSRYTQGD 75 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCG-QCTCQAIRSVGD 358 +RI +YFT L IW+E+ N+RP P CSC +CTC + + Sbjct: 76 LLRISDLQQEASSMKQGTLSVTEYFTKLRIIWDEIENFRPDPRCSCTIKCTCSVLTIIAQ 135 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIP---- 526 +L D +FL GLNE Y++ R +LLM P+P++ +++ + Q+ERQ +F+P Sbjct: 136 RKLEDRAMQFLRGLNEQYNNVRSHVLLMEPMPTIPKIFSYVAQQERQLSGN-NFLPNFSL 194 Query: 527 -SSESSALAV-------------------------GTHPSKKK---FKTDVICQHCGKPG 619 S E++++ V ++ + K + C HCGK G Sbjct: 195 ESKENASINVVKITCEFCGRIGHTESVCYKKYGVPSSYEGRSKSYNTRNGKACTHCGKIG 254 Query: 620 HSIDKCFRIIGFPPNFKFTKSKNVPGKSNGPNHSANCVPNQEAPAASS-----ENTKLFS 784 H+ID CF+ FPP +KF SK V AN + E A S E+ L Sbjct: 255 HTIDVCFKKHRFPPGYKFGNSKVV----------ANNIVAVEGKATSDQMQRHESHDLVR 304 Query: 785 FTQKQVQKLMTLL 823 F+ +Q Q L L+ Sbjct: 305 FSPEQDQALFALI 317 >ref|XP_004515089.1| PREDICTED: uncharacterized protein LOC101500638 [Cicer arietinum] Length = 379 Score = 170 bits (431), Expect = 6e-40 Identities = 95/275 (34%), Positives = 148/275 (53%), Gaps = 11/275 (4%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GTI P +D L + W RCN ++++W+ NS+ +IA SI++M+SA ++W L RY Q D Sbjct: 86 GTISRPKDTDRLSMAWDRCNTMVMSWIRNSLESDIAQSIMWMDSAAEIWHELNDRYHQGD 145 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQ-CTCQAIRSVGD 358 RI YFT+L +W+EL N+ PLP CSC C+C + + + Sbjct: 146 IFRISDLQEEIYGLRQGDSSITIYFTNLKKLWQELENFFPLPSCSCTPTCSCNLLPKIRE 205 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQ-----REARLSFI 523 + +DY FL GLNE Y R QI+LM P+P++ V++MLLQ+ERQ E + + Sbjct: 206 YRENDYVIHFLKGLNEQYSPVRSQIMLMEPLPTISKVFSMLLQQERQFFSHTEELKTVAV 265 Query: 524 PSSESSALAVGT-----HPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSKN 688 S+ S G+ S + + IC HC K GH +D CF+ G+P N+ + S Sbjct: 266 VSNHSRGFGRGSSLGSGRGSGSRGRGYKICTHCNKSGHMVDVCFKKHGYPLNYPRSNS-- 323 Query: 689 VPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQ 793 G SN + ++ + + +A+S+++ L + TQ Sbjct: 324 --GASNNCSSTSPDIEDAHT-SATSDSSSLDNATQ 355 >ref|XP_004501782.1| PREDICTED: uncharacterized protein LOC101501608 [Cicer arietinum] Length = 362 Score = 170 bits (431), Expect = 6e-40 Identities = 102/304 (33%), Positives = 149/304 (49%), Gaps = 30/304 (9%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G+IP PD + + W RCNNL+L+W+ + V+ EIA+SI+++++A W LK R+SQ D Sbjct: 62 GSIPCPDAPNQMIPAWKRCNNLVLSWINHFVSHEIATSILWIDTAAAAWKDLKDRFSQGD 121 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHC-SCGQCTCQAIRSVGD 358 SVRI Y+T + +W+EL NYRP+P C S C C +++ Sbjct: 122 SVRISQLHQDLYSMHQSDLTVTAYYTKMKILWDELCNYRPIPECQSVTLCCCDVSKTLKK 181 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSES 538 + +D FL GLN++Y + R QILLM+P+PSL +++M++Q+ERQ L P ES Sbjct: 182 YRDNDCVLCFLRGLNDNYSAVRSQILLMDPLPSLTKIFSMIIQQERQ----LQTSPLPES 237 Query: 539 SALAV---------------------------GTHPSKKKFKTDV--ICQHCGKPGHSID 631 S +A G P K V C HCG+ H+ID Sbjct: 238 SVMAAQVPQQVSYQNKPSYSSSNSGRGKASYQGNQPRHSGGKVGVNRQCTHCGRTNHTID 297 Query: 632 KCFRIIGFPPNFKFTKSKNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQKL 811 CF I G PP FK K N+ SS ++ + +Q+Q+Q L Sbjct: 298 TCFLIHGLPPGFKSKKVHNI-----------------TTYLTSSCDSSVLGLSQEQIQSL 340 Query: 812 MTLL 823 + LL Sbjct: 341 LALL 344 >gb|EOX93232.1| Uncharacterized protein TCM_002073 [Theobroma cacao] Length = 817 Score = 169 bits (427), Expect = 2e-39 Identities = 100/269 (37%), Positives = 142/269 (52%), Gaps = 8/269 (2%) Frame = +2 Query: 38 YIPWLRCNNLIL-----TWLIN-SVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXX 199 Y+ W R L L T IN S+ K + +Y +W+TLK Y+QPD R+ Sbjct: 50 YVAWSRSFILALSIRNKTGFINGSIPKPATTDPLY-----HMWNTLKQNYAQPDDTRLCN 104 Query: 200 XXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDIQLSDYT 379 YF L + EE+R+YRPLPHC CG+C + D D Sbjct: 105 LQYTLGNITQGTRSVDSYFIELKAVREEIRSYRPLPHCECGRCNANCFKRYIDQYHKDMV 164 Query: 380 FKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESSALAVGT 559 F+FL GLNES+ + R I+LM+PIP+LD VY +L+EE Q+ ESS + + T Sbjct: 165 FRFLNGLNESFSAIRSHIILMDPIPTLDRVYNFMLREETQKNLLFQSQSVLESSTM-LTT 223 Query: 560 HPSKKKFKTDVICQHCGKPGHSIDKCFRIIGFPPNFKFT--KSKNVPGKSNGPNHSANCV 733 SKKK K D++C HCGK GH+ +KC+R+IGFP +FKFT K+ GK+ N +A+ Sbjct: 224 TDSKKKLKKDLVCSHCGKKGHNKEKCYRLIGFPYDFKFTTRKANIKKGKTAVNNVTASNE 283 Query: 734 PNQEAPAASSENTKLFSFTQKQVQKLMTL 820 + + S+ + S +Q+ Q L+ L Sbjct: 284 ISIDEFQVDSDGKGISSNSQQGKQSLVNL 312 >ref|XP_006575768.1| PREDICTED: uncharacterized protein LOC102663845 [Glycine max] Length = 482 Score = 167 bits (423), Expect = 5e-39 Identities = 94/303 (31%), Positives = 151/303 (49%), Gaps = 29/303 (9%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GT P P +D LY W RCNN++++W+++SV I S+++M+ A+D+W LK RYSQ D Sbjct: 57 GTAPEPLKTDRLYGAWRRCNNMVVSWIVHSVATSIRQSVLWMDKAEDIWRDLKSRYSQGD 116 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCG-QCTCQAIRSVGD 358 +RI +YFT L IW+E+ ++RP P C+C +C+C +G Sbjct: 117 LLRISDLQQEASTLKQGALSITEYFTRLRVIWDEIESFRPDPICTCNVRCSCSVSTIIGQ 176 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSES 538 +L D +FL GLNE Y + R +LLM+PIP + +++ + Q+ERQ S + E+ Sbjct: 177 RKLEDRAMQFLRGLNEQYTNIRSHVLLMDPIPPISKIFSYVAQQERQLLGNCSPNLNFEA 236 Query: 539 SALAVGTHPS-------------------------KKKFKTD---VICQHCGKPGHSIDK 634 +++ T S + ++K++ C HCGK GH++D Sbjct: 237 KEISINTARSACEYCGRSGHTESVCYKKHGMPSSHETRYKSNGGRKTCTHCGKMGHTVDV 296 Query: 635 CFRIIGFPPNFKFTKSKNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQKLM 814 C+R G+PP + K G++ N QE E L F+ +Q + L+ Sbjct: 297 CYRKHGYPPGY-----KPYNGRTTVNNMVTMNDKFQEDQTQHHEAQDLVRFSPEQHKALL 351 Query: 815 TLL 823 L+ Sbjct: 352 ALI 354 >gb|ABD32334.2| Polynucleotidyl transferase, Ribonuclease H fold [Medicago truncatula] Length = 772 Score = 167 bits (423), Expect = 5e-39 Identities = 93/287 (32%), Positives = 153/287 (53%), Gaps = 13/287 (4%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G++P P D W RCNNLIL+W+INSV+ +IA +I++ A DVW L+ R+S+ D Sbjct: 63 GSVPIPPMDDLNRTAWERCNNLILSWIINSVSPQIAQTIVFHEYAIDVWIELQERFSKVD 122 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCG-QCTCQAIRSVGD 358 +R+ DYFTS+ ++WEEL ++RP+P C+C C C+++R+ D Sbjct: 123 RIRVASLRSSINNLKQGDKSVLDYFTSIKSLWEELNSHRPMPMCTCPYPCRCESMRAARD 182 Query: 359 IQLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSES 538 ++ D +FL GLN+S+ + Q+LL++P+PS++ VY+M++QEE S + S+E Sbjct: 183 FRMEDQVIQFLTGLNDSFSVVKTQVLLIDPLPSINKVYSMVIQEESNIIPPTS-LASNED 241 Query: 539 SALAVGTHPSKKKF------------KTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKS 682 S++ V ++K F C C + H+++ C+ FP K T S Sbjct: 242 SSILVNASDARKPFLRGKSSGTSQSKNNSRYCTFCRRNNHTVEYCYLKHDFPNANKPTAS 301 Query: 683 KNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLL 823 N H+ + + E ++SS+ TQ+Q L++LL Sbjct: 302 SNAVTS----EHAVDSHTSSEGTSSSSQT----GLTQEQYVHLVSLL 340 >gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm, score: 11.19) [Arabidopsis thaliana] Length = 1633 Score = 167 bits (422), Expect = 7e-39 Identities = 90/272 (33%), Positives = 135/272 (49%), Gaps = 4/272 (1%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 GTI P Y W RCN+ + TWL+NSV+K+I S++++ +A+ +W + R+ Q D Sbjct: 80 GTIVKPPLDHRDYGAWSRCNDTVSTWLMNSVSKKIGQSLLFIPTAEGIWKNMLSRFKQDD 139 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 + R+ Y+T L T+WEE +NY LP C+CG+C C A + Sbjct: 140 APRVYDIEQRLSKIEQGSMDISAYYTELQTLWEEHKNYVDLPVCTCGRCECDAAVKWERL 199 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 Q + KFLMGLNESY+ TR IL++ PI +++ + ++ Q+ERQ+ R Sbjct: 200 QQRSHVTKFLMGLNESYEQTRRHILMLKPIRTIEEAFNIVTQDERQKAIR---------- 249 Query: 542 ALAVGTHPSKKKFKTD----VICQHCGKPGHSIDKCFRIIGFPPNFKFTKSKNVPGKSNG 709 P+ K D +C +CGK GH++ KC++IIG+PP +K S P Sbjct: 250 -------PTPKVDNQDQLKLPLCTNCGKVGHTVQKCYKIIGYPPGYKAATSYRQPQIQTQ 302 Query: 710 PNHSANCVPNQEAPAASSENTKLFSFTQKQVQ 805 P +P Q P L S QV+ Sbjct: 303 PRMQ---MPQQSQPRMQQPIQHLISQFNAQVR 331 >ref|XP_006586460.1| PREDICTED: uncharacterized protein LOC102664915 [Glycine max] Length = 393 Score = 166 bits (420), Expect = 1e-38 Identities = 90/297 (30%), Positives = 154/297 (51%), Gaps = 31/297 (10%) Frame = +2 Query: 26 SDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPDSVRIXXXX 205 SD Y W RCNN++++WL++SV+ I S+++M+ A+++W+ LK RY+Q D +R+ Sbjct: 64 SDRTYGAWSRCNNIVVSWLVHSVSISIRQSVLWMDRAEEIWNDLKSRYAQGDLLRVSELQ 123 Query: 206 XXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCG-QCTCQAIRSVGDIQLSDYTF 382 YFT L IW+E+ N+RP P C C +CTC + ++ + D+ Sbjct: 124 QEASSIKQGSLSVTKYFTKLRVIWDEIENFRPDPICRCTVKCTCLVLTTMAQRKREDHAM 183 Query: 383 KFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSF----IPSSESSAL- 547 +FL GLNE Y + R +LLM+PIP++ +++ + Q+ERQ S + S E S++ Sbjct: 184 QFLRGLNEQYSNIRSHVLLMDPIPTIPKIFSYVAQQERQLTGNNSISSFNLESKEGSSIN 243 Query: 548 ----------AVGTHPS---------------KKKFKTDVICQHCGKPGHSIDKCFRIIG 652 +G + S K + T IC +CGK GH+++ C++ G Sbjct: 244 AVKSVCEFCGCIGHNESICYKKNGLPPNYDGKGKGYNTRKICTYCGKLGHTVEVCYKKHG 303 Query: 653 FPPNFKFTKSKNVPGKSNGPNHSANCVPNQEAPAASSENTKLFSFTQKQVQKLMTLL 823 +PP FKF + + +N + + + P S E+ +L F+ +Q + L+ L+ Sbjct: 304 YPPGFKFNNGRTM---ANNVVAAEGKATDDQIP--SQESQELVRFSPEQYKALLALI 355 >ref|XP_006586459.1| PREDICTED: uncharacterized protein LOC102664411 [Glycine max] Length = 265 Score = 166 bits (419), Expect = 1e-38 Identities = 82/231 (35%), Positives = 130/231 (56%), Gaps = 9/231 (3%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G++P P +DP Y W R NN I++WL N V+K+I +SI++ N K++WD LK+R+S+ + Sbjct: 10 GSLPMPTTTDPTYAAWTRGNNAIISWLYNFVSKDIITSILFANMTKEIWDDLKIRFSRKN 69 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 +RI Y+T L ++ EEL Y+P C G ++++ D Sbjct: 70 GLRIFQLRRQLMSSQQGNYDVSTYYTKLKSVLEELSGYKPTFQCKRG-----GLQTLQDY 124 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 +Y FLMGLN+S+ RGQILL +P P + V+++++QEE QRE ++ IPS S+ Sbjct: 125 NEFEYVMSFLMGLNDSFAQIRGQILLSDPFPPVGNVFSLVIQEEAQREIIVNHIPSLNSN 184 Query: 542 ALAVGTHPSKKKF---------KTDVICQHCGKPGHSIDKCFRIIGFPPNF 667 +A + + K K C +C GH+ DKCF+++G+PPN+ Sbjct: 185 NMAFVVNSTTKNTTNGKSRNPKKERPQCAYCNMLGHTKDKCFKLVGYPPNY 235 >ref|XP_004236387.1| PREDICTED: uncharacterized protein LOC101254987 [Solanum lycopersicum] Length = 620 Score = 165 bits (418), Expect = 2e-38 Identities = 82/247 (33%), Positives = 134/247 (54%), Gaps = 12/247 (4%) Frame = +2 Query: 2 GTIPAPDFSDPLYIPWLRCNNLILTWLINSVTKEIASSIIYMNSAKDVWDTLKLRYSQPD 181 G++ P S P Y W RCN+++++WL+NS++K+IA S++Y +AKD+W L+ R+ Q + Sbjct: 79 GSLSEPAVSSPTYKAWNRCNDMVISWLLNSLSKDIAESVLYSKTAKDIWKELEDRFGQCN 138 Query: 182 SVRIXXXXXXXXXXXXXXXXXXDYFTSLNTIWEELRNYRPLPHCSCGQCTCQAIRSVGDI 361 ++ Y+T + IW+EL + HC+C C+C Sbjct: 139 GAKLFQLQKELSDLVQGNSDVAGYYTKVKRIWDELDSLDTCAHCTCA-CSCGGKNRTLKS 197 Query: 362 QLSDYTFKFLMGLNESYDSTRGQILLMNPIPSLDTVYAMLLQEERQREARLSFIPSSESS 541 +FLMGLN++Y S R IL+++P+PS++ Y++L+Q+E+QRE +F ES+ Sbjct: 198 HQDGRLIQFLMGLNDTYSSVRSNILMISPLPSVNQAYSLLIQDEKQREIH-TFQHPIESA 256 Query: 542 ALAVGTHPSKKKF------------KTDVICQHCGKPGHSIDKCFRIIGFPPNFKFTKSK 685 +A +KF K ++ C +C K H+ C+R+IGFP +FKFTK K Sbjct: 257 FMAARQQYGVQKFNTLEKKGNFEESKNNLFCTYCKKTRHTAQNCYRLIGFPADFKFTKGK 316 Query: 686 NVPGKSN 706 +SN Sbjct: 317 KSQTQSN 323