BLASTX nr result
ID: Rehmannia28_contig00025454
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia28_contig00025454 (1293 letters) Database: ./nr 84,704,028 sequences; 31,038,470,784 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699... 422 e-142 ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697... 422 e-141 ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom... 314 e-102 ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom... 282 4e-89 gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family ... 271 2e-83 ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom... 265 2e-82 ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955... 265 3e-80 gb|KHN46305.1| hypothetical protein glysoja_045316, partial [Gly... 256 1e-79 ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954... 267 2e-79 ref|XP_015381157.1| PREDICTED: uncharacterized protein LOC107174... 253 4e-78 ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321... 254 4e-78 emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] 268 1e-77 gb|KYP45295.1| hypothetical protein KK1_033170 [Cajanus cajan] 251 2e-77 ref|XP_015389606.1| PREDICTED: uncharacterized protein LOC107178... 251 2e-77 gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposo... 255 3e-77 ref|XP_010662801.1| PREDICTED: uncharacterized protein LOC104882... 260 1e-76 gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly... 254 2e-76 ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798... 251 4e-76 gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly... 253 1e-75 ref|XP_007037468.1| Integrase, catalytic region, putative [Theob... 244 1e-75 >ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix dactylifera] Length = 490 Score = 422 bits (1086), Expect = e-142 Identities = 199/341 (58%), Positives = 266/341 (78%), Gaps = 1/341 (0%) Frame = +1 Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 +P+EDPN F+LHHTD+A TV+++PPL GSNY++W R+F+LA+SIKNKLG LDGSI TP Sbjct: 20 TPSEDPNSPFFLHHTDNAQTVIVTPPLVGSNYLSWSRSFSLAISIKNKLGFLDGSISTPE 79 Query: 379 SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558 D LYIPWLRCNNLIL+WLLNS+SKEIASN+L+I SAKE+W+KLK+RF+QPDN+RI+QL Sbjct: 80 VTDPLYIPWLRCNNLILAWLLNSISKEIASNVLFIKSAKEVWNKLKSRFAQPDNVRIYQL 139 Query: 559 QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738 +Q+LSSI Q + +VSEYFTQLNA+WEEL+NYRP+P+CSCG C C AL+ GE D++F Sbjct: 140 KQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCICDALKGVGEDLELDHIF 199 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918 +FLMGLN++Y+ +RGQI+L+SP PSLDK FS++LQEERQR+AR I P +S + A N Sbjct: 200 QFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEERQRQARAIIFPAPESSALAAVLN 259 Query: 919 SEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSANISSSQDVH 1098 K K +++ C HCGK GH+++KCYRLIGFPPNFKFTK K + K A S++Q + Sbjct: 260 --KSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKFTKTKFPSVNNKSVAPHSANQVIS 317 Query: 1099 SGSNDSHGSGMI-FTQDQVQKLMALINKDGMQPVSSGTSSS 1218 S + + +Q Q+Q+L+AL+N G+ +S ++S+ Sbjct: 318 STQGKGLSAPQLSLSQTQIQQLLALVN-SGIPQMSLNSAST 357 >ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix dactylifera] Length = 514 Score = 422 bits (1086), Expect = e-141 Identities = 201/345 (58%), Positives = 265/345 (76%), Gaps = 5/345 (1%) Frame = +1 Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 +P+EDPN F+LH TD+A TV+++PPL GSNY++W R+F+LA+SIKNKLG LDGSIPTP Sbjct: 20 TPSEDPNSPFFLHRTDNAQTVIVTPPLIGSNYLSWSRSFSLAISIKNKLGFLDGSIPTPE 79 Query: 379 SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558 D LY+PWLRCNNLIL+WLLNS+SKEIASN+L+I S KE+W+KLK+RF+QPDN+RI+QL Sbjct: 80 VTDPLYVPWLRCNNLILAWLLNSISKEIASNVLFIKSTKEVWNKLKSRFAQPDNVRIYQL 139 Query: 559 QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738 +Q+LSSI QGT +VSEYFTQLNA+WEEL+NYRP+P+CSCG C C AL+ GE DY+F Sbjct: 140 KQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCICDALKGVGENLELDYIF 199 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918 +FLM LN +++ +RGQI+L+SP PSLDK FS++LQEERQR+AR I P +S + A N Sbjct: 200 QFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEERQRQARAIIFPAPESSALAAVLN 259 Query: 919 SEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSANISSSQDVH 1098 K K + + C HCGK GH+R+KCYRLIGFPPNFKFTK K + K A+ S++Q + Sbjct: 260 KPKNK--AKITCYHCGKPGHTREKCYRLIGFPPNFKFTKTKSPSVNNKSVASHSANQVI- 316 Query: 1099 SGSNDSHGSGMI-----FTQDQVQKLMALINKDGMQPVSSGTSSS 1218 + + G G+ +Q QVQ+L AL+N G+ ++ ++SS Sbjct: 317 ---SPTQGKGLAAPQLSLSQAQVQQLFALVN-SGITQLNLNSASS 357 >ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao] gi|508779769|gb|EOY27025.1| Uncharacterized protein TCM_028976 [Theobroma cacao] Length = 318 Score = 314 bits (805), Expect = e-102 Identities = 146/293 (49%), Positives = 195/293 (66%) Frame = +1 Query: 166 TPPVQVRTITVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKL 345 T P T +S DP +YLHHTDH +VV++P LT +NYV W R+F LALSI+NK+ Sbjct: 12 TAPNPQLTSQISQANDPPSPYYLHHTDHLGSVVVNPKLTTNNYVAWSRSFLLALSIRNKV 71 Query: 346 GLLDGSIPTPNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRF 525 G ++GSIP P+ D L+ W RCNNLI+SWLLNS+S+ IAS I ++ S EIW+ LK + Sbjct: 72 GFINGSIPKPSITDDLHPIWNRCNNLIVSWLLNSISQPIASTIFFMESVAEIWNTLKLNY 131 Query: 526 SQPDNIRIFQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRS 705 +QPDN + LQ L S+ Q V YF +L +WEEL+NYRP+P C CG C + + Sbjct: 132 AQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCECGKCNANCFKK 191 Query: 706 YGEIQSCDYVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPM 885 + + D VF+FL GLNES+ IR QI+L+ P PSLDKV+SM+L+EE Q+ P Sbjct: 192 FSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQKNMFLQSQPF 251 Query: 886 MDSLSFAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKP 1044 ++SL+ N +K K + D+ C HCGK GH ++KCYR+I FP +FKFTKGKP Sbjct: 252 LESLAMLAATNVKK-KPMKDLTCTHCGKKGHVKEKCYRIIRFPEDFKFTKGKP 303 >ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao] gi|508722464|gb|EOY14361.1| Uncharacterized protein TCM_033758 [Theobroma cacao] Length = 328 Score = 282 bits (721), Expect = 4e-89 Identities = 140/294 (47%), Positives = 196/294 (66%), Gaps = 5/294 (1%) Frame = +1 Query: 310 AFTLALSIKNKLGLLDGSIPTPNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINS 489 +F LALSI+NK +DGSIP P+ D L++P RCN+LIL+WLL S+S IAS + YI Sbjct: 23 SFLLALSIQNKSRFIDGSIPEPDVSDKLFVPCTRCNSLILAWLLESISPPIASTVFYIRK 82 Query: 490 AKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFC 669 A E+W+ LK RFSQPD+ RI LQ L +I QGT +V YFT+LN +WEEL+NYRP+P C Sbjct: 83 AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142 Query: 670 SCGLCTCSALRSYGEIQSCDYVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEE 849 SCG+C + ++Y + D VF+FL GLNES+ +R QIL++ P PSL+K +++++++E Sbjct: 143 SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202 Query: 850 RQREARTSITPMMDSLSFAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKF 1029 QR P+++S + A K K DVVC +C K GH++DKCYRLIGFPP+FKF Sbjct: 203 SQRNLYLHTMPIIESSAMAT-MTEGKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261 Query: 1030 TKGK-PRNQGQKHSAN----ISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALIN 1176 KGK P +G S N ++S ++ + S + ++ Q+QKLM+LIN Sbjct: 262 LKGKSPLKKGNVWSINNVGPVTSKEECDESTKSL--SSLTLSKHQIQKLMSLIN 313 >gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family [Cajanus cajan] Length = 437 Score = 271 bits (693), Expect = 2e-83 Identities = 153/384 (39%), Positives = 216/384 (56%), Gaps = 20/384 (5%) Frame = +1 Query: 202 PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381 P+ DP + +LHH+D + S PL NY TW RA +AL +KNK+ +DGS+P P + Sbjct: 8 PSSDPTNPLFLHHSDGPGLFLTSQPLDNKNYTTWSRAMLVALGVKNKIPFVDGSLPRPAA 67 Query: 382 DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561 DD Y W+ NN+++SWL NSVSKEI ++IL+ N AKEIWD LK+RFS+ + RIFQL+ Sbjct: 68 DDPTYAAWIHGNNVVISWLYNSVSKEIITSILFANIAKEIWDDLKSRFSRKNGPRIFQLR 127 Query: 562 QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRP-IPFCSCGLCTCSALRSYGEIQSCDYVF 738 ++L+S+ QGT VS Y+T+L ++WE+L Y+P P CTC L+ +YV Sbjct: 128 RQLTSLQQGTDDVSTYYTKLKSIWEDLSGYKPSFP------CTCGGLQHLQVYNDLEYVM 181 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSI--TPMMDSLSFAVK 912 FLMGLN+S+ IRGQILL P P + VFS++LQEE QRE T++ TP ++S + A Sbjct: 182 SFLMGLNDSFSQIRGQILLSDPLPPIGNVFSLVLQEETQREIGTAVTHTPSINSDNMAFD 241 Query: 913 YNSEKGKQVSD---------VVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKH 1065 NS +D C +CG GH++DKCY+L+G+PPN+ F Q Sbjct: 242 VNSSTKSSAADHYKFNRRERPKCAYCGLLGHTKDKCYKLVGYPPNYNF----KNRQTPVA 297 Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKL-------MALINKDGMQPVSSGTSSSLH 1224 + + S + ++ D+ T Q Q+L M L N D P Sbjct: 298 NQVLESPEPLNQNKPDN------LTPAQCQQLINFLTNQMKLDNPDEAVP---------- 341 Query: 1225 FSNMAGIFPSPNLLSHTAT-PWVI 1293 +N+ GI + + L H T WVI Sbjct: 342 -TNVTGICMNTHFLLHNITYRWVI 364 >ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao] gi|508708772|gb|EOY00669.1| Uncharacterized protein TCM_010591 [Theobroma cacao] Length = 336 Score = 265 bits (678), Expect = 2e-82 Identities = 145/330 (43%), Positives = 215/330 (65%), Gaps = 3/330 (0%) Frame = +1 Query: 196 VSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTP 375 +SP E+ +Y+HH+D +VVI+P L +NY++W RAF LALSI K G +DG+I P Sbjct: 10 ISPAENLLSSYYIHHSDLHGSVVINPKLAVANYMSWSRAFLLALSICKKRGFIDGTIKKP 69 Query: 376 NSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555 + +SL+ W RCN LI++WLL S++ +IASN+L ++SAKEI + LK RFSQP I Sbjct: 70 SEANSLFEDWSRCNILIVTWLLESLTPKIASNVLDMDSAKEILETLKNRFSQPYETIICN 129 Query: 556 LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735 LQ +L +I+QGT +V+ YFT+LN+VW+ELKN+RP+P C + + Y + Q+ D V Sbjct: 130 LQFQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQCDYEGRKNNCYKKYADQQNKDAV 189 Query: 736 FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915 F FL GLNES+ +R IL++ P S+D+ +S+++++ QR + +++ + A Sbjct: 190 FCFLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKMLQRS--LILQSPVENSTMATVI 247 Query: 916 NSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGK--PRNQGQKHSANIS-SS 1086 EK K +++VC HCGK GHS++K Y +IGFP NFKFTK K R G ++ IS S Sbjct: 248 TEEKRKN-TNLVCSHCGKKGHSKEKYYCIIGFPENFKFTKLKRNMRKGGSSVNSAISGSE 306 Query: 1087 QDVHSGSNDSHGSGMIFTQDQVQKLMALIN 1176 QD + + + S + T+ Q+QKLM LI+ Sbjct: 307 QDEYDETVTNSISQLSLTKAQIQKLMTLIS 336 >ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial [Erythranthe guttata] Length = 514 Score = 265 bits (678), Expect = 3e-80 Identities = 145/371 (39%), Positives = 216/371 (58%), Gaps = 22/371 (5%) Frame = +1 Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 SP D +H +LH +D N +++S T NY +W RA T++L++KNK+G +DG+I P Sbjct: 7 SPLGDVSHPMFLHPSDGPNLILVSQLFTEDNYASWSRAMTISLTVKNKIGFIDGTISEPA 66 Query: 379 SDDSLYI-PWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555 +D+ + W+R NN+++SW++NSVSK+I +I+Y NS+KEIWD LKTRFSQ + RIFQ Sbjct: 67 ADELVMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126 Query: 556 LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735 L++ L+++ QG+ +V+ YFT++ A+W+EL NYRP CSCG C C + +YV Sbjct: 127 LRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184 Query: 736 FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915 FLMGLNES RGQILL+ P P + KVF+ + QEERQR +S S+ F+VK Sbjct: 185 MSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEERQRSVVSSHVESSGSV-FSVKN 243 Query: 916 NSEKG-----------KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGK---PRNQ 1053 K K+ C HC GH+ +KCY+L G+PP++K K + P NQ Sbjct: 244 EGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSPANQ 303 Query: 1054 GQKHSANISSSQDVHSGSNDSHGSGMI--FTQDQVQKLMALIN-----KDGMQPVSSGTS 1212 +++ S SG + H G + T Q Q+ M++ + + S+ Sbjct: 304 VSGFDSSLDSHSS-DSGVSSQHVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSAASAQPQ 362 Query: 1213 SSLHFSNMAGI 1245 SS H ++ A + Sbjct: 363 SSAHGADTATV 373 >gb|KHN46305.1| hypothetical protein glysoja_045316, partial [Glycine soja] Length = 276 Score = 256 bits (654), Expect = 1e-79 Identities = 124/279 (44%), Positives = 177/279 (63%) Frame = +1 Query: 211 DPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDS 390 DP+H +LHH+D ++ S PL NY TW RA +A S+KNK+ +DGS+P P + D Sbjct: 1 DPSHPLFLHHSDGPGLILTSQPLDHKNYTTWSRAMMVAFSVKNKVAFIDGSLPMPTTVDP 60 Query: 391 LYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRL 570 Y W NNL++SWL NSV K+I S+IL+ N+AKEIW+ LKTRFS+ + RIFQL+++L Sbjct: 61 TYAAWTCGNNLVISWLYNSVFKDIISSILFANTAKEIWEDLKTRFSRKNGPRIFQLKRQL 120 Query: 571 SSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVFKFLM 750 S+ QG S Y+T+L +VWEEL Y+P C CG L++ + +YV FLM Sbjct: 121 MSLQQGNDDASTYYTKLKSVWEELSGYKPTFRCKCG-----GLQTLQDYIESEYVMSFLM 175 Query: 751 GLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYNSEKG 930 GLN+++ ++GQILL P P + VFS+++QEE QRE + P ++S + A K + Sbjct: 176 GLNDNFAQVQGQILLSDPLPPIGNVFSLVIQEEAQREIVVNHIPYLNSNTMAKKERPQ-- 233 Query: 931 KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPR 1047 C HC GH++DKCY+L+G+PPN + K KP+ Sbjct: 234 -------CAHCNLLGHTKDKCYKLVGYPPN--YFKNKPQ 263 >ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata] Length = 659 Score = 267 bits (682), Expect = 2e-79 Identities = 149/392 (38%), Positives = 223/392 (56%), Gaps = 27/392 (6%) Frame = +1 Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 SP +D +H +LH +D N +++S LT NY +W RA T++L++KNK+G +DG+I P Sbjct: 7 SPLDDVSHPMFLHPSDGPNLILVSQLLTEDNYASWSRAMTISLTVKNKIGFIDGTISEPP 66 Query: 379 SDDSLYI-PWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555 +D+ + W+R NN+++SW++NSVSK+I +I+Y NS+KEIWD LKTRFSQ + RIFQ Sbjct: 67 ADELIMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126 Query: 556 LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735 L++ L+++ QG+ +V+ YFT++ A+W+EL NYRP CSCG C C + +YV Sbjct: 127 LRRDLANLTQGSQSVNVYFTKVKAIWDELANYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184 Query: 736 FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915 FLMGLN+S RGQILL+ P P + KVF+ I QEERQR +S S+ F+VK Sbjct: 185 MSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEERQRSVVSSHVDSSGSV-FSVKN 243 Query: 916 NSEKG-----------KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062 K K+ C HC GH+ +KCY+L G+PP++K K + + + Sbjct: 244 EGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSHVNQ 303 Query: 1063 HSANISSSQDVHS-----GSNDSHGSGMIFTQDQVQKLMALINKD----------GMQPV 1197 S SS D HS S G T Q Q+ M++ + +QP Sbjct: 304 VS-GFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSTASIQPQ 362 Query: 1198 SSGTSSSLHFSNMAGIFPSPNLLSHTATPWVI 1293 S+ + + S + GI + S ++ W++ Sbjct: 363 SAHGADTATVSCVTGICALSGVPSLSSADWIL 394 >ref|XP_015381157.1| PREDICTED: uncharacterized protein LOC107174627 [Citrus sinensis] Length = 316 Score = 253 bits (647), Expect = 4e-78 Identities = 130/310 (41%), Positives = 190/310 (61%), Gaps = 12/310 (3%) Frame = +1 Query: 193 TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372 ++S EDP++ +LHH+DH +++S PLT NY TW RA +ALS KNK+G +DGSI Sbjct: 10 SISSHEDPSNPLFLHHSDHPGVILVSQPLTEDNYNTWSRAMIMALSAKNKIGFIDGSIKH 69 Query: 373 PNSDDSLYIP-WLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRI 549 P + W RCN+++ SWLLNS+SKEI+ +++Y A EIW LK R SQ + I Sbjct: 70 PGDASAAESQHWNRCNDMVKSWLLNSISKEISLSVIYCKLASEIWADLKERLSQVNGPYI 129 Query: 550 FQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCD 729 FQ+++ + ++VQ T +++ Y+T+L A+W+EL IP CSCG ++++ + Q Sbjct: 130 FQVEKEIHNLVQDTMSIATYYTKLKALWDELDALCSIPTCSCG-----SMKAVIQYQQSH 184 Query: 730 YVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAV 909 KFLMGLNESY RGQILL+ P P+++K +S++LQ+ERQ ++ T ++ A Sbjct: 185 KTMKFLMGLNESYSATRGQILLMDPLPNVNKSYSLVLQDERQHAVSSNQTIAPEATELAA 244 Query: 910 KYNSEKGKQVSDV-----------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQG 1056 K NS + K+ DV C+HCG GH+ DKCY + GFPP+ + KG Sbjct: 245 KMNSRERKEYKDVEKRKDGKRERPKCDHCGWVGHTVDKCYHIHGFPPDHRNRKG-----N 299 Query: 1057 QKHSANISSS 1086 K SAN +SS Sbjct: 300 SKPSANQTSS 309 >ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321802, partial [Brassica oleracea var. oleracea] Length = 353 Score = 254 bits (650), Expect = 4e-78 Identities = 134/353 (37%), Positives = 206/353 (58%), Gaps = 10/353 (2%) Frame = +1 Query: 202 PTEDPNHHFYLHHTDHANTVVISPPL-TGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 P + N+ +YLH++DHA V++S L TG+++ W R+ +AL+++NKLG +DG+IP P Sbjct: 11 PVDHYNNPYYLHNSDHAGLVLVSDRLETGADFHAWRRSVRMALNVRNKLGFIDGTIPKPP 70 Query: 379 SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558 +D W RCN+++ +WL+NSVSK+I ++L++++A+ IW L +RF Q D RIF++ Sbjct: 71 ADHRDSGSWSRCNDMVSTWLMNSVSKKIGQSLLFMSTAELIWKNLMSRFKQDDAPRIFEI 130 Query: 559 QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738 +Q+LS+I QG+ VS Y+T+L +WEE +NY +P C+CG C C+A S+ IQ V Sbjct: 131 EQKLSNIQQGSLDVSTYYTELVTLWEEFQNYVDLPVCTCGKCECNAAASWELIQQRSRVT 190 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918 KFLMGLNESY+ R IL++ P PS+++VF+M+ Q+ERQ+ R S+ DS+ F Sbjct: 191 KFLMGLNESYDATRRHILMLKPIPSIEEVFNMVAQDERQKIIRPSL--KTDSVVFQTSAT 248 Query: 919 SEKGKQVSDV---------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSA 1071 + VC HCG GH KC++L G+PP +F ++ Sbjct: 249 ESASPHYAAAVAYRPKQRPVCTHCGMAGHIVQKCFKLHGYPPGHRF-----------YNT 297 Query: 1072 NISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFS 1230 N SS Q + + SN+ + + Q Q A +Q S G HF+ Sbjct: 298 NASSQQRLSAPSNNQSRGPVSQSSHQHQSTTAGNTVAQVQNASPGALDLAHFT 350 >emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera] Length = 970 Score = 268 bits (686), Expect = 1e-77 Identities = 149/387 (38%), Positives = 230/387 (59%), Gaps = 20/387 (5%) Frame = +1 Query: 193 TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372 ++S ED ++LH+ DH V++S LTG+NY TW RA +AL+ KNK+ +DGSIP Sbjct: 17 SLSSMEDSTSPYFLHNLDHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPC 76 Query: 373 PNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552 P SDD L+ W+RCN++++SW+LNSV K+IA ++LY ++A IW+ L+ RF Q + RIF Sbjct: 77 PESDDLLFGTWIRCNSMVISWILNSVHKDIADSLLYFDTAVGIWNDLRDRFCQSNGPRIF 136 Query: 553 QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732 Q+++ L ++ QG+ VS Y+T+L +W+ELK ++P+P C+CG ++++ E Q +Y Sbjct: 137 QIKKHLIALSQGSLDVSTYYTRLKILWDELKGFQPLPECACG-----TMKTWMEFQQQEY 191 Query: 733 VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLS---- 900 V +FLMGLNES+ R QIL++ P P + KVFS++ Q+ERQ + DS++ Sbjct: 192 VMQFLMGLNESFVQTRSQILMMEPLPPIAKVFSLVAQDERQCSINYGLYTPPDSVAANDS 251 Query: 901 ------FAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062 A + NS+ K C HCG GH+ DKCY+L G+PP +KF K +N K Sbjct: 252 NSTVAISAARLNSKPKK--DRPTCSHCGILGHTVDKCYKLYGYPPGYKF---KSKNPHAK 306 Query: 1063 HSANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINK----------DGMQPVSSGTS 1212 AN +SS+ + S + + + Q Q+L+AL++ + QP S +S Sbjct: 307 AQANQTSSRTTEA-SATADSPLVSLSPAQCQQLIALLSSQLHDNTPATPELQQPGPSVSS 365 Query: 1213 SSLHFSNMAGIFPSPNLLSHTATPWVI 1293 S FS + FP+ S ++ WV+ Sbjct: 366 FSSIFSLSSVSFPN----SLDSSAWVL 388 >gb|KYP45295.1| hypothetical protein KK1_033170 [Cajanus cajan] Length = 286 Score = 251 bits (640), Expect = 2e-77 Identities = 129/285 (45%), Positives = 178/285 (62%), Gaps = 12/285 (4%) Frame = +1 Query: 202 PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381 PT DP + +LHH+D V+ S PL NY TW A +A S+KNK+ +DGS+P + Sbjct: 8 PTLDPTNPLFLHHSDGPGLVLTSQPLDNKNYTTWSHAMLVAFSVKNKIPFVDGSLPKLAA 67 Query: 382 DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561 + Y W+R NNL++SWL NSVSK+I ++IL+ N+AKEIWD LKT+FS+ + IFQL+ Sbjct: 68 NHPTYPAWIRGNNLVISWLYNSVSKDIITSILFANTAKEIWDDLKTKFSRKNGPHIFQLR 127 Query: 562 QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRP-IPFCSCGLCTCSALRSYGEIQSCDYVF 738 ++L S+ QG VS Y+T+L ++WEEL Y+P P CTC L+ + + +YV Sbjct: 128 RQLMSLQQGIDYVSTYYTKLKSIWEELSGYKPSFP------CTCGGLQHLQDYNASEYVM 181 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSI--TPMMDSLSFAVK 912 FLMGLN+S+ IRGQILL P P + VFS+ILQEE Q E T+I TP ++ S A Sbjct: 182 SFLMGLNDSFSQIRGQILLSYPLPPIGNVFSLILQEETQIEIGTNITHTPSVNFDSMAFL 241 Query: 913 YNSEKGKQVSD---------VVCEHCGKGGHSRDKCYRLIGFPPN 1020 NS + D C HCG GH++D+ Y+L+G+PPN Sbjct: 242 VNSSNKSSIVDHNKTYKKEKPKCAHCGILGHTKDEFYKLVGYPPN 286 >ref|XP_015389606.1| PREDICTED: uncharacterized protein LOC107178668 [Citrus sinensis] Length = 316 Score = 251 bits (642), Expect = 2e-77 Identities = 129/309 (41%), Positives = 188/309 (60%), Gaps = 12/309 (3%) Frame = +1 Query: 196 VSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTP 375 +S EDP++ +LHH+DH +++S PLT NY TW RA +ALS KNK+G +DG I P Sbjct: 11 ISSHEDPSNPLFLHHSDHPGVILVSQPLTEDNYNTWSRAMIMALSAKNKIGFIDGFIKHP 70 Query: 376 NSDDSLYIP-WLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552 + W RCN+++ SWLLNS+SKEI+ +++Y A EIW LK R SQ + IF Sbjct: 71 GDTSAAESQHWNRCNDMVKSWLLNSISKEISLSVIYCKFASEIWTDLKERLSQVNGPYIF 130 Query: 553 QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732 Q+++ + ++VQ T +++ Y+T+L A+W+EL IP CSCG ++++ + Q Sbjct: 131 QVEKEIHNLVQDTMSIATYYTKLKALWDELDALCSIPTCSCG-----SMKAVIQYQQSHK 185 Query: 733 VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVK 912 KFLMGLNESY RGQILL+ P P+++K +S++LQ+ERQ ++ T ++ A K Sbjct: 186 TMKFLMGLNESYSATRGQILLMDPLPNVNKSYSLVLQDERQHAVSSNQTIAPEATELAAK 245 Query: 913 YNSEKGKQVSDV-----------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQ 1059 NS + K+ DV C+HCG GH+ DKCY + GFPP+ + KG Sbjct: 246 MNSRERKEYKDVEKRKDGKRERPKCDHCGWVGHTVDKCYHIHGFPPDHRNRKG-----NS 300 Query: 1060 KHSANISSS 1086 K SAN +SS Sbjct: 301 KPSANQTSS 309 >gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 444 Score = 255 bits (652), Expect = 3e-77 Identities = 145/373 (38%), Positives = 213/373 (57%), Gaps = 9/373 (2%) Frame = +1 Query: 202 PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381 P++D ++ +LHH+D V+ S PL NY TW RA +AL +KNKL +DG++P P S Sbjct: 8 PSQDVSNPLFLHHSDGPGLVLTSQPLDHKNYTTWSRAMQVALFVKNKLAFIDGTLPKPAS 67 Query: 382 DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561 DS ++ W NN+++SWL NSVSK+I ++IL+ ++A+EIW LKTRFS+ + RIFQL+ Sbjct: 68 TDSTFVAWNHANNVVISWLYNSVSKDIITSILFASTAQEIWHDLKTRFSKKNGSRIFQLR 127 Query: 562 QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVFK 741 ++L S+ QG +S Y+T+L ++WEEL Y+P CTC L+ +YV Sbjct: 128 RQLMSLHQGMDDISTYYTKLKSIWEELSGYKP-----TFQCTCGGLQQLQSFTESEYVMS 182 Query: 742 FLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQRE-ARTSITPMMDSLSFAVKYN 918 FLMGLN+S IRGQILL P PS+ VFS++LQ+E QRE A TS P+ +S + N Sbjct: 183 FLMGLNDSISQIRGQILLSDPLPSIGNVFSLVLQDEAQREIAVTSSPPVANSDNIVFTVN 242 Query: 919 SEKGKQVSDVV-------CEHCGKGGHSRDKCYRLIGFPPN-FKFTKGKPRNQGQKHSAN 1074 S + + C HC GH++D CY+L+G+PPN FK NQ S N Sbjct: 243 SSQPATSRNRFTKKERPRCAHCNILGHTKDTCYKLVGYPPNYFKNHTTNTVNQVTGSSDN 302 Query: 1075 ISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFSNMAGIFPS 1254 + +SQ + T DQ Q+L+ + + + T+ +N+ GI + Sbjct: 303 VLTSQSSN------------LTPDQRQQLINFLTNQ----MQADTTLDAITTNVTGICMN 346 Query: 1255 PNLLSHTATPWVI 1293 L ++ T W+I Sbjct: 347 VALDNNYHT-WII 358 >ref|XP_010662801.1| PREDICTED: uncharacterized protein LOC104882222 [Vitis vinifera] Length = 693 Score = 260 bits (665), Expect = 1e-76 Identities = 143/377 (37%), Positives = 221/377 (58%), Gaps = 13/377 (3%) Frame = +1 Query: 193 TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372 ++S ED ++LH+++H V++S LTG+NY TW RA +AL+ KNK+ +DGSIP Sbjct: 17 SLSSMEDSTSPYFLHNSNHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPC 76 Query: 373 PNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552 P SDD L+ W+RCNN+++SW+LNSV K+I ++LY ++A IW+ L+ RF Q + RIF Sbjct: 77 PESDDLLFGTWIRCNNMVISWILNSVHKDIVDSLLYFDTAVGIWNDLRDRFRQSNGPRIF 136 Query: 553 QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732 Q+++ L ++ QG+ VS Y+T+L +W+ELK ++P+ C+CG ++++ E Q +Y Sbjct: 137 QIKKHLIALSQGSLDVSTYYTRLKILWDELKGFQPLLECACG-----TMKTWMEFQQQEY 191 Query: 733 VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLS---- 900 V +FLMGLNES+ QIL++ P P + KVFS++ Q+ERQR + DS++ Sbjct: 192 VMQFLMGLNESFVQTHSQILMMEPLPPIAKVFSLVAQDERQRSINYGLYTPPDSVAANDS 251 Query: 901 ------FAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062 A + NS+ K C H G GH+ DKCY+L G+PP +KF K +N K Sbjct: 252 NSTIAILAARLNSKPKK--DQPTCSHYGILGHTVDKCYKLYGYPPRYKF---KSKNPHAK 306 Query: 1063 HSANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFSNMAG 1242 AN +SS+ + S + + Q Q+L+AL+ SS LH + +A Sbjct: 307 AQANQTSSRTTEA-STTADSPLASLSPAQCQQLIALL------------SSQLHDNTLAT 353 Query: 1243 ---IFPSPNLLSHTATP 1284 P P++ S + P Sbjct: 354 PDLQQPGPSVSSFSVIP 370 >gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja] Length = 484 Score = 254 bits (649), Expect = 2e-76 Identities = 143/379 (37%), Positives = 214/379 (56%), Gaps = 27/379 (7%) Frame = +1 Query: 226 FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405 +YLH ++ V++SP LT NY TW R+ +AL KNK +DGS+P P D LY PW Sbjct: 9 YYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 68 Query: 406 LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585 +RCN ++L+W+ S+S IA ++L+I++A +W L+ RFSQ D RI LQ+ L Q Sbjct: 69 IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 128 Query: 586 GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762 GT VS+YFTQL W+EL+NYRPIP C C + C+C + S + DYV +FL GLN+ Sbjct: 129 GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLND 188 Query: 763 SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921 + + QI++++P P +D VFS+++Q+ER+ S++ + A++ NS Sbjct: 189 RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 248 Query: 922 -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065 KGK S + VC HCGK H D C+ IG+PP +K K K + + + Sbjct: 249 NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 308 Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS---LH-- 1224 ++N S+ + GS S S FTQ+ Q ++ + +K G QP ++ ++S LH Sbjct: 309 TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANSVTTSPFALHSP 366 Query: 1225 FSNMAGIFPSPNLLSHTAT 1281 SN G PS +L +T Sbjct: 367 SSNPNGKNPSLWILDTAST 385 >ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max] Length = 389 Score = 251 bits (640), Expect = 4e-76 Identities = 133/353 (37%), Positives = 202/353 (57%), Gaps = 22/353 (6%) Frame = +1 Query: 226 FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405 +YLH ++ V++SP LT NY TW + +AL KNK +DGS+P P D LY PW Sbjct: 17 YYLHPNENPALVLVSPSLTAKNYHTWSHSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 76 Query: 406 LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585 +RCN ++L+W+ S+S IA ++L+I++A +W L+ RFSQ D RI LQ+ L Q Sbjct: 77 IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 136 Query: 586 GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762 GT VS+YFTQL W+EL+NYRPIP C C + C+C + S + DYV +FL GLN+ Sbjct: 137 GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVVRFLKGLND 196 Query: 763 SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921 + + QI++++P P +D VFS+++Q+ER+ S++ + A++ NS Sbjct: 197 RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 256 Query: 922 -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065 KGK S + VC HCGK H D C+ IG+PP +K K K + + + Sbjct: 257 NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 316 Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS 1218 ++N S+ + GS S S FTQ+ Q ++ + +K G QP ++ ++S Sbjct: 317 TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANSVTTS 367 >gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja] Length = 484 Score = 253 bits (645), Expect = 1e-75 Identities = 143/379 (37%), Positives = 214/379 (56%), Gaps = 27/379 (7%) Frame = +1 Query: 226 FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405 +YLH ++ V++SP LT NY TW R+ +AL KNK +DGS+P P D LY PW Sbjct: 9 YYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 68 Query: 406 LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585 +RCN ++L+W+ S+S IA ++L+I++A +W L+ RFSQ D RI LQ+ L Q Sbjct: 69 IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 128 Query: 586 GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762 GT VS+YFTQL W+EL+NYRPIP C C + C+C + S + DYV +FL GLN+ Sbjct: 129 GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLND 188 Query: 763 SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921 + + QI++++P P +D VFS+++Q+ER+ S++ + A++ NS Sbjct: 189 RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 248 Query: 922 -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065 KGK S + VC HCGK H D C+ IG+PP +K K K + + + Sbjct: 249 NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 308 Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS---LH-- 1224 ++N S+ + GS S S FTQ+ Q ++ + +K G QP ++ ++S LH Sbjct: 309 TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANLVTTSPFALHSP 366 Query: 1225 FSNMAGIFPSPNLLSHTAT 1281 SN G PS +L +T Sbjct: 367 SSNPNGKNPSLWILDTAST 385 >ref|XP_007037468.1| Integrase, catalytic region, putative [Theobroma cacao] gi|508774713|gb|EOY21969.1| Integrase, catalytic region, putative [Theobroma cacao] Length = 242 Score = 244 bits (624), Expect = 1e-75 Identities = 109/220 (49%), Positives = 154/220 (70%) Frame = +1 Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378 SP DP ++LHHT+H +V+I+P LT +NYVTW R+F LALSI+NK G ++G+I P Sbjct: 19 SPIGDPQFPYFLHHTNHPGSVIINPKLTTTNYVTWSRSFLLALSIRNKKGFINGTISKPQ 78 Query: 379 SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558 D LY W+RCNNLI++WLL+S++ IAS I Y++S +IW+ LK F+QPD+ R+ L Sbjct: 79 PTDPLYPSWIRCNNLIVAWLLDSITPPIASTIFYMDSVVDIWNTLKQSFAQPDDSRVCNL 138 Query: 559 QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738 Q L ++ QGT +V YF +L +WEEL+NYRP+P C CG + R Y + D VF Sbjct: 139 QYTLGNVTQGTRSVDSYFIELKGIWEELRNYRPLPHCVCGKYSPECFRRYSDQYQKDMVF 198 Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQR 858 +FL GLN+ + +R QI+L+ P PSLDKV++++L+EE QR Sbjct: 199 RFLNGLNDFFSAVRSQIILMDPIPSLDKVYNLVLREEAQR 238