BLASTX nr result

ID: Rehmannia28_contig00025454 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia28_contig00025454
         (1293 letters)

Database: ./nr 
           84,704,028 sequences; 31,038,470,784 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699...   422   e-142
ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697...   422   e-141
ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobrom...   314   e-102
ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobrom...   282   4e-89
gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family ...   271   2e-83
ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobrom...   265   2e-82
ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955...   265   3e-80
gb|KHN46305.1| hypothetical protein glysoja_045316, partial [Gly...   256   1e-79
ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954...   267   2e-79
ref|XP_015381157.1| PREDICTED: uncharacterized protein LOC107174...   253   4e-78
ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321...   254   4e-78
emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]   268   1e-77
gb|KYP45295.1| hypothetical protein KK1_033170 [Cajanus cajan]        251   2e-77
ref|XP_015389606.1| PREDICTED: uncharacterized protein LOC107178...   251   2e-77
gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposo...   255   3e-77
ref|XP_010662801.1| PREDICTED: uncharacterized protein LOC104882...   260   1e-76
gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly...   254   2e-76
ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798...   251   4e-76
gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly...   253   1e-75
ref|XP_007037468.1| Integrase, catalytic region, putative [Theob...   244   1e-75

>ref|XP_008779954.1| PREDICTED: uncharacterized protein LOC103699729, partial [Phoenix
            dactylifera]
          Length = 490

 Score =  422 bits (1086), Expect = e-142
 Identities = 199/341 (58%), Positives = 266/341 (78%), Gaps = 1/341 (0%)
 Frame = +1

Query: 199  SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
            +P+EDPN  F+LHHTD+A TV+++PPL GSNY++W R+F+LA+SIKNKLG LDGSI TP 
Sbjct: 20   TPSEDPNSPFFLHHTDNAQTVIVTPPLVGSNYLSWSRSFSLAISIKNKLGFLDGSISTPE 79

Query: 379  SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558
              D LYIPWLRCNNLIL+WLLNS+SKEIASN+L+I SAKE+W+KLK+RF+QPDN+RI+QL
Sbjct: 80   VTDPLYIPWLRCNNLILAWLLNSISKEIASNVLFIKSAKEVWNKLKSRFAQPDNVRIYQL 139

Query: 559  QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738
            +Q+LSSI Q + +VSEYFTQLNA+WEEL+NYRP+P+CSCG C C AL+  GE    D++F
Sbjct: 140  KQQLSSITQRSLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCICDALKGVGEDLELDHIF 199

Query: 739  KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918
            +FLMGLN++Y+ +RGQI+L+SP PSLDK FS++LQEERQR+AR  I P  +S + A   N
Sbjct: 200  QFLMGLNDTYDTVRGQIILMSPLPSLDKTFSLVLQEERQRQARAIIFPAPESSALAAVLN 259

Query: 919  SEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSANISSSQDVH 1098
              K K  +++ C HCGK GH+++KCYRLIGFPPNFKFTK K  +   K  A  S++Q + 
Sbjct: 260  --KSKNRAEITCYHCGKSGHTKEKCYRLIGFPPNFKFTKTKFPSVNNKSVAPHSANQVIS 317

Query: 1099 SGSNDSHGSGMI-FTQDQVQKLMALINKDGMQPVSSGTSSS 1218
            S       +  +  +Q Q+Q+L+AL+N  G+  +S  ++S+
Sbjct: 318  STQGKGLSAPQLSLSQTQIQQLLALVN-SGIPQMSLNSAST 357


>ref|XP_008777304.1| PREDICTED: uncharacterized protein LOC103697258 [Phoenix dactylifera]
          Length = 514

 Score =  422 bits (1086), Expect = e-141
 Identities = 201/345 (58%), Positives = 265/345 (76%), Gaps = 5/345 (1%)
 Frame = +1

Query: 199  SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
            +P+EDPN  F+LH TD+A TV+++PPL GSNY++W R+F+LA+SIKNKLG LDGSIPTP 
Sbjct: 20   TPSEDPNSPFFLHRTDNAQTVIVTPPLIGSNYLSWSRSFSLAISIKNKLGFLDGSIPTPE 79

Query: 379  SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558
              D LY+PWLRCNNLIL+WLLNS+SKEIASN+L+I S KE+W+KLK+RF+QPDN+RI+QL
Sbjct: 80   VTDPLYVPWLRCNNLILAWLLNSISKEIASNVLFIKSTKEVWNKLKSRFAQPDNVRIYQL 139

Query: 559  QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738
            +Q+LSSI QGT +VSEYFTQLNA+WEEL+NYRP+P+CSCG C C AL+  GE    DY+F
Sbjct: 140  KQQLSSITQGTLSVSEYFTQLNAIWEELRNYRPLPYCSCGHCICDALKGVGENLELDYIF 199

Query: 739  KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918
            +FLM LN +++ +RGQI+L+SP PSLDK FS++LQEERQR+AR  I P  +S + A   N
Sbjct: 200  QFLMELNNTFDSVRGQIILMSPLPSLDKTFSLVLQEERQRQARAIIFPAPESSALAAVLN 259

Query: 919  SEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSANISSSQDVH 1098
              K K  + + C HCGK GH+R+KCYRLIGFPPNFKFTK K  +   K  A+ S++Q + 
Sbjct: 260  KPKNK--AKITCYHCGKPGHTREKCYRLIGFPPNFKFTKTKSPSVNNKSVASHSANQVI- 316

Query: 1099 SGSNDSHGSGMI-----FTQDQVQKLMALINKDGMQPVSSGTSSS 1218
               + + G G+       +Q QVQ+L AL+N  G+  ++  ++SS
Sbjct: 317  ---SPTQGKGLAAPQLSLSQAQVQQLFALVN-SGITQLNLNSASS 357


>ref|XP_007024403.1| Uncharacterized protein TCM_028976 [Theobroma cacao]
            gi|508779769|gb|EOY27025.1| Uncharacterized protein
            TCM_028976 [Theobroma cacao]
          Length = 318

 Score =  314 bits (805), Expect = e-102
 Identities = 146/293 (49%), Positives = 195/293 (66%)
 Frame = +1

Query: 166  TPPVQVRTITVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKL 345
            T P    T  +S   DP   +YLHHTDH  +VV++P LT +NYV W R+F LALSI+NK+
Sbjct: 12   TAPNPQLTSQISQANDPPSPYYLHHTDHLGSVVVNPKLTTNNYVAWSRSFLLALSIRNKV 71

Query: 346  GLLDGSIPTPNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRF 525
            G ++GSIP P+  D L+  W RCNNLI+SWLLNS+S+ IAS I ++ S  EIW+ LK  +
Sbjct: 72   GFINGSIPKPSITDDLHPIWNRCNNLIVSWLLNSISQPIASTIFFMESVAEIWNTLKLNY 131

Query: 526  SQPDNIRIFQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRS 705
            +QPDN  +  LQ  L S+ Q    V  YF +L  +WEEL+NYRP+P C CG C  +  + 
Sbjct: 132  AQPDNTCVCNLQYTLGSVTQRVKIVYAYFIELKCIWEELRNYRPLPHCECGKCNANCFKK 191

Query: 706  YGEIQSCDYVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPM 885
            + +    D VF+FL GLNES+  IR QI+L+ P PSLDKV+SM+L+EE Q+       P 
Sbjct: 192  FSDQYQKDMVFRFLNGLNESFSAIRSQIILMDPIPSLDKVYSMVLREESQKNMFLQSQPF 251

Query: 886  MDSLSFAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKP 1044
            ++SL+     N +K K + D+ C HCGK GH ++KCYR+I FP +FKFTKGKP
Sbjct: 252  LESLAMLAATNVKK-KPMKDLTCTHCGKKGHVKEKCYRIIRFPEDFKFTKGKP 303


>ref|XP_007017136.1| Uncharacterized protein TCM_033758 [Theobroma cacao]
            gi|508722464|gb|EOY14361.1| Uncharacterized protein
            TCM_033758 [Theobroma cacao]
          Length = 328

 Score =  282 bits (721), Expect = 4e-89
 Identities = 140/294 (47%), Positives = 196/294 (66%), Gaps = 5/294 (1%)
 Frame = +1

Query: 310  AFTLALSIKNKLGLLDGSIPTPNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINS 489
            +F LALSI+NK   +DGSIP P+  D L++P  RCN+LIL+WLL S+S  IAS + YI  
Sbjct: 23   SFLLALSIQNKSRFIDGSIPEPDVSDKLFVPCTRCNSLILAWLLESISPPIASTVFYIRK 82

Query: 490  AKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFC 669
            A E+W+ LK RFSQPD+ RI  LQ  L +I QGT +V  YFT+LN +WEEL+NYRP+P C
Sbjct: 83   AYEVWETLKERFSQPDDARICNLQFNLYNISQGTRSVDAYFTELNCIWEELRNYRPLPHC 142

Query: 670  SCGLCTCSALRSYGEIQSCDYVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEE 849
            SCG+C  +  ++Y +    D VF+FL GLNES+  +R QIL++ P PSL+K +++++++E
Sbjct: 143  SCGICNSACFQTYIDQYQKDSVFRFLNGLNESFSALRSQILMMKPFPSLNKAYNLVIRDE 202

Query: 850  RQREARTSITPMMDSLSFAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKF 1029
             QR       P+++S + A      K K   DVVC +C K GH++DKCYRLIGFPP+FKF
Sbjct: 203  SQRNLYLHTMPIIESSAMAT-MTEGKVKSKVDVVCSYCHKKGHTKDKCYRLIGFPPDFKF 261

Query: 1030 TKGK-PRNQGQKHSAN----ISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALIN 1176
             KGK P  +G   S N    ++S ++    +     S +  ++ Q+QKLM+LIN
Sbjct: 262  LKGKSPLKKGNVWSINNVGPVTSKEECDESTKSL--SSLTLSKHQIQKLMSLIN 313


>gb|KYP31881.1| Putative transposon Ty5-1 protein YCL075W family [Cajanus cajan]
          Length = 437

 Score =  271 bits (693), Expect = 2e-83
 Identities = 153/384 (39%), Positives = 216/384 (56%), Gaps = 20/384 (5%)
 Frame = +1

Query: 202  PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381
            P+ DP +  +LHH+D     + S PL   NY TW RA  +AL +KNK+  +DGS+P P +
Sbjct: 8    PSSDPTNPLFLHHSDGPGLFLTSQPLDNKNYTTWSRAMLVALGVKNKIPFVDGSLPRPAA 67

Query: 382  DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561
            DD  Y  W+  NN+++SWL NSVSKEI ++IL+ N AKEIWD LK+RFS+ +  RIFQL+
Sbjct: 68   DDPTYAAWIHGNNVVISWLYNSVSKEIITSILFANIAKEIWDDLKSRFSRKNGPRIFQLR 127

Query: 562  QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRP-IPFCSCGLCTCSALRSYGEIQSCDYVF 738
            ++L+S+ QGT  VS Y+T+L ++WE+L  Y+P  P      CTC  L+        +YV 
Sbjct: 128  RQLTSLQQGTDDVSTYYTKLKSIWEDLSGYKPSFP------CTCGGLQHLQVYNDLEYVM 181

Query: 739  KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSI--TPMMDSLSFAVK 912
             FLMGLN+S+  IRGQILL  P P +  VFS++LQEE QRE  T++  TP ++S + A  
Sbjct: 182  SFLMGLNDSFSQIRGQILLSDPLPPIGNVFSLVLQEETQREIGTAVTHTPSINSDNMAFD 241

Query: 913  YNSEKGKQVSD---------VVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKH 1065
             NS      +D           C +CG  GH++DKCY+L+G+PPN+ F       Q    
Sbjct: 242  VNSSTKSSAADHYKFNRRERPKCAYCGLLGHTKDKCYKLVGYPPNYNF----KNRQTPVA 297

Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKL-------MALINKDGMQPVSSGTSSSLH 1224
            +  + S + ++    D+       T  Q Q+L       M L N D   P          
Sbjct: 298  NQVLESPEPLNQNKPDN------LTPAQCQQLINFLTNQMKLDNPDEAVP---------- 341

Query: 1225 FSNMAGIFPSPNLLSHTAT-PWVI 1293
             +N+ GI  + + L H  T  WVI
Sbjct: 342  -TNVTGICMNTHFLLHNITYRWVI 364


>ref|XP_007044837.1| Uncharacterized protein TCM_010591 [Theobroma cacao]
            gi|508708772|gb|EOY00669.1| Uncharacterized protein
            TCM_010591 [Theobroma cacao]
          Length = 336

 Score =  265 bits (678), Expect = 2e-82
 Identities = 145/330 (43%), Positives = 215/330 (65%), Gaps = 3/330 (0%)
 Frame = +1

Query: 196  VSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTP 375
            +SP E+    +Y+HH+D   +VVI+P L  +NY++W RAF LALSI  K G +DG+I  P
Sbjct: 10   ISPAENLLSSYYIHHSDLHGSVVINPKLAVANYMSWSRAFLLALSICKKRGFIDGTIKKP 69

Query: 376  NSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555
            +  +SL+  W RCN LI++WLL S++ +IASN+L ++SAKEI + LK RFSQP    I  
Sbjct: 70   SEANSLFEDWSRCNILIVTWLLESLTPKIASNVLDMDSAKEILETLKNRFSQPYETIICN 129

Query: 556  LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735
            LQ +L +I+QGT +V+ YFT+LN+VW+ELKN+RP+P C       +  + Y + Q+ D V
Sbjct: 130  LQFQLRNILQGTRSVNTYFTELNSVWQELKNFRPLPQCDYEGRKNNCYKKYADQQNKDAV 189

Query: 736  FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915
            F FL GLNES+  +R  IL++ P  S+D+ +S+++++  QR     +   +++ + A   
Sbjct: 190  FCFLNGLNESFSCLRSHILMLKPFLSIDQAYSLVIKKMLQRS--LILQSPVENSTMATVI 247

Query: 916  NSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGK--PRNQGQKHSANIS-SS 1086
              EK K  +++VC HCGK GHS++K Y +IGFP NFKFTK K   R  G   ++ IS S 
Sbjct: 248  TEEKRKN-TNLVCSHCGKKGHSKEKYYCIIGFPENFKFTKLKRNMRKGGSSVNSAISGSE 306

Query: 1087 QDVHSGSNDSHGSGMIFTQDQVQKLMALIN 1176
            QD +  +  +  S +  T+ Q+QKLM LI+
Sbjct: 307  QDEYDETVTNSISQLSLTKAQIQKLMTLIS 336


>ref|XP_012835096.1| PREDICTED: uncharacterized protein LOC105955841, partial [Erythranthe
            guttata]
          Length = 514

 Score =  265 bits (678), Expect = 3e-80
 Identities = 145/371 (39%), Positives = 216/371 (58%), Gaps = 22/371 (5%)
 Frame = +1

Query: 199  SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
            SP  D +H  +LH +D  N +++S   T  NY +W RA T++L++KNK+G +DG+I  P 
Sbjct: 7    SPLGDVSHPMFLHPSDGPNLILVSQLFTEDNYASWSRAMTISLTVKNKIGFIDGTISEPA 66

Query: 379  SDDSLYI-PWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555
            +D+ +    W+R NN+++SW++NSVSK+I  +I+Y NS+KEIWD LKTRFSQ +  RIFQ
Sbjct: 67   ADELVMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126

Query: 556  LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735
            L++ L+++ QG+ +V+ YFT++ A+W+EL NYRP   CSCG C C          + +YV
Sbjct: 127  LRRDLANLTQGSQSVNVYFTKVKAIWDELVNYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184

Query: 736  FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915
              FLMGLNES    RGQILL+ P P + KVF+ + QEERQR   +S      S+ F+VK 
Sbjct: 185  MSFLMGLNESLASTRGQILLMDPLPPISKVFAFVSQEERQRSVVSSHVESSGSV-FSVKN 243

Query: 916  NSEKG-----------KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGK---PRNQ 1053
               K            K+     C HC   GH+ +KCY+L G+PP++K  K +   P NQ
Sbjct: 244  EGFKRSINNQFYNTGFKKKERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSPANQ 303

Query: 1054 GQKHSANISSSQDVHSGSNDSHGSGMI--FTQDQVQKLMALIN-----KDGMQPVSSGTS 1212
                 +++ S     SG +  H  G +   T  Q Q+ M++ +     +      S+   
Sbjct: 304  VSGFDSSLDSHSS-DSGVSSQHVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSAASAQPQ 362

Query: 1213 SSLHFSNMAGI 1245
            SS H ++ A +
Sbjct: 363  SSAHGADTATV 373


>gb|KHN46305.1| hypothetical protein glysoja_045316, partial [Glycine soja]
          Length = 276

 Score =  256 bits (654), Expect = 1e-79
 Identities = 124/279 (44%), Positives = 177/279 (63%)
 Frame = +1

Query: 211  DPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDS 390
            DP+H  +LHH+D    ++ S PL   NY TW RA  +A S+KNK+  +DGS+P P + D 
Sbjct: 1    DPSHPLFLHHSDGPGLILTSQPLDHKNYTTWSRAMMVAFSVKNKVAFIDGSLPMPTTVDP 60

Query: 391  LYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRL 570
             Y  W   NNL++SWL NSV K+I S+IL+ N+AKEIW+ LKTRFS+ +  RIFQL+++L
Sbjct: 61   TYAAWTCGNNLVISWLYNSVFKDIISSILFANTAKEIWEDLKTRFSRKNGPRIFQLKRQL 120

Query: 571  SSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVFKFLM 750
             S+ QG    S Y+T+L +VWEEL  Y+P   C CG      L++  +    +YV  FLM
Sbjct: 121  MSLQQGNDDASTYYTKLKSVWEELSGYKPTFRCKCG-----GLQTLQDYIESEYVMSFLM 175

Query: 751  GLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYNSEKG 930
            GLN+++  ++GQILL  P P +  VFS+++QEE QRE   +  P ++S + A K   +  
Sbjct: 176  GLNDNFAQVQGQILLSDPLPPIGNVFSLVIQEEAQREIVVNHIPYLNSNTMAKKERPQ-- 233

Query: 931  KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPR 1047
                   C HC   GH++DKCY+L+G+PPN  + K KP+
Sbjct: 234  -------CAHCNLLGHTKDKCYKLVGYPPN--YFKNKPQ 263


>ref|XP_012833844.1| PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata]
          Length = 659

 Score =  267 bits (682), Expect = 2e-79
 Identities = 149/392 (38%), Positives = 223/392 (56%), Gaps = 27/392 (6%)
 Frame = +1

Query: 199  SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
            SP +D +H  +LH +D  N +++S  LT  NY +W RA T++L++KNK+G +DG+I  P 
Sbjct: 7    SPLDDVSHPMFLHPSDGPNLILVSQLLTEDNYASWSRAMTISLTVKNKIGFIDGTISEPP 66

Query: 379  SDDSLYI-PWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQ 555
            +D+ +    W+R NN+++SW++NSVSK+I  +I+Y NS+KEIWD LKTRFSQ +  RIFQ
Sbjct: 67   ADELIMRNAWIRNNNIVMSWIINSVSKDIQGSIMYSNSSKEIWDDLKTRFSQTNGPRIFQ 126

Query: 556  LQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYV 735
            L++ L+++ QG+ +V+ YFT++ A+W+EL NYRP   CSCG C C          + +YV
Sbjct: 127  LRRDLANLTQGSQSVNVYFTKVKAIWDELANYRPC--CSCGKCDCGGFEKLQAHYNQEYV 184

Query: 736  FKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKY 915
              FLMGLN+S    RGQILL+ P P + KVF+ I QEERQR   +S      S+ F+VK 
Sbjct: 185  MSFLMGLNDSLASTRGQILLMDPLPPISKVFAFISQEERQRSVVSSHVDSSGSV-FSVKN 243

Query: 916  NSEKG-----------KQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062
               K            K+     C HC   GH+ +KCY+L G+PP++K  K +  +   +
Sbjct: 244  EGFKRSINNQFYNPGLKKRERSFCTHCNMQGHTVEKCYKLHGYPPSYKPQKSRFSSHVNQ 303

Query: 1063 HSANISSSQDVHS-----GSNDSHGSGMIFTQDQVQKLMALINKD----------GMQPV 1197
             S    SS D HS      S    G     T  Q Q+ M++ +             +QP 
Sbjct: 304  VS-GFDSSLDSHSSDAGVSSQQVDGYLQSMTPSQCQQFMSMFSSHMAAQQQQSTASIQPQ 362

Query: 1198 SSGTSSSLHFSNMAGIFPSPNLLSHTATPWVI 1293
            S+  + +   S + GI     + S ++  W++
Sbjct: 363  SAHGADTATVSCVTGICALSGVPSLSSADWIL 394


>ref|XP_015381157.1| PREDICTED: uncharacterized protein LOC107174627 [Citrus sinensis]
          Length = 316

 Score =  253 bits (647), Expect = 4e-78
 Identities = 130/310 (41%), Positives = 190/310 (61%), Gaps = 12/310 (3%)
 Frame = +1

Query: 193  TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372
            ++S  EDP++  +LHH+DH   +++S PLT  NY TW RA  +ALS KNK+G +DGSI  
Sbjct: 10   SISSHEDPSNPLFLHHSDHPGVILVSQPLTEDNYNTWSRAMIMALSAKNKIGFIDGSIKH 69

Query: 373  PNSDDSLYIP-WLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRI 549
            P    +     W RCN+++ SWLLNS+SKEI+ +++Y   A EIW  LK R SQ +   I
Sbjct: 70   PGDASAAESQHWNRCNDMVKSWLLNSISKEISLSVIYCKLASEIWADLKERLSQVNGPYI 129

Query: 550  FQLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCD 729
            FQ+++ + ++VQ T +++ Y+T+L A+W+EL     IP CSCG     ++++  + Q   
Sbjct: 130  FQVEKEIHNLVQDTMSIATYYTKLKALWDELDALCSIPTCSCG-----SMKAVIQYQQSH 184

Query: 730  YVFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAV 909
               KFLMGLNESY   RGQILL+ P P+++K +S++LQ+ERQ    ++ T   ++   A 
Sbjct: 185  KTMKFLMGLNESYSATRGQILLMDPLPNVNKSYSLVLQDERQHAVSSNQTIAPEATELAA 244

Query: 910  KYNSEKGKQVSDV-----------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQG 1056
            K NS + K+  DV            C+HCG  GH+ DKCY + GFPP+ +  KG      
Sbjct: 245  KMNSRERKEYKDVEKRKDGKRERPKCDHCGWVGHTVDKCYHIHGFPPDHRNRKG-----N 299

Query: 1057 QKHSANISSS 1086
             K SAN +SS
Sbjct: 300  SKPSANQTSS 309


>ref|XP_013615493.1| PREDICTED: uncharacterized protein LOC106321802, partial [Brassica
            oleracea var. oleracea]
          Length = 353

 Score =  254 bits (650), Expect = 4e-78
 Identities = 134/353 (37%), Positives = 206/353 (58%), Gaps = 10/353 (2%)
 Frame = +1

Query: 202  PTEDPNHHFYLHHTDHANTVVISPPL-TGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
            P +  N+ +YLH++DHA  V++S  L TG+++  W R+  +AL+++NKLG +DG+IP P 
Sbjct: 11   PVDHYNNPYYLHNSDHAGLVLVSDRLETGADFHAWRRSVRMALNVRNKLGFIDGTIPKPP 70

Query: 379  SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558
            +D      W RCN+++ +WL+NSVSK+I  ++L++++A+ IW  L +RF Q D  RIF++
Sbjct: 71   ADHRDSGSWSRCNDMVSTWLMNSVSKKIGQSLLFMSTAELIWKNLMSRFKQDDAPRIFEI 130

Query: 559  QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738
            +Q+LS+I QG+  VS Y+T+L  +WEE +NY  +P C+CG C C+A  S+  IQ    V 
Sbjct: 131  EQKLSNIQQGSLDVSTYYTELVTLWEEFQNYVDLPVCTCGKCECNAAASWELIQQRSRVT 190

Query: 739  KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVKYN 918
            KFLMGLNESY+  R  IL++ P PS+++VF+M+ Q+ERQ+  R S+    DS+ F     
Sbjct: 191  KFLMGLNESYDATRRHILMLKPIPSIEEVFNMVAQDERQKIIRPSL--KTDSVVFQTSAT 248

Query: 919  SEKGKQVSDV---------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQKHSA 1071
                   +           VC HCG  GH   KC++L G+PP  +F           ++ 
Sbjct: 249  ESASPHYAAAVAYRPKQRPVCTHCGMAGHIVQKCFKLHGYPPGHRF-----------YNT 297

Query: 1072 NISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFS 1230
            N SS Q + + SN+     +  +  Q Q   A      +Q  S G     HF+
Sbjct: 298  NASSQQRLSAPSNNQSRGPVSQSSHQHQSTTAGNTVAQVQNASPGALDLAHFT 350


>emb|CAN74819.1| hypothetical protein VITISV_034590 [Vitis vinifera]
          Length = 970

 Score =  268 bits (686), Expect = 1e-77
 Identities = 149/387 (38%), Positives = 230/387 (59%), Gaps = 20/387 (5%)
 Frame = +1

Query: 193  TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372
            ++S  ED    ++LH+ DH   V++S  LTG+NY TW RA  +AL+ KNK+  +DGSIP 
Sbjct: 17   SLSSMEDSTSPYFLHNLDHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPC 76

Query: 373  PNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552
            P SDD L+  W+RCN++++SW+LNSV K+IA ++LY ++A  IW+ L+ RF Q +  RIF
Sbjct: 77   PESDDLLFGTWIRCNSMVISWILNSVHKDIADSLLYFDTAVGIWNDLRDRFCQSNGPRIF 136

Query: 553  QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732
            Q+++ L ++ QG+  VS Y+T+L  +W+ELK ++P+P C+CG      ++++ E Q  +Y
Sbjct: 137  QIKKHLIALSQGSLDVSTYYTRLKILWDELKGFQPLPECACG-----TMKTWMEFQQQEY 191

Query: 733  VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLS---- 900
            V +FLMGLNES+   R QIL++ P P + KVFS++ Q+ERQ      +    DS++    
Sbjct: 192  VMQFLMGLNESFVQTRSQILMMEPLPPIAKVFSLVAQDERQCSINYGLYTPPDSVAANDS 251

Query: 901  ------FAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062
                   A + NS+  K      C HCG  GH+ DKCY+L G+PP +KF   K +N   K
Sbjct: 252  NSTVAISAARLNSKPKK--DRPTCSHCGILGHTVDKCYKLYGYPPGYKF---KSKNPHAK 306

Query: 1063 HSANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINK----------DGMQPVSSGTS 1212
              AN +SS+   + S  +    +  +  Q Q+L+AL++           +  QP  S +S
Sbjct: 307  AQANQTSSRTTEA-SATADSPLVSLSPAQCQQLIALLSSQLHDNTPATPELQQPGPSVSS 365

Query: 1213 SSLHFSNMAGIFPSPNLLSHTATPWVI 1293
             S  FS  +  FP+    S  ++ WV+
Sbjct: 366  FSSIFSLSSVSFPN----SLDSSAWVL 388


>gb|KYP45295.1| hypothetical protein KK1_033170 [Cajanus cajan]
          Length = 286

 Score =  251 bits (640), Expect = 2e-77
 Identities = 129/285 (45%), Positives = 178/285 (62%), Gaps = 12/285 (4%)
 Frame = +1

Query: 202  PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381
            PT DP +  +LHH+D    V+ S PL   NY TW  A  +A S+KNK+  +DGS+P   +
Sbjct: 8    PTLDPTNPLFLHHSDGPGLVLTSQPLDNKNYTTWSHAMLVAFSVKNKIPFVDGSLPKLAA 67

Query: 382  DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561
            +   Y  W+R NNL++SWL NSVSK+I ++IL+ N+AKEIWD LKT+FS+ +   IFQL+
Sbjct: 68   NHPTYPAWIRGNNLVISWLYNSVSKDIITSILFANTAKEIWDDLKTKFSRKNGPHIFQLR 127

Query: 562  QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRP-IPFCSCGLCTCSALRSYGEIQSCDYVF 738
            ++L S+ QG   VS Y+T+L ++WEEL  Y+P  P      CTC  L+   +  + +YV 
Sbjct: 128  RQLMSLQQGIDYVSTYYTKLKSIWEELSGYKPSFP------CTCGGLQHLQDYNASEYVM 181

Query: 739  KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSI--TPMMDSLSFAVK 912
             FLMGLN+S+  IRGQILL  P P +  VFS+ILQEE Q E  T+I  TP ++  S A  
Sbjct: 182  SFLMGLNDSFSQIRGQILLSYPLPPIGNVFSLILQEETQIEIGTNITHTPSVNFDSMAFL 241

Query: 913  YNSEKGKQVSD---------VVCEHCGKGGHSRDKCYRLIGFPPN 1020
             NS     + D           C HCG  GH++D+ Y+L+G+PPN
Sbjct: 242  VNSSNKSSIVDHNKTYKKEKPKCAHCGILGHTKDEFYKLVGYPPN 286


>ref|XP_015389606.1| PREDICTED: uncharacterized protein LOC107178668 [Citrus sinensis]
          Length = 316

 Score =  251 bits (642), Expect = 2e-77
 Identities = 129/309 (41%), Positives = 188/309 (60%), Gaps = 12/309 (3%)
 Frame = +1

Query: 196  VSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTP 375
            +S  EDP++  +LHH+DH   +++S PLT  NY TW RA  +ALS KNK+G +DG I  P
Sbjct: 11   ISSHEDPSNPLFLHHSDHPGVILVSQPLTEDNYNTWSRAMIMALSAKNKIGFIDGFIKHP 70

Query: 376  NSDDSLYIP-WLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552
                +     W RCN+++ SWLLNS+SKEI+ +++Y   A EIW  LK R SQ +   IF
Sbjct: 71   GDTSAAESQHWNRCNDMVKSWLLNSISKEISLSVIYCKFASEIWTDLKERLSQVNGPYIF 130

Query: 553  QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732
            Q+++ + ++VQ T +++ Y+T+L A+W+EL     IP CSCG     ++++  + Q    
Sbjct: 131  QVEKEIHNLVQDTMSIATYYTKLKALWDELDALCSIPTCSCG-----SMKAVIQYQQSHK 185

Query: 733  VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLSFAVK 912
              KFLMGLNESY   RGQILL+ P P+++K +S++LQ+ERQ    ++ T   ++   A K
Sbjct: 186  TMKFLMGLNESYSATRGQILLMDPLPNVNKSYSLVLQDERQHAVSSNQTIAPEATELAAK 245

Query: 913  YNSEKGKQVSDV-----------VCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQ 1059
             NS + K+  DV            C+HCG  GH+ DKCY + GFPP+ +  KG       
Sbjct: 246  MNSRERKEYKDVEKRKDGKRERPKCDHCGWVGHTVDKCYHIHGFPPDHRNRKG-----NS 300

Query: 1060 KHSANISSS 1086
            K SAN +SS
Sbjct: 301  KPSANQTSS 309


>gb|KYP74100.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 444

 Score =  255 bits (652), Expect = 3e-77
 Identities = 145/373 (38%), Positives = 213/373 (57%), Gaps = 9/373 (2%)
 Frame = +1

Query: 202  PTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNS 381
            P++D ++  +LHH+D    V+ S PL   NY TW RA  +AL +KNKL  +DG++P P S
Sbjct: 8    PSQDVSNPLFLHHSDGPGLVLTSQPLDHKNYTTWSRAMQVALFVKNKLAFIDGTLPKPAS 67

Query: 382  DDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQ 561
             DS ++ W   NN+++SWL NSVSK+I ++IL+ ++A+EIW  LKTRFS+ +  RIFQL+
Sbjct: 68   TDSTFVAWNHANNVVISWLYNSVSKDIITSILFASTAQEIWHDLKTRFSKKNGSRIFQLR 127

Query: 562  QRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVFK 741
            ++L S+ QG   +S Y+T+L ++WEEL  Y+P        CTC  L+        +YV  
Sbjct: 128  RQLMSLHQGMDDISTYYTKLKSIWEELSGYKP-----TFQCTCGGLQQLQSFTESEYVMS 182

Query: 742  FLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQRE-ARTSITPMMDSLSFAVKYN 918
            FLMGLN+S   IRGQILL  P PS+  VFS++LQ+E QRE A TS  P+ +S +     N
Sbjct: 183  FLMGLNDSISQIRGQILLSDPLPSIGNVFSLVLQDEAQREIAVTSSPPVANSDNIVFTVN 242

Query: 919  SEKGKQVSDVV-------CEHCGKGGHSRDKCYRLIGFPPN-FKFTKGKPRNQGQKHSAN 1074
            S +     +         C HC   GH++D CY+L+G+PPN FK       NQ    S N
Sbjct: 243  SSQPATSRNRFTKKERPRCAHCNILGHTKDTCYKLVGYPPNYFKNHTTNTVNQVTGSSDN 302

Query: 1075 ISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFSNMAGIFPS 1254
            + +SQ  +             T DQ Q+L+  +       + + T+     +N+ GI  +
Sbjct: 303  VLTSQSSN------------LTPDQRQQLINFLTNQ----MQADTTLDAITTNVTGICMN 346

Query: 1255 PNLLSHTATPWVI 1293
              L ++  T W+I
Sbjct: 347  VALDNNYHT-WII 358


>ref|XP_010662801.1| PREDICTED: uncharacterized protein LOC104882222 [Vitis vinifera]
          Length = 693

 Score =  260 bits (665), Expect = 1e-76
 Identities = 143/377 (37%), Positives = 221/377 (58%), Gaps = 13/377 (3%)
 Frame = +1

Query: 193  TVSPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPT 372
            ++S  ED    ++LH+++H   V++S  LTG+NY TW RA  +AL+ KNK+  +DGSIP 
Sbjct: 17   SLSSMEDSTSPYFLHNSNHPGIVLVSHHLTGANYNTWSRAMVMALTAKNKISFIDGSIPC 76

Query: 373  PNSDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIF 552
            P SDD L+  W+RCNN+++SW+LNSV K+I  ++LY ++A  IW+ L+ RF Q +  RIF
Sbjct: 77   PESDDLLFGTWIRCNNMVISWILNSVHKDIVDSLLYFDTAVGIWNDLRDRFRQSNGPRIF 136

Query: 553  QLQQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDY 732
            Q+++ L ++ QG+  VS Y+T+L  +W+ELK ++P+  C+CG      ++++ E Q  +Y
Sbjct: 137  QIKKHLIALSQGSLDVSTYYTRLKILWDELKGFQPLLECACG-----TMKTWMEFQQQEY 191

Query: 733  VFKFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQREARTSITPMMDSLS---- 900
            V +FLMGLNES+     QIL++ P P + KVFS++ Q+ERQR     +    DS++    
Sbjct: 192  VMQFLMGLNESFVQTHSQILMMEPLPPIAKVFSLVAQDERQRSINYGLYTPPDSVAANDS 251

Query: 901  ------FAVKYNSEKGKQVSDVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK 1062
                   A + NS+  K      C H G  GH+ DKCY+L G+PP +KF   K +N   K
Sbjct: 252  NSTIAILAARLNSKPKK--DQPTCSHYGILGHTVDKCYKLYGYPPRYKF---KSKNPHAK 306

Query: 1063 HSANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALINKDGMQPVSSGTSSSLHFSNMAG 1242
              AN +SS+   + S  +       +  Q Q+L+AL+            SS LH + +A 
Sbjct: 307  AQANQTSSRTTEA-STTADSPLASLSPAQCQQLIALL------------SSQLHDNTLAT 353

Query: 1243 ---IFPSPNLLSHTATP 1284
                 P P++ S +  P
Sbjct: 354  PDLQQPGPSVSSFSVIP 370


>gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja]
          Length = 484

 Score =  254 bits (649), Expect = 2e-76
 Identities = 143/379 (37%), Positives = 214/379 (56%), Gaps = 27/379 (7%)
 Frame = +1

Query: 226  FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405
            +YLH  ++   V++SP LT  NY TW R+  +AL  KNK   +DGS+P P   D LY PW
Sbjct: 9    YYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 68

Query: 406  LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585
            +RCN ++L+W+  S+S  IA ++L+I++A  +W  L+ RFSQ D  RI  LQ+ L    Q
Sbjct: 69   IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 128

Query: 586  GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762
            GT  VS+YFTQL   W+EL+NYRPIP C C + C+C  + S    +  DYV +FL GLN+
Sbjct: 129  GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLND 188

Query: 763  SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921
             +   + QI++++P P +D VFS+++Q+ER+       S++      + A++ NS     
Sbjct: 189  RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 248

Query: 922  -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065
                    KGK  S   + VC HCGK  H  D C+  IG+PP +K  K K  +   +  +
Sbjct: 249  NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 308

Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS---LH-- 1224
            ++N S+ +    GS  S  S   FTQ+  Q ++  +  +K G QP ++  ++S   LH  
Sbjct: 309  TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANSVTTSPFALHSP 366

Query: 1225 FSNMAGIFPSPNLLSHTAT 1281
             SN  G  PS  +L   +T
Sbjct: 367  SSNPNGKNPSLWILDTAST 385


>ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max]
          Length = 389

 Score =  251 bits (640), Expect = 4e-76
 Identities = 133/353 (37%), Positives = 202/353 (57%), Gaps = 22/353 (6%)
 Frame = +1

Query: 226  FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405
            +YLH  ++   V++SP LT  NY TW  +  +AL  KNK   +DGS+P P   D LY PW
Sbjct: 17   YYLHPNENPALVLVSPSLTAKNYHTWSHSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 76

Query: 406  LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585
            +RCN ++L+W+  S+S  IA ++L+I++A  +W  L+ RFSQ D  RI  LQ+ L    Q
Sbjct: 77   IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 136

Query: 586  GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762
            GT  VS+YFTQL   W+EL+NYRPIP C C + C+C  + S    +  DYV +FL GLN+
Sbjct: 137  GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVVRFLKGLND 196

Query: 763  SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921
             +   + QI++++P P +D VFS+++Q+ER+       S++      + A++ NS     
Sbjct: 197  RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 256

Query: 922  -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065
                    KGK  S   + VC HCGK  H  D C+  IG+PP +K  K K  +   +  +
Sbjct: 257  NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 316

Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS 1218
            ++N S+ +    GS  S  S   FTQ+  Q ++  +  +K G QP ++  ++S
Sbjct: 317  TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANSVTTS 367


>gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja]
          Length = 484

 Score =  253 bits (645), Expect = 1e-75
 Identities = 143/379 (37%), Positives = 214/379 (56%), Gaps = 27/379 (7%)
 Frame = +1

Query: 226  FYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPNSDDSLYIPW 405
            +YLH  ++   V++SP LT  NY TW R+  +AL  KNK   +DGS+P P   D LY PW
Sbjct: 9    YYLHPNENPALVLVSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPKPPVSDPLYAPW 68

Query: 406  LRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQLQQRLSSIVQ 585
            +RCN ++L+W+  S+S  IA ++L+I++A  +W  L+ RFSQ D  RI  LQ+ L    Q
Sbjct: 69   IRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQ 128

Query: 586  GTSTVSEYFTQLNAVWEELKNYRPIPFCSCGL-CTCSALRSYGEIQSCDYVFKFLMGLNE 762
            GT  VS+YFTQL   W+EL+NYRPIP C C + C+C  + S    +  DYV +FL GLN+
Sbjct: 129  GTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLND 188

Query: 763  SYEGIRGQILLISPTPSLDKVFSMILQEERQR--EARTSITPMMDSLSFAVKYNS----- 921
             +   + QI++++P P +D VFS+++Q+ER+       S++      + A++ NS     
Sbjct: 189  RFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNF 248

Query: 922  -------EKGKQVS---DVVCEHCGKGGHSRDKCYRLIGFPPNFKFTKGKPRNQGQK--H 1065
                    KGK  S   + VC HCGK  H  D C+  IG+PP +K  K K  +   +  +
Sbjct: 249  NGKGGYYNKGKGSSKGGNRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQANN 308

Query: 1066 SANISSSQDVHSGSNDSHGSGMIFTQDQVQKLMALI--NKDGMQPVSSGTSSS---LH-- 1224
            ++N S+ +    GS  S  S   FTQ+  Q ++  +  +K G QP ++  ++S   LH  
Sbjct: 309  TSNASALESTQQGS--SAQSSFQFTQEMYQGILEALQQSKVGSQPKANLVTTSPFALHSP 366

Query: 1225 FSNMAGIFPSPNLLSHTAT 1281
             SN  G  PS  +L   +T
Sbjct: 367  SSNPNGKNPSLWILDTAST 385


>ref|XP_007037468.1| Integrase, catalytic region, putative [Theobroma cacao]
           gi|508774713|gb|EOY21969.1| Integrase, catalytic region,
           putative [Theobroma cacao]
          Length = 242

 Score =  244 bits (624), Expect = 1e-75
 Identities = 109/220 (49%), Positives = 154/220 (70%)
 Frame = +1

Query: 199 SPTEDPNHHFYLHHTDHANTVVISPPLTGSNYVTWCRAFTLALSIKNKLGLLDGSIPTPN 378
           SP  DP   ++LHHT+H  +V+I+P LT +NYVTW R+F LALSI+NK G ++G+I  P 
Sbjct: 19  SPIGDPQFPYFLHHTNHPGSVIINPKLTTTNYVTWSRSFLLALSIRNKKGFINGTISKPQ 78

Query: 379 SDDSLYIPWLRCNNLILSWLLNSVSKEIASNILYINSAKEIWDKLKTRFSQPDNIRIFQL 558
             D LY  W+RCNNLI++WLL+S++  IAS I Y++S  +IW+ LK  F+QPD+ R+  L
Sbjct: 79  PTDPLYPSWIRCNNLIVAWLLDSITPPIASTIFYMDSVVDIWNTLKQSFAQPDDSRVCNL 138

Query: 559 QQRLSSIVQGTSTVSEYFTQLNAVWEELKNYRPIPFCSCGLCTCSALRSYGEIQSCDYVF 738
           Q  L ++ QGT +V  YF +L  +WEEL+NYRP+P C CG  +    R Y +    D VF
Sbjct: 139 QYTLGNVTQGTRSVDSYFIELKGIWEELRNYRPLPHCVCGKYSPECFRRYSDQYQKDMVF 198

Query: 739 KFLMGLNESYEGIRGQILLISPTPSLDKVFSMILQEERQR 858
           +FL GLN+ +  +R QI+L+ P PSLDKV++++L+EE QR
Sbjct: 199 RFLNGLNDFFSAVRSQIILMDPIPSLDKVYNLVLREEAQR 238


Top