BLASTX nr result

ID: Rehmannia22_contig00020173 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00020173
         (827 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [A...   234   4e-59
ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612...   228   3e-57
ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498...   194   2e-47
emb|CAN69709.1| hypothetical protein VITISV_018584 [Vitis vinifera]   193   5e-47
gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis]    186   9e-45
emb|CAN78493.1| hypothetical protein VITISV_037041 [Vitis vinifera]   177   5e-42
ref|XP_006591683.1| PREDICTED: uncharacterized protein LOC100809...   171   3e-40
gb|AAB18645.1| unknown [Hordeum vulgare]                              171   3e-40
ref|XP_006593040.1| PREDICTED: uncharacterized protein LOC100794...   169   1e-39
gb|ACY01934.1| hypothetical protein [Beta vulgaris]                   169   1e-39
ref|XP_006603612.1| PREDICTED: uncharacterized protein LOC102666...   168   2e-39
dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis t...   168   2e-39
gb|AAD17354.1| contains similarity to Arabidopsis thaliana retro...   167   5e-39
emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]   167   5e-39
ref|XP_006588029.1| PREDICTED: uncharacterized protein LOC100776...   166   9e-39
gb|EOY26223.1| Uncharacterized protein TCM_027661 [Theobroma cacao]   166   9e-39
ref|XP_006465129.1| PREDICTED: uncharacterized protein LOC102627...   166   1e-38
ref|XP_006606639.1| PREDICTED: uncharacterized protein LOC102663...   165   2e-38
ref|XP_006577427.1| PREDICTED: uncharacterized protein LOC102667...   165   2e-38
pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrot...   164   3e-38

>ref|XP_006826167.1| hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]
           gi|548830333|gb|ERM93404.1| hypothetical protein
           AMTR_s04947p00003620 [Amborella trichopoda]
          Length = 379

 Score =  234 bits (596), Expect = 4e-59
 Identities = 113/243 (46%), Positives = 162/243 (66%), Gaps = 2/243 (0%)
 Frame = +3

Query: 105 RAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQ-NQFGGSPVDDPNTHLATFL 281
           RAIR+Y  P  N+   GIVR  I A  FELKP +  M+Q   QF G P +DP+ HL +FL
Sbjct: 21  RAIREYAAPMFNELNPGIVRPEIQAPQFELKPVMFQMLQTVGQFSGMPTEDPHLHLRSFL 80

Query: 282 EICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSK 461
           E+ D+ K+ GV++E +RL+LFPFSLRD+AR WL T  P S+T+W+DL +KFL K+FPP++
Sbjct: 81  EVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWNDLAEKFLRKYFPPTR 140

Query: 462 TLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILD 641
             + +SEI  FQQ++ E   +A ERFK+LLR+CPHHG     +++ FYNGLN  +R +LD
Sbjct: 141 NAKFRSEIMSFQQLEDESTSDAWERFKELLRKCPHHGIPHCIQMETFYNGLNAASRMVLD 200

Query: 642 AASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERS-APKPIAGVLELDTMSALAAQISSLT 818
           A++ G ++                N+YQW + R+   + +AGVLE+D ++AL AQ++S+T
Sbjct: 201 ASANGAILSKSYNEAFEILETIASNNYQWSNTRAPTSRKVAGVLEVDAITALTAQMASMT 260

Query: 819 KQL 827
             L
Sbjct: 261 NVL 263


>ref|XP_006494715.1| PREDICTED: uncharacterized protein LOC102612045 [Citrus sinensis]
          Length = 810

 Score =  228 bits (580), Expect = 3e-57
 Identities = 117/244 (47%), Positives = 158/244 (64%), Gaps = 2/244 (0%)
 Frame = +3

Query: 102 ERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQ-NQFGGSPVDDPNTHLATF 278
           ++AIRDY + T    + GIVR  + ANNFELKP +  M+Q   QF G P  D + HL  F
Sbjct: 47  DKAIRDYAVLTPQAIHPGIVRPDVQANNFELKPVMFQMLQTVGQFNGLPSKDLHPHLKLF 106

Query: 279 LEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPS 458
           LE+ D  K+ G + EA+RLRLF FSLRD+AR WL +  P SIT+W DL  KFL K+FPP+
Sbjct: 107 LEVSDAFKIAGASQEALRLRLFSFSLRDRARAWLNSLPPDSITTWSDLADKFLLKYFPPT 166

Query: 459 KTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTIL 638
           K  +L++EI  F Q++ E L +A ERFK+LLRRCPHHG     +++  YNGLN  TR I+
Sbjct: 167 KNAKLRNEITSFHQLEDESLCDAWERFKELLRRCPHHGIPCCIQLETLYNGLNQSTRLIV 226

Query: 639 DAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSER-SAPKPIAGVLELDTMSALAAQISSL 815
           DA++ G L+                N+YQWPS R +A +  AGV  +D ++AL+AQ++SL
Sbjct: 227 DASANGALLFKSYNEAYEILERIANNNYQWPSTRQAATRGTAGVHNVDALTALSAQVTSL 286

Query: 816 TKQL 827
           TK +
Sbjct: 287 TKMV 290


>ref|XP_004492121.1| PREDICTED: uncharacterized protein LOC101498022 [Cicer arietinum]
          Length = 544

 Score =  194 bits (494), Expect = 2e-47
 Identities = 96/201 (47%), Positives = 126/201 (62%)
 Frame = +3

Query: 213 MVQQNQFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFA 392
           MVQQ Q  G+P DDPN +L+  LE CDT+KMNGVT + IRLRLFPF LRD+AR WL +  
Sbjct: 1   MVQQKQLSGTPTDDPNLYLSISLESCDTLKMNGVTYDTIRLRLFPFPLRDRARAWLHSLP 60

Query: 393 PGSITSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHG 572
             SIT+WD L Q FL ++FPPSKT QL+++I  F Q + E LYEA E FK++LR CPHHG
Sbjct: 61  SESITTWDQLKQAFLGRYFPPSKTAQLRNQITSFSQKEGESLYEAWENFKEMLRLCPHHG 120

Query: 573 YADWQRVQYFYNGLNGHTRTILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPK 752
              W  +  FYN L+  TR  +D  +GG  +                N YQW S+RS P 
Sbjct: 121 MERWLIIHTFYNELSYTTRMTVDDDAGGAFINKNIEESYALIEDMEHNHYQWSSDRS-PH 179

Query: 753 PIAGVLELDTMSALAAQISSL 815
              G+ E+D +  +A+++ +L
Sbjct: 180 NKGGMYEVDALDHIASKVDAL 200


>emb|CAN69709.1| hypothetical protein VITISV_018584 [Vitis vinifera]
          Length = 363

 Score =  193 bits (491), Expect = 5e-47
 Identities = 113/257 (43%), Positives = 146/257 (56%), Gaps = 1/257 (0%)
 Frame = +3

Query: 48  NENNGNRVVNVPPQPEAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQN 227
           NEN+G+  V        P RA++DYF+P V    S I R PI ANNFE+K  +I M+Q +
Sbjct: 142 NENHGDNGV--------PNRALKDYFVPNVG--VSSIRRPPIQANNFEIKLAIIQMIQSS 191

Query: 228 -QFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSI 404
            QFGG   DDPN H+A FLEI  T K NGVTD+AIRLRLFPF L +KA+ WL +  PG+I
Sbjct: 192 IQFGGLANDDPNLHIANFLEIFYTFKHNGVTDDAIRLRLFPFPLNNKAKAWLISLPPGTI 251

Query: 405 TSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADW 584
           T+WD L                           D E LYEA ERFKDLLR+C HHG   W
Sbjct: 252 TTWDGLQ--------------------------DQESLYEARERFKDLLRKCSHHGLPMW 285

Query: 585 QRVQYFYNGLNGHTRTILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAG 764
            +VQ FYN L+ +T+T++DA S G  +                N++   ++R+A K I G
Sbjct: 286 MQVQTFYNSLHXNTQTMVDAPSXGXFINKTPEEGYQLIEVMAXNNFLKSTDRNAQKRIVG 345

Query: 765 VLELDTMSALAAQISSL 815
           V + D  + LA Q++ L
Sbjct: 346 VHDFDAFNNLATQVTIL 362


>gb|ABD63156.1| Retrotransposon gag protein [Asparagus officinalis]
          Length = 275

 Score =  186 bits (472), Expect = 9e-45
 Identities = 90/171 (52%), Positives = 118/171 (69%)
 Frame = +3

Query: 303 MNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSE 482
           MNGV+D+AI+LRLFPFSLRDKAR WLQ+  PGSIT+WD L++ FLAK+FPPSKT QL+++
Sbjct: 1   MNGVSDDAIKLRLFPFSLRDKARAWLQSLPPGSITTWDQLSEAFLAKYFPPSKTAQLRNQ 60

Query: 483 IAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTL 662
           I  F Q + E LY+A ER+KDLLR CPHHG  DW  +  FYNGL  +TR  +DAA+GG L
Sbjct: 61  ITTFTQKEGESLYDAWERYKDLLRMCPHHGLEDWLIIHTFYNGLLYNTRMTVDAAAGGAL 120

Query: 663 MXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISSL 815
           M                N +QW  ERS PK  +G  ++D +  +A+++ +L
Sbjct: 121 MNKSVRDAKQLIEDMAQNHFQWSGERSLPKK-SGRYDVDALDHIASRVDAL 170


>emb|CAN78493.1| hypothetical protein VITISV_037041 [Vitis vinifera]
          Length = 1048

 Score =  177 bits (448), Expect = 5e-42
 Identities = 88/187 (47%), Positives = 120/187 (64%)
 Frame = +3

Query: 255 PNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKF 434
           P    A   EI DT K NGVTD+AIRLRLFPFSL +KA+ WL +  PG+IT+WD L   F
Sbjct: 8   PKDAAACCFEIRDTFKHNGVTDDAIRLRLFPFSLNNKAKAWLISLPPGTITTWDGLVNAF 67

Query: 435 LAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGL 614
           LAK+FP +K+ +++++I  F Q D E LYEA ERFKDLLR+CPHHG   W + Q FYN L
Sbjct: 68  LAKYFPLAKSTKMRNDITNFLQQDQESLYEAWERFKDLLRKCPHHGLPIWMQAQMFYNSL 127

Query: 615 NGHTRTILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSAL 794
           + +T+T++DAASGG  +                N++   ++R+A K   GV ++D  + L
Sbjct: 128 HPNTQTMVDAASGGAFINKTPDEGYQLIKVMASNNFLKSTDRNAQKRTVGVHDIDVFNNL 187

Query: 795 AAQISSL 815
           A Q++ L
Sbjct: 188 ATQVAIL 194


>ref|XP_006591683.1| PREDICTED: uncharacterized protein LOC100809313 [Glycine max]
          Length = 471

 Score =  171 bits (433), Expect = 3e-40
 Identities = 90/239 (37%), Positives = 137/239 (57%)
 Frame = +3

Query: 111 IRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLEIC 290
           + DY  P + Q ++ I R  + A NF     LI ++Q N F G P +DP  HLAT+++IC
Sbjct: 3   LEDYSSPIIPQYFTSIARPEVQAANFSYPYSLIQLIQGNLFHGLPSEDPYAHLATYIDIC 62

Query: 291 DTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQ 470
           +T+K+ GV ++AIRL LF FSL D+A+ WL+ F   S+ +WD++ +KFL K+FP SKT +
Sbjct: 63  NTVKIAGVPEDAIRLNLFCFSLADEAKIWLRLFKGNSLWTWDEVVEKFLKKYFPESKTAE 122

Query: 471 LKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAAS 650
            K EI+ F Q   E L EA +RF  LLR+ P HGY++  ++  F +GL   ++ ILDA++
Sbjct: 123 GKMEISLFHQFPNESLSEALDRFHGLLRKMPTHGYSESVQLNIFIDGLRPQSKQILDASA 182

Query: 651 GGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISSLTKQL 827
            G +                 + +    +R+       +LEL T  A  AQ   L++Q+
Sbjct: 183 RGKIKLKTPEEAMELIENMAASDHAILPDRTYAPTKRSLLELTTQDATLAQNKLLSRQI 241


>gb|AAB18645.1| unknown [Hordeum vulgare]
          Length = 337

 Score =  171 bits (433), Expect = 3e-40
 Identities = 85/231 (36%), Positives = 137/231 (59%), Gaps = 5/231 (2%)
 Frame = +3

Query: 138 NQNYSGIVRQPI----NANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLEICDTIKM 305
           N N +  +  PI    NA ++E+   L+++V + QF G P +D  +HL TF+E+CD  K 
Sbjct: 12  NTNNNDFISTPIAPATNAESYEINAALLNLVMKEQFSGLPSEDVASHLNTFIELCDMQKK 71

Query: 306 NGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSEI 485
             V ++ I+L+LFPFSLRD+A+ W  +    SI SWD     +++K+FPP+K + L+++I
Sbjct: 72  KDVDNDVIKLKLFPFSLRDRAKTWFSSLPKSSIDSWDKCKDAYISKYFPPAKIISLRNDI 131

Query: 486 AQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTLM 665
             F+Q+D E + +A ER K ++R CP +G + W  +Q FY GLN  +R ILD+A+GGT M
Sbjct: 132 MNFKQLDHEHVAQAWERMKLMIRNCPANGLSLWMIIQIFYAGLNFASRNILDSATGGTFM 191

Query: 666 XXXXXXXXXXXXXXXXNSYQWPSERS-APKPIAGVLELDTMSALAAQISSL 815
                           N  QW +ERS   K +  + E++++S+   ++ +L
Sbjct: 192 EITLGEATKLLDNIMTNYSQWHTERSPTSKKVHVIEEINSLSSKMDELMNL 242


>ref|XP_006593040.1| PREDICTED: uncharacterized protein LOC100794810 [Glycine max]
          Length = 551

 Score =  169 bits (428), Expect = 1e-39
 Identities = 87/245 (35%), Positives = 139/245 (56%)
 Frame = +3

Query: 93  EAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLA 272
           + P   + DY  P + Q ++ I R  + A NF     LI ++Q N F G P +DP  HLA
Sbjct: 4   DRPRMTLEDYSSPIIPQYFTSIARPEVQAANFSYPYSLIQLIQGNLFHGLPSEDPYAHLA 63

Query: 273 TFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFP 452
            +++IC+ +K+ GV + A RL LF FSL  KA+ WL++F   S+ +W+++ +KFL K+FP
Sbjct: 64  IYIDICNMVKIVGVPENATRLNLFSFSLAGKAKIWLRSFKGNSLRTWEEVVEKFLKKYFP 123

Query: 453 PSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRT 632
            SKT++ K EI+ F Q   E L EA +RF+ LLR+ P HGY++  ++  F +GL   ++ 
Sbjct: 124 ESKTVEGKLEISSFHQFLDESLSEALDRFRGLLRKTPTHGYSEPVQLNIFIDGLRPQSKQ 183

Query: 633 ILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISS 812
           +LDA++GG +                 + +    +R+       +LEL T  A+ AQ   
Sbjct: 184 LLDASAGGKIKLKTSEEAMKLIENMAASDHAILRDRTYAPTKRSLLELTTQDAILAQKKL 243

Query: 813 LTKQL 827
           L++Q+
Sbjct: 244 LSQQI 248


>gb|ACY01934.1| hypothetical protein [Beta vulgaris]
          Length = 1717

 Score =  169 bits (428), Expect = 1e-39
 Identities = 97/241 (40%), Positives = 136/241 (56%)
 Frame = +3

Query: 105 RAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLE 284
           R  RDY  P+ +   +G+    I A NFE+K  +++MVQ NQF G P +DPN HL  FL+
Sbjct: 7   RTFRDYAQPSPDNVPTGVPMPTIAATNFEIKSHVLNMVQNNQFAGLPSEDPNQHLQRFLQ 66

Query: 285 ICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKT 464
            C T K  GVT + ++L LF FSLRDKA  +     P +IT+W +L + FL K++P  KT
Sbjct: 67  CCATQKQAGVTPDEMKLLLFGFSLRDKAL-FCYNKLPSTITTWPELFKIFLIKWYPYQKT 125

Query: 465 LQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDA 644
             ++  I  F Q   E LYEA ERFKDL R CPHHG   W   Q FY  ++  TR ++D 
Sbjct: 126 ADMRHAIVTFTQDPGESLYEAWERFKDLQRLCPHHGLDQWYLCQIFYTRVDADTRRVIDG 185

Query: 645 ASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISSLTKQ 824
           ASGG  M                N  Q  + RS+ K   G  +++ +S L+ Q+++LT++
Sbjct: 186 ASGGFFMDKAIEEGYELLEKLASN--QASTIRSSLKK-GGKHDVEAISLLSGQLATLTQK 242

Query: 825 L 827
           +
Sbjct: 243 I 243


>ref|XP_006603612.1| PREDICTED: uncharacterized protein LOC102666104 [Glycine max]
          Length = 657

 Score =  168 bits (426), Expect = 2e-39
 Identities = 88/245 (35%), Positives = 137/245 (55%)
 Frame = +3

Query: 93  EAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLA 272
           +A    + D+      Q ++ I R  + A N      LI ++Q N F G P +DP  HLA
Sbjct: 56  QARRVTLEDFSNTATPQFFTSIARPEVQAANISYPHSLIQLIQGNLFHGLPSEDPYAHLA 115

Query: 273 TFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFP 452
           +++EIC+T+K+ GV ++A+RL LF FSL  +A++WL +F   S+ +W+++ +KFL K+FP
Sbjct: 116 SYIEICNTVKIAGVPEDAVRLNLFSFSLAGEAKRWLHSFKGNSLRTWEEVVEKFLKKYFP 175

Query: 453 PSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRT 632
            SKT + K EI+ F Q   E L EA +RF  LLR+ P HGY++  ++  F +GL  H++ 
Sbjct: 176 ESKTAEGKMEISSFHQFPDESLSEALDRFHGLLRKTPTHGYSEPVQLNIFIDGLRPHSKQ 235

Query: 633 ILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISS 812
           +LDA++GG +                 +      +RS       +LEL T  A  AQ   
Sbjct: 236 LLDASAGGKIKLKTPEEAMELIENMAASDQAILRDRSYVPTKRSLLELGTQDATLAQNKL 295

Query: 813 LTKQL 827
           LT+Q+
Sbjct: 296 LTRQI 300


>dbj|BAB10790.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
          Length = 1864

 Score =  168 bits (426), Expect = 2e-39
 Identities = 82/179 (45%), Positives = 114/179 (63%)
 Frame = +3

Query: 129 PTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLEICDTIKMN 308
           P  +   +GIV  P+  NNFE+K GLI+MVQ N+F G P++DP  HL  F  +C   K+N
Sbjct: 52  PRNHNQRNGIVPPPVQNNNFEIKSGLIAMVQSNKFHGLPMEDPLDHLDEFDRLCSLTKIN 111

Query: 309 GVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSEIA 488
           GV+++  +LRLFPFSL DKA QW ++   GSITSW+D  + FLAKFF  S+T +L+++I+
Sbjct: 112 GVSEDGFKLRLFPFSLGDKAHQWEKSLLQGSITSWNDCKKAFLAKFFSNSRTARLRNDIS 171

Query: 489 QFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTLM 665
            F Q + E   EA ERFK    +CPHHG++    +   Y G+    R +LD AS G  +
Sbjct: 172 GFTQTNNETFCEAWERFKGYQTQCPHHGFSKASLLSTLYRGVLPKIRMLLDTASNGNFL 230


>gb|AAD17354.1| contains similarity to Arabidopsis thaliana retrotransposon Athila
           hypothetical protein 1 (GB:X81801) [Arabidopsis
           thaliana] gi|7267376|emb|CAB77937.1| putative athila
           transposon protein [Arabidopsis thaliana]
          Length = 446

 Score =  167 bits (422), Expect = 5e-39
 Identities = 89/203 (43%), Positives = 120/203 (59%), Gaps = 2/203 (0%)
 Frame = +3

Query: 63  NRVVNV--PPQPEAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFG 236
           N++V+V  PP  + P R I     P  +    GIV  P+  NNFE+  GLISM+Q N+F 
Sbjct: 5   NKLVDVQDPPNVDQP-RNIGAGDAPRNHHQRQGIVPPPVQINNFEIMSGLISMIQGNKFH 63

Query: 237 GSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWD 416
           G P +DP  +L +F  +C   K+NGVT +  +LRLFPFSL DKA  W +T  P SITSWD
Sbjct: 64  GLPKEDPLDNLDSFDRLCGLTKINGVTKDMFKLRLFPFSLGDKAHHWKKTLPPDSITSWD 123

Query: 417 DLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQ 596
           D  + FLAKFF  ++T +L++EI+ F Q + E  +EA ERFK     CPHHG+     ++
Sbjct: 124 DCKKDFLAKFFSNARTARLRNEISGFTQKNNETFFEASERFKSYTTYCPHHGFKKASLLR 183

Query: 597 YFYNGLNGHTRTILDAASGGTLM 665
             Y G     R +LD  S G  +
Sbjct: 184 TLYRGALPKIRMLLDTTSNGNFL 206


>emb|CAN69639.1| hypothetical protein VITISV_040272 [Vitis vinifera]
          Length = 437

 Score =  167 bits (422), Expect = 5e-39
 Identities = 86/168 (51%), Positives = 114/168 (67%), Gaps = 1/168 (0%)
 Frame = +3

Query: 48  NENNGNRVVNVPPQPEAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQN 227
           +EN+G+  V        P RA++DY +P V      I R PI ANNFE+K  +I M++ +
Sbjct: 210 DENHGDNGV--------PNRALKDYSIPNVG--VLSIQRPPIQANNFEIKLAIIQMIRSS 259

Query: 228 -QFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSI 404
            QFGG   DDPN H+A FLEICDT K NGV D+AIRLRLFPFSL +KA+ WL +  PG+I
Sbjct: 260 VQFGGLANDDPNLHIANFLEICDTFKHNGVIDDAIRLRLFPFSLNNKAKAWLISLPPGTI 319

Query: 405 TSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDL 548
           T+WD L   FL K+FPP+K+++++++I  F Q D E LYEA ER  +L
Sbjct: 320 TTWDGLVNAFLTKYFPPAKSIKMRNDITNFLQQDQESLYEAWERKLEL 367


>ref|XP_006588029.1| PREDICTED: uncharacterized protein LOC100776307 [Glycine max]
          Length = 1898

 Score =  166 bits (420), Expect = 9e-39
 Identities = 87/245 (35%), Positives = 136/245 (55%)
 Frame = +3

Query: 93   EAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNTHLA 272
            +A    + D+      Q ++ I R  + A N      LI ++Q N F G P +DP  HLA
Sbjct: 297  QARRVTLEDFSNTATPQFFTSIARPEVQAANISYPHSLIQLIQGNLFHGLPSEDPYAHLA 356

Query: 273  TFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFFP 452
            +++EIC+T+K+ GV ++A+RL LF FSL  +A++WL +F   ++ +W+++ +KFL K+FP
Sbjct: 357  SYIEICNTVKIAGVPEDAVRLNLFSFSLAGEAKRWLHSFKGNNLRTWEEVVEKFLKKYFP 416

Query: 453  PSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRT 632
             SKT + K EI+ F Q   E L EA +RF  LLR+ P HGY+   ++  F +GL  H++ 
Sbjct: 417  ESKTAEGKMEISSFHQFPDESLSEALDRFHGLLRKTPTHGYSKPVQLNIFIDGLRPHSKQ 476

Query: 633  ILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQISS 812
            +LDA++GG +                 +      +RS       +LEL T  A  AQ   
Sbjct: 477  LLDASAGGKIKLKTPEEAMELIENMAASDQAILRDRSYVPTKRSLLELGTQDATLAQNKL 536

Query: 813  LTKQL 827
            LT+Q+
Sbjct: 537  LTRQI 541


>gb|EOY26223.1| Uncharacterized protein TCM_027661 [Theobroma cacao]
          Length = 250

 Score =  166 bits (420), Expect = 9e-39
 Identities = 97/245 (39%), Positives = 130/245 (53%), Gaps = 1/245 (0%)
 Frame = +3

Query: 93  EAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQN-QFGGSPVDDPNTHL 269
           E   + + +Y +  V   +S I R  I  NNFE+K  +I M+Q + QFG SP DD N ++
Sbjct: 28  EEEAKYLLEYVVRLVQSLHSSIRRLAIQVNNFEIKLPIIQMIQTSIQFGRSPNDDLNAYI 87

Query: 270 ATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAKFF 449
             FLEICDT K NGVT++ IRLRLFPFSLRDK + WL +     I++ DDL QKFLAK F
Sbjct: 88  VNFLEICDTFKHNGVTNDVIRLRLFPFSLRDKIKSWLNSLIASFISTRDDLAQKFLAKLF 147

Query: 450 PPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGHTR 629
           PP+KT  + + I  F Q + E LYEA ER                               
Sbjct: 148 PPTKTANMWNGITSFVQFNPESLYEAWER------------------------------- 176

Query: 630 TILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQIS 809
           T +DA + G LM                N+YQWP E+   + +A V ELD ++A  AQ++
Sbjct: 177 TTIDATTSGALMDKSIDEAYDLLKEIAFNNYQWPCEKLVLRKVASVHELDGINAFTAQVT 236

Query: 810 SLTKQ 824
            L+K+
Sbjct: 237 VLSKK 241


>ref|XP_006465129.1| PREDICTED: uncharacterized protein LOC102627778 [Citrus sinensis]
          Length = 1020

 Score =  166 bits (419), Expect = 1e-38
 Identities = 92/245 (37%), Positives = 136/245 (55%)
 Frame = +3

Query: 84  PQPEAPERAIRDYFLPTVNQNYSGIVRQPINANNFELKPGLISMVQQNQFGGSPVDDPNT 263
           P  +   R +++   P ++Q     +  P    NFELK G+I ++  + F G   +DPN 
Sbjct: 42  PMAQNNNRTLKELAAPNLDQQPL-CIENPNPQVNFELKSGMIHLL--HTFHGLVGEDPNK 98

Query: 264 HLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKARQWLQTFAPGSITSWDDLTQKFLAK 443
           HL  F  +C T+K  GV++E ++L  FPFSL D A++WL     G++T+W+++ Q FL K
Sbjct: 99  HLKEFHVVCSTMKPAGVSEEQVKLMAFPFSLADSAKEWLYYLPSGTVTTWNEMRQLFLEK 158

Query: 444 FFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDLLRRCPHHGYADWQRVQYFYNGLNGH 623
           +FP SK   ++ EI   +Q + EPLY+  ERFK L   CPHH  +D   +QYFY GL   
Sbjct: 159 YFPASKAGSIRKEICGIRQYNGEPLYDYWERFKKLCASCPHHQISDQLLIQYFYEGLLPM 218

Query: 624 TRTILDAASGGTLMXXXXXXXXXXXXXXXXNSYQWPSERSAPKPIAGVLELDTMSALAAQ 803
            R+++DAASGG L+                NS Q+ ++     P  GV E+ T S L  Q
Sbjct: 219 DRSMIDAASGGALVDKTPEAARNLIANMAANSQQFNTKNDLLPPPKGVNEVSTTS-LEKQ 277

Query: 804 ISSLT 818
           +S+LT
Sbjct: 278 VSNLT 282


>ref|XP_006606639.1| PREDICTED: uncharacterized protein LOC102663756 [Glycine max]
          Length = 507

 Score =  165 bits (417), Expect = 2e-38
 Identities = 92/278 (33%), Positives = 146/278 (52%), Gaps = 8/278 (2%)
 Frame = +3

Query: 18  QNRAMADIVDNENNGNRVVNVPPQPEAPER--------AIRDYFLPTVNQNYSGIVRQPI 173
           +N A+    + +  G+   + P  P   E          + D+      Q ++ I R  +
Sbjct: 23  RNNAVRRRREQDTEGSSHTSPPLSPHHAEMDGEPARRVTLEDFSNTATPQFFTSIARPEV 82

Query: 174 NANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFS 353
            A N      LI ++Q N F G P +DP  HLA+++EIC+T+K+ GV ++A+RL LF FS
Sbjct: 83  QAANISYPHSLIQLIQGNLFHGLPSEDPYAHLASYIEICNTVKIAGVPEDAVRLNLFSFS 142

Query: 354 LRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACE 533
           L  +A++WL +F   S+ +W+++ +KFL K+FP SKT + K EI+ F Q   E L EA +
Sbjct: 143 LTGEAKRWLHSFKGNSLRTWEEVVEKFLKKYFPESKTAEGKMEISSFHQFPDESLSEALD 202

Query: 534 RFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTLMXXXXXXXXXXXXXXXX 713
           RF  LLR+   HGY++  ++  F +GL  H++ +LDA++GG +                 
Sbjct: 203 RFHGLLRKTLTHGYSEPVQLNIFIDGLRPHSKQLLDASAGGKIKLKTPEEAMELIENMAA 262

Query: 714 NSYQWPSERSAPKPIAGVLELDTMSALAAQISSLTKQL 827
           +      +RS       +LEL T  A  AQ   LT+Q+
Sbjct: 263 SDQAILRDRSYVPTKRSLLELGTQDATLAQNKLLTRQI 300


>ref|XP_006577427.1| PREDICTED: uncharacterized protein LOC102667981 [Glycine max]
          Length = 628

 Score =  165 bits (417), Expect = 2e-38
 Identities = 91/278 (32%), Positives = 146/278 (52%), Gaps = 8/278 (2%)
 Frame = +3

Query: 18  QNRAMADIVDNENNGNRVVNVPPQPEAPER--------AIRDYFLPTVNQNYSGIVRQPI 173
           +N A     + +  G+   + P  P   E          + D+      Q ++ I R  +
Sbjct: 23  RNNAARRRREQDTEGSSYTSPPLSPHHTEMDGESARRVTLEDFCNTATPQFFTSIARPEV 82

Query: 174 NANNFELKPGLISMVQQNQFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFS 353
            A N      LI ++Q N F G P +DP  HLA+++EIC+T+K+ GV  +A+RL LF FS
Sbjct: 83  QAANISYPHSLIQLIQGNLFYGLPSEDPYAHLASYIEICNTVKIVGVPKDAVRLNLFSFS 142

Query: 354 LRDKARQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACE 533
           L ++A++WL +F   S+ +W+++ +KFL K+FP SKT++ K EI+ F Q   E L EA +
Sbjct: 143 LAEEAKRWLHSFKGNSLRTWEEVVEKFLKKYFPKSKTVEGKMEISSFHQFPDESLSEALD 202

Query: 534 RFKDLLRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTLMXXXXXXXXXXXXXXXX 713
           RF  LLR+ P HGY++  ++  F +GL   ++ +LDA +GG +                 
Sbjct: 203 RFHGLLRKTPTHGYSEPVQLNIFIDGLRPQSKQLLDAFAGGKIKLKTPEEAMELIENMVV 262

Query: 714 NSYQWPSERSAPKPIAGVLELDTMSALAAQISSLTKQL 827
           +      +R+       +LELD   A+ AQ   LT+Q+
Sbjct: 263 SDQAILRDRTYVPTKRSLLELDMQDAMLAQNKLLTRQI 300


>pir||S66306 hypothetical protein 1 - Arabidopsis thaliana retrotransposon
           Athila gi|806535|emb|CAA57397.1| unnamed protein product
           [Arabidopsis thaliana]
          Length = 935

 Score =  164 bits (416), Expect = 3e-38
 Identities = 91/219 (41%), Positives = 124/219 (56%)
 Frame = +3

Query: 9   KRAQNRAMADIVDNENNGNRVVNVPPQPEAPERAIRDYFLPTVNQNYSGIVRQPINANNF 188
           ++ +   MAD+VD +             E P       F    NQ + GIV  P+  NNF
Sbjct: 26  EQTETDTMADVVDEQ-------------EQPTNIGAGDFPHNHNQRH-GIVPPPVQNNNF 71

Query: 189 ELKPGLISMVQQNQFGGSPVDDPNTHLATFLEICDTIKMNGVTDEAIRLRLFPFSLRDKA 368
           E+K GLI+MVQ N+F G  ++DP  HL  F  +C   K+NGV+++  +LRLFPFSL DKA
Sbjct: 72  EIKSGLIAMVQGNKFHGLLMEDPLDHLDEFERLCRLTKINGVSEDGFKLRLFPFSLGDKA 131

Query: 369 RQWLQTFAPGSITSWDDLTQKFLAKFFPPSKTLQLKSEIAQFQQIDFEPLYEACERFKDL 548
             W +T   GSIT+WDD  + FLAKFF  S+T +L++EI+ F Q   E   EA ERFK  
Sbjct: 132 HLWEKTLPHGSITTWDDCKKAFLAKFFSNSRTARLRNEISGFTQKQNESFCEAWERFKGY 191

Query: 549 LRRCPHHGYADWQRVQYFYNGLNGHTRTILDAASGGTLM 665
             +CPHHG+ +   +   Y G+    R +LD AS G  +
Sbjct: 192 PTKCPHHGFKEASLLSTLYRGVLPKIRMLLDTASNGNFL 230


Top