BLASTX nr result

ID: Alisma22_contig00020256 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00020256
         (896 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010068882.1 PREDICTED: uncharacterized protein LOC104455862 [...   156   3e-39
XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [...   150   1e-37
XP_010484950.1 PREDICTED: uncharacterized protein LOC104763247 [...   147   1e-37
XP_019086488.1 PREDICTED: uncharacterized protein LOC109126953 [...   143   2e-37
XP_013738037.1 PREDICTED: uncharacterized protein LOC106440823 [...   145   4e-37
XP_018464433.1 PREDICTED: uncharacterized protein LOC108835711 [...   143   2e-36
XP_009127080.1 PREDICTED: uncharacterized protein LOC103851952 [...   143   4e-36
XP_010430659.1 PREDICTED: uncharacterized protein LOC104714886 [...   142   6e-36
XP_013647154.1 PREDICTED: uncharacterized protein LOC106352005 i...   142   8e-36
JAU70191.1 hypothetical protein LE_TR14648_c0_g1_i1_g.46319 [Noc...   140   3e-35
XP_009117857.1 PREDICTED: uncharacterized protein LOC103842926 [...   137   3e-34
AAK44121.1 putative retroelement pol polyprotein [Arabidopsis th...   137   7e-34
KFK30733.1 hypothetical protein AALP_AA6G020200 [Arabis alpina]       140   9e-34
CAN60203.1 hypothetical protein VITISV_036059 [Vitis vinifera]        140   1e-33
AAC67200.1 putative retroelement pol polyprotein [Arabidopsis th...   139   3e-33
XP_010462983.1 PREDICTED: uncharacterized protein LOC104743624 [...   136   4e-33
OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis tha...   139   4e-33
XP_019089249.1 PREDICTED: uncharacterized protein LOC109128031 [...   134   6e-33
XP_010451841.1 PREDICTED: uncharacterized protein LOC104734030 [...   134   6e-33
XP_010463299.1 PREDICTED: uncharacterized protein LOC104743969 [...   135   9e-33

>XP_010068882.1 PREDICTED: uncharacterized protein LOC104455862 [Eucalyptus grandis]
          Length = 994

 Score =  156 bits (395), Expect = 3e-39
 Identities = 110/333 (33%), Positives = 167/333 (50%), Gaps = 35/333 (10%)
 Frame = -3

Query: 894  SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENF--TVSMMRPPIPPYDEV 721
            ++++YLRE+K +CDQL+AIGK + +  K++ +L  LG E+ENF  T+  ++P  P YDEV
Sbjct: 143  TLANYLREYKSICDQLNAIGKPVDEITKMFGVLEGLGPEYENFRTTIYCLKPQ-PEYDEV 201

Query: 720  VSLLRDHDMR-RASSESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQF--PRSG 550
            ++ L   + R +  S +  +PNLA+  Q+                 Y SQ S F   R G
Sbjct: 202  IAQLERFESRLKTFSRNQFNPNLAYFGQRQFQVQSKDITNE----DYISQASGFISQRGG 257

Query: 549  YQSQHQYPS------------------SNYXXXXXXXXXXQDPGVLGARPE---PFFTSQ 433
            Y+    Y                    +N           +   V+    +   P+ T  
Sbjct: 258  YRGGRSYGRGRTGNNRGRGFERYQNYRTNVSSLGQGNQHLRQRNVISTTNQGFRPYATYS 317

Query: 432  GRGFN----PSIRSS-GLQFQARPP---LFCQICGKIGHDALRCWHRFNNNYQVTEIPKK 277
             R  N    PS  SS  LQ +       L CQIC + GHDALRCW++F+N+YQ  +IP  
Sbjct: 318  QRYQNVKTYPSANSSKNLQDEKETSSIRLECQICKRSGHDALRCWYQFDNSYQAEDIPAA 377

Query: 276  LTEDQVTDQAKALAAMHVGDQQTFNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMI 97
            LT            A+H+ D      +EW+ DSGAT H++  PG+LH ++ Y G D +MI
Sbjct: 378  LT------------ALHIEDSTG---SEWYPDSGATAHISANPGMLHTLSKYQGHDTVMI 422

Query: 96   GDGSSIPITHVGSAFL-VNDKSITLERVLIEPD 1
            G+GS + +TH+G+ +L   + ++ L  VLI P+
Sbjct: 423  GNGSCLLVTHIGNTWLKTKNSTLPLNDVLIVPE 455


>XP_010064888.1 PREDICTED: uncharacterized protein LOC104452050 [Eucalyptus grandis]
          Length = 616

 Score =  150 bits (379), Expect = 1e-37
 Identities = 109/339 (32%), Positives = 155/339 (45%), Gaps = 41/339 (12%)
 Frame = -3

Query: 894  SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENF--TVSMMRPPIPPYDEV 721
            ++S YLRE+K +CD+L+AIGK + D  K++ +L  LG E+ENF  T+  ++P  P YDEV
Sbjct: 143  ALSVYLREYKYICDRLNAIGKPVDDITKLFGVLEGLGAEYENFRTTIYCLKPQ-PEYDEV 201

Query: 720  VSLLRDHDMRRAS-SESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQF--PRSG 550
            ++ L   + R  + S +  + N+A+  Q+                 Y SQ S F   R  
Sbjct: 202  IAQLERFESRMQNYSRTQFNSNMAYFGQR----HPQAQFKETTDGEYISQNSGFVAQRGN 257

Query: 549  YQSQHQYPSSNYXXXXXXXXXXQDPGV-----------LGARPEPFFTSQGRGFNPSIR- 406
            Y+    Y    +              +              R +P+  S   GF    R 
Sbjct: 258  YRGGRSYGRGRFLNNKGRGFRYPGDNLGYRSSNTSYMQENQRTKPYQNSSDSGFTSGFRP 317

Query: 405  --------------------SSGLQFQARPP---LFCQICGKIGHDALRCWHRFNNNYQV 295
                                S  +Q +       L CQIC K GHDAL CW+RF+N+YQ 
Sbjct: 318  YAGPSQRYQEVKSYPSNLTLSKSMQNEKATSSIKLECQICKKPGHDALHCWYRFDNSYQA 377

Query: 294  TEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLNEWHVDSGATNHVAQTPGILHNVTPYTG 115
             EIP              LAA+H+ D +    +EW+ D+GAT H+     ILHN + YTG
Sbjct: 378  EEIPT------------TLAAIHLKDAKG---SEWYPDTGATAHITANSSILHNSSKYTG 422

Query: 114  IDNIMIGDGSSIPITHVGSAFLVNDKS-ITLERVLIEPD 1
             D +MIGDGS + +T  G+  L   KS + L  VLI PD
Sbjct: 423  YDTVMIGDGSHLSVTCTGNTLLHTGKSLLPLNDVLIVPD 461


>XP_010484950.1 PREDICTED: uncharacterized protein LOC104763247 [Camelina sativa]
          Length = 449

 Score =  147 bits (372), Expect = 1e-37
 Identities = 104/304 (34%), Positives = 149/304 (49%), Gaps = 6/304 (1%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV---SMMRPPIPPYDE 724
           S+S Y REFK +CD L +IGK + +  K++  L+ LG E++       S +    PP ++
Sbjct: 140 SLSTYCREFKSICDSLSSIGKPVDESMKIFGFLNGLGREYDPIATVIQSSLSKLSPPTND 199

Query: 723 VVSLLRDHDMRRASSESLSS--PNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSG 550
           VVS ++  D +  S +  SS  P+LAF++ KT                 PSQ+ +  R G
Sbjct: 200 VVSEVQGFDSKLQSYDDASSVTPHLAFMTDKTNPCAPQFQ---------PSQRGRGGRFG 250

Query: 549 YQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQARPPL 370
                                         R    +T++GRGF+     S  Q Q RP  
Sbjct: 251 QN----------------------------RGRGGYTTRGRGFSQHQSVSPSQGQ-RP-- 279

Query: 369 FCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLNEW 190
            CQICG+IGH A++C++RF NNYQ TE+P            +A A++ V D       EW
Sbjct: 280 ICQICGRIGHTAIKCYNRFENNYQ-TEVP-----------TQAFASLQVSDDSG---REW 324

Query: 189 HVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDK-SITLERVL 13
           H DS AT H+  +   L  V  Y G D +M+GDG+ +PITHVGS  + + K +I L  VL
Sbjct: 325 HPDSAATAHITSSTSGLQEVKAYDGTDAVMVGDGAYLPITHVGSTTISSAKGTIPLHEVL 384

Query: 12  IEPD 1
           + PD
Sbjct: 385 VCPD 388


>XP_019086488.1 PREDICTED: uncharacterized protein LOC109126953 [Camelina sativa]
          Length = 277

 Score =  143 bits (360), Expect = 2e-37
 Identities = 99/296 (33%), Positives = 140/296 (47%), Gaps = 13/296 (4%)
 Frame = -3

Query: 888 SDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEV 721
           S Y REF+ VCD L AIGK +++  KV   L+ L  E++        S+ R P P +++V
Sbjct: 33  SVYCREFRAVCDNLSAIGKHVNESMKVVLFLNGLAREYDPIATVIQSSLSRLPAPTFNDV 92

Query: 720 VSLLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGY 547
           V  +   D +  S ES S  SPNLAF +Q+                           SGY
Sbjct: 93  VLEVSGFDSKLQSYESASDVSPNLAFQAQRGGF------------------------SGY 128

Query: 546 QSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGL-QFQA---- 382
           + +                        G R    F+++ RGF+  + +SG  Q Q+    
Sbjct: 129 RGRGSNSRGR-----------------GGRN---FSTRSRGFSQQVNNSGWNQSQSGGSN 168

Query: 381 --RPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQT 208
             RP   CQICG++GH AL+CW+RF+N YQ  ++P+ L   QV+D               
Sbjct: 169 NIRP--VCQICGRVGHVALKCWNRFDNTYQSDDVPQALAAHQVSDSCG------------ 214

Query: 207 FNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVND 40
               EW  DSG++ HV  TP  L  VTPY G + +M+GDG+ +PITHVGS  L  +
Sbjct: 215 ---REWVTDSGSSAHVTSTPNQLSAVTPYNGPETVMVGDGAHLPITHVGSTTLTTN 267


>XP_013738037.1 PREDICTED: uncharacterized protein LOC106440823 [Brassica napus]
          Length = 410

 Score =  145 bits (367), Expect = 4e-37
 Identities = 98/297 (32%), Positives = 145/297 (48%), Gaps = 9/297 (3%)
 Frame = -3

Query: 888 SDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEV 721
           ++Y REF+ +CDQL +IG  + +  K++  L+ LG E++        S+ R P P +++V
Sbjct: 145 TEYCREFRTICDQLSSIGHPVEESMKIFNFLNGLGREYDPVCAVVQHSLSRTPAPTFNDV 204

Query: 720 VSLLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGY 547
           VS +  +D R  S +  S  SP++AF +QK                S     S +  + +
Sbjct: 205 VSEVAGYDSRLTSYDDSSAVSPHMAFQTQK----------------SEADPPSNYTTTSH 248

Query: 546 QSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQ---FQARP 376
             + +   SN                   R    ++S+GRGF+    S+G       A  
Sbjct: 249 NHRGRGSYSNRFGSN--------------RGRGGYSSRGRGFHQQSVSTGQNNHTTSATQ 294

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
              CQICG++GH ALRCW+RF+ NYQ   +P+            ALAA+ V +       
Sbjct: 295 RPICQICGRMGHTALRCWNRFDTNYQNDNLPQ------------ALAALQVSETSG---Q 339

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDKSITL 25
           EW+ DSGAT HV  T   L+++TPY G + IM  DG+ +PITHVGSA L     I L
Sbjct: 340 EWYPDSGATAHVTSTTAGLNSLTPYNGSETIMAADGNCLPITHVGSANLSVSSGIHL 396


>XP_018464433.1 PREDICTED: uncharacterized protein LOC108835711 [Raphanus sativus]
          Length = 398

 Score =  143 bits (361), Expect = 2e-36
 Identities = 97/294 (32%), Positives = 139/294 (47%), Gaps = 14/294 (4%)
 Frame = -3

Query: 888 SDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEV 721
           S Y REF+ VCD+L +IGK + +  K++  L+ LG E++        SM R P+P + +V
Sbjct: 148 STYCREFRAVCDKLSSIGKPVDESLKIFTFLNGLGREYDPIITVIQSSMARTPLPTFSDV 207

Query: 720 VSLLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGY 547
           +S +   D R  S E+ +  SPN+AF +Q+T                           G+
Sbjct: 208 ISEVCGFDTRLQSYETSTDVSPNMAFQTQRT---------------------------GF 240

Query: 546 QSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQA----- 382
            + +Q    N                 G R    F+++GRGF   + +SG    +     
Sbjct: 241 HNNNQRGRGNSYSW------------FGTRGRGGFSTRGRGFTQQVNNSGWNQSSGHNNN 288

Query: 381 ---RPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQ 211
              RP   CQICG+ GH AL+C+ RFN +YQ  + P+ L+         AL A H   Q 
Sbjct: 289 NNNRP--ICQICGRTGHTALKCYDRFNGSYQSDDAPQALS---------ALTASHPSGQ- 336

Query: 210 TFNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFL 49
                EW  DSGA+ H+   P  L NV  Y G +N+M+ DGS +PITHVGS  L
Sbjct: 337 -----EWIPDSGASAHMTPNPTPLTNVVAYNGPENVMVADGSFVPITHVGSTTL 385


>XP_009127080.1 PREDICTED: uncharacterized protein LOC103851952 [Brassica rapa]
           XP_009116067.1 PREDICTED: uncharacterized protein
           LOC103841294 [Brassica rapa]
          Length = 406

 Score =  143 bits (360), Expect = 4e-36
 Identities = 101/298 (33%), Positives = 145/298 (48%), Gaps = 18/298 (6%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENF------TVSMMRPPIPP 733
           S++DYL + K +CDQL +IG  + +  K++  L  LG ++E        +V MM  P   
Sbjct: 142 SMTDYLTDLKLICDQLTSIGSPVPEKMKIFAALQGLGKDYEPLITSVEGSVDMMSNPT-- 199

Query: 732 YDEVVSLLRDHDMR--RASSESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFP 559
            ++++  L  +D R  R ++ + +SP+LAF  +++                         
Sbjct: 200 LEDLIPRLHSYDSRIQRYNAPTEASPHLAFNVERSNY----------------------- 236

Query: 558 RSGYQSQHQYPSSNYXXXXXXXXXXQDPGVLGA-RPEPFFTSQGRGFNPSIRSSGLQFQA 382
           ++GY +      SN                 GA R    F+++GRGF+  + SSG Q  +
Sbjct: 237 QTGYYNSRGRGQSNRR--------------FGANRGRGSFSTRGRGFHQQLSSSGSQSGS 282

Query: 381 ---------RPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAM 229
                    RP   CQICG+ GH ALRCWHRFNN YQ         ED+ T    ALAAM
Sbjct: 283 SVSSVSSDERPS--CQICGRYGHSALRCWHRFNNTYQ--------EEDKPT----ALAAM 328

Query: 228 HVGDQQTFNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSA 55
            + D       EW  DSGAT+H+  +P  L    PY G D ++IGDG+ +PITHVGSA
Sbjct: 329 RITDVSDHGGAEWFADSGATSHITNSPNHLAYTQPYRGNDAVLIGDGNFLPITHVGSA 386


>XP_010430659.1 PREDICTED: uncharacterized protein LOC104714886 [Camelina sativa]
          Length = 384

 Score =  142 bits (357), Expect = 6e-36
 Identities = 98/305 (32%), Positives = 154/305 (50%), Gaps = 7/305 (2%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYD 727
           S+S Y REFK +CD L +IGK I +  K++  L+ LG E++  T     S+ + P P ++
Sbjct: 111 SLSTYCREFKSICDSLSSIGKPIDESMKIFGFLNGLGREYDPITTVIQSSLSKLPPPTFN 170

Query: 726 EVVSLLRDHDMRRASSESLSS--PNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRS 553
           +V++ ++  D +    E  +S  P++AF+++KT                 P+Q+ +    
Sbjct: 171 DVIADVQGFDSKLQLYEDTNSVTPHMAFMTEKTNPCAPQYD---------PNQRGRGRNR 221

Query: 552 GYQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQARPP 373
           G                                   +TS+GRGF     +S LQ Q RP 
Sbjct: 222 GRGG--------------------------------YTSRGRGFPQHQTTSQLQGQ-RP- 247

Query: 372 LFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLNE 193
             CQICG++GH A++C++RF+NNYQ +E+            A+A + + V D++     E
Sbjct: 248 -VCQICGRVGHTAIKCYNRFDNNYQ-SEV-----------SAQAFSTLCVSDER-----E 289

Query: 192 WHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDK-SITLERV 16
           WH DSGAT HV  +   L +   Y G D +M+GDG+ +PITH+GSA +   K +I L  V
Sbjct: 290 WHPDSGATAHVTTSTSGLQDAKVYEGNDAVMVGDGAYLPITHIGSATISTPKGNIPLNEV 349

Query: 15  LIEPD 1
           L+ P+
Sbjct: 350 LVCPE 354


>XP_013647154.1 PREDICTED: uncharacterized protein LOC106352005 isoform X1
           [Brassica napus] XP_013647160.1 PREDICTED:
           uncharacterized protein LOC106352005 isoform X2
           [Brassica napus]
          Length = 399

 Score =  142 bits (357), Expect = 8e-36
 Identities = 92/292 (31%), Positives = 136/292 (46%), Gaps = 11/292 (3%)
 Frame = -3

Query: 882 YLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEVVS 715
           Y REF+ +CD+L +IGK + +  K++   + LG E++        SM R P P  ++V+S
Sbjct: 147 YCREFRAICDKLSSIGKPVEESMKIFSFTNGLGREYDPIITVIQSSMTRVPAPTLNDVIS 206

Query: 714 LLRDHDMRRASSESLSS--PNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGYQS 541
            +   D R  S E+ S   PN+AF +Q+                S         R G+ +
Sbjct: 207 EVSGFDTRLQSYEATSDVLPNMAFQTQRGFYNNYNRGRGNNYSRS---------RGGFSN 257

Query: 540 QHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSG-----LQFQARP 376
           +  Y S                          F+++GRGF   + +SG          RP
Sbjct: 258 RGGYSSRGG-----------------------FSTRGRGFTQQVSNSGGNNNNNNNNTRP 294

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
              CQICG+ GH ALRCW+RF+N+YQ  ++P  L+   V+D +                 
Sbjct: 295 --VCQICGRTGHSALRCWNRFDNSYQSDDLPHALSAFPVSDLSG---------------R 337

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVND 40
           EW  DSGAT H+  +   + N TPY G ++IM+ DG+ +PITHVGS  L  D
Sbjct: 338 EWIADSGATAHMTSSTSHMQNATPYNGPEHIMVADGNFLPITHVGSTTLTTD 389


>JAU70191.1 hypothetical protein LE_TR14648_c0_g1_i1_g.46319 [Noccaea
           caerulescens]
          Length = 395

 Score =  140 bits (353), Expect = 3e-35
 Identities = 96/290 (33%), Positives = 142/290 (48%), Gaps = 15/290 (5%)
 Frame = -3

Query: 882 YLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEVVS 715
           Y REF+ +CDQL AIGK + +  K++  L+ L  EF+  +     S+ R P P +++VVS
Sbjct: 147 YCREFRTICDQLSAIGKPVEESMKIFTFLNGLSREFDPISTVIQSSLSRFPPPTFNDVVS 206

Query: 714 LLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGYQS 541
            +     +  S E+    +P  AF  QK+               S+P Q+ +    G+ S
Sbjct: 207 EISGFHTQLQSYETPEEVTPFTAFQVQKSNY-------------SHPGQRGR----GHSS 249

Query: 540 QHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGF---------NPSIRSSGLQF 388
                                    G+R    F+++GRGF         N S+ S G Q 
Sbjct: 250 SR----------------------FGSRGRGGFSTRGRGFSQQVNPAGWNQSLSSDGNQ- 286

Query: 387 QARPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQT 208
             RP   CQICG++GH AL+CW+ F++ YQ  ++PK            ALAA+H+ D   
Sbjct: 287 NNRP--MCQICGRMGHTALKCWNMFDHAYQSDDVPK------------ALAALHISDDSG 332

Query: 207 FNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGS 58
               EW+ DSGAT H+  +   L N TPY G D +++G+G+ +PITHVGS
Sbjct: 333 M---EWYPDSGATAHITASASSLQNPTPYHGSDMVLVGNGNQLPITHVGS 379


>XP_009117857.1 PREDICTED: uncharacterized protein LOC103842926 [Brassica rapa]
          Length = 390

 Score =  137 bits (346), Expect = 3e-34
 Identities = 93/294 (31%), Positives = 138/294 (46%), Gaps = 11/294 (3%)
 Frame = -3

Query: 891 VSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDE 724
           +++YL+E K VC QL +IG  + +  KV+  L  LG ++E        SM   P P +++
Sbjct: 145 MAEYLQEIKSVCSQLSSIGSPVPERMKVFAALHGLGRDYEPIKTTIESSMDADPTPTFED 204

Query: 723 VVSLLRDHDMRRAS--SESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSG 550
           V+  L   D R  S  ++   SP+LAF SQ+                     + Q  RS 
Sbjct: 205 VIPRLTSFDDRLQSYITQPDVSPHLAFYSQRG--------------------RGQNSRSR 244

Query: 549 YQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQA---- 382
            + Q                          R    +++QGRGF+  + S    F +    
Sbjct: 245 GRGQ-------------------------GRGRGSYSTQGRGFHQHVSSPSGSFTSSASE 279

Query: 381 -RPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTF 205
            RP   CQICGK+GH+ALRCWHRF+N+YQ+ ++P  LT  ++TD                
Sbjct: 280 NRP--LCQICGKLGHNALRCWHRFDNSYQLDDLPAALTALRITDVTG------------- 324

Query: 204 NLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVN 43
             +EW  DSGA++HV  +P  L     Y G D++M+G+G  +PITH GS  L +
Sbjct: 325 --HEWFPDSGASSHVTNSPHHLQQAQVYNGSDSVMVGNGEFLPITHTGSTSLAS 376


>AAK44121.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
           AAL34266.1 putative retroelement pol polyprotein
           [Arabidopsis thaliana] BAD44515.1 putative retroelement
           pol polyprotein [Arabidopsis thaliana] BAD44526.1
           putative retroelement pol polyprotein [Arabidopsis
           thaliana] BAF02210.1 putative retroelement pol
           polyprotein [Arabidopsis thaliana]
          Length = 405

 Score =  137 bits (344), Expect = 7e-34
 Identities = 94/294 (31%), Positives = 136/294 (46%), Gaps = 10/294 (3%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYD 727
           S+ +YL++ K +CDQL ++G  +++  K++  L+ LG E+E        SM   P P  +
Sbjct: 149 SMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPIKTTIENSMDALPGPSLE 208

Query: 726 EVVSLLRDHDMRRAS--SESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRS 553
           +V+  L  +D R      E+  SP++AF                       +       S
Sbjct: 209 DVIPKLTGYDDRLQGYLEETAVSPHVAFNI---------------------TTSDDSNAS 247

Query: 552 GYQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRS----SGLQFQ 385
           GY + +                         R    F+++GRGF+  I S    SG Q  
Sbjct: 248 GYFNAYNRGKGK-----------------SNRGRNSFSTRGRGFHQQISSTNSSSGSQ-S 289

Query: 384 ARPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTF 205
               + CQICGK+GH AL+CWHRFNN+YQ  E+P            +ALAAM + D    
Sbjct: 290 GGTSVVCQICGKMGHPALKCWHRFNNSYQYEELP------------RALAAMRITDITDQ 337

Query: 204 NLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVN 43
           + NEW  DS AT HV  +P  L    PY G D +M+ DG+ +PITH GS  L +
Sbjct: 338 HGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLAS 391


>KFK30733.1 hypothetical protein AALP_AA6G020200 [Arabis alpina]
          Length = 3091

 Score =  140 bits (354), Expect = 9e-34
 Identities = 96/300 (32%), Positives = 140/300 (46%), Gaps = 13/300 (4%)
 Frame = -3

Query: 894  SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTVSMMRPP----IPPYD 727
            ++++YL E K++CDQL  IG  +++  KV+  L+ LG E+E    ++        +P ++
Sbjct: 649  TMAEYLSEIKKICDQLKFIGSPVTETMKVFTALNGLGQEYEPIKTTIEGAMDSFLVPVFE 708

Query: 726  EVVSLLRDHDMRRAS-SESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSG 550
            +VV  L   D R  S ++   SP+LAF + +                S  S      RSG
Sbjct: 709  DVVPKLTAFDHRLQSYTQGYISPHLAFFANQ----------------SDASTNRGRGRSG 752

Query: 549  YQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIR--------SSGL 394
            Y                             R    FT++GRGF+  I         SSG 
Sbjct: 753  YN----------------------------RGRGTFTTKGRGFHQQITQAPHSGFSSSGS 784

Query: 393  QFQARPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQ 214
                     CQICGK GH A+RCWHRFNN+YQ  ++PK            ALAAM + + 
Sbjct: 785  ASGGNNKTVCQICGKSGHPAIRCWHRFNNSYQDEDMPK------------ALAAMRITEV 832

Query: 213  QTFNLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDKS 34
               +  EW  D+GA+ H+  +P  L     YTG D++M+GDG+ +PITH GSA L +  +
Sbjct: 833  PDASGLEWFPDTGASAHITSSPLHLQQACSYTGSDSVMVGDGNFLPITHTGSASLASSSA 892


>CAN60203.1 hypothetical protein VITISV_036059 [Vitis vinifera]
          Length = 1412

 Score =  140 bits (353), Expect = 1e-33
 Identities = 100/307 (32%), Positives = 156/307 (50%), Gaps = 10/307 (3%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTVSMMRPPIPPYDEVVS 715
           ++ +++R FK +CD L AIGK + D +KV+ LL+SLG ++E FT +M++PP P Y E+VS
Sbjct: 141 TIGEHIRTFKSLCDSLAAIGKPVPDKEKVFCLLTSLGPQYETFTTTMLKPPRPSYSELVS 200

Query: 714 LLRDHDMRR-----ASSESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPR-- 556
            L+  D RR      ++ + ++P +AF                     Y  QQ ++P+  
Sbjct: 201 QLQSLDQRRNWFSNHANAAHATPQMAF---------------------YGQQQKRYPQFS 239

Query: 555 SGYQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNP--SIRSSGLQFQA 382
           +GYQ   Q  +S            Q+ G L +   P  ++Q R   P    R + ++   
Sbjct: 240 TGYQGNKQKFTSTGRGFQAQQSKDQNRGYLSS---PTSSTQQRRPPPPGERRMTPVERDL 296

Query: 381 RPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFN 202
                CQ CG +GH A  CW           +PK+ T  Q  D  +ALAA+ +    T  
Sbjct: 297 YREEKCQYCGMVGHIAKICWW----------VPKRPT--QQDDIPQALAALTL--DNTIA 342

Query: 201 LNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVG-SAFLVNDKSITL 25
             EW  D+GA+NH+    G+L N+  Y+G D+++IGDGSS+PI  +G S+    +K + L
Sbjct: 343 ETEWTSDTGASNHMTGKQGMLTNIRNYSGSDSVLIGDGSSLPILGIGDSSIKQRNKVLPL 402

Query: 24  ERVLIEP 4
             VL+ P
Sbjct: 403 HDVLLVP 409


>AAC67200.1 putative retroelement pol polyprotein [Arabidopsis thaliana]
          Length = 1402

 Score =  139 bits (350), Expect = 3e-33
 Identities = 98/308 (31%), Positives = 143/308 (46%), Gaps = 11/308 (3%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYD 727
           S+ +YL++ K +CDQL ++G  +++  K++  L+ LG E+E        SM   P P  +
Sbjct: 139 SMDEYLKDLKTICDQLASVGSPVTEKMKIFAALNGLGREYEPIKTTIENSMDALPGPSLE 198

Query: 726 EVVSLLRDHDMRRAS--SESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRS 553
           +V+  L  +D R      E+  SP++AF                       +       S
Sbjct: 199 DVIPKLTGYDDRLQGYLEETAVSPHVAFNI---------------------TTSDDSNAS 237

Query: 552 GYQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRS----SGLQFQ 385
           GY + +                         R    F+++GRGF+  I S    SG Q  
Sbjct: 238 GYFNAYNRGKGK-----------------SNRGRNSFSTRGRGFHQQISSTNSSSGSQ-S 279

Query: 384 ARPPLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTF 205
               + CQICGK+GH AL+CWHRFNN+YQ  E+P            +ALAAM + D    
Sbjct: 280 GGTSVVCQICGKMGHPALKCWHRFNNSYQYEELP------------RALAAMRITDITDQ 327

Query: 204 NLNEWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDK-SIT 28
           + NEW  DS AT HV  +P  L    PY G D +M+ DG+ +PITH GS  L +   ++ 
Sbjct: 328 HGNEWLPDSAATAHVTNSPRSLQQSQPYHGSDAVMVADGNFLPITHTGSTNLASSSGNVP 387

Query: 27  LERVLIEP 4
           L  VL+ P
Sbjct: 388 LTDVLVCP 395


>XP_010462983.1 PREDICTED: uncharacterized protein LOC104743624 [Camelina sativa]
          Length = 473

 Score =  136 bits (342), Expect = 4e-33
 Identities = 96/305 (31%), Positives = 141/305 (46%), Gaps = 12/305 (3%)
 Frame = -3

Query: 882 YLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEVVS 715
           Y REF  VCD L +IGK + ++ K++  L+ L  E++        SM R P P +++VV 
Sbjct: 143 YCREFGAVCDSLSSIGKPVDENMKIFTFLNGLSREYDPIATVLQSSMSRTPSPTFNDVVL 202

Query: 714 LLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGYQS 541
            +   D +  S E+    SP++AF +Q+                  P Q+    R G+  
Sbjct: 203 EISGFDSKLQSYETPPEVSPHIAFQTQRGGCHG-------------PGQRG---RGGHHV 246

Query: 540 QHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQ-----ARP 376
           +  Y                             +++GRGF+  + SSG         A P
Sbjct: 247 RGGY-----------------------------STRGRGFSQQVNSSGWNQNQSGNSANP 277

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
              CQICG+ GH AL+CW+RF+ +YQ  ++P+ L   QV+D +                 
Sbjct: 278 RPVCQICGRTGHVALKCWNRFDASYQSDDVPQALATLQVSDSSG---------------R 322

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDK-SITLER 19
           EW  DSGAT H+  T   L +VTPY G +N+++ DG+  PITHVGS  L     SI L  
Sbjct: 323 EWLTDSGATAHITPTTDSLQSVTPYNGAENVIVADGTHQPITHVGSTTLATTSGSIPLCD 382

Query: 18  VLIEP 4
           VL+ P
Sbjct: 383 VLVCP 387


>OAP02304.1 hypothetical protein AXX17_AT3G39340 [Arabidopsis thaliana]
          Length = 2099

 Score =  139 bits (349), Expect = 4e-33
 Identities = 101/306 (33%), Positives = 144/306 (47%), Gaps = 9/306 (2%)
 Frame = -3

Query: 891  VSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTVSMMRP----PIPPYDE 724
            + +YLR+ K +C+QL +IG  + +  K++ +L  LG E+E   V++       P P  +E
Sbjct: 260  MDEYLRDIKSICEQLASIGSPVPEKMKIFAVLKGLGREYEPIKVNIEGMIDMYPGPTLEE 319

Query: 723  VVSLLRDHDMRRASSE--SLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSG 550
            V S L+    R AS       SP+LAF +                  S   + +Q+ + G
Sbjct: 320  VSSRLKSFSDRLASYNVGMEVSPHLAFYAN----------------YSGKGKGNQYGKPG 363

Query: 549  YQSQHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSS--GLQFQARP 376
                +Q  S NY                        +++GRGF   I SS  G       
Sbjct: 364  ---GNQGKSGNY------------------------STKGRGFPQQISSSTSGSYNNTEN 396

Query: 375  PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
             + CQICGK GH AL+CWHRFNN+YQ  E+P             AL AM + D    N N
Sbjct: 397  RVVCQICGKPGHPALKCWHRFNNSYQYEELP------------AALTAMRITDVTDHNGN 444

Query: 195  EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFLVNDKSI-TLER 19
            +W  DSGAT HV  +   L    PY G D++M+G+G  +PITH GS  L +   I +L+ 
Sbjct: 445  KWVGDSGATAHVTNSTHNLQQSQPYGGSDSVMVGNGDFLPITHTGSTTLPSSSGILSLKD 504

Query: 18   VLIEPD 1
            VL+ P+
Sbjct: 505  VLVCPN 510


>XP_019089249.1 PREDICTED: uncharacterized protein LOC109128031 [Camelina sativa]
          Length = 374

 Score =  134 bits (336), Expect = 6e-33
 Identities = 95/285 (33%), Positives = 131/285 (45%), Gaps = 7/285 (2%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTVS----MMRPPIPPYD 727
           S+ +YL+E K+VC+QL +IG  +S+  K++  L  LG E+E    S    M   P    D
Sbjct: 143 SMDEYLKEIKRVCEQLASIGSPVSEQMKIFAALKGLGREYEPMKTSVEGSMDAQPSLTLD 202

Query: 726 EVVSLLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRS 553
            V+S L  +  R AS  S S  SP++AF                          +    S
Sbjct: 203 AVISRLTSYSDRLASYHSGSEVSPHMAF-------------------------NTITASS 237

Query: 552 GYQSQHQ-YPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQARP 376
           GY S ++   +S Y                  R    FT++GRGF+  I           
Sbjct: 238 GYYSSNRGRGNSRYGNN---------------RGRNNFTTKGRGFHQQISQEP---GGTN 279

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
            + CQICGK GH A +CWHRF+N+YQ   +P+             LAA+ V D    N N
Sbjct: 280 KVICQICGKPGHPASKCWHRFDNSYQFDNVPQ------------VLAALRVTDVTDHNGN 327

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVG 61
           EW +D GAT HV  +P  L     Y G +++M+GDGS +PITH G
Sbjct: 328 EWVLDFGATTHVTNSPHHLQQAQVYEGNESVMMGDGSFLPITHTG 372


>XP_010451841.1 PREDICTED: uncharacterized protein LOC104734030 [Camelina sativa]
          Length = 374

 Score =  134 bits (336), Expect = 6e-33
 Identities = 95/285 (33%), Positives = 131/285 (45%), Gaps = 7/285 (2%)
 Frame = -3

Query: 894 SVSDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTVS----MMRPPIPPYD 727
           S+ +YL+E K+VC+QL +IG  +S+  K++  L  LG E+E    S    M   P    D
Sbjct: 143 SMDEYLKEIKRVCEQLASIGSPVSEQMKIFAALKGLGREYEPMKTSVEGSMDAQPSLTLD 202

Query: 726 EVVSLLRDHDMRRASSESLS--SPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRS 553
            V+S L  +  R AS  S S  SP++AF                          +    S
Sbjct: 203 AVISRLTSYSDRLASYHSGSEVSPHMAF-------------------------NTITASS 237

Query: 552 GYQSQHQ-YPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQARP 376
           GY S ++   +S Y                  R    FT++GRGF+  I           
Sbjct: 238 GYYSSNRGRGNSRYGNN---------------RGRNNFTTKGRGFHQQISQEP---GGTN 279

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
            + CQICGK GH A +CWHRF+N+YQ   +P+             LAA+ V D    N N
Sbjct: 280 KVICQICGKPGHPASKCWHRFDNSYQFDNVPQ------------VLAALRVTDVTDHNGN 327

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVG 61
           EW +D GAT HV  +P  L     Y G +++M+GDGS +PITH G
Sbjct: 328 EWVLDFGATTHVTNSPHHLQQAQVYEGNESVMMGDGSFLPITHTG 372


>XP_010463299.1 PREDICTED: uncharacterized protein LOC104743969 [Camelina sativa]
          Length = 464

 Score =  135 bits (339), Expect = 9e-33
 Identities = 101/305 (33%), Positives = 138/305 (45%), Gaps = 10/305 (3%)
 Frame = -3

Query: 888 SDYLREFKQVCDQLHAIGKSISDDDKVYYLLSSLGTEFENFTV----SMMRPPIPPYDEV 721
           S Y+RE+  VCDQL +IGK + +  K+   L  LG E++   V    SM R P P + EV
Sbjct: 145 SAYVREYTAVCDQLSSIGKPVEESIKICGFLLGLGREYDPIAVIIQNSMSRIPTPTFTEV 204

Query: 720 VSLLRDHDMRRASSESLSSPNLAFLSQKTXXXXXXXXXXXXXXNSYPSQQSQFPRSGYQS 541
           +  +   D R  S +                                + Q  F     Q+
Sbjct: 205 LYEIEGFDTRLQSYDD----------------------------DAVATQMVFHTPAAQT 236

Query: 540 QHQYPSSNYXXXXXXXXXXQDPGVLGARPEPFFTSQGRGFNPSIRSSGLQFQA-----RP 376
           Q  + +S               G    R    ++S GRGF     SSG   Q+     RP
Sbjct: 237 QEVFQASQTNNYRGRGGYNNKSG--SNRGRGGYSSCGRGFYQQAVSSGSNQQSGATNTRP 294

Query: 375 PLFCQICGKIGHDALRCWHRFNNNYQVTEIPKKLTEDQVTDQAKALAAMHVGDQQTFNLN 196
              CQIC K+GH A +CW+RF+NNYQ   +            A+ LAA+ V D       
Sbjct: 295 T--CQICCKVGHMAAKCWNRFDNNYQGENL------------AQVLAALQVSDSSG---R 337

Query: 195 EWHVDSGATNHVAQTPGILHNVTPYTGIDNIMIGDGSSIPITHVGSAFL-VNDKSITLER 19
           +W  DSGAT+HV  T   L + TPY G D+IM+ DG+ +P+THVGS  L V+  S+TL  
Sbjct: 338 DWIPDSGATSHVTTTEAALQHATPYQGTDSIMVADGNYLPLTHVGSTSLPVSTGSLTLND 397

Query: 18  VLIEP 4
           VL+ P
Sbjct: 398 VLVCP 402


Top