BLASTX nr result

ID: Alisma22_contig00038559 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Alisma22_contig00038559
         (358 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AAK51235.1 polyprotein [Arabidopsis thaliana]                          96   8e-21
XP_017179487.1 PREDICTED: uncharacterized protein LOC108169827 [...    94   7e-20
OMO78631.1 Integrase, catalytic core [Corchorus capsularis]            92   2e-19
OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]               92   2e-19
AAD43604.1 T3P18.3 [Arabidopsis thaliana]                              91   6e-19
CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]               91   6e-19
AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease ho...    90   2e-18
XP_013694541.1 PREDICTED: uncharacterized protein LOC106398551 [...    89   2e-18
XP_015386587.1 PREDICTED: uncharacterized protein LOC107177375 [...    89   4e-18
OMO62605.1 Integrase, catalytic core [Corchorus capsularis]            88   7e-18
AAF69172.1 F27F5.11 [Arabidopsis thaliana]                             87   1e-17
XP_020114563.1 uncharacterized protein LOC109728471 isoform X15 ...    87   1e-17
KZV57610.1 hypothetical protein F511_03070 [Dorcoceras hygrometr...    87   1e-17
XP_020114543.1 uncharacterized protein LOC109728471 isoform X12 ...    87   1e-17
XP_008671960.1 PREDICTED: uncharacterized protein LOC103649473 [...    87   1e-17
XP_020114487.1 uncharacterized protein LOC109728471 isoform X3 [...    87   1e-17
XP_020114480.1 uncharacterized protein LOC109728471 isoform X2 [...    87   1e-17
XP_020114471.1 uncharacterized protein LOC109728471 isoform X1 [...    87   1e-17
CAN74381.1 hypothetical protein VITISV_007944 [Vitis vinifera]         87   1e-17
JAU84243.1 Retrovirus-related Pol polyprotein from transposon TN...    84   2e-17

>AAK51235.1 polyprotein [Arabidopsis thaliana]
          Length = 1453

 Score = 96.3 bits (238), Expect = 8e-21
 Identities = 50/119 (42%), Positives = 65/119 (54%), Gaps = 2/119 (1%)
 Frame = +3

Query: 6    LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
            LR FG+ C+PCL+    HK  PRSL CVFL Y+  YKGYRCL+  T  V+ISR+VIF+E 
Sbjct: 682  LRVFGTACYPCLRPLGEHKFEPRSLQCVFLGYNSQYKGYRCLYPPTGRVYISRHVIFDEE 741

Query: 186  VFPFKDSIPSHVAMGELACISDWPEPVHNHDQT--PPLSSSVGEGCLPPSGSLSHTMED 356
             FPFK      V   E + +S W   +   DQ+  P       E    P     +T++D
Sbjct: 742  TFPFKQKYQFLVPQYESSLLSAWQSSIPQADQSLIPQAEEGKIESLAKPPSIQKNTIQD 800


>XP_017179487.1 PREDICTED: uncharacterized protein LOC108169827 [Malus domestica]
          Length = 738

 Score = 93.6 bits (231), Expect = 7e-20
 Identities = 56/115 (48%), Positives = 67/115 (58%), Gaps = 1/115 (0%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LRTFG  CFP L D   +KL PRSL CVFL YSD YKGYRCLH ST  V++SR+V FNE 
Sbjct: 498 LRTFGCACFPYLGDYATYKLQPRSLSCVFLGYSDQYKGYRCLHPSTGRVYLSRHVKFNEH 557

Query: 186 VFPFKDSIPSHVA-MGELACISDWPEPVHNHDQTPPLSSSVGEGCLPPSGSLSHT 347
            FPF  S+ S  A   +L  +   P P     Q PP+        +P +  L+HT
Sbjct: 558 DFPFHSSMASPSAPESDLVFVPIHPVPTPLLHQ-PPVP-------MPQNQDLTHT 604


>OMO78631.1 Integrase, catalytic core [Corchorus capsularis]
          Length = 577

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 42/67 (62%), Positives = 51/67 (76%)
 Frame = +3

Query: 3   CLRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNE 182
           CLR FGS CFP L+    +KL PRSLPC+FL YS+ +KGYRCLH  +  V+ISR+V F+E
Sbjct: 337 CLRVFGSKCFPFLRSHSKNKLEPRSLPCIFLGYSELHKGYRCLHPPSGRVYISRHVTFDE 396

Query: 183 TVFPFKD 203
            VFPFKD
Sbjct: 397 KVFPFKD 403


>OMO61427.1 Zinc finger, CCCH-type [Corchorus capsularis]
          Length = 1996

 Score = 92.4 bits (228), Expect = 2e-19
 Identities = 42/67 (62%), Positives = 51/67 (76%)
 Frame = +3

Query: 3   CLRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNE 182
           CLR FGS CFP L+    +KL PRSLPC+FL YS+ +KGYRCLH  +  V+ISR+V F+E
Sbjct: 655 CLRVFGSKCFPFLRSHSKNKLEPRSLPCIFLGYSELHKGYRCLHPPSGRVYISRHVTFDE 714

Query: 183 TVFPFKD 203
            VFPFKD
Sbjct: 715 KVFPFKD 721


>AAD43604.1 T3P18.3 [Arabidopsis thaliana]
          Length = 1309

 Score = 90.9 bits (224), Expect = 6e-19
 Identities = 45/95 (47%), Positives = 58/95 (61%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FG+ C+PCL+    +K  PRSL CVFL Y + YKGYRCL+  T  V+ISR+VIF+E 
Sbjct: 524 LRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEA 583

Query: 186 VFPFKDSIPSHVAMGELACISDWPEPVHNHDQTPP 290
            FPFK+   S V   +   +  W     + D TPP
Sbjct: 584 QFPFKEKYHSLVPKYQTTLLQAW----QHTDLTPP 614


>CAC37623.1 copia-like polyprotein [Arabidopsis thaliana]
          Length = 1466

 Score = 90.9 bits (224), Expect = 6e-19
 Identities = 45/95 (47%), Positives = 58/95 (61%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FG+ C+PCL+    +K  PRSL CVFL Y + YKGYRCL+  T  V+ISR+VIF+E 
Sbjct: 681 LRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEA 740

Query: 186 VFPFKDSIPSHVAMGELACISDWPEPVHNHDQTPP 290
            FPFK+   S V   +   +  W     + D TPP
Sbjct: 741 QFPFKEKYHSLVPKYQTTLLQAW----QHTDLTPP 771


>AAD21687.1 Strong similarity to gi|3600044 T12H20.12 protease homolog from
           Arabidopsis thaliana BAC gb|AF080119 and is a member of
           the reverse transcriptase family PF|00078 [Arabidopsis
           thaliana]
          Length = 1415

 Score = 89.7 bits (221), Expect = 2e-18
 Identities = 42/83 (50%), Positives = 51/83 (61%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FGS C+PCL+    +K  PRSL CVFL Y+  YKGYRC +  T  V+ISRNVIFNE+
Sbjct: 679 LRVFGSACYPCLRPLAQNKFDPRSLQCVFLGYNSQYKGYRCFYPPTGKVYISRNVIFNES 738

Query: 186 VFPFKDSIPSHVAMGELACISDW 254
             PFK+   S V       +  W
Sbjct: 739 ELPFKEKYQSLVPQYSTPLLQAW 761


>XP_013694541.1 PREDICTED: uncharacterized protein LOC106398551 [Brassica napus]
          Length = 663

 Score = 89.4 bits (220), Expect = 2e-18
 Identities = 42/85 (49%), Positives = 55/85 (64%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FGS C+P L+    HK  PRSL CVFL YS  YKGYRCL+  T  V+I+++VIF+ET
Sbjct: 386 LRVFGSACYPYLRPVAEHKFEPRSLLCVFLGYSSQYKGYRCLYPPTGKVYITQHVIFDET 445

Query: 186 VFPFKDSIPSHVAMGELACISDWPE 260
           +FPFK+   S V     + +  W +
Sbjct: 446 LFPFKEQYKSLVPQYATSLLRAWQQ 470


>XP_015386587.1 PREDICTED: uncharacterized protein LOC107177375 [Citrus sinensis]
          Length = 1013

 Score = 88.6 bits (218), Expect = 4e-18
 Identities = 42/86 (48%), Positives = 55/86 (63%), Gaps = 3/86 (3%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FGS CFPCL+    +K  P+SLPCVFL YS+ YKGYRC H  T  V++SR+V+F+E 
Sbjct: 464 LRVFGSRCFPCLRRITQNKFDPKSLPCVFLGYSEFYKGYRCFHPPTGKVYLSRDVVFDEK 523

Query: 186 VFPFKDSIPSHVAMGE---LACISDW 254
            FPF      + + GE   L   ++W
Sbjct: 524 TFPFMKPGILYSSCGEHTNLTSFNEW 549


>OMO62605.1 Integrase, catalytic core [Corchorus capsularis]
          Length = 734

 Score = 87.8 bits (216), Expect = 7e-18
 Identities = 41/65 (63%), Positives = 49/65 (75%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FGSLCFP L+D    KL P+SLPCVFL YS  YKGYRC   +T  V+ISR+V+F+E 
Sbjct: 593 LRVFGSLCFPYLRDPSKTKLDPKSLPCVFLGYSHQYKGYRCFCPTTNKVYISRHVVFDED 652

Query: 186 VFPFK 200
           VFPF+
Sbjct: 653 VFPFQ 657


>AAF69172.1 F27F5.11 [Arabidopsis thaliana]
          Length = 1313

 Score = 87.4 bits (215), Expect = 1e-17
 Identities = 41/67 (61%), Positives = 49/67 (73%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FG  C+P L+D  ++K  PRSL CVFL Y+D YKGYRC   ST  V+ISR+VIF+ET
Sbjct: 627 LRVFGCACYPTLRDYASNKFDPRSLKCVFLGYNDKYKGYRCFLPSTGRVYISRHVIFDET 686

Query: 186 VFPFKDS 206
           VFPF  S
Sbjct: 687 VFPFAQS 693


>XP_020114563.1 uncharacterized protein LOC109728471 isoform X15 [Ananas comosus]
          Length = 890

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 40/64 (62%), Positives = 49/64 (76%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           +R FG LC+P +QD   HKLSPRSLPC+FL  SD +KG+RCL+ ST  V ISR+V F ET
Sbjct: 188 IRVFGCLCYPDVQDIADHKLSPRSLPCIFLGLSDKHKGFRCLYPSTGKVFISRHVTFVET 247

Query: 186 VFPF 197
           VFP+
Sbjct: 248 VFPY 251


>KZV57610.1 hypothetical protein F511_03070 [Dorcoceras hygrometricum]
          Length = 1011

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 49/112 (43%), Positives = 65/112 (58%), Gaps = 5/112 (4%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FGS CFP   DT  HK  P+++PC+F+ YSD +KGY+C    +Q + ISR+V+F+E 
Sbjct: 637 LRVFGSRCFPYTWDTRKHKFDPKTIPCIFVGYSDRHKGYKCFFPPSQKIIISRHVVFDEK 696

Query: 186 VFPFKDSIPSHVAMGELACISDWPEPVHNHDQTPPLS-----SSVGEGCLPP 326
            FPFK+    H A  E  C+SD    V +   T P+S     S   E C PP
Sbjct: 697 QFPFKN---QHCA--EKICLSDHWMSVFDSWTTFPISDNDSLSQSVEACDPP 743


>XP_020114543.1 uncharacterized protein LOC109728471 isoform X12 [Ananas comosus]
          Length = 1013

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 40/64 (62%), Positives = 49/64 (76%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           +R FG LC+P +QD   HKLSPRSLPC+FL  SD +KG+RCL+ ST  V ISR+V F ET
Sbjct: 311 IRVFGCLCYPDVQDIADHKLSPRSLPCIFLGLSDKHKGFRCLYPSTGKVFISRHVTFVET 370

Query: 186 VFPF 197
           VFP+
Sbjct: 371 VFPY 374


>XP_008671960.1 PREDICTED: uncharacterized protein LOC103649473 [Zea mays]
          Length = 1477

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 49/113 (43%), Positives = 66/113 (58%), Gaps = 6/113 (5%)
 Frame = +3

Query: 6    LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
            LR FG  C+P +  T AHKL+PRS  CVFL YS  +KGYRCL  ST  + ISR+V+F+E+
Sbjct: 863  LRVFGCKCYPNISATAAHKLAPRSTMCVFLGYSSEHKGYRCLDISTNRIIISRHVVFDES 922

Query: 186  VFPFKDSIPSHVAMGELACISDWPEPVHNHDQTPPL----SSSVGE--GCLPP 326
             FPF ++ PS      L  +  + E V      PP+    SS+ G+    +PP
Sbjct: 923  SFPFAETPPSSALPTNLDFLDLFSEQV----PVPPIGALPSSTAGQRTSAIPP 971


>XP_020114487.1 uncharacterized protein LOC109728471 isoform X3 [Ananas comosus]
          Length = 1483

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 40/64 (62%), Positives = 49/64 (76%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           +R FG LC+P +QD   HKLSPRSLPC+FL  SD +KG+RCL+ ST  V ISR+V F ET
Sbjct: 781 IRVFGCLCYPDVQDIADHKLSPRSLPCIFLGLSDKHKGFRCLYPSTGKVFISRHVTFVET 840

Query: 186 VFPF 197
           VFP+
Sbjct: 841 VFPY 844


>XP_020114480.1 uncharacterized protein LOC109728471 isoform X2 [Ananas comosus]
          Length = 1526

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 40/64 (62%), Positives = 49/64 (76%)
 Frame = +3

Query: 6    LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
            +R FG LC+P +QD   HKLSPRSLPC+FL  SD +KG+RCL+ ST  V ISR+V F ET
Sbjct: 824  IRVFGCLCYPDVQDIADHKLSPRSLPCIFLGLSDKHKGFRCLYPSTGKVFISRHVTFVET 883

Query: 186  VFPF 197
            VFP+
Sbjct: 884  VFPY 887


>XP_020114471.1 uncharacterized protein LOC109728471 isoform X1 [Ananas comosus]
          Length = 1551

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 40/64 (62%), Positives = 49/64 (76%)
 Frame = +3

Query: 6    LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
            +R FG LC+P +QD   HKLSPRSLPC+FL  SD +KG+RCL+ ST  V ISR+V F ET
Sbjct: 849  IRVFGCLCYPDVQDIADHKLSPRSLPCIFLGLSDKHKGFRCLYPSTGKVFISRHVTFVET 908

Query: 186  VFPF 197
            VFP+
Sbjct: 909  VFPY 912


>CAN74381.1 hypothetical protein VITISV_007944 [Vitis vinifera]
          Length = 1884

 Score = 87.0 bits (214), Expect = 1e-17
 Identities = 39/68 (57%), Positives = 51/68 (75%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LR FG  CFP L+D   +K SP++ PCVF+ YS  +KGYRCLH ST+ V+ISR+VIFNE 
Sbjct: 629 LRIFGCQCFPYLRDYGKNKFSPKTYPCVFIGYSSLHKGYRCLHPSTKRVYISRHVIFNEN 688

Query: 186 VFPFKDSI 209
            FP+ +S+
Sbjct: 689 CFPYDNSL 696


>JAU84243.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Noccaea caerulescens]
          Length = 244

 Score = 84.3 bits (207), Expect = 2e-17
 Identities = 47/120 (39%), Positives = 63/120 (52%), Gaps = 19/120 (15%)
 Frame = +3

Query: 6   LRTFGSLCFPCLQDTVAHKLSPRSLPCVFLVYSDTYKGYRCLHYSTQAVHISRNVIFNET 185
           LRTFG  C+P L+   A K  PRSL CVFL Y+  YKGYRCL+ +T  V++SR+V+F+E 
Sbjct: 110 LRTFGCACYPTLRAYAATKFDPRSLKCVFLGYTAKYKGYRCLYPATGRVYLSRHVLFDEE 169

Query: 186 VFPFKDSIPSHVAMGELACISDW-------------------PEPVHNHDQTPPLSSSVG 308
           VFPF +   S          S W                   P P+ + +  PPLS++ G
Sbjct: 170 VFPFHNMHTSQHQSQGTNLYSSWLKSFTTDPITPTPPNTAPTPTPLFSAEDFPPLSATPG 229


Top