BLASTX nr result

ID: Dioscorea21_contig00000943 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00000943
         (1444 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002284662.1| PREDICTED: protein TIC 62, chloroplastic iso...   427   e-117
emb|CAN75677.1| hypothetical protein VITISV_033052 [Vitis vinifera]   416   e-113
ref|XP_003541088.1| PREDICTED: uncharacterized protein LOC100779...   412   e-113
ref|XP_002313068.1| predicted protein [Populus trichocarpa] gi|2...   404   e-111
ref|XP_003539023.1| PREDICTED: uncharacterized protein LOC100802...   407   e-111

>ref|XP_002284662.1| PREDICTED: protein TIC 62, chloroplastic isoform 1 [Vitis vinifera]
            gi|297741413|emb|CBI32544.3| unnamed protein product
            [Vitis vinifera]
          Length = 529

 Score =  427 bits (1099), Expect = e-117
 Identities = 247/431 (57%), Positives = 286/431 (66%), Gaps = 45/431 (10%)
 Frame = -1

Query: 1339 IEKSVFFSGCSLGIPANKRCTDARKSSASLDFRVRAVGSVGSSQKTANVNEPDKVTTDKD 1160
            IEK  F  G  L +P+++RC D+RK +  L+FR +A G+   S  T    +      D+D
Sbjct: 24   IEKP-FLCGQVLRLPSSRRCPDSRKLTV-LEFRAQATGTTKFSFSTIGAIQDKADLKDED 81

Query: 1159 LVYVAGATGKVGSRTVRELLKLGFRVRAGVRSAQRAQTLVESVRQMKLD---DSSGSKPV 989
            L +VAGATG+VGSRTVRELLKLGFRVRAGVR+AQ+A+ L++SV+QMKLD    S G++PV
Sbjct: 82   LAFVAGATGRVGSRTVRELLKLGFRVRAGVRTAQKAEALIQSVKQMKLDVESASEGTQPV 141

Query: 988  EKLEIVECDLENQXXXXXXXXXXXXXICCIGASEKEVFDITGPYRIDYKATENLINAATV 809
            EKLEIVECDLE +             ICCIGASEKEVFDITGPYRIDY AT+NLI+AATV
Sbjct: 142  EKLEIVECDLEKRDQIGPALGNASVVICCIGASEKEVFDITGPYRIDYMATKNLIDAATV 201

Query: 808  AKVDHFILVTSLGTNKIGFPAAILNLFWGVLIWKRKAEEALIASGLPYTIVRPGGMERPT 629
            AKV+HFIL+TSLGTNK+GFPAAILNLFWGVLIWKRKAEEAL ASGLPYTIVRPGGMERPT
Sbjct: 202  AKVNHFILLTSLGTNKVGFPAAILNLFWGVLIWKRKAEEALFASGLPYTIVRPGGMERPT 261

Query: 628  DAYKETHNLTLSPEDTLFGGQVSNLQVAELMACMAKNTRLSYCKVVEAIAETTAPLTPME 449
            DAYKETHN+TLS EDTLFGGQVSNLQVAEL+A MAKN   SYCKVVE IAETTAPLTP  
Sbjct: 262  DAYKETHNITLSQEDTLFGGQVSNLQVAELIAFMAKNRGSSYCKVVEVIAETTAPLTPFG 321

Query: 448  ELLAKIPSKRXXXXXXXXXXXXXEDVV----------------VQSEASKPRPLSPYSMY 317
            ELLAKIPS+R                V                 Q +A+   PLSPY +Y
Sbjct: 322  ELLAKIPSQRVDVSPKESDAADGPAPVPVVSGPPPSTPIEKGPPQGKATAMSPLSPYIVY 381

Query: 316  DDLKXXXXXXXXXXXSLH--------------------------LQEKEQAKFTPQKQQP 215
            +DLK           S                            +  KE  +   +K +P
Sbjct: 382  EDLKPPTSPTPTPSTSSSTARAPDVDGIPAEPKSIPSVLEPLSTVLAKEAIQEEAKKTRP 441

Query: 214  LSPYTVYEDLK 182
            LSPY VY+DLK
Sbjct: 442  LSPYIVYDDLK 452


>emb|CAN75677.1| hypothetical protein VITISV_033052 [Vitis vinifera]
          Length = 535

 Score =  416 bits (1068), Expect = e-113
 Identities = 247/452 (54%), Positives = 286/452 (63%), Gaps = 66/452 (14%)
 Frame = -1

Query: 1339 IEKSVFFSGCSLGIPANKRCTDARKSSASLDFRVRAVGSVGSSQKTANVNEPDKVTTDKD 1160
            IEK  F  G  L +P+++RC D+RK +  L+FR +A G+   S  T    +      D+D
Sbjct: 24   IEKP-FLCGQVLRLPSSRRCPDSRKLTV-LEFRAQATGTTKFSFSTIGAIQDKADLKDED 81

Query: 1159 LVYVAGATGKVGSRTVRELLKLGFRVRAGVRSAQRAQTLVESVRQMKLD---DSSGSKPV 989
            L +VAGATG+VGSRTVRELLKLGFRVRAGVR+AQ+A+ L++SV+QMKLD    S G++PV
Sbjct: 82   LAFVAGATGRVGSRTVRELLKLGFRVRAGVRTAQKAEALIQSVKQMKLDVESASEGTQPV 141

Query: 988  EKLEIVECDLENQXXXXXXXXXXXXXICCIGASEKEVFDITGPYRIDYKATENLINAATV 809
            EKLEIVECDLE +             ICCIGASEKEVFDITGPYRIDY AT+NLI+AATV
Sbjct: 142  EKLEIVECDLEKRDQIGPALGNASVVICCIGASEKEVFDITGPYRIDYMATKNLIDAATV 201

Query: 808  AKVDHFILVTSLGTNKIGFPAAILNLFWGVLIWKRKAEEALIASGLPYTIVRPGGMERPT 629
            AKV+HFIL+TSLGTNK+GFPAAILNLFWGVLIWKRKAEEAL ASGLPYTIVRPGGMERPT
Sbjct: 202  AKVNHFILLTSLGTNKVGFPAAILNLFWGVLIWKRKAEEALFASGLPYTIVRPGGMERPT 261

Query: 628  DAYKETHNLTLSPEDTLFGGQVSNL---------------------QVAELMACMAKNTR 512
            DAYKETHN+TLS EDTLFGGQVSNL                     QVAEL+A MAKN  
Sbjct: 262  DAYKETHNITLSQEDTLFGGQVSNLQMQTWLKKLNATPKVTFVNKFQVAELIAFMAKNRG 321

Query: 511  LSYCKVVEAIAETTAPLTPMEELLAKIPSKRXXXXXXXXXXXXXEDVV------------ 368
             SYCKVVE IAETTAPLTP  ELLAKIPS+R                V            
Sbjct: 322  SSYCKVVEVIAETTAPLTPFGELLAKIPSQRVDVSPKESDAADGPXPVPVVSGPPPSTPI 381

Query: 367  ----VQSEASKPRPLSPYSMYDDLKXXXXXXXXXXXSLH--------------------- 263
                 Q +A+   PLSPY +Y+DLK           S                       
Sbjct: 382  EKGPPQGKATAMSPLSPYIVYEDLKPPTSPTPTPSTSSSTARAPDVDGIPAEPKSIPSVL 441

Query: 262  -----LQEKEQAKFTPQKQQPLSPYTVYEDLK 182
                 +  KE  +   +K +PLSPY VY+DLK
Sbjct: 442  EPXSTVLAKEAIQEEAKKTRPLSPYIVYDDLK 473


>ref|XP_003541088.1| PREDICTED: uncharacterized protein LOC100779056 [Glycine max]
          Length = 528

 Score =  412 bits (1060), Expect = e-113
 Identities = 246/445 (55%), Positives = 283/445 (63%), Gaps = 42/445 (9%)
 Frame = -1

Query: 1390 ALHSSPIPIHGISGRNSIEKSVFFSGCSLGIPANKRCTDARKSSASLDFRVRAVGSVGSS 1211
            +L ++ IP   +S R + +K    S  +L       CT        L  R +A GS  SS
Sbjct: 8    SLTATTIP-SSLSRRGATDKPSATSHVNLSHFMRYPCTTRSTKQKILCTRAQASGSTKSS 66

Query: 1210 QKTANVNEPDKVTTDKDLVYVAGATGKVGSRTVRELLKLGFRVRAGVRSAQRAQTLVESV 1031
              +A        + D +LV+VAGATG+VGSRTVREL+KLGFRVRAGVRSAQRA  LV+SV
Sbjct: 67   TGSAEGISEKTDSKDDNLVFVAGATGRVGSRTVRELIKLGFRVRAGVRSAQRAGALVQSV 126

Query: 1030 RQMKLDDSSGS-KPVEKLEIVECDLENQXXXXXXXXXXXXXICCIGASEKEVFDITGPYR 854
             Q+KLD +SG  + VEKLEIVECDLE               IC IGASEKEVFDITGP+R
Sbjct: 127  EQLKLDGASGGGQAVEKLEIVECDLEKPETIGSALGDASTVICSIGASEKEVFDITGPFR 186

Query: 853  IDYKATENLINAATVAKVDHFILVTSLGTNKIGFPAAILNLFWGVLIWKRKAEEALIASG 674
            IDY+AT+NLI+AATVAKV+HFILVTSLGTNKIGFPAAILNLFWGVL+WKRKAEEAL+ASG
Sbjct: 187  IDYQATKNLIDAATVAKVNHFILVTSLGTNKIGFPAAILNLFWGVLVWKRKAEEALLASG 246

Query: 673  LPYTIVRPGGMERPTDAYKETHNLTLSPEDTLFGGQVSNLQVAELMACMAKNTRLSYCKV 494
            LPYTIVRPGGMERPTDA+KETHN+TLS EDTLFGG VSNLQ+AEL+A MAKN  LSYCK+
Sbjct: 247  LPYTIVRPGGMERPTDAFKETHNITLSTEDTLFGGLVSNLQIAELLAVMAKNRDLSYCKI 306

Query: 493  VEAIAETTAPLTPMEELLAKIPSKRXXXXXXXXXXXXXED--------------VVVQSE 356
            VEAIAETTAPLTPMEELLAKIPS+R                             V  Q E
Sbjct: 307  VEAIAETTAPLTPMEELLAKIPSQRPYISSPKKPDIAAVSVPDPPANVVTVEPKVATQQE 366

Query: 355  ASKPR-----PLSPYSMYDDLK------XXXXXXXXXXXSLHLQEKEQAKFTPQK----- 224
             ++P+     PLSPY +YDDLK                       K  A  TP       
Sbjct: 367  TAQPKPVAKQPLSPYIVYDDLKPPSSPSPSQPGGGKPTKISETVPKPSASDTPSSVPGVD 426

Query: 223  -----------QQPLSPYTVYEDLK 182
                       ++PLSPY  Y DLK
Sbjct: 427  GISQTTSSSKVEKPLSPYVAYPDLK 451


>ref|XP_002313068.1| predicted protein [Populus trichocarpa] gi|222849476|gb|EEE87023.1|
            predicted protein [Populus trichocarpa]
          Length = 564

 Score =  404 bits (1037), Expect(2) = e-111
 Identities = 224/384 (58%), Positives = 258/384 (67%), Gaps = 40/384 (10%)
 Frame = -1

Query: 1213 SQKTANVNEPDKVTTDKDLVYVAGATGKVGSRTVRELLKLGFRVRAGVRSAQRAQTLVES 1034
            +Q +      +  T D++L +VAGATGKVGSR VRELLKLGFRVRAGVRSAQ+A+ L +S
Sbjct: 55   AQASVEAISKEMETKDENLAFVAGATGKVGSRAVRELLKLGFRVRAGVRSAQKAEALAQS 114

Query: 1033 VRQMKLDDSSGSKPVEKLEIVECDLENQXXXXXXXXXXXXXICCIGASEKEVFDITGPYR 854
            V++MKLD   GS+PVE+LE VECDLE               +CCIGASEKEVFD+TGP R
Sbjct: 115  VKEMKLD-VEGSQPVERLETVECDLEKPNQIGPALGNASVVLCCIGASEKEVFDVTGPCR 173

Query: 853  IDYKATENLINAATVAKVDHFILVTSLGTNKIGFPAAILNLFWGVLIWKRKAEEALIASG 674
            IDY+AT+NL++AATVAKVDHFI+V+SLGTNK GFPAAILNLFWGVLIWKRKAEEALIASG
Sbjct: 174  IDYRATKNLVDAATVAKVDHFIMVSSLGTNKFGFPAAILNLFWGVLIWKRKAEEALIASG 233

Query: 673  LPYTIVRPGGMERPTDAYKETHNLTLSPEDTLFGGQVSNLQVAELMACMAKNTRLSYCKV 494
            +PYTIVRPGGMERPTDAYKETHNLT+S EDTLFGGQVSNLQVAE MA MAKN  LSYCKV
Sbjct: 234  VPYTIVRPGGMERPTDAYKETHNLTVSEEDTLFGGQVSNLQVAEFMAFMAKNRGLSYCKV 293

Query: 493  VEAIAETTAPLTPMEELLAKIPSKR--XXXXXXXXXXXXXEDVVVQSEASKP-------- 344
            VE IAETTAPLTPM+ELLAKIPS+R                  +V+ EA  P        
Sbjct: 294  VEVIAETTAPLTPMDELLAKIPSQRVEPKKSDAAELPKSVPPKIVEPEAPSPPSQREPAQ 353

Query: 343  ------RPLSPYSMYDDLKXXXXXXXXXXXSLH------------------------LQE 254
                  RPLSPY+ Y+DLK                                      + E
Sbjct: 354  AKAVVTRPLSPYTAYEDLKPPTSPIPTQPSGKKENVNSVEAVSMLDTPDPSPASASGIAE 413

Query: 253  KEQAKFTPQKQQPLSPYTVYEDLK 182
             + A    +  +PLSPY  Y+DLK
Sbjct: 414  TKPAPVETKTARPLSPYVAYDDLK 437



 Score = 28.1 bits (61), Expect(2) = e-111
 Identities = 10/15 (66%), Positives = 12/15 (80%)
 Frame = -3

Query: 83  YADMKPPTSPMPSLP 39
           Y D+KPPTSP P+ P
Sbjct: 433 YDDLKPPTSPSPTAP 447


>ref|XP_003539023.1| PREDICTED: uncharacterized protein LOC100802919 [Glycine max]
          Length = 529

 Score =  407 bits (1045), Expect = e-111
 Identities = 241/448 (53%), Positives = 283/448 (63%), Gaps = 45/448 (10%)
 Frame = -1

Query: 1390 ALHSSPIPIHGISGRNSIEKSVFFSGCSLGIPANKRCTDARKSSASLDFRVRAVGSVGSS 1211
            +L ++ IP   +S R + +K    S  +L       CT   K   +   R +A GS  S 
Sbjct: 8    SLTATTIPTSSLSRRAATDKPSATSHVNLSHFTRYPCTTKHKIRCT---RAQASGSTKSC 64

Query: 1210 QKTANVNEPDKVTTDKDLVYVAGATGKVGSRTVRELLKLGFRVRAGVRSAQRAQTLVESV 1031
              TA        + D +LV+VAGATG+VGSRTVREL+KLGFRVRAGVRSAQRA  LV+SV
Sbjct: 65   TGTAEGISEKTDSKDDNLVFVAGATGRVGSRTVRELIKLGFRVRAGVRSAQRAGALVQSV 124

Query: 1030 RQMKLDDSSGS-KPVEKLEIVECDLENQXXXXXXXXXXXXXICCIGASEKEVFDITGPYR 854
             Q+KLD ++G  + VEKLEIVECDLE               IC IGASEKEVFDITGP+R
Sbjct: 125  EQLKLDGANGGVQAVEKLEIVECDLEKPETIGSALGNASTVICSIGASEKEVFDITGPFR 184

Query: 853  IDYKATENLINAATVAKVDHFILVTSLGTNKIGFPAAILNLFWGVLIWKRKAEEALIASG 674
            IDY AT+NLI+AATV KV+HFILVTSLGTNKIGFPAAILNLFWGVL+WKRKAEEAL+ASG
Sbjct: 185  IDYLATKNLIDAATVTKVNHFILVTSLGTNKIGFPAAILNLFWGVLVWKRKAEEALLASG 244

Query: 673  LPYTIVRPGGMERPTDAYKETHNLTLSPEDTLFGGQVSNLQVAELMACMAKNTRLSYCKV 494
            LPYTIVRPGGMERPTDA+KETHN+TLS EDTLFGG VSNLQ+AEL+A MAKN  LSYCK+
Sbjct: 245  LPYTIVRPGGMERPTDAFKETHNITLSTEDTLFGGLVSNLQIAELLAVMAKNRDLSYCKI 304

Query: 493  VEAIAETTAPLTPMEELLAKIPSKRXXXXXXXXXXXXXEDVV-----------------V 365
            VEAIAETT+PLTPME LLA+IPS+R               VV                  
Sbjct: 305  VEAIAETTSPLTPMEGLLARIPSQRPYISSPKVIQKPDIAVVSIPDPPANVVAKEPKVAT 364

Query: 364  QSEASKPR-----PLSPYSMYDDLK----------------------XXXXXXXXXXXSL 266
            Q E ++P+     PLSPY +YDDLK                                  L
Sbjct: 365  QQETAQPKPVANQPLSPYIVYDDLKPPSSPSPSQPGGGKQTKISETVPQPSASDTPSSVL 424

Query: 265  HLQEKEQAKFTPQKQQPLSPYTVYEDLK 182
             +    Q   + + ++PLSPY VY DLK
Sbjct: 425  GVDGDSQTTSSSKVEKPLSPYVVYPDLK 452


Top