Gene Id: HORVU.MOREX.r2.1HG0041620
Gene ID in V1: HORVU1Hr1G050560
Gene ID in V3: HORVU.MOREX.r3.1HG0053170
exon number: 1
Chromosome number: chr1H
Start Position: 355760678
End Position: 355762732
Gene length: 2054 bp
Strand positive: +
Protein Id: HORVU.MOREX.r2.1HG0041620.1
Protein length: 684 aa
Molecular weight: 73.90273 kDa
Theoretical pI: 6.06
Total number of negatively charged residues(Asp + Glu): 75
Total number of positively charged residues (Arg + Lys): 66
Instability index (II): 57.45
Aliphatic index: 67.19
Grand average of hydropathicity (GRAVY): -0.519
GO: GO:0046983
Protein ID | Ortholog | gene symbol |
---|---|---|
HORVU.MOREX.r2.1HG0041620.1 | AT1G32640.1 | MYC2 |
HORVU.MOREX.r2.1HG0041620.1 | Os10t0575000-01 | OsbHLH009 |
Protein sequence:
>HORVU.MOREX.r2.1HG0041620.1
MNLWTDDNASMMEAFMASADMPAFPWGAAATPPPPAAVPAFNQDTLQQRLQAIIEGSRETWTYAIFWQSSTDAGASLLGWGDGYYKGCDDADKRRQQPTPASAAEQEHRKRVLRELNSLIAGGGAAAPDEAVEEEVTDTEWFFLVSMTQSFPNGMGLPGQALFAGQPIWIATGLASAPCERARQAYTFGLRTMVCIPLGTGVLELGATEVIFQTTDSLGRIRSLFSLNGGGGGSGSWPPVAPPPQEAETDPSVLWLADAPAGDMKESPPSVEISVSKPPPSQPPQIHHFENGSTSTLTENPSLSVHAQQPLPQQQAAAAAQRQNQLQLQHQHNQGPFRRELNFSDFASNPSVTVTPPFFKPESGEILNFGADSTSRRNPSPAPPAATASLTTAPGSLFSQHTATVTAPSNDAKNNPKRSMEATSRASNTNHHQTATANEGMLSFSSAPTTRPSTGTGAPAKSESDHSDLEASVREVESSRVVPPPEEKRPRKRGRKPANGREEPLNHVEAERQRREKLNQRFYALRAVVPNVSKMDKASLLGDAISYINELRGKMTALESDKETLHSQIEALKKERDARPAAPSSSGMHDNGARCHAVEIEAKILGLEAMIRVQCHKRNHPAAKLMTALRELDLDVYHASVSVVKDIMIQQVAVKMATRVYSQEQLNAALYGRLAEPGAAMQIR
CDS sequence:
>HORVU.MOREX.r2.1HG0041620
ATGAACCTGTGGACGGACGACAACGCCTCCATGATGGAGGCCTTCATGGCCTCCGCCGACATGCCGGCGTTCCCCTGGGGCGCGGCGGCCACCCCGCCGCCGCCGGCCGCCGTGCCGGCCTTCAACCAGGACACGCTCCAGCAGCGCCTGCAGGCCATCATCGAGGGCTCCAGGGAGACCTGGACCTACGCCATCTTCTGGCAGTCCTCCACCGACGCCGGCGCCTCGCTCCTCGGCTGGGGCGACGGCTATTACAAGGGCTGCGACGACGCCGACAAGCGCCGCCAGCAGCCCACCCCGGCCTCCGCCGCCGAGCAGGAGCACCGCAAGCGCGTCCTCAGGGAGCTCAACTCGCTCATAGCCGGGGGCGGCGCCGCCGCGCCCGACGAGGCCGTCGAGGAGGAGGTCACGGACACCGAGTGGTTCTTCCTCGTCTCCATGACCCAGTCCTTCCCCAACGGGATGGGCTTGCCGGGCCAGGCTCTCTTCGCCGGCCAGCCTATCTGGATCGCCACCGGGCTCGCCAGCGCGCCCTGCGAGCGGGCCAGGCAGGCCTACACCTTCGGCCTCCGCACCATGGTCTGCATCCCCCTCGGCACCGGCGTGCTCGAGCTCGGCGCCACCGAGGTCATCTTCCAGACCACCGATAGCTTGGGGCGGATCCGCTCGCTCTTCAGCCTCAACGGCGGAGGAGGGGGCTCTGGATCCTGGCCGCCCGTGGCGCCGCCGCCCCAGGAGGCGGAGACGGATCCGTCCGTGCTCTGGCTCGCCGACGCGCCGGCCGGGGACATGAAGGAGTCGCCGCCGTCCGTCGAGATCTCCGTCTCCAAGCCGCCGCCGTCACAGCCGCCGCAGATCCATCACTTCGAGAACGGGAGCACCAGCACGCTCACGGAGAACCCCAGCCTCTCCGTGCACGCGCAGCAGCCTCTGCCGCAGCAGCAGGCGGCGGCGGCGGCGCAGAGGCAGAACCAGCTCCAGCTCCAGCACCAGCACAACCAGGGTCCTTTCCGCCGGGAGCTCAATTTCTCAGATTTCGCGTCCAACCCATCCGTCACGGTGACCCCGCCTTTCTTCAAGCCTGAGTCTGGTGAGATCCTAAACTTTGGCGCTGACAGCACCAGCCGGAGGAACCCTTCGCCGGCGCCCCCCGCCGCGACGGCCAGCCTCACCACCGCGCCGGGGAGCCTCTTCTCCCAGCACACGGCGACGGTGACGGCCCCATCAAACGACGCCAAGAACAACCCGAAGCGGTCCATGGAGGCCACCTCCCGCGCGAGCAACACCAACCACCACCAGACCGCGACAGCCAACGAGGGGATGCTGTCCTTCTCGTCCGCGCCGACGACGCGGCCGTCCACCGGCACGGGCGCACCAGCCAAGTCGGAGTCCGACCATTCCGACCTGGAGGCGTCGGTCCGCGAGGTGGAGAGCAGCCGCGTGGTGCCTCCGCCGGAGGAGAAGCGGCCGCGCAAGCGCGGGCGCAAGCCGGCGAACGGGCGCGAGGAGCCACTGAACCACGTGGAGGCGGAGCGGCAGCGGCGGGAGAAGCTGAACCAGCGGTTCTACGCGCTCCGCGCCGTGGTGCCCAACGTGTCCAAGATGGACAAGGCCTCACTGCTGGGCGACGCCATCTCCTACATCAACGAGCTCCGCGGTAAGATGACGGCGCTGGAGTCGGACAAGGAGACGCTCCACTCCCAAATTGAGGCGCTGAAGAAGGAGCGCGACGCCCGGCCGGCCGCGCCGTCGTCGTCGGGAATGCACGACAACGGGGCGCGGTGCCACGCGGTGGAGATCGAGGCCAAGATCCTGGGGCTGGAGGCGATGATCCGCGTGCAGTGCCACAAGCGCAACCACCCGGCGGCGAAGCTGATGACGGCGCTGCGGGAGCTGGACCTGGACGTGTACCACGCCAGCGTCTCGGTGGTGAAGGACATCATGATCCAGCAGGTGGCGGTGAAGATGGCCACCCGGGTCTACTCCCAGGAACAGCTCAACGCGGCGCTCTACGGCCGCCTCGCCGAGCCGGGCGCCGCGATGCAAATCCGGTAA