Gene Id: HORVU.MOREX.r2.5HG0446090
Gene ID in V1: HORVU5Hr1G123770
Gene ID in V3: HORVU.MOREX.r3.5HG0536140
exon number: 2
Chromosome number: chr5H
Start Position: 593926506
End Position: 593933047
Gene length: 6541 bp
Strand positive: +
Protein Id: HORVU.MOREX.r2.5HG0446090.1
Protein length: 510 aa
Molecular weight: 56.58769 kDa
Theoretical pI: 5
Total number of negatively charged residues(Asp + Glu): 62
Total number of positively charged residues (Arg + Lys): 43
Instability index (II): 59.75
Aliphatic index: 70.96
Grand average of hydropathicity (GRAVY): -0.572
GO: GO:0003677,GO:0003700,GO:0005634,GO:0006355,GO:0043565
Protein ID | Ortholog | gene symbol |
---|---|---|
HORVU.MOREX.r2.5HG0446090.1 | AT5G16820.1 | HSFA1B |
HORVU.MOREX.r2.5HG0446090.1 | Os03t0854500-01 | OsHsfA1 |
Protein sequence:
>HORVU.MOREX.r2.5HG0446090.1
MEGGVALASSVTTAVAPPGQGAGAPPPFLMKTYDMVDDPATDAVVSWGPASNSFIVWNTPEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVDPDKWEFANEGFLRGQKHLLKTINRRKPLHANNQVQVQQQQHQQQHQQQPQLQNAPIPSCVEVGKFGMEEEIEMLKRDKNVLMQELVRLRQQQQTTDHQLQTLGKRLHGMEQRQQQMMSFLAKAMQSPGFLAQFVQQNENSKRRIVAANKKRRLPKQDDGLNPESALLDGQIIKYQPMINEAAKAMLRKILQQDTSPHRFESMGNSDNLLLENCMPSAQTFDSSSSTRNSAVTLAEVPGNSGMPYMPTSSGLSAICSSSSPPEMQCPPVLDSNSSTQLPNMSAVPSVPKAMTPGLSDISIPGFPDLHDLITEDAINIPVENYAMPGPECIFPLPEGSDDSVPMDPIDTDEIDDTQKLPGIIDSFWEQFLCASPLSVDNDEVDSGLLDTREAQEENGWTRTENLANLTEQMGLLSSNHRG
CDS sequence:
>HORVU.MOREX.r2.5HG0446090
ATGGAGGGCGGGGTCGCTCTGGCGTCGTCGGTGACGACGGCGGTGGCGCCTCCGGGGCAGGGGGCGGGGGCGCCGCCGCCGTTCCTGATGAAGACGTACGACATGGTGGACGACCCGGCGACGGACGCGGTGGTGTCGTGGGGGCCGGCCAGCAACAGCTTCATCGTCTGGAACACGCCCGAGTTCGCCAGGGACCTCCTGCCCAAGTACTTCAAGCACAACAACTTCTCATCCTTCGTCAGGCAGCTCAACACATACGGATTCAGAAAAGTTGATCCAGACAAGTGGGAATTTGCAAATGAGGGTTTTTTGAGAGGACAGAAACATCTCCTGAAGACCATCAACAGAAGGAAACCATTGCATGCAAACAACCAAGTGCAAGTGCAACAGCAGCAGCACCAGCAGCAGCATCAGCAGCAACCCCAGTTGCAGAATGCGCCAATACCTTCTTGTGTAGAGGTGGGGAAGTTTGGGATGGAGGAAGAGATTGAGATGCTGAAAAGGGATAAAAATGTTCTGATGCAGGAGCTTGTCAGGCTGAGACAGCAACAGCAGACCACTGACCATCAGCTGCAGACTTTGGGCAAGCGTCTTCATGGAATGGAGCAACGGCAGCAGCAAATGATGTCTTTCCTGGCTAAAGCAATGCAGAGTCCTGGCTTCCTAGCACAGTTTGTACAGCAGAATGAAAACAGCAAACGAAGAATAGTAGCTGCCAACAAGAAAAGGAGGCTACCTAAGCAAGATGATGGCCTGAACCCTGAAAGCGCATTGTTGGATGGCCAGATAATCAAGTATCAGCCTATGATCAATGAAGCAGCCAAAGCAATGCTGAGGAAGATCCTACAGCAGGATACCTCCCCACACAGATTTGAATCCATGGGCAATTCAGATAATCTGTTACTGGAGAACTGTATGCCAAGTGCTCAAACTTTTGACAGCTCTTCATCAACCCGGAATTCTGCAGTCACCCTTGCAGAGGTCCCAGGCAACTCGGGCATGCCATATATGCCCACGAGCTCTGGACTTTCAGCAATTTGTTCATCGTCAAGCCCCCCTGAGATGCAGTGTCCTCCAGTTCTGGACAGCAACTCATCCACACAACTTCCCAACATGAGTGCTGTGCCTTCTGTGCCAAAGGCTATGACGCCAGGTCTAAGTGATATCAGTATTCCAGGATTCCCAGATCTGCATGATCTCATAACGGAGGACGCAATTAATATTCCTGTAGAGAACTATGCAATGCCTGGTCCTGAGTGTATATTCCCCTTACCTGAAGGCAGTGATGACTCTGTTCCCATGGACCCCATCGACACTGATGAGATCGATGACACTCAGAAGCTTCCGGGCATCATTGATTCCTTCTGGGAGCAGTTCCTTTGTGCCAGCCCTCTATCTGTTGACAACGACGAGGTCGATTCAGGTCTGCTGGACACACGGGAAGCGCAAGAGGAGAATGGATGGACCAGGACCGAGAACTTGGCGAATCTCACGGAACAAATGGGCCTGCTGTCGTCGAACCATAGAGGGTGA