Gene Id: HORVU.MOREX.r2.4HG0341540
Gene ID in V1: HORVU4Hr1G082110
Gene ID in V3: HORVU.MOREX.r3.4HG0410020
exon number: 6
Chromosome number: chr4H
Start Position: 601680928
End Position: 601685052
Gene length: 4124 bp
Strand positive: +
Protein Id: HORVU.MOREX.r2.4HG0341540.1
Protein length: 917 aa
Molecular weight: 101.9008 kDa
Theoretical pI: 5.56
Total number of negatively charged residues(Asp + Glu): 126
Total number of positively charged residues (Arg + Lys): 103
Instability index (II): 54.04
Aliphatic index: 72.53
Grand average of hydropathicity (GRAVY): -0.439
GO: GO:0003677,GO:0005634,GO:0006351,GO:0006355
Protein ID | Ortholog | gene symbol |
---|---|---|
HORVU.MOREX.r2.4HG0341540.1 | AT2G17150.1 | NLP1 |
HORVU.MOREX.r2.4HG0341540.1 | Os03t0131100-01 | - |
Protein sequence:
>HORVU.MOREX.r2.4HG0341540.1
MEHPATRSDDGGMRGADMDELDLMEEFLLASPGPDFPEFLHPGGAASPSPFSPLFDLGSTITTAPAGGGDGDDDGSRRAWLIQPQEAPVSVKERLGRALQGIASRSRGAAGELLVQVWVPTRIGDRQVLTTCGQPFWVGRRSDRLESYRTVSVKYQFSADEAACAELGLPGRVFVGRVPEWTPDVRLFTDNEYPRVRYAQHFDIRGSVAMPVFERRTGACLGVVELVMTTQKINYNAEIDNICNALKEVDLRGSDVSSDPRAQVVDASYRAIVPELAHVLRAVCETHKLPLAQTWIPCVCQAKRASRHSDEKYKYCVSTVDEACYVRDPAMNGFHQACSEHHLFRGEGVVGTALGTNEPCFSPDITAYSKVQYPLSHYAKLFGLRAAVAIRLRSVKTGSMDLILEFFLPNNCITSEEQGAMLTSLSNTIQQASCTLRVVGVKELANDGSPETSSPTPPEVCDKPTEILDELSSGLNIPARTTSVDASEEVSSWIASLVDVQNNGAQGETDCGLPFGFRKQEDEGFSVTAGWPTSPVLEPEDKSFFPGFKKQEEYEVKGSPFSSDRSLSNSDKAIEKRRTKIEKTVSLQELRKHFAGSLKEAAKNLGVCPTTLKRICRHHGIDRWPSRKIKKVGHSLKKLQMVIDSVHGAEGTVRLSSLYENFTKTTWSERELQGDLSCPASEQKVQLEPSVPDRQCESRFSPHTSGSNSLSPTYSQSSNSSLGCSSDPKPQQQQSSAPQLAVKQEFFMEENQSSTQMKAASHDELQLFTEEKPVTLYRSQSHMLFSEHKPVENMSSMQEAKPDSLKIKAMYGEERCIFRLQPSWGFEKLKEEIAKRFGISQEIYHLKYLDDESEWVLLTCDADLLECIDVYKASSAKTAFMRFVIEEIMFRAFTHQFERHEDASFTVLERLLACMST
CDS sequence:
>HORVU.MOREX.r2.4HG0341540
ATGGAGCATCCCGCGACAAGGAGCGACGATGGCGGCATGCGTGGTGCCGACATGGACGAGCTGGATCTCATGGAGGAGTTCCTGCTGGCGTCGCCCGGGCCCGACTTCCCCGAGTTCCTGCACCCGGGCGGCGCGGCGTCGCCGAGCCCCTTCTCCCCCCTCTTCGACCTCGGCAGCACCATCACCACCGCCCCGGCCGGCGGCGGCGACGGGGACGACGACGGGTCGCGCCGCGCGTGGCTGATCCAGCCGCAGGAAGCACCGGTCTCGGTGAAGGAGCGGCTGGGGCGGGCGCTGCAGGGCATCGCGTCGCGGTCGCGCGGGGCGGCGGGCGAGCTGCTGGTCCAGGTCTGGGTGCCCACGCGCATCGGCGACCGCCAGGTGCTCACCACCTGCGGCCAGCCCTTCTGGGTCGGCCGCCGCAGCGACCGCCTCGAGAGCTACCGCACCGTGTCCGTCAAGTACCAGTTCTCCGCCGACGAGGCCGCCTGCGCCGAGCTGGGCCTCCCCGGCCGCGTCTTCGTCGGCCGCGTCCCCGAGTGGACGCCCGACGTGCGCCTCTTCACCGACAACGAGTACCCGCGCGTCCGATACGCGCAGCACTTCGACATCCGCGGCAGCGTCGCCATGCCGGTCTTCGAGCGCCGCACCGGGGCCTGCCTCGGCGTCGTCGAGCTCGTCATGACCACCCAGAAGATCAACTACAACGCCGAGATCGACAACATCTGCAATGCTCTCAAGGAGGTTGATCTCAGAGGTTCCGATGTTTCGAGCGATCCTCGCGCACAGGTGGTCGATGCTTCCTACCGAGCAATTGTGCCAGAGCTAGCGCATGTTCTCAGAGCTGTTTGTGAGACCCATAAGTTGCCACTGGCCCAGACATGGATACCCTGCGTCTGCCAGGCCAAAAGGGCGAGCCGCCACTCTGACGAAAAATACAAGTATTGCGTCTCCACCGTGGACGAGGCGTGTTATGTCCGCGATCCCGCCATGAATGGCTTTCACCAGGCTTGCTCTGAGCATCATCTGTTCAGAGGCGAGGGTGTTGTCGGCACGGCGCTCGGGACAAATGAGCCGTGTTTCTCTCCGGACATAACTGCCTACAGCAAGGTCCAATACCCCCTCTCACATTATGCAAAACTTTTCGGCTTAAGGGCCGCAGTGGCAATTCGGCTGCGAAGTGTCAAGACTGGAAGTATGGATCTTATCTTGGAATTTTTCTTGCCAAACAACTGCATAACAAGTGAAGAGCAAGGGGCCATGCTTACTTCTTTGTCCAATACCATACAACAAGCCTCATGTACACTGCGAGTTGTCGGTGTGAAAGAACTGGCGAACGATGGATCGCCTGAAACTAGCTCGCCGACCCCACCAGAAGTTTGTGACAAGCCAACTGAAATCTTGGATGAGCTTTCTAGTGGCCTTAATATTCCTGCAAGGACAACATCAGTGGATGCTTCTGAGGAGGTATCTTCATGGATTGCAAGCCTTGTGGATGTTCAGAATAATGGGGCGCAGGGAGAAACAGATTGTGGCCTGCCATTTGGATTCAGGAAACAAGAGGATGAAGGGTTCAGTGTAACAGCTGGCTGGCCAACTTCACCGGTGCTAGAACCTGAAGATAAAAGTTTCTTTCCAGGGTTTAAGAAGCAAGAAGAATATGAGGTCAAGGGTTCACCTTTTTCCAGCGATCGAAGCCTCTCAAACTCAGACAAAGCAATAGAGAAGCGGCGAACTAAAATTGAGAAAACTGTGAGCCTGCAAGAACTTCGGAAGCATTTCGCTGGTAGTTTGAAAGAAGCTGCAAAGAATTTAGGAGTGTGCCCTACTACATTGAAGAGGATTTGCAGACATCATGGAATTGATCGTTGGCCATCAAGAAAGATCAAGAAAGTTGGGCACTCTCTAAAGAAATTGCAAATGGTCATTGATTCGGTACATGGAGCTGAAGGAACTGTTCGGCTGAGTTCACTCTATGAAAACTTCACCAAGACTACATGGTCAGAAAGAGAATTGCAGGGAGATCTGAGTTGTCCAGCATCAGAGCAAAAGGTTCAGCTGGAGCCTTCAGTTCCTGATCGACAGTGCGAGAGCAGGTTCAGTCCGCATACCTCTGGCTCAAATTCCCTCTCCCCCACCTACAGCCAGAGCTCAAATTCTAGCCTGGGCTGTTCCAGCGATCCAAAGCCTCAGCAACAGCAAAGCAGCGCTCCTCAGCTTGCAGTGAAGCAGGAGTTTTTCATGGAGGAGAATCAGAGTTCGACACAGATGAAAGCTGCGAGCCATGACGAACTGCAGTTATTTACTGAAGAAAAACCTGTCACCCTGTATAGATCTCAGAGCCACATGCTATTCAGTGAACACAAACCAGTGGAAAACATGTCAAGCATGCAAGAAGCCAAGCCTGATTCTCTCAAGATAAAAGCCATGTATGGCGAGGAAAGATGCATATTCCGGCTTCAGCCGAGTTGGGGCTTTGAAAAGCTAAAAGAAGAAATCGCAAAGCGGTTCGGCATTTCTCAGGAAATTTATCACCTCAAGTACTTGGATGACGAGTCGGAGTGGGTTCTTCTAACATGTGATGCAGACCTGCTGGAGTGCATCGATGTATACAAGGCATCAAGTGCTAAAACAGCATTCATGCGCTTTGTAATTGAGGAAATTATGTTTCGTGCCTTCACGCATCAGTTTGAGCGGCACGAAGATGCATCCTTTACGGTGCTGGAACGCCTTCTTGCTTGTATGAGTACATGA