{"id":"P02745","source":"Swiss-Prot","name":"Complement C1q subcomponent subunit A","specific_function":"C1q associates with the proenzymes C1r and C1s to yield C1, the first component of the serum complement system. The collagen-like regions of C1q interact with the Ca(2+)-dependent C1r(2)C1s(2) proenzyme complex, and efficient activation of C1 takes place on interaction of the globular heads of C1q with the Fc regions of IgG or IgM antibody present in immune complexes.","gene_name":"C1QA","locus":"1p36.12","cellular_location":"Secreted","signal_regions":"1-22","theoretical_pi":9.45,"molecular_weight":26016.47,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0016678|Complement C1q subcomponent subunit A\nMEGPRGWLVLCVLAISLASMVTEDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIR\nTGIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAI\nRRNPPMGGNVVIFDTVITNQEEPYQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSR\nGQVRRSLGFCDTTNKGLFQVVSGGMVLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFL\nIFPSA","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016679|Complement C1q subcomponent subunit A (C1QA)\nATGGAGGGTCCCCGGGGATGGCTGGTGCTCTGTGTGCTGGCCATATCGCTGGCCTCTATG\nGTGACCGAGGACTTGTGCCGAGCACCAGACGGGAAGAAAGGGGAGGCAGGAAGACCTGGC\nAGACGGGGGCGGCCAGGCCTCAAGGGGGAGCAAGGGGAGCCGGGGGCCCCTGGCATCCGG\nACAGGCATCCAAGGCCTTAAAGGAGACCAGGGGGAACCTGGGCCCTCTGGAAACCCCGGC\nAAGGTGGGCTACCCAGGGCCCAGCGGCCCCCTCGGAGCCCGTGGCATCCCGGGAATTAAA\nGGCACCAAGGGCAGCCCAGGAAACATCAAGGACCAGCCGAGGCCAGCCTTCTCCGCCATT\nCGGCGGAACCCCCCAATGGGGGGCAACGTGGTCATCTTCGACACGGTCATCACCAACCAG\nGAAGAACCGTACCAGAACCACTCCGGCCGATTCGTCTGCACTGTACCCGGCTACTACTAC\nTTCACCTTCCAGGTGCTGTCCCAGTGGGAAATCTGCCTGTCCATCGTCTCCTCCTCAAGG\nGGCCAGGTCCGACGCTCCCTGGGCTTCTGTGACACCACCAACAAGGGGCTCTTCCAGGTG\nGTGTCAGGGGGCATGGTGCTTCAGCTGCAGCAGGGTGACCAGGTCTGGGTTGAAAAAGAC\nCCCAAAAAGGGTCACATTTACCAGGGCTCTGAGGCCGACAGCGTCTTCAGCGGCTTCCTC\nATCTTCCCATCTGCCTGA","gene_format":"FASTA","parent_id":"BE0002094"} {"id":"P02746","source":"Swiss-Prot","name":"Complement C1q subcomponent subunit B","specific_function":"C1q associates with the proenzymes C1r and C1s to yield C1, the first component of the serum complement system. The collagen-like regions of C1q interact with the Ca(2+)-dependent C1r(2)C1s(2) proenzyme complex, and efficient activation of C1 takes place on interaction of the globular heads of C1q with the Fc regions of IgG or IgM antibody present in immune complexes.","gene_name":"C1QB","locus":"1p36.12","cellular_location":"Secreted","signal_regions":"1-27","theoretical_pi":8.87,"molecular_weight":26721.62,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0016676|Complement C1q subcomponent subunit B\nMMMKIPWGSIPVLMLLLLLGLIDISQAQLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKG\nEKGLPGLAGDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQ\nKIAFSATRTINVPLRRDQTIRFDHVITNMNNNYEPRSGKFTCKVPGLYYFTYHASSRGNL\nCVNLMRGRERAQKVVTFCDYAYNTFQVTTGGMVLKLEQGENVFLQATDKNSLLGMEGANS\nIFSGFLLFPDMEA","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016677|Complement C1q subcomponent subunit B (C1QB)\nATGATGATGAAGATCCCATGGGGCAGCATCCCAGTACTGATGTTGCTCCTGCTCCTGGGC\nCTAATCGATATCTCCCAGGCCCAGCTCAGCTGCACCGGGCCCCCAGCCATCCCTGGCATC\nCCGGGTATCCCTGGGACACCTGGCCCCGATGGCCAACCTGGGACCCCAGGGATAAAAGGA\nGAGAAAGGGCTTCCAGGGCTGGCTGGAGACCATGGTGAGTTCGGAGAGAAGGGAGACCCA\nGGGATTCCTGGGAATCCAGGAAAAGTCGGCCCCAAGGGCCCCATGGGCCCTAAAGGTGGC\nCCAGGGGCCCCTGGAGCCCCAGGCCCCAAAGGTGAATCGGGAGACTACAAGGCCACCCAG\nAAAATCGCCTTCTCTGCCACAAGAACCATCAACGTCCCCCTGCGCCGGGACCAGACCATC\nCGCTTCGACCACGTGATCACCAACATGAACAACAATTATGAGCCCCGCAGTGGCAAGTTC\nACCTGCAAGGTGCCCGGTCTCTACTACTTCACCTACCACGCCAGCTCTCGAGGGAACCTG\nTGCGTGAACCTCATGCGTGGCCGGGAGCGTGCACAGAAGGTGGTCACCTTCTGTGACTAT\nGCCTACAACACCTTCCAGGTCACCACCGGTGGCATGGTCCTCAAGCTGGAGCAGGGGGAG\nAACGTCTTCCTGCAGGCCACCGACAAGAACTCACTACTGGGCATGGAGGGTGCCAACAGC\nATCTTTTCCGGGTTCCTGCTCTTTCCAGATATGGAGGCCTGA","gene_format":"FASTA","parent_id":"BE0002095"} {"id":"P02747","source":"Swiss-Prot","name":"Complement C1q subcomponent subunit C","specific_function":"C1q associates with the proenzymes C1r and C1s to yield C1, the first component of the serum complement system. The collagen-like regions of C1q interact with the Ca(2+)-dependent C1r(2)C1s(2) proenzyme complex, and efficient activation of C1 takes place on interaction of the globular heads of C1q with the Fc regions of IgG or IgM antibody present in immune complexes.","gene_name":"C1QC","locus":"1p36.11","cellular_location":"Secreted","signal_regions":"1-28","theoretical_pi":8.58,"molecular_weight":25773.56,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0011520|Complement C1q subcomponent subunit C\nMDVGPSSLPHLGLKLLLLLLLLPLRGQANTGCYGIPGMPGLPGAPGKDGYDGLPGPKGEP\nGIPAIPGIRGPKGQKGEPGLPGHPGKNGPMGPPGMPGVPGPMGIPGEPGEEGRYKQKFQS\nVFTVTRQTHQPPAPNSLIRFNAVLTNPQGDYDTSTGKFTCKVPGLYYFVYHASHTANLCV\nLLYRSGVKVVTFCGHTSKTNQVNSGGVLLRLQVGEEVWLAVNDYYDMVGIQGSDSVFSGF\nLLFPD","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011521|Complement C1q subcomponent subunit C (C1QC)\nATGGACGTGGGGCCCAGCTCCCTGCCCCACCTTGGGCTGAAGCTGCTGCTGCTCCTGCTG\nCTGCTGCCCCTCAGGGGCCAAGCCAACACAGGCTGCTACGGGATCCCAGGGATGCCCGGC\nCTGCCCGGGGCACCAGGGAAGGATGGGTACGACGGACTGCCGGGGCCCAAGGGGGAGCCA\nGGAATCCCAGCCATTCCCGGGATCCGAGGACCCAAAGGGCAGAAGGGAGAACCCGGCTTA\nCCCGGCCATCCTGGGAAAAATGGCCCCATGGGACCCCCTGGGATGCCAGGGGTGCCCGGC\nCCCATGGGCATCCCTGGAGAGCCAGGTGAGGAGGGCAGATACAAGCAGAAATTCCAGTCA\nGTGTTCACGGTCACTCGGCAGACCCACCAGCCCCCTGCACCCAACAGCCTGATCAGATTC\nAACGCGGTCCTCACCAACCCGCAGGGAGATTATGACACGAGCACTGGCAAGTTCACCTGC\nAAAGTCCCCGGCCTCTACTACTTTGTCTACCACGCGTCGCATACAGCCAACCTGTGCGTG\nCTGCTGTACCGCAGCGGCGTCAAAGTGGTCACCTTCTGTGGCCACACGTCCAAAACCAAT\nCAGGTCAACTCGGGCGGTGTGCTGCTGAGGTTGCAGGTGGGCGAGGAGGTGTGGCTGGCT\nGTCAATGACTACTACGACATGGTGGGCATCCAGGGCTCTGACAGCGTCTTCTCCGGCTTC\nCTGCTCTTCCCCGACTAG","gene_format":"FASTA","parent_id":"BE0002096"} {"id":"P08637","source":"Swiss-Prot","name":"Low affinity immunoglobulin gamma Fc region receptor III-A","specific_function":"Receptor for the Fc region of IgG. Binds complexed or aggregated IgG and also monomeric IgG. Mediates antibody-dependent cellular cytotoxicity (ADCC) and other antibody-dependent responses, such as phagocytosis.","gene_name":"FCGR3A","locus":"1q23","cellular_location":"Cell membrane","transmembrane_regions":"209-229","signal_regions":"1-16","theoretical_pi":8.21,"molecular_weight":29088.895,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0004112|Low affinity immunoglobulin gamma Fc region receptor III-A\nMWQLLLPTALLLLVSAGMRTEDLPKAVVFLEPQWYRVLEKDSVTLKCQGAYSPEDNSTQW\nFHNESLISSQASSYFIDAATVDDSGEYRCQTNLSTLSDPVQLEVHIGWLLLQAPRWVFKE\nEDPIHLRCHSWKNTALHKVTYLQNGKGRKYFHHNSDFYIPKATLKDSGSYFCRGLFGSKN\nVSSETVNITITQGLAVSTISSFFPPGYQVSFCLVMVLLFAVDTGLYFSVKTNIRSSTRDW\nKDHKFKWRKDPQDK","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011522|Low affinity immunoglobulin gamma Fc region receptor III-A (FCGR3A)\nATGGGTGGAGGGGCTGGGGAAAGGCTGTTTACTTCCTCCTGTCTAGTCGGTTTGGTCCCT\nTTAGGGCTCCGGATATCTTTGGTGACTTGTCCACTCCAGTGTGGCATCATGTGGCAGCTG\nCTCCTCCCAACTGCTCTGCTACTTCTAGTTTCAGCTGGCATGCGGACTGAAGATCTCCCA\nAAGGCTGTGGTGTTCCTGGAGCCTCAATGGTACAGGGTGCTCGAGAAGGACAGTGTGACT\nCTGAAGTGCCAGGGAGCCTACTCCCCTGAGGACAATTCCACACAGTGGTTTCACAATGAG\nAGCCTCATCTCAAGCCAGGCCTCGAGCTACTTCATTGACGCTGCCACAGTCGACGACAGT\nGGAGAGTACAGGTGCCAGACAAACCTCTCCACCCTCAGTGACCCGGTGCAGCTAGAAGTC\nCATATCGGCTGGCTGTTGCTCCAGGCCCCTCGGTGGGTGTTCAAGGAGGAAGACCCTATT\nCACCTGAGGTGTCACAGCTGGAAGAACACTGCTCTGCATAAGGTCACATATTTACAGAAT\nGGCAAAGGCAGGAAGTATTTTCATCATAATTCTGACTTCTACATTCCAAAAGCCACACTC\nAAAGACAGCGGCTCCTACTTCTGCAGGGGGCTTTTTGGGAGTAAAAATGTGTCTTCAGAG\nACTGTGAACATCACCATCACTCAAGGTTTGGCAGTGTCAACCATCTCATCATTCTTTCCA\nCCTGGGTACCAAGTCTCTTTCTGCTTGGTGATGGTACTCCTTTTTGCAGTGGACACAGGA\nCTATATTTCTCTGTGAAGACAAACATTCGAAGCTCAACAAGAGACTGGAAGGACCATAAA\nTTTAAATGGAGAAAGGACCCTCAAGACAAATGA","gene_format":"FASTA","parent_id":"BE0002097"} {"id":"P12314","source":"Swiss-Prot","name":"High affinity immunoglobulin gamma Fc receptor I","general_function":"Receptor signaling protein activity","specific_function":"High affinity receptor for the Fc region of immunoglobulins gamma. Functions in both innate and adaptive immune responses.","gene_name":"FCGR1A","locus":"1q21.2-q21.3","cellular_location":"Cell membrane","transmembrane_regions":"293-313","signal_regions":"1-15","theoretical_pi":8.08,"molecular_weight":42631.525,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001416|High affinity immunoglobulin gamma Fc receptor I\nMWFLTTLLLWVPVDGQVDTTKAVITLQPPWVSVFQEETVTLHCEVLHLPGSSSTQWFLNG\nTATQTSTPSYRITSASVNDSGEYRCQRGLSGRSDPIQLEIHRGWLLLQVSSRVFTEGEPL\nALRCHAWKDKLVYNVLYYRNGKAFKFFHWNSNLTILKTNISHNGTYHCSGMGKHRYTSAG\nISVTVKELFPAPVLNASVTSPLLEGNLVTLSCETKLLLQRPGLQLYFSFYMGSKTLRGRN\nTSSEYQILTARREDSGLYWCEAATEDGNVLKRSPELELQVLGLQLPTPVWFHVLFYLAVG\nIMFLVNTVLWVTIRKELKRKKKWDLEISLDSGHEKKVISSLQEDRHLEEELKCQEQKEEQ\nLQEGVHRKEPQGAT","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016183|High affinity immunoglobulin gamma Fc receptor I (FCGR1A)\nATGTGGTTCTTGACAACTCTGCTCCTTTGGGTTCCAGTTGATGGGCAAGTGGACACCACA\nAAGGCAGTGATCACTTTGCAGCCTCCATGGGTCAGCGTGTTCCAAGAGGAAACCGTAACC\nTTGCACTGTGAGGTGCTCCATCTGCCTGGGAGCAGCTCTACACAGTGGTTTCTCAATGGC\nACAGCCACTCAGACCTCGACCCCCAGCTACAGAATCACCTCTGCCAGTGTCAATGACAGT\nGGTGAATACAGGTGCCAGAGAGGTCTCTCAGGGCGAAGTGACCCCATACAGCTGGAAATC\nCACAGAGGCTGGCTACTACTGCAGGTCTCCAGCAGAGTCTTCACGGAAGGAGAACCTCTG\nGCCTTGAGGTGTCATGCGTGGAAGGATAAGCTGGTGTACAATGTGCTTTACTATCGAAAT\nGGCAAAGCCTTTAAGTTTTTCCACTGGAATTCTAACCTCACCATTCTGAAAACCAACATA\nAGTCACAATGGCACCTACCATTGCTCAGGCATGGGAAAGCATCGCTACACATCAGCAGGA\nATATCTGTCACTGTGAAAGAGCTATTTCCAGCTCCAGTGCTGAATGCATCTGTGACATCC\nCCACTCCTGGAGGGGAATCTGGTCACCCTGAGCTGTGAAACAAAGTTGCTCTTGCAGAGG\nCCTGGTTTGCAGCTTTACTTCTCCTTCTACATGGGCAGCAAGACCCTGCGAGGCAGGAAC\nACATCCTCTGAATACCAAATACTAACTGCTAGAAGAGAAGACTCTGGGTTATACTGGTGC\nGAGGCTGCCACAGAGGATGGAAATGTCCTTAAGCGCAGCCCTGAGTTGGAGCTTCAAGTG\nCTTGGCCTCCAGTTACCAACTCCTGTCTGGTTTCATGTCCTTTTCTATCTGGCAGTGGGA\nATAATGTTTTTAGTGAACACTGTTCTCTGGGTGACAATACGTAAAGAACTGAAAAGAAAG\nAAAAAGTGGGATTTAGAAATCTCTTTGGATTCTGGTCATGAGAAGAAGGTAATTTCCAGC\nCTTCAAGAAGACAGACATTTAGAAGAAGAGCTGAAATGTCAGGAACAAAAAGAAGAACAG\nCTGCAGGAAGGGGTGCACCGGAAGGAGCCCCAGGGGGCCACGTAG","gene_format":"FASTA","parent_id":"BE0000710"} {"id":"P12318","source":"Swiss-Prot","name":"Low affinity immunoglobulin gamma Fc region receptor II-a","specific_function":"Binds to the Fc region of immunoglobulins gamma. Low affinity receptor. By binding to IgG it initiates cellular responses against pathogens and soluble antigens. Promotes phagocytosis of opsonized antigens.","gene_name":"FCGR2A","locus":"1q23","cellular_location":"Cell membrane","transmembrane_regions":"218-240","signal_regions":"1-33","theoretical_pi":6.78,"molecular_weight":35000.42,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0037156|Low affinity immunoglobulin gamma Fc region receptor II-a\nMTMETQMSQNVCPRNLWLLQPLTVLLLLASADSQAAAPPKAVLKLEPPWINVLQEDSVTL\nTCQGARSPESDSIQWFHNGNLIPTHTQPSYRFKANNNDSGEYTCQTGQTSLSDPVHLTVL\nSEWLVLQTPHLEFQEGETIMLRCHSWKDKPLVKVTFFQNGKSQKFSHLDPTFSIPQANHS\nHSGDYHCTGNIGYTLFSSKPVTITVQVPSMGSSSPMGIIVAVVIATAVAAIVAAVVALIY\nCRKKRISANSTDPVKAAQFEPPGRQMIAIRKRQLEETNNDYETADGGYMTLNPRAPTDDD\nKNIYLTLPPNDHVNSNN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0019233|Low affinity immunoglobulin gamma Fc region receptor II-a (FCGR2A)\nATGACTATGGAGACCCAAATGTCTCAGAATGTATGTCCCAGAAACCTGTGGCTGCTTCAA\nCCATTGACAGTTTTGCTGCTGCTGGCTTCTGCAGACAGTCAAGCTGCAGCTCCCCCAAAG\nGCTGTGCTGAAACTTGAGCCCCCGTGGATCAACGTGCTCCAGGAGGACTCTGTGACTCTG\nACATGCCAGGGGGCTCGCAGCCCTGAGAGCGACTCCATTCAGTGGTTCCACAATGGGAAT\nCTCATTCCCACCCACACGCAGCCCAGCTACAGGTTCAAGGCCAACAACAATGACAGCGGG\nGAGTACACGTGCCAGACTGGCCAGACCAGCCTCAGCGACCCTGTGCATCTGACTGTGCTT\nTCCGAATGGCTGGTGCTCCAGACCCCTCACCTGGAGTTCCAGGAGGGAGAAACCATCATG\nCTGAGGTGCCACAGCTGGAAGGACAAGCCTCTGGTCAAGGTCACATTCTTCCAGAATGGA\nAAATCCCAGAAATTCTCCCATTTGGATCCCACCTTCTCCATCCCACAAGCAAACCACAGT\nCACAGTGGTGATTACCACTGCACAGGAAACATAGGCTACACGCTGTTCTCATCCAAGCCT\nGTGACCATCACTGTCCAAGTGCCCAGCATGGGCAGCTCTTCACCAATGGGGATCATTGTG\nGCTGTGGTCATTGCGACTGCTGTAGCAGCCATTGTTGCTGCTGTAGTGGCCTTGATCTAC\nTGCAGGAAAAAGCGGATTTCAGCCAATTCCACTGATCCTGTGAAGGCTGCCCAATTTGAG\nCCACCTGGACGTCAAATGATTGCCATCAGAAAGAGACAACTTGAAGAAACCAACAATGAC\nTATGAAACAGCTGACGGCGGCTACATGACTCTGAACCCCAGGGCACCTACTGACGATGAT\nAAAAACATCTACCTGACTCTTCCTCCCAACGACCATGTCAACAGTAATAACTAA","gene_format":"FASTA","parent_id":"BE0002098"} {"id":"P01375","source":"Swiss-Prot","name":"Tumor necrosis factor","general_function":"Tumor necrosis factor receptor binding","specific_function":"Cytokine that binds to TNFRSF1A/TNFR1 and TNFRSF1B/TNFBR. It is mainly secreted by macrophages and can induce cell death of certain tumor cell lines. It is potent pyrogen causing fever by direct action or by stimulation of interleukin-1 secretion and is implicated in the induction of cachexia, Under certain conditions it can stimulate cell proliferation and induce cell differentiation. Impairs regulatory T-cells (Treg) function in individuals with rheumatoid arthritis via FOXP3 dephosphorylation. Upregulates the expression of protein phosphatase 1 (PP1), which dephosphorylates the key 'Ser-418' residue of FOXP3, thereby inactivating FOXP3 and rendering Treg cells functionally defective (PubMed:23396208). Key mediator of cell death in the anticancer action of BCG-stimulated neutrophils in combination with DIABLO/SMAC mimetic in the RT4v6 bladder cancer cell line (PubMed:22517918).The TNF intracellular domain (ICD) form induces IL12 production in dendritic cells.","gene_name":"TNF","locus":"6p21.3","cellular_location":"Cell membrane","transmembrane_regions":"36-56","theoretical_pi":6.92,"molecular_weight":25644.15,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001404|Tumor necrosis factor\nMSTESMIRDVELAEEALPKKTGGPQGSRRCLFLSLFSFLIVAGATTLFCLLHFGVIGPQR\nEEFPRDLSLISPLAQAVRSSSRTPSDKPVAHVVANPQAEGQLQWLNRRANALLANGVELR\nDNQLVVPSEGLYLIYSQVLFKGQGCPSTHVLLTHTISRIAVSYQTKVNLLSAIKSPCQRE\nTPEGAEAKPWYEPIYLGGVFQLEKGDRLSAEINRPDYLDFAESGQVYFGIIAL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021837|Tumor necrosis factor (TNF)\nATGAGCACTGAAAGCATGATCCGGGACGTGGAGCTGGCCGAGGAGGCGCTCCCCAAGAAG\nACAGGGGGGCCCCAGGGCTCCAGGCGGTGCTTGTTCCTCAGCCTCTTCTCCTTCCTGATC\nGTGGCAGGCGCCACCACGCTCTTCTGCCTGCTGCACTTTGGAGTGATCGGCCCCCAGAGG\nGAAGAGTTCCCCAGGGACCTCTCTCTAATCAGCCCTCTGGCCCAGGCAGTCAGATCATCT\nTCTCGAACCCCGAGTGACAAGCCTGTAGCCCATGTTGTAGCAAACCCTCAAGCTGAGGGG\nCAGCTCCAGTGGCTGAACCGCCGGGCCAATGCCCTCCTGGCCAATGGCGTGGAGCTGAGA\nGATAACCAGCTGGTGGTGCCATCAGAGGGCCTGTACCTCATCTACTCCCAGGTCCTCTTC\nAAGGGCCAAGGCTGCCCCTCCACCCATGTGCTCCTCACCCACACCATCAGCCGCATCGCC\nGTCTCCTACCAGACCAAGGTCAACCTCCTCTCTGCCATCAAGAGCCCCTGCCAGAGGGAG\nACCCCAGAGGGGGCTGAGGCCAAGCCCTGGTATGAGCCCATCTATCTGGGAGGGGTCTTC\nCAGCTGGAGAAGGGTGACCGACTCAGCGCTGAGATCAATCGGCCCGACTATCTCGACTTT\nGCCGAGTCTGGGCAGGTCTACTTTGGGATCATTGCCCTGTGA","gene_format":"FASTA","parent_id":"BE0000704"} {"id":"P31994","source":"Swiss-Prot","name":"Low affinity immunoglobulin gamma Fc region receptor II-b","specific_function":"Receptor for the Fc region of complexed or aggregated immunoglobulins gamma. Low affinity receptor. Involved in a variety of effector and regulatory functions such as phagocytosis of immune complexes and modulation of antibody production by B-cells. Binding to this receptor results in down-modulation of previous state of cell activation triggered via antigen receptors on B-cells (BCR), T-cells (TCR) or via another Fc receptor. Isoform IIB1 fails to mediate endocytosis or phagocytosis. Isoform IIB2 does not trigger phagocytosis.","gene_name":"FCGR2B","locus":"1q23","cellular_location":"Cell membrane","transmembrane_regions":"218-240","signal_regions":"1-42","theoretical_pi":6.12,"molecular_weight":34043.355,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0004116|Low affinity immunoglobulin gamma Fc region receptor II-b\nMGILSFLPVLATESDWADCKSPQPWGHMLLWTAVLFLAPVAGTPAAPPKAVLKLEPQWIN\nVLQEDSVTLTCRGTHSPESDSIQWFHNGNLIPTHTQPSYRFKANNNDSGEYTCQTGQTSL\nSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRCHSWKDKPLVKVTFFQNGKSKKFSRSDPN\nFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTITVQAPSSSPMGIIVAVVTGIAVAAIVAA\nVVALIYCRKKRISALPGYPECREMGETLPEKPANPTNPDEADKVGAENTITYSLLMHPDA\nLEEPDDQNRI","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0019237|Low affinity immunoglobulin gamma Fc region receptor II-b (FCGR2B)\nATGGGAATCCTGTCATTCTTACCTGTCCTTGCCACTGAGAGTGACTGGGCTGACTGCAAG\nTCCCCCCAGCCTTGGGGTCATATGCTTCTGTGGACAGCTGTGCTATTCCTGGCTCCTGTT\nGCTGGGACACCTGCTCCCCCAAAGGCTGTGCTGAAACTCGAGCCCCAGTGGATCAACGTG\nCTCCAGGAGGACTCTGTGACTCTGACATGCCGGGGGACTCACAGCCCTGAGAGCGACTCC\nATTCAGTGGTTCCACAATGGGAATCTCATTCCCACCCACACGCAGCCCAGCTACAGGTTC\nAAGGCCAACAACAATGACAGCGGGGAGTACACGTGCCAGACTGGCCAGACCAGCCTCAGC\nGACCCTGTGCATCTGACTGTGCTTTCTGAGTGGCTGGTGCTCCAGACCCCTCACCTGGAG\nTTCCAGGAGGGAGAAACCATCGTGCTGAGGTGCCACAGCTGGAAGGACAAGCCTCTGGTC\nAAGGTCACATTCTTCCAGAATGGAAAATCCAAGAAATTTTCCCGTTCGGATCCCAACTTC\nTCCATCCCACAAGCAAACCACAGTCACAGTGGTGATTACCACTGCACAGGAAACATAGGC\nTACACGCTGTACTCATCCAAGCCTGTGACCATCACTGTCCAAGCTCCCAGCTCTTCACCG\nATGGGGATCATTGTGGCTGTGGTCACTGGGATTGCTGTAGCGGCCATTGTTGCTGCTGTA\nGTGGCCTTGATCTACTGCAGGAAAAAGCGGATTTCAGCCAATCCCACTAATCCTGATGAG\nGCTGACAAAGTTGGGGCTGAGAACACAATCACCTATTCACTTCTCATGCACCCGGATGCT\nCTGGAAGAGCCTGATGACCAGAACCGTATTTAG","gene_format":"FASTA","parent_id":"BE0002099"} {"id":"P31995","source":"Swiss-Prot","name":"Low affinity immunoglobulin gamma Fc region receptor II-c","general_function":"Transmembrane signaling receptor activity","specific_function":"Receptor for the Fc region of complexed immunoglobulins gamma. Low affinity receptor. Involved in a variety of effector and regulatory functions such as phagocytosis of immune complexes and modulation of antibody production by B-cells.","gene_name":"FCGR2C","locus":"1q23.3","cellular_location":"Cytoplasm","transmembrane_regions":"224-246","signal_regions":"1-42","theoretical_pi":6.9,"molecular_weight":35577.96,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0004118|Low affinity immunoglobulin gamma Fc region receptor II-c\nMGILSFLPVLATESDWADCKSPQPWGHMLLWTAVLFLAPVAGTPAAPPKAVLKLEPQWIN\nVLQEDSVTLTCRGTHSPESDSIQWFHNGNLIPTHTQPSYRFKANNNDSGEYTCQTGQTSL\nSDPVHLTVLSEWLVLQTPHLEFQEGETIVLRCHSWKDKPLVKVTFFQNGKSKKFSRSDPN\nFSIPQANHSHSGDYHCTGNIGYTLYSSKPVTITVQAPSSSPMGIIVAVVTGIAVAAIVAA\nVVALIYCRKKRISANSTDPVKAAQFEPPGRQMIAIRKRQPEETNNDYETADGGYMTLNPR\nAPTDDDKNIYLTLPPNDHVNSNN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011523|Low affinity immunoglobulin gamma Fc region receptor II-c (FCGR2C)\nATGGGAATCCTGTCATTCTTACCTGTCCTTGCCACTGAGAGTGACTGGGCTGACTGCAAG\nTCCCCCCAGCCTTGGGGTCATATGCTTCTGTGGACAGCTGTGCTATTCCTGGCTCCTGTT\nGCTGGGACACCTGCAGCTCCCCCAAAGGCTGTGCTGAAACTCGAGCCCCAGTGGATCAAC\nGTGCTCCAAGAGGACTCTGTGACTCTGACATGCCGGGGGACTCACAGCCCTGAGAGCGAC\nTCCATTCCGTGGTTCCACAATGGGAATCTCATTCCCACCCACACGCAGCCCAGCTACAGG\nTTCAAGGCCAACAACAATGACAGCGGGGAGTACACGTGCCAGACTGGCCAGACCAGCCTC\nAGCGACCCTGTGCATCTGACTGTGCTTTCTGAGTGGCTGGTGCTCCAGACCCCTCACCTG\nGAGTTCCAGGAGGGAGAAACCATCGTGCTGAGGTGCCACAGCTGGAAGGACAAGCCTCTG\nGTCAAGGTCACATTCTTCCAGAATGGAAAATCCAAGAAATTTTCCCGTTCGGATCCCAAC\nTTCTCCATCCCACAAGCAAACCACAGTCACAGTGGTGATTACCACTGCACAGGAAACATA\nGGCTACACGCTGTACTCATCCAAGCCTGTGACCATCACTGTCCAAGCTCCCAGCTCTTCA\nCCGATGGGGATCATTGTGGCTGTGGTCACTGGGATTGCTGTAGCGGCCATTGTTGCTGCT\nGTAGTGGCCTTGATCTACTGCAGGAAAAAGCGGATTTCAGCCAATTCCACTGATCCTGTG\nAAGGCTGCCCAATTTGAGCCACCTGGACGTCAAATGATTGCCATCAGAAAGAGACAACCT\nGAAGAAACCAACAATGACTATGAAACAGCTGACGGCGGCTACATGACTCTGAACCCCAGG\nGCACCTACTGACGATGATAAAAACATCTACCTGACTCTTCCTCCCAACGACCATGTCAAC\nAGTAATAACTAA","gene_format":"FASTA","parent_id":"BE0002100"} {"id":"P17181","source":"Swiss-Prot","name":"Interferon alpha/beta receptor 1","general_function":"Type i interferon receptor activity","specific_function":"Associates with IFNAR2 to form the type I interferon receptor. Receptor for interferons alpha and beta. Binding to type I IFNs triggers tyrosine phosphorylation of a number of proteins including JAKs, TYK2, STAT proteins and IFNR alpha- and beta-subunits themselves. Can also transduce IFNB signals without the help of IFNAR2, and not activating the Jak-STAT pathway.","gene_name":"IFNAR1","cellular_location":"Membrane","transmembrane_regions":"437-457","signal_regions":"1-27","theoretical_pi":5.56,"molecular_weight":63524.81,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001318|Interferon alpha/beta receptor 1\nMMVVLLGATTLVLVAVAPWVLSAAAGGKNLKSPQKVEVDIIDDNFILRWNRSDESVGNVT\nFSFDYQKTGMDNWIKLSGCQNITSTKCNFSSLKLNVYEEIKLRIRAEKENTSSWYEVDSF\nTPFRKAQIGPPEVHLEAEDKAIVIHISPGTKDSVMWALDGLSFTYSLVIWKNSSGVEERI\nENIYSRHKIYKLSPETTYCLKVKAALLTSWKIGVYSPVHCIKTTVENELPPPENIEVSVQ\nNQNYVLKWDYTYANMTFQVQWLHAFLKRNPGNHLYKWKQIPDCENVKTTQCVFPQNVFQK\nGIYLLRVQASDGNNTSFWSEEIKFDTEIQAFLLPPVFNIRSLSDSFHIYIGAPKQSGNTP\nVIQDYPLIYEIIFWENTSNAERKIIEKKTDVTVPNLKPLTVYCVKARAHTMDEKLNKSSV\nFSDAVCEKTKPGNTSKIWLIVGICIALFALPFVIYAAKVFLRCINYVFFPSLKPSSSIDE\nYFSEQPLKNLLLSTSEEQIEKCFIIENISTIATVEETNQTDEDHKKYSSQTSQDSGNYSN\nEDESESKTSEELQQDFV","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0018987|Interferon alpha/beta receptor 1 (IFNAR1)\nATGATGGTCGTCCTCCTGGGCGCGACGACCCTAGTGCTCGTCGCCGTGGCGCCATGGGTG\nTTGTCCGCAGCCGCAGGTGGAAAAAATCTAAAATCTCCTCAAAAAGTAGAGGTCGACATC\nATAGATGACAACTTTATCCTGAGGTGGAACAGGAGCGATGAGTCTGTCGGGAATGTGACT\nTTTTCATTCGATTATCAAAAAACTGGGATGGATAATTGGATAAAATTGTCTGGGTGTCAG\nAATATTACTAGTACCAAATGCAACTTTTCTTCACTCAAGCTGAATGTTTATGAAGAAATT\nAAATTGCGTATAAGAGCAGAAAAAGAAAACACTTCTTCATGGTATGAGGTTGACTCATTT\nACACCATTTCGCAAAGCTCAGATTGGTCCTCCAGAAGTACATTTAGAAGCTGAAGATAAG\nGCAATAGTGATACACATCTCTCCTGGAACAAAAGATAGTGTTATGTGGGCTTTGGATGGT\nTTAAGCTTTACATATAGCTTAGTTATCTGGAAAAACTCTTCAGGTGTAGAAGAAAGGATT\nGAAAATATTTATTCCAGACATAAAATTTATAAACTCTCACCAGAGACTACTTATTGTCTA\nAAAGTTAAAGCAGCACTACTTACGTCATGGAAAATTGGTGTCTATAGTCCAGTACATTGT\nATAAAGACCACAGTTGAAAATGAACTACCTCCACCAGAAAATATAGAAGTCAGTGTCCAA\nAATCAGAACTATGTTCTTAAATGGGATTATACATATGCAAACATGACCTTTCAAGTTCAG\nTGGCTCCACGCCTTTTTAAAAAGGAATCCTGGAAACCATTTGTATAAATGGAAACAAATA\nCCTGACTGTGAAAATGTCAAAACTACCCAGTGTGTCTTTCCTCAAAACGTTTTCCAAAAA\nGGAATTTACCTTCTCCGCGTACAAGCATCTGATGGAAATAACACATCTTTTTGGTCTGAA\nGAGATAAAGTTTGATACTGAAATACAAGCTTTCCTACTTCCTCCAGTCTTTAACATTAGA\nTCCCTTAGTGATTCATTCCATATCTATATCGGTGCTCCAAAACAGTCTGGAAACACGCCT\nGTGATCCAGGATTATCCACTGATTTATGAAATTATTTTTTGGGAAAACACTTCAAATGCT\nGAGAGAAAAATTATCGAGAAAAAAACTGATGTTACAGTTCCTAATTTGAAACCACTGACT\nGTATATTGTGTGAAAGCCAGAGCACACACCATGGATGAAAAGCTGAATAAAAGCAGTGTT\nTTTAGTGACGCTGTATGTGAGAAAACAAAACCAGGAAATACCTCTAAAATTTGGCTTATA\nGTTGGAATTTGTATTGCATTATTTGCTCTCCCGTTTGTCATTTATGCTGCGAAAGTCTTC\nTTGAGATGCATCAATTATGTCTTCTTTCCATCACTTAAACCTTCTTCCAGTATAGATGAG\nTATTTCTCTGAACAGCCATTGAAGAATCTTCTGCTTTCAACTTCTGAGGAACAAATCGAA\nAAATGTTTCATAATTGAAAATATAAGCACAATTGCTACAGTAGAAGAAACTAATCAAACT\nGATGAAGATCATAAAAAATACAGTTCCCAAACTAGCCAAGATTCAGGAAATTATTCTAAT\nGAAGATGAAAGCGAAAGTAAAACAAGTGAAGAACTACAGCAGGACTTTGTATGA","gene_format":"FASTA","parent_id":"BE0000661"} {"id":"P07204","source":"Swiss-Prot","name":"Thrombomodulin","general_function":"Transmembrane signaling receptor activity","specific_function":"Thrombomodulin is a specific endothelial cell receptor that forms a 1:1 stoichiometric complex with thrombin. This complex is responsible for the conversion of protein C to the activated protein C (protein Ca). Once evolved, protein Ca scissions the activated cofactors of the coagulation mechanism, factor Va and factor VIIIa, and thereby reduces the amount of thrombin generated.","gene_name":"THBD","locus":"20p12-cen","cellular_location":"Membrane","transmembrane_regions":"516-539","signal_regions":"1-18","theoretical_pi":4.54,"molecular_weight":60328.72,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0016328|Thrombomodulin\nMLGVLVLGALALAGLGFPAPAEPQPGGSQCVEHDCFALYPGPATFLNASQICDGLRGHLM\nTVRSSVAADVISLLLNGDGGVGRRRLWIGLQLPPGCGDPKRLGPLRGFQWVTGDNNTSYS\nRWARLDLNGAPLCGPLCVAVSAAEATVPSEPIWEEQQCEVKADGFLCEFHFPATCRPLAV\nEPGAAAAAVSITYGTPFAARGADFQALPVGSSAAVAPLGLQLMCTAPPGAVQGHWAREAP\nGAWDCSVENGGCEHACNAIPGAPRCQCPAGAALQADGRSCTASATQSCNDLCEHFCVPNP\nDQPGSYSCMCETGYRLAADQHRCEDVDDCILEPSPCPQRCVNTQGGFECHCYPNYDLVDG\nECVEPVDPCFRANCEYQCQPLNQTSYLCVCAEGFAPIPHEPHRCQMFCNQTACPADCDPN\nTQASCECPEGYILDDGFICTDIDECENGGFCSGVCHNLPGTFECICGPDSALARHIGTDC\nDSGKVDGGDSGSGEPPPSPTPGSTLTPPAVGLVHSGLLIGISIASLCLVVALLALLCHLR\nKKQGAARAKMEYKCAAPSKEVVLQHVRTERTPQRL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016329|Thrombomodulin (THBD)\nATGCTTGGGGTCCTGGTCCTTGGCGCGCTGGCCCTGGCCGGCCTGGGGTTCCCCGCACCC\nGCAGAGCCGCAGCCGGGTGGCAGCCAGTGCGTCGAGCACGACTGCTTCGCGCTCTACCCG\nGGCCCCGCGACCTTCCTCAATGCCAGTCAGATCTGCGACGGACTGCGGGGCCACCTAATG\nACAGTGCGCTCCTCGGTGGCTGCCGATGTCATTTCCTTGCTACTGAACGGCGACGGCGGC\nGTTGGCCGCCGGCGCCTCTGGATCGGCCTGCAGCTGCCACCCGGCTGCGGCGACCCCAAG\nCGCCTCGGGCCCCTGCGCGGCTTCCAGTGGGTTACGGGAGACAACAACACCAGCTATAGC\nAGGTGGGCACGGCTCGACCTCAATGGGGCTCCCCTCTGCGGCCCGTTGTGCGTCGCTGTC\nTCCGCTGCTGAGGCCACTGTGCCCAGCGAGCCGATCTGGGAGGAGCAGCAGTGCGAAGTG\nAAGGCCGATGGCTTCCTCTGCGAGTTCCACTTCCCAGCCACCTGCAGGCCACTGGCTGTG\nGAGCCCGGCGCCGCGGCTGCCGCCGTCTCGATCACCTACGGCACCCCGTTCGCGGCCCGC\nGGAGCGGACTTCCAGGCGCTGCCGGTGGGCAGCTCCGCCGCGGTGGCTCCCCTCGGCTTA\nCAGCTAATGTGCACCGCGCCGCCCGGAGCGGTCCAGGGGCACTGGGCCAGGGAGGCGCCG\nGGCGCTTGGGACTGCAGCGTGGAGAACGGCGGCTGCGAGCACGCGTGCAATGCGATCCCT\nGGGGCTCCCCGCTGCCAGTGCCCAGCCGGCGCCGCCCTGCAGGCAGACGGGCGCTCCTGC\nACCGCATCCGCGACGCAGTCCTGCAACGACCTCTGCGAGCACTTCTGCGTTCCCAACCCC\nGACCAGCCGGGCTCCTACTCGTGCATGTGCGAGACCGGCTACCGGCTGGCGGCCGACCAA\nCACCGGTGCGAGGACGTGGATGACTGCATACTGGAGCCCAGTCCGTGTCCGCAGCGCTGT\nGTCAACACACAGGGTGGCTTCGAGTGCCACTGCTACCCTAACTACGACCTGGTGGACGGC\nGAGTGTGTGGAGCCCGTGGACCCGTGCTTCAGAGCCAACTGCGAGTACCAGTGCCAGCCC\nCTGAACCAAACTAGCTACCTCTGCGTCTGCGCCGAGGGCTTCGCGCCCATTCCCCACGAG\nCCGCACAGGTGCCAGATGTTTTGCAACCAGACTGCCTGTCCAGCCGACTGCGACCCCAAC\nACCCAGGCTAGCTGTGAGTGCCCTGAAGGCTACATCCTGGACGACGGTTTCATCTGCACG\nGACATCGACGAGTGCGAAAACGGCGGCTTCTGCTCCGGGGTGTGCCACAACCTCCCCGGT\nACCTTCGAGTGCATCTGCGGGCCCGACTCGGCCCTTGCCCGCCACATTGGCACCGACTGT\nGACTCCGGCAAGGTGGACGGTGGCGACAGCGGCTCTGGCGAGCCCCCGCCCAGCCCGACG\nCCCGGCTCCACCTTGACTCCTCCGGCCGTGGGGCTCGTGCATTCGGGCTTGCTCATAGGC\nATCTCCATCGCGAGCCTGTGCCTGGTGGTGGCGCTTTTGGCGCTCCTCTGCCACCTGCGC\nAAGAAGCAGGGCGCCGCCAGGGCCAAGATGGAGTACAAGTGCGCGGCCCCTTCCAAGGAG\nGTAGTGCTGCAGCACGTGCGGACCGAGCGGACGCCGCAGAGACTCTGA","gene_format":"FASTA","parent_id":"BE0001151"} {"id":"P15692","source":"Swiss-Prot","name":"Vascular endothelial growth factor A","general_function":"Vascular endothelial growth factor receptor binding","specific_function":"Growth factor active in angiogenesis, vasculogenesis and endothelial cell growth. Induces endothelial cell proliferation, promotes cell migration, inhibits apoptosis and induces permeabilization of blood vessels. Binds to the FLT1/VEGFR1 and KDR/VEGFR2 receptors, heparan sulfate and heparin. NRP1/Neuropilin-1 binds isoforms VEGF-165 and VEGF-145. Isoform VEGF165B binds to KDR but does not activate downstream signaling pathways, does not activate angiogenesis and inhibits tumor growth.","gene_name":"VEGFA","locus":"6p12","cellular_location":"Secreted","signal_regions":"1-26","theoretical_pi":9.08,"molecular_weight":27042.205,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000325|Vascular endothelial growth factor A\nMNFLLSWVHWSLALLLYLHHAKWSQAAPMAEGGGQNHHEVVKFMDVYQRSYCHPIETLVD\nIFQEYPDEIEYIFKPSCVPLMRCGGCCNDEGLECVPTEESNITMQIMRIKPHQGQHIGEM\nSFLQHNKCECRPKKDRARQEKKSVRGKGKGQKRKRKKSRYKSWSVYVGARCCLMPWSLPG\nPHPCGPCSERRKHLFVQDPQTCKCSCKNTDSRCKARQLELNERTCRCDKPRR","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021644|Vascular endothelial growth factor A (VEGFA)\nCTGACGGACAGACAGACAGACACCGCCCCCAGCCCCAGCTACCACCTCCTCCCCGGCCGG\nCGGCGGACAGTGGACGCGGCGGCGAGCCGCGGGCAGGGGCCGGAGCCCGCGCCCGGAGGC\nGGGGTGGAGGGGGTCGGGGCTCGCGGCGTCGCACTGAAACTTTTCGTCCAACTTCTGGGC\nTGTTCTCGCTTCGGAGGAGCCGTGGTCCGCGCGGGGGAAGCCGAGCCGAGCGGAGCCGCG\nAGAAGTGCTAGCTCGGGCCGGGAGGAGCCGCAGCCGGAGGAGGGGGAGGAGGAAGAAGAG\nAAGGAAGAGGAGAGGGGGCCGCAGTGGCGACTCGGCGCTCGGAAGCCGGGCTCATGGACG\nGGTGAGGCGGCGGTGTGCGCAGACAGTGCTCCAGCCGCGCGCGCTCCCCAGGCCCTGGCC\nCGGGCCTCGGGCCGGGGAGGAAGAGTAGCTCGCCGAGGCGCCGAGGAGAGCGGGCCGCCC\nCACAGCCCGAGCCGGAGAGGGAGCGCGAGCCGCGCCGGCCCCGGTCGGGCCTCCGAAACC\nATGAACTTTCTGCTGTCTTGGGTGCATTGGAGCCTTGCCTTGCTGCTCTACCTCCACCAT\nGCCAAGTGGTCCCAGGCTGCACCCATGGCAGAAGGAGGAGGGCAGAATCATCACGAAGTG\nGTGAAGTTCATGGATGTCTATCAGCGCAGCTACTGCCATCCAATCGAGACCCTGGTGGAC\nATCTTCCAGGAGTACCCTGATGAGATCGAGTACATCTTCAAGCCATCCTGTGTGCCCCTG\nATGCGATGCGGGGGCTGCTGCAATGACGAGGGCCTGGAGTGTGTGCCCACTGAGGAGTCC\nAACATCACCATGCAGATTATGCGGATCAAACCTCACCAAGGCCAGCACATAGGAGAGATG\nAGCTTCCTACAGCACAACAAATGTGAATGCAGACCAAAGAAAGATAGAGCAAGACAAGAA\nAAAAAATCAGTTCGAGGAAAGGGAAAGGGGCAAAAACGAAAGCGCAAGAAATCCCGGTAT\nAAGTCCTGGAGCGTGTACGTTGGTGCCCGCTGCTGTCTAATGCCCTGGAGCCTCCCTGGC\nCCCCATCCCTGTGGGCCTTGCTCAGAGCGGAGAAAGCATTTGTTTGTACAAGATCCGCAG\nACGTGTAAATGTTCCTGCAAAAACACAGACTCGCGTTGCAAGGCGAGGCAGCTTGAGTTA\nAACGAACGTACTTGCAGATGTGACAAGCCGAGGCGGTGA","gene_format":"FASTA","parent_id":"BE0000163"} {"id":"P35228","source":"Swiss-Prot","name":"Nitric oxide synthase, inducible","general_function":"Tetrahydrobiopterin binding","specific_function":"Produces nitric oxide (NO) which is a messenger molecule with diverse functions throughout the body. In macrophages, NO mediates tumoricidal and bactericidal actions. Also has nitrosylase activity and mediates cysteine S-nitrosylation of cytoplasmic target proteins such COX2. As component of the iNOS-S100A8/9 transnitrosylase complex involved in the selective inflammatory stimulus-dependent S-nitrosylation of GAPDH on 'Cys-247' implicated in regulation of the GAIT complex activity and probably multiple targets including ANXA5, EZR, MSN and VIM.","gene_name":"NOS2","locus":"17q11.2-q12","theoretical_pi":8.01,"molecular_weight":131116.3,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000010|Nitric oxide synthase, inducible\nMACPWKFLFKTKFHQYAMNGEKDINNNVEKAPCATSSPVTQDDLQYHNLSKQQNESPQPL\nVETGKKSPESLVKLDATPLSSPRHVRIKNWGSGMTFQDTLHHKAKGILTCRSKSCLGSIM\nTPKSLTRGPRDKPTPPDELLPQAIEFVNQYYGSFKEAKIEEHLARVEAVTKEIETTGTYQ\nLTGDELIFATKQAWRNAPRCIGRIQWSNLQVFDARSCSTAREMFEHICRHVRYSTNNGNI\nRSAITVFPQRSDGKHDFRVWNAQLIRYAGYQMPDGSIRGDPANVEFTQLCIDLGWKPKYG\nRFDVVPLVLQANGRDPELFEIPPDLVLEVAMEHPKYEWFRELELKWYALPAVANMLLEVG\nGLEFPGCPFNGWYMGTEIGVRDFCDVQRYNILEEVGRRMGLETHKLASLWKDQAVVEINI\nAVLHSFQKQNVTIMDHHSAAESFMKYMQNEYRSRGGCPADWIWLVPPMSGSITPVFHQEM\nLNYVLSPFYYYQVEAWKTHVWQDEKRRPKRREIPLKVLVKAVLFACMLMRKTMASRVRVT\nILFATETGKSEALAWDLGALFSCAFNPKVVCMDKYRLSCLEEERLLLVVTSTFGNGDCPG\nNGEKLKKSLFMLKELNNKFRYAVFGLGSSMYPRFCAFAHDIDQKLSHLGASQLTPMGEGD\nELSGQEDAFRSWAVQTFKAACETFDVRGKQHIQIPKLYTSNVTWDPHHYRLVQDSQPLDL\nSKALSSMHAKNVFTMRLKSRQNLQSPTSSRATILVELSCEDGQGLNYLPGEHLGVCPGNQ\nPALVQGILERVVDGPTPHQTVRLEALDESGSYWVSDKRLPPCSLSQALTYFLDITTPPTQ\nLLLQKLAQVATEEPERQRLEALCQPSEYSKWKFTNSPTFLEVLEEFPSLRVSAGFLLSQL\nPILKPRFYSISSSRDHTPTEIHLTVAVVTYHTRDGQGPLHHGVCSTWLNSLKPQDPVPCF\nVRNASGFHLPEDPSHPCILIGPGTGIAPFRSFWQQRLHDSQHKGVRGGRMTLVFGCRRPD\nEDHIYQEEMLEMAQKGVLHAVHTAYSRLPGKPKVYVQDILRQQLASEVLRVLHKEPGHLY\nVCGDVRMARDVAHTLKQLVAAKLKLNEEQVEDYFFQLKSQKRYHEDIFGAVFPYEAKKDR\nVAVQPSSLEMSAL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0020444|Nitric oxide synthase, inducible (NOS2)\nATGGCCTGTCCTTGGAAATTTCTGTTCAAGACCAAATTCCACCAGTATGCAATGAATGGG\nGAAAAAGACATCAACAACAATGTGGAGAAAGCCCCCTGTGCCACCTCCAGTCCAGTGACA\nCAGGATGACCTTCAGTATCACAACCTCAGCAAGCAGCAGAATGAGTCCCCGCAGCCCCTC\nGTGGAGACGGGAAAGAAGTCTCCAGAATCTCTGGTCAAGCTGGATGCAACCCCATTGTCC\nTCCCCACGGCATGTGAGGATCAAAAACTGGGGCAGCGGGATGACTTTCCAAGACACACTT\nCACCATAAGGCCAAAGGGATTTTAACTTGCAGGTCCAAATCTTGCCTGGGGTCCATTATG\nACTCCCAAAAGTTTGACCAGAGGACCCAGGGACAAGCCTACCCCTCCAGATGAGCTTCTA\nCCTCAAGCTATCGAATTTGTCAACCAATATTACGGCTCCTTCAAAGAGGCAAAAATAGAG\nGAACATCTGGCCAGGGTGGAAGCGGTAACAAAGGAGATAGAAACAACAGGAACCTACCAA\nCTGACGGGAGATGAGCTCATCTTCGCCACCAAGCAGGCCTGGCGCAATGCCCCACGCTGC\nATTGGGAGGATCCAGTGGTCCAACCTGCAGGTCTTCGATGCCCGCAGCTGTTCCACTGCC\nCGGGAAATGTTTGAACACATCTGCAGACACGTGCGTTACTCCACCAACAATGGCAACATC\nAGGTCGGCCATCACCGTGTTCCCCCAGCGGAGTGATGGCAAGCACGACTTCCGGGTGTGG\nAATGCTCAGCTCATCCGCTATGCTGGCTACCAGATGCCAGATGGCAGCATCAGAGGGGAC\nCCTGCCAACGTGGAATTCACTCAGCTGTGCATCGACCTGGGCTGGAAGCCCAAGTACGGC\nCGCTTCGATGTGGTCCCCCTGGTCCTGCAGGCCAATGGCCGTGACCCTGAGCTCTTCGAA\nATCCCACCTGACCTTGTGCTTGAGGTGGCCATGGAACATCCCAAATACGAGTGGTTTCGG\nGAACTGGAGCTAAAGTGGTACGCCCTGCCTGCAGTGGCCAACATGCTGCTTGAGGTGGGC\nGGCCTGGAGTTCCCAGGGTGCCCCTTCAATGGCTGGTACATGGGCACAGAGATCGGAGTC\nCGGGACTTCTGTGACGTCCAGCGCTACAACATCCTGGAGGAAGTGGGCAGGAGAATGGGC\nCTGGAAACGCACAAGCTGGCCTCGCTCTGGAAAGACCAGGCTGTCGTTGAGATCAACATT\nGCTGTGCTCCATAGTTTCCAGAAGCAGAATGTGACCATCATGGACCACCACTCGGCTGCA\nGAATCCTTCATGAAGTACATGCAGAATGAATACCGGTCCCGTGGGGGCTGCCCGGCAGAC\nTGGATTTGGCTGGTCCCTCCCATGTCTGGGAGCATCACCCCCGTGTTTCACCAGGAGATG\nCTGAACTACGTCCTGTCCCCTTTCTACTACTATCAGGTAGAGGCCTGGAAAACCCATGTC\nTGGCAGGACGAGAAGCGGAGACCCAAGAGAAGAGAGATTCCATTGAAAGTCTTGGTCAAA\nGCTGTGCTCTTTGCCTGTATGCTGATGCGCAAGACAATGGCGTCCCGAGTCAGAGTCACC\nATCCTCTTTGCGACAGAGACAGGAAAATCAGAGGCGCTGGCCTGGGACCTGGGGGCCTTA\nTTCAGCTGTGCCTTCAACCCCAAGGTTGTCTGCATGGATAAGTACAGGCTGAGCTGCCTG\nGAGGAGGAACGGCTGCTGTTGGTGGTGACCAGTACGTTTGGCAATGGAGACTGCCCTGGC\nAATGGAGAGAAACTGAAGAAATCGCTCTTCATGCTGAAAGAGCTCAACAACAAATTCAGG\nTACGCTGTGTTTGGCCTCGGCTCCAGCATGTACCCTCGGTTCTGCGCCTTTGCTCATGAC\nATTGATCAGAAGCTGTCCCACCTGGGGGCCTCTCAGCTCACCCCGATGGGAGAAGGGGAT\nGAGCTCAGTGGGCAGGAGGACGCCTTCCGCAGCTGGGCCGTGCAAACCTTCAAGGCAGCC\nTGTGAGACGTTTGATGTCCGAGGCAAACAGCACATTCAGATCCCCAAGCTCTACACCTCC\nAATGTGACCTGGGACCCGCACCACTACAGGCTCGTGCAGGACTCACAGCCTTTGGACCTC\nAGCAAAGCCCTCAGCAGCATGCATGCCAAGAACGTGTTCACCATGAGGCTCAAATCTCGG\nCAGAATCTACAAAGTCCGACATCCAGCCGTGCCACCATCCTGGTGGAACTCTCCTGTGAG\nGATGGCCAAGGCCTGAACTACCTGCCGGGGGAGCACCTTGGGGTTTGCCCAGGCAACCAG\nCCGGCCCTGGTCCAAGGTATCCTGGAGCGAGTGGTGGATGGCCCCACACCCCACCAGACA\nGTGCGCCTGGAGGCCCTGGATGAGAGTGGCAGCTACTGGGTCAGTGACAAGAGGCTGCCC\nCCCTGCTCACTCAGCCAGGCCCTCACCTACTTCCTGGACATCACCACACCCCCAACCCAG\nCTGCTGCTCCAAAAGCTGGCCCAGGTGGCCACAGAAGAGCCTGAGAGACAGAGGCTGGAG\nGCCCTGTGCCAGCCCTCAGAGTACAGCAAGTGGAAGTTCACCAACAGCCCCACATTCCTG\nGAGGTGCTAGAGGAGTTCCCGTCCCTGCGGGTGTCTGCTGGCTTCCTGCTTTCCCAGCTC\nCCCATTCTGAAGCCCAGGTTCTACTCCATCAGCTCCTCCCGGGATCACACGCCCACAGAG\nATCCACCTGACTGTGGCCGTGGTCACCTACCACACCCGAGATGGCCAGGGTCCCCTGCAC\nCACGGCGTCTGCAGCACATGGCTCAACAGCCTGAAGCCCCAAGACCCAGTGCCCTGCTTT\nGTGCGGAATGCCAGCGGCTTCCACCTCCCCGAGGATCCCTCCCATCCTTGCATCCTCATC\nGGGCCTGGCACAGGCATCGCGCCCTTCCGCAGTTTCTGGCAGCAACGGCTCCATGACTCC\nCAGCACAAGGGAGTGCGGGGAGGCCGCATGACCTTGGTGTTTGGGTGCCGCCGCCCAGAT\nGAGGACCACATCTACCAGGAGGAGATGCTGGAGATGGCCCAGAAGGGGGTGCTGCATGCG\nGTGCACACAGCCTATTCCCGCCTGCCTGGCAAGCCCAAGGTCTATGTTCAGGACATCCTG\nCGGCAGCAGCTGGCCAGCGAGGTGCTCCGTGTGCTCCACAAGGAGCCAGGCCACCTCTAT\nGTTTGCGGGGATGTGCGCATGGCCCGGGACGTGGCCCACACCCTGAAGCAGCTGGTGGCT\nGCCAAGCTGAAATTGAATGAGGAGCAGGTCGAGGACTATTTCTTTCAGCTCAAGAGCCAG\nAAGCGCTATCACGAAGATATCTTTGGTGCTGTATTTCCTTACGAGGCGAAGAAGGACAGG\nGTGGCGGTGCAGCCCAGCAGCCTGGAGATGTCAGCGCTCTGA","gene_format":"FASTA","parent_id":"BE0000005"} {"id":"Q07869","source":"Swiss-Prot","name":"Peroxisome proliferator-activated receptor alpha","general_function":"Zinc ion binding","specific_function":"Ligand-activated transcription factor. Key regulator of lipid metabolism. Activated by the endogenous ligand 1-palmitoyl-2-oleoyl-sn-glycerol-3-phosphocholine (16:0/18:1-GPC). Activated by oleylethanolamide, a naturally occurring lipid that regulates satiety. Receptor for peroxisome proliferators such as hypolipidemic drugs and fatty acids. Regulates the peroxisomal beta-oxidation pathway of fatty acids. Functions as transcription activator for the ACOX1 and P450 genes. Transactivation activity requires heterodimerization with RXRA and is antagonized by NR2C2. May be required for the propagation of clock information to metabolic pathways regulated by PER2.","gene_name":"PPARA","locus":"22q12-q13.1|22q13.31","cellular_location":"Nucleus","theoretical_pi":6.2,"molecular_weight":52224.595,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000142|Peroxisome proliferator-activated receptor alpha\nMVDTESPLCPLSPLEAGDLESPLSEEFLQEMGNIQEISQSIGEDSSGSFGFTEYQYLGSC\nPGSDGSVITDTLSPASSPSSVTYPVVPGSVDESPSGALNIECRICGDKASGYHYGVHACE\nGCKGFFRRTIRLKLVYDKCDRSCKIQKKNRNKCQYCRFHKCLSVGMSHNAIRFGRMPRSE\nKAKLKAEILTCEHDIEDSETADLKSLAKRIYEAYLKNFNMNKVKARVILSGKASNNPPFV\nIHDMETLCMAEKTLVAKLVANGIQNKEAEVRIFHCCQCTSVETVTELTEFAKAIPGFANL\nDLNDQVTLLKYGVYEAIFAMLSSVMNKDGMLVAYGNGFITREFLKSLRKPFCDIMEPKFD\nFAMKFNALELDDSDISLFVAAIICCGDRPGLLNVGHIEKMQEGIVHVLRLHLQSNHPDDI\nFLFPKLLQKMADLRQLVTEHAQLVQIIKKTESDAALHPLLQEIYRDMY","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021925|Peroxisome proliferator-activated receptor alpha (PPARA)\nATGGTGGACACGGAAAGCCCACTCTGCCCCCTCTCCCCACTCGAGGCCGGCGATCTAGAG\nAGCCCGTTATCTGAAGAGTTCCTGCAAGAAATGGGAAACATCCAAGAGATTTCGCAATCC\nATCGGCGAGGATAGTTCTGGAAGCTTTGGCTTTACGGAATACCAGTATTTAGGAAGCTGT\nCCTGGCTCAGATGGCTCGGTCATCACGGACACGCTTTCACCAGCTTCGAGCCCCTCCTCG\nGTGACTTATCCTGTGGTCCCCGGCAGCGTGGACGAGTCTCCCAGTGGAGCATTGAACATC\nGAATGTAGAATCTGCGGGGACAAGGCCTCAGGCTATCATTACGGAGTCCACGCGTGTGAA\nGGCTGCAAGGGCTTCTTTCGGCGAACGATTCGACTCAAGCTGGTGTATGACAAGTGCGAC\nCGCAGCTGCAAGATCCAGAAAAAGAACAGAAACAAATGCCAGTATTGTCGATTTCACAAG\nTGCCTTTCTGTCGGGATGTCACACAACGCGATTCGTTTTGGACGAATGCCAAGATCTGAG\nAAAGCAAAACTGAAAGCAGAAATTCTTACCTGTGAACATGACATAGAAGATTCTGAAACT\nGCAGATCTCAAATCTCTGGCCAAGAGAATCTACGAGGCCTACTTGAAGAACTTCAACATG\nAACAAGGTCAAAGCCCGGGTCATCCTCTCAGGAAAGGCCAGTAACAATCCACCTTTTGTC\nATACATGATATGGAGACACTGTGTATGGCTGAGAAGACGCTGGTGGCCAAGCTGGTGGCC\nAATGGCATCCAGAACAAGGAGGCGGAGGTCCGCATCTTTCACTGCTGCCAGTGCACGTCA\nGTGGAGACCGTCACGGAGCTCACGGAATTCGCCAAGGCCATCCCAGGCTTCGCAAACTTG\nGACCTGAACGATCAAGTGACATTGCTAAAATACGGAGTTTATGAGGCCATATTCGCCATG\nCTGTCTTCTGTGATGAACAAAGACGGGATGCTGGTAGCGTATGGAAATGGGTTTATAACT\nCGTGAATTCCTAAAAAGCCTAAGGAAACCGTTCTGTGATATCATGGAACCCAAGTTTGAT\nTTTGCCATGAAGTTCAATGCACTGGAACTGGATGACAGTGATATCTCCCTTTTTGTGGCT\nGCTATCATTTGCTGTGGAGATCGTCCTGGCCTTCTAAACGTAGGACACATTGAAAAAATG\nCAGGAGGGTATTGTACATGTGCTCAGACTCCACCTGCAGAGCAACCACCCGGACGATATC\nTTTCTCTTCCCAAAACTTCTTCAAAAAATGGCAGACCTCCGGCAGCTGGTGACGGAGCAT\nGCGCAGCTGGTGCAGATCATCAAGAAGACGGAGTCGGATGCTGCGCTGCACCCGCTACTG\nCAGGAGATCTACAGGGACATGTACTGA","gene_format":"FASTA","parent_id":"BE0000071"} {"id":"P37231","source":"Swiss-Prot","name":"Peroxisome proliferator-activated receptor gamma","general_function":"Zinc ion binding","specific_function":"Nuclear receptor that binds peroxisome proliferators such as hypolipidemic drugs and fatty acids. Once activated by a ligand, the nuclear receptor binds to DNA specific PPAR response elements (PPRE) and modulates the transcription of its target genes, such as acyl-CoA oxidase. It therefore controls the peroxisomal beta-oxidation pathway of fatty acids. Key regulator of adipocyte differentiation and glucose homeostasis. ARF6 acts as a key regulator of the tissue-specific adipocyte P2 (aP2) enhancer. Acts as a critical regulator of gut homeostasis by suppressing NF-kappa-B-mediated proinflammatory responses. Plays a role in the regulation of cardiovascular circadian rhythms by regulating the transcription of ARNTL/BMAL1 in the blood vessels (By similarity).","gene_name":"PPARG","locus":"3p25","cellular_location":"Nucleus","theoretical_pi":5.77,"molecular_weight":57619.58,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000429|Peroxisome proliferator-activated receptor gamma\nMGETLGDSPIDPESDSFTDTLSANISQEMTMVDTEMPFWPTNFGISSVDLSVMEDHSHSF\nDIKPFTTVDFSSISTPHYEDIPFTRTDPVVADYKYDLKLQEYQSAIKVEPASPPYYSEKT\nQLYNKPHEEPSNSLMAIECRVCGDKASGFHYGVHACEGCKGFFRRTIRLKLIYDRCDLNC\nRIHKKSRNKCQYCRFQKCLAVGMSHNAIRFGRMPQAEKEKLLAEISSDIDQLNPESADLR\nALAKHLYDSYIKSFPLTKAKARAILTGKTTDKSPFVIYDMNSLMMGEDKIKFKHITPLQE\nQSKEVAIRIFQGCQFRSVEAVQEITEYAKSIPGFVNLDLNDQVTLLKYGVHEIIYTMLAS\nLMNKDGVLISEGQGFMTREFLKSLRKPFGDFMEPKFEFAVKFNALELDDSDLAIFIAVII\nLSGDRPGLLNVKPIEDIQDNLLQALELQLKLNHPESSQLFAKLLQKMTDLRQIVTEHVQL\nLQVIKKTETDMSLHPLLQEIYKDLY","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021642|Peroxisome proliferator-activated receptor gamma (PPARG)\nATGACCATGGTTGACACAGAGATGCCATTCTGGCCCACCAACTTTGGGATCAGCTCCGTG\nGATCTCTCCGTAATGGAAGACCACTCCCACTCCTTTGATATCAAGCCCTTCACTACTGTT\nGACTTCTCCAGCATTTCTACTCCACATTACGAAGACATTCCATTCACAAGAACAGATCCA\nGTGGTTGCAGATTACAAGTATGACCTGAAACTTCAAGAGTACCAAAGTGCAATCAAAGTG\nGAGCCTGCATCTCCACCTTATTATTCTGAGAAGACTCAGCTCTACAATAAGCCTCATGAA\nGAGCCTTCCAACTCCCTCATGGCAATTGAATGTCGTGTCTGTGGAGATAAAGCTTCTGGA\nTTTCACTATGGAGTTCATGCTTGTGAAGGATGCAAGGGTTTCTTCCGGAGAACAATCAGA\nTTGAAGCTTATCTATGACAGATGTGATCTTAACTGTCGGATCCACAAAAAAAGTAGAAAT\nAAATGTCAGTACTGTCGGTTTCAGAAATGCCTTGCAGTGGGGATGTCTCATAATGCCATC\nAGGTTTGGGCGGATGCCACAGGCCGAGAAGGAGAAGCTGTTGGCGGAGATCTCCAGTGAT\nATCGACCAGCTGAATCCAGAGTCCGCTGACCTCCGGGCCCTGGCAAAACATTTGTATGAC\nTCATACATAAAGTCCTTCCCGCTGACCAAAGCAAAGGCGAGGGCGATCTTGACAGGAAAG\nACAACAGACAAATCACCATTCGTTATCTATGACATGAATTCCTTAATGATGGGAGAAGAT\nAAAATCAAGTTCAAACACATCACCCCCCTGCAGGAGCAGAGCAAAGAGGTGGCCATCCGC\nATCTTTCAGGGCTGCCAGTTTCGCTCCGTGGAGGCTGTGCAGGAGATCACAGAGTATGCC\nAAAAGCATTCCTGGTTTTGTAAATCTTGACTTGAACGACCAAGTAACTCTCCTCAAATAT\nGGAGTCCACGAGATCATTTACACAATGCTGGCCTCCTTGATGAATAAAGATGGGGTTCTC\nATATCCGAGGGCCAAGGCTTCATGACAAGGGAGTTTCTAAAGAGCCTGCGAAAGCCTTTT\nGGTGACTTTATGGAGCCCAAGTTTGAGTTTGCTGTGAAGTTCAATGCACTGGAATTAGAT\nGACAGCGACTTGGCAATATTTATTGCTGTCATTATTCTCAGTGGAGACCGCCCAGGTTTG\nCTGAATGTGAAGCCCATTGAAGACATTCAAGACAACCTGCTACAAGCCCTGGAGCTCCAG\nCTGAAGCTGAACCACCCTGAGTCCTCACAGCTGTTTGCCAAGCTGCTCCAGAAAATGACA\nGACCTCAGACAGATTGTCACGGAACACGTGCAGCTACTGCAGGTGATCAAGAAGACGGAG\nACAGACATGAGTCTTCACCCGCTCCTGCAGGAGATCTACAAGGACTTGTACTAG","gene_format":"FASTA","parent_id":"BE0000215"} {"id":"P09210","source":"Swiss-Prot","name":"Glutathione S-transferase A2","general_function":"Glutathione transferase activity","specific_function":"Conjugation of reduced glutathione to a wide number of exogenous and endogenous hydrophobic electrophiles.","gene_name":"GSTA2","locus":"6p12.1","cellular_location":"Cytoplasm","theoretical_pi":9.07,"molecular_weight":25663.675,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0011580|Glutathione S-transferase A2\nMAEKPKLHYSNIRGRMESIRWLLAAAGVEFEEKFIKSAEDLDKLRNDGYLMFQQVPMVEI\nDGMKLVQTRAILNYIASKYNLYGKDIKEKALIDMYIEGIADLGEMILLLPFSQPEEQDAK\nLALIQEKTKNRYFPAFEKVLKSHGQDYLVGNKLSRADIHLVELLYYVEELDSSLISSFPL\nLKALKTRISNLPTVKKFLQPGSPRKPPMDEKSLEESRKIFRF","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011581|Glutathione S-transferase A2 (GSTA2)\nATGGCAGAGAAGCCCAAGCTCCACTACTCCAATATACGGGGCAGAATGGAGTCCATCCGG\nTGGCTCCTGGCTGCAGCTGGAGTAGAGTTTGAAGAGAAATTTATAAAATCTGCAGAAGAT\nTTGGACAAGTTAAGAAATGATGGATATTTGATGTTCCAGCAAGTGCCAATGGTTGAGATT\nGATGGGATGAAGCTGGTGCAGACCAGAGCCATTCTCAACTACATTGCCAGCAAATACAAC\nCTCTATGGGAAAGACATAAAGGAGAAAGCCCTGATTGATATGTATATAGAAGGTATAGCA\nGATTTGGGTGAAATGATCCTTCTTCTGCCCTTTAGTCAACCTGAGGAACAAGATGCCAAG\nCTTGCCTTGATCCAAGAGAAAACAAAAAATCGCTACTTCCCTGCCTTTGAAAAAGTCTTA\nAAGAGCCACGGACAAGACTACCTTGTTGGCAACAAGCTGAGCCGGGCTGACATTCACCTG\nGTGGAACTTCTCTACTACGTGGAAGAGCTTGACTCTAGCCTTATTTCCAGCTTCCCTCTG\nCTGAAGGCCCTGAAAACCAGAATCAGTAACCTGCCCACAGTGAAGAAGTTTCTACAGCCT\nGGCAGCCCAAGGAAGCCTCCCATGGATGAGAAATCTTTAGAAGAATCAAGGAAGATTTTC\nAGGTTTTAA","gene_format":"FASTA","parent_id":"BE0002189"} {"id":"P09488","source":"Swiss-Prot","name":"Glutathione S-transferase Mu 1","general_function":"Protein homodimerization activity","specific_function":"Conjugation of reduced glutathione to a wide number of exogenous and endogenous hydrophobic electrophiles.","gene_name":"GSTM1","locus":"1p13.3","cellular_location":"Cytoplasm","theoretical_pi":6.67,"molecular_weight":25711.555,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0037080|Glutathione S-transferase Mu 1\nMPMILGYWDIRGLAHAIRLLLEYTDSSYEEKKYTMGDAPDYDRSQWLNEKFKLGLDFPNL\nPYLIDGAHKITQSNAILCYIARKHNLCGETEEEKIRVDILENQTMDNHMQLGMICYNPEF\nEKLKPKYLEELPEKLKLYSEFLGKRPWFAGNKITFVDFLVYDVLDLHRIFEPKCLDAFPN\nLKDFISRFEGLEKISAYMKSSRFLPRPVFSKMAVWGNK","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0020484|Glutathione S-transferase Mu 1 (GSTM1)\nATGCCCATGATACTGGGGTACTGGGACATCCGCGGGCTGGCCCACGCCATCCGCCTGCTC\nCTGGAATACACAGACTCAAGCTATGAGGAAAAGAAGTACACGATGGGGGACGCTCCTGAT\nTATGACAGAAGCCAGTGGCTGAATGAAAAATTCAAGCTGGGCCTGGACTTTCCCAATCTG\nCCCTACTTGATTGATGGGGCTCACAAGATCACCCAGAGCAACGCCATCTTGTGCTACATT\nGCCCGCAAGCACAACCTGTGTGGGGAGACAGAAGAGGAGAAGATTCGTGTGGACATTTTG\nGAGAACCAGACCATGGACAACCATATGCAGCTGGGCATGATCTGCTACAATCCAGAATTT\nGAGAAACTGAAGCCAAAGTACTTGGAGGAACTCCCTGAAAAGCTAAAGCTCTACTCAGAG\nTTTCTGGGGAAGCGGCCATGGTTTGCAGGAAACAAGATCACTTTTGTAGATTTTCTCGTC\nTATGATGTCCTTGACCTCCACCGTATATTTGAGCCCAAGTGCTTGGACGCCTTCCCAAAT\nCTGAAGGACTTCATCTCCCGCTTTGAGGGCTTGGAGAAGATCTCTGCCTACATGAAGTCC\nAGCCGCTTCCTCCCAAGACCTGTGTTCTCAAAGATGGCTGTCTGGGGCAACAAGTAG","gene_format":"FASTA","parent_id":"BE0000807"} {"id":"P23219","source":"Swiss-Prot","name":"Prostaglandin G/H synthase 1","general_function":"Prostaglandin-endoperoxide synthase activity","specific_function":"Converts arachidonate to prostaglandin H2 (PGH2), a committed step in prostanoid synthesis. Involved in the constitutive production of prostanoids in particular in the stomach and platelets. In gastric epithelial cells, it is a key step in the generation of prostaglandins, such as prostaglandin E2 (PGE2), which plays an important role in cytoprotection. In platelets, it is involved in the generation of thromboxane A2 (TXA2), which promotes platelet activation and aggregation, vasoconstriction and proliferation of vascular smooth muscle cells.","gene_name":"PTGS1","locus":"9q32-q33.3","cellular_location":"Microsome membrane","signal_regions":"1-23","theoretical_pi":7.39,"molecular_weight":68685.82,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0036935|Prostaglandin G/H synthase 1\nMSRSLLLWFLLFLLLLPPLPVLLADPGAPTPVNPCCYYPCQHQGICVRFGLDRYQCDCTR\nTGYSGPNCTIPGLWTWLRNSLRPSPSFTHFLLTHGRWFWEFVNATFIREMLMRLVLTVRS\nNLIPSPPTYNSAHDYISWESFSNVSYYTRILPSVPKDCPTPMGTKGKKQLPDAQLLARRF\nLLRRKFIPDPQGTNLMFAFFAQHFTHQFFKTSGKMGPGFTKALGHGVDLGHIYGDNLERQ\nYQLRLFKDGKLKYQVLDGEMYPPSVEEAPVLMHYPRGIPPQSQMAVGQEVFGLLPGLMLY\nATLWLREHNRVCDLLKAEHPTWGDEQLFQTTRLILIGETIKIVIEEYVQQLSGYFLQLKF\nDPELLFGVQFQYRNRIAMEFNHLYHWHPLMPDSFKVGSQEYSYEQFLFNTSMLVDYGVEA\nLVDAFSRQIAGRIGGGRNMDHHILHVAVDVIRESREMRLQPFNEYRKRFGMKPYTSFQEL\nVGEKEMAAELEELYGDIDALEFYPGLLLEKCHPNSIFGESMIEIGAPFSLKGLLGNPICS\nPEYWKPSTFGGEVGFNIVKTATLKKLVCLNTKTCPYVSFRVPDASQDDGPAVERPSTEL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021254|Prostaglandin G/H synthase 1 (PTGS1)\nATGAGCCGGAGTCTCTTGCTCTGGTTCTTGCTGTTCCTGCTCCTGCTCCCGCCGCTCCCC\nGTCCTGCTCGCGGACCCAGGGGCGCCCACGCCAGTGAATCCCTGTTGTTACTATCCATGC\nCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTTGACCGCTACCAGTGTGACTGCACCCGC\nACGGGCTATTCCGGCCCCAACTGCACCATCCCTGGCCTGTGGACCTGGCTCCGGAATTCA\nCTGCGGCCCAGCCCCTCTTTCACCCACTTCCTGCTCACTCACGGGCGCTGGTTCTGGGAG\nTTTGTCAATGCCACCTTCATCCGAGAGATGCTCATGCGCCTGGTACTCACAGTGCGCTCC\nAACCTTATCCCCAGTCCCCCCACCTACAACTCAGCACATGACTACATCAGCTGGGAGTCT\nTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTGCCTAAAGATTGCCCCACA\nCCCATGGGAACCAAAGGGAAGAAGCAGTTGCCAGATGCCCAGCTCCTGGCCCGCCGCTTC\nCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATGTTTGCCTTCTTT\nGCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACC\nAAGGCCTTGGGCCATGGGGTAGACCTCGGCCACATTTATGGAGACAATCTGGAGCGTCAG\nTATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAGGTGCTGGATGGAGAAATG\nTACCCGCCCTCGGTAGAAGAGGCGCCTGTGTTGATGCACTACCCCCGAGGCATCCCGCCC\nCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTAT\nGCCACGCTCTGGCTACGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCC\nACCTGGGGCGATGAGCAGCTTTTCCAGACGACCCGCCTCATCCTCATAGGGGAGACCATC\nAAGATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTT\nGACCCAGAGCTGCTGTTCGGTGTCCAGTTCCAATACCGCAACCGCATTGCCATGGAGTTC\nAACCATCTCTACCACTGGCACCCCCTCATGCCTGACTCCTTCAAGGTGGGCTCCCAGGAG\nTACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTATGGGGTTGAGGCC\nCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGGATCGGTGGGGGCAGGAACATGGAC\nCACCACATCCTGCATGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAGATGCGGCTGCAG\nCCCTTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTC\nGTAGGAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTGTATGGAGACATTGATGCGTTG\nGAGTTCTACCCTGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGT\nATGATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCT\nCCGGAGTACTGGAAGCCGAGCACATTTGGCGGCGAGGTGGGCTTTAACATTGTCAAGACG\nGCCACACTGAAGAAGCTGGTCTGCCTCAACACCAAGACCTGTCCCTACGTTTCCTTCCGT\nGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCCACAGAGCTCTGA\n","gene_format":"FASTA","parent_id":"BE0000017"} {"id":"P35354","source":"Swiss-Prot","name":"Prostaglandin G/H synthase 2","general_function":"Prostaglandin-endoperoxide synthase activity","specific_function":"Converts arachidonate to prostaglandin H2 (PGH2), a committed step in prostanoid synthesis. Constitutively expressed in some tissues in physiological conditions, such as the endothelium, kidney and brain, and in pathological conditions, such as in cancer. PTGS2 is responsible for production of inflammatory prostaglandins. Up-regulation of PTGS2 is also associated with increased cell adhesion, phenotypic changes, resistance to apoptosis and tumor angiogenesis. In cancer cells, PTGS2 is a key step in the production of prostaglandin E2 (PGE2), which plays important roles in modulating motility, proliferation and resistance to apoptosis.","gene_name":"PTGS2","locus":"1q25.2-q25.3","cellular_location":"Microsome membrane","signal_regions":"1-17","theoretical_pi":7.41,"molecular_weight":68995.625,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0021832|Prostaglandin G/H synthase 2\nMLARALLLCAVLALSHTANPCCSHPCQNRGVCMSVGFDQYKCDCTRTGFYGENCSTPEFL\nTRIKLFLKPTPNTVHYILTHFKGFWNVVNNIPFLRNAIMSYVLTSRSHLIDSPPTYNADY\nGYKSWEAFSNLSYYTRALPPVPDDCPTPLGVKGKKQLPDSNEIVEKLLLRRKFIPDPQGS\nNMMFAFFAQHFTHQFFKTDHKRGPAFTNGLGHGVDLNHIYGETLARQRKLRLFKDGKMKY\nQIIDGEMYPPTVKDTQAEMIYPPQVPEHLRFAVGQEVFGLVPGLMMYATIWLREHNRVCD\nVLKQEHPEWGDEQLFQTSRLILIGETIKIVIEDYVQHLSGYHFKLKFDPELLFNKQFQYQ\nNRIAAEFNTLYHWHPLLPDTFQIHDQKYNYQQFIYNNSILLEHGITQFVESFTRQIAGRV\nAGGRNVPPAVQKVSQASIDQSRQMKYQSFNEYRKRFMLKPYESFEELTGEKEMSAELEAL\nYGDIDAVELYPALLVEKPRPDAIFGETMVEVGAPFSLKGLMGNVICSPAYWKPSTFGGEV\nGFQIINTASIQSLICNNVKGCPFTSFSVPDPELIKTVTINASSSRSGLDDINPTVLLKER\nSTEL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021833|Prostaglandin G/H synthase 2 (PTGS2)\nATGCTCGCCCGCGCCCTGCTGCTGTGCGCGGTCCTGGCGCTCAGCCATACAGCAAATCCT\nTGCTGTTCCCACCCATGTCAAAACCGAGGTGTATGTATGAGTGTGGGATTTGACCAGTAT\nAAGTGCGATTGTACCCGGACAGGATTCTATGGAGAAAACTGCTCAACACCGGAATTTTTG\nACAAGAATAAAATTATTTCTGAAACCCACTCCAAACACAGTGCACTACATACTTACCCAC\nTTCAAGGGATTTTGGAACGTTGTGAATAACATTCCCTTCCTTCGAAATGCAATTATGAGT\nTATGTGTTGACATCCAGATCACATTTGATTGACAGTCCACCAACTTACAATGCTGACTAT\nGGCTACAAAAGCTGGGAAGCCTTCTCTAACCTCTCCTATTATACTAGAGCCCTTCCTCCT\nGTGCCTGATGATTGCCCGACTCCCTTGGGTGTCAAAGGTAAAAAGCAGCTTCCTGATTCA\nAATGAGATTGTGGAAAAATTGCTTCTAAGAAGAAAGTTCATCCCTGATCCCCAGGGCTCA\nAACATGATGTTTGCATTCTTTGCCCAGCACTTCACGCATCAGTTTTTCAAGACAGATCAT\nAAGCGAGGGCCAGCTTTCACCAACGGGCTGGGCCATGGGGTGGACTTAAATCATATTTAC\nGGTGAAACTCTGGCTAGACAGCGTAAACTGCGCCTTTTCAAGGATGGAAAAATGAAATAT\nCAGATAATTGATGGAGAGATGTATCCTCCCACAGTCAAAGATACTCAGGCAGAGATGATC\nTACCCTCCTCAAGTCCCTGAGCATCTACGGTTTGCTGTGGGGCAGGAGGTCTTTGGTCTG\nGTGCCTGGTCTGATGATGTATGCCACAATCTGGCTGCGGGAACACAACAGAGTATGCGAT\nGTGCTTAAACAGGAGCATCCTGAATGGGGTGATGAGCAGTTGTTCCAGACAAGCAGGCTA\nATACTGATAGGAGAGACTATTAAGATTGTGATTGAAGATTATGTGCAACACTTGAGTGGC\nTATCACTTCAAACTGAAATTTGACCCAGAACTACTTTTCAACAAACAATTCCAGTACCAA\nAATCGTATTGCTGCTGAATTTAACACCCTCTATCACTGGCATCCCCTTCTGCCTGACACC\nTTTCAAATTCATGACCAGAAATACAACTATCAACAGTTTATCTACAACAACTCTATATTG\nCTGGAACATGGAATTACCCAGTTTGTTGAATCATTCACCAGGCAAATTGCTGGCAGGGTT\nGCTGGTGGTAGGAATGTTCCACCCGCAGTACAGAAAGTATCACAGGCTTCCATTGACCAG\nAGCAGGCAGATGAAATACCAGTCTTTTAATGAGTACCGCAAACGCTTTATGCTGAAGCCC\nTATGAATCATTTGAAGAACTTACAGGAGAAAAGGAAATGTCTGCAGAGTTGGAAGCACTC\nTATGGTGACATCGATGCTGTGGAGCTGTATCCTGCCCTTCTGGTAGAAAAGCCTCGGCCA\nGATGCCATCTTTGGTGAAACCATGGTAGAAGTTGGAGCACCATTCTCCTTGAAAGGACTT\nATGGGTAATGTTATATGTTCTCCTGCCTACTGGAAGCCAAGCACTTTTGGTGGAGAAGTG\nGGTTTTCAAATCATCAACACTGCCTCAATTCAGTCTCTCATCTGCAATAACGTGAAGGGC\nTGTCCCTTTACTTCATTCAGTGTTCCAGATCCAGAGCTCATTAAAACAGTCACCATCAAT\nGCAAGTTCTTCCCGCTCCGGACTAGATGATATCAATCCCACAGTACTACTAAAAGAACGT\nTCGACTGAACTGTAG","gene_format":"FASTA","parent_id":"BE0000262"} {"id":"Q9UM07","source":"Swiss-Prot","name":"Protein-arginine deiminase type-4","general_function":"Protein-arginine deiminase activity","specific_function":"Catalyzes the citrullination/deimination of arginine residues of proteins such as histones, thereby playing a key role in histone code and regulation of stem cell maintenance. Citrullinates histone H1 at 'Arg-54' (to form H1R54ci), histone H3 at 'Arg-2', 'Arg-8', 'Arg-17' and/or 'Arg-26' (to form H3R2ci, H3R8ci, H3R17ci, H3R26ci, respectively) and histone H4 at 'Arg-3' (to form H4R3ci). Acts as a key regulator of stem cell maintenance by mediating citrullination of histone H1: citrullination of 'Arg-54' of histone H1 (H1R54ci) results in H1 displacement from chromatin and global chromatin decondensation, thereby promoting pluripotency and stem cell maintenance. Promotes profound chromatin decondensation during the innate immune response to infection in neutrophils by mediating formation of H1R54ci. Citrullination of histone H3 prevents their methylation by CARM1 and HRMT1L2/PRMT1 and represses transcription. Citrullinates EP300/P300 at 'Arg-2142', which favors its interaction with NCOA2/GRIP1.","gene_name":"PADI4","locus":"1p36.13","cellular_location":"Cytoplasm","theoretical_pi":6.56,"molecular_weight":74078.65,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0016309|Protein-arginine deiminase type-4\nMAQGTLIRVTPEQPTHAVCVLGTLTQLDICSSAPEDCTSFSINASPGVVVDIAHGPPAKK\nKSTGSSTWPLDPGVEVTLTMKVASGSTGDQKVQISYYGPKTPPVKALLYLTGVEISLCAD\nITRTGKVKPTRAVKDQRTWTWGPCGQGAILLVNCDRDNLESSAMDCEDDEVLDSEDLQDM\nSLMTLSTKTPKDFFTNHTLVLHVARSEMDKVRVFQATRGKLSSKCSVVLGPKWPSHYLMV\nPGGKHNMDFYVEALAFPDTDFPGLITLTISLLDTSNLELPEAVVFQDSVVFRVAPWIMTP\nNTQPPQEVYACSIFENEDFLKSVTTLAMKAKCKLTICPEEENMDDQWMQDEMEIGYIQAP\nHKTLPVVFDSPRNRGLKEFPIKRVMGPDFGYVTRGPQTGGISGLDSFGNLEVSPPVTVRG\nKEYPLGRILFGDSCYPSNDSRQMHQALQDFLSAQQVQAPVKLYSDWLSVGHVDEFLSFVP\nAPDRKGFRLLLASPRSCYKLFQEQQNEGHGEALLFEGIKKKKQQKIKNILSNKTLREHNS\nFVERCIDWNRELLKRELGLAESDIIDIPQLFKLKEFSKAEAFFPNMVNMLVLGKHLGIPK\nPFGPVINGRCCLEEKVCSLLEPLGLQCTFINDFFTYHIRHGEVHCGTNVRRKPFSFKWWN\nMVP","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016310|Protein-arginine deiminase type-4 (PADI4)\nATGGCCCAGGGGACATTGATCCGTGTGACCCCAGAGCAGCCCACCCATGCCGTGTGTGTG\nCTGGGCACCTTGACTCAGCTTGACATCTGCAGCTCTGCCCCTGAGGACTGCACGTCCTTC\nAGCATCAACGCCTCCCCAGGGGTGGTCGTGGATATTGCCCACGGCCCTCCAGCCAAGAAG\nAAATCCACAGGTTCCTCCACATGGCCCCTGGACCCTGGGGTAGAGGTGACCCTGACGATG\nAAAGTGGCCAGTGGTAGCACAGGCGACCAGAAGGTTCAGATTTCATACTACGGACCCAAG\nACTCCACCAGTCAAAGCTCTACTCTACCTCACCGGGGTGGAAATCTCCTTGTGCGCAGAC\nATCACCCGCACCGGCAAAGTGAAGCCAACCAGAGCTGTGAAAGATCAGAGGACCTGGACC\nTGGGGCCCTTGTGGACAGGGTGCCATCCTGCTGGTGAACTGTGACAGAGACAATCTCGAA\nTCTTCTGCCATGGACTGCGAGGATGATGAAGTGCTTGACAGCGAAGACCTGCAGGACATG\nTCGCTGATGACCCTGAGCACGAAGACCCCCAAGGACTTCTTCACAAACCATACACTGGTG\nCTCCACGTGGCCAGGTCTGAGATGGACAAAGTGAGGGTGTTTCAGGCCACACGGGGCAAA\nCTGTCCTCCAAGTGCAGCGTAGTCTTGGGTCCCAAGTGGCCCTCTCACTACCTGATGGTC\nCCCGGTGGAAAGCACAACATGGACTTCTACGTGGAGGCCCTCGCTTTCCCGGACACCGAC\nTTCCCGGGGCTCATTACCCTCACCATCTCCCTGCTGGACACGTCCAACCTGGAGCTCCCC\nGAGGCTGTGGTGTTCCAAGACAGCGTGGTCTTCCGCGTGGCGCCCTGGATCATGACCCCC\nAACACCCAGCCCCCGCAGGAGGTGTACGCGTGCAGTATTTTTGAAAATGAGGACTTCCTG\nAAGTCAGTGACTACTCTGGCCATGAAAGCCAAGTGCAAGCTGACCATCTGCCCTGAGGAG\nGAGAACATGGATGACCAGTGGATGCAGGATGAAATGGAGATCGGCTACATCCAAGCCCCA\nCACAAAACGCTGCCCGTGGTCTTCGACTCTCCAAGGAACAGAGGCCTGAAGGAGTTTCCC\nATCAAACGCGTGATGGGTCCAGATTTTGGCTATGTAACTCGAGGGCCCCAAACAGGGGGT\nATCAGTGGACTGGACTCCTTTGGGAACCTGGAAGTGAGCCCCCCAGTCACAGTCAGGGGC\nAAGGAATACCCGCTGGGCAGGATTCTCTTCGGGGACAGCTGTTATCCCAGCAATGACAGC\nCGGCAGATGCACCAGGCCCTGCAGGACTTCCTCAGTGCCCAGCAGGTGCAGGCCCCTGTG\nAAGCTCTATTCTGACTGGCTGTCCGTGGGCCACGTGGACGAGTTCCTGAGCTTTGTGCCA\nGCACCCGACAGGAAGGGCTTCCGGCTGCTCCTGGCCAGCCCCAGGTCCTGCTACAAACTG\nTTCCAGGAGCAGCAGAATGAGGGCCACGGGGAGGCCCTGCTGTTCGAAGGGATCAAGAAA\nAAAAAACAGCAGAAAATAAAGAACATTCTGTCAAACAAGACATTGAGAGAACATAATTCA\nTTTGTGGAGAGATGCATCGACTGGAACCGCGAGCTGCTGAAGCGGGAGCTGGGCCTGGCC\nGAGAGTGACATCATTGACATCCCGCAGCTCTTCAAGCTCAAAGAGTTCTCTAAGGCGGAA\nGCTTTTTTCCCCAACATGGTGAACATGCTGGTGCTAGGGAAGCACCTGGGCATCCCCAAG\nCCCTTCGGGCCCGTCATCAACGGCCGCTGCTGCCTGGAGGAGAAGGTGTGTTCCCTGCTG\nGAGCCACTGGGCCTCCAGTGCACCTTCATCAACGACTTCTTCACCTACCACATCAGGCAT\nGGGGAGGTGCACTGCGGCACCAACGTGCGCAGAAAGCCCTTCTCCTTCAAGTGGTGGAAC\nATGGTGCCCTGA","gene_format":"FASTA","parent_id":"BE0001092"} {"id":"O75469","source":"Swiss-Prot","name":"Nuclear receptor subfamily 1 group I member 2","general_function":"Zinc ion binding","specific_function":"Nuclear receptor that binds and is activated by variety of endogenous and xenobiotic compounds. Transcription factor that activates the transcription of multiple genes involved in the metabolism and secretion of potentially harmful xenobiotics, drugs and endogenous compounds. Activated by the antibiotic rifampicin and various plant metabolites, such as hyperforin, guggulipid, colupulone, and isoflavones. Response to specific ligands is species-specific. Activated by naturally occurring steroids, such as pregnenolone and progesterone. Binds to a response element in the promoters of the CYP3A4 and ABCB1/MDR1 genes.","gene_name":"NR1I2","locus":"3q12-q13.3","cellular_location":"Nucleus","theoretical_pi":8.44,"molecular_weight":49761.245,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001903|Nuclear receptor subfamily 1 group I member 2\nMEVRPKESWNHADFVHCEDTESVPGKPSVNADEEVGGPQICRVCGDKATGYHFNVMTCEG\nCKGFFRRAMKRNARLRCPFRKGACEITRKTRRQCQACRLRKCLESGMKKEMIMSDEAVEE\nRRALIKRKKSERTGTQPLGVQGLTEEQRMMIRELMDAQMKTFDTTFSHFKNFRLPGVLSS\nGCELPESLQAPSREEAAKWSQVRKDLCSLKVSLQLRGEDGSVWNYKPPADSGGKEIFSLL\nPHMADMSTYMFKGIISFAKVISYFRDLPIEDQISLLKGAAFELCQLRFNTVFNAETGTWE\nCGRLSYCLEDTAGGFQQLLLEPMLKFHYMLKKLQLHEEEYVLMQAISLFSPDRPGVLQHR\nVVDQLQEQFAITLKSYIECNRPQPAHRFLFLKIMAMLTELRSINAQHTQRLLRIQDIHPF\nATPLMQELFGITGS","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010672|Nuclear receptor subfamily 1 group I member 2 (NR1I2)\nCTGGAGGTGAGACCCAAAGAAAGCTGGAACCATGCTGACTTTGTACACTGTGAGGACACA\nGAGTCTGTTCCTGGAAAGCCCAGTGTCAACGCAGATGAGGAAGTCGGAGGTCCCCAAATC\nTGCCGTGTATGTGGGGACAAGGCCACTGGCTATCACTTCAATGTCATGACATGTGAAGGA\nTGCAAGGGCTTTTTCAGGAGGGCCATGAAACGCAACGCCCGGCTGAGGTGCCCCTTCCGG\nAAGGGCGCCTGCGAGATCACCCGGAAGACCCGGCGACAGTGCCAGGCCTGCCGCCTGCGC\nAAGTGCCTGGAGAGCGGCATGAAGAAGGAGATGATCATGTCCGACGAGGCCGTGGAGGAG\nAGGCGGGCCTTGATCAAGCGGAAGAAAAGTGAACGGACAGGGACTCAGCCACTGGGAGTG\nCAGGGGCTGACAGAGGAGCAGCGGATGATGATCAGGGAGCTGATGGACGCTCAGATGAAA\nACCTTTGACACTACCTTCTCCCATTTCAAGAATTTCCGGCTGCCAGGGGTGCTTAGCAGT\nGGCTGCGAGTTGCCAGAGTCTCTGCAGGCCCCATCGAGGGAAGAAGCTGCCAAGTGGAGC\nCAGGTCCGGAAAGATCTGTGCTCTTTGAAGGTCTCTCTGCAGCTGCGGGGGGAGGATGGC\nAGTGTCTGGAACTACAAACCCCCAGCCGACAGTGGCGGGAAAGAGATCTTCTCCCTGCTG\nCCCCACATGGCTGACATGTCAACCTACATGTTCAAAGGCATCATCAGCTTTGCCAAAGTC\nATCTCCTACTTCAGGGACTTGCCCATCGAGGACCAGATCTCCCTGCTGAAGGGGGCCGCT\nTTCGAGCTGTGTCAACTGAGATTCAACACAGTGTTCAACGCGGAGACTGGAACCTGGGAG\nTGTGGCCGGCTGTCCTACTGCTTGGAAGACACTGCAGGTGGCTTCCAGCAACTTCTACTG\nGAGCCCATGCTGAAATTCCACTACATGCTGAAGAAGCTGCAGCTGCATGAGGAGGAGTAT\nGTGCTGATGCAGGCCATCTCCCTCTTCTCCCCAGACCGCCCAGGTGTGCTGCAGCACCGC\nGTGGTGGACCAGCTGCAGGAGCAATTCGCCATTACTCTGAAGTCCTACATTGAATGCAAT\nCGGCCCCAGCCTGCTCATAGGTTCTTGTTCCTGAAGATCATGGCTATGCTCACCGAGCTC\nCGCAGCATCAATGCTCAGCACACCCAGCGGCTGCTGCGCATCCAGGACATACACCCCTTT\nGCTACGCCCCTCATGCAGGAGTTGTTCGGCATCACAGGTAGCTGA","gene_format":"FASTA","parent_id":"BE0000956"} {"id":"P13569","source":"Swiss-Prot","name":"Cystic fibrosis transmembrane conductance regulator","general_function":"Pdz domain binding","specific_function":"Involved in the transport of chloride ions. May regulate bicarbonate secretion and salvage in epithelial cells by regulating the SLC4A7 transporter. Can inhibit the chloride channel activity of ANO1. Plays a role in the chloride and bicarbonate homeostasis during sperm epididymal maturation and capacitation.","gene_name":"CFTR","locus":"7q31.2","cellular_location":"Early endosome membrane","transmembrane_regions":"81-103\n118-138\n195-215\n221-241\n308-328\n331-350\n860-880\n912-932\n991-1011\n1014-1034\n1103-1123\n1129-1149","theoretical_pi":9.02,"molecular_weight":168139.895,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0002377|Cystic fibrosis transmembrane conductance regulator\nMQRSPLEKASVVSKLFFSWTRPILRKGYRQRLELSDIYQIPSVDSADNLSEKLEREWDRE\nLASKKNPKLINALRRCFFWRFMFYGIFLYLGEVTKAVQPLLLGRIIASYDPDNKEERSIA\nIYLGIGLCLLFIVRTLLLHPAIFGLHHIGMQMRIAMFSLIYKKTLKLSSRVLDKISIGQL\nVSLLSNNLNKFDEGLALAHFVWIAPLQVALLMGLIWELLQASAFCGLGFLIVLALFQAGL\nGRMMMKYRDQRAGKISERLVITSEMIENIQSVKAYCWEEAMEKMIENLRQTELKLTRKAA\nYVRYFNSSAFFFSGFFVVFLSVLPYALIKGIILRKIFTTISFCIVLRMAVTRQFPWAVQT\nWYDSLGAINKIQDFLQKQEYKTLEYNLTTTEVVMENVTAFWEEGFGELFEKAKQNNNNRK\nTSNGDDSLFFSNFSLLGTPVLKDINFKIERGQLLAVAGSTGAGKTSLLMVIMGELEPSEG\nKIKHSGRISFCSQFSWIMPGTIKENIIFGVSYDEYRYRSVIKACQLEEDISKFAEKDNIV\nLGEGGITLSGGQRARISLARAVYKDADLYLLDSPFGYLDVLTEKEIFESCVCKLMANKTR\nILVTSKMEHLKKADKILILHEGSSYFYGTFSELQNLQPDFSSKLMGCDSFDQFSAERRNS\nILTETLHRFSLEGDAPVSWTETKKQSFKQTGEFGEKRKNSILNPINSIRKFSIVQKTPLQ\nMNGIEEDSDEPLERRLSLVPDSEQGEAILPRISVISTGPTLQARRRQSVLNLMTHSVNQG\nQNIHRKTTASTRKVSLAPQANLTELDIYSRRLSQETGLEISEEINEEDLKECFFDDMESI\nPAVTTWNTYLRYITVHKSLIFVLIWCLVIFLAEVAASLVVLWLLGNTPLQDKGNSTHSRN\nNSYAVIITSTSSYYVFYIYVGVADTLLAMGFFRGLPLVHTLITVSKILHHKMLHSVLQAP\nMSTLNTLKAGGILNRFSKDIAILDDLLPLTIFDFIQLLLIVIGAIAVVAVLQPYIFVATV\nPVIVAFIMLRAYFLQTSQQLKQLESEGRSPIFTHLVTSLKGLWTLRAFGRQPYFETLFHK\nALNLHTANWFLYLSTLRWFQMRIEMIFVIFFIAVTFISILTTGEGEGRVGIILTLAMNIM\nSTLQWAVNSSIDVDSLMRSVSRVFKFIDMPTEGKPTKSTKPYKNGQLSKVMIIENSHVKK\nDDIWPSGGQMTVKDLTAKYTEGGNAILENISFSISPGQRVGLLGRTGSGKSTLLSAFLRL\nLNTEGEIQIDGVSWDSITLQQWRKAFGVIPQKVFIFSGTFRKNLDPYEQWSDQEIWKVAD\nEVGLRSVIEQFPGKLDFVLVDGGCVLSHGHKQLMCLARSVLSKAKILLLDEPSAHLDPVT\nYQIIRRTLKQAFADCTVILCEHRIEAMLECQQFLVIEENKVRQYDSIQKLLNERSLFRQA\nISPSDRVKLFPHRNSSKCKSKPQIAALKEETEEEVQDTRL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010831|Cystic fibrosis transmembrane conductance regulator (CFTR)\nATGCAGAGGTCGCCTCTGGAAAAGGCCAGCGTTGTCTCCAAACTTTTTTTCAGCTGGACC\nAGACCAATTTTGAGGAAAGGATACAGACAGCGCCTGGAATTGTCAGACATATACCAAATC\nCCTTCTGTTGATTCTGCTGACAATCTATCTGAAAAATTGGAAAGAGAATGGGATAGAGAG\nCTGGCTTCAAAGAAAAATCCTAAACTCATTAATGCCCTTCGGCGATGTTTTTTCTGGAGA\nTTTATGTTCTATGGAATCTTTTTATATTTAGGGGAAGTCACCAAAGCAGTACAGCCTCTC\nTTACTGGGAAGAATCATAGCTTCCTATGACCCGGATAACAAGGAGGAACGCTCTATCGCG\nATTTATCTAGGCATAGGCTTATGCCTTCTCTTTATTGTGAGGACACTGCTCCTACACCCA\nGCCATTTTTGGCCTTCATCACATTGGAATGCAGATGAGAATAGCTATGTTTAGTTTGATT\nTATAAGAAGACTTTAAAGCTGTCAAGCCGTGTTCTAGATAAAATAAGTATTGGACAACTT\nGTTAGTCTCCTTTCCAACAACCTGAACAAATTTGATGAAGGACTTGCATTGGCACATTTC\nGTGTGGATCGCTCCTTTGCAAGTGGCACTCCTCATGGGGCTAATCTGGGAGTTGTTACAG\nGCGTCTGCCTTCTGTGGACTTGGTTTCCTGATAGTCCTTGCCCTTTTTCAGGCTGGGCTA\nGGGAGAATGATGATGAAGTACAGAGATCAGAGAGCTGGGAAGATCAGTGAAAGACTTGTG\nATTACCTCAGAAATGATTGAAAATATCCAATCTGTTAAGGCATACTGCTGGGAAGAAGCA\nATGGAAAAAATGATTGAAAACTTAAGACAAACAGAACTGAAACTGACTCGGAAGGCAGCC\nTATGTGAGATACTTCAATAGCTCAGCCTTCTTCTTCTCAGGGTTCTTTGTGGTGTTTTTA\nTCTGTGCTTCCCTATGCACTAATCAAAGGAATCATCCTCCGGAAAATATTCACCACCATC\nTCATTCTGCATTGTTCTGCGCATGGCGGTCACTCGGCAATTTCCCTGGGCTGTACAAACA\nTGGTATGACTCTCTTGGAGCAATAAACAAAATACAGGATTTCTTACAAAAGCAAGAATAT\nAAGACATTGGAATATAACTTAACGACTACAGAAGTAGTGATGGAGAATGTAACAGCCTTC\nTGGGAGGAGGGATTTGGGGAATTATTTGAGAAAGCAAAACAAAACAATAACAATAGAAAA\nACTTCTAATGGTGATGACAGCCTCTTCTTCAGTAATTTCTCACTTCTTGGTACTCCTGTC\nCTGAAAGATATTAATTTCAAGATAGAAAGAGGACAGTTGTTGGCGGTTGCTGGATCCACT\nGGAGCAGGCAAGACTTCACTTCTAATGGTGATTATGGGAGAACTGGAGCCTTCAGAGGGT\nAAAATTAAGCACAGTGGAAGAATTTCATTCTGTTCTCAGTTTTCCTGGATTATGCCTGGC\nACCATTAAAGAAAATATCATCTTTGGTGTTTCCTATGATGAATATAGATACAGAAGCGTC\nATCAAAGCATGCCAACTAGAAGAGGACATCTCCAAGTTTGCAGAGAAAGACAATATAGTT\nCTTGGAGAAGGTGGAATCACACTGAGTGGAGGTCAACGAGCAAGAATTTCTTTAGCAAGA\nGCAGTATACAAAGATGCTGATTTGTATTTATTAGACTCTCCTTTTGGATACCTAGATGTT\nTTAACAGAAAAAGAAATATTTGAAAGCTGTGTCTGTAAACTGATGGCTAACAAAACTAGG\nATTTTGGTCACTTCTAAAATGGAACATTTAAAGAAAGCTGACAAAATATTAATTTTGCAT\nGAAGGTAGCAGCTATTTTTATGGGACATTTTCAGAACTCCAAAATCTACAGCCAGACTTT\nAGCTCAAAACTCATGGGATGTGATTCTTTCGACCAATTTAGTGCAGAAAGAAGAAATTCA\nATCCTAACTGAGACCTTACACCGTTTCTCATTAGAAGGAGATGCTCCTGTCTCCTGGACA\nGAAACAAAAAAACAATCTTTTAAACAGACTGGAGAGTTTGGGGAAAAAAGGAAGAATTCT\nATTCTCAATCCAATCAACTCTATACGAAAATTTTCCATTGTGCAAAAGACTCCCTTACAA\nATGAATGGCATCGAAGAGGATTCTGATGAGCCTTTAGAGAGAAGGCTGTCCTTAGTACCA\nGATTCTGAGCAGGGAGAGGCGATACTGCCTCGCATCAGCGTGATCAGCACTGGCCCCACG\nCTTCAGGCACGAAGGAGGCAGTCTGTCCTGAACCTGATGACACACTCAGTTAACCAAGGT\nCAGAACATTCACCGAAAGACAACAGCATCCACACGAAAAGTGTCACTGGCCCCTCAGGCA\nAACTTGACTGAACTGGATATATATTCAAGAAGGTTATCTCAAGAAACTGGCTTGGAAATA\nAGTGAAGAAATTAACGAAGAAGACTTAAAGGAGTGCTTTTTTGATGATATGGAGAGCATA\nCCAGCAGTGACTACATGGAACACATACCTTCGATATATTACTGTCCACAAGAGCTTAATT\nTTTGTGCTAATTTGGTGCTTAGTAATTTTTCTGGCAGAGGTGGCTGCTTCTTTGGTTGTG\nCTGTGGCTCCTTGGAAACACTCCTCTTCAAGACAAAGGGAATAGTACTCATAGTAGAAAT\nAACAGCTATGCAGTGATTATCACCAGCACCAGTTCGTATTATGTGTTTTACATTTACGTG\nGGAGTAGCCGACACTTTGCTTGCTATGGGATTCTTCAGAGGTCTACCACTGGTGCATACT\nCTAATCACAGTGTCGAAAATTTTACACCACAAAATGTTACATTCTGTTCTTCAAGCACCT\nATGTCAACCCTCAACACGTTGAAAGCAGGTGGGATTCTTAATAGATTCTCCAAAGATATA\nGCAATTTTGGATGACCTTCTGCCTCTTACCATATTTGACTTCATCCAGTTGTTATTAATT\nGTGATTGGAGCTATAGCAGTTGTCGCAGTTTTACAACCCTACATCTTTGTTGCAACAGTG\nCCAGTGATAGTGGCTTTTATTATGTTGAGAGCATATTTCCTCCAAACCTCACAGCAACTC\nAAACAACTGGAATCTGAAGGCAGGAGTCCAATTTTCACTCATCTTGTTACAAGCTTAAAA\nGGACTATGGACACTTCGTGCCTTCGGACGGCAGCCTTACTTTGAAACTCTGTTCCACAAA\nGCTCTGAATTTACATACTGCCAACTGGTTCTTGTACCTGTCAACACTGCGCTGGTTCCAA\nATGAGAATAGAAATGATTTTTGTCATCTTCTTCATTGCTGTTACCTTCATTTCCATTTTA\nACAACAGGAGAAGGAGAAGGAAGAGTTGGTATTATCCTGACTTTAGCCATGAATATCATG\nAGTACATTGCAGTGGGCTGTAAACTCCAGCATAGATGTGGATAGCTTGATGCGATCTGTG\nAGCCGAGTCTTTAAGTTCATTGACATGCCAACAGAAGGTAAACCTACCAAGTCAACCAAA\nCCATACAAGAATGGCCAACTCTCGAAAGTTATGATTATTGAGAATTCACACGTGAAGAAA\nGATGACATCTGGCCCTCAGGGGGCCAAATGACTGTCAAAGATCTCACAGCAAAATACACA\nGAAGGTGGAAATGCCATATTAGAGAACATTTCCTTCTCAATAAGTCCTGGCCAGAGGGTG\nGGCCTCTTGGGAAGAACTGGATCAGGGAAGAGTACTTTGTTATCAGCTTTTTTGAGACTA\nCTGAACACTGAAGGAGAAATCCAGATCGATGGTGTGTCTTGGGATTCAATAACTTTGCAA\nCAGTGGAGGAAAGCCTTTGGAGTGATACCACAGAAAGTATTTATTTTTTCTGGAACATTT\nAGAAAAAACTTGGATCCCTATGAACAGTGGAGTGATCAAGAAATATGGAAAGTTGCAGAT\nGAGGTTGGGCTCAGATCTGTGATAGAACAGTTTCCTGGGAAGCTTGACTTTGTCCTTGTG\nGATGGGGGCTGTGTCCTAAGCCATGGCCACAAGCAGTTGATGTGCTTGGCTAGATCTGTT\nCTCAGTAAGGCGAAGATCTTGCTGCTTGATGAACCCAGTGCTCATTTGGATCCAGTAACA\nTACCAAATAATTAGAAGAACTCTAAAACAAGCATTTGCTGATTGCACAGTAATTCTCTGT\nGAACACAGGATAGAAGCAATGCTGGAATGCCAACAATTTTTGGTCATAGAAGAGAACAAA\nGTGCGGCAGTACGATTCCATCCAGAAACTGCTGAACGAGAGGAGCCTCTTCCGGCAAGCC\nATCAGCCCCTCCGACAGGGTGAAGCTCTTTCCCCACCGGAACTCAAGCAAGTGCAAGTCT\nAAGCCCCAGATTGCTGCTCTGAAAGAGGAGACAGAAGAAGAGGTGCAAGATACAAGGCTT\nTAG","gene_format":"FASTA","parent_id":"BE0001195"} {"id":"P04150","source":"Swiss-Prot","name":"Glucocorticoid receptor","general_function":"Zinc ion binding","specific_function":"Receptor for glucocorticoids (GC). Has a dual mode of action: as a transcription factor that binds to glucocorticoid response elements (GRE), both for nuclear and mitochondrial DNA, and as a modulator of other transcription factors. Affects inflammatory responses, cellular proliferation and differentiation in target tissues. Could act as a coactivator for STAT5-dependent transcription upon growth hormone (GH) stimulation and could reveal an essential role of hepatic GR in the control of body growth. Involved in chromatin remodeling. May play a negative role in adipogenesis through the regulation of lipolytic and antilipogenic genes expression.","gene_name":"NR3C1","locus":"5q31.3","cellular_location":"Cytoplasm","theoretical_pi":6.31,"molecular_weight":85658.57,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001584|Glucocorticoid receptor\nMDSKESLTPGREENPSSVLAQERGDVMDFYKTLRGGATVKVSASSPSLAVASQSDSKQRR\nLLVDFPKGSVSNAQQPDLSKAVSLSMGLYMGETETKVMGNDLGFPQQGQISLSSGETDLK\nLLEESIANLNRSTSVPENPKSSASTAVSAAPTEKEFPKTHSDVSSEQQHLKGQTGTNGGN\nVKLYTTDQSTFDILQDLEFSSGSPGKETNESPWRSDLLIDENCLLSPLAGEDDSFLLEGN\nSNEDCKPLILPDTKPKIKDNGDLVLSSPSNVTLPQVKTEKEDFIELCTPGVIKQEKLGTV\nYCQASFPGANIIGNKMSAISVHGVSTSGGQMYHYDMNTASLSQQQDQKPIFNVIPPIPVG\nSENWNRCQGSGDDNLTSLGTLNFPGRTVFSNGYSSPSMRPDVSSPPSSSSTATTGPPPKL\nCLVCSDEASGCHYGVLTCGSCKVFFKRAVEGQHNYLCAGRNDCIIDKIRRKNCPACRYRK\nCLQAGMNLEARKTKKKIKGIQQATTGVSQETSENPGNKTIVPATLPQLTPTLVSLLEVIE\nPEVLYAGYDSSVPDSTWRIMTTLNMLGGRQVIAAVKWAKAIPGFRNLHLDDQMTLLQYSW\nMFLMAFALGWRSYRQSSANLLCFAPDLIINEQRMTLPCMYDQCKHMLYVSSELHRLQVSY\nEEYLCMKTLLLLSSVPKDGLKSQELFDEIRMTYIKELGKAIVKREGNSSQNWQRFYQLTK\nLLDSMHEVVENLLNYCFQTFLDKTMSIEFPEMLAEIITNQIPKYSNGNIKKLLFHQK","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0020486|Glucocorticoid receptor (NR3C1)\nATGGACTCCAAAGAATCATTAACTCCTGGTAGAGAAGAAAACCCCAGCAGTGTGCTTGCT\nCAGGAGAGGGGAGATGTGATGGACTTCTATAAAACCCTAAGAGGAGGAGCTACTGTGAAG\nGTTTCTGCGTCTTCACCCTCACTGGCTGTCGCTTCTCAATCAGACTCCAAGCAGCGAAGA\nCTTTTGGTTGATTTTCCAAAAGGCTCAGTAAGCAATGCGCAGCAGCCAGATCTGTCCAAA\nGCAGTTTCACTCTCAATGGGACTGTATATGGGAGAGACAGAAACAAAAGTGATGGGAAAT\nGACCTGGGATTCCCACAGCAGGGCCAAATCAGCCTTTCCTCGGGGGAAACAGACTTAAAG\nCTTTTGGAAGAAAGCATTGCAAACCTCAATAGGTCGACCAGTGTTCCAGAGAACCCCAAG\nAGTTCAGCATCCACTGCTGTGTCTGCTGCCCCCACAGAGAAGGAGTTTCCAAAAACTCAC\nTCTGATGTATCTTCAGAACAGCAACATTTGAAGGGCCAGACTGGCACCAACGGTGGCAAT\nGTGAAATTGTATACCACAGACCAAAGCACCTTTGACATTTTGCAGGATTTGGAGTTTTCT\nTCTGGGTCCCCAGGTAAAGAGACGAATGAGAGTCCTTGGAGATCAGACCTGTTGATAGAT\nGAAAACTGTTTGCTTTCTCCTCTGGCGGGAGAAGACGATTCATTCCTTTTGGAAGGAAAC\nTCGAATGAGGACTGCAAGCCTCTCATTTTACCGGACACTAAACCCAAAATTAAGGATAAT\nGGAGATCTGGTTTTGTCAAGCCCCAGTAATGTAACACTGCCCCAAGTGAAAACAGAAAAA\nGAAGATTTCATCGAACTCTGCACCCCTGGGGTAATTAAGCAAGAGAAACTGGGCACAGTT\nTACTGTCAGGCAAGCTTTCCTGGAGCAAATATAATTGGTAATAAAATGTCTGCCATTTCT\nGTTCATGGTGTGAGTACCTCTGGAGGACAGATGTACCACTATGACATGAATACAGCATCC\nCTTTCTCAACAGCAGGATCAGAAGCCTATTTTTAATGTCATTCCACCAATTCCCGTTGGT\nTCCGAAAATTGGAATAGGTGCCAAGGATCTGGAGATGACAACTTGACTTCTCTGGGGACT\nCTGAACTTCCCTGGTCGAACAGTTTTTTCTAATGGCTATTCAAGCCCCAGCATGAGACCA\nGATGTAAGCTCTCCTCCATCCAGCTCCTCAACAGCAACAACAGGACCACCTCCCAAACTC\nTGCCTGGTGTGCTCTGATGAAGCTTCAGGATGTCATTATGGAGTCTTAACTTGTGGAAGC\nTGTAAAGTTTTCTTCAAAAGAGCAGTGGAAGGACAGCACAATTACCTATGTGCTGGAAGG\nAATGATTGCATCATCGATAAAATTCGAAGAAAAAACTGCCCAGCATGCCGCTATCGAAAA\nTGTCTTCAGGCTGGAATGAACCTGGAAGCTCGAAAAACAAAGAAAAAAATAAAAGGAATT\nCAGCAGGCCACTACAGGAGTCTCACAAGAAACCTCTGAAAATCCTGGTAACAAAACAATA\nGTTCCTGCAACGTTACCACAACTCACCCCTACCCTGGTGTCACTGTTGGAGGTTATTGAA\nCCTGAAGTGTTATATGCAGGATATGATAGCTCTGTTCCAGACTCAACTTGGAGGATCATG\nACTACGCTCAACATGTTAGGAGGGCGGCAAGTGATTGCAGCAGTGAAATGGGCAAAGGCA\nATACCAGGTTTCAGGAACTTACACCTGGATGACCAAATGACCCTACTGCAGTACTCCTGG\nATGTTTCTTATGGCATTTGCTCTGGGGTGGAGATCATATAGACAATCAAGTGCAAACCTG\nCTGTGTTTTGCTCCTGATCTGATTATTAATGAGCAGAGAATGACTCTACCCTGCATGTAC\nGACCAATGTAAACACATGCTGTATGTTTCCTCTGAGTTACACAGGCTTCAGGTATCTTAT\nGAAGAGTATCTCTGTATGAAAACCTTACTGCTTCTCTCTTCAGTTCCTAAGGACGGTCTG\nAAGAGCCAAGAGCTATTTGATGAAATTAGAATGACCTACATCAAAGAGCTAGGAAAAGCC\nATTGTCAAGAGGGAAGGAAACTCCAGCCAGAACTGGCAGCGGTTTTATCAACTGACAAAA\nCTCTTGGATTCTATGCATGAAGTGGTTGAAAATCTCCTTAACTATTGCTTCCAAACATTT\nTTGGATAAGACCATGAGTATTGAATTCCCCGAGATGTTAGCTGAAATCATCACCAATCAG\nATACCAAAATATTCAAATGGAAATATCAAAAAACTTCTGTTTCATCAAAAGTGA","gene_format":"FASTA","parent_id":"BE0000794"} {"id":"P35372","source":"Swiss-Prot","name":"Mu-type opioid receptor","general_function":"Voltage-gated calcium channel activity","specific_function":"Receptor for endogenous opioids such as beta-endorphin and endomorphin. Receptor for natural and synthetic opioids including morphine, heroin, DAMGO, fentanyl, etorphine, buprenorphin and methadone. Agonist binding to the receptor induces coupling to an inactive GDP-bound heterotrimeric G-protein complex and subsequent exchange of GDP for GTP in the G-protein alpha subunit leading to dissociation of the G-protein complex with the free GTP-bound G-protein alpha and the G-protein beta-gamma dimer activating downstream cellular effectors. The agonist- and cell type-specific activity is predominantly coupled to pertussis toxin-sensitive G(i) and G(o) G alpha proteins, GNAI1, GNAI2, GNAI3 and GNAO1 isoforms Alpha-1 and Alpha-2, and to a lesser extend to pertussis toxin-insensitive G alpha proteins GNAZ and GNA15. They mediate an array of downstream cellular responses, including inhibition of adenylate cyclase activity and both N-type and L-type calcium channels, activation of inward rectifying potassium channels, mitogen-activated protein kinase (MAPK), phospholipase C (PLC), phosphoinositide/protein kinase (PKC), phosphoinositide 3-kinase (PI3K) and regulation of NF-kappa-B. Also couples to adenylate cyclase stimulatory G alpha proteins. The selective temporal coupling to G-proteins and subsequent signaling can be regulated by RGSZ proteins, such as RGS9, RGS17 and RGS4. Phosphorylation by members of the GPRK subfamily of Ser/Thr protein kinases and association with beta-arrestins is involved in short-term receptor desensitization. Beta-arrestins associate with the GPRK-phosphorylated receptor and uncouple it from the G-protein thus terminating signal transduction. The phosphorylated receptor is internalized through endocytosis via clathrin-coated pits which involves beta-arrestins. The activation of the ERK pathway occurs either in a G-protein-dependent or a beta-arrestin-dependent manner and is regulated by agonist-specific receptor phosphorylation. Acts as a class A G-protein coupled receptor (GPCR) which dissociates from beta-arrestin at or near the plasma membrane and undergoes rapid recycling. Receptor down-regulation pathways are varying with the agonist and occur dependent or independent of G-protein coupling. Endogenous ligands induce rapid desensitization, endocytosis and recycling whereas morphine induces only low desensitization and endocytosis. Heterooligomerization with other GPCRs can modulate agonist binding, signaling and trafficking properties. Involved in neurogenesis. Isoform 12 couples to GNAS and is proposed to be involved in excitatory effects. Isoform 16 and isoform 17 do not bind agonists but may act through oligomerization with binding-competent OPRM1 isoforms and reduce their ligand binding activity.","gene_name":"OPRM1","locus":"6q24-q25","cellular_location":"Cell membrane","transmembrane_regions":"69-93\n107-131\n143-165\n186-207\n231-255\n284-307\n315-338","theoretical_pi":8.29,"molecular_weight":44778.855,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001536|Mu-type opioid receptor\nMDSSAAPTNASNCTDALAYSSCSPAPSPGSWVNLSHLDGNLSDPCGPNRTDLGGRDSLCP\nPTGSPSMITAITIMALYSIVCVVGLFGNFLVMYVIVRYTKMKTATNIYIFNLALADALAT\nSTLPFQSVNYLMGTWPFGTILCKIVISIDYYNMFTSIFTLCTMSVDRYIAVCHPVKALDF\nRTPRNAKIINVCNWILSSAIGLPVMFMATTKYRQGSIDCTLTFSHPTWYWENLLKICVFI\nFAFIMPVLIITVCYGLMILRLKSVRMLSGSKEKDRNLRRITRMVLVVVAVFIVCWTPIHI\nYVIIKALVTIPETTFQTVSWHFCIALGYTNSCLNPVLYAFLDENFKRCFREFCIPTSSNI\nEQQNSTRIRQNTRDHPSTANTVDRTNHQLENLEAETAPLP","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021649|Mu-type opioid receptor (OPRM1)\nATGGACAGCAGCGCTGCCCCCACGAACGCCAGCAATTGCACTGATGCCTTGGCGTACTCA\nAGTTGCTCCCCAGCACCCAGCCCCGGTTCCTGGGTCAACTTGTCCCACTTAGATGGCAAC\nCTGTCCGACCCATGCGGTCCGAACCGCACCGACCTGGGCGGGAGAGACAGCCTGTGCCCT\nCCGACCGGCAGTCCCTCCATGATCACGGCCATCACGATCATGGCCCTCTACTCCATCGTG\nTGCGTGGTGGGGCTCTTCGGAAACTTCCTGGTCATGTATGTGATTGTCAGATACACCAAG\nATGAAGACTGCCACCAACATCTACATTTTCAACCTTGCTCTGGCAGATGCCTTAGCCACC\nAGTACCCTGCCCTTCCAGAGTGTGAATTACCTAATGGGAACATGGCCATTTGGAACCATC\nCTTTGCAAGATAGTGATCTCCATAGATTACTATAACATGTTCACCAGCATATTCACCCTC\nTGCACCATGAGTGTTGATCGATACATTGCAGTCTGCCACCCTGTCAAGGCCTTAGATTTC\nCGTACTCCCCGAAATGCCAAAATTATCAATGTCTGCAACTGGATCCTCTCTTCAGCCATT\nGGTCTTCCTGTAATGTTCATGGCTACAACAAAATACAGGCAAGGTTCCATAGATTGTACA\nCTAACATTCTCTCATCCAACCTGGTACTGGGAAAACCTGCTGAAGATCTGTGTTTTCATC\nTTCGCCTTCATTATGCCAGTGCTCATCATTACCGTGTGCTATGGACTGATGATCTTGCGC\nCTCAAGAGTGTCCGCATGCTCTCTGGCTCCAAAGAAAAGGACAGGAATCTTCGAAGGATC\nACCAGGATGGTGCTGGTGGTGGTGGCTGTGTTCATCGTCTGCTGGACTCCCATTCACATT\nTACGTCATCATTAAAGCCTTGGTTACAATCCCAGAAACTACGTTCCAGACTGTTTCTTGG\nCACTTCTGCATTGCTCTAGGTTACACAAACAGCTGCCTCAACCCAGTCCTTTATGCATTT\nCTGGATGAAAACTTCAAACGATGCTTCAGAGAGTTCTGTATCCCAACCTCTTCCAACATT\nGAGCAACAAAACTCCACTCGAATTCGTCAGAACACTAGAGACCACCCCTCCACGGCCAAT\nACAGTGGATAGAACTAATCATCAGCTAGAAAATCTGGAAGCAGAAACTGCTCCGTTGCCC\nTAA","gene_format":"FASTA","parent_id":"BE0000770"} {"id":"P41143","source":"Swiss-Prot","name":"Delta-type opioid receptor","general_function":"Opioid receptor activity","specific_function":"G-protein coupled receptor that functions as receptor for endogenous enkephalins and for a subset of other opioids. Ligand binding causes a conformation change that triggers signaling via guanine nucleotide-binding proteins (G proteins) and modulates the activity of down-stream effectors, such as adenylate cyclase. Signaling leads to the inhibition of adenylate cyclase activity. Inhibits neurotransmitter release by reducing calcium ion currents and increasing potassium ion conductance. Plays a role in the perception of pain and in opiate-mediated analgesia. Plays a role in developing analgesic tolerance to morphine.","gene_name":"OPRD1","locus":"1p36.1-p34.3","cellular_location":"Cell membrane","transmembrane_regions":"48-75\n86-110\n123-144\n164-186\n207-238\n262-284\n300-321","theoretical_pi":9.17,"molecular_weight":40368.235,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0010290|Delta-type opioid receptor\nMEPAPSAGAELQPPLFANASDAYPSACPSAGANASGPPGARSASSLALAIAITALYSAVC\nAVGLLGNVLVMFGIVRYTKMKTATNIYIFNLALADALATSTLPFQSAKYLMETWPFGELL\nCKAVLSIDYYNMFTSIFTLTMMSVDRYIAVCHPVKALDFRTPAKAKLINICIWVLASGVG\nVPIMVMAVTRPRDGAVVCMLQFPSPSWYWDTVTKICVFLFAFVVPILIITVCYGLMLLRL\nRSVRLLSGSKEKDRSLRRITRMVLVVVGAFVVCWAPIHIFVIVWTLVDIDRRDPLVVAAL\nHLCIALGYANSSLNPVLYAFLDENFKRCFRQLCRKPCGRPDPSSFSRAREATARERVTAC\nTPSDGPGGGAAA","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010291|Delta-type opioid receptor (OPRD1)\nATGGAACCGGCCCCCTCCGCCGGCGCCGAGCTGCAGCCCCCGCTCTTCGCCAACGCCTCG\nGACGCCTACCCTAGCGCCTGCCCCAGCGCTGGCGCCAATGCGTCGGGGCCGCCAGGCGCG\nCGGAGCGCCTCGTCCCTCGCCCTGGCAATCGCCATCACCGCGCTCTACTCGGCCGTGTGC\nGCCGTGGGGCTGCTGGGCAACGTGCTTGTCATGTTCGGCATCGTCCGGTACACTAAGATG\nAAGACGGCCACCAACATCTACATCTTCAACCTGGCCTTAGCCGATGCGCTGGCCACCAGC\nACGCTGCCTTTCCAGAGTGCCAAGTACCTGATGGAGACGTGGCCCTTCGGCGAGCTGCTC\nTGCAAGGCTGTGCTCTCCATCGACTACTACAATATGTTCACCAGCATCTTCACGCTCACC\nATGATGAGTGTTGACCGCTACATCGCTGTCTGCCACCCTGTCAAGGCCCTGGACTTCCGC\nACGCCTGCCAAGGCCAAGCTGATCAACATCTGTATCTGGGTCCTGGCCTCAGGCGTTGGC\nGTGCCCATCATGGTCATGGCTGTGACCCGTCCCCGGGACGGGGCAGTGGTGTGCATGCTC\nCAGTTCCCCAGCCCCAGCTGGTACTGGGACACGGTGACCAAGATCTGCGTGTTCCTCTTC\nGCCTTCGTGGTGCCCATCCTCATCATCACCGTGTGCTATGGCCTCATGCTGCTGCGCCTG\nCGCAGTGTGCGCCTGCTGTCGGGCTCCAAGGAGAAGGACCGCAGCCTGCGGCGCATCACG\nCGCATGGTGCTGGTGGTTGTGGGCGCCTTCGTGGTGTGTTGGGCGCCCATCCACATCTTC\nGTCATCGTCTGGACGCTGGTGGACATCGACCGGCGCGACCCGCTGGTGGTGGCTGCGCTG\nCACCTGTGCATCGCGCTGGGCTACGCCAATAGCAGCCTCAACCCCGTGCTCTACGCTTTC\nCTCGACGAGAACTTCAAGCGCTGCTTCCGCCAGCTCTGCCGCAAGCCCTGCGGCCGCCCA\nGACCCCAGCAGCTTCAGCCGCGCCCGCGAAGCCACGGCCCGCGAGCGTGTCACCGCCTGC\nACCCCGTCCGATGGTCCCGGCGGTGGCGCTGCCGCCTGA","gene_format":"FASTA","parent_id":"BE0000420"} {"id":"O90777","source":"TrEMBL","name":"HIV-1 protease","general_function":"Aspartic-type endopeptidase activity","gene_name":"HIV-1 protease","cellular_location":"Cytoplasmic","theoretical_pi":8.77,"molecular_weight":10724.61,"organism":"Human Immunodeficiency Virus","amino_acid_sequence":">lcl|BSEQ0010463|HIV-1 protease\nPQVTLWQRPIVTIKIGGQLKEALLDTGADDTVLEEMSLPGKWKPKMIGGIGGFIKVRQYD\nQVSIEICGHKAIGTVLIGPTPVNIIGRNLLTQLGCTLNF","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0001323|297 bp\nCCTCAGGTCACTCTTTGGCAACGACCCATAGTCACAATAAAGATAGGGGGGCAACTAAAG\nGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATTAGAAGAAATGAGTTTGCCAGGA\nAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTATCAAAGTAAGACAGTATGAT\nCAGGTATCCATAGAAATCTGCGGACATAAAGCTATAGGTACAGTATTAATAGGACCTACA\nCCTGTCAACATAATTGGAAGGAATCTGTTGACTCAGCTTGGCTGCACTTTAAATTTT","gene_format":"FASTA","parent_id":"BE0000664"} {"id":"Q72874","source":"TrEMBL","name":"Pol polyprotein","general_function":"Aspartic-type endopeptidase activity","gene_name":"pol","molecular_weight":10778.7,"organism":"Human immunodeficiency virus 1","organism_ncbi_taxonomy_id":11676,"amino_acid_sequence":">lcl|BSEQ0020806|Pol polyprotein\nPQVTLWQRPLVTIKIGGQLKEALLDTGADDTVLEEMSLPGRWKPKMIGGIGGFIKVRQYD\nQILIEICGHKAIGTVLVGPTPVNIIGRNLLTQIGCTLNF","amino_acid_format":"FASTA","gene_format":"FASTA","parent_id":"BE0004854"} {"id":"P04083","source":"Swiss-Prot","name":"Annexin A1","general_function":"Structural molecule activity","specific_function":"Plays important roles in the innate immune response as effector of glucocorticoid-mediated responses and regulator of the inflammatory process. Has anti-inflammatory activity (PubMed:8425544). Plays a role in glucocorticoid-mediated down-regulation of the early phase of the inflammatory response (By similarity). Promotes resolution of inflammation and wound healing (PubMed:25664854). Functions at least in part by activating the formyl peptide receptors and downstream signaling cascades (PubMed:15187149, PubMed:25664854). Promotes chemotaxis of granulocytes and monocytes via activation of the formyl peptide receptors (PubMed:15187149). Contributes to the adaptive immune response by enhancing signaling cascades that are triggered by T-cell activation, regulates differentiation and proliferation of activated T-cells (PubMed:17008549). Promotes the differentiation of T-cells into Th1 cells and negatively regulates differentiation into Th2 cells (PubMed:17008549). Has no effect on unstimulated T cells (PubMed:17008549). Promotes rearrangement of the actin cytoskeleton, cell polarization and cell migration (PubMed:15187149). Negatively regulates hormone exocytosis via activation of the formyl peptide receptors and reorganization of the actin cytoskeleton (PubMed:19625660). Has high affinity for Ca(2+) and can bind up to eight Ca(2+) ions (By similarity). Displays Ca(2+)-dependent binding to phospholipid membranes (PubMed:2532504, PubMed:8557678). Plays a role in the formation of phagocytic cups and phagosomes. Plays a role in phagocytosis by mediating the Ca(2+)-dependent interaction between phagosomes and the actin cytoskeleton (By similarity).","gene_name":"ANXA1","locus":"9q12-q21.2|9q12-q21.2","cellular_location":"Nucleus","theoretical_pi":7.04,"molecular_weight":38713.855,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0010292|Annexin A1\nMAMVSEFLKQAWFIENEEQEYVQTVKSSKGGPGSAVSPYPTFNPSSDVAALHKAIMVKGV\nDEATIIDILTKRNNAQRQQIKAAYLQETGKPLDETLKKALTGHLEEVVLALLKTPAQFDA\nDELRAAMKGLGTDEDTLIEILASRTNKEIRDINRVYREELKRDLAKDITSDTSGDFRNAL\nLSLAKGDRSEDFGVNEDLADSDARALYEAGERRKGTDVNVFNTILTTRSYPQLRRVFQKY\nTKYSKHDMNKVLDLELKGDIEKCLTAIVKCATSKPAFFAEKLHQAMKGVGTRHKALIRIM\nVSRSEIDMNDIKAFYQKMYGISLCQAILDETKGDYEKILVALCGGN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010293|Annexin A1 (ANXA1)\nATGGCAATGGTATCAGAATTCCTCAAGCAGGCCTGGTTTATTGAAAATGAAGAGCAGGAA\nTATGTTCAAACTGTGAAGTCATCCAAAGGTGGTCCCGGATCAGCGGTGAGCCCCTATCCT\nACCTTCAATCCATCCTCGGATGTCGCTGCCTTGCATAAGGCCATAATGGTTAAAGGTGTG\nGATGAAGCAACCATCATTGACATTCTAACTAAGCGAAACAATGCACAGCGTCAACAGATC\nAAAGCAGCATATCTCCAGGAAACAGGAAAGCCCCTGGATGAAACACTGAAGAAAGCCCTT\nACAGGTCACCTTGAGGAGGTTGTTTTAGCTCTGCTAAAAACTCCAGCGCAATTTGATGCT\nGATGAACTTCGTGCTGCCATGAAGGGCCTTGGAACTGATGAAGATACTCTAATTGAGATT\nTTGGCATCAAGAACTAACAAAGAAATCAGAGACATTAACAGGGTCTACAGAGAGGAACTG\nAAGAGAGATCTGGCCAAAGACATAACCTCAGACACATCTGGAGATTTTCGGAACGCTTTG\nCTTTCTCTTGCTAAGGGTGACCGATCTGAGGACTTTGGTGTGAATGAAGACTTGGCTGAT\nTCAGATGCCAGGGCCTTGTATGAAGCAGGAGAAAGGAGAAAGGGGACAGACGTAAACGTG\nTTCAATACCATCCTTACCACCAGAAGCTATCCACAACTTCGCAGAGTGTTTCAGAAATAC\nACCAAGTACAGTAAGCATGACATGAACAAAGTTCTGGACCTGGAGTTGAAAGGTGACATT\nGAGAAATGCCTCACAGCTATCGTGAAGTGCGCCACAAGCAAACCAGCTTTCTTTGCAGAG\nAAGCTTCATCAAGCCATGAAAGGTGTTGGAACTCGCCATAAGGCATTGATCAGGATTATG\nGTTTCCCGTTCTGAAATTGACATGAATGATATCAAAGCATTCTATCAGAAGATGTATGGT\nATCTCCCTTTGCCAAGCCATCCTGGATGAAACCAAAGGAGATTATGAGAAAATCCTGGTG\nGCTCTTTGTGGAGGAAACTAA","gene_format":"FASTA","parent_id":"BE0000422"} {"id":"P07437","source":"Swiss-Prot","name":"Tubulin beta chain","general_function":"Ubiquitin protein ligase binding","specific_function":"Tubulin is the major constituent of microtubules. It binds two moles of GTP, one at an exchangeable site on the beta chain and one at a non-exchangeable site on the alpha chain.","gene_name":"TUBB","locus":"6p25","cellular_location":"Cytoplasm","theoretical_pi":4.52,"molecular_weight":49670.515,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0009963|Tubulin beta chain\nMREIVHIQAGQCGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQLDRISVYYNEATGGKYV\nPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDVV\nRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTVV\nEPYNATLSVHQLVENTDETYCIDNEALYDICFRTLKLTTPTYGDLNHLVSATMSGVTTCL\nRFPGQLNADLRKLAVNMVPFPRLHFFMPGFAPLTSRGSQQYRALTVPELTQQVFDAKNMM\nAACDPRHGRYLTVAAVFRGRMSMKEVDEQMLNVQNKNSSYFVEWIPNNVKTAVCDIPPRG\nLKMAVTFIGNSTAIQELFKRISEQFTAMFRRKAFLHWYTGEGMDEMEFTEAESNMNDLVS\nEYQQYQDATAEEEEDFGEEAEEEA","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0009964|Tubulin beta chain (TUBB)\nATGGGAGATGTGGGGCCGAGGTGGGCGAATTGGGATCCCTCCAGGTCAGGGGTTCGAGAC\nCATCCTGGGCAACAAAGCGAGACCCTCCCCCATGCCACGTTTCTACAAAAAATAAAATTC\nTGGGAGGTGATCAGTGATGAACATGGCATCGACCCCACCGGCACCTACCACGGGGACAGC\nGACCTGCAGCTGGACCGCATCTCTGTGTACTACAATGAAGCCACAGGTGGCAAATATGTT\nCCTCGTGCCATCCTGGTGGATCTAGAACCTGGGACCATGGACTCTGTTCGCTCAGGTCCT\nTTTGGCCAGATCTTTAGACCAGACAACTTTGTATTTGGTCAGTCTGGGGCAGGTAACAAC\nTGGGCCAAAGGCCACTACACAGAGGGCGCCGAGCTGGTTGATTCTGTCCTGGATGTGGTA\nCGGAAGGAGGCAGAGAGCTGTGACTGCCTGCAGGGCTTCCAGCTGACCCACTCACTGGGC\nGGGGGCACAGGCTCTGGAATGGGCACTCTCCTTATCAGCAAGATCCGAGAAGAATACCCT\nGATCGCATCATGAATACCTTCAGTGTGGTGCCTTCACCCAAAGTGTCTGACACCGTGGTC\nGAGCCCTACAATGCCACCCTCTCCGTCCATCAGTTGGTAGAGAATACTGATGAGACCTAT\nTGCATTGACAACGAGGCCCTCTATGATATCTGCTTCCGCACTCTGAAGCTGACCACACCA\nACCTACGGGGATCTGAACCACCTTGTCTCAGCCACCATGAGTGGTGTCACCACCTGCCTC\nCGTTTCCCTGGCCAGCTCAATGCTGACCTCCGCAAGTTGGCAGTCAACATGGTCCCCTTC\nCCACGTCTCCATTTCTTTATGCCTGGCTTTGCCCCTCTCACCAGCCGTGGAAGCCAGCAG\nTATCGAGCTCTCACAGTGCCGGAACTCACCCAGCAGGTCTTCGATGCCAAGAACATGATG\nGCTGCCTGTGACCCCCGCCACGGCCGATACCTCACCGTGGCTGCTGTCTTCCGTGGTCGG\nATGTCCATGAAGGAGGTCGATGAGCAGATGCTTAACGTGCAGAACAAGAACAGCAGCTAC\nTTTGTGGAATGGATCCCCAACAATGTCAAGACAGCCGTCTGTGACATCCCACCTCGTGGC\nCTCAAGATGGCAGTCACCTTCATTGGCAATAGCACAGCCATCCAGGAGCTCTTCAAGCGC\nATCTCGGAGCAGTTCACTGCCATGTTCCGCCGGAAGGCCTTCCTCCACTGGTACACAGGC\nGAGGGCATGGACGAGATGGAGTTCACCGAGGCTGAGAGCAACATGAACGACCTCGTCTCT\nGAGTATCAGCAGTACCAGGATGCCACCGCAGAAGAGGAGGAGGATTTCGGTGAGGAGGCC\nGAAGAGGAGGCCTAA","gene_format":"FASTA","parent_id":"BE0000143"} {"id":"Q9NR96","source":"Swiss-Prot","name":"Toll-like receptor 9","general_function":"Transmembrane signaling receptor activity","specific_function":"Key component of innate and adaptive immunity. TLRs (Toll-like receptors) control host immune response against pathogens through recognition of molecular patterns specific to microorganisms. TLR9 is a nucleotide-sensing TLR which is activated by unmethylated cytidine-phosphate-guanosine (CpG) dinucleotides. Acts via MYD88 and TRAF6, leading to NF-kappa-B activation, cytokine secretion and the inflammatory response. Controls lymphocyte response to Helicobacter infection.","gene_name":"TLR9","locus":"3p21.3","cellular_location":"Endoplasmic reticulum membrane","transmembrane_regions":"819-839","signal_regions":"1-25","theoretical_pi":8.23,"molecular_weight":115858.665,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0002329|Toll-like receptor 9\nMGFCRSALHPLSLLVQAIMLAMTLALGTLPAFLPCELQPHGLVNCNWLFLKSVPHFSMAA\nPRGNVTSLSLSSNRIHHLHDSDFAHLPSLRHLNLKWNCPPVGLSPMHFPCHMTIEPSTFL\nAVPTLEELNLSYNNIMTVPALPKSLISLSLSHTNILMLDSASLAGLHALRFLFMDGNCYY\nKNPCRQALEVAPGALLGLGNLTHLSLKYNNLTVVPRNLPSSLEYLLLSYNRIVKLAPEDL\nANLTALRVLDVGGNCRRCDHAPNPCMECPRHFPQLHPDTFSHLSRLEGLVLKDSSLSWLN\nASWFRGLGNLRVLDLSENFLYKCITKTKAFQGLTQLRKLNLSFNYQKRVSFAHLSLAPSF\nGSLVALKELDMHGIFFRSLDETTLRPLARLPMLQTLRLQMNFINQAQLGIFRAFPGLRYV\nDLSDNRISGASELTATMGEADGGEKVWLQPGDLAPAPVDTPSSEDFRPNCSTLNFTLDLS\nRNNLVTVQPEMFAQLSHLQCLRLSHNCISQAVNGSQFLPLTGLQVLDLSHNKLDLYHEHS\nFTELPRLEALDLSYNSQPFGMQGVGHNFSFVAHLRTLRHLSLAHNNIHSQVSQQLCSTSL\nRALDFSGNALGHMWAEGDLYLHFFQGLSGLIWLDLSQNRLHTLLPQTLRNLPKSLQVLRL\nRDNYLAFFKWWSLHFLPKLEVLDLAGNQLKALTNGSLPAGTRLRRLDVSCNSISFVAPGF\nFSKAKELRELNLSANALKTVDHSWFGPLASALQILDVSANPLHCACGAAFMDFLLEVQAA\nVPGLPSRVKCGSPGQLQGLSIFAQDLRLCLDEALSWDCFALSLLAVALGLGVPMLHHLCG\nWDLWYCFHLCLAWLPWRGRQSGRDEDALPYDAFVVFDKTQSAVADWVYNELRGQLEECRG\nRWALRLCLEERDWLPGKTLFENLWASVYGSRKTLFVLAHTDRVSGLLRASFLLAQQRLLE\nDRKDVVVLVILSPDGRRSRYVRLRQRLCRQSVLLWPHQPSGQRSFWAQLGMALTRDNHHF\nYNRNFCQGPTAE","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0019078|Toll-like receptor 9 (TLR9)\nATGGGTTTCTGCCGCAGCGCCCTGCACCCGCTGTCTCTCCTGGTGCAGGCCATCATGCTG\nGCCATGACCCTGGCCCTGGGTACCTTGCCTGCCTTCCTACCCTGTGAGCTCCAGCCCCAC\nGGCCTGGTGAACTGCAACTGGCTGTTCCTGAAGTCTGTGCCCCACTTCTCCATGGCAGCA\nCCCCGTGGCAATGTCACCAGCCTTTCCTTGTCCTCCAACCGCATCCACCACCTCCATGAT\nTCTGACTTTGCCCACCTGCCCAGCCTGCGGCATCTCAACCTCAAGTGGAACTGCCCGCCG\nGTTGGCCTCAGCCCCATGCACTTCCCCTGCCACATGACCATCGAGCCCAGCACCTTCTTG\nGCTGTGCCCACCCTGGAAGAGCTAAACCTGAGCTACAACAACATCATGACTGTGCCTGCG\nCTGCCCAAATCCCTCATATCCCTGTCCCTCAGCCATACCAACATCCTGATGCTAGACTCT\nGCCAGCCTCGCCGGCCTGCATGCCCTGCGCTTCCTATTCATGGACGGCAACTGTTATTAC\nAAGAACCCCTGCAGGCAGGCACTGGAGGTGGCCCCGGGTGCCCTCCTTGGCCTGGGCAAC\nCTCACCCACCTGTCACTCAAGTACAACAACCTCACTGTGGTGCCCCGCAACCTGCCTTCC\nAGCCTGGAGTATCTGCTGTTGTCCTACAACCGCATCGTCAAACTGGCGCCTGAGGACCTG\nGCCAATCTGACCGCCCTGCGTGTGCTCGATGTGGGCGGAAATTGCCGCCGCTGCGACCAC\nGCTCCCAACCCCTGCATGGAGTGCCCTCGTCACTTCCCCCAGCTACATCCCGATACCTTC\nAGCCACCTGAGCCGTCTTGAAGGCCTGGTGTTGAAGGACAGTTCTCTCTCCTGGCTGAAT\nGCCAGTTGGTTCCGTGGGCTGGGAAACCTCCGAGTGCTGGACCTGAGTGAGAACTTCCTC\nTACAAATGCATCACTAAAACCAAGGCCTTCCAGGGCCTAACACAGCTGCGCAAGCTTAAC\nCTGTCCTTCAATTACCAAAAGAGGGTGTCCTTTGCCCACCTGTCTCTGGCCCCTTCCTTC\nGGGAGCCTGGTCGCCCTGAAGGAGCTGGACATGCACGGCATCTTCTTCCGCTCACTCGAT\nGAGACCACGCTCCGGCCACTGGCCCGCCTGCCCATGCTCCAGACTCTGCGTCTGCAGATG\nAACTTCATCAACCAGGCCCAGCTCGGCATCTTCAGGGCCTTCCCTGGCCTGCGCTACGTG\nGACCTGTCGGACAACCGCATCAGCGGAGCTTCGGAGCTGACAGCCACCATGGGGGAGGCA\nGATGGAGGGGAGAAGGTCTGGCTGCAGCCTGGGGACCTTGCTCCGGCCCCAGTGGACACT\nCCCAGCTCTGAAGACTTCAGGCCCAACTGCAGCACCCTCAACTTCACCTTGGATCTGTCA\nCGGAACAACCTGGTGACCGTGCAGCCGGAGATGTTTGCCCAGCTCTCGCACCTGCAGTGC\nCTGCGCCTGAGCCACAACTGCATCTCGCAGGCAGTCAATGGCTCCCAGTTCCTGCCGCTG\nACCGGTCTGCAGGTGCTAGACCTGTCCCACAATAAGCTGGACCTCTACCACGAGCACTCA\nTTCACGGAGCTACCGCGACTGGAGGCCCTGGACCTCAGCTACAACAGCCAGCCCTTTGGC\nATGCAGGGCGTGGGCCACAACTTCAGCTTCGTGGCTCACCTGCGCACCCTGCGCCACCTC\nAGCCTGGCCCACAACAACATCCACAGCCAAGTGTCCCAGCAGCTCTGCAGTACGTCGCTG\nCGGGCCCTGGACTTCAGCGGCAATGCACTGGGCCATATGTGGGCCGAGGGAGACCTCTAT\nCTGCACTTCTTCCAAGGCCTGAGCGGTTTGATCTGGCTGGACTTGTCCCAGAACCGCCTG\nCACACCCTCCTGCCCCAAACCCTGCGCAACCTCCCCAAGAGCCTACAGGTGCTGCGTCTC\nCGTGACAATTACCTGGCCTTCTTTAAGTGGTGGAGCCTCCACTTCCTGCCCAAACTGGAA\nGTCCTCGACCTGGCAGGAAACCAGCTGAAGGCCCTGACCAATGGCAGCCTGCCTGCTGGC\nACCCGGCTCCGGAGGCTGGATGTCAGCTGCAACAGCATCAGCTTCGTGGCCCCCGGCTTC\nTTTTCCAAGGCCAAGGAGCTGCGAGAGCTCAACCTTAGCGCCAACGCCCTCAAGACAGTG\nGACCACTCCTGGTTTGGGCCCCTGGCGAGTGCCCTGCAAATACTAGATGTAAGCGCCAAC\nCCTCTGCACTGCGCCTGTGGGGCGGCCTTTATGGACTTCCTGCTGGAGGTGCAGGCTGCC\nGTGCCCGGTCTGCCCAGCCGGGTGAAGTGTGGCAGTCCGGGCCAGCTCCAGGGCCTCAGC\nATCTTTGCACAGGACCTGCGCCTCTGCCTGGATGAGGCCCTCTCCTGGGACTGTTTCGCC\nCTCTCGCTGCTGGCTGTGGCTCTGGGCCTGGGTGTGCCCATGCTGCATCACCTCTGTGGC\nTGGGACCTCTGGTACTGCTTCCACCTGTGCCTGGCCTGGCTTCCCTGGCGGGGGCGGCAA\nAGTGGGCGAGATGAGGATGCCCTGCCCTACGATGCCTTCGTGGTCTTCGACAAAACGCAG\nAGCGCAGTGGCAGACTGGGTGTACAACGAGCTTCGGGGGCAGCTGGAGGAGTGCCGTGGG\nCGCTGGGCACTCCGCCTGTGCCTGGAGGAACGCGACTGGCTGCCTGGCAAAACCCTCTTT\nGAGAACCTGTGGGCCTCGGTCTATGGCAGCCGCAAGACGCTGTTTGTGCTGGCCCACACG\nGACCGGGTCAGTGGTCTCTTGCGCGCCAGCTTCCTGCTGGCCCAGCAGCGCCTGCTGGAG\nGACCGCAAGGACGTCGTGGTGCTGGTGATCCTGAGCCCTGACGGCCGCCGCTCCCGCTAC\nGTGCGGCTGCGCCAGCGCCTCTGCCGCCAGAGTGTCCTCCTCTGGCCCCACCAGCCCAGT\nGGTCAGCGCAGCTTCTGGGCCCAGCTGGGCATGGCCCTGACCAGGGACAACCACCACTTC\nTATAACCGGAACTTCTGCCAGGGACCCACGGCCGAATAG","gene_format":"FASTA","parent_id":"BE0001171"} {"id":"Q8MU52","source":"Swiss-Prot","name":"Glutathione S-transferase","general_function":"Glutathione transferase activity","specific_function":"Conjugation of reduced glutathione to a wide number of exogenous and endogenous hydrophobic electrophiles. May also function as a storage protein or ligandin for parasitotoxic ferriprotoporphyrin IX (hemin).","gene_name":"GST","theoretical_pi":6.28,"molecular_weight":24788.9,"organism":"Plasmodium falciparum","organism_ncbi_taxonomy_id":5833,"amino_acid_sequence":">lcl|BSEQ0019314|Glutathione S-transferase\nMGDNIVLYYFDARGKAELIRLIFAYLGIEYTDKRFGVNGDAFVEFKNFKKEKDTPFEQVP\nILQIGDLILAQSQAIVRYLSKKYNICGESELNEFYADMIFCGVQDIHYKFNNTNLFKQNE\nTTFLNEDLPKWSGYFEKLLKKNHTNNNNDKYYFVGNNLTYADLAVFNLYDDIETKYPSSL\nKNFPLLKAHNEFISNLPNIKNYITNRKESVY","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0005215|642 bp\nATGAAGCTTGGAGATAATATAGTGTTATATTATTTTGATGCAAGGGGTAAAGCTGAATTG\nATAAGATTAATTTTTGCCTACCTTGGAATAGAATATACAGATAAAAGATTTGGAGTAAAC\nGGTGATGCTTTTGTTGAATTTAAAAATTTTAAAAAAGAAAAGGATACTCCTTTTGAGCAA\nGTACCCATATTACAAATTGGAGATTTGATATTAGCTCAAAGCCAAGCTATAGTTCGTTAT\nTTATCAAAAAAATATAATATATGTGGTGAAAGTGAATTGAATGAATTTTATGCAGATATG\nATATTTTGTGGTGTTCAGGATATTCATTATAAATTTAATAATACCAATTTATTTAAACAA\nAATGAAACAACTTTTTTAAATGAGGATTTACCAAAATGGTCAGGTTATTTTGAGAAACTT\nTTAAAAAAAAATCATACAAATAATAATAATGATAAATATTATTTTGTAGGTAATAATTTA\nACATATGCTGACCTAGCTGTTTTTAATTTATATGATGATATTGAAACAAAATATCCAAGT\nAGCTTGAAAAATTTCCCTTTATTAAAAGCTCATAATGAATTTATAAGTAACTTGCCTAAT\nATCAAAAATTATATAACTAATAGAAAAGAAAGTGTATACTAA","gene_format":"FASTA","parent_id":"BE0002659"} {"id":"P09429","source":"Swiss-Prot","name":"High mobility group protein B1","general_function":"Transcription factor binding","specific_function":"Multifunctional redox sensitive protein with various roles in different cellular compartments. In the nucleus is one of the major chromatin-associated non-histone proteins and acts as a DNA chaperone involved in replication, transcription, chromatin remodeling, V(D)J recombination, DNA repair and genome stability. Proposed to be an universal biosensor for nucleic acids. Promotes host inflammatory response to sterile and infectious signals and is involved in the coordination and integration of innate and adaptive immune responses. In the cytoplasm functions as sensor and/or chaperone for immunogenic nucleic acids implicating the activation of TLR9-mediated immune responses, and mediates autophagy. Acts as danger associated molecular pattern (DAMP) molecule that amplifies immune responses during tissue injury. Released to the extracellular environment can bind DNA, nucleosomes, IL-1 beta, CXCL12, AGER isoform 2/sRAGE, lipopolysaccharide (LPS) and lipoteichoic acid (LTA), and activates cells through engagement of multiple surface receptors. In the extracellular compartment fully reduced HMGB1 (released by necrosis) acts as a chemokine, disulfide HMGB1 (actively secreted) as a cytokine, and sulfonyl HMGB1 (released from apoptotic cells) promotes immunological tolerance (PubMed:23519706, PubMed:23446148, PubMed:23994764, PubMed:25048472). Has proangiogdenic activity (By similarity). May be involved in platelet activation (By similarity). Binds to phosphatidylserine and phosphatidylethanolamide (By similarity). Bound to RAGE mediates signaling for neuronal outgrowth (By similarity). May play a role in accumulation of expanded polyglutamine (polyQ) proteins such as huntingtin (HTT) or TBP (PubMed:23303669, PubMed:25549101).Nuclear functions are attributed to fully reduced HGMB1. Associates with chromatin and binds DNA with a preference to non-canonical DNA structures such as single-stranded DNA, DNA-containing cruciforms or bent structures, supercoiled DNA and ZDNA. Can bent DNA and enhance DNA flexibility by looping thus providing a mechanism to promote activities on various gene promoters by enhancing transcription factor binding and/or bringing distant regulatory sequences into close proximity (PubMed:20123072). May have an enhancing role in nucleotide excision repair (NER) (By similarity). However, effects in NER using in vitro systems have been reported conflictingly (PubMed:19446504, PubMed:19360789). May be involved in mismatch repair (MMR) and base excision repair (BER) pathways (PubMed:15014079, PubMed:16143102, PubMed:17803946). May be involved in double strand break repair such as non-homologous end joining (NHEJ) (By similarity). Involved in V(D)J recombination by acting as a cofactor of the RAG complex: acts by stimulating cleavage and RAG protein binding at the 23 bp spacer of conserved recombination signal sequences (RSS) (By similarity). In vitro can displace histone H1 from highly bent DNA (By similarity). Can restructure the canonical nucleosome leading to relaxation of structural constraints for transcription factor-binding (By similarity). Enhances binding of sterol regulatory element-binding proteins (SREBPs) such as SREBF1 to their cognate DNA sequences and increases their transcriptional activities (By similarity). Facilitates binding of TP53 to DNA (PubMed:23063560). Proposed to be involved in mitochondrial quality control and autophagy in a transcription-dependent fashion implicating HSPB1; however, this function has been questioned (By similarity). Can modulate the activity of the telomerase complex and may be involved in telomere maintenance (By similarity).In the cytoplasm proposed to dissociate the BECN1:BCL2 complex via competitive interaction with BECN1 leading to autophagy activation (PubMed:20819940). Involved in oxidative stress-mediated autophagy (PubMed:21395369). Can protect BECN1 and ATG5 from calpain-mediated cleavage and thus proposed to control their proautophagic and proapoptotic functions and to regulate the extent and severity of inflammation-associated cellular injury (By similarity). In myeloid cells has a protective role against endotoxemia and bacterial infection by promoting autophagy (By similarity). Involved in endosomal translocation and activation of TLR9 in response to CpG-DNA in macrophages (By similarity).In the extracellular compartment (following either active secretion or passive release) involved in regulation of the inflammatory response. Fully reduced HGMB1 (which subsequently gets oxidized after release) in association with CXCL12 mediates the recruitment of inflammatory cells during the initial phase of tissue injury; the CXCL12:HMGB1 complex triggers CXCR4 homodimerization (PubMed:22370717). Induces the migration of monocyte-derived immature dendritic cells and seems to regulate adhesive and migratory functions of neutrophils implicating AGER/RAGE and ITGAM (By similarity). Can bind to various types of DNA and RNA including microbial unmethylated CpG-DNA to enhance the innate immune response to nucleic acids. Proposed to act in promiscuous DNA/RNA sensing which cooperates with subsequent discriminative sensing by specific pattern recognition receptors (By similarity). Promotes extracellular DNA-induced AIM2 inflammasome activation implicating AGER/RAGE (PubMed:24971542). Disulfide HMGB1 binds to transmembrane receptors, such as AGER/RAGE, TLR2, TLR4 and probably TREM1, thus activating their signal transduction pathways. Mediates the release of cytokines/chemokines such as TNF, IL-1, IL-6, IL-8, CCL2, CCL3, CCL4 and CXCL10 (PubMed:12765338, PubMed:18354232, PubMed:19264983, PubMed:20547845, PubMed:24474694). Promotes secretion of interferon-gamma by macrophage-stimulated natural killer (NK) cells in concert with other cytokines like IL-2 or IL-12 (PubMed:15607795). TLR4 is proposed to be the primary receptor promoting macrophage activation and signaling through TLR4 seems to implicate LY96/MD-2 (PubMed:20547845). In bacterial LPS- or LTA-mediated inflammatory responses binds to the endotoxins and transfers them to CD14 for signaling to the respective TLR4:LY96 and TLR2 complexes (PubMed:18354232, PubMed:21660935, PubMed:25660311). Contributes to tumor proliferation by association with ACER/RAGE (By similarity). Can bind to IL1-beta and signals through the IL1R1:IL1RAP receptor complex (PubMed:18250463). Binding to class A CpG activates cytokine production in plasmacytoid dendritic cells implicating TLR9, MYD88 and AGER/RAGE and can activate autoreactive B cells. Via HMGB1-containing chromatin immune complexes may also promote B cell responses to endogenous TLR9 ligands through a B-cell receptor (BCR)-dependent and ACER/RAGE-independent mechanism (By similarity). Inhibits phagocytosis of apoptotic cells by macrophages; the function is dependent on poly-ADP-ribosylation and involves binding to phosphatidylserine on the cell surface of apoptotic cells (By similarity). In adaptive immunity may be involved in enhancing immunity through activation of effector T cells and suppression of regulatory T (TReg) cells (PubMed:15944249, PubMed:22473704). In contrast, without implicating effector or regulatory T cells, required for tumor infiltration and activation of T cells expressing the lymphotoxin LTA:LTB heterotrimer thus promoting tumor malignant progression (By similarity). Also reported to limit proliferation of T cells (By similarity). Released HMGB1:nucleosome complexes formed during apoptosis can signal through TLR2 to induce cytokine production (PubMed:19064698). Involved in induction of immunological tolerance by apoptotic cells; its pro-inflammatory activities when released by apoptotic cells are neutralized by reactive oxygen species (ROS)-dependent oxidation specifically on Cys-106 (PubMed:18631454). During macrophage activation by activated lymphocyte-derived self apoptotic DNA (ALD-DNA) promotes recruitment of ALD-DNA to endosomes (By similarity).","gene_name":"HMGB1","cellular_location":"Nucleus","theoretical_pi":5.42,"molecular_weight":24893.58,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0021855|High mobility group protein B1\nMGKGDPKKPRGKMSSYAFFVQTCREEHKKKHPDASVNFSEFSKKCSERWKTMSAKEKGKF\nEDMAKADKARYEREMKTYIPPKGETKKKFKDPNAPKRPPSAFFLFCSEYRPKIKGEHPGL\nSIGDVAKKLGEMWNNTAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEK\nSKKKKEEEEDEEDEEDEEEEEDEEDEDEEEDDDDE","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021856|High mobility group protein B1 (HMGB1)\nATGGGCAAAGGAGATCCTAAGAAGCCGAGAGGCAAAATGTCATCATATGCATTTTTTGTG\nCAAACTTGTCGGGAGGAGCATAAGAAGAAGCACCCAGATGCTTCAGTCAACTTCTCAGAG\nTTTTCTAAGAAGTGCTCAGAGAGGTGGAAGACCATGTCTGCTAAAGAGAAAGGAAAATTT\nGAAGATATGGCAAAAGCGGACAAGGCCCGTTATGAAAGAGAAATGAAAACCTATATCCCT\nCCCAAAGGGGAGACAAAAAAGAAGTTCAAGGATCCCAATGCACCCAAGAGGCCTCCTTCG\nGCCTTCTTCCTCTTCTGCTCTGAGTATCGCCCAAAAATCAAAGGAGAACATCCTGGCCTG\nTCCATTGGTGATGTTGCGAAGAAACTGGGAGAGATGTGGAATAACACTGCTGCAGATGAC\nAAGCAGCCTTATGAAAAGAAGGCTGCGAAGCTGAAGGAAAAATACGAAAAGGATATTGCT\nGCATATCGAGCTAAAGGAAAGCCTGATGCAGCAAAAAAGGGAGTTGTCAAGGCTGAAAAA\nAGCAAGAAAAAGAAGGAAGAGGAGGAAGATGAGGAAGATGAAGAGGATGAGGAGGAGGAG\nGAAGATGAAGAAGATGAAGATGAAGAAGAAGATGATGATGATGAATAA","gene_format":"FASTA","parent_id":"BE0003428"} {"id":"Q9BYF1","source":"Swiss-Prot","name":"Angiotensin-converting enzyme 2","general_function":"Zinc ion binding","specific_function":"Carboxypeptidase which converts angiotensin I to angiotensin 1-9, a peptide of unknown function, and angiotensin II to angiotensin 1-7, a vasodilator. Also able to hydrolyze apelin-13 and dynorphin-13 with high efficiency. May be an important regulator of heart function.(Microbial infection) Acts as a receptor for SARS coronavirus/SARS-CoV and human coronavirus NL63/HCoV-NL63.","gene_name":"ACE2","locus":"Xp22","cellular_location":"Secreted","transmembrane_regions":"741-761","signal_regions":"1-17","theoretical_pi":5.23,"molecular_weight":92462.4,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0001955|Angiotensin-converting enzyme 2\nMSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQ\nNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTIL\nNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLY\nEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHL\nHAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQ\nAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILM\nCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS\nIGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEM\nKREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLH\nKCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNK\nNSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKN\nQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRLNDN\nSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSGENP\nYASIDISKGENNPGFQNTDDVQTSF","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010689|Angiotensin-converting enzyme 2 (ACE2)\nATGTCAAGCTCTTCCTGGCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACC\nATTGAGGAACAGGCCAAGACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTC\nTATCAAAGTTCACTTGCTTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAA\nAACATGAATAATGCTGGGGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCC\nCAAATGTATCCACTACAAGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTT\nCAGCAAAATGGGTCTTCAGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTA\nAATACAATGAGCACCATCTACAGTACTGGAAAAGTTTGTAACCCAGATAATCCACAAGAA\nTGCTTATTACTTGAACCAGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAG\nAGGCTCTGGGCTTGGGAAAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATAT\nGAAGAGTATGTGGTCTTGAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGG\nGATTATTGGAGAGGAGACTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGC\nCAGTTGATTGAAGATGTGGAACATACCTTTGAAGAGATTAAACCATTATATGAACATCTT\nCATGCCTATGTGAGGGCAAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGA\nTGCCTCCCTGCTCATTTGCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCT\nTTGACAGTTCCCTTTGGACAGAAACCAAACATAGATGTTACTGATGCAATGGTGGACCAG\nGCCTGGGATGCACAGAGAATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTT\nCCTAATATGACTCAAGGATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAG\nAAAGCAGTCTGCCATCCCACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATG\nTGCACAAAGGTGACAATGGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAG\nTATGATATGGCATATGCTGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTC\nCATGAAGCTGTTGGGGAAATCATGTCACTTTCTGCAGCCACACCTAAGCATTTAAAATCC\nATTGGTCTTCTGTCACCCGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTC\nAAACAAGCACTCACGATTGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGG\nTGGATGGTCTTTAAAGGGGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATG\nAAGCGAGAGATAGTTGGGGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCC\nGCATCTCTGTTCCATGTTTCTAATGATTACTCATTCATTCGATATTACACAAGGACCCTT\nTACCAATTCCAGTTTCAAGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCAC\nAAATGTGACATCTCAAACTCTACAGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTT\nGGAAAATCAGAACCCTGGACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAAT\nGTAAGGCCACTGCTCAACTACTTTGAGCCCTTATTTACCTGGCTGAAAGACCAGAACAAG\nAATTCTTTTGTGGGATGGAGTACCGACTGGAGTCCATATGCAGACCAAAGCATCAAAGTG\nAGGATAAGCCTAAAATCAGCTCTTGGAGATAAAGCATATGAATGGAACGACAATGAAATG\nTACCTGTTCCGATCATCTGTTGCATATGCTATGAGGCAGTACTTTTTAAAAGTAAAAAAT\nCAGATGATTCTTTTTGGGGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCC\nTTTAATTTCTTTGTCACTGCACCTAAAAATGTGTCTGATATCATTCCTAGAACTGAAGTT\nGAAAAGGCCATCAGGATGTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAAC\nAGCCTAGAGTTTCTGGGGATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCC\nATATGGCTGATTGTTTTTGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTG\nATCTTCACTGGGATCAGAGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCT\nTATGCCTCCATCGATATTAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGAT\nGTTCAGACCTCCTTTTAG","gene_format":"FASTA","parent_id":"BE0000982"} {"id":"P10415","source":"Swiss-Prot","name":"Apoptosis regulator Bcl-2","general_function":"Ubiquitin protein ligase binding","specific_function":"Suppresses apoptosis in a variety of cell systems including factor-dependent lymphohematopoietic and neural cells. Regulates cell death by controlling the mitochondrial membrane permeability. Appears to function in a feedback loop system with caspases. Inhibits caspase activity either by preventing the release of cytochrome c from the mitochondria and/or by binding to the apoptosis-activating factor (APAF-1). May attenuate inflammation by impairing NLRP1-inflammasome activation, hence CASP1 activation and IL1B release (PubMed:17418785).","gene_name":"BCL2","locus":"18q21.33|18q21.3","cellular_location":"Mitochondrion outer membrane","transmembrane_regions":"212-233","theoretical_pi":7.32,"molecular_weight":26265.66,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000491|Apoptosis regulator Bcl-2\nMAHAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAPAPGIFSSQPGHTPHPA\nASRDPVARTSPLQTPAAPGAAAGPALSPVPPVVHLTLRQAGDDFSRRYRRDFAEMSSQLH\nLTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVESVNREMSPLVDNIALWMTEY\nLNRHLHTWIQDNGGWDAFVELYGPSMRPLFDFSWLSLKTLLSLALVGACITLGAYLGHK","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021924|Apoptosis regulator Bcl-2 (BCL2)\nATGGCGCACGCTGGGAGAACAGGGTACGATAACCGGGAGATAGTGATGAAGTACATCCAT\nTATAAGCTGTCGCAGAGGGGCTACGAGTGGGATGCGGGAGATGTGGGCGCCGCGCCCCCG\nGGGGCCGCCCCCGCACCGGGCATCTTCTCCTCCCAGCCCGGGCACACGCCCCATCCAGCC\nGCATCCCGGGACCCGGTCGCCAGGACCTCGCCGCTGCAGACCCCGGCTGCCCCCGGCGCC\nGCCGCGGGGCCTGCGCTCAGCCCGGTGCCACCTGTGGTCCACCTGACCCTCCGCCAGGCC\nGGCGACGACTTCTCCCGCCGCTACCGCCGCGACTTCGCCGAGATGTCCAGCCAGCTGCAC\nCTGACGCCCTTCACCGCGCGGGGACGCTTTGCCACGGTGGTGGAGGAGCTCTTCAGGGAC\nGGGGTGAACTGGGGGAGGATTGTGGCCTTCTTTGAGTTCGGTGGGGTCATGTGTGTGGAG\nAGCGTCAACCGGGAGATGTCGCCCCTGGTGGACAACATCGCCCTGTGGATGACTGAGTAC\nCTGAACCGGCACCTGCACACCTGGATCCAGGATAACGGAGGCTGGGATGCCTTTGTGGAA\nCTGTACGGCCCCAGCATGCGGCCTCTGTTTGATTTCTCCTGGCTGTCTCTGAAGACTCTG\nCTCAGTTTGGCCCTGGTGGGAGCTTGCATCACCCTGGGTGCCTATCTGGGCCACAAGTGA\n","gene_format":"FASTA","parent_id":"BE0000246"} {"id":"P12104","source":"Swiss-Prot","name":"Fatty acid-binding protein, intestinal","general_function":"Transporter activity","specific_function":"FABP are thought to play a role in the intracellular transport of long-chain fatty acids and their acyl-CoA esters. FABP2 is probably involved in triglyceride-rich lipoprotein synthesis. Binds saturated long-chain fatty acids with a high affinity, but binds with a lower affinity to unsaturated long-chain fatty acids. FABP2 may also help maintain energy homeostasis by functioning as a lipid sensor.","gene_name":"FABP2","locus":"4q28-q31","cellular_location":"Cytoplasm","theoretical_pi":7.52,"molecular_weight":15207.165,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0012494|Fatty acid-binding protein, intestinal\nMAFDSTWKVDRSENYDKFMEKMGVNIVKRKLAAHDNLKLTITQEGNKFTVKESSAFRNIE\nVVFELGVTFNYNLADGTELRGTWSLEGNKLIGKFKRTDNGNELNTVREIIGDELVQTYVY\nEGVEAKRIFKKD","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0012495|Fatty acid-binding protein, intestinal (FABP2)\nATGGCGTTTGACAGCACTTGGAAGGTAGACCGGAGTGAAAACTATGACAAGTTCATGGAA\nAAAATGGGTGTTAATATAGTGAAAAGGAAGCTTGCAGCTCATGACAATTTGAAGCTGACA\nATTACACAAGAAGGAAATAAATTCACAGTCAAAGAATCAAGCACTTTTCGAAACATTGAA\nGTTGTTTTTGAACTTGGTGTCACCTTTAATTACAATCTAGCAGACGGAACTGAACTCAGG\nGGGACCTGGAGCCTTGAGGGAAATAAACTTATTGGAAAATTCAAACGGACAGACAATGGA\nAACGAACTGAATACTGTCCGAGAAATTATAGGTGATGAACTAGTCCAGACTTATGTATAT\nGAAGGAGTAGAAGCCAAAAGGATCTTTAAAAAGGATTGA","gene_format":"FASTA","parent_id":"BE0003591"} {"id":"P07359","source":"Swiss-Prot","name":"Platelet glycoprotein Ib alpha chain","general_function":"Thrombin receptor activity","specific_function":"GP-Ib, a surface membrane protein of platelets, participates in the formation of platelet plugs by binding to the A1 domain of vWF, which is already bound to the subendothelium.","gene_name":"GP1BA","locus":"17pter-p12","cellular_location":"Membrane","transmembrane_regions":"532-552","signal_regions":"1-16","theoretical_pi":6.67,"molecular_weight":71539.265,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0010716|Platelet glycoprotein Ib alpha chain\nMPLLLLLLLLPSPLHPHPICEVSKVASHLEVNCDKRNLTALPPDLPKDTTILHLSENLLY\nTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTV\nLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNNLTELP\nAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNA\nENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPTLGDEGDTDLYDYYPEEDT\nEGDKVRATRTVVKFPTKAHTTPWGLFYSWSTASLDSQMPSSLHPTQESTKEQTTFPPRWT\nPNFTLHMESITFSKTPKSTTEPTPSPTTSEPVPEPAPNMTTLEPTPSPTTPEPTSEPAPS\nPTTPEPTSEPAPSPTTPEPTSEPAPSPTTPEPTPIPTIATSPTILVSATSLITPKSTFLT\nTTKPVSLLESTKKTIPELDQPPKLRGVLQGHLESSRNDPFLHPDFCCLLPLGFYVLGLFW\nLLFASVVLILLLSWVGHVKPQALDSGQGAALTTATQTTHLELQRGRQVTVPRAWLLFLRG\nSLPTFRSSLFLWVRPNGRVGPLVAGRRPSALSQGRGQDLLSTVSIRYSGHSL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010717|Platelet glycoprotein Ib alpha chain (GP1BA)\nATGCCTCTCCTCCTCTTGCTGCTCCTGCTGCCAAGCCCCTTACACCCCCACCCCATCTGT\nGAGGTCTCCAAAGTGGCCAGCCACCTAGAAGTGAACTGTGACAAGAGGAATCTGACAGCG\nCTGCCTCCAGACCTGCCGAAAGACACAACCATCCTCCACCTGAGTGAGAACCTCCTGTAC\nACCTTCTCCCTGGCAACCCTGATGCCTTACACTCGCCTCACTCAGCTGAACCTAGATAGG\nTGCGAGCTCACCAAGCTCCAGGTCGATGGGACGCTGCCAGTGCTGGGGACCCTGGATCTA\nTCCCACAATCAGCTGCAAAGCCTGCCCTTGCTAGGGCAGACACTGCCTGCTCTCACCGTC\nCTGGACGTCTCCTTCAACCGGCTGACCTCGCTGCCTCTTGGTGCCCTGCGTGGTCTTGGC\nGAACTCCAAGAGCTCTACCTGAAAGGCAATGAGCTGAAGACCCTGCCCCCAGGGCTCCTG\nACGCCCACACCCAAGCTGGAGAAGCTCAGTCTGGCTAACAACAACTTGACTGAGCTCCCC\nGCTGGGCTCCTGAATGGGCTGGAGAATCTCGACACCCTTCTCCTCCAAGAGAACTCGCTG\nTATACAATACCAAAGGGCTTTTTTGGGTCCCACCTCCTGCCTTTTGCTTTTCTCCACGGG\nAACCCCTGGTTATGCAACTGTGAGATCCTCTATTTTCGTCGCTGGCTGCAGGACAATGCT\nGAAAATGTCTACGTATGGAAGCAAGGTGTGGACGTCAAGGCCATGACCTCTAACGTGGCC\nAGTGTGCAGTGTGACAATTCAGACAAGTTTCCCGTCTACAAATACCCAGGAAAGGGGTGC\nCCCACCCTTGGTGATGAAGGTGACACAGACCTATATGATTACTACCCAGAAGAGGACACT\nGAGGGCGATAAGGTGCGTGCCACAAGGACTGTGGTCAAGTTCCCCACCAAAGCCCATACA\nACCCCCTGGGGTCTATTCTACTCATGGTCCACTGCTTCTCTAGACAGCCAAATGCCCTCC\nTCCTTGCATCCAACACAAGAATCCACTAAGGAGCAGACCACATTCCCACCTAGATGGACC\nCCAAATTTCACACTTCACATGGAATCCATCACATTCTCCAAAACTCCAAAATCCACTACT\nGAACCAACCCCAAGCCCGACCACCTCAGAGCCCGTCCCGGAGCCCGCCCCAAACATGACC\nACCCTGGAGCCCACTCCAAGCCCGACCACCCCAGAGCCCACCTCAGAGCCCGCCCCCAGC\nCCGACCACCCCGGAGCCCACCTCAGAGCCCGCCCCCAGCCCGACCACCCCAGAGCCCACC\nTCAGAGCCCGCCCCCAGCCCGACCACCCCGGAGCCCACCCCAATCCCGACCATCGCCACA\nAGCCCGACCATCCTGGTGTCTGCCACAAGCCTGATCACTCCAAAAAGCACATTTTTAACT\nACCACAAAACCCGTATCACTCTTAGAATCCACCAAAAAAACCATCCCTGAACTTGATCAG\nCCACCAAAGCTCCGTGGGGTGCTCCAAGGGCATTTGGAGAGCTCCAGAAATGACCCTTTT\nCTCCACCCCGACTTTTGCTGCCTCCTCCCCCTGGGCTTCTATGTCTTGGGTCTCTTCTGG\nCTGCTCTTTGCCTCTGTGGTCCTCATCCTGCTGCTGAGCTGGGTTGGGCATGTGAAACCA\nCAGGCCCTGGACTCTGGCCAAGGTGCTGCTCTGACCACAGCCACACAAACCACACACCTG\nGAGCTGCAGAGGGGACGGCAAGTGACAGTGCCCCGGGCCTGGCTGCTCTTCCTTCGAGGT\nTCGCTTCCCACTTTCCGCTCCAGCCTCTTCCTGTGGGTACGGCCTAATGGCCGTGTGGGG\nCCTCTAGTGGCAGGAAGGAGGCCCTCAGCTCTGAGTCAGGGTCGTGGTCAGGACCTGCTG\nAGCACAGTGAGCATTAGGTACTCTGGCCACAGCCTCTGA","gene_format":"FASTA","parent_id":"BE0001025"} {"id":"P31151","source":"Swiss-Prot","name":"Protein S100-A7","general_function":"Zinc ion binding","gene_name":"S100A7","locus":"1q21","cellular_location":"Cytoplasm","theoretical_pi":6.78,"molecular_weight":11470.87,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0012623|Protein S100-A7\nMSNTQAERSIIGMIDMFHKYTRRDDKIEKPSLLTMMKENFPNFLSACDKKGTNYLADVFE\nKKDKNEDKKIDFSEFLSLLGDIATDYHKQSHGAAPCSGGSQ","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0012624|Protein S100-A7 (S100A7)\nATGAGCAACACTCAAGCTGAGAGGTCCATAATAGGCATGATCGACATGTTTCACAAATAC\nACCAGACGTGATGACAAGATTGAGAAGCCAAGCCTGCTGACGATGATGAAGGAGAACTTC\nCCCAACTTCCTTAGTGCCTGTGACAAAAAGGGCACAAATTACCTCGCCGATGTCTTTGAG\nAAAAAGGACAAGAATGAGGATAAGAAGATTGATTTTTCTGAGTTTCTGTCCTTGCTGGGA\nGACATAGCCACAGACTACCACAAGCAGAGCCATGGAGCAGCGCCCTGTTCCGGGGGCAGC\nCAGTGA","gene_format":"FASTA","parent_id":"BE0003756"} {"id":"Q13547","source":"Swiss-Prot","name":"Histone deacetylase 1","general_function":"Transcription regulatory region sequence-specific dna binding","specific_function":"Responsible for the deacetylation of lysine residues on the N-terminal part of the core histones (H2A, H2B, H3 and H4). Histone deacetylation gives a tag for epigenetic repression and plays an important role in transcriptional regulation, cell cycle progression and developmental events. Histone deacetylases act via the formation of large multiprotein complexes. Deacetylates SP proteins, SP1 and SP3, and regulates their function. Component of the BRG1-RB1-HDAC1 complex, which negatively regulates the CREST-mediated transcription in resting neurons. Upon calcium stimulation, HDAC1 is released from the complex and CREBBP is recruited, which facilitates transcriptional activation. Deacetylates TSHZ3 and regulates its transcriptional repressor activity. Deacetylates 'Lys-310' in RELA and thereby inhibits the transcriptional activity of NF-kappa-B. Deacetylates NR1D2 and abrogates the effect of KAT5-mediated relieving of NR1D2 transcription repression activity. Component of a RCOR/GFI/KDM1A/HDAC complex that suppresses, via histone deacetylase (HDAC) recruitment, a number of genes implicated in multilineage blood cell development. Involved in CIART-mediated transcriptional repression of the circadian transcriptional activator: CLOCK-ARNTL/BMAL1 heterodimer. Required for the transcriptional repression of circadian target genes, such as PER1, mediated by the large PER complex or CRY1 through histone deacetylation.","gene_name":"HDAC1","locus":"1p34","cellular_location":"Nucleus","theoretical_pi":5.16,"molecular_weight":55102.615,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0011697|Histone deacetylase 1\nMAQTQGTRRKVCYYYDGDVGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKAN\nAEEMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVAS\nAVKLNKQQTDIAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHHG\nDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNYPLRDGIDDESYEAI\nFKPVMSKVMEMFQPSAVVLQCGSDSLSGDRLGCFNLTIKGHAKCVEFVKSFNLPMLMLGG\nGGYTIRNVARCWTYETAVALDTEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTNEYLE\nKIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESGDEDEDDPDKRISICSSDKRIACEEEF\nSDSEEEGEGGRKNSSNFKKAKRVKTEDEKEKDPEEKKEVTEEEKTKEEKPEAKGVKEEVK\nLA","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011698|Histone deacetylase 1 (HDAC1)\nATGGCGCAGACGCAGGGCACCCGGAGGAAAGTCTGTTACTACTACGACGGGGATGTTGGA\nAATTACTATTATGGACAAGGCCACCCAATGAAGCCTCACCGAATCCGCATGACTCATAAT\nTTGCTGCTCAACTATGGTCTCTACCGAAAAATGGAAATCTATCGCCCTCACAAAGCCAAT\nGCTGAGGAGATGACCAAGTACCACAGCGATGACTACATTAAATTCTTGCGCTCCATCCGT\nCCAGATAACATGTCGGAGTACAGCAAGCAGATGCAGAGATTCAACGTTGGTGAGGACTGT\nCCAGTATTCGATGGCCTGTTTGAGTTCTGTCAGTTGTCTACTGGTGGTTCTGTGGCAAGT\nGCTGTGAAACTTAATAAGCAGCAGACGGACATCGCTGTGAATTGGGCTGGGGGCCTGCAC\nCATGCAAAGAAGTCCGAGGCATCTGGCTTCTGTTACGTCAATGATATCGTCTTGGCCATC\nCTGGAACTGCTAAAGTATCACCAGAGGGTGCTGTACATTGACATTGATATTCACCATGGT\nGACGGCGTGGAAGAGGCCTTCTACACCACGGACCGGGTCATGACTGTGTCCTTTCATAAG\nTATGGAGAGTACTTCCCAGGAACTGGGGACCTACGGGATATCGGGGCTGGCAAAGGCAAG\nTATTATGCTGTTAACTACCCGCTCCGAGACGGGATTGATGACGAGTCCTATGAGGCCATT\nTTCAAGCCGGTCATGTCCAAAGTAATGGAGATGTTCCAGCCTAGTGCGGTGGTCTTACAG\nTGTGGCTCAGACTCCCTATCTGGGGATCGGTTAGGTTGCTTCAATCTAACTATCAAAGGA\nCACGCCAAGTGTGTGGAATTTGTCAAGAGCTTTAACCTGCCTATGCTGATGCTGGGAGGC\nGGTGGTTACACCATTCGTAACGTTGCCCGGTGCTGGACATATGAGACAGCTGTGGCCCTG\nGATACGGAGATCCCTAATGAGCTTCCATACAATGACTACTTTGAATACTTTGGACCAGAT\nTTCAAGCTCCACATCAGTCCTTCCAATATGACTAACCAGAACACGAATGAGTACCTGGAG\nAAGATCAAACAGCGACTGTTTGAGAACCTTAGAATGCTGCCGCACGCACCTGGGGTCCAA\nATGCAGGCGATTCCTGAGGACGCCATCCCTGAGGAGAGTGGCGATGAGGACGAAGACGAC\nCCTGACAAGCGCATCTCGATCTGCTCCTCTGACAAACGAATTGCCTGTGAGGAAGAGTTC\nTCCGATTCTGAAGAGGAGGGAGAGGGGGGCCGCAAGAACTCTTCCAACTTCAAAAAAGCC\nAAGAGAGTCAAAACAGAGGATGAAAAAGAGAAAGACCCAGAGGAGAAGAAAGAAGTCACC\nGAAGAGGAGAAAACCAAGGAGGAGAAGCCAGAAGCCAAAGGGGTCAAGGAGGAGGTCAAG\nTTGGCCTGA","gene_format":"FASTA","parent_id":"BE0002365"} {"id":"P51843","source":"Swiss-Prot","name":"Nuclear receptor subfamily 0 group B member 1","general_function":"Transcription factor binding","specific_function":"Orphan nuclear receptor. Component of a cascade required for the development of the hypothalamic-pituitary-adrenal-gonadal axis. Acts as a coregulatory protein that inhibits the transcriptional activity of other nuclear receptors through heterodimeric interactions. May also have a role in the development of the embryo and in the maintenance of embryonic stem cell pluripotency (By similarity).","gene_name":"NR0B1","locus":"Xp21.3-p21.2","cellular_location":"Nucleus","theoretical_pi":8.12,"molecular_weight":51717.185,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0000148|Nuclear receptor subfamily 0 group B member 1\nMAGENHQWQGSILYNMLMSAKQTRAAPEAPETRLVDQCWGCSCGDEPGVGREGLLGGRNV\nALLYRCCFCGKDHPRQGSILYSMLTSAKQTYAAPKAPEATLGPCWGCSCGSDPGVGRAGL\nPGGRPVALLYRCCFCGEDHPRQGSILYSLLTSSKQTHVAPAAPEARPGGAWWDRSYFAQR\nPGGKEALPGGRATALLYRCCFCGEDHPQQGSTLYCVPTSTNQAQAAPEERPRAPWWDTSS\nGALRPVALKSPQVVCEAASAGLLKTLRFVKYLPCFQVLPLDQQLVLVRNCWASLLMLELA\nQDRLQFETVEVSEPSMLQKILTTRRRETGGNEPLPVPTLQHHLAPPAEARKVPSASQVQA\nIKCFLSKCWSLNISTKEYAYLKGTVLFNPDVPGLQCVKYIQGLQWGTQQILSEHTRMTHQ\nGPHDRFIELNSTLFLLRFINANVIAELFFRPIIGTVSMDDMMLEMLCTKI","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0010151|Nuclear receptor subfamily 0 group B member 1 (NR0B1)\nATGGCGGGCGAGAACCACCAGTGGCAGGGCAGCATCCTCTACAACATGCTTATGAGCGCG\nAAGCAAACGCGCGCGGCTCCTGAGGCTCCAGAGACGCGGCTGGTGGATCAGTGCTGGGGC\nTGTTCGTGCGGCGATGAGCCCGGGGTGGGCAGAGAGGGGCTGCTGGGCGGGCGGAACGTG\nGCGCTCCTGTACCGCTGCTGCTTTTGCGGTAAAGACCACCCACGGCAGGGCAGCATCCTC\nTACAGCATGCTGACGAGCGCAAAGCAAACGTACGCGGCACCGAAGGCGCCCGAGGCGACG\nCTGGGTCCGTGCTGGGGCTGTTCGTGCGGCTCTGATCCCGGGGTGGGCAGAGCGGGGCTT\nCCGGGTGGGCGGCCCGTGGCACTCCTGTACCGCTGCTGCTTTTGTGGTGAAGACCACCCG\nCGGCAGGGCAGCATCCTCTACAGCTTGCTCACTAGCTCAAAGCAAACGCACGTGGCTCCG\nGCAGCGCCCGAGGCACGGCCAGGGGGCGCGTGGTGGGACCGCTCCTACTTCGCGCAGAGG\nCCAGGGGGTAAAGAGGCGCTACCAGGCGGGCGGGCCACGGCGCTTCTGTACCGCTGCTGC\nTTTTGCGGTGAAGACCACCCGCAGCAGGGCAGCACCCTCTACTGCGTGCCCACGAGCACA\nAATCAAGCGCAGGCGGCTCCGGAGGAGCGGCCGAGGGCCCCCTGGTGGGACACCTCCTCT\nGGTGCGCTGCGGCCGGTGGCGCTCAAGAGTCCACAGGTGGTCTGCGAGGCAGCCTCAGCG\nGGCCTGTTGAAGACGCTGCGCTTCGTCAAGTACTTGCCCTGCTTCCAGGTGCTGCCCCTG\nGACCAGCAGCTGGTGCTGGTGCGCAACTGCTGGGCGTCCCTGCTCATGCTTGAGCTGGCC\nCAGGACCGCTTGCAGTTCGAGACTGTGGAAGTCTCGGAGCCCAGCATGCTGCAGAAGATC\nCTCACCACCAGGCGGCGGGAGACCGGGGGCAACGAGCCACTGCCCGTGCCCACGCTGCAG\nCACCATTTGGCACCGCCGGCGGAGGCCAGGAAGGTGCCCTCCGCCTCCCAGGTCCAAGCC\nATCAAGTGCTTTCTTTCCAAATGCTGGAGTCTGAACATCAGTACCAAGGAGTACGCCTAC\nCTCAAGGGGACCGTGCTCTTTAACCCGGACGTGCCGGGCCTGCAGTGCGTGAAGTACATT\nCAGGGACTCCAGTGGGGAACTCAGCAAATACTCAGTGAACACACCAGGATGACGCACCAA\nGGGCCCCATGACAGATTCATCGAACTTAATAGTACCCTTTTCCTGCTGAGATTCATCAAT\nGCCAATGTCATTGCTGAACTGTTCTTCAGGCCCATCATCGGCACAGTCAGCATGGATGAT\nATGATGCTGGAAATGCTCTGTACAAAGATATAA","gene_format":"FASTA","parent_id":"BE0000074"} {"id":"P51681","source":"Swiss-Prot","name":"C-C chemokine receptor type 5","general_function":"Virus receptor activity","specific_function":"Receptor for a number of inflammatory CC-chemokines including MIP-1-alpha, MIP-1-beta and RANTES and subsequently transduces a signal by increasing the intracellular calcium ion level. May play a role in the control of granulocytic lineage proliferation or differentiation. Acts as a coreceptor (CD4 being the primary receptor) for HIV-1 R5 isolates.(Microbial infection) Acts as a receptor for human immunodeficiency virus-1/HIV-1.","gene_name":"CCR5","locus":"3p21.31","cellular_location":"Cell membrane","transmembrane_regions":"31-58\n69-89\n103-124\n142-166\n199-218\n236-260\n278-301","theoretical_pi":9.18,"molecular_weight":40523.645,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0016246|C-C chemokine receptor type 5\nMDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR\nLKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQLLTGLYFIGFFSGIFFII\nLLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTCSS\nHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI\nMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQVTETLGMTHCCINPIIYAFV\nGEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVGL","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0016247|C-C chemokine receptor type 5 (CCR5)\nATGGATTATCAAGTGTCAAGTCCAATCTATGACATCAATTATTATACATCGGAGCCCTGC\nCAAAAAATCAATGTGAAGCAAATCGCAGCCCGCCTCCTGCCTCCGCTCTACTCACTGGTG\nTTCATCTTTGGTTTTGTGGGCAACATGCTGGTCATCCTCATCCTGATAAACTGCAAAAGG\nCTGAAGAGCATGACTGACATCTACCTGCTCAACCTGGCCATCTCTGACCTGTTTTTCCTT\nCTTACTGTCCCCTTCTGGGCTCACTATGCTGCCGCCCAGTGGGACTTTGGAAATACAATG\nTGTCAACTCTTGACAGGGCTCTATTTTATAGGCTTCTTCTCTGGAATCTTCTTCATCATC\nCTCCTGACAATCGATAGGTACCTGGCTGTCGTCCATGCTGTGTTTGCTTTAAAAGCCAGG\nACGGTCACCTTTGGGGTGGTGACAAGTGTGATCACTTGGGTGGTGGCTGTGTTTGCGTCT\nCTCCCAGGAATCATCTTTACCAGATCTCAAAAAGAAGGTCTTCATTACACCTGCAGCTCT\nCATTTTCCATACAGTCAGTATCAATTCTGGAAGAATTTCCAGACATTAAAGATAGTCATC\nTTGGGGCTGGTCCTGCCGCTGCTTGTCATGGTCATCTGCTACTCGGGAATCCTAAAAACT\nCTGCTTCGGTGTCGAAATGAGAAGAAGAGGCACAGGGCTGTGAGGCTTATCTTCACCATC\nATGATTGTTTATTTTCTCTTCTGGGCTCCCTACAACATTGTCCTTCTCCTGAACACCTTC\nCAGGAATTCTTTGGCCTGAATAATTGCAGTAGCTCTAACAGGTTGGACCAAGCTATGCAG\nGTGACAGAGACTCTTGGGATGACGCACTGCTGCATCAACCCCATCATCTATGCCTTTGTC\nGGGGAGAAGTTCAGAAACTACCTCTTAGTCTTCTTCCAAAAGCACATTGCCAAACGCTTC\nTGCAAATGCTGTTCTATTTTCCAGCAAGAGGCTCCCGAGCGAGCAAGCTCAGTTTACACC\nCGATCCACTGGGGAGCAGGAAATATCTGTGGGCTTGTGA","gene_format":"FASTA","parent_id":"BE0000911"} {"id":"Q16602","source":"Swiss-Prot","name":"Calcitonin gene-related peptide type 1 receptor","general_function":"Receptor for calcitonin-gene-related peptide (CGRP) together with RAMP1 and receptor for adrenomedullin together with RAMP3 (By similarity). Receptor for adrenomedullin together with RAMP2. The activity of this receptor is mediated by G proteins which activate adenylyl cyclase.","specific_function":"Adrenomedullin receptor activity","gene_name":"CALCRL","locus":"2q32.1","cellular_location":"Cell membrane","transmembrane_regions":"147-166\n174-193\n214-236\n254-273\n290-313\n337-354\n367-388","signal_regions":"1-22","molecular_weight":52928.98,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0049732|Calcitonin gene-related peptide type 1 receptor\nMEKKCTLNFLVLLPFFMILVTAELEESPEDSIQLGVTRNKIMTAQYECYQKIMQDPIQQA\nEGVYCNRTWDGWLCWNDVAAGTESMQLCPDYFQDFDPSEKVTKICDQDGNWFRHPASNRT\nWTNYTQCNVNTHEKVKTALNLFYLTIIGHGLSIASLLISLGIFFYFKSLSCQRITLHKNL\nFFSFVCNSVVTIIHLTAVANNQALVATNPVSCKVSQFIHLYLMGCNYFWMLCEGIYLHTL\nIVVAVFAEKQHLMWYYFLGWGFPLIPACIHAIARSLYYNDNCWISSDTHLLYIIHGPICA\nALLVNLFFLLNIVRVLITKLKVTHQAESNLYMKAVRATLILVPLLGIEFVLIPWRPEGKI\nAEEVYDYIMHILMHFQGLLVSTIFCFFNGEVQAILRRNWNQYKIQFGNSFSNSEALRSAS\nYTVSTISDGPGYSHDCPSEHLNGKSIHDIENVLLKPENLYN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0049733|Calcitonin gene-related peptide type 1 receptor (CALCRL)\nATGGAGAAAAAGTGTACCCTGTATTTTCTGGTTCTCTTGCCTTTTTTTATGATTCTTGTT\nACAGCAGAATTAGAAGAGAGTCCTGAGGACTCAATTCAGTTGGGAGTTACTAGAAATAAA\nATCATGACAGCTCAATATGAATGTTACCAAAAGATTATGCAAGACCCCATTCAACAAGCA\nGAAGGCGTTTACTGCAACAGAACCTGGGATGGATGGCTCTGCTGGAACGATGTTGCAGCA\nGGAACTGAATCAATGCAGCTCTGCCCTGATTACTTTCAGGACTTTGATCCATCAGAAAAA\nGTTACAAAGATCTGTGACCAAGATGGAAACTGGTTTAGACATCCAGCAAGCAACAGAACA\nTGGACAAATTATACCCAGTGTAATGTTAACACCCACGAGAAAGTGAAGACTGCACTAAAT\nTTGTTTTACCTGACCATAATTGGACACGGATTGTCTATTGCATCACTGCTTATCTCGCTT\nGGCATATTCTTTTATTTCAAGAGCCTAAGTTGCCAAAGGATTACCTTACACAAAAATCTG\nTTCTTCTCATTTGTTTGTAACTCTGTTGTAACAATCATTCACCTCACTGCAGTGGCCAAC\nAACCAGGCCTTAGTAGCCACAAATCCTGTTAGTTGCAAAGTGTCCCAGTTCATTCATCTT\nTACCTGATGGGCTGTAATTACTTTTGGATGCTCTGTGAAGGCATTTACCTACACACACTC\nATTGTGGTGGCCGTGTTTGCAGAGAAGCAACATTTAATGTGGTATTATTTTCTTGGCTGG\nGGATTTCCACTGATTCCTGCTTGTATACATGCCATTGCTAGAAGCTTATATTACAATGAC\nAATTGCTGGATCAGTTCTGATACCCATCTCCTCTACATTATCCATGGCCCAATTTGTGCT\nGCTTTACTGGTGAATCTTTTTTTCTTGTTAAATATTGTACGCGTTCTCATCACCAAGTTA\nAAAGTTACACACCAAGCGGAATCCAATCTGTACATGAAAGCTGTGAGAGCTACTCTTATC\nTTGGTGCCATTGCTTGGCATTGAATTTGTGCTGATTCCATGGCGACCTGAAGGAAAGATT\nGCAGAGGAGGTATATGACTACATCATGCACATCCTTATGCACTTCCAGGGTCTTTTGGTC\nTCTACCATTTTCTGCTTCTTTAATGGAGAGGTTCAAGCAATTCTGAGAAGAAACTGGAAT\nCAATACAAAATCCAATTTGGAAACAGCTTTTCCAACTCAGAAGCTCTTCGTAGTGCGTCT\nTACACAGTGTCAACAATCAGTGATGGTCCAGGTTATAGTCATGACTGTCCTAGTGAACAC\nTTAAATGGAAAAAGCATCCATGATATTGAAAATGTTCTCTTAAAACCAGAAAATTTATAT\nAATTGA","gene_format":"FASTA","parent_id":"BE0009009"} {"id":"P08887","source":"Swiss-Prot","name":"Interleukin-6 receptor subunit alpha","general_function":"Protein homodimerization activity","specific_function":"Part of the receptor for interleukin 6. Binds to IL6 with low affinity, but does not transduce a signal. Signal activation necessitate an association with IL6ST. Activation may lead to the regulation of the immune response, acute-phase reactions and hematopoiesis.Low concentration of a soluble form of IL6 receptor acts as an agonist of IL6 activity.","gene_name":"IL6R","locus":"1q21","cellular_location":"Basolateral cell membrane","transmembrane_regions":"366-386","signal_regions":"1-19","theoretical_pi":8.3,"molecular_weight":51547.015,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0005598|Interleukin-6 receptor subunit alpha\nMLAVGCALLAALLAAPGAALAPRRCPAQEVARGVLTSLPGDSVTLTCPGVEPEDNATVHW\nVLRKPAAGSHPSRWAGMGRRLLLRSVQLHDSGNYSCYRAGRPAGTVHLLVDVPPEEPQLS\nCFRKSPLSNVVCEWGPRSTPSLTTKAVLLVRKFQNSPAEDFQEPCQYSQESQKFSCQLAV\nPEGDSSFYIVSMCVASSVGSKFSKTQTFQGCGILQPDPPANITVTAVARNPRWLSVTWQD\nPHSWNSSFYRLRFELRYRAERSKTFTTWMVKDLQHHCVIHDAWSGLRHVVQLRAQEEFGQ\nGEWSEWSPEAMGTPWTESRSPPAENEVSTPMQALTTNKDDDNILFRDSANATSLPVQDSS\nSVPLPTFLVAGGSLAFGTLLCIAIVLRFKKTWKLRALKEGKTSMHPPYSLGQLVPERPRP\nTPVLVPLISPPVSPSSLGSDNTSSHNRPDARDPRSPYDISNTDYFFPR","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0019340|Interleukin-6 receptor subunit alpha (IL6R)\nATGCTGGCCGTCGGCTGCGCGCTGCTGGCTGCCCTGCTGGCCGCGCCGGGAGCGGCGCTG\nGCCCCAAGGCGCTGCCCTGCGCAGGAGGTGGCGAGAGGCGTGCTGACCAGTCTGCCAGGA\nGACAGCGTGACTCTGACCTGCCCGGGGGTAGAGCCGGAAGACAATGCCACTGTTCACTGG\nGTGCTCAGGAAGCCGGCTGCAGGCTCCCACCCCAGCAGATGGGCTGGCATGGGAAGGAGG\nCTGCTGCTGAGGTCGGTGCAGCTCCACGACTCTGGAAACTATTCATGCTACCGGGCCGGC\nCGCCCAGCTGGGACTGTGCACTTGCTGGTGGATGTTCCCCCCGAGGAGCCCCAGCTCTCC\nTGCTTCCGGAAGAGCCCCCTCAGCAATGTTGTTTGTGAGTGGGGTCCTCGGAGCACCCCA\nTCCCTGACGACAAAGGCTGTGCTCTTGGTGAGGAAGTTTCAGAACAGTCCGGCCGAAGAC\nTTCCAGGAGCCGTGCCAGTATTCCCAGGAGTCCCAGAAGTTCTCCTGCCAGTTAGCAGTC\nCCGGAGGGAGACAGCTCTTTCTACATAGTGTCCATGTGCGTCGCCAGTAGTGTCGGGAGC\nAAGTTCAGCAAAACTCAAACCTTTCAGGGTTGTGGAATCTTGCAGCCTGATCCGCCTGCC\nAACATCACAGTCACTGCCGTGGCCAGAAACCCCCGCTGGCTCAGTGTCACCTGGCAAGAC\nCCCCACTCCTGGAACTCATCTTTCTACAGACTACGGTTTGAGCTCAGATATCGGGCTGAA\nCGGTCAAAGACATTCACAACATGGATGGTCAAGGACCTCCAGCATCACTGTGTCATCCAC\nGACGCCTGGAGCGGCCTGAGGCACGTGGTGCAGCTTCGTGCCCAGGAGGAGTTCGGGCAA\nGGCGAGTGGAGCGAGTGGAGCCCGGAGGCCATGGGCACGCCTTGGACAGAATCCAGGAGT\nCCTCCAGCTGAGAACGAGGTGTCCACCCCCATGCAGGCACTTACTACTAATAAAGACGAT\nGATAATATTCTCTTCAGAGATTCTGCAAATGCGACAAGCCTCCCAGTGCAAGATTCTTCT\nTCAGTACCACTGCCCACATTCCTGGTTGCTGGAGGGAGCCTGGCCTTCGGAACGCTCCTC\nTGCATTGCCATTGTTCTGAGGTTCAAGAAGACGTGGAAGCTGCGGGCTCTGAAGGAAGGC\nAAGACAAGCATGCATCCGCCGTACTCTTTGGGGCAGCTGGTCCCGGAGAGGCCTCGACCC\nACCCCAGTGCTTGTTCCTCTCATCTCCCCACCGGTGTCCCCCAGCAGCCTGGGGTCTGAC\nAATACCTCGAGCCACAACCGACCAGATGCCAGGGACCCACGGAGCCCTTATGACATCAGC\nAATACAGACTACTTCTTCCCCAGATAG","gene_format":"FASTA","parent_id":"BE0002859"} {"id":"P0C6X7","source":"Swiss-Prot","name":"Replicase polyprotein 1ab","general_function":"Zinc ion binding","specific_function":"Replicase polyprotein 1ab: Multifunctional protein involved in the transcription and replication of viral RNAs. Contains the proteinases responsible for the cleavages of the polyprotein.Host translation inhibitor nsp1: Inhibits host translation by interacting with the 40S ribosomal subunit. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. Viral mRNAs are not susceptible to nsp1-mediated endonucleolytic RNA cleavage thanks to the presence of a 5'-end leader sequence and are therefore protected from degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response.Non-structural protein 2: May play a role in the modulation of host cell survival signaling pathway by interacting with host PHB and PHB2. Indeed, these two proteins play a role in maintaining the functional integrity of the mitochondria and protecting cells from various stresses.Papain-like proteinase: Responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PL-PRO possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Participates together with nsp4 in the assembly of virally-induced cytoplasmic double-membrane vesicles necessary for viral replication. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF3. Prevents also host NF-kappa-B signaling.Non-structural protein 4: Participates in the assembly of virally-induced cytoplasmic double-membrane vesicles necessary for viral replication.Proteinase 3CL-PRO: Ccleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Also able to bind an ADP-ribose-1''-phosphate (ADRP).Non-structural protein 6: Plays a role in the initial induction of autophagosomes from host reticulum endoplasmic. Later, limits the expansion of these phagosomes that are no longer able to deliver viral components to lysosomes.Non-structural protein 7: Forms an hexadecamer with nsp8 (8 subunits of each) that may participate in viral replication by acting as a primase. Alternatively, may synthesize substantially longer products than oligonucleotide primers.Non-structural protein 8: Forms an hexadecamer with nsp7 (8 subunits of each) that may participate in viral replication by acting as a primase. Alternatively, may synthesize substantially longer products than oligonucleotide primers.Non-structural protein 9: May participate in viral replication by acting as a ssRNA-binding protein.Non-structural protein 10: Plays a pivotal role in viral transcription by stimulating both nsp14 3'-5' exoribonuclease and nsp16 2'-O-methyltransferase activities. Therefore plays an essential role in viral mRNAs cap methylation.RNA-directed RNA polymerase: Responsible for replication and transcription of the viral RNA genome.Helicase: Multi-functional protein with a zinc-binding domain in N-terminus displaying RNA and DNA duplex-unwinding activities with 5' to 3' polarity. Activity of helicase is dependent on magnesium.Guanine-N7 methyltransferase: Enzyme possessing two different activities: an exoribonuclease activity acting on both ssRNA and dsRNA in a 3' to 5' direction and a N7-guanine methyltransferase activity.Uridylate-specific endoribonuclease: Mn(2+)-dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic phosphates 5' to the cleaved bond.2'-O-methyltransferase: Methyltransferase that mediates mRNA cap 2'-O-ribose methylation to the 5'-cap structure of viral mRNAs. N7-methyl guanosine cap is a prerequisite for binding of nsp16. Therefore plays an essential role in viral mRNAs cap methylation which is essential to evade immune system.","gene_name":"rep","cellular_location":"Host membrane","transmembrane_regions":"2092-2112\n2203-2223\n2304-2324\n2326-2346\n2351-2371\n2755-2775\n2830-2850\n2879-2899\n2992-3012\n3022-3042\n3054-3074\n3077-3097\n3105-3125\n3142-3162\n3564-3584\n3586-3606\n3612-3632\n3658-3678\n3707-3727\n3728-3748\n3756-3776","molecular_weight":790241.63,"organism":"SARS-CoV","organism_ncbi_taxonomy_id":227859,"amino_acid_sequence":">lcl|BSEQ0009333|Replicase polyprotein 1ab\nMESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEAREHLKNGTCGLVELEKGV\nLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGITLGVLVPHVGETPIAYRN\nVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQNWNTKHGSGALRELTRELNGG\nAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQLDYIESKRGVYCCRDHEHEIAW\nFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPLNSKVKVIQPRVEKKKTEGFMGRI\nRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFLKATCEHCGTENLVIEGPTTCGYL\nPTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRLRKGGRTRCFGGCVFAYVGCYNKR\nAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRERVNINIVGDFHLNEEVAIILASF\nSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQA\nAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAY\nVTGGLVQQTSQWLSNLLGTTVEKLRPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIV\nKGQIQVASDNIKDCVKCFIDVVNKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQC\nIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVG\nTPVCVNGLMLLEIKDKEQYCALSPGLLATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNV\nRITFELDERVDKVLNEKCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDE\nWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEEIDETCEHEYGTEDDYQGL\nPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEPTPEEPVNQFTGYLKLTDNVAIKC\nVDIVKEAQSANPMVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLTVGGS\nCLLSGHNLAKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSL\nQVCVQTVRTQVYIAVNDKALYEQVVMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQK\nPVDVKPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAP\nYMVGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT\nALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATIQR\nKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLEEAAR\nCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTELG\nVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLLSLREVKTIKVFTTVDNTNLHTQL\nVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFEYYHTLDESFL\nGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQLEVKFNAPALQEAYYRAR\nAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLNVVCKHCGQKTTTLT\nGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMMSAPPAEYKLQQGTFLC\nANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTDVFYKETSYTTTIKPVSYK\nLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNASFDNFKLTCSNTKFADDLNQ\nMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKKGAKLLHKPIVWHINQATTKTTF\nKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNLACESQQPTSEEVVENPTIQKEVIE\nCDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMAAYVENTSITIKKPNELSLALGLKTIA\nTHGIAAINSVPWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVFTLLFQLCTFT\nKSTNSRIRASLPTTIAKNSVKSVAKLCLDAGINYVKSPKFSKLFTIAMWLLLLSICLGSL\nICVTAAFGVLLSNFGAPSYCNGVRELYLNSSNVTTMDFCEGSFPCSICLSGLDSLDSYPA\nLETIQVTISSYKLDLTILGLAAEWVLAYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSW\nLMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVEC\nTTIVNGMKRSFYVYANGGRGFCKTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINP\nTDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVF\nDGKSKCDESASKSASVYYSQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFS\nVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDL\nEVTGDSCNNFMLTYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSE\nQLRKQIRSAAKKNNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVLA\nALVCYIVMPVHTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGG\nSYKNDKSCPVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLI\nEYSDFATSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQ\nFPNTYLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVFCGVDA\nMNLIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHVVAANAL\nLFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWITAI\nYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLLPLTQ\nYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSITSAVLQ\nSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIR\nKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNG\nSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGK\nFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE\nPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQC\nSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFFVYENAFLPFTLGIMAIAA\nCAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLELADTSLSGYRLKDCV\nMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKVYYGNALDQAISMWALVISVTSN\nYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCIMLVYCFLGYCCCCYFGLFCLLNRY\nFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDAFKLNIKLLGIGGKPCIKVATVQSKMS\nDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQG\nAVDINRLCEEMLDNRATLQAIASEFSSLPSYAAYATAQEAYEQAVANGDSEVVLKKLKKS\nLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDND\nALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVD\nADSKIVQLSEINMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTAC\nTDDNALAYYNNSKGGRFVLALLSDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGP\nKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDY\nLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFC\nDLKGKYVQIPTTCANDPVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFLNRV\nCGVSAARLTPCGTGTSTDVVYRAFDIYNEKVAGFAKFLKTNCCRFQEKDEEGNLLDSYFV\nVKRHTMSNYQHEETIYNLVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYAL\nRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTVQ\nFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSLLMPILTLTRAL\nAAESHMDADLAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCA\nNFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKEL\nLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKE\nGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINAN\nQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAK\nNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVET\nPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCNLSHRFYRLANECAQVLSEMVMCGG\nSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECL\nYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAVVCYNSNYAAQGLVASIKNFKAVLYYQNN\nVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTD\nGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDN\nTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSH\nKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVG\nSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLS\nDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKL\nNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYST\nLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARAR\nVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYI\nGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDN\nKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQN\nAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDR\nDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGL\nCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATR\nDAVGTNLPLQLGFSTGVNLVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLP\nWNVVRIKIVQMLSDTLKGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFS\nTSSDTYACWNHSVGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTR\nCLAVHECFVKRVDWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAI\nKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAIV\nCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQ\nVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYN\nLWNTFTRLQSLENVAYNVVNKGHFDGHAGEAPVSIINNAVYTKVDGIDVEIFENKTTLPV\nNVAFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKP\nTESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIGES\nVKTQFNYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKLEGYA\nFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQTGSSKCV\nCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYPKLQASQAW\nQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLNTLTLAVPYNM\nRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWD\nLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGSIAVKITEHSWNADLYKLMG\nHFSWWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHANYIFWRNTNPIQLSSYSLF\nDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIRENNRVVVSSDILVNN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0021701|Replicase polyprotein 1ab (rep)\nATGGAGAGCCTTGTTCTTGGTGTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTC\nCTTCAGGTTAGAGACGTGCTAGTGCGTGGCTTCGGGGACTCTGTGGAAGAGGCCCTATCG\nGAGGCACGTGAACACCTCAAAAATGGCACTTGTGGTCTAGTAGAGCTGGAAAAAGGCGTA\nCTGCCCCAGCTTGAACAGCCCTATGTGTTCATTAAACGTTCTGATGCCTTAAGCACCAAT\nCACGGCCACAAGGTCGTTGAGCTGGTTGCAGAAATGGACGGCATTCAGTACGGTCGTAGC\nGGTATAACACTGGGAGTACTCGTGCCACATGTGGGCGAAACCCCAATTGCATACCGCAAT\nGTTCTTCTTCGTAAGAACGGTAATAAGGGAGCCGGTGGTCATAGCTATGGCATCGATCTA\nAAGTCTTATGACTTAGGTGACGAGCTTGGCACTGATCCCATTGAAGATTATGAACAAAAC\nTGGAACACTAAGCATGGCAGTGGTGCACTCCGTGAACTCACTCGTGAGCTCAATGGAGGT\nGCAGTCACTCGCTATGTCGACAACAATTTCTGTGGCCCAGATGGGTACCCTCTTGATTGC\nATCAAAGATTTTCTCGCACGCGCGGGCAAGTCAATGTGCACTCTTTCCGAACAACTTGAT\nTACATCGAGTCGAAGAGAGGTGTCTACTGCTGCCGTGACCATGAGCATGAAATTGCCTGG\nTTCACTGAGCGCTCTGATAAGAGCTACGAGCACCAGACACCCTTCGAAATTAAGAGTGCC\nAAGAAATTTGACACTTTCAAAGGGGAATGCCCAAAGTTTGTGTTTCCTCTTAACTCAAAA\nGTCAAAGTCATTCAACCACGTGTTGAAAAGAAAAAGACTGAGGGTTTCATGGGGCGTATA\nCGCTCTGTGTACCCTGTTGCATCTCCACAGGAGTGTAACAATATGCACTTGTCTACCTTG\nATGAAATGTAATCATTGCGATGAAGTTTCATGGCAGACGTGCGACTTTCTGAAAGCCACT\nTGTGAACATTGTGGCACTGAAAATTTAGTTATTGAAGGACCTACTACATGTGGGTACCTA\nCCTACTAATGCTGTAGTGAAAATGCCATGTCCTGCCTGTCAAGACCCAGAGATTGGACCT\nGAGCATAGTGTTGCAGATTATCACAACCACTCAAACATTGAAACTCGACTCCGCAAGGGA\nGGTAGGACTAGATGTTTTGGAGGCTGTGTGTTTGCCTATGTTGGCTGCTATAATAAGCGT\nGCCTACTGGGTTCCTCGTGCTAGTGCTGATATTGGCTCAGGCCATACTGGCATTACTGGT\nGACAATGTGGAGACCTTGAATGAGGATCTCCTTGAGATACTGAGTCGTGAACGTGTTAAC\nATTAACATTGTTGGCGATTTTCATTTGAATGAAGAGGTTGCCATCATTTTGGCATCTTTC\nTCTGCTTCTACAAGTGCCTTTATTGACACTATAAAGAGTCTTGATTACAAGTCTTTCAAA\nACCATTGTTGAGTCCTGCGGTAACTATAAAGTTACCAAGGGAAAGCCCGTAAAAGGTGCT\nTGGAACATTGGACAACAGAGATCAGTTTTAACACCACTGTGTGGTTTTCCCTCACAGGCT\nGCTGGTGTTATCAGATCAATTTTTGCGCGCACACTTGATGCAGCAAACCACTCAATTCCT\nGATTTGCAAAGAGCAGCTGTCACCATACTTGATGGTATTTCTGAACAGTCATTACGTCTT\nGTCGACGCCATGGTTTATACTTCAGACCTGCTCACCAACAGTGTCATTATTATGGCATAT\nGTAACTGGTGGTCTTGTACAACAGACTTCTCAGTGGTTGTCTAATCTTTTGGGCACTACT\nGTTGAAAAACTCAGGCCTATCTTTGAATGGATTGAGGCGAAACTTAGTGCAGGAGTTGAA\nTTTCTCAAGGATGCTTGGGAGATTCTCAAATTTCTCATTACAGGTGTTTTTGACATCGTC\nAAGGGTCAAATACAGGTTGCTTCAGATAACATCAAGGATTGTGTAAAATGCTTCATTGAT\nGTTGTTAACAAGGCACTCGAAATGTGCATTGATCAAGTCACTATCGCTGGCGCAAAGTTG\nCGATCACTCAACTTAGGTGAAGTCTTCATCGCTCAAAGCAAGGGACTTTACCGTCAGTGT\nATACGTGGCAAGGAGCAGCTGCAACTACTCATGCCTCTTAAGGCACCAAAAGAAGTAACC\nTTTCTTGAAGGTGATTCACATGACACAGTACTTACCTCTGAGGAGGTTGTTCTCAAGAAC\nGGTGAACTCGAAGCACTCGAGACGCCCGTTGATAGCTTCACAAATGGAGCTATCGTTGGC\nACACCAGTCTGTGTAAATGGCCTCATGCTCTTAGAGATTAAGGACAAAGAACAATACTGC\nGCATTGTCTCCTGGTTTACTGGCTACAAACAATGTCTTTCGCTTAAAAGGGGGTGCACCA\nATTAAAGGTGTAACCTTTGGAGAAGATACTGTTTGGGAAGTTCAAGGTTACAAGAATGTG\nAGAATCACATTTGAGCTTGATGAACGTGTTGACAAAGTGCTTAATGAAAAGTGCTCTGTC\nTACACTGTTGAATCCGGTACCGAAGTTACTGAGTTTGCATGTGTTGTAGCAGAGGCTGTT\nGTGAAGACTTTACAACCAGTTTCTGATCTCCTTACCAACATGGGTATTGATCTTGATGAG\nTGGAGTGTAGCTACATTCTACTTATTTGATGATGCTGGTGAAGAAAACTTTTCATCACGT\nATGTATTGTTCCTTTTACCCTCCAGATGAGGAAGAAGAGGACGATGCAGAGTGTGAGGAA\nGAAGAAATTGATGAAACCTGTGAACATGAGTACGGTACAGAGGATGATTATCAAGGTCTC\nCCTCTGGAATTTGGTGCCTCAGCTGAAACAGTTCGAGTTGAGGAAGAAGAAGAGGAAGAC\nTGGCTGGATGATACTACTGAGCAATCAGAGATTGAGCCAGAACCAGAACCTACACCTGAA\nGAACCAGTTAATCAGTTTACTGGTTATTTAAAACTTACTGACAATGTTGCCATTAAATGT\nGTTGACATCGTTAAGGAGGCACAAAGTGCTAATCCTATGGTGATTGTAAATGCTGCTAAC\nATACACCTGAAACATGGTGGTGGTGTAGCAGGTGCACTCAACAAGGCAACCAATGGTGCC\nATGCAAAAGGAGAGTGATGATTACATTAAGCTAAATGGCCCTCTTACAGTAGGAGGGTCT\nTGTTTGCTTTCTGGACATAATCTTGCTAAGAAGTGTCTGCATGTTGTTGGACCTAACCTA\nAATGCAGGTGAGGACATCCAGCTTCTTAAGGCAGCATATGAAAATTTCAATTCACAGGAC\nATCTTACTTGCACCATTGTTGTCAGCAGGCATATTTGGTGCTAAACCACTTCAGTCTTTA\nCAAGTGTGCGTGCAGACGGTTCGTACACAGGTTTATATTGCAGTCAATGACAAAGCTCTT\nTATGAGCAGGTTGTCATGGATTATCTTGATAACCTGAAGCCTAGAGTGGAAGCACCTAAA\nCAAGAGGAGCCACCAAACACAGAAGATTCCAAAACTGAGGAGAAATCTGTCGTACAGAAG\nCCTGTCGATGTGAAGCCAAAAATTAAGGCCTGCATTGATGAGGTTACCACAACACTGGAA\nGAAACTAAGTTTCTTACCAATAAGTTACTCTTGTTTGCTGATATCAATGGTAAGCTTTAC\nCATGATTCTCAGAACATGCTTAGAGGTGAAGATATGTCTTTCCTTGAGAAGGATGCACCT\nTACATGGTAGGTGATGTTATCACTAGTGGTGATATCACTTGTGTTGTAATACCCTCCAAA\nAAGGCTGGTGGCACTACTGAGATGCTCTCAAGAGCTTTGAAGAAAGTGCCAGTTGATGAG\nTATATAACCACGTACCCTGGACAAGGATGTGCTGGTTATACACTTGAGGAAGCTAAGACT\nGCTCTTAAGAAATGCAAATCTGCATTTTATGTACTACCTTCAGAAGCACCTAATGCTAAG\nGAAGAGATTCTAGGAACTGTATCCTGGAATTTGAGAGAAATGCTTGCTCATGCTGAAGAG\nACAAGAAAATTAATGCCTATATGCATGGATGTTAGAGCCATAATGGCAACCATCCAACGT\nAAGTATAAAGGAATTAAAATTCAAGAGGGCATCGTTGACTATGGTGTCCGATTCTTCTTT\nTATACTAGTAAAGAGCCTGTAGCTTCTATTATTACGAAGCTGAACTCTCTAAATGAGCCG\nCTTGTCACAATGCCAATTGGTTATGTGACACATGGTTTTAATCTTGAAGAGGCTGCGCGC\nTGTATGCGTTCTCTTAAAGCTCCTGCCGTAGTGTCAGTATCATCACCAGATGCTGTTACT\nACATATAATGGATACCTCACTTCGTCATCAAAGACATCTGAGGAGCACTTTGTAGAAACA\nGTTTCTTTGGCTGGCTCTTACAGAGATTGGTCCTATTCAGGACAGCGTACAGAGTTAGGT\nGTTGAATTTCTTAAGCGTGGTGACAAAATTGTGTACCACACTCTGGAGAGCCCCGTCGAG\nTTTCATCTTGACGGTGAGGTTCTTTCACTTGACAAACTAAAGAGTCTCTTATCCCTGCGG\nGAGGTTAAGACTATAAAAGTGTTCACAACTGTGGACAACACTAATCTCCACACACAGCTT\nGTGGATATGTCTATGACATATGGACAGCAGTTTGGTCCAACATACTTGGATGGTGCTGAT\nGTTACAAAAATTAAACCTCATGTAAATCATGAGGGTAAGACTTTCTTTGTACTACCTAGT\nGATGACACACTACGTAGTGAAGCTTTCGAGTACTACCATACTCTTGATGAGAGTTTTCTT\nGGTAGGTACATGTCTGCTTTAAACCACACAAAGAAATGGAAATTTCCTCAAGTTGGTGGT\nTTAACTTCAATTAAATGGGCTGATAACAATTGTTATTTGTCTAGTGTTTTATTAGCACTT\nCAACAGCTTGAAGTCAAATTCAATGCACCAGCACTTCAAGAGGCTTATTATAGAGCCCGT\nGCTGGTGATGCTGCTAACTTTTGTGCACTCATACTCGCTTACAGTAATAAAACTGTTGGC\nGAGCTTGGTGATGTCAGAGAAACTATGACCCATCTTCTACAGCATGCTAATTTGGAATCT\nGCAAAGCGAGTTCTTAATGTGGTGTGTAAACATTGTGGTCAGAAAACTACTACCTTAACG\nGGTGTAGAAGCTGTGATGTATATGGGTACTCTATCTTATGATAATCTTAAGACAGGTGTT\nTCCATTCCATGTGTGTGTGGTCGTGATGCTACACAATATCTAGTACAACAAGAGTCTTCT\nTTTGTTATGATGTCTGCACCACCTGCTGAGTATAAATTACAGCAAGGTACATTCTTATGT\nGCGAATGAGTACACTGGTAACTATCAGTGTGGTCATTACACTCATATAACTGCTAAGGAG\nACCCTCTATCGTATTGACGGAGCTCACCTTACAAAGATGTCAGAGTACAAAGGACCAGTG\nACTGATGTTTTCTACAAGGAAACATCTTACACTACAACCATCAAGCCTGTGTCGTATAAA\nCTCGATGGAGTTACTTACACAGAGATTGAACCAAAATTGGATGGGTATTATAAAAAGGAT\nAATGCTTACTATACAGAGCAGCCTATAGACCTTGTACCAACTCAACCATTACCAAATGCG\nAGTTTTGATAATTTCAAACTCACATGTTCTAACACAAAATTTGCTGATGATTTAAATCAA\nATGACAGGCTTCACAAAGCCAGCTTCACGAGAGCTATCTGTCACATTCTTCCCAGACTTG\nAATGGCGATGTAGTGGCTATTGACTATAGACACTATTCAGCGAGTTTCAAGAAAGGTGCT\nAAATTACTGCATAAGCCAATTGTTTGGCACATTAACCAGGCTACAACCAAGACAACGTTC\nAAACCAAACACTTGGTGTTTACGTTGTCTTTGGAGTACAAAGCCAGTAGATACTTCAAAT\nTCATTTGAAGTTCTGGCAGTAGAAGACACACAAGGAATGGACAATCTTGCTTGTGAAAGT\nCAACAACCCACCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGAAGGAAGTCATAGAG\nTGTGACGTGAAAACTACCGAAGTTGTAGGCAATGTCATACTTAAACCATCAGATGAAGGT\nGTTAAAGTAACACAAGAGTTAGGTCATGAGGATCTTATGGCTGCTTATGTGGAAAACACA\nAGCATTACCATTAAGAAACCTAATGAGCTTTCACTAGCCTTAGGTTTAAAAACAATTGCC\nACTCATGGTATTGCTGCAATTAATAGTGTTCCTTGGAGTAAAATTTTGGCTTATGTCAAA\nCCATTCTTAGGACAAGCAGCAATTACAACATCAAATTGCGCTAAGAGATTAGCACAACGT\nGTGTTTAACAATTATATGCCTTATGTGTTTACATTATTGTTCCAATTGTGTACTTTTACT\nAAAAGTACCAATTCTAGAATTAGAGCTTCACTACCTACAACTATTGCTAAAAATAGTGTT\nAAGAGTGTTGCTAAATTATGTTTGGATGCCGGCATTAATTATGTGAAGTCACCCAAATTT\nTCTAAATTGTTCACAATCGCTATGTGGCTATTGTTGTTAAGTATTTGCTTAGGTTCTCTA\nATCTGTGTAACTGCTGCTTTTGGTGTACTCTTATCTAATTTTGGTGCTCCTTCTTATTGT\nAATGGCGTTAGAGAATTGTATCTTAATTCGTCTAACGTTACTACTATGGATTTCTGTGAA\nGGTTCTTTTCCTTGCAGCATTTGTTTAAGTGGATTAGACTCCCTTGATTCTTATCCAGCT\nCTTGAAACCATTCAGGTGACGATTTCATCGTACAAGCTAGACTTGACAATTTTAGGTCTG\nGCCGCTGAGTGGGTTTTGGCATATATGTTGTTCACAAAATTCTTTTATTTATTAGGTCTT\nTCAGCTATAATGCAGGTGTTCTTTGGCTATTTTGCTAGTCATTTCATCAGCAATTCTTGG\nCTCATGTGGTTTATCATTAGTATTGTACAAATGGCACCCGTTTCTGCAATGGTTAGGATG\nTACATCTTCTTTGCTTCTTTCTACTACATATGGAAGAGCTATGTTCATATCATGGATGGT\nTGCACCTCTTCGACTTGCATGATGTGCTATAAGCGCAATCGTGCCACACGCGTTGAGTGT\nACAACTATTGTTAATGGCATGAAGAGATCTTTCTATGTCTATGCAAATGGAGGCCGTGGC\nTTCTGCAAGACTCACAATTGGAATTGTCTCAATTGTGACACATTTTGCACTGGTAGTACA\nTTCATTAGTGATGAAGTTGCTCGTGATTTGTCACTCCAGTTTAAAAGACCAATCAACCCT\nACTGACCAGTCATCGTATATTGTTGATAGTGTTGCTGTGAAAAATGGCGCGCTTCACCTC\nTACTTTGACAAGGCTGGTCAAAAGACCTATGAGAGACATCCGCTCTCCCATTTTGTCAAT\nTTAGACAATTTGAGAGCTAACAACACTAAAGGTTCACTGCCTATTAATGTCATAGTTTTT\nGATGGCAAGTCCAAATGCGACGAGTCTGCTTCTAAGTCTGCTTCTGTGTACTACAGTCAG\nCTGATGTGCCAACCTATTCTGTTGCTTGACCAAGCTCTTGTATCAGACGTTGGAGATAGT\nACTGAAGTTTCCGTTAAGATGTTTGATGCTTATGTCGACACCTTTTCAGCAACTTTTAGT\nGTTCCTATGGAAAAACTTAAGGCACTTGTTGCTACAGCTCACAGCGAGTTAGCAAAGGGT\nGTAGCTTTAGATGGTGTCCTTTCTACATTCGTGTCAGCTGCCCGACAAGGTGTTGTTGAT\nACCGATGTTGACACAAAGGATGTTATTGAATGTCTCAAACTTTCACATCACTCTGACTTA\nGAAGTGACAGGTGACAGTTGTAACAATTTCATGCTCACCTATAATAAGGTTGAAAACATG\nACGCCCAGAGATCTTGGCGCATGTATTGACTGTAATGCAAGGCATATCAATGCCCAAGTA\nGCAAAAAGTCACAATGTTTCACTCATCTGGAATGTAAAAGACTACATGTCTTTATCTGAA\nCAGCTGCGTAAACAAATTCGTAGTGCTGCCAAGAAGAACAACATACCTTTTAGACTAACT\nTGTGCTACAACTAGACAGGTTGTCAATGTCATAACTACTAAAATCTCACTCAAGGGTGGT\nAAGATTGTTAGTACTTGTTTTAAACTTATGCTTAAGGCCACATTATTGTGCGTTCTTGCT\nGCATTGGTTTGTTATATCGTTATGCCAGTACATACATTGTCAATCCATGATGGTTACACA\nAATGAAATCATTGGTTACAAAGCCATTCAGGATGGTGTCACTCGTGACATCATTTCTACT\nGATGATTGTTTTGCAAATAAACATGCTGGTTTTGACGCATGGTTTAGCCAGCGTGGTGGT\nTCATACAAAAATGACAAAAGCTGCCCTGTAGTAGCTGCTATCATTACAAGAGAGATTGGT\nTTCATAGTGCCTGGCTTACCGGGTACTGTGCTGAGAGCAATCAATGGTGACTTCTTGCAT\nTTTCTACCTCGTGTTTTTAGTGCTGTTGGCAACATTTGCTACACACCTTCCAAACTCATT\nGAGTATAGTGATTTTGCTACCTCTGCTTGCGTTCTTGCTGCTGAGTGTACAATTTTTAAG\nGATGCTATGGGCAAACCTGTGCCATATTGTTATGACACTAATTTGCTAGAGGGTTCTATT\nTCTTATAGTGAGCTTCGTCCAGACACTCGTTATGTGCTTATGGATGGTTCCATCATACAG\nTTTCCTAACACTTACCTGGAGGGTTCTGTTAGAGTAGTAACAACTTTTGATGCTGAGTAC\nTGTAGACATGGTACATGCGAAAGGTCAGAAGTAGGTATTTGCCTATCTACCAGTGGTAGA\nTGGGTTCTTAATAATGAGCATTACAGAGCTCTATCAGGAGTTTTCTGTGGTGTTGATGCG\nATGAATCTCATAGCTAACATCTTTACTCCTCTTGTGCAACCTGTGGGTGCTTTAGATGTG\nTCTGCTTCAGTAGTGGCTGGTGGTATTATTGCCATATTGGTGACTTGTGCTGCCTACTAC\nTTTATGAAATTCAGACGTGTTTTTGGTGAGTACAACCATGTTGTTGCTGCTAATGCACTT\nTTGTTTTTGATGTCTTTCACTATACTCTGTCTGGTACCAGCTTACAGCTTTCTGCCGGGA\nGTCTACTCAGTCTTTTACTTGTACTTGACATTCTATTTCACCAATGATGTTTCATTCTTG\nGCTCACCTTCAATGGTTTGCCATGTTTTCTCCTATTGTGCCTTTTTGGATAACAGCAATC\nTATGTATTCTGTATTTCTCTGAAGCACTGCCATTGGTTCTTTAACAACTATCTTAGGAAA\nAGAGTCATGTTTAATGGAGTTACATTTAGTACCTTCGAGGAGGCTGCTTTGTGTACCTTT\nTTGCTCAACAAGGAAATGTACCTAAAATTGCGTAGCGAGACACTGTTGCCACTTACACAG\nTATAACAGGTATCTTGCTCTATATAACAAGTACAAGTATTTCAGTGGAGCCTTAGATACT\nACCAGCTATCGTGAAGCAGCTTGCTGCCACTTAGCAAAGGCTCTAAATGACTTTAGCAAC\nTCAGGTGCTGATGTTCTCTACCAACCACCACAGACATCAATCACTTCTGCTGTTCTGCAG\nAGTGGTTTTAGGAAAATGGCATTCCCGTCAGGCAAAGTTGAAGGGTGCATGGTACAAGTA\nACCTGTGGAACTACAACTCTTAATGGATTGTGGTTGGATGACACAGTATACTGTCCAAGA\nCATGTCATTTGCACAGCAGAAGACATGCTTAATCCTAACTATGAAGATCTGCTCATTCGC\nAAATCCAACCATAGCTTTCTTGTTCAGGCTGGCAATGTTCAACTTCGTGTTATTGGCCAT\nTCTATGCAAAATTGTCTGCTTAGGCTTAAAGTTGATACTTCTAACCCTAAGACACCCAAG\nTATAAATTTGTCCGTATCCAACCTGGTCAAACATTTTCAGTTCTAGCATGCTACAATGGT\nTCACCATCTGGTGTTTATCAGTGTGCCATGAGACCTAATCATACCATTAAAGGTTCTTTC\nCTTAATGGATCATGTGGTAGTGTTGGTTTTAACATTGATTATGATTGCGTGTCTTTCTGC\nTATATGCATCATATGGAGCTTCCAACAGGAGTACACGCTGGTACTGACTTAGAAGGTAAA\nTTCTATGGTCCATTTGTTGACAGACAAACTGCACAGGCTGCAGGTACAGACACAACCATA\nACATTAAATGTTTTGGCATGGCTGTATGCTGCTGTTATCAATGGTGATAGGTGGTTTCTT\nAATAGATTCACCACTACTTTGAATGACTTTAACCTTGTGGCAATGAAGTACAACTATGAA\nCCTTTGACACAAGATCATGTTGACATATTGGGACCTCTTTCTGCTCAAACAGGAATTGCC\nGTCTTAGATATGTGTGCTGCTTTGAAAGAGCTGCTGCAGAATGGTATGAATGGTCGTACT\nATCCTTGGTAGCACTATTTTAGAAGATGAGTTTACACCATTTGATGTTGTTAGACAATGC\nTCTGGTGTTACCTTCCAAGGTAAGTTCAAGAAAATTGTTAAGGGCACTCATCATTGGATG\nCTTTTAACTTTCTTGACATCACTATTGATTCTTGTTCAAAGTACACAGTGGTCACTGTTT\nTTCTTTGTTTACGAGAATGCTTTCTTGCCATTTACTCTTGGTATTATGGCAATTGCTGCA\nTGTGCTATGCTGCTTGTTAAGCATAAGCACGCATTCTTGTGCTTGTTTCTGTTACCTTCT\nCTTGCAACAGTTGCTTACTTTAATATGGTCTACATGCCTGCTAGCTGGGTGATGCGTATC\nATGACATGGCTTGAATTGGCTGACACTAGCTTGTCTGGTTATAGGCTTAAGGATTGTGTT\nATGTATGCTTCAGCTTTAGTTTTGCTTATTCTCATGACAGCTCGCACTGTTTATGATGAT\nGCTGCTAGACGTGTTTGGACACTGATGAATGTCATTACACTTGTTTACAAAGTCTACTAT\nGGTAATGCTTTAGATCAAGCTATTTCCATGTGGGCCTTAGTTATTTCTGTAACCTCTAAC\nTATTCTGGTGTCGTTACGACTATCATGTTTTTAGCTAGAGCTATAGTGTTTGTGTGTGTT\nGAGTATTACCCATTGTTATTTATTACTGGCAACACCTTACAGTGTATCATGCTTGTTTAT\nTGTTTCTTAGGCTATTGTTGCTGCTGCTACTTTGGCCTTTTCTGTTTACTCAACCGTTAC\nTTCAGGCTTACTCTTGGTGTTTATGACTACTTGGTCTCTACACAAGAATTTAGGTATATG\nAACTCCCAGGGGCTTTTGCCTCCTAAGAGTAGTATTGATGCTTTCAAGCTTAACATTAAG\nTTGTTGGGTATTGGAGGTAAACCATGTATCAAGGTTGCTACTGTACAGTCTAAAATGTCT\nGACGTAAAGTGCACATCTGTGGTACTGCTCTCGGTTCTTCAACAACTTAGAGTAGAGTCA\nTCTTCTAAATTGTGGGCACAATGTGTACAACTCCACAATGATATTCTTCTTGCAAAAGAC\nACAACTGAAGCTTTCGAGAAGATGGTTTCTCTTTTGTCTGTTTTGCTATCCATGCAGGGT\nGCTGTAGACATTAATAGGTTGTGCGAGGAAATGCTCGATAACCGTGCTACTCTTCAGGCT\nATTGCTTCAGAATTTAGTTCTTTACCATCATATGCCGCTTATGCCACTGCCCAGGAGGCC\nTATGAGCAGGCTGTAGCTAATGGTGATTCTGAAGTCGTTCTCAAAAAGTTAAAGAAATCT\nTTGAATGTGGCTAAATCTGAGTTTGACCGTGATGCTGCCATGCAACGCAAGTTGGAAAAG\nATGGCAGATCAGGCTATGACCCAAATGTACAAACAGGCAAGATCTGAGGACAAGAGGGCA\nAAAGTAACTAGTGCTATGCAAACAATGCTCTTCACTATGCTTAGGAAGCTTGATAATGAT\nGCACTTAACAACATTATCAACAATGCGCGTGATGGTTGTGTTCCACTCAACATCATACCA\nTTGACTACAGCAGCCAAACTCATGGTTGTTGTCCCTGATTATGGTACCTACAAGAACACT\nTGTGATGGTAACACCTTTACATATGCATCTGCACTCTGGGAAATCCAGCAAGTTGTTGAT\nGCGGATAGCAAGATTGTTCAACTTAGTGAAATTAACATGGACAATTCACCAAATTTGGCT\nTGGCCTCTTATTGTTACAGCTCTAAGAGCCAACTCAGCTGTTAAACTACAGAATAATGAA\nCTGAGTCCAGTAGCACTACGACAGATGTCCTGTGCGGCTGGTACCACACAAACAGCTTGT\nACTGATGACAATGCACTTGCCTACTATAACAATTCGAAGGGAGGTAGGTTTGTGCTGGCA\nTTACTATCAGACCACCAAGATCTCAAATGGGCTAGATTCCCTAAGAGTGATGGTACAGGT\nACAATTTACACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCAAAAGGGCCT\nAAAGTGAAATACTTGTACTTCATCAAAGGCTTAAACAACCTAAATAGAGGTATGGTGCTG\nGGCAGTTTAGCTGCTACAGTACGTCTTCAGGCTGGAAATGCTACAGAAGTACCTGCCAAT\nTCAACTGTGCTTTCCTTCTGTGCTTTTGCAGTAGACCCTGCTAAAGCATATAAGGATTAC\nCTAGCAAGTGGAGGACAACCAATCACCAACTGTGTGAAGATGTTGTGTACACACACTGGT\nACAGGACAGGCAATTACTGTAACACCAGAAGCTAACATGGACCAAGAGTCCTTTGGTGGT\nGCTTCATGTTGTCTGTATTGTAGATGCCACATTGACCATCCAAATCCTAAAGGATTCTGT\nGACTTGAAAGGTAAGTACGTCCAAATACCTACCACTTGTGCTAATGACCCAGTGGGTTTT\nACACTTAGAAACACAGTCTGTACCGTCTGCGGAATGTGGAAAGGTTATGGCTGTAGTTGT\nGACCAACTCCGCGAACCCTTGATGCAGTCTGCGGATGCATCAACGTTTTTAAACCGGGTT\nTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTC\nTACAGGGCTTTTGATATTTACAACGAAAAAGTTGCTGGTTTTGCAAAGTTCCTAAAAACT\nAATTGCTGTCGCTTCCAGGAGAAGGATGAGGAAGGCAATTTATTAGACTCTTACTTTGTA\nGTTAAGAGGCATACTATGTCTAACTACCAACATGAAGAGACTATTTATAACTTGGTTAAA\nGATTGTCCAGCGGTTGCTGTCCATGACTTTTTCAAGTTTAGAGTAGATGGTGACATGGTA\nCCACATATATCACGTCAGCGTCTAACTAAATACACAATGGCTGATTTAGTCTATGCTCTA\nCGTCATTTTGATGAGGGTAATTGTGATACATTAAAAGAAATACTCGTCACATACAATTGC\nTGTGATGATGATTATTTCAATAAGAAGGATTGGTATGACTTCGTAGAGAATCCTGACATC\nTTACGCGTATATGCTAACTTAGGTGAGCGTGTACGCCAATCATTATTAAAGACTGTACAA\nTTCTGCGATGCTATGCGTGATGCAGGCATTGTAGGCGTACTGACATTAGATAATCAGGAT\nCTTAATGGGAACTGGTACGATTTCGGTGATTTCGTACAAGTAGCACCAGGCTGCGGAGTT\nCCTATTGTGGATTCATATTACTCATTGCTGATGCCCATCCTCACTTTGACTAGGGCATTG\nGCTGCTGAGTCCCATATGGATGCTGATCTCGCAAAACCACTTATTAAGTGGGATTTGCTG\nAAATATGATTTTACGGAAGAGAGACTTTGTCTCTTCGACCGTTATTTTAAATATTGGGAC\nCAGACATACCATCCCAATTGTATTAACTGTTTGGATGATAGGTGTATCCTTCATTGTGCA\nAACTTTAATGTGTTATTTTCTACTGTGTTTCCACCTACAAGTTTTGGACCACTAGTAAGA\nAAAATATTTGTAGATGGTGTTCCTTTTGTTGTTTCAACTGGATACCATTTTCGTGAGTTA\nGGAGTCGTACATAATCAGGATGTAAACTTACATAGCTCGCGTCTCAGTTTCAAGGAACTT\nTTAGTGTATGCTGCTGATCCAGCTATGCATGCAGCTTCTGGCAATTTATTGCTAGATAAA\nCGCACTACATGCTTTTCAGTAGCTGCACTAACAAACAATGTTGCTTTTCAAACTGTCAAA\nCCCGGTAATTTTAATAAAGACTTTTATGACTTTGCTGTGTCTAAAGGTTTCTTTAAGGAA\nGGAAGTTCTGTTGAACTAAAACACTTCTTCTTTGCTCAGGATGGCAACGCTGCTATCAGT\nGATTATGACTATTATCGTTATAATCTGCCAACAATGTGTGATATCAGACAACTCCTATTC\nGTAGTTGAAGTTGTTGATAAATACTTTGATTGTTACGATGGTGGCTGTATTAATGCCAAC\nCAAGTAATCGTTAACAATCTGGATAAATCAGCTGGTTTCCCATTTAATAAATGGGGTAAG\nGCTAGACTTTATTATGACTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCGTATACT\nAAGCGTAATGTCATCCCTACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAG\nAATAGAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTATGACAAATAGACAGTTT\nCATCAGAAATTATTGAAGTCAATAGCCGCCACTAGAGGAGCTACTGTGGTAATTGGAACA\nAGCAAGTTTTACGGTGGCTGGCATAATATGTTAAAAACTGTTTACAGTGATGTAGAAACT\nCCACACCTTATGGGTTGGGATTATCCAAAATGTGACAGAGCCATGCCTAACATGCTTAGG\nATAATGGCCTCTCTTGTTCTTGCTCGCAAACATAACACTTGCTGTAACTTATCACACCGT\nTTCTACAGGTTAGCTAACGAGTGTGCGCAAGTATTAAGTGAGATGGTCATGTGTGGCGGC\nTCACTATATGTTAAACCAGGTGGAACATCATCCGGTGATGCTACAACTGCTTATGCTAAT\nAGTGTCTTTAACATTTGTCAAGCTGTTACAGCCAATGTAAATGCACTTCTTTCAACTGAT\nGGTAATAAGATAGCTGACAAGTATGTCCGCAATCTACAACACAGGCTCTATGAGTGTCTC\nTATAGAAATAGGGATGTTGATCATGAATTCGTGGATGAGTTTTACGCTTACCTGCGTAAA\nCATTTCTCCATGATGATTCTTTCTGATGATGCCGTTGTGTGCTATAACAGTAACTATGCG\nGCTCAAGGTTTAGTAGCTAGCATTAAGAACTTTAAGGCAGTTCTTTATTATCAAAATAAT\nGTGTTCATGTCTGAGGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCACGAA\nTTTTGCTCACAGCATACAATGCTAGTTAAACAAGGAGATGATTACGTGTACCTGCCTTAC\nCCAGATCCATCAAGAATATTAGGCGCAGGCTGTTTTGTCGATGATATTGTCAAAACAGAT\nGGTACACTTATGATTGAAAGGTTCGTGTCACTGGCTATTGATGCTTACCCACTTACAAAA\nCATCCTAATCAGGAGTATGCTGATGTCTTTCACTTGTATTTACAATACATTAGAAAGTTA\nCATGATGAGCTTACTGGCCACATGTTGGACATGTATTCCGTAATGCTAACTAATGATAAC\nACCTCACGGTACTGGGAACCTGAGTTTTATGAGGCTATGTACACACCACATACAGTCTTG\nCAGGCTGTAGGTGCTTGTGTATTGTGCAATTCACAGACTTCACTTCGTTGCGGTGCCTGT\nATTAGGAGACCATTCCTATGTTGCAAGTGCTGCTATGACCATGTCATTTCAACATCACAC\nAAATTAGTGTTGTCTGTTAATCCCTATGTTTGCAATGCCCCAGGTTGTGATGTCACTGAT\nGTGACACAACTGTATCTAGGAGGTATGAGCTATTATTGCAAGTCACATAAGCCTCCCATT\nAGTTTTCCATTATGTGCTAATGGTCAGGTTTTTGGTTTATACAAAAACACATGTGTAGGC\nAGTGACAATGTCACTGACTTCAATGCGATAGCAACATGTGATTGGACTAATGCTGGCGAT\nTACATACTTGCCAACACTTGTACTGAGAGACTCAAGCTTTTCGCAGCAGAAACGCTCAAA\nGCCACTGAGGAAACATTTAAGCTGTCATATGGTATTGCCACTGTACGCGAAGTACTCTCT\nGACAGAGAATTGCATCTTTCATGGGAGGTTGGAAAACCTAGACCACCATTGAACAGAAAC\nTATGTCTTTACTGGTTACCGTGTAACTAAAAATAGTAAAGTACAGATTGGAGAGTACACC\nTTTGAAAAAGGTGACTATGGTGATGCTGTTGTGTACAGAGGTACTACGACATACAAGTTG\nAATGTTGGTGATTACTTTGTGTTGACATCTCACACTGTAATGCCACTTAGTGCACCTACT\nCTAGTGCCACAAGAGCACTATGTGAGAATTACTGGCTTGTACCCAACACTCAACATCTCA\nGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTCGGCATGCAAAAGTACTCTACA\nCTCCAAGGACCACCTGGTACTGGTAAGAGTCATTTTGCCATCGGACTTGCTCTCTATTAC\nCCATCTGCTCGCATAGTGTATACGGCATGCTCTCATGCAGCTGTTGATGCCCTATGTGAA\nAAGGCATTAAAATATTTGCCCATAGATAAATGTAGTAGAATCATACCTGCGCGTGCGCGC\nGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACACTAGAACAGTATGTTTTCTGCACT\nGTAAATGCATTGCCAGAAACAACTGCTGACATTGTAGTCTTTGATGAAATCTCTATGGCT\nACTAATTATGACTTGAGTGTTGTCAATGCTAGACTTCGTGCAAAACACTACGTCTATATT\nGGCGATCCTGCTCAATTACCAGCCCCCCGCACATTGCTGACTAAAGGCACACTAGAACCA\nGAATATTTTAATTCAGTGTGCAGACTTATGAAAACAATAGGTCCAGACATGTTCCTTGGA\nACTTGTCGCCGTTGTCCTGCTGAAATTGTTGACACTGTGAGTGCTTTAGTTTATGACAAT\nAAGCTAAAAGCACACAAGGATAAGTCAGCTCAATGCTTCAAAATGTTCTACAAAGGTGTT\nATTACACATGATGTTTCATCTGCAATCAACAGACCTCAAATAGGCGTTGTAAGAGAATTT\nCTTACACGCAATCCTGCTTGGAGAAAAGCTGTTTTTATCTCACCTTATAATTCACAGAAC\nGCTGTAGCTTCAAAAATCTTAGGATTGCCTACGCAGACTGTTGATTCATCACAGGGTTCT\nGAATATGACTATGTCATATTCACACAAACTACTGAAACAGCACACTCTTGTAATGTCAAC\nCGCTTCAATGTGGCTATCACAAGGGCAAAAATTGGCATTTTGTGCATAATGTCTGATAGA\nGATCTTTATGACAAACTGCAATTTACAAGTCTAGAAATACCACGTCGCAATGTGGCTACA\nTTACAAGCAGAAAATGTAACTGGACTTTTTAAGGACTGTAGTAAGATCATTACTGGTCTT\nCATCCTACACAGGCACCTACACACCTCAGCGTTGATATAAAGTTCAAGACTGAAGGATTA\nTGTGTTGACATACCAGGCATACCAAAGGACATGACCTACCGTAGACTCATCTCTATGATG\nGGTTTCAAAATGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAAGAA\nGCTATTCGTCACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTGTCATGCAACTAGA\nGATGCTGTGGGTACTAACCTACCTCTCCAGCTAGGATTTTCTACAGGTGTTAACTTAGTA\nGCTGTACCGACTGGTTATGTTGACACTGAAAATAACACAGAATTCACCAGAGTTAATGCA\nAAACCTCCACCAGGTGACCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCC\nTGGAATGTAGTGCGTATTAAGATAGTACAAATGCTCAGTGATACACTGAAAGGATTGTCA\nGACAGAGTCGTGTTCGTCCTTTGGGCGCATGGCTTTGAGCTTACATCAATGAAGTACTTT\nGTCAAGATTGGACCTGAAAGAACGTGTTGTCTGTGTGACAAACGTGCAACTTGCTTTTCT\nACTTCATCAGATACTTATGCCTGCTGGAATCATTCTGTGGGTTTTGACTATGTCTATAAC\nCCATTTATGATTGATGTTCAGCAGTGGGGCTTTACGGGTAACCTTCAGAGTAACCATGAC\nCAACATTGCCAGGTACATGGAAATGCACATGTGGCTAGTTGTGATGCTATCATGACTAGA\nTGTTTAGCAGTCCATGAGTGCTTTGTTAAGCGCGTTGATTGGTCTGTTGAATACCCTATT\nATAGGAGATGAACTGAGGGTTAATTCTGCTTGCAGAAAAGTACAACACATGGTTGTGAAG\nTCTGCATTGCTTGCTGATAAGTTTCCAGTTCTTCATGACATTGGAAATCCAAAGGCTATC\nAAGTGTGTGCCTCAGGCTGAAGTAGAATGGAAGTTCTACGATGCTCAGCCATGTAGTGAC\nAAAGCTTACAAAATAGAGGAACTCTTCTATTCTTATGCTACACATCACGATAAATTCACT\nGATGGTGTTTGTTTGTTTTGGAATTGTAACGTTGATCGTTACCCAGCCAATGCAATTGTG\nTGTAGGTTTGACACAAGAGTCTTGTCAAACTTGAACTTACCAGGCTGTGATGGTGGTAGT\nTTGTATGTGAATAAGCATGCATTCCACACTCCAGCTTTCGATAAAAGTGCATTTACTAAT\nTTAAAGCAATTGCCTTTCTTTTACTATTCTGATAGTCCTTGTGAGTCTCATGGCAAACAA\nGTAGTGTCGGATATTGATTATGTTCCACTCAAATCTGCTACGTGTATTACACGATGCAAT\nTTAGGTGGTGCTGTTTGCAGACACCATGCAAATGAGTACCGACAGTACTTGGATGCATAT\nAATATGATGATTTCTGCTGGATTTAGCCTATGGATTTACAAACAATTTGATACTTATAAC\nCTGTGGAATACATTTACCAGGTTACAGAGTTTAGAAAATGTGGCTTATAATGTTGTTAAT\nAAAGGACACTTTGATGGACACGCCGGCGAAGCACCTGTTTCCATCATTAATAATGCTGTT\nTACACAAAGGTAGATGGTATTGATGTGGAGATCTTTGAAAATAAGACAACACTTCCTGTT\nAATGTTGCATTTGAGCTTTGGGCTAAGCGTAACATTAAACCAGTGCCAGAGATTAAGATA\nCTCAATAATTTGGGTGTTGATATCGCTGCTAATACTGTAATCTGGGACTACAAAAGAGAA\nGCCCCAGCACATGTATCTACAATAGGTGTCTGCACAATGACTGACATTGCCAAGAAACCT\nACTGAGAGTGCTTGTTCTTCACTTACTGTCTTGTTTGATGGTAGAGTGGAAGGACAGGTA\nGACCTTTTTAGAAACGCCCGTAATGGTGTTTTAATAACAGAAGGTTCAGTCAAAGGTCTA\nACACCTTCAAAGGGACCAGCACAAGCTAGCGTCAATGGAGTCACATTAATTGGAGAATCA\nGTAAAAACACAGTTTAACTACTTTAAGAAAGTAGACGGCATTATTCAACAGTTGCCTGAA\nACCTACTTTACTCAGAGCAGAGACTTAGAGGATTTTAAGCCCAGATCACAAATGGAAACT\nGACTTTCTCGAGCTCGCTATGGATGAATTCATACAGCGATATAAGCTCGAGGGCTATGCC\nTTCGAACACATCGTTTATGGAGATTTCAGTCATGGACAACTTGGCGGTCTTCATTTAATG\nATAGGCTTAGCCAAGCGCTCACAAGATTCACCACTTAAATTAGAGGATTTTATCCCTATG\nGACAGCACAGTGAAAAATTACTTCATAACAGATGCGCAAACAGGTTCATCAAAATGTGTG\nTGTTCTGTGATTGATCTTTTACTTGATGACTTTGTCGAGATAATAAAGTCACAAGATTTG\nTCAGTGATTTCAAAAGTGGTCAAGGTTACAATTGACTATGCTGAAATTTCATTCATGCTT\nTGGTGTAAGGATGGACATGTTGAAACCTTCTACCCAAAACTACAAGCAAGTCAAGCGTGG\nCAACCAGGTGTTGCGATGCCTAACTTGTACAAGATGCAAAGAATGCTTCTTGAAAAGTGT\nGACCTTCAGAATTATGGTGAAAATGCTGTTATACCAAAAGGAATAATGATGAATGTCGCA\nAAGTATACTCAACTGTGTCAATACTTAAATACACTTACTTTAGCTGTACCCTACAACATG\nAGAGTTATTCACTTTGGTGCTGGCTCTGATAAAGGAGTTGCACCAGGTACAGCTGTGCTC\nAGACAATGGTTGCCAACTGGCACACTACTTGTCGATTCAGATCTTAATGACTTCGTCTCC\nGACGCAGATTCTACTTTAATTGGAGACTGTGCAACAGTACATACGGCTAATAAATGGGAC\nCTTATTATTAGCGATATGTATGACCCTAGGACCAAACATGTGACAAAAGAGAATGACTCT\nAAAGAAGGGTTTTTCACTTATCTGTGTGGATTTATAAAGCAAAAACTAGCCCTGGGTGGT\nTCTATAGCTGTAAAGATAACAGAGCATTCTTGGAATGCTGACCTTTACAAGCTTATGGGC\nCATTTCTCATGGTGGACAGCTTTTGTTACAAATGTAAATGCATCATCATCGGAAGCATTT\nTTAATTGGGGCTAACTATCTTGGCAAGCCGAAGGAACAAATTGATGGCTATACCATGCAT\nGCTAACTACATTTTCTGGAGGAACACAAATCCTATCCAGTTGTCTTCCTATTCACTCTTT\nGACATGAGCAAATTTCCTCTTAAATTAAGAGGAACTGCTGTAATGTCTCTTAAGGAGAAT\nCAAATCAATGATATGATTTATTCTCTTCTGGAAAAAGGTAGGCTTATCATTAGAGAAAAC\nAACAGAGTTGTGGTTTCAAGTGATATTCTTGTTAACAACTAA","gene_format":"FASTA","parent_id":"BE0003801"} {"id":"Q9H228","source":"Swiss-Prot","name":"Sphingosine 1-phosphate receptor 5","general_function":"Sphingosine-1-phosphate receptor activity","specific_function":"Receptor for the lysosphingolipid sphingosine 1-phosphate (S1P). S1P is a bioactive lysophospholipid that elicits diverse physiological effect on most types of cells and tissues. Is coupled to both the G(i/0)alpha and G(12) subclass of heteromeric G-proteins (By similarity). May play a regulatory role in the transformation of radial glial cells into astrocytes and may affect proliferative activity of these cells.","gene_name":"S1PR5","locus":"19p13.2","cellular_location":"Cell membrane","transmembrane_regions":"41-61\n71-91\n112-132\n152-172\n193-213\n253-273\n288-308","theoretical_pi":9.98,"molecular_weight":41774.515,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0004776|Sphingosine 1-phosphate receptor 5\nMESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAVVCLAVCAFIVLENLAVLLV\nLGRHPRFHAPMFLLLGSLTLSDLLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT\nASVLSLLAIALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD\nACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANARRLPARPGTAGTTSTRA\nRRKPRSLALLRTLSVVLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANSL\nLNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSF\nSGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0011741|Sphingosine 1-phosphate receptor 5 (S1PR5)\nATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAAC\nTACACCGGCAAGCTCCGCGGTGCGCGCTACCAGCCGGGTGCCGGCCTGCGCGCCGACGCC\nGTGGTGTGCCTGGCGGTGTGCGCCTTCATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTG\nCTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTGCTCCTGGGCAGCCTCACGTTG\nTCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGCCGCTCACG\nCTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACT\nGCGTCCGTGCTGAGCCTCCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG\nGGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGC\nGTGTCGCTGCTCCTCGGGCTCCTGCCAGCGCTGGGCTGGAATTGCCTGGGTCGCCTGGAC\nGCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCTCTTCTGCGTGCTCGCC\nTTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTACGC\nGCCAACGCGCGGCGCCTGCCGGCACGGCCCGGGACTGCGGGGACCACCTCGACCCGGGCG\nCGTCGCAAGCCGCGCTCGCTGGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTT\nGTGGCATGTTGGGGCCCCCTCTTCCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGC\nACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTGGGACTGGCCATGGCCAACTCACTT\nCTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTG\nGTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGC\nGCGGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTC\nAGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGC\nAGCCCCGGTGCACCCACAGCCGCCCGGACTCTGGTATCAGAACCGGCTGCAGACTGA","gene_format":"FASTA","parent_id":"BE0002432"} {"id":"P21453","source":"Swiss-Prot","name":"Sphingosine 1-phosphate receptor 1","general_function":"Sphingosine-1-phosphate receptor activity","specific_function":"G-protein coupled receptor for the bioactive lysosphingolipid sphingosine 1-phosphate (S1P) that seems to be coupled to the G(i) subclass of heteromeric G proteins. Signaling leads to the activation of RAC1, SRC, PTK2/FAK1 and MAP kinases. Plays an important role in cell migration, probably via its role in the reorganization of the actin cytoskeleton and the formation of lamellipodia in response to stimuli that increase the activity of the sphingosine kinase SPHK1. Required for normal chemotaxis toward sphingosine 1-phosphate. Required for normal embryonic heart development and normal cardiac morphogenesis. Plays an important role in the regulation of sprouting angiogenesis and vascular maturation. Inhibits sprouting angiogenesis to prevent excessive sprouting during blood vessel development. Required for normal egress of mature T-cells from the thymus into the blood stream and into peripheral lymphoid organs. Plays a role in the migration of osteoclast precursor cells, the regulation of bone mineralization and bone homeostasis (By similarity). Plays a role in responses to oxidized 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine by pulmonary endothelial cells and in the protection against ventilator-induced lung injury.","gene_name":"S1PR1","cellular_location":"Cell membrane","transmembrane_regions":"47-68\n83-104\n117-138\n161-182\n197-224\n258-278\n290-310","molecular_weight":42810.195,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0048956|Sphingosine 1-phosphate receptor 1\nMGPTSVPLVKAHRSSVSDYVNYDIIVRHYNYTGKLNISADKENSIKLTSVVFILICCFII\nLENIFVLLTIWKTKKFHRPMYYFIGNLALSDLLAGVAYTANLLLSGATTYKLTPAQWFLR\nEGSMFVALSASVFSLLAIAIERYITMLKMKLHNGSNNFRLFLLISACWVISLILGGLPIM\nGWNCISALSSCSTVLPLYHKHYILFCTTVFTLLLLSIVILYCRIYSLVRTRSRRLTFRKN\nISKASRSSEKSLALLKTVIIVLSVFIACWAPLFILLLLDVGCKVKTCDILFRAEYFLVLA\nVLNSGTNPIIYTLTNKEMRRAFIRIMSCCKCPSGDSAGKFKRPIIAGMEFSRSKSDNSSH\nPQKDEGDNPETIMSSGNVNSSS","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0048957|Sphingosine 1-phosphate receptor 1 (S1PR1)\nATGGGGCCCACCAGCGTCCCGCTGGTCAAGGCCCACCGCAGCTCGGTCTCTGACTACGTC\nAACTATGATATCATCGTCCGGCATTACAACTACACGGGAAAGCTGAATATCAGCGCGGAC\nAAGGAGAACAGCATTAAACTGACCTCGGTGGTGTTCATTCTCATCTGCTGCTTTATCATC\nCTGGAGAACATCTTTGTCTTGCTGACCATTTGGAAAACCAAGAAATTCCACCGACCCATG\nTACTATTTTATTGGCAATCTGGCCCTCTCAGACCTGTTGGCAGGAGTAGCCTACACAGCT\nAACCTGCTCTTGTCTGGGGCCACCACCTACAAGCTCACTCCCGCCCAGTGGTTTCTGCGG\nGAAGGGAGTATGTTTGTGGCCCTGTCAGCCTCCGTGTTCAGTCTCCTCGCCATCGCCATT\nGAGCGCTATATCACAATGCTGAAAATGAAACTCCACAACGGGAGCAATAACTTCCGCCTC\nTTCCTGCTAATCAGCGCCTGCTGGGTCATCTCCCTCATCCTGGGTGGCCTGCCTATCATG\nGGCTGGAACTGCATCAGTGCGCTGTCCAGCTGCTCCACCGTGCTGCCGCTCTACCACAAG\nCACTATATCCTCTTCTGCACCACGGTCTTCACTCTGCTTCTGCTCTCCATCGTCATTCTG\nTACTGCAGAATCTACTCCTTGGTCAGGACTCGGAGCCGCCGCCTGACGTTCCGCAAGAAC\nATTTCCAAGGCCAGCCGCAGCTCTGAGAAGTCGCTGGCGCTGCTCAAGACCGTAATTATC\nGTCCTGAGCGTCTTCATCGCCTGCTGGGCACCGCTCTTCATCCTGCTCCTGCTGGATGTG\nGGCTGCAAGGTGAAGACCTGTGACATCCTCTTCAGAGCGGAGTACTTCCTGGTGTTAGCT\nGTGCTCAACTCCGGCACCAACCCCATCATTTACACTCTGACCAACAAGGAGATGCGTCGG\nGCCTTCATCCGGATCATGTCCTGCTGCAAGTGCCCGAGCGGAGACTCTGCTGGCAAATTC\nAAGCGACCCATCATCGCCGGCATGGAATTCAGCCGCAGCAAATCGGACAATTCCTCCCAC\nCCCCAGAAAGACGAAGGGGACAACCCAGAGACCATTATGTCTTCTGGAAACGTCAACTCT\nTCTTCCTAG","gene_format":"FASTA","parent_id":"BE0008652"} {"id":"O95977","source":"Swiss-Prot","name":"Sphingosine 1-phosphate receptor 4","general_function":"Receptor for the lysosphingolipid sphingosine 1-phosphate (S1P). S1P is a bioactive lysophospholipid that elicits diverse physiological effect on most types of cells and tissues. May be involved in cell migration processes that are specific for lymphocytes.","specific_function":"G protein-coupled receptor activity","gene_name":"S1PR4","locus":"19p13.3","cellular_location":"Cell membrane","transmembrane_regions":"51-71\n85-105\n118-138\n162-182\n207-227\n253-273\n289-309","molecular_weight":41622.525,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0052329|Sphingosine 1-phosphate receptor 4\nMNATGTPVAPESCQQLAAGGHSRLIVLHYNHSGRLAGRGGPEDGGLGALRGLSVAASCLV\nVLENLLVLAAITSHMRSRRWVYYCLVNITLSDLLTGAAYLANVLLSGARTFRLAPAQWFL\nREGLLFTALAASTFSLLFTAGERFATMVRPVAESGATKTSRVYGFIGLCWLLAALLGMLP\nLLGWNCLCAFDRCSSLLPLYSKRYILFCLVIFAGVLATIMGLYGAIFRLVQASGQKAPRP\nAARRKARRLLKTVLMILLAFLVCWGPLFGLLLADVFGSNLWAQEYLRGMDWILALAVLNS\nAVNPIIYSFRSREVCRAVLSFLCCGCLRLGMRGPGDCLARAVEAHSGASTTDSSLRPRDS\nFRGSRSLSFRMREPLSSISSVRSI","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0052330|Sphingosine 1-phosphate receptor 4 (S1PR4)\nATGAACGCCACGGGGACCCCGGTGGCCCCCGAGTCCTGCCAACAGCTGGCGGCCGGCGGG\nCACAGCCGGCTCATTGTTCTGCACTACAACCACTCGGGCCGGCTGGCCGGGCGCGGGGGG\nCCGGAGGATGGCGGCCTGGGGGCCCTGCGGGGGCTGTCGGTGGCCGCCAGCTGCCTGGTG\nGTGCTGGAGAACTTGCTGGTGCTGGCGGCCATCACCAGCCACATGCGGTCGCGACGCTGG\nGTCTACTATTGCCTGGTGAACATCACGCTGAGTGACCTGCTCACGGGCGCGGCCTACCTG\nGCCAACGTGCTGCTGTCGGGGGCCCGCACCTTCCGTCTGGCGCCCGCCCAGTGGTTCCTA\nCGGGAGGGCCTGCTCTTCACCGCCCTGGCCGCCTCCACCTTCAGCCTGCTCTTCACTGCA\nGGGGAGCGCTTTGCCACCATGGTGCGGCCGGTGGCCGAGAGCGGGGCCACCAAGACCAGC\nCGCGTCTACGGCTTCATCGGCCTCTGCTGGCTGCTGGCCGCGCTGCTGGGGATGCTGCCT\nTTGCTGGGCTGGAACTGCCTGTGCGCCTTTGACCGCTGCTCCAGCCTTCTGCCCCTCTAC\nTCCAAGCGCTACATCCTCTTCTGCCTGGTGATCTTCGCCGGCGTCCTGGCCACCATCATG\nGGCCTCTATGGGGCCATCTTCCGCCTGGTGCAGGCCAGCGGGCAGAAGGCCCCACGCCCA\nGCGGCCCGCCGCAAGGCCCGCCGCCTGCTGAAGACGGTGCTGATGATCCTGCTGGCCTTC\nCTGGTGTGCTGGGGCCCACTCTTCGGGCTGCTGCTGGCCGACGTCTTTGGCTCCAACCTC\nTGGGCCCAGGAGTACCTGCGGGGCATGGACTGGATCCTGGCCCTGGCCGTCCTCAACTCG\nGCGGTCAACCCCATCATCTACTCCTTCCGCAGCAGGGAGGTGTGCAGAGCCGTGCTCAGC\nTTCCTCTGCTGCGGGTGTCTCCGGCTGGGCATGCGAGGGCCCGGGGACTGCCTGGCCCGG\nGCCGTCGAGGCTCACTCCGGAGCTTCCACCACCGACAGCTCTCTGAGGCCAAGGGACAGC\nTTTCGCGGCTCCCGCTCGCTCAGCTTTCGGATGCGGGAGCCCCTGTCCAGCATCTCCAGC\nGTGCGGAGCATCTGA","gene_format":"FASTA","parent_id":"BE0009882"} {"id":"Q99500","source":"Swiss-Prot","name":"Sphingosine 1-phosphate receptor 3","general_function":"Receptor for the lysosphingolipid sphingosine 1-phosphate (S1P). S1P is a bioactive lysophospholipid that elicits diverse physiological effect on most types of cells and tissues. When expressed in rat HTC4 hepatoma cells, is capable of mediating S1P-induced cell proliferation and suppression of apoptosis.","specific_function":"G protein-coupled receptor activity","gene_name":"S1PR3","locus":"9q22.1","cellular_location":"Cell membrane","transmembrane_regions":"41-65\n73-101\n116-134\n154-179\n196-216\n244-265\n282-302","molecular_weight":42249.805,"organism":"Humans","organism_ncbi_taxonomy_id":9606,"amino_acid_sequence":">lcl|BSEQ0052327|Sphingosine 1-phosphate receptor 3\nMATALPPRLQPVRGNETLREHYQYVGKLAGRLKEASEGSTLTTVLFLVICSFIVLENLMV\nLIAIWKNNKFHNRMYFFIGNLALCDLLAGIAYKVNILMSGKKTFSLSPTVWFLREGSMFV\nALGASTCSLLAIAIERHLTMIKMRPYDANKRHRVFLLIGMCWLIAFTLGALPILGWNCLH\nNLPDCSTILPLYSKKYIAFCISIFTAILVTIVILYARIYFLVKSSSRKVANHNNSERSMA\nLLRTVVIVVSVFIACWSPLFILFLIDVACRVQACPILFKAQWFIVLAVLNSAMNPVIYTL\nASKEMRRAFFRLVCNCLVRGRGARASPIQPALDPSRSKSSSSNNSSHSPKVKEDLPHTAP\nSSCIMDKNAALQNGIFCN","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0052328|Sphingosine 1-phosphate receptor 3 (S1PR3)\nATGGCAACTGCCCTCCCGCCGCGTCTCCAGCCGGTGCGGGGGAACGAGACCCTGCGGGAG\nCATTACCAGTACGTGGGGAAGTTGGCGGGCAGGCTGAAGGAGGCCTCCGAGGGCAGCACG\nCTCACCACCGTGCTCTTCTTGGTCATCTGCAGCTTCATCGTCTTGGAGAACCTGATGGTT\nTTGATTGCCATCTGGAAAAACAATAAATTTCACAACCGCATGTACTTTTTCATTGGCAAC\nCTGGCTCTCTGCGACCTGCTGGCCGGCATCGCTTACAAGGTCAACATTCTGATGTCTGGC\nAAGAAGACGTTCAGCCTGTCTCCCACGGTCTGGTTCCTCAGGGAGGGCAGTATGTTCGTG\nGCCCTTGGGGCGTCCACCTGCAGCTTACTGGCCATCGCCATCGAGCGGCACTTGACAATG\nATCAAAATGAGGCCTTACGACGCCAACAAGAGGCACCGCGTCTTCCTCCTGATCGGGATG\nTGCTGGCTCATTGCCTTCACGCTGGGCGCCCTGCCCATTCTGGGCTGGAACTGCCTGCAC\nAATCTCCCTGACTGCTCTACCATCCTGCCCCTCTACTCCAAGAAGTACATTGCCTTCTGC\nATCAGCATCTTCACGGCCATCCTGGTGACCATCGTGATCCTCTACGCACGCATCTACTTC\nCTGGTGAAGTCCAGCAGCCGTAAGGTGGCCAACCACAACAACTCGGAGCGGTCCATGGCA\nCTGCTGCGGACCGTGGTGATTGTGGTGAGCGTGTTCATCGCCTGCTGGTCCCCACTCTTC\nATCCTCTTCCTCATTGATGTGGCCTGCAGGGTGCAGGCGTGCCCCATCCTCTTCAAGGCT\nCAGTGGTTCATCGTGTTGGCTGTGCTCAACTCCGCCATGAACCCGGTCATCTACACGCTG\nGCCAGCAAGGAGATGCGGCGGGCCTTCTTCCGTCTGGTCTGCAACTGCCTGGTCAGGGGA\nCGGGGGGCCCGCGCCTCACCCATCCAGCCTGCGCTCGACCCAAGCAGAAGTAAATCAAGC\nAGCAGCAACAATAGCAGCCACTCTCCGAAGGTCAAGGAAGACCTGCCCCACACAGCCCCC\nTCATCCTGCATCATGGACAAGAACGCAGCACTTCAGAATGGGATCTTCTGCAACTGA","gene_format":"FASTA","parent_id":"BE0009881"} {"id":"Q5L478","source":"TrEMBL","name":"Nonstructural protein 5A","general_function":"Zinc ion binding","gene_name":"NS5A","molecular_weight":48598.34,"organism":"Hepatitis C Virus","organism_ncbi_taxonomy_id":11103,"amino_acid_sequence":">lcl|BSEQ0022270|Nonstructural protein 5A\nSGSWLRDVWDWICTVLTDFKTWLQSKLLPRIPGVPFLSCQRGYKGVWRGDGIMHTTCPCG\nAQITGHVKNGSMRIVGPKTCSNTWHGTFPINTYTTGPCTPSPAPNYSRALWRVAAEEYVE\nVTRVGDFHYVTGMTTDNVKCPCQVPAPEFFTEVDGVRLHRYAPACKPLLREEVTFMVGLN\nQYLVGSQLPCEPEPDVTVLTSMLTDPSHITAETAGRRLARGSPPSLASSSASQLSAPSLK\nATCTTRHDSPDADLIEANLLWRQEMGGNITRVESENKVVILDSFDPLQAEEDEREVSVPA\nEILRKSRKFPRAMPIWARPDYNPPLLESWKDPDYVPPVVHGCPLPPAKAPPIPPPRRKRT\nVVLTESTVSSALAELATKTFGSSGSSAVDSGTATAPPDQLPGDGDSGSDVESYSSMPPLE\nGEPGDPDLSDGSWSTMSEEASEDVVCC","amino_acid_format":"FASTA","gene_format":"FASTA","parent_id":"BE0008651"} {"id":"Q05318","source":"Swiss-Prot","name":"RNA-directed RNA polymerase L","general_function":"RNA-directed RNA polymerase that catalyzes the transcription of viral mRNAs, their capping and polyadenylation. The template is composed of the viral RNA tightly encapsidated by the nucleoprotein (N). The viral polymerase binds to the genomic RNA at the 3' leader promoter, and transcribes subsequently all viral mRNAs with a decreasing efficiency. The first gene is the most transcribed, and the last the least transcribed. The viral phosphoprotein acts as a processivity factor. Capping is concommitant with initiation of mRNA transcription. Indeed, a GDP polyribonucleotidyl transferase (PRNTase) adds the cap structure when the nascent RNA chain length has reached few nucleotides. Ribose 2'-O methylation of viral mRNA cap precedes and facilitates subsequent guanine-N-7 methylation, both activities being carried by the viral polymerase. Polyadenylation of mRNAs occur by a stuttering mechanism at a slipery stop site present at the end viral genes. After finishing transcription of a mRNA, the polymerase can resume transcription of the downstream gene.","specific_function":"Atp binding","gene_name":"L","cellular_location":"Virion","molecular_weight":252722.045,"organism":"Zaire ebolavirus (strain Mayinga-76)","organism_ncbi_taxonomy_id":128952,"amino_acid_sequence":">lcl|BSEQ0052294|RNA-directed RNA polymerase L\nMATQHTQYPDARLSSPIVLDQCDLVTRACGLYSSYSLNPQLRNCKLPKHIYRLKYDVTVT\nKFLSDVPVATLPIDFIVPVLLKALSGNGFCPVEPRCQQFLDEIIKYTMQDALFLKYYLKN\nVGAQEDCVDEHFQEKILSSIQGNEFLHQMFFWYDLAILTRRGRLNRGNSRSTWFVHDDLI\nDILGYGDYVFWKIPISMLPLNTQGIPHAAMDWYQASVFKEAVQGHTHIVSVSTADVLIMC\nKDLITCRFNTTLISKIAEIEDPVCSDYPNFKIVSMLYQSGDYLLSILGSDGYKIIKFLEP\nLCLAKIQLCSKYTERKGRFLTQMHLAVNHTLEEITEMRALKPSQAQKIREFHRTLIRLEM\nTPQQLCELFSIQKHWGHPVLHSETAIQKVKKHATVLKALRPIVIFETYCVFKYSIAKHYF\nDSQGSWYSVTSDRNLTPGLNSYIKRNQFPPLPMIKELLWEFYHLDHPPLFSTKIISDLSI\nFIKDRATAVERTCWDAVFEPNVLGYNPPHKFSTKRVPEQFLEQENFSIENVLSYAQKLEY\nLLPQYRNFSFSLKEKELNVGRTFGKLPYPTRNVQTLCEALLADGLAKAFPSNMMVVTERE\nQKESLLHQASWHHTSDDFGEHATVRGSSFVTDLEKYNLAFRYEFTAPFIEYCNRCYGVKN\nVFNWMHYTIPQCYMHVSDYYNPPHNLTLENRDNPPEGPSSYRGHMGGIEGLQQKLWTSIS\nCAQISLVEIKTGFKLRSAVMGDNQCITVLSVFPLETDADEQEQSAEDNAARVAASLAKVT\nSACGIFLKPDETFVHSGFIYFGKKQYLNGVQLPQSLKTATRMAPLSDAIFDDLQGTLASI\nGTAFERSISETRHIFPCRITAAFHTFFSVRILQYHHLGFNKGFDLGQLTLGKPLDFGTIS\nLALAVPQVLGGLSFLNPEKCFYRNLGDPVTSGLFQLKTYLRMIEMDDLFLPLIAKNPGNC\nTAIDFVLNPSGLNVPGSQDLTSFLRQIVRRTITLSAKNKLINTLFHASADFEDEMVCKWL\nLSSTPVMSRFAADIFSRTPSGKRLQILGYLEGTRTLLASKIINNNTETPVLDRLRKITLQ\nRWSLWFSYLDHCDNILAEALTQITCTVDLAQILREYSWAHILEGRPLIGATLPCMIEQFK\nVFWLKPYEQCPQCSNAKQPGGKPFVSVAVKKHIVSAWPNASRISWTIGDGIPYIGSRTED\nKIGQPAIKPKCPSAALREAIELASRLTWVTQGSSNSDLLIKPFLEARVNLSVQEILQMTP\nSHYSGNIVHRYNDQYSPHSFMANRMSNSATRLIVSTNTLGEFSGGGQSARDSNIIFQNVI\nNYAVALFDIKFRNTEATDIQYNRAHLHLTKCCTREVPAQYLTYTSTLDLDLTRYRENELI\nYDSNPLKGGLNCNISFDNPFFQGKRLNIIEDDLIRLPHLSGWELAKTIMQSIISDSNNSS\nTDPISSGETRSFTTHFLTYPKIGLLYSFGAFVSYYLGNTILRTKKLTLDNFLYYLTTQIH\nNLPHRSLRILKPTFKHASVMSRLMSIDPHFSIYIGGAAGDRGLSDAARLFLRTSISSFLT\nFVKEWIINRGTIVPLWIVYPLEGQNPTPVNNFLYQIVELLVHDSSRQQAFKTTISDHVHP\nHDNLVYTCKSTASNFFHASLAYWRSRHRNSNRKYLARDSSTGSSTNNSDGHIERSQEQTT\nRDPHDGTERNLVLQMSHEIKRTTIPQENTHQGPSFQSFLSDSACGTANPKLNFDRSRHNV\nKFQDHNSASKREGHQIISHRLVLPFFTLSQGTRQLTSSNESQTQDEISKYLRQLRSVIDT\nTVYCRFTGIVSSMHYKLDEVLWEIESFKSAVTLAEGEGAGALLLIQKYQVKTLFFNTLAT\nESSIESEIVSGMTTPRMLLPVMSKFHNDQIEIILNNSASQITDITNPTWFKDQRARLPKQ\nVEVITMDAETTENINRSKLYEAVYKLILHHIDPSVLKAVVLKVFLSDTEGMLWLNDNLAP\nFFATGYLIKPITSSARSSEWYLCLTNFLSTTRKMPHQNHLSCKQVILTALQLQIQRSPYW\nLSHLTQYADCELHLSYIRLGFPSLEKVLYHRYNLVDSKRGPLVSITQHLAHLRAEIRELT\nNDYNQQRQSRTQTYHFIRTAKGRITKLVNDYLKFFLIVQALKHNGTWQAEFKKLPELISV\nCNRFYHIRDCNCEERFLVQTLYLHRMQDSEVKLIERLTGLLSLFPDGLYRFD","amino_acid_format":"FASTA","gene_sequence":">lcl|BSEQ0052295|RNA-directed RNA polymerase L (L)\nATGGCTACACAACATACCCAATACCCAGACGCTAGGTTATCATCACCAATTGTATTGGAC\nCAATGTGACCTAGTCACTAGAGCTTGCGGGTTATATTCATCATACTCCCTTAATCCGCAA\nCTACGCAACTGTAAACTCCCGAAACATATCTACCGTTTGAAATACGATGTAACTGTTACC\nAAGTTCTTGAGTGATGTACCAGTGGCGACATTGCCCATAGATTTCATAGTCCCAGTTCTT\nCTCAAGGCACTGTCAGGCAATGGATTCTGTCCTGTTGAGCCGCGGTGCCAACAGTTCTTA\nGATGAAATCATTAAGTACACAATGCAAGATGCTCTCTTCTTGAAATATTATCTCAAAAAT\nGTGGGTGCTCAAGAAGACTGTGTTGATGAACACTTTCAAGAGAAAATCTTATCTTCAATT\nCAGGGCAATGAATTTTTACATCAAATGTTTTTCTGGTATGATCTGGCTATTTTAACTCGA\nAGGGGTAGATTAAATCGAGGAAACTCTAGATCAACATGGTTTGTTCATGATGATTTAATA\nGACATCTTAGGCTATGGGGACTATGTTTTTTGGAAGATCCCAATTTCAATGTTACCACTG\nAACACACAAGGAATCCCCCATGCTGCTATGGACTGGTATCAGGCATCAGTATTCAAAGAA\nGCGGTTCAAGGGCATACACACATTGTTTCTGTTTCTACTGCCGACGTCTTGATAATGTGC\nAAAGATTTAATTACATGTCGATTCAACACAACTCTAATCTCAAAAATAGCAGAGATTGAG\nGATCCAGTTTGTTCTGATTATCCCAATTTTAAGATTGTGTCTATGCTTTACCAGAGCGGA\nGATTACTTACTCTCCATATTAGGGTCTGATGGGTATAAAATTATTAAGTTCCTCGAACCA\nTTGTGCTTGGCCAAAATTCAATTATGCTCAAAGTACACTGAGAGGAAGGGCCGATTCTTA\nACACAAATGCATTTAGCTGTAAATCACACCCTAGAAGAAATTACAGAAATGCGTGCACTA\nAAGCCTTCACAGGCTCAAAAGATCCGTGAATTCCATAGAACATTGATAAGGCTGGAGATG\nACGCCACAACAACTTTGTGAGCTATTTTCCATTCAAAAACACTGGGGGCATCCTGTGCTA\nCATAGTGAAACAGCAATCCAAAAAGTTAAAAAACATGCTACGGTGCTAAAAGCATTACGC\nCCTATAGTGATTTTCGAGACATACTGTGTTTTTAAATATAGTATTGCCAAACATTATTTT\nGATAGTCAAGGATCTTGGTACAGTGTTACTTCAGATAGGAATCTAACACCGGGTCTTAAT\nTCTTATATCAAAAGAAATCAATTCCCTCCGTTGCCAATGATTAAAGAACTACTATGGGAA\nTTTTACCACCTTGACCACCCTCCACTTTTCTCAACCAAAATTATTAGTGACTTAAGTATT\nTTTATAAAAGACAGAGCTACCGCAGTAGAAAGGACATGCTGGGATGCAGTATTCGAGCCT\nAATGTTCTAGGATATAATCCACCTCACAAATTTAGTACTAAACGTGTACCGGAACAATTT\nTTAGAGCAAGAAAACTTTTCTATTGAGAATGTTCTTTCCTACGCACAAAAACTCGAGTAT\nCTACTACCACAATATCGGAACTTTTCTTTCTCATTGAAAGAGAAAGAGTTGAATGTAGGT\nAGAACCTTCGGAAAATTGCCTTATCCGACTCGCAATGTTCAAACACTTTGTGAAGCTCTG\nTTAGCTGATGGTCTTGCTAAAGCATTTCCTAGCAATATGATGGTAGTTACGGAACGTGAG\nCAAAAAGAAAGCTTATTGCATCAAGCATCATGGCACCACACAAGTGATGATTTTGGTGAA\nCATGCCACAGTTAGAGGGAGTAGCTTTGTAACTGATTTAGAGAAATACAATCTTGCATTT\nAGATATGAGTTTACAGCACCTTTTATAGAATATTGCAACCGTTGCTATGGTGTTAAGAAT\nGTTTTTAATTGGATGCATTATACAATCCCACAGTGTTATATGCATGTCAGTGATTATTAT\nAATCCACCACATAACCTCACACTGGAGAATCGAGACAACCCCCCCGAAGGGCCTAGTTCA\nTACAGGGGTCATATGGGAGGGATTGAAGGACTGCAACAAAAACTCTGGACAAGTATTTCA\nTGTGCTCAAATTTCTTTAGTTGAAATTAAGACTGGTTTTAAGTTACGCTCAGCTGTGATG\nGGTGACAATCAGTGCATTACTGTTTTATCAGTCTTCCCCTTAGAGACTGACGCAGACGAG\nCAGGAACAGAGCGCCGAAGACAATGCAGCGAGGGTGGCCGCCAGCCTAGCAAAAGTTACA\nAGTGCCTGTGGAATCTTTTTAAAACCTGATGAAACATTTGTACATTCAGGTTTTATCTAT\nTTTGGAAAAAAACAATATTTGAATGGGGTCCAATTGCCTCAGTCCCTTAAAACGGCTACA\nAGAATGGCACCATTGTCTGATGCAATTTTTGATGATCTTCAAGGGACCCTGGCTAGTATA\nGGCACTGCTTTTGAGCGATCCATCTCTGAGACACGACATATCTTTCCTTGCAGGATAACC\nGCAGCTTTCCATACGTTTTTTTCGGTGAGAATCTTGCAATATCATCATCTCGGGTTCAAT\nAAAGGTTTTGACCTTGGACAGTTAACACTCGGCAAACCTCTGGATTTCGGAACAATATCA\nTTGGCACTAGCGGTACCGCAGGTGCTTGGAGGGTTATCCTTCTTGAATCCTGAGAAATGT\nTTCTACCGGAATCTAGGAGATCCAGTTACCTCAGGCTTATTCCAGTTAAAAACTTATCTC\nCGAATGATTGAGATGGATGATTTATTCTTACCTTTAATTGCGAAGAACCCTGGGAACTGC\nACTGCCATTGACTTTGTGCTAAATCCTAGCGGATTAAATGTCCCTGGGTCGCAAGACTTA\nACTTCATTTCTGCGCCAGATTGTACGCAGGACCATCACCCTAAGTGCGAAAAACAAACTT\nATTAATACCTTATTTCATGCGTCAGCTGACTTCGAAGACGAAATGGTTTGTAAATGGCTA\nTTATCATCAACTCCTGTTATGAGTCGTTTTGCGGCCGATATCTTTTCACGCACGCCGAGC\nGGGAAGCGATTGCAAATTCTAGGATACCTGGAAGGAACACGCACATTATTAGCCTCTAAG\nATCATCAACAATAATACAGAGACACCGGTTTTGGACAGACTGAGGAAAATAACATTGCAA\nAGGTGGAGCCTATGGTTTAGTTATCTTGATCATTGTGATAATATCCTGGCGGAGGCTTTA\nACCCAAATAACTTGCACAGTTGATTTAGCACAGATTCTGAGGGAATATTCATGGGCTCAT\nATTTTAGAGGGAAGACCTCTTATTGGAGCCACACTCCCATGTATGATTGAGCAATTCAAA\nGTGTTTTGGCTGAAACCCTACGAACAATGTCCGCAGTGTTCAAATGCAAAGCAACCAGGT\nGGGAAACCATTCGTGTCAGTGGCAGTCAAGAAACATATTGTTAGTGCATGGCCGAACGCA\nTCCCGAATAAGCTGGACTATCGGGGATGGAATCCCATACATTGGATCAAGGACAGAAGAT\nAAGATAGGACAACCTGCTATTAAACCAAAATGTCCTTCCGCAGCCTTAAGAGAGGCCATT\nGAATTGGCGTCCCGTTTAACATGGGTAACTCAAGGCAGTTCGAACAGTGACTTGCTAATA\nAAACCATTTTTGGAAGCACGAGTAAATTTAAGTGTTCAAGAAATACTTCAAATGACCCCT\nTCACATTACTCAGGAAATATTGTTCACAGGTACAACGATCAATACAGTCCTCATTCTTTC\nATGGCCAATCGTATGAGTAATTCAGCAACGCGATTGATTGTTTCTACAAACACTTTAGGT\nGAGTTTTCAGGAGGTGGCCAGTCTGCACGCGACAGCAATATTATTTTCCAGAATGTTATA\nAATTATGCAGTTGCACTGTTCGATATTAAATTTAGAAACACTGAGGCTACAGATATCCAA\nTATAATCGTGCTCACCTTCATCTAACTAAGTGTTGCACCCGGGAAGTACCAGCTCAGTAT\nTTAACATACACATCTACATTGGATTTAGATTTAACAAGATACCGAGAAAACGAATTGATT\nTATGACAGTAATCCTCTAAAAGGAGGACTCAATTGCAATATCTCATTCGATAATCCATTT\nTTCCAAGGTAAACGGCTGAACATTATAGAAGATGATCTTATTCGACTGCCTCACTTATCT\nGGATGGGAGCTAGCCAAGACCATCATGCAATCAATTATTTCAGATAGCAACAATTCATCT\nACAGACCCAATTAGCAGTGGAGAAACAAGATCATTCACTACCCATTTCTTAACTTATCCC\nAAGATAGGACTTCTGTACAGTTTTGGGGCCTTTGTAAGTTATTATCTTGGCAATACAATT\nCTTCGGACTAAGAAATTAACACTTGACAATTTTTTATATTACTTAACTACTCAAATTCAT\nAATCTACCACATCGCTCATTGCGAATACTTAAGCCAACATTCAAACATGCAAGCGTTATG\nTCACGGTTAATGAGTATTGATCCTCATTTTTCTATTTACATAGGCGGTGCTGCAGGTGAC\nAGAGGACTCTCAGATGCGGCCAGGTTATTTTTGAGAACGTCCATTTCATCTTTTCTTACA\nTTTGTAAAAGAATGGATAATTAATCGCGGAACAATTGTCCCTTTATGGATAGTATATCCG\nCTAGAGGGTCAAAACCCAACACCTGTGAATAATTTTCTCTATCAGATCGTAGAACTGCTG\nGTGCATGATTCATCAAGACAACAGGCTTTTAAAACTACCATAAGTGATCATGTACATCCT\nCACGACAATCTTGTTTACACATGTAAGAGTACAGCCAGCAATTTCTTCCATGCATCATTG\nGCGTACTGGAGGAGCAGACACAGAAACAGCAACCGAAAATACTTGGCAAGAGACTCTTCA\nACTGGATCAAGCACAAACAACAGTGATGGTCATATTGAGAGAAGTCAAGAACAAACCACC\nAGAGATCCACATGATGGCACTGAACGGAATCTAGTCCTACAAATGAGCCATGAAATAAAA\nAGAACGACAATTCCACAAGAAAACACGCACCAGGGTCCGTCGTTCCAGTCCTTTCTAAGT\nGACTCTGCTTGTGGTACAGCAAATCCAAAACTAAATTTCGATCGATCGAGACACAATGTG\nAAATTTCAGGATCATAACTCGGCATCCAAGAGGGAAGGTCATCAAATAATCTCACACCGT\nCTAGTCCTACCTTTCTTTACATTATCTCAAGGGACACGCCAATTAACGTCATCCAATGAG\nTCACAAACCCAAGACGAGATATCAAAGTACTTACGGCAATTGAGATCCGTCATTGATACC\nACAGTTTATTGTAGATTTACCGGTATAGTCTCGTCCATGCATTACAAACTTGATGAGGTC\nCTTTGGGAAATAGAGAGTTTCAAGTCGGCTGTGACGCTAGCAGAGGGAGAAGGTGCTGGT\nGCCTTACTATTGATTCAGAAATACCAAGTTAAGACCTTATTTTTCAACACGCTAGCTACT\nGAGTCCAGTATAGAGTCAGAAATAGTATCAGGAATGACTACTCCTAGGATGCTTCTACCT\nGTTATGTCAAAATTCCATAATGACCAAATTGAGATTATTCTTAACAACTCAGCAAGCCAA\nATAACAGACATAACAAATCCTACTTGGTTTAAAGACCAAAGAGCAAGGCTACCTAAGCAA\nGTCGAGGTTATAACCATGGATGCAGAGACAACAGAGAATATAAACAGATCGAAATTGTAC\nGAAGCTGTATATAAATTGATCTTACACCATATTGATCCTAGCGTATTGAAAGCAGTGGTC\nCTTAAAGTCTTTCTAAGTGATACTGAGGGTATGTTATGGCTAAATGATAATTTAGCCCCG\nTTTTTTGCCACTGGTTATTTAATTAAGCCAATAACGTCAAGTGCTAGATCTAGTGAGTGG\nTATCTTTGTCTGACGAACTTCTTATCAACTACACGTAAGATGCCACACCAAAACCATCTC\nAGTTGTAAACAGGTAATACTTACGGCATTGCAACTGCAAATTCAACGAAGCCCATACTGG\nCTAAGTCATTTAACTCAGTATGCTGACTGTGAGTTACATTTAAGTTATATCCGCCTTGGT\nTTTCCATCATTAGAGAAAGTACTATACCACAGGTATAACCTCGTCGATTCAAAAAGAGGT\nCCACTAGTCTCTATCACTCAGCACTTAGCACATCTTAGAGCAGAGATTCGAGAATTAACT\nAATGATTATAATCAACAGCGACAAAGTCGGACTCAAACATATCACTTTATTCGTACTGCA\nAAAGGACGAATCACAAAACTAGTCAATGATTATTTAAAATTCTTTCTTATTGTGCAAGCA\nTTAAAACATAATGGGACATGGCAAGCTGAGTTTAAGAAATTACCAGAGTTGATTAGTGTG\nTGCAATAGGTTCTACCATATTAGAGATTGCAATTGTGAAGAACGTTTCTTAGTTCAAACC\nTTATATTTACATAGAATGCAGGATTCTGAAGTTAAGCTTATCGAAAGGCTGACAGGGCTT\nCTGAGTTTATTTCCGGATGGTCTCTACAGGTTTGATTGA","gene_format":"FASTA","parent_id":"BE0009862"} {"id":"Q809M3","source":"Swiss-Prot","name":"RNA-directed RNA polymerase catalytic subunit","general_function":"RNA-dependent RNA polymerase which is responsible for replication and transcription of virus RNA segments. The transcription of viral mRNAs occurs by a unique mechanism called cap-snatching. 5' methylated caps of cellular mRNAs are cleaved after 10-13 nucleotides by PA. In turn, these short capped RNAs are used as primers by PB1 for transcription of viral mRNAs. During virus replication, PB1 initiates RNA synthesis and copy vRNA into complementary RNA (cRNA) which in turn serves as a template for the production of more vRNAs.","specific_function":"Nucleotide binding","gene_name":"PB1","cellular_location":"Host nucleus","molecular_weight":86420.48,"organism":"Influenza A virus (strain A/Silky Chicken/Hong Kong/SF189/2001 H5N1 genotype A)","organism_ncbi_taxonomy_id":196430,"amino_acid_sequence":">lcl|BSEQ0052300|RNA-directed RNA polymerase catalytic subunit\nMDVNPTLLFLKVPAQNAISTTFPYTGDPPYSHGTGTGYTMDTVNRTHQYSEKGKWTTNTE\nTGAPQLNPIDGPLPEDNEPSGYAQTDCVLEAMAFLEESHPGIFENSCLETMEIVQQTRVD\nKLTQGRQTYDWTLNRNQPAATALANTIEIFRSNGLTANESGRLIDFLKDVMESMDKEEME\nITTHFQRKRRVRDNMTKKMVTQRTIGKKKQRLNKKSYLIRALTLNTMTKDAERGKLKRRA\nIATPGMQIRGFVYFVETLARSICEKLEQSGLPVGGNEKKAKLANVVRKMMTNSQDTELSF\nTITGDNTKWNENQNPRMFLAMITYITRNQPEWFRNVLSIAPIMFSNKMARLGKGYMFESK\nSMKLRTQIPAEMLANIDLKYFNESTRKKIEKIRPLLIDGTASLSPGMMMGMFNMLSTVLG\nVSILNLGQKRYTKTTYWWDGLQSSDDFALIVNAPNHEGIQAGVDRFYRTCKLVGINMSKK\nKSYINRTGTFEFTSFFYRYGFVANFSMELPSFGVSGINESADMSIGVTVIKNNMINNDLG\nPATAQMALQLFIKDYRYTYRCHRGDTQIQTRRSFELKKLWEQTRSKAGLLVSDGGPNLYN\nIRNLHIPEVCLKWELMDEDYQGRLCNPLNPFVSHKEIESVNNAVVMPAHGPAKSVEYDAV\nATTHSWIPKRNRSILNTSQRGILEDEQMYQKCCNLFEKFFPSSSYRRPVGISSMVEAMVS\nRARIDARIDFESGRIKKEEFAEIMKICSTIEELRRQK","amino_acid_format":"FASTA","gene_format":"FASTA","parent_id":"BE0009866"}