Bioinformatics. Группа авторов

Читать онлайн книгу.

Bioinformatics - Группа авторов


Скачать книгу
that differ from the primary assembly because of allelic sequence or incorrect sequence, as determined by the Genome Reference Consortium. The Region in detail shows a zoomed-in view of the region outlined by the red box in the top section of the page. Genes are indicated by rectangles, colored as described in the gene legend below the graphic. The gene identifiers, along with the direction of transcription, are shown below the rectangles. The bottom section shows a zoomed-in view of the region surrounded by the red box in the Region in detail. The blue bar represents the genomic contig in this region. In the Genes track, genes above the bar are transcribed from left to right; those below the contig are transcribed from right to left. A few of the PAH transcripts, which are transcribed from right to left, are visible in this view. Gold transcripts are merged HAVANA/Ensembl transcripts; red are Ensembl protein-coding transcripts; blue transcripts are non-protein-coding processed transcripts. The pop-up display, activated when clicking on a particular transcript, shows the details for the first transcript in the Genes track, PAH-215.

      Box 4.4 Ensembl Stable IDs

      Ensembl assigns accession numbers to many data types in its database. Each identifier begins with the organism prefix; for human, the prefix is ENS; for mouse, it is ENSMUS; and for anole lizard, it is ENSACA. Next comes an abbreviation for the feature type: G for gene, T for transcript, P for protein, R for regulatory, and so forth. This is followed by a series of digits, and an optional version. The version number increments when there is a change in the underlying data. The gene version changes when the underlying transcripts are updated, and the transcript and protein versions increment when the sequence changes.

      For example, the human PAH gene has the following identifiers:

       ENSG00000171759.9: the identifier of the human PAH gene

       ENST00000553106.5: the identifier of one transcript of the human PAH gene, transcript PAH-215

       ENSP00000448059.1: the identifier of the protein translation of transcript PAH-215, ENST00000553106.5

       ENSR00000056420: the identifier of a promoter of several PAH transcripts

Snapshot depicts zooming in on the bottom section of the Location tab. (a) Highlight a region of interest, the final exon of PAH transcript PAH-203, by clicking the mouse and then scrolling to the left or right. In order to highlight the region, the Drag or Select toggle in the blue bar at the top of the section must first be set to Select. (b) To zoom in to the highlighted region, select Jump to region. It may take a few iterations to create the view in this figure. At the bottom of the window is a track labeled All phenotype-associated with short variants. Скачать книгу