我们肯定都遇到许多利用坐标去处理范围信息的需求,比如要定位基因组的某个位置,这个位置可能代表了gene model 、genetic variants(包括了SNPs、inser‐tions/deletions)、transposable elements 、binding sites;又或者想看看染色体某个区域的GC含量、统计overlap、计算coverage、提取序列等。这些都属于Range Data的处理范围。本次着重看原理部分。
forward strand, this means reading left-to-right, and for the reverse strand it means right-to-left
A gene can live on a DNA strand in one of two orientations. The gene is said to have a coding strand (also known as its sense strand), and a template strand (also known as its antisense strand).
mRNA sequence always corresponds to the 5-3 coding sequence of a gene.
mRNA matches the coding sequence of the gene, not the template sequence(看图https://en./wiki/File:Simple_transcription_elongation1.svg) 转录时基因以负链为模板链,从负链的3‘向5’转录(合成的转录本是5‘=》3’,同时与正链/编码链上对应位置的序列一致)