差别

这里会显示出您选择的修订版和当前版本之间的差别。

--- stat:rnaseq [2023/06/26 00:14] – [流程] inkit
+++ stat:rnaseq [2024/10/21 13:14] (当前版本) – [rMATS] inkit
@@ 行 1: / 行 1: @@
 ====== RNA-Seq ======
 ====教程====
+  * 向SRA提交数据 [[stat:rnaseq:srasubmit]]
   * 知乎搜索 https://www.zhihu.com/search?q=rnaseq%20fastq%20star&type=content
   * RNA-Seq数据标准化方法 https://zhuanlan.zhihu.com/p/37196518
@@ 行 6: / 行 7: @@
   * RNA-seq：转录组数据分析处理(上) https://zhuanlan.zhihu.com/p/61847802
   * Common File Formats Used by the ENCODE Consortium [[https://www.encodeproject.org/help/file-formats/]]
+  * FAA RNA Seq Compare [[https://www.faa.gov/sites/faa.gov/files/GEN_21001B_diffExp_TechReport.pdf]]
-====流程====
+=====流程=====
+[[https://bioconductor.org/packages/release/BiocViews.html#___GeneExpressionWorkflow]]\\
 Raw Data: Fastq (.gz) ->
   - QC: FastQC
@@ 行 17: / 行 20: @@
   - Differential
+====1 QC ====
 ===FastQC===
 <code>
@@ 行 23: / 行 27: @@
 fastqc --noextract RawData/I409/I409_1.fq.gz -o results/1_initial_qc/
 </code>
+====2 Alignment====
 ===基因组注释数据===
@@ 行 31: / 行 38: @@
 >>GTF [[https://ftp.ensembl.org/pub/release-109/gtf/rattus_norvegicus/]]
 >>GFF [[https://ftp.ensembl.org/pub/release-109/gff3/rattus_norvegicus/]]
+===Alignment Indexing===
+[[https://registry.opendata.aws/jhu-indexes/]]
@@ 行 36: / 行 46: @@
   * [[https://github.com/pachterlab/kallisto]]
   * STAR
+  * HISAT2
+  * Salmon [[https://combine-lab.github.io/salmon/getting_started/]]
+===HISAT2===
+[[https://daehwankimlab.github.io/hisat2/download/]]
+ *  [[https://notebook.community/ssjunnebo/pathogen-informatics-training/Notebooks/RNA-Seq/genome-mapping]]
+<code>
+hisat2-build -p 32 fasta/Rattus_norvegicus.mRatBN7.2.dna.toplevel.fa hisat_index
+hisat2 -x hisat_index/hisat_index -1 I409/I409_1.fq.gz -2 I409/I409_2.fq.gz -S I409.sam -p 32
+</code>
 ===STAR===
@@ 行 51: / 行 73: @@
 STAR --genomeDir star_index --readFilesIn rna4/RawData/I409/I409_1.fq --outSAMtype BAM SortedByCoordinate --quantMode GeneCounts --runThreadN 14
+</code>
+=== kallisto  ===
+<code>
+# bat
+kallisto bus [arguments] FASTQ-files
+kallisto quant -i rattus_index_ki/transcriptome.idx -o reads.kallisto_quant -t 64 --fusion --pseudobam --genomebam --gtf gtf\Rattus_norvegicus.mRatBN7.2.109.gtf rna4\RawData\I409\I409_1.fq.gz rna4\RawData\I409\I409_2.fq.gz
+</code>
+====3 Count ====
+>可以Reads的工具[[https://hbctraining.github.io/Intro-to-rnaseq-hpc-O2/lessons/05_counting_reads.html]]
+>>1 [[https://subread.sourceforge.net/]]
+>>>featureCounts [[https://rnnh.github.io/bioinfo-notebook/docs/featureCounts.html]]
+<code>
+featureCounts -p -M -O -T 32 -a gtf/Rattus_norvegicus.mRatBN7.2.109.gtf -o output.txt data.sam [bam]
+</code>
+===== Splicing =====
+====rMATS====
+===剪接事件===
+  * SE（Skipped Exon，外显子跳跃）
+  * MXE（Mutually Exclusive Exons，相互排斥的外显子）
+  * A5SS（Alternative 5' Splice Site，可变 5' 剪接位点）
+  * A3SS（Alternative 3' Splice Site，可变 3' 剪接位点）
+  * RI（Retained Intron，内含子保留）
+SE (Skipped Exon), MXE (Mutually Exclusive Exons), A5SS (Alternative 5' Splice Site), A3SS (Alternative 3' Splice Site), RI (Retained Intron)
+===计数方法===
+  * JC（Junction Counts）：仅使用跨越剪接点的 reads 进行定量分析。提供更为精准的剪接定量，但可能会漏掉一些低覆盖的剪接事件。
+  * JCEC（Junction Counts and Exon Counts）：使用跨越剪接点和覆盖整个外显子的 reads 进行定量分析。能检测更多事件，但有时可能会引入额外的噪音。
+如果需要更精准的剪接事件识别，建议使用 JC。如果希望尽可能多地检测到所有的剪接事件，可以考虑 JCEC。
+===txt===
+<code>
+ID：剪接事件的唯一标识符。
+GeneID：发生剪接事件的基因的标识符（基因名称或基因 ID）。
+chr：发生剪接事件的染色体位置。
+strand：基因的链信息（正链或负链）。
+longExonStart_0base 和 longExonEnd：选择的较长外显子的起始和终止位置。
+shortES 和 shortEE：选择的较短外显子的起始和终止位置。
+flankingES 和 flankingEE：两侧剪接外显子的位置。
+ID：剪接事件的 ID 编号。
+IncFormLen 和 SkipFormLen：包含和跳过该外显子的转录本的长度。
+ICJ 和 SCJ（Inclusion Junction Counts / Skipping Junction Counts）：代表包含和跳过该剪接事件的 reads 数目。
+IncLevel1 / IncLevel2：代表在两组样本中该剪接事件的包含水平（Ψ值）。
+IncLevelDiff：两组样本间的剪接差异值（ΔΨ，Inclusion Level Difference）。
+PValue 和 FDR：用于判断剪接事件是否显著差异的 P 值和 FDR（假发现率）。
 </code>

Inkit wiki

用户工具

站点工具

差别

页面工具