当前位置:网站首页>Answer the reader's question (7): About doublet
Answer the reader's question (7): About doublet
2022-07-18 09:29:00 【Top bioinformation】
Several questions have been released on gongzong number , Next, I will give my thoughts in turn , For reference only . If you have different opinions , Welcome to email me to discuss :[email protected]
problem
*How do I get there? doublet? When selecting a large category for dimension reduction clustering of small categories , Some people will still check various categories marker Expression of , Is this step necessary ?
*
1
The most effective way is to start from the upstream , Control the number of cells and optimize the process before sequencing .
2
However, those who analyze the data can only look at it from the perspective of analysis . Usually I will 「 First use software to predict 」, Please refer to a post I wrote before : Single cell analysis record (4): doublet testing .
Here comes a question , Go to doublet At which step :
Run away cellranger Get the matrix , Go to doublet, basic QC( Gene number 、UMI Count and so on )... Run away cellranger Get the matrix , basic QC( Gene number 、UMI Count and so on ), Go to doublet...
There should be little difference between the two . I'm usually the first , Considering the basic QC The threshold may be changed back and forth several times , So basic QC This step should be relatively backward , Save running doublet This step
I usually use multiple software ,「 Two 」 The software also reports doublet Of cellular barcode, I will pick out and remove .
3
besides , I will also use some classic marker Come looking for doublet.( This method can also be used to annotate cells )
celltype_marker=c(
"Epcam",# epithelial cells epithelial
"Pecam1","Cdh5",# Endothelial cells endothelial
"Pdgfra","Col1a1","Col3a1",# Fibroblasts fibroblasts
"Fcgr1","Cd163","Aif1","Cd68",# Myeloid cells myeloid
"Ms4a1",#B cells
"Cd3g","Cd3e",#T cells
"Ncr1",#NK cells
"Ptprc"# Immune cells
)
VlnPlot(allseu,features = celltype_marker,pt.size = 0,ncol = 2)
Like this , If a group cluster It expresses what is unlikely to appear at the same time gene, It could be doublet. Of course, we should rely on some 「 Experience 」, For example, fibrous marker There may be 「 Certain extent 」 The expression of ,CD4( One gene ) There may be 「 Certain extent 」 The expression of , It's possible . but T Cell overexpression EPCAM, I don't believe it .
4
There is another experience : use marker Identification of doublet When , Identified doublet( This is the premise. ), They may gather together to form a small group , It may also look like a bridge connecting two subgroups .
5
in addition , All of the above are for general categories clustering Description of , Do small classes reclustering When , I generally don't think about going between classes doublet, First, it is difficult to distinguish , The second is between subclasses doublet Formation probability ratio class doublet The probability of formation is much lower .
6
There's another situation ( Relatively rare ), Between samples doublet. At this time , utilize SNP It can be distinguished accurately sample1, sample2, as well as doublet(sample1+sample2).
There are already some tools available from scRNA-seq Get a small amount of data SNP Information , And achieve the above purpose , such as Souporcell( original text https://www.nature.com/articles/s41592-020-0820-1), The accuracy is OK .
But in most cases, one sample is tested , This idea is not feasible .
Back to the first 2 A little question
*When selecting a large category for dimension reduction clustering of small categories , Some people will still check various categories marker Expression of , Is this step necessary ?
*
It is necessary to . Doing it reclustering At this stage , Inspection category marker The expression of , I often see other kinds of cells ( Or it is composed of this kind of cells and other large types of cells doublet) Enter in disorder . Friends who have not done this step before , You can try , There will be surprises ( scared ).
At this time, the most rigorous way is to put other large types of cells back where they should go , Single cell clustering annotation is a “ Opening and closing ” The process of , It is also the most time-consuming and cumbersome process .
If the number is small , Directly remove , It doesn't matter much .
If it is composed of this kind of cells and other large types of cells doublet, Directly remove .
边栏推荐
- Tengshi's new luxury medium and large MPV is exposed, safe and comfortable
- 7-Redis架构设计到使用场景-缓存穿透、缓存雪崩、缓存预热、缓存降级
- Codeforces Round #804 B Almost Ternary Matrix
- 要想不踩SaaS那些坑,得先了解“SaaS架构”
- Role based security access control strategy for virtual machine in cloud environment
- Code forces round # 802 B. Palindromic Numbers
- Codeforces Round #802 B. Palindromic Numbers
- Codeforces Global Round 21 D. Permutation Graph
- Easy gene encode histone chip SEQ and transcription factor chip SEQ data standard and processing flow
- Codeforces Round #805 A - G
猜你喜欢

Leetcode-14-longest public prefix

Comprehensive explanation of e-commerce crawler API

CSDN博客专家专属荣耀奖励来了
求助,更新到思源 v2.0.27,dark+ 主题自适应宽度问题

Codeforces Round #803 (Div. 2) C. 3SUM Closure

安装CUDA失败的情况nsight visual studio edition失败

WTK6900H语音识别单芯片实现智能语音识别蓝牙耳机方案设计

SQL也能玩转AI ?没错!MLOps Meetup V3 回顾|OpenMLBD+SQLFlow+Byzer
![[ArcGIS creates a small picture frame of islands and nine segment lines in the South China Sea]](/img/5e/5d81afb511d22214a2545121f22300.png)
[ArcGIS creates a small picture frame of islands and nine segment lines in the South China Sea]

一键生成VR全景图展示
随机推荐
sql语句
华泰证券开户安全吗,是正规证券公司吗?
手机炒股开户哪家券商公司好?哪个更安全
Codeforces Round #803 (Div. 2) C. 3SUM Closure
Help brand insight -- Analysis of consumers' emotional behavior
成长一夏 挑战赛|跟大佬学习 / 创作,得CSDN大礼包、专属荣誉证书、纪念T恤和勋章!
请问政务网服务器的mysql数据已经与dataworks开通网络策略,但是测试还是说数据库测试连通性
A few minutes before work, I thoroughly understood json stringify()
5-Redis架构设计到使用场景-存储原理-数据类型底层结构
Codeforces Global Round 21 A. NIT orz!
【空间&单细胞组学】第2期:联合单细胞和bulk转录组鉴定了结直肠癌中两种上皮肿瘤细胞状态,并完善了CMS分型
量子保密通信浅析
STM32F030使用定时器中断卡死的问题
Win11提示Outlook搜索错误怎么办?Win11提示Outlook搜索错误
求助,更新到思源 v2.0.27,dark+ 主题自适应宽度问题
Analysis of quantum secure communication
Ubuntu 22.04 LTS 是目前最安全的版本的七大原因
一种指纹生物特征的隐私保护和认证方法
idea 新建项目时克隆仓库代码
Codeforces Global Round 21 D. Permutation Graph