美国斯托尔斯医学研究所Julia Zeitlinger、斯坦福大学Anshul Kundaje等研究人员合作揭示碱基分辨率水平的转录因子结合规则。这一研究成果于2021年2月18日在线发表在国际学术期刊《自然—遗传学》上。
研究人员引入了一个深度学习模型BPNet,该模型使用DNA序列预测多能性转录因子(TF)的碱基分辨率染色质免疫沉淀(ChIP)-nexus结合图谱。研究人员开发了解释工具来学习预测性基序表示形式,并为协作式TF结合交互鉴定了语法规则。令人惊讶的是,Nanog优先结合螺旋周期,并且TF通常以定向方式协同作用,研究人员使用簇状规则间隔的短回文重复序列(CRISPR)诱导的点突变进行了验证。
这一模型代表了一种强大的通用方法,可揭示基因组数据中顺式调控序列的基序和语法。
据介绍,TF结合基序的排列(语法)是顺式调节代码的重要组成部分,但仍然难以捉摸。
附:英文原文
Title: Base-resolution models of transcription-factor binding reveal soft motif syntax
Author: iga Avsec, Melanie Weilert, Avanti Shrikumar, Sabrina Krueger, Amr Alexandari, Khyati Dalal, Robin Fropf, Charles McAnany, Julien Gagneur, Anshul Kundaje, Julia Zeitlinger
Issue&Volume: 2021-02-18
Abstract: The arrangement (syntax) of transcription factor (TF) binding motifs is an important part of the cis-regulatory code, yet remains elusive. We introduce a deep learning model, BPNet, that uses DNA sequence to predict base-resolution chromatin immunoprecipitation (ChIP)–nexus binding profiles of pluripotency TFs. We develop interpretation tools to learn predictive motif representations and identify soft syntax rules for cooperative TF binding interactions. Strikingly, Nanog preferentially binds with helical periodicity, and TFs often cooperate in a directional manner, which we validate using clustered regularly interspaced short palindromic repeat (CRISPR)-induced point mutations. Our model represents a powerful general approach to uncover the motifs and syntax of cis-regulatory sequences in genomics data.
DOI: 10.1038/s41588-021-00782-6
Source: https://www.nature.com/articles/s41588-021-00782-6
	Nature Genetics:《自然—遗传学》,创刊于1992年。隶属于施普林格·自然出版集团,最新IF:25.455
	官方网址:https://www.nature.com/ng/
	投稿链接:https://mts-ng.nature.com/cgi-bin/main.plex
