TopAffy: predicting transcription factors DNA-binding specificities using a general topological method

dc.contributor.advisorPe�a-Castillo, Lourdes
dc.contributor.authorZier-Vogel, Ryan
dc.date.issued2021-09
dc.description.abstractTranscription factors (TFs) recognize and bind to specific DNA sequences. Knowing the binding specificity of TFs is crucial to understand gene regulation and how genetic differences in the DNA sequence of TF binding sites affect TF DNA binding activity. However, the transcription factor binding preferences of only 1% of all eukaryotic TFs are known. Computational prediction of TF binding preferences is an affordable and efficient way to increase the number of known binding preferences. Most bioinformatic tools for predicting the binding preferences of TFs require as input the binding preferences of related TFs. However, there are TF families for which very little experimental data is available. In this work, we present TopAffy, a new approach for predicting TF 8-mer binding profiles. TopAffy constructs a stochastic topological representation of DNA-binding domain sequences and learns a numerical representation of the binding preferences of neighbouring amino acid pairs. TopAffy's main contribution is to construct a family-independent model which can be used to predict the 8-mer binding profile for TF families for which no experimental data is yet available. TopAffy's predictive performance is comparable to the performance of state-of-the-art family-specific approaches. Our results demonstrate that it is possible to learn a general model of binding specificities suitable for predicting binding preferences for a number of TF families.
dc.description.noteIncludes bibliographical references (pages 65-75).
dc.format.extentix, 75 pages : some color illustrations, color maps
dc.format.mediumText
dc.identifier.doihttps://doi.org/10.48336/CKGH-MG44
dc.identifier.urihttps://hdl.handle.net/20.500.14783/14743
dc.language.isoen
dc.publisherMemorial University of Newfoundland
dc.rights.licenseThe author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.
dc.subjectDNA-binding specificities
dc.subject.lcshDNA
dc.subject.lcshTranscription factors
dc.subject.lcshDNA-binding proteins
dc.subject.lcshComputer science
dc.subject.lcshGenetic regulation
dc.subject.lcshStochastic processes.
dc.titleTopAffy: predicting transcription factors DNA-binding specificities using a general topological method
dc.typeDoctoral thesis
mem.campusSt. John's Campus
mem.convocationDate2021-10
mem.departmentComputer Science
mem.divisionsCompSci
mem.facultyFaculty of Science
mem.fullTextStatuspublic
mem.institutionMemorial University of Newfoundland
mem.isPublishedunpub
mem.thesisAuthorizedNameZier-Vogel, Ryan
thesis.degree.disciplineComputer Science
thesis.degree.grantorMemorial University of Newfoundland
thesis.degree.leveldoctoral
thesis.degree.namePh. D.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis.pdf
Size:
3.53 MB
Format:
Adobe Portable Document Format

Collections