Applications of Machine Learning Techniques in Genetic Circuit Design

Jiajie Zhu, Qi Zhang, Babak Forouraghi, Xiao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Construction of mathematical models to investigate genetic circuit design is a powerful technique in synthetic biology with real-world applications in biomanufacturing and biosensing. The challenge of building such models is to accurately discover flow of information in simple as well as complex biological systems. However, building synthetic biological models is often a time-consuming process with relatively low prediction accuracy for highly complex genetic circuits. The primary goal of this study was to investigate the utility of various machine learning (ML) techniques to accurately construct mathematical models for predicting gene expressions in genetic circuit designs. Specifically, classification and regressions models were built using Random Forrest (RF), Support Vector Machines (SVM), and Artificial Neural Networks (ANN). The obtained accuracy of the regression model using RF and ANN yielded R2 scores of 0.97 and 0.95, respectively, compared to the best score of 0.63 obtained in an earlier study. Furthermore, a classifier model was built using the green fluorescent protein (GFP) measurements obtained from the experiments conducted in this work. Biologists use GFP as an indicator of gene expression, enabling easy measurement of its protein level in the living cells. The measured GFP values were predicted with 100% accuracy by both RF and ANN classifier models while identifying various synthetic gene circuit patterns. The paper also highlights importance of the relevant data preparation techniques to ensure high accuracy is obtained by the utilized ML models.

Original languageEnglish (US)
Title of host publication2021 13th International Conference on Machine Learning and Computing, ICMLC 2021
PublisherAssociation for Computing Machinery
Pages1-7
Number of pages7
ISBN (Electronic)9781450389310
DOIs
StatePublished - Feb 26 2021
Event2021 13th International Conference on Machine Learning and Computing, ICMLC 2021 - Virtual, Online, China
Duration: Feb 26 2021Mar 1 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2021 13th International Conference on Machine Learning and Computing, ICMLC 2021
Country/TerritoryChina
CityVirtual, Online
Period2/26/213/1/21

Keywords

  • Synthetic gene circuit design
  • gene expression
  • machine Learning

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Applications of Machine Learning Techniques in Genetic Circuit Design'. Together they form a unique fingerprint.

Cite this