posted on 2018-02-08, 00:00authored byTung T Hoang
With the exponential growth of databases of DNA sequences in the past decades, it has become ineffective to analyze biological data through only the traditional experimental methods. As a result, computational methods that can combine mathematics, statistics, and computer science have been employed to study biological data, giving birth to a new exciting field of bioinformatics. This thesis focuses on two of the most important research directions in the field: determining the evolutionary relationship between DNA sequences, and predicting protein coding regions based on DNA sequences.