The Data Science track develops strong mathematical, statistical, and computational and programming skills through the general master's core and programming requirements, in addition to providing fundamental data science education through general and focused electives requirement from courses in data sciences and related areas.
This program tackles different fields from a mathematical perspective: Big Data, Machine Learning, Statistical Modelling, Optimisation, Stochastic Process, Advanced Algorithm and a set of computer programming skills.
Finalist prize in Mathematical Competition in Modelling. An international competition organised by the US.
Stochastic signal transformation. Using stochastic model and Python to simulate how signals are transmitted among nodes, optimising the capacity of wireless networks.
Prediction of Missing Links in Citation Network. Construction of features (TF-IDF Similarity, Nnumber, etc.) based on graph and text model; Evaluation of performance of various models (Multilayer Neural Network, Random Trees Classifier, etc.)
Handwritten Digits Recognition with Kernel Method. Implementation in Python of kernel algorithms (Kernel Ridge Regression, SVM and kernel PCA, etc.) only with NumPy and SciPy package.
Implementation of SQL Query with MapReduce. Implementation of basic SQL query (join, sort, etc) ̇ with MapReduce in Hadoop File System in Java.
Building a pricing tool for an international medical insurance product with expatriate contracts. Machine learning approaches are used to precisely estimate the undertaken risk head by head.
Manages workload across the whole APJ region including Southeast Asia, India, Australia and New Zealand.
Managed the routes and daily schedules of 35+ Field Engineers in Optimized Regions (Singapore, New Zealand and Australia).
Provides guidance to other team members on process alignment and adherence and acts as a workflow manager within assigned team. Disseminates process changes and updates training materials as applicable.
Perform data virtualisation by multi-dimensional histogram and clustering plot in order to understand the behaviour(distribution) on raw datasets.
Implement a various numbers of machine learning algorithms on the extracted data set for testing purpose, including SVM, CLUSEQ, HMMs, Time Series and k-NNs.
Adapt new mathematical algorithm to comply with the special data structure(series of dates and sequences) for optimising the insurance fraud detection methods.