Data Preparation and User Labelling for Time Series Classification

In this article I want to share practical experience on applying classification algorithms for segmenting a large data set of industrial time series data. The data set used contains approx 100 sensors sampled parallel and equidistant, so we have a good source for our machine learning experiment. We are using the vectors representing the system …

Read More

kMeans for time series segmentation

In this post, we want to look at using k-Means to segment a multivariate time series. We consider the vector of the scalar values of the variables sampled at the same time as the system vector. The clustering will be executed in the feature space spanned by the variables; e.g. each variable provides one dimension …

Read More

autonomous analytics

Most of the time when I do data science I am not working on the actual model building, but  instead I work on tools and improvements to reduce my work to a minimum. I am trying to optimize my own workflow. Engineers and developers used to do that all the time: building tools to automate …

Read More