Skip to main content

Why Machine Learning Now? 5 Elements that lay the Foundation of Machine Learning

Machine Learning is undoubtedly one of keywords for 2016 and will continue to be in 2017 and onwards.

3 familiar cases that anybody will be able to associate with: 

  • Case 1: search engine such as Google
  • Case 2: social media feed such as Facebook
  • Case 3: mobile keyboard text suggestions

You can find thousands of links on machine learning definition but knowing the meaning is not helpful. What I found valuable is to know what enabled machine learning, or in other words, what machine learns from, and how it is used now and in the future in different industries. 

Throughout this year, I aim to understand how machine learning works and how it will impact us in different industries.

First, let us understand what are the fundamentals that enabled machine learning, or in other words, why now?

To summarize, 4 elements are essential:

Cost to Store Data: I will credit this as the No 1 element for Machine learning, throughout the past decade, Cost of manufacturing hardware continue to reduce, while frameworks such as Hardoop allows for distributed processing of large data sets across clusters of computers using simple programing models.

IDC report suggest that  from 2010 to 2015, the cost per unit data storage has reduced from US$9 to US$0.2; cheap storage means more things piling up ( that is also why our big house is always full of things we don't need), this provides the grounding for having digital data and the rest.

Digital Data: we are creating data every second we live, faster than ever more:
  • By 2020, 1.7 megabyte data will be crated every second
  • By 2020, 1.4 trillion gigabyte data will exist on earth
At the same time, the source where the data are created are more centralised:
  • 31.25M messages are sent per minute on Facebook
  • 350M photos are uploaded to Facebook per day, adding to its current 250B photo database
  • 40,000 search queries are done via Google alone, which equals to 1.2T per year
  • 300 hours of videos are uploaded onto Youtube every minute
Big amount, centralized data with rich information (location, demographics etc) opens enormous possibilities for us to find patterns, to identify correlations and to connect the dots and to predict what will happen.

IBM teamed with researchers from John Hopkins University to predict outbreaks of dengue fever and malaria. They look at see how changes in rainfall, temperature, and even soil acidity can dramatically affect the populations of wild animals and insects that carry the diseases.

Deep Learning Algorithm: nevertheless, data is just part of the requirement for learning, since we are kids we know there must be a way to learn, or the information is non sense. And we know that first we teach and then the kid by itself.

The way to learn for the machines are different Algorithm that can be used for different use cases. There are likely never will be a "one-size-fit-all" Algorithm (let's put religion on the side for now). But we can look at what are the most popular Algorithm on the market and how they can be used:
Ironically there are so many "most popular algorithms" list out in the market, in my reference you can find 4 different ones, but as long as you do know the problem you are trying to solve the answer should not be far away.

Oracle offers a good framework to determine what you are trying to move, it categorize all the algorithms into 6 categorise and I just put all the names I found into one of the buckets: 

  • Classification: logistic regression, naïve bayes, SVM, decision tree, neighbours etc)
  • Regression: multiple regression, SVM, linear regression; PLS
  • Attribute importance: MDL, non-negative matrix factorization
  • Anomaly detection: one-class SVM
  • Clustering: k-means, orthogonal partitioning
  • Association: A Priori
  • Feature extraction: NNMF; dimensionality reduction; fast singular value exaction; random Forest 

The focus of the algorithm, as always, has been coming up with a model that learns faster, apply to more things and in a cheaper way. "Machine Learning: Trend, Perspectives and Prospects" ends with the expectation to see more algorithms that contrasts current approaches to the type or Learning that happens in the nature. 

To paraphrase this in the layman terms, we don't know much about how we learn, we will continue to figure it out and then use algorithms to copy these learning to machine so they can do some specific tasks we learn to do using our brain.

Computing Power: GPU, which is a specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images in a frame buffer intended for output to a display, enables parallel computing which means we can process fragmented, large amount of data in a very short period of time. With this, we can quickly put our algorithm, which are our findings, into test and get feedback to refine our models. 

These are the 4 most credited factors that machine learning base on. I would add another factor that I see as also critical to to the phenomenon:

The Drive of Evolution: our desire to live better, do less and know more. I was going to stop at the 4th point, you see, cost goes down, more product (data) are created, more test can be done (computing) so we can learn faster and better (algorithm). It is a complete storyline. 

But why do we do this, if there is no why I would be confident to say we can also stop tomorrow. Such things happen everywhere, when the hype goes away, so is everything else. When the task becomes to hard, we divert our attention to sometime else.

I would say not so much so for the case of machine learning for the 5th reason, which I will put it as "there are always a few that are interested to explore naively and will never stop; there are more than desire to find the few and push the findings to market so they get money or fame or fulfillment to their life; and there are so many that work without knowing why so they can enjoy a better life. 

If I can put them into one sentence, I would say it is the diversity of human value, what we need to fulfill our life that drives machine learning. 

It is hard to say throughout this trip, who will learn more, the machine or us, but for sure whatever we have done, we can improve. 

The opportunities in Machine learning applies to machine as well as the society. It is still in its early stage and for a while will stay there. In this year let's look at what has been done and what we think we happen in the near future. 



Popular posts from this blog

Want to Invest in Machine Learning, Here Are Some Tips to Help You Make Good Decisions

Machine learning is under the spot light for investors, strategic, tactic or commercial. While at the same time, you may not feel so empowered to make an investment decision because it is an early-stage industry with many changes and risks involved. And especially if you are an angel investor, maybe the guys in the garage will create the next Facebook or Google, maybe not, but how can i tell? So what are the questions I need to figure out when investing in machine learning and what tips I can use to evaluate a company? The first key question you need to figure out is really what do you want to use the investment for.  Are you m aking the investment so you can  Use its product in your company and/or serve your clients Eliminate a competitor and improve your  competitiveness Obtain a constant cash flow for a certain period Exit with a good return later on Acquire other benefits you want If the investment is fundamentally f...

5 Ways Machine Learning Makes Your Workforce Happier and More Productive

Form the employer’s perspective, machine learning is just something to be “employed” to meet their objective, more efficient in reaching out to potential consumers, less costly in manufacturing and employee management, and better serve existing customers. In this article, I look at 5 ways machine learning can be make your workforce happier and more productive. Identify and Engage with Matching Talents Matching algorithms is among the most development area in Machine Learning. Today, its implementation an be found everywhere:  from the type of content shown on our Facebook news feeds to the suggested TV shows that come up on Netflix, and even to the matches suggested on dating sites/apps like  and Tinder.  At the moment, most of the matching algorithms use strings and keywords in resume to filter candidates. It makes finding potential candidates faster and more accurate. Soon, it should be able to match candidates based on...

2 Myth & 2 Truth About Bio-authentication

Bio-metrics using our physical characteristics to verify our identity. What has been advertised on the market is that it improves security level and make user experience much more simple.  While at the same time, we also hear the opposite voice, from time to time, about the failure of the technology and the risk of using it. In this article, I aim to summarize the research findings to understand the actual benefit and risk of bio-metrics and what what are the key hurdles for implementing biometrics in end-consumer products. TechTarget defined Biometric payment as a point of sale (POS) technology that uses biometric authentication to identify the users and authorize the deduction of funds from a bank account. Fingerprint payment, based on finger-scanning, is the most common biometric payment method.  As the authentication involved physical characteristics (e.g. fingerprint; voice; pupil etc), there certainly need a “point” on which the “S...