Blog

Saturday, 11 June 2016 19:21

Machine Learning is dead - Long live machine learning!

Written by
Rate this item
(0 votes)
Long Live Machine Learning Long Live Machine Learning

You may be thinking that this title makes no sense at all. ML, AI, ANN and Deep learning have made it into the everyday lexicon and here I am, proclaiming that ML is dead. Well, here is what I mean…

The open sourcing of entire ML frameworks marks the end of a phase of rapid development of tools, and thus marks the death of ML as we have known it so far. The next phase will be marked with ubiquitous application of these tools into software applications. And that is how ML will live forever, because it will seamlessly and inextricably integrate into our lives.

There has been a rapid democratization of data and tools in the past year. New tools and techniques are discovered, get converted to code, and released to the public via APIs very quickly thereby shortening product lifecycles in the ML and AI space. One way to note this trend is by observing the rapid open sourcing of ML algorithms and AI machines since 2015.

 

In the first couple of rounds of this open source movement, companies were releasing just the algorithms (Google’s TensorFlow) or the architectures (Facebook). Then there were datasets (Yahoo) and APIs (Microsoft’s Project Oxford). Now Google set a new trend when it released a fully trained AI (Parsey McParseface and SyntaxNet). Also from earlier this month, we have Ambry by LinkedIn and DSSTNE (pronounced Destiny) by Amazon.

 

The most exciting thing today is that even if one doesn’t have a rigorous theoretical background in ML, one can still apply pre-implemented advanced algorithms like Conditional Random Field or Hidden Markov Models to their data analysis projects. One doesn’t need to know the details of the implementation of CRFs or HMMs before they can use it.

 

In addition to the democratization of tools, we are also seeing ML cloud companies constantly wooing users towards their platforms. Many of them give you instantaneous access to the software, the algorithms and also the hardware architecture one needs. Once you adopt a platform, it is likely that you will stick to that platform as you grow more sophisticated. Here is a great comparative review on all the top 6 ML clouds.

 

What all this essentially means is that it is perfectly alright if one doesn’t know about the Sequential Minimal Optimization algorithm developed by John Platt in 1998 to solve the quadratic programming problem that occurs when you train a support vector machine (SVM). Knowing that (1) SVM is just a supervised learning model that fits a linear hyperplane to classify high dimensional data (2) SVM is capable of transforming nonlinear data to a linear classifiable form using kernels, and (3) SVM allows for errors using the soft margin technique allows one to implement SVM from scratch. But we don’t need to re-do that implementation anymore.

 

What we really need today is not necessarily an ML theoretician, but people who can apply ML very well in their respective domains. We may not necessarily need someone who can derive a new result, say by extending the Vapnik–Chervonenkis dimension. Similarly re-implementing singular value, eigen value or even Cholesky decompositions one more time today may not be the most innovative endeavor. However, we definitely need people who can come up with creative applications that will eventually enable ML and AI to become more mainstream in our lives, almost to a point where ML now becomes a part of human culture itself.

 

A preview of this future can be found at the Google I/O 2016 Keynote speech.

 

This doesn’t mean that research in ML has stopped. It just means that the go-to-market timelines for implementations of new tools and techniques from the time they get published in the academic community to the time the practicing ML community lays hands on it, have shortened tremendously.

Long live Machine Learning!

source: http://www.datasciencecentral.com/profiles/blogs/machine-learning-is-dead-long-live-machine-learning

Read 6332 times

Search

Latest Comments

K2 Content

  • A synergetic R-Shiny portal for modeling and tracking of COVID-19 data
    A synergetic R-Shiny portal for modeling and tracking of COVID-19 data

    Dr. Mahdi Salehi, an associate member of SDAT and assistant professor of statistics at the University of Neyshabur, introduced a useful online interactive dashboard that visualize and follows confirmed cases of COVID-19 in real-time. The dashboard was publicly made available on 6 April 2020 to illustrate the counts of confirmed cases, deaths, and recoveries of COVID-19 at the level of country or continent. This dashboard is intended as a user-friendly dashboard for researchers as well as the general public to track the COVID-19 pandemic, and is generated from trusted data sources and built-in open-source R software (Shiny in particular); ensuring a high sense of transparency and reproducibility.

    Access the shiny dashboard: https://mahdisalehi.shinyapps.io/Covid19Dashboard/

    Written on Friday, 08 January 2021 07:03 in SDAT News Read 4579 times Read more...
  • First Event on Play with Real Data
    First Event on Play with Real Data

    Scientific Data Analysis Team (SDAT) intends to organize the first event on the value of data to provide data holders and data analyzers with an opportunity to extract maximum value from their data. This event is organized by International Statistical Institute (ISI) and SDAT hosted at the Bu-Ali Sina University, Hamedan, Iran. 

    Organizers and the data providers will provide more information about the goals of the initial ideas, team arrangement, competition processes, and the benefits of attending this event on a webinar hosted at the ISI Gotowebianr system. Everyone invites to participate in this webinar for free, but it is needed to register at the webinar system by 30 December 2020. 

    Event Time: 31 December 2020 - 13:30-16:30 Central European Time (CET)

    Register for the webinar: https://register.gotowebinar.com/register/8913834636664974352 

    More details about this event: http://sdat.ir/en/playdata 

    Aims and outputs:

    • Playing with real data by explorative and predictive data analysis techniques 
    • A platform between a limited number of data providers and hundreds to thousands of data scientist Teams
    • Improving creativity and scientific reasoning of data scientist and statisticians 
    • Finding the possible “bugs” with the current data analysis methods and new developments
    • Learn different views about a dataset.

    AWARD-WINNING:

    The best-report awards consist of a cash prize:
    $400 for first place,
    $200 for second place, and
    $100 for third place.

    Important Dates: 

    Event Webinar: 31 December 2020 - 13:30-16:30 Central European Time (CET). 
    Team Arrangement: 01 Jan. 2021 - 07 Jan. 2021
    Competition: 10 Jan. 2021 - 15 Jan. 2021
    First Assessment Result: 25 Jan. 2021
    Selected Teams Webinar: 30 Jan. 2021
    Award Ceremony: 31 Jan. 2021

    Please share this event with your colleagues, students, and data analyzers. 

    Written on Wednesday, 23 December 2020 13:45 in SDAT News Read 4854 times Read more...
  • Development of Neuroimaging Symposium and Advanced fMRI Data Analysis
    Development of Neuroimaging Symposium and Advanced fMRI Data Analysis

    The Developement of Structural and Functional Neuroimaging Symposium hold at the School of Sciences, Shiraz University in April 17 2019.  The Advanced fMRI Data Analysis Workshop also held in April 18-19 2019. For more information please visit: http://sdat.ir/dns98 

    Written on Sunday, 21 April 2019 12:18 in SDAT News Read 4987 times Read more...
  • Releasing Rfssa Package by SDAT Members at CRAN
    Releasing Rfssa Package by SDAT Members at CRAN

    The Rfssa package is available at CRAN. Dr. Hossein Haghbin and Dr. Seyed Morteza Najibi (SDAT Members) have published this package to provide the collections of necessary functions to implement Functional Singular Spectrum Analysis (FSSA) for analysing Functional Time Series (FTS). FSSA is a novel non-parametric method to perform decomposition and reconstruction of FTS. For more information please visit github homepage of package. 

    Written on Sunday, 03 March 2019 21:03 in SDAT News Read 3564 times
  • Data Science Symposium
    Data Science Symposium

    Symposium of Data Science Developement and its job opportunities hold at the Faculty of Science, Shiraz University in Feb 20 2019. For more information please visit: http://sdat.ir/dss97 

    Written on Friday, 01 February 2019 00:13 in SDAT News Read 5065 times Read more...

About Us

SDAT is an abbreviation for Scientific Data Analysis Team. It consists of groups who are specialists in various fields of data sciences including Statistical Analytics, Business Analytics, Big Data Analytics and Health Analytics. 

Get In Touch

Address:  No.15 13th West Street, North Sarrafan, Apt. No. 1 Saadat Abad- Tehran

 Phone: +98-910-199-2800

Email: info@sdat.ir

Login Form