Just like any truly transformative tech, AI takes long-term planning, tons of competence and picking the parts that make most sense for your current business goals. So obviously AI fails for most organizations because they lack all of these things and the fact that most never operate in scale or tackle problems difficult enough to need AI.
Maybe one solution is finding the AI equivalent of Microsoft Office, ie a group of cheap tools that are so powerful, flexible and integrated that they can be used by individual employees and teams across all industries and business needs.
I agree that AI is very difficult and requires hard work by competent operators to be successful but I think there’s a deeper problem - most business data is irrelevant noise. It won’t matter how talented your DS team is if there is no signal in your data relevant to business operations for the ML model to identify.
Completely agree. Most business data is noise and most of the signals are already discovered as simple rules and heuristics. On the other hand, if you have a strong signal in your data, even a simple algorithm like linear/logistic regression will be able to help. What I’ll call “signal hunting” is probably the best use of DS resources and also the hardest thing to do.
I’ve done my share of experiments with ML/AI and where I’ve seen the most interesting value has been NLP applications (such as categorizing customer comments or assigning categories to products based in description) and finding “factors that influence behavior x” which then can be turned into either a model or a few simple rules.
I'm convinced this is one of the main sources of inefficiency in modern management, whether using AI or not. Managers are incentivized to be action prone so as to demonstrate their value continuously, and they are also expected to put in processes. Often these processes are on way too tight of a cadence and what ends up happening is that managers spend their entire time evaluating and chasing noise.
Azure ML Studio is a good tool for your non-data-science developer to play around with ML tools. I picked it up at my company and it was fairly easy to make models and predictions with.
Of course, once you actually start to get good at it you want to switch back to using code, but it's a good way to start.
How do we know that AI is truly transformative tech? The null hypothesis is that most businesses could achieve equivalent ROI by using older, simpler data analysis techniques like linear regression.
Because most large tech companies are driven by it right now and you likely interact with a system influenced by ML every time you use your phone, email, maps, translation, social media etc etc. The catch it's only transformative if you're dealing with large enough problems. You're right that more tiny companies should probably stick to classic analysis techniques.
I would argue that linear regression is under the umbrella of ML. And probably the most important ML technique for existing businesses. A lot of the "low hanging fruit" for existing business cases is not necessarily discovering how to use regression analysis on existing data (which is probably already happening at a lot of places) but how to operationalize and continually update linear regression models so you can make them a mission-critical part of the infrastructure.
Linear regression has been around since before computers. It's a real stretch to put that under the umbrella of AI. Nor is AI required to continually update linear regression models; a "for" loop will suffice.
That loop has to run over high quality, relevant data. The regression model needs to be retrained if the production data drifts from the original training dataset. The output of the regression model needs to be wired into a downstream application or decision support tool. That downstream application needs to know how to deal with error bars. A data scientist / statistician needs to re-architect the whole thing if the data or business requirement changes substantially.
As usual, the hardest problems are outside of the code.
I agree with you here Data and measurement is the single most important part of the process.
From my experience working in an industrial plant which has been involved in several machine learning trials a lot of the time there are attempts made to use complex modeling techniques to make up for a lack of measurements.
Something I question is whether the outcome would have been better if the money which was invested into hiring AI consultants was spent on better plant instrumentation.
Industrial Instruments are not cheap something like a PGNAA analyser (https://en.wikipedia.org/wiki/Prompt_gamma_neutron_activatio...) is an expensive capital purchase and I suspect some people have unrealistic expectations that AI and machine learning can replace things like this.
I think there is some middle ground where AI complements better sensors (maybe instrument companies should be pushing this). I've yet to see any of the data experts push back and say something like "actually you need to measure this better first before we can model it."
Most ML algorithms boil down to linear regression with some loss function. Even deep learning networks are essentially just linear regressions stacked on top of each other, with lower layers being trained on features predicted by higher layers.
I think if neural networks or SVMs are AI, then linear regression is as well. Neural Turing machines and other recent developments I think are closer to the layperson's idea of "AI," though.
>Most ML algorithms boil down to linear regression
I think (particularly with DL) it would probably be more accurate to claim it boils down to nonlinear logistic regression rather than linear regression. To your point, both are relatively old techniques
Nonlinear regression is still linear regression with transformed features. Kernel regression, for example, just uses features generated from the data using a supplied kernel function/covariance function. DL just allows the features to be trained.
You could replace "AI" there with any other newly developed tech paradigms, like "Big Data" several years ago. Your null hypothesis is probably correct for most companies, but there will always be some companies for whom the new tech is beneficial. Most companies do not need big data, and are fine running a single postgres instance for the entirety of their lifetime, just like most companies probably don't need AI.
This is true, but I think the real benefit of the “Big Data” paradigm (scare quotes included) was the spreading the idea that you should actually measure things, and then make a decision. You’d think this was obvious, but apparently it wasn’t. (Think back to the book Lean Startup, whose premise is, “A/B test hypotheses.”) Similarly, the “AI Revolution”, could be boiled down to, “run a regression.”
First - there is much more data now than 5 or 10 years ago; it is generated by every process and is easy to store.
Second - there is a greater art and capability to aggregate and manipulate data. It's simply faster, but also there is a lot of supporting technology in the form of workflows and tooling.
Third - there are more algorithms now; these are often derived from AI research (DNN, RNN, Bayesian things..)
The first two definitely mean that linear regression can generate much more value than 10 years ago.
The third one is a product of the frustration with linear regression and many other "traditional" algorithms. In many domains (speech, images, text processing) the community smashed its head on the wall for 30 years before the computational resources and algorithmic tricks that came out in 2010->now came on stream. You just can't do much with TFIDF or similar with text - I tried very very hard; on the other hand using a transformer is like bloody magic.
Doesn't the first part of your comment contradict the second? If it does require a lot of long-term planning and competence, maybe the last thing you want to do is to distribute it to indivudal employees.
I suspect that it's hierarchical in nature, in the end. That is, you have the aspects of "using AI" that are big, complex, enterprise-wide and do require lots of long-term planning and a strategic focus. But there are also the "low hanging fruit" where you can utilize the "MS Office of AI" to achieve smaller, more tactical goals.
I feel like I use a ton of tools everyday that would have been considered "AI" 10 years ago, like content-aware fill in Photoshop, translation and correcting software, etc.
Maybe one solution is finding the AI equivalent of Microsoft Office, ie a group of cheap tools that are so powerful, flexible and integrated that they can be used by individual employees and teams across all industries and business needs.