AI Adoption within the Enterprise 2021 – O’Reilly
In the course of the first weeks of February, we requested recipients of our Information and AI Newsletters to take part in a survey on AI adoption within the enterprise. We had been excited by answering two questions. First, we wished to know how using AI grew previously 12 months. We had been additionally within the follow of AI: how builders work, what strategies and instruments they use, what their issues are, and what improvement practices are in place.
Probably the most putting result’s the sheer variety of respondents. In our 2020 survey, which reached the identical viewers, we had 1,239 responses. This 12 months, we had a complete of 5,154. After eliminating 1,580 respondents who didn’t full the survey, we’re left with 3,574 responses—virtually thrice as many as final 12 months. It’s attainable that pandemic-induced boredom led extra individuals to reply, however we doubt it. Whether or not they’re placing merchandise into manufacturing or simply kicking the tires, extra persons are utilizing AI than ever earlier than.
Govt Abstract
- We had virtually thrice as many responses as final 12 months, with comparable efforts at promotion. Extra persons are working with AI.
- Prior to now, firm tradition has been probably the most vital barrier to AI adoption. Whereas it’s nonetheless a problem, tradition has dropped to fourth place.
- This 12 months, probably the most vital barrier to AI adoption is the dearth of expert individuals and the issue of hiring. That scarcity has been predicted for a number of years; we’re lastly seeing it.
- The second-most vital barrier was the supply of high quality information. That realization is an indication that the sphere is rising up.
- The share of respondents reporting “mature” practices has been roughly the identical for the previous couple of years. That isn’t shocking, given the rise within the variety of respondents: we suspect many organizations are simply starting their AI tasks.
- The retail business sector has the very best proportion of mature practices; schooling has the bottom. However schooling additionally had the very best proportion of respondents who had been “contemplating” AI.
- Comparatively few respondents are utilizing model management for information and fashions. Instruments for versioning information and fashions are nonetheless immature, however they’re crucial for making AI outcomes reproducible and dependable.
Respondents
Of the three,574 respondents who accomplished this 12 months’s survey, 3,099 had been working with AI indirectly: contemplating it, evaluating it, or placing merchandise into manufacturing. Of those respondents, it’s not a shock that the most important quantity are primarily based in america (39%) and that roughly half had been from North America (47%). India had the second-most respondents (7%), whereas Asia (together with India) had 16% of the full. Australia and New Zealand accounted for 3% of the full, giving the Asia-Pacific (APAC) area 19%. A bit over 1 / 4 (26%) of respondents had been from Europe, led by Germany (4%). 7% of the respondents had been from South America, and a couple of% had been from Africa. Aside from Antarctica, there have been no continents with zero respondents, and a complete of 111 nations had been represented. These outcomes that curiosity and use of AI is worldwide and rising.
This 12 months’s outcomes match final 12 months’s information effectively. However it’s equally necessary to note what the information doesn’t say. Solely 0.2% of the respondents stated they had been from China. That clearly doesn’t replicate actuality; China is a pacesetter in AI and doubtless has extra AI builders than some other nation, together with the US. Likewise, 1% of the respondents had been from Russia. Purely as a guess, we suspect that the variety of AI builders in Russia is barely smaller than the quantity within the US. These anomalies say far more about who the survey reached (subscribers to O’Reilly’s newsletters) than they are saying concerning the precise variety of AI builders in Russia and China.
The respondents represented a various vary of industries. Not surprisingly, computer systems, electronics, and expertise topped the charts, with 17% of the respondents. Monetary providers (15%), healthcare (9%), and schooling (8%) are the industries making the next-most vital use of AI. We see comparatively little use of AI within the pharmaceutical and chemical industries (2%), although we anticipate that to alter sharply given the function of AI in creating the COVID-19 vaccine. Likewise, we see few respondents from the automotive business (2%), although we all know that AI is vital to new merchandise comparable to autonomous autos.
3% of the respondents had been from the vitality business, and one other 1% from public utilities (which incorporates a part of the vitality sector). That’s a decent quantity by itself, however we now have to ask: Will AI play a task in rebuilding our frail and outdated vitality infrastructure, as occasions of the previous couple of years—not simply the Texas freeze or the California fires—have demonstrated? We anticipate that it’s going to, although it’s honest to ask whether or not AI programs skilled on normative information will probably be sturdy within the face of “black swan” occasions. What is going to an AI system do when confronted with a uncommon state of affairs, one which isn’t well-represented in its coaching information? That, in spite of everything, is the issue dealing with the builders of autonomous autos. Driving a automotive safely is simple when the opposite visitors and pedestrians all play by the foundations. It’s solely tough when one thing surprising occurs. The identical is true of {the electrical} grid.
We additionally anticipate AI to reshape agriculture (1% of respondents). As with vitality, AI-driven modifications received’t come rapidly. Nonetheless, we’ve seen a gentle stream of AI tasks in agriculture, with targets starting from detecting crop illness to killing moths with small drones.
Lastly, 8% of respondents stated that their business was “Different,” and 14% had been grouped into “All Others.” “All Others” combines 12 industries that the survey listed as attainable responses (together with automotive, pharmaceutical and chemical, and agriculture) however that didn’t have sufficient responses to indicate within the chart. “Different” is the wild card, comprising industries we didn’t record as choices. “Different” seems within the fourth place, simply behind healthcare. Sadly, we don’t know which industries are represented by that class—nevertheless it reveals that the unfold of AI has certainly turn into broad!
Maturity
Roughly one quarter of the respondents described their use of AI as “mature” (26%), that means that that they had revenue-bearing AI merchandise in manufacturing. That is virtually precisely consistent with the outcomes from 2020, the place 25% of the respondents reported that that they had merchandise in manufacturing (“Mature” wasn’t a attainable response within the 2020 survey.)
This 12 months, 35% of our respondents had been “evaluating” AI (trials and proof-of-concept tasks), additionally roughly the identical as final 12 months (33%). 13% of the respondents weren’t making use of AI or contemplating utilizing it; that is down from final 12 months’s quantity (15%), however once more, it’s not considerably totally different.
What can we make of the respondents who’re “contemplating” AI however haven’t but began any tasks (26%)? That’s not an choice final 12 months’s respondents had. We suspect that final 12 months respondents who had been contemplating AI stated they had been both “evaluating” or “not utilizing” it.
Wanting on the issues respondents confronted in AI adoption gives one other method to gauge the general maturity of AI as a discipline. Final 12 months, the foremost bottleneck holding again adoption was firm tradition (22%), adopted by the issue of figuring out applicable use instances (20%). This 12 months, cultural issues are in fourth place (14%) and discovering applicable use instances is in third (17%). That’s a really vital change, significantly for company tradition. Firms have accepted AI to a a lot larger diploma, though discovering applicable issues to resolve nonetheless stays a problem.
The largest issues on this 12 months’s survey are lack of expert individuals and issue in hiring (19%) and information high quality (18%). It’s no shock that the demand for AI experience has exceeded the availability, nevertheless it’s necessary to comprehend that it’s now turn into the largest bar to wider adoption. The largest expertise gaps had been ML modelers and information scientists (52%), understanding enterprise use instances (49%), and information engineering (42%). The necessity for individuals managing and sustaining computing infrastructure was comparatively low (24%), hinting that firms are fixing their infrastructure necessities within the cloud.
It’s gratifying to notice that organizations beginning to notice the significance of knowledge high quality (18%). We’ve recognized about “rubbish in, rubbish out” for a very long time; that goes double for AI. Dangerous information yields dangerous outcomes at scale.
Hyperparameter tuning (2%) wasn’t thought of an issue. It’s on the backside of the record—the place, we hope, it belongs. Which will replicate the success of automated instruments for constructing fashions (AutoML, though as we’ll see later, most respondents aren’t utilizing them). It’s extra regarding that workflow reproducibility (3%) is in second-to-last place. This is smart, on condition that we don’t see heavy utilization of instruments for mannequin and information versioning. We’ll take a look at this later, however having the ability to reproduce experimental outcomes is crucial to any science, and it’s a well known drawback in AI.
Maturity by Continent
When wanting on the geographic distribution of respondents with mature practices, we discovered virtually no distinction between North America (27%), Asia (27%), and Europe (28%). In distinction, in our 2018 report, Asia was behind in mature practices, although it had a markedly greater variety of respondents within the “early adopter” or “exploring” levels. Asia has clearly caught up. There’s no vital distinction between these three continents in our 2021 information.
We discovered a smaller proportion of respondents with mature practices and the next proportion of respondents who had been “contemplating” AI in South America (20%), Oceania (Australia and New Zealand, 18%), and Africa (17%). Don’t underestimate AI’s future affect on any of those continents.
Lastly, the proportion of respondents “evaluating” AI was virtually the identical on every continent, various solely from 31% (South America) to 36% (Oceania).
Maturity by Trade
Whereas AI maturity doesn’t rely strongly on geography, we see a special image if we take a look at maturity by business.
Wanting on the prime eight industries, monetary providers (38%), telecommunications (37%), and retail (40%) had the best proportion of respondents reporting mature practices. And whereas it had by far the best variety of respondents, computer systems, electronics, and expertise was in fourth place, with 35% of respondents reporting mature practices. Schooling (10%) and authorities (16%) had been the laggards. Healthcare and life sciences, at 28%, had been within the center, as had been manufacturing (25%), protection (26%), and media (29%).
Alternatively, if we take a look at industries which are contemplating AI, we discover that schooling is the chief (48%). Respondents working in authorities and manufacturing appear to be considerably additional alongside, with 49% and 47% evaluating AI, that means that they’ve pilot or proof-of-concept tasks in progress.
This will simply be a trick of the numbers: each group provides as much as 100%, so if there are fewer “mature” practices in a single group, the proportion of “evaluating” and “contemplating” practices must be greater. However there’s additionally an actual sign: respondents in these industries could not take into account their practices “mature,” however every of those business sectors had over 100 respondents, and schooling had virtually 250. Manufacturing must automate many processes (from meeting to inspection and extra); authorities has been as challenged as any business by the worldwide pandemic, and has all the time wanted methods to “do extra with much less”; and schooling has been experimenting with expertise for quite a few years now. There’s a actual need to do extra with AI in these fields. It’s value declaring that academic and governmental functions of AI steadily increase moral questions—and probably the most necessary points for the following few years will probably be seeing how these organizations reply to moral issues.
The Follow of AI
Now that we’ve mentioned the place mature practices are discovered, each geographically and by business, let’s see what a mature follow appears like. What do these organizations have in frequent? How are they totally different from organizations which are evaluating or contemplating AI?
Strategies
First, 82% of the respondents are utilizing supervised studying, and 67% are utilizing deep studying. Deep studying is a set of algorithms which are frequent to virtually all AI approaches, so this overlap isn’t shocking. (Contributors might present a number of solutions.) 58% claimed to be utilizing unsupervised studying.
After unsupervised studying, there was a major drop-off. Human-in-the-loop, information graphs, reinforcement studying, simulation, and planning and reasoning all noticed utilization beneath 40%. Surprisingly, pure language processing wasn’t within the image in any respect. (A really small variety of respondents wrote in “pure language processing” as a response, however they had been solely a small proportion of the full.) That is vital and positively value watching over the following few months. In the previous couple of years, there have been many breakthroughs in NLP and NLU (pure language understanding): everybody within the business has examine GPT-3, and lots of distributors are betting closely on utilizing AI to automate customer support name facilities and comparable functions. This survey means that these functions nonetheless haven’t moved into follow.
We requested the same query to respondents who had been contemplating or evaluating using AI (60% of the full). Whereas the odds had been decrease, the applied sciences appeared in the identical order, with only a few variations. This means that respondents who’re nonetheless evaluating AI are experimenting with fewer applied sciences than respondents with mature practices. That means (moderately sufficient) that respondents are selecting to “begin easy” and restrict the strategies that they experiment with.
Information
We additionally requested what sorts of knowledge our “mature” respondents are utilizing. Most (83%) are utilizing structured information (logfiles, time collection information, geospatial information). 71% are utilizing textual content information—that isn’t per the variety of respondents who reported utilizing NLP, until “textual content” is getting used generically to incorporate any information that may be represented as textual content (e.g., type information). 52% of the respondents reported utilizing photos and video. That appears low relative to the quantity of analysis we examine AI and pc imaginative and prescient. Maybe it’s not shocking although: there’s no motive for enterprise use instances to be in sync with tutorial analysis. We’d anticipate most enterprise functions to contain structured information, type information, or textual content information of some variety. Comparatively few respondents (23%) are working with audio, which stays very difficult.
Once more, we requested the same query to respondents who had been evaluating or contemplating AI, and once more, we obtained comparable outcomes, although the proportion of respondents for any given reply was considerably smaller (4–5%).
Threat
After we requested respondents with mature practices what dangers they checked for, 71% stated “surprising outcomes or predictions.” Interpretability, mannequin degradation over time, privateness, and equity additionally ranked excessive (over 50%), although it’s disappointing that solely 52% of the respondents chosen this selection. Safety can also be a priority, at 42%. AI raises necessary new safety points, together with the opportunity of poisoned information sources and reverse engineering fashions to extract personal info.
It’s laborious to interpret these outcomes with out understanding precisely what functions are being developed. Privateness, safety, equity, and security are necessary issues for each utility of AI, nevertheless it’s additionally necessary to comprehend that not all functions are the identical. A farming utility that detects crop illness doesn’t have the identical type of dangers as an utility that’s approving or denying loans. Security is a a lot greater concern for autonomous autos than for personalised purchasing bots. Nonetheless, do we actually imagine that these dangers don’t should be addressed for almost half of all tasks?
Instruments
Respondents with mature practices clearly had their favourite instruments: scikit-learn, TensorFlow, PyTorch, and Keras every scored over 45%, with scikit-learn and TensorFlow the leaders (each with 65%). A second group of instruments, together with Amazon’s SageMaker (25%), Microsoft’s Azure ML Studio (21%), and Google’s Cloud ML Engine (18%), clustered round 20%, together with Spark NLP and spaCy.
When requested which instruments they deliberate to include over the approaching 12 months, roughly half of the respondents answered mannequin monitoring (57%) and mannequin visualization (49%). Fashions turn into stale for a lot of causes, not the least of which is modifications in human habits, modifications for which the mannequin itself could also be accountable. The flexibility to watch a mannequin’s efficiency and detect when it has turn into “stale” will probably be more and more necessary as companies develop extra reliant on AI and in flip demand that AI tasks reveal their worth.
Responses from those that had been evaluating or contemplating AI had been comparable, however with some fascinating variations: scikit-learn moved from first place to 3rd (48%). The second group was led by merchandise from cloud distributors that incorporate AutoML: Microsoft Azure ML Studio (29%), Google Cloud ML Engine (25%), and Amazon SageMaker (23%). These merchandise had been considerably extra standard than they had been amongst “mature” customers. The distinction isn’t enormous, however it’s putting. Susceptible to over-overinterpreting, customers who’re newer to AI are extra inclined to make use of vendor-specific packages, extra inclined to make use of AutoML in certainly one of its incarnations, and considerably extra inclined to go together with Microsoft or Google quite than Amazon. It’s additionally attainable that scikit-learn has much less model recognition amongst those that are comparatively new to AI in comparison with packages from organizations like Google or Fb.
When requested particularly about AutoML merchandise, 51% of “mature” respondents stated they weren’t utilizing AutoML in any respect. 22% use Amazon SageMaker; 16% use Microsoft Azure AutoML; 14% use Google Cloud AutoML; and different instruments had been all below 10%. Amongst customers who’re evaluating or contemplating AI, solely 40% stated they weren’t utilizing AutoML in any respect—and the Google, Microsoft, and Amazon packages had been all however tied (27–28%). AutoML isn’t but a giant a part of the image, nevertheless it seems to be gaining traction amongst customers who’re nonetheless contemplating or experimenting with AI. And it’s attainable that we’ll see elevated use of AutoML instruments amongst mature customers, of whom 45% indicated that they’d be incorporating instruments for automated mannequin search and hyperparameter tuning (in a phrase, AutoML) within the coming but.
Deployment and Monitoring
An AI undertaking means nothing if it may’t be deployed; even tasks which are solely supposed for inside use want some type of deployment. Our survey confirmed that AI deployment remains to be largely unknown territory, dominated by homegrown advert hoc processes. The three most vital instruments for deploying AI all had roughly 20% adoption: MLflow (22%), TensorFlow Prolonged, a.ok.a. TFX (20%), and Kubeflow (18%). Three merchandise from smaller startups—Domino, Seldon, and Cortex—had roughly 4% adoption. However probably the most frequent reply to this query was “not one of the above” (46%). Since this query was solely requested of respondents with “mature” AI practices (i.e., respondents who’ve AI merchandise in manufacturing), we will solely assume that they’ve constructed their very own instruments and pipelines for deployment and monitoring. Given the various kinds that an AI undertaking can take, and that AI deployment remains to be one thing of a darkish artwork, it isn’t shocking that AI builders and operations groups are solely beginning to undertake third-party instruments for deployment.
Versioning
Supply management has lengthy been a typical follow in software program improvement. There are a lot of well-known instruments used to construct supply code repositories.
We’re assured that AI tasks use supply code repositories comparable to Git or GitHub; that’s a typical follow for all software program builders. Nonetheless, AI brings with it a special set of issues. In AI programs, the coaching information is as necessary as, if no more necessary than, the supply code. So is the mannequin constructed from the coaching information: the mannequin displays the coaching information and hyperparameters, along with the supply code itself, and could also be the results of a whole bunch of experiments.
Our survey reveals that AI builders are solely beginning to use instruments for information and mannequin versioning. For information versioning, 35% of the respondents are utilizing homegrown instruments, whereas 46% responded “not one of the above,” which we take to imply they’re utilizing nothing greater than a database. 9% are utilizing DVC, 8% are utilizing instruments from Weights & Biases, and 5% are utilizing Pachyderm.
Instruments for mannequin and experiment monitoring had been used extra steadily, though the outcomes are basically the identical. 29% are utilizing homegrown instruments, whereas 34% stated “not one of the above.” The main instruments had been MLflow (27%) and Kubeflow (18%), with Weights & Biases at 8%.
Respondents who’re contemplating or evaluating AI are even much less possible to make use of information versioning instruments: 59% stated “not one of the above,” whereas solely 26% are utilizing homegrown instruments. Weights & Biases was the most well-liked third-party answer (12%). When requested about mannequin and experiment monitoring, 44% stated “not one of the above,” whereas 21% are utilizing homegrown instruments. It’s fascinating, although, that on this group, MLflow (25%) and Kubeflow (21%) ranked above homegrown instruments.
Though the instruments out there for versioning fashions and information are nonetheless rudimentary, it’s disturbing that so many practices, together with those who have AI merchandise in manufacturing, aren’t utilizing them. You’ll be able to’t reproduce outcomes when you can’t reproduce the information and the fashions that generated the outcomes. We’ve stated {that a} quarter of respondents thought of their AI follow mature—nevertheless it’s unclear what maturity means if it doesn’t embrace reproducibility.
The Backside Line
Prior to now two years, the viewers for AI has grown, nevertheless it hasn’t modified a lot: Roughly the identical proportion of respondents take into account themselves to be a part of a “mature” follow; the identical industries are represented, and at roughly the identical ranges; and the geographical distribution of our respondents has modified little.
We don’t know whether or not to be gratified or discouraged that solely 50% of the respondents listed privateness or ethics as a danger they had been involved about. With out information from prior years, it’s laborious to inform whether or not that is an enchancment or a step backward. However it’s tough to imagine that there are such a lot of AI functions for which privateness, ethics, and safety aren’t vital dangers.
Device utilization didn’t current any large surprises: the sphere is dominated by scikit-learn, TensorFlow, PyTorch, and Keras, although there’s a wholesome ecosystem of open supply, commercially licensed, and cloud native instruments. AutoML has but to make large inroads, however respondents representing much less mature practices appear to be leaning towards automated instruments and are much less possible to make use of scikit-learn.
The variety of respondents who aren’t addressing information or mannequin versioning was an unwelcome shock. These practices must be foundational: central to creating AI merchandise which have verifiable, repeatable outcomes. Whereas we acknowledge that versioning instruments applicable to AI functions are nonetheless of their early levels, the variety of individuals who checked “not one of the above” was revealing—significantly since “the above” included homegrown instruments. You’ll be able to’t have reproducible outcomes when you don’t have reproducible information and fashions. Interval.
Prior to now 12 months, AI within the enterprise has grown; the sheer variety of respondents will inform you that. However has it matured? Many new groups are coming into the sphere, whereas the proportion of respondents who’ve deployed functions has remained roughly fixed. In lots of respects, this means success: 25% of a much bigger quantity is greater than 25% of a smaller quantity. However is utility deployment the suitable metric for maturity? Enterprise AI received’t actually have matured till improvement and operations teams can have interaction in practices like steady deployment, till outcomes are repeatable (no less than in a statistical sense), and till ethics, security, privateness, and safety are main quite than secondary issues. Mature AI? Sure, enterprise AI has been maturing. However it’s time to set the bar for maturity greater.