In April, the number of job openings hit a record 9.3 million, according to data from the Labor Department. With the pandemic fading away, there has been a scramble to hire new employees and this has become a major challenge for companies.
So then can AI (Artificial Intelligence) help out? Well, it definitely can. The irony is that many companies are using the technology—and don’t even realize it! The reason is that AI is built into the top online job sites.
“For example, when you type in a search for a job title, say with the phrase ‘job manager,’ the LinkedIn engine will not only look for the title itself, but also people with relevant skills like time management, team coordination, risk assessment and so on,” said Sakshi Jain, who is the Engineering Manager on LinkedIn’s Responsible AI team. “This means that a recruiter gets more results than just the people who already have the exact title or role.”
One of the key powers of AI is that it can detect complex patterns in huge data sets. In a way, this can simulate the capabilities of a recruiter. And this is definitely important when it comes to finding passive job candidates.
“In a study we recently published, 74% of talent leaders told us they’ve increased outreach to passive candidates in the past year,” said Hari Kolam, who is the CEO of Findem. “AI greatly speeds up the passive recruiting process–one where it can take upward of ten hours to fill a single role. It can index and surface information on people from hundreds of sources as passive candidates typically aren’t on job or career sites, and many people tend to only include piecemeal information on their LinkedIn and other profiles.”
AI can also help with personalization. This can be a good way to create a good first impression with candidates.
“Currently, there are many hiring workflows that are incredibly inefficient, including the scheduling of interviews and follow-up emails for candidates,” said Vivek Ravisankar, who is the CEO of HackerRank. “With the use of automated scheduling and email follow-ups, AI can help free up valuable time and solve the major pain point of extensive back-and-forth coordination with candidates and interviewers.”
Yet AI is not without its risks. After all, there is inherent bias in datasets and this can result in outcomes that are unfair and discriminatory.
“AI-driven HR software that is using years old data on previous hires to determine ideal candidates for job openings is a perfect example of where algorithms can go wrong,” said Ingrid Burton, who is the Chief Marketing Officer at Quantcast. “This is especially true in roles that have historically been dominated by men, such as software engineers, which would risk the hiring algorithm to arbitrarily exclude most women and minorities from advancing during the hiring process.”
To guard against this, there must be good governance as well as explainability of the AI models. There also needs to be people in the loop for critical parts of the process.
“It is never acceptable to set up an AI process and simply leave it to run,” said Ian Cook, who is the Vice President of People Analytics at Visier. “While there is no need to inspect every transaction or process run by the AI, there is a need to constantly review the performance of the AI steps to ensure that the outputs are in line with expectations. Validation, updating and retraining are constant requirements of running any AI process.”
C3.ai, which is a top provider of enterprise AI software and services, is a newly public company. It pulled off its deal in December and issued 15.1 million shares at $42 each. On the first day of trading, the shares spiked 120%.
And this would not be the end of the gains. Within a couple months, the stock price would hit an all-time high of $183.
But as the markets started to cool off, so did the shares of C3.ai. Consider that the stock price is now at $64.
Despite this, the future does look bright for the company. “The total addressable market is huge,” said Tom Siebel, who is the CEO and founder of C3.ai. “It’s a third of a trillion dollars.”
Keep in mind that Siebel is a veteran of the enterprise software world. In the early 1980s, he worked as an executive at Oracle and helped make the company the dominant player in relational databases. Then in 1993, he started Siebel Systems and pioneered the CRM (Customer Relationship Management) category.
As for C3.ai, he launched this company in 2009. Siebel was early in recognizing that AI would be a megatrend.
But he also crafted a solid approach to building the platform. “We were novel in using a model-driven architecture to enable organizations to rapidly design, develop, provision and operate enterprise AI applications at scale,” said Siebel. “We spent about a billion dollars inventing this in the last decade and it is our secret sauce.”
This was in contrast to using traditional techniques, such as with structured programming, that involve a mishmash of open source and proprietary solutions. However, this usually means too much complexity to effectively scale.
Even some of the world’s top companies have suffered major blunders and failures with AI. IBM’s Watson, for example, has fallen well short of expectations. Then there is GE, which has spent billions on AI and has seen little return.
The Future Of Enterprise AI
The C3.ai platform can handle applications for global enterprises as well as small businesses. And to get a sense of its power, it currently manages over 4.8 million concurrent production AI models and processes more than 1.5 billion AI predictions per day.
Now, another key factor for the success of C3.ai is that the company takes a partnership approach with customers. This is essential for AI since it is important to leverage vertical-specific data and insights.
A case study for this is Shell. “The company is reinventing itself as the fifth largest in the world,” said Siebel. “They want to get to a zero net carbon footprint by 2050, which is no mean trick, right? This is about applying AI to the entire value chain, about delivering cleaner, safer, lower cost, more reliable energy. This is maybe a $4 billion a year economic benefit.”
Granted, the temptation for companies is to build their own systems. But this is really the wrong approach. “Believe it or not, people tried to build their own relational database systems during the 1980s and I don’t think anybody succeeded,” said Siebel. “We are seeing it again with AI. Companies will try it once, twice, three times. Then they’ll wind up firing the CIO and buy the technology from a professional.”
Ultimately, Siebel thinks that AI will be similar to CRM or ERP. In other words, it will be a technology that’s a necessity for a large number of businesses. “Companies that do not adopt AI will no longer exist,” said Siebel.
Even though AI continues to thrive and grow, there remain challenges to use the technology. Just some include finding data scientists, determining the right problems to focus on, getting quality data and scaling the models.
No doubt, these problems are even worse for non-tech companies. They generally do not have the expertise or sufficient resources to make AI a success.
“Research shows non-tech companies in particular have struggled to take their AI programs beyond the proof of concept and pilot phases–with just 21% of retail, 17% of automotive, 6% of manufacturing, and 3% of energy companies successfully scaling their AI use cases,” said Jerry Kurtz, who is the Executive Vice President of Insights and Data at Capgemini North America.
But despite all this, there are still a myriad of companies that are beating the odds. And they are becoming much more competitive. “There are many opportunities for non-tech companies to leverage AI to improve efficiency and provide a better customer and employee experience,” said Margaret Lee, who is the Senior Vice President and General Manager of Digital Service Operations Management at BMC.
So what are some of the non-tech companies that have been able to move-the-needle with their AI efforts? Here’s a look at two and the lessons learned.
John Deere: Keep in mind that the company has a long history of innovation, going back to the invention of the steel plow in 1837. The result is that John Deere is a world leader, with a market capitalization of $120 billion and annual sales of over $35 billion.
The company’s AI efforts began with machine vision because of its advantages with GPS connectivity in its equipment and rich datasets. For example, during the spring, the peak data ingestion was 425MB per second or about 50 million sensor measurements per second.
“Our projects have resulted in products in the market, or soon to be in the market, that reduce chemical inputs but sensing weeds from non-weeds and selectively apply chemical to the weeds only,” said Jahmy Hindman, who is the Chief Technology Officer of John Deere. “In addition, this work has led to vision-based automated control systems in combine harvesters that optimize the processing settings of the ‘factories on wheels’ to minimize the grain lost during harvest.”
A key lesson for the company has been the importance of keeping customer needs and wants in mind. “We work tirelessly to help our customers advance their business and feed the world,” said Hindman.
Levi Strauss: An early project for this company came in response to the Covid-19 lockdowns in Europe. Levi Stratus looked for ways to manage the inventory pile-up. To this end, the company gathered unique datasets on dynamic price elasticities and then applied AI to it.
It was a small project but was able to scale quickly. What started as a test in 11 stores in Germany in May 2020 grew to 17 countries in Europe by October. The system was also used for the 11/11 Singles’ Day in China.
“I have three pieces of advice from this experience,” said Louis DiCesari, who is the Global Head of Data, analytics, and AI at Levi Strauss. “First, choose real, commercial problems that are aligned to your company’s strategic priorities, and chunk them into actionable steps. Second, don’t get too hung up on having perfect data or technology, or using the latest algorithms. Instead, embrace agile, deliver minimum viable products, continuously measure the impact, and continue to iterate and add new features. And, of course, set a vision, communicate the progress toward that vision throughout the organization, and invite feedback.”
DiCesari attributes AI to the company’s ability to accelerate innovation and move faster than ever before. “In 2021, we aim to deliver more value and support all countries and functions of the enterprise, infuse data and AI throughout the business, enable new ways of working and continue streamlining processes, while also digitizing assets,” he said.
The edge is an end point where data is generated through some type of interface, device or sensor. Keep in mind that the technology is nothing new. But in light of the rapid innovations in a myriad of categories, the edge has become a major growth business.
“The edge brings the intelligence as close as possible to the data source and the point of action,” said Teresa Tung, who is the Managing Director at Accenture Labs. “This is important because while centralized cloud computing makes it easier and cheaper to process data at scale, there are times when it doesn’t make sense to send data off to the cloud for processing.”
This is definitely critical for AI. The fact is that consumers and businesses want super-fast performance with their applications.
“Currently AI training produces vast volumes of data that are almost exclusively implemented and stored in the cloud,” said Flavio Bonomi, who is the board advisor to Lynx Software. “But by placing compute at the edge, this allows for looking at patterns locally. We believe this can evolve the training models to become simpler and more effective.”
The edge may even allow for improved privacy with AI models. “Having federated learning means that no end-user data is centralized or communicated between nodes,” said Sean Leach, who is the Chief Product Architect at Fastly.
What Can Be Done At The Edge
The most notable use case for the edge and AI is the self-driving car. The complexities are mind boggling, which is why the development of this technology has taken so long.
But of course, there are many other use cases that span a myriad of industries. Just look at manufacturing. “In monitoring manufacturing processes where seconds or minutes could mean millions of dollars in losses, for example, machine learning models embedded in sensors and devices where the data is being collected enables operators to preemptively mitigate serious production issues and optimize performance,” said Santiago Giraldo, who is the Senior Product Marketing Manager of Machine Learning at Cloudera.
Here are some other examples:
Chris Bergey, the Senior Vice President and General Manager of Infrastructure Line of Business at Arm: “AI and the edge can explore the impacts of urbanization and climate change with software-defined sensor networks, pinpoint the origins of power outages in smart grids with data provenance, or enhance public safety initiatives through data streaming.”
Adam Burns, the Vice President of IoT and the Director of Edge Inference Products at Intel: “CORaiL, which was a project with Accenture and the Sulubaaï Environmental Foundation, can analyze coral reef resiliency using smart cameras and video analytics powered by Intel Movidius VPUs, Intel FPGAs and CPUs, and the OpenVINO toolkit.”
Jason Shepherd, the Vice President of Ecosystems at ZEDEDA: “TinyML will enable AI in more appliances, connected products, healthcare wearables, etc., for fixed functions triggered locally by simple voice and gesture commands, common sounds (a baby crying, water running, a gunshot), location and orientation, environmental conditions, vital signs, and so on.”
Michael Berthold, the CEO and cofounder at KNIME: “In the future, we will also see models that update themselves and potentially recruit new data points on purpose for retraining.”
Ari Weil, who is the Global Vice President of Product and Industry Marketing at Akamai: “Consider medical devices like pacemakers or heart rate monitors in hospitals. If they signal distress or some condition that requires immediate attention, AI processing on or near the device will mean the difference between life and death.”
But successfully bringing AI to the edge will face challenges and likely take years to get to critical mass. “The edge has relatively lower resource capabilities in comparison to data centers, and edge deployments will require lightweight solutions focused on security and supporting low latency applications,” said Brons Larson, who is a PhD and the AI Strategy Lead at Dell Technologies.
There will also need to be heavy investments in infrastructure and the retooling of existing technologies. “For NetApp, this is a large opportunity but one that we have to re-invent our storage to support,” said Ross Ackerman, who is the Head Of Customer Experience and Active IQ Data Science at NetApp. “A lot of the typical ONTAP value prop is lost at the edge because clones and snapshots have less value. The data at the edge is mostly ephemeral, needing only a short time to be used in making a recommendation.”
Then there are the cybersecurity risks. In fact, they could become more dangerous then typical threats because of the impact on the physical world.
“As the edge is being used with applications and workflows, there is not always consistent security in place to provide centralized visibility,” said Derek Manky, who is the Chief of Security Insights and Global Threat Alliances at Fortinet’s FortiGuard Labs. “Centralized visibility and unified controls are sometimes being sacrificed in favor of performance and agility.”
Given the issues with the edge and AI, there needs to be a focus on building quality systems but also rethinking conventional approaches. Here are some recommendations:
Prasad Alluri, the Vice President of Corporate Strategy at Micron: “The increase in AI also means that its increasingly important that edge computing is near 5G base stations. So soon, in every base station, every tower might have compute and storage nodes in it.”
Debu Chatterjee, the Senior Director of AI Platform Engineering at ServiceNow: “There will need to be newer chips with tensor capabilities seen in GPUs or their alternative, or specialized with specific inference models burnt into FPGAs. A hardware/software combo will be required to provide a zero-trust security model at the edge.”
Abhinav Joshi, the Global Product Marketing Leader at OpenShift Kubernetes Platform at Red Hat: “Many of these challenges can be successfully addressed at the start by approaching the project with a focus on an end-to-end solution architecture built on the foundation of containers, Kubernetes, and DevOps best practices.”
Although, when it comes to AI and the edge, the best strategy is probably to start with the low-hanging fruit. This should help avoid failed projects.
“Enterprises should begin by applying AI to smaller, non-mission critical applications,” said Bob Friday, who is the Chief Technology Officer at Mist Systems, which is a Juniper Networks company. “By paying close attention to details such as finding the right edge location and operational cloud stack, it can make operations easier to manage.”
But regardless of the approach, the future does look promising for the edge. And AI efforts really need to consider the potential use cases to get its full value.
With the equity markets surging and interest rates at historically low levels, the environment is ideal for IPOs. But for Snowflake, which has recently filed for an offering, it would likely do well in just about any market environment. This tech startup is growing like a weed and the market opportunity is enormous.
Founded in 2012, Snowflake pioneered the category for cloud-native data warehouses. The founders actually spent two years developing the software.
And yes, the timing proved to be spot-on. The market was ripe for disruption as traditional data warehouses have a myriad of disadvantages. Just some include: the inability to handle unstructured data and huge workloads, high costs, complex interfaces, problems with consistency and integrity of data, and issues with data sharing.
Note that the founders—Thierry Cruanes, Benoit Dageville, and Marcin Zukowski—were veterans of the traditional data warehouse market. They had worked at companies like Oracle, IBM and Google (by the way, the name “Snowflake” was chosen because the founders like to ski!) In other words, the founders had a strong understanding of the weaknesses of legacy systems—but also had the creativity to build a much better alternative.
“My company uses Snowflake and also competes with it in some cases,” said Sam Underwood, who is the VP of Business Strategy with Futurety. “Snowflake is growing rapidly, and justifiably so, because it’s filling a gaping hole in the market—namely, a huge need to unify data sources to form a single source of truth across an organization. There are many, many tools that already do this—Google BigQuery among others—however, Snowflake has combined the technical effectiveness with the UI simplicity to really excel among both technical users and high-level decision makers who may not want or need as much granular detail.”
So how fast is Snowflake growing? During the first six months of this year, revenues spiked from $104 million to $242 million on a year-over-year basis. While there continues to be significant net losses, the company has still been able to greatly improve gross margins.
Consider that a key technology decision for Snowflake was to separate compute from storage. “This offers great performance to customers without the high cost, so they get the best of both worlds,” said Venkat Venkataramani, who is the co-founder and CEO of Rockset. “This was phenomenally compelling at the time and years ahead of even the likes of Amazon with Redshift and Google.”
But of course, Snowflake is more than just about whiz-bang technology. The company has also assembled an experienced executive team, led by CEO Frank Slootman. Prior to joining, he was at the helm of ServiceNow, which he took from $100 million in revenues to $1.4 billion. The current market cap of the company is $93 billion.
True, Snowflake does have customer concentration, with Capital One accounting for roughly 11% of overall revenues. But then again, this does show the strategic importance of the technology.
“This IPO underscores a significant change in thinking about the increasing importance of the database market,” said Raj Verma, who is the Co-CEO of MemSQL. “Data has never been more important than it is right now. In the last 25 years, only one company in this sector other than Snowflake went public. And I’m sure we’ll see a couple more companies go out in the new few years as well. There was an iron grip on the database market for more than two decades, with IBM, Oracle and SAP HANA. Now we are seeing a changing of the guard, which gives customers the option of deciding what is best for their business. I can tell you that the technology of yesterday will not solve the data challenges of tomorrow, and this IPO brings newer technology solutions to the forefront.”
While quantum computing is still in the early phases, there have already been many innovations and breakthroughs. Companies like IBM, Microsoft, Google and Honeywell have been investing aggressively in the technology.
So then what is quantum computing? Well, it is similar to traditional computing, which relies on bits—that is, the 0’s and 1’s to encode information. But quantum computing as its own version of this: the quantum bit or qubit. This is where the information can have multiple states at the same time. And the reason for this is the impact of the effects of quantum mechanics, like superposition and entanglement. Yes, this is all about the spooky world of Schrodinger’s cat, which is both alive and dead at the same time!
“Quantum computing is a new kind of computing, using the same physical rules that atoms follow in order to manipulate information,” said Dr. Jay Gambetta, who is an IBM Fellow and vice president of IBM Quantum. “At this fundamental level, quantum computers execute quantum circuits—like a computer’s logical circuits, but now using the physical phenomena of superposition, entanglement, and interference to implement mathematical calculations out of the reach of even our most advanced supercomputers.”
One of the fertile areas for quantum computing is AI (Artificial Intelligence), which relies on processing huge amounts of complex datasets. There is also a need to evolve algorithms to allow for better learning, reasoning and understanding.
Then what are some of the things we may see with quantum computing and AI? Let’s take a look:
Christopher Savoie, who is the CEO and founder of Zapata Computing:
Generative models are those models that don’t just limit themselves to answering a question, but that actually generate output such as an image, music, video, etc. As an example, imagine you have a lot of pictures of the side of a face, but not a lot of pictures of the front of a face. If you want security detection capabilities to be able to recognize dual facial recognition on the front side of a face, you can actually use these generative models very accurately to create more samples of frontal views of a face. Inserting quantum processing units into the classical framework has the potential to boost the quality of the images generated. And how does this help us with classical machine learning? Well, traditional machine learning algorithms are as good as the data you feed them. If you try to train a classical face detection model with a small dataset of faces, this model won’t be very good. However, you can use quantum-enhanced generative models to enlarge this dataset with more images (both in terms of quantity and variety), which can significantly improve the detection model. This isn’t limited to generating faces, you can also generate fake molecules, cancer cells, or MRI scans, which are very similar to the real thing. This allows us to train better machine learning models, which can then apply to real data and real-world problems.
For the first time, a Natural Language Processing (NLP) algorithm is “meaning aware” and has been executed on a quantum computer. When we refer to meaning aware we mean that computers can actually understand whole sentences and not just individual words and that the awareness can be expanded to whole phrases and ultimately real time speech without requiring stochastic guesswork that is the state of the art today and which is computationally so expensive. Full scale implementation is dependent on quantum computers becoming much larger than is currently the case. This development of research in NLP is a prime example of the fact that as realistic quantum computers become available, more use cases will also become apparent. Of course, this has been proven to be the case in the past 30 years on classical computers as a precedent.
Dr. Itamar Sivan, who is the CEO and co-founder of Quantum Machines:
Roughly speaking, AI and ML are good ways to ask a computer to provide an answer to a problem based on some past experience. It might be challenging to tell a computer what a cat is, for instance. Still, if you show a neural network enough images of cats and tell it they are cats, then the computer will be able to correctly identify other cats that it did not see before. It appears that some of the most prominent and widely used AI and ML algorithms can be sped-up significantly if run on quantum computers. For some algorithms we are even anticipate exponential speed-ups, which clearly does not mean performing a task faster, but rather turning a previously impossible task and making it possible, or even easy. While the potential is undoubtedly immense, this still remains to be proven and realized with hardware.
One of the areas being looked at currently is in the area of artificial intelligence within financial trading. Quantum physics is probabilistic, meaning the outcomes constitute a predicted distribution. In certain classes of problems, where outcomes are governed by unintuitive and surprising relationships among the different input factors, quantum computers have the potential to better predict that distribution thereby leading to a more correct answer. Dr. Hayes states: “The basic idea is that there are problems that require an AI to generate new data that it hasn’t seen before in order to make a decision. Solving this problem may require coming up with an underlying model for the probability distribution in question that it could use in new situations.”
Daniel Newman, who is the Principal Analyst and Founding Partner at Futurum Research:
As it pertains to AI/ML, I think what I’m most encouraged by is the potential for classical and quantum to work together leveraging the elastic nature of the cloud and the powerful, specific problem-solving capabilities of quantum computing. I get the sense that a lot of people are looking at quantum versus classical computing, but in reality, it will be the two working together harmoniously to solve challenging and complex problems. Both have strengths and the development now is for quantum computing to function as part of the solution. Over time, both computing formats will continue to advance, but the ability to accelerate workloads on traditional GPUs and ASICs while also leveraging the power of quantum computing is a recipe for faster, more robust results, which is what the market should be eager to see as quantum computing becomes more widely accessible.
For me, I see a few applications for quantum computing in the immediate future that will gain popularity, but of course there will be many more. Financial Services and Healthcare are immediate applications where Quantum Computing can take advantage of speed and specificity to help tackle complexities. Fraud detection and drug compound identifications have been touted as some of the most exciting use cases. Given the current state of cybercrime and the attention to healthcare in the wake of the pandemic, this makes a lot of sense.
According to research from IDC, the global spending on AI (Artificial Intelligence) is expected to hit $97.9 billion by 2023, up from $37.5 billion in 2019. This represents a compound annual growth rate of 28.4%.
No doubt, this is an enormous opportunity for startups. “These days, almost every company needs to leverage AI in order to thrive and build a meaningful future,” said Saar Yoskovitz, who is the CEO of Augury. “This is true for younger startups, and it is true for the largest companies, even in the most traditional and nascent industries like manufacturing and insurance. In a sense, AI has become another layer in the tech stack, like databases, and not a business model.”
All this is true. But then again, finding the right product-market fit is extremely challenging. Consider that AI models generally require a lot of customization. In fact, there is often much variation even for companies in the same industry.
And yes, there must be a high-quality dataset. But of course, this is far from easy for a startup to develop.
“Data creates an interesting chicken-and-egg problem,” said Yoskovitz. “Without a customer, you don’t have data, which means you cannot train your algorithms. Without algorithms, you are not able to provide value to your customers and compete in the market. Therefore you will not have customers to provide data.”
What to do? You can look at forming partnerships, such as with companies that may not have strong AI capabilities. Or another approach is to create a free app that collects data.
“What we have found is that to build a successful AI startup, the key is sourcing and building proprietary data,” said Saniya, who is the CEO and co-founder of Pilota. “This is what makes your business defensible and attractive to investors. Almost every investor has asked us where our data comes from and what would stop others who can access this data from replicating what we do. So when creating a business centered around AI, it is extremely important to make sure that your data is proprietary and you are not just building a business on analyzing public datasets.”
But even with a good dataset, this is still not enough. For example, does the AI model make a real difference? Or are the results mostly minor improvements?
“Investors are interested in startups that are building tailored AI solutions for previously unsolvable problems,” said Jay Srinivasan, who is the CEO and co-founder of atSpoke. “So focus on areas where there are many inefficiencies and repetitive human processes, such as call centers and back-office paperwork processing. Investors want successful AI solutions that address specific workflows and problems, such as a legal document review.”
Then there is the issue of adoption. Let’s face it, customers are still leery of the powers of AI. It does not help that the models are often complex and opaque. There are also the nagging problems with bias.
“Know your customer,” said Vasiliy Buharin, who is the Associate Director of Product Innovation at Activ Surgical. “You may have the algorithm to solve the worst LA traffic or shorten airplane boarding time 10-fold. But if the solution requires people to behave like preprogrammed automatons, it will never be adopted, and your company will fail. Your customer has a certain way of doing things. Your product must fit this workflow.”
Given all the challenges, a startup will likely undergo multiple pivots. This is why you need a top-notch team that works well together.
“If you start with a problem and leave yourself open to many different solutions, you will learn from the market and find the best solution to build your business around,” said Sean Byrnes, who is the CEO of Outlier. “Spending six months exploring and selecting the right problem can save you six years of wasted effort trying to build a business around a flawed idea.”
MLOps is a relatively new concept in the AI (Artificial Intelligence) world and stands for “machine learning operations.” Its about how to best manage data scientists and operations people to allow for the effective development, deployment and monitoring of models.
“MLOps is the natural progression of DevOps in the context of AI,” said Samir Tout, who is a Professor of Cybersecurity at the Eastern Michigan University’s School of Information Security & Applied Computing (SISAC). “While it leverages DevOps’ focus on security, compliance, and management of IT resources, MLOps’ real emphasis is on the consistent and smooth development of models and their scalability.”
The origins of MLOps goes back to 2015 from a paper entitled “Hidden Technical Debt in Machine Learning Systems.” And since then, the growth has been particularly strong. Consider that the market for MLOps solutions is expected to reach $4 billion by 2025.
“Putting ML models in production, operating models, and scaling use cases has been challenging for companies due to technology sprawl and siloing,” said Santiago Giraldo, who is the Senior Product Marketing Manager and Data Engineer at Cloudera. “In fact, 87% of projects don’t get past the experiment phase and therefore, never make it into production.”
Then how can MLOps help? Well, the handling of data is a big part of it.
“Some key best practices are having a reproducible pipeline for data preparation and training, having a centralized experiment tracking system with well-defined metrics, and implementing a model management solution that makes it easy to compare alternative models across various metrics and roll back to an old model if there is a problem in production,” said Matei Zaharia, who is the chief technologist at Databricks. “These tools make it easy for ML teams to understand the performance of new models and catch and repair errors in production.”
Something else to consider is that AI models are subject to change. This has certainly been apparent with the COVID-19 pandemic. The result is that many AI models have essentially gone haywire because of the lack of relevant datasets.
“People often think a given model can be deployed and continue operating forever, but this is not accurate,” said Randy LeBlanc, who is the VP of Customer Success at RapidMiner. “Like a machine, models must be continuously monitored and maintained over time to see how they’re performing and shifting with new data–ensuring that they’re delivering real, ongoing business impact. MLOps also allows for faster intervention when models degrade, meaning greater data security and accuracy, and allows businesses to develop and deploy models at a faster rate. For example, if you discovered an algorithm that will save you a million dollars per month, every month this model isn’t in production or deployment costs you $1 million.”
MLOps also requires rigorous tracking that is based on tangible metrics. If not, a project can easily go off the rails. “When monitoring models, you want to have standard performance KPIs as well as those that are specific to the business problem,” said Sarah Gates, who is an Analytics Strategist at SAS. “This should be through a central location regardless of where the model is deployed or what language it was written in. That tracking should be automated–so you immediately know and are alerted—when performance degrades. Performance monitoring should be multifaceted, so you are looking at your models from different perspectives.”
While MLOps tools can be a huge help, there still needs to be discipline within the organization. Success is more than just about technology.
“Monitoring/testing of models requires a clear understanding of the data biases,” said Michael Berthold, who is the CEO and co-founder of KNIME. “Scientific research on event, model change, and drift detection has most of the answers, but they are generally ignored in real life. You need to test on independent data, use challenger models and have frequent recalibration. Most data science toolboxes today totally ignore this aspect and have a very limited view on ‘end-to-end’ data science.”
Python is one of the world’s most popular computer languages, with over 8 million developers (this is according to research from SlashData). The creator of Python is Guido van Rossum, a computer scientist and academic. Back in the late 1980s, he saw an opportunity to create a better language and also realized that the open source model would be ideal for bolstering innovation and adoption (by the way, the name for the language came from his favorite comedy, the Monty Python’s Flying Circus).
“Python is a high-level programming language, easy for beginners and advanced users to get started with,” said Jory Schwach, who is the CEO of Andium.com. “It’s forgiving in its usage, allowing coders to skip learning the nuances that are necessary in other, more structured languages like Java. Python was designed to be opinionated about how software should be created, so there’s often just a single appropriate way to write a piece of code, leaving developers with fewer design decisions to deliberate over.”
“Python has become the most popular language of choice for learning programming in school and university,” said Ben Finkel, who is a CBT Nuggets Trainer. “This is true not just in computer science departments, but also in other areas as programming has become more prevalent. Statistics, economics, physics, even traditionally non-technical fields such as sociology have all started introducing programming and data analysis into their curriculum.”
No doubt, a major catalyst for the growth of the language has been AI (Artificial Intelligence) and ML (Machine Learning), which rely on handling huge amounts of data and the use of sophisticated algorithms.
“Because Python is easy to use and fast to iterate with, it was picked up early on by academics doing research in the ML/AI field,” said Mark Story, who is a principal developer at Sentry. “As a result, many libraries were created to build workflows in Python, including projects like TensorFlow and OpenAI.”
Although, Python has proven to be effective for a myriad of other areas, such as building websites and creating scripts for DevOps. Yet it is with AI/ML where the language has really shined.
“Analytics libraries such as NumPy, Pandas, SciPy, and several others have created an efficient way to build and test data models for use in analytics,” said Matt Ratliff, who is a Senior Data Science Mentor at NextUp Solutions. “In previous years, data scientists were confined to using proprietary platforms and C, and custom-building machine learning algorithms. But with Python libraries, data solutions can be built much faster and with more reliability. SciKit-Learn, for example, has built-in algorithms for classification, regression, clustering, and support for dimensionality reduction. Using Jupyter Notebooks, data scientists can add snippets of Python code to display calculations and visualizations, which can then be shared among colleagues and industry professionals.”
Granted, Python is certainly not perfect. No language is.
“Due to its interpreted nature, Python does not have the most efficient runtime performance,” said Story. “A Python program will consume more memory than a similar program built in a compiled language like C++ would. Python is not well suited for mobile, or desktop application development as well.”
But despite all this, there are many more pros than cons–and Python is likely going to continue to grow.
“Python is an excellent choice for most people to learn the basics of code, in the same way that everyone learns how to read and write,” said Tom Hatch, who is the CTO of SaltStack. “But the real beauty of Python is that it is also a language that can scale to large and complex software projects.”