Three data science trends we’ll see more of in 2021

08/06/2021

By: Sid Bhatia, regional director – Middle East, Dataiku

In 2020, data science, machine learning, and AI emerged as critical organizational assets for handling large-scale change with less friction. As we steamroll through 2021, here are some of the data science trends to look out for to ensure your organization is taking a holistic approach (think agile, responsible, and collaborative) to its data initiatives:

MLOps Will Become Even More Critical

In 2021, organizations will take their MLOps foundations and go a step further to implement detailed processes and requirements around drift monitoring using MLOps. Input drift is based on the principle that a model is only going to predict accurately if the data it was trained on is an accurate reflection of the real world. If a comparison of recent requests to a deployed model against the training data shows distinct differences, there is a high likelihood that the model performance is compromised.

In 2020, the significant drift observed was a result of the global health crisis. As a result, the new year is bound to include organizations using MLOps to put more structure in place around drift monitoring so that models can be more agile and accurate. And organizations won’t stop there. Aside from using MLOps for the short-term to address model drift during events during a crisis, teams will also likely look to implement MLOps practices for the long term in an effort to more effectively scale their machine learning efforts.

Teams Will Need to Infuse Agility Amidst a Post-Pandemic Environment

According to Gartner, the theme of resilient delivery “isn’t about ‘bouncing back’ — it’s about having the ability to nimbly adapt or pivot in a dynamic business or IT environment. The theme’s underlying assumption is that volatility exists, so it’s vital to have the skills, capabilities, techniques, operational processes and systems to constantly adapt to changing patterns.”

In 2021, the use of AI for sustained resilience will be underscored, particularly with regard to empowering every team and employee to work with data to improve their business output. These challenges we observed in 2020 will remain in 2021 for teams that don’t have a collaborative data science platform:

Access to systems: Whether accessing the various data sources or the computational capabilities, doing so in a remote setting can be challenging.
Collaboration within teams: Without the physical in-office proximity, individuals can become siloed in the execution of their data projects.
Collaboration across teams: Data projects require buy-in and validation from business teams and also require data engineering and other teams to help with operationalization.
Reuse over time: Capitalizing on past projects is key to maintaining productivity and reducing duplicate work. The lack of in-person discussions can limit this ability.

Organizations Will Go From “What Is Responsible AI?” to “How Can We Implement Responsible AI?”

Up until now, a lot of the conversations around the topic of Responsible AI have been “We haven’t thought about this yet” or “How can we think about it and acknowledge the harms and impacts that AI can have on the world?” Teams might be determining how Responsible AI differs across job functions (data scientist vs. an analyst, for example), agreeing on and establishing a framework for their organization’s ethical rules, and putting checklists into place for Responsible AI across the AI pipeline.

In 2021, we believe we’ll see more organizations put this research and work into practice. There’s no longer a need to convince people that this is the way to go, as they’ve already gotten there. Now, it’s going to be a matter of bringing organizations the expertise to implement the ethical use of AI across their existing and future use cases.

Embracing these AI trends will not only accelerate organizations’ post-COVID recovery, but the adoption of enterprise-wide AI as well.