As we reflect on the insights shared during the ‘Snowday’ event on November 1st and 2nd, a cascade of exciting revelations about the future of Snowflake awaits us.

This article dives into the noteworthy data science features unveiled during the event, shedding light on the innovation and advancements that took center stage.

Join us on this journey as we unravel the highlights from Snowdays, with a specific focus on the latest in data science features, such as:

Snowflake Cortex Highlights

An overview of GenAI and LLM capabilities in Snowflake

Language Model (LLM) capabilities (some models are public while others are still in private preview)

LLMs are a new type of AI model that can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way. Snowflake’s new LLM capabilities make it one of the first cloud data platforms to offer users access to this powerful new technology.

This differentiates Snowflake from other cloud data platforms because it gives Snowflake users a significant advantage in terms of developing AI applications. With Snowflake Cortex, users can now easily build AI applications that can perform a wide range of tasks, such as generating reports, identifying fraud, and predicting customer behavior.

Here are some specific use cases for Snowflake’s LLM capabilities:

  • Sentiment Detection: Analyze customer feedbacks, social media interactions, or product reviews to understand overall sentiment and identify areas for improvement. (Further elaboration is provided in SQL commands for sentiment analysis and summarization (in private preview).)

  • Text Summarization: Generate concise summaries of lengthy documents, emails, or reports to quickly grasp key information and save time. (Further elaboration is provided in SQL commands for sentiment analysis and summarization (in private preview).)

  • Translation: Require translation for the values in a specific column during your transformations? There’s no need to reach out to an external API; you can perform the translation seamlessly within Snowflake.

Advanced search features (in private preview)

Snowflake’s new advanced search features make it easier than ever for users to find the data they need, regardless of where it is stored. This is a major advantage over other cloud data platforms, which typically have more limited search capabilities.

Specifically, Snowflake’s new search features allow users to:

  • Search across databases, Iceberg tables, data inside Snowflake, Native Applications, and the Snowflake Marketplace.

  • Use natural language queries to search for data.

  • Filter search results by a variety of criteria, such as data type, file size, and creation date.

These advanced search features can help users to save time and be more productive when working with data. For example, a data analyst could use the new search features to quickly find all of the data that is relevant to a specific analysis project. Or, a business user could use the new search features to find all of the data that is related to a specific customer or product.

Not to mention that the use of the different snowflake filters provided by these search features (column naming, tagging, metadata specification) induces an increase in data consistency.

In conclusion, it’s worth noting that this addresses a key aspect of data catalogs. Given the inherent challenges in implementing a data catalog, leveraging this Snowflake integrated feature becomes particularly compelling.

SQL commands for sentiment analysis and summarization (in private preview)

Snowflake’s new SQL commands for sentiment analysis and summarization make it easy for users to extract insights from their data without having to learn any new tools or languages. This is a major advantage over other cloud data platforms, which typically require users to use separate tools for these tasks.

Specifically, Snowflake’s new SQL commands allow users to:

  • Analyze the sentiment of text data, such as customer reviews, social media posts, and survey responses.

  • Summarize text data to extract the most important information.

These new SQL commands can help users to save time and be more productive when working with data. For example, a business user could use the new SQL commands to quickly analyze the sentiment of customer reviews or to summarize a long and complex report.

Snowflake Copilot (in private preview)

It is an LLM-powered assistant to generate and refine SQL with natural language. In addition, this text-to-code functionality will be coming soon programmatically via a general purpose function, Text2SQL, with Snowflake Cortex.

Document Data Extraction with Document AI (in private preview)

Snowflake Cortex’s Document AI, powered by LLM allows efficient extraction of data from various document types (pdf, word, txt, screenshots), saving time and resources in manual data processing.

Overall, Snowflake’s new features are very impressive and have the potential to revolutionize the way that people use cloud data platforms.

Snowflake is clearly differentiating itself from the competition by offering innovative features that make it easier for users to develop and deploy AI applications, find data, manage ML features, and share ML models. Snowflake is also making it easier for users to extract insights from their data without having to learn any new tools or languages.

Feature store (in private preview)

A feature store is a central repository for managing and sharing machine learning features. Snowflake’s new feature store makes it easier for users to develop and deploy ML models.

This differentiates Snowflake from other cloud data platforms because it provides users with a single place to manage all of their ML features. This can help users to save time and improve the efficiency of their ML development process.

Here are some specific use cases for Snowflake’s feature store:

  • A data scientist could use the feature store to manage and share the features that they are using to develop a machine learning model. For more information on this subject, you can head here.

  • A machine learning engineer could use the feature store to deploy a machine learning model to production and ensure that it has access to the features that it needs. For more information on this subject, you can head here.

  • A business user could use the feature store to explore and understand the features that are being used to power their machine learning applications. For more information ont this subject, you can head here.

Model sharing

Snowflake’s ability to share ML models is also a major differentiator. This will make it easier for people to collaborate on ML projects and democratize access to ML.

Specifically, Snowflake allows users to share ML models with others, both within and outside of their organization. This can help to accelerate the development and adoption of ML applications.

Here are some specific use cases for Snowflake’s model sharing capability:

  • A data scientist could share a machine learning model with a business user so that they can use it to explore their data and generate insights.

  • A machine learning engineer could share a machine learning model with a DevOps engineer so that they can deploy it to production.

  • A software vendor could share a machine learning model with their customers so that they can use it to improve their products and services.

Snowflake Notebooks (in private preview)

Snowflake Notebooks, a cloud-based environment specifically designed for code development and execution, share similarities with notebooks that have been available for several years on other cloud data platforms. Databricks was the pioneer in this field, and BigQuery followed suit with its announcement on August 31. These Notebooks are thoughtfully designed to optimize performance within the Snowflake ecosystem.

Snowflake Notebooks offer a number of advantages over traditional IDEs, such as:

  • Collaboration: Snowflake Notebooks are easy to share with others, which makes it easy to collaborate on projects.

  • Interactivity: Snowflake Notebooks are interactive, so you can experiment with your code and see the results immediately.

  • Performance: Snowflake Notebooks are highly performant, so you can run complex queries and analyses quickly.

Snowflake Notebooks are currently in beta, but they are expected to be released to the general public in the near future.

What differentiates Snowflake as a Machine Learning development environment from others is the fact that it consists of Python APIs accessible through the Snowpark ML library, and SQL interfaces for defining, managing, and retrieving features, along with managed infrastructure for feature metadata management and continuous feature processing. This eliminates the need for specialized AI expertise. Moreover, the Feature Store and the Model sharing features empower Snowflake to catch up more effectively with their competitors, specifically Databricks, which obviously had the upper hand in the last couple of years due its integration with ML Flow.

Conclusion

Snowflake’s unveiling of new features, including Snowflake Cortex, Copilot, Document AI and notebooks, affirms the company’s commitment to innovation. While the data science announcements align with the current GenAI trends, it is imperative to approach them with caution. It is noteworthy that certain features, though announced long ago, are still pending universal availability across all environments. Finally, let’s approach these innovations with open minds and a cautious optimism, ready to embrace the possibilities while staying grounded in reality.

Medium Blog by Artefact.

This article was initially published on Medium.com.
Follow us on our Medium Blog !

Artefact Newsletter

Interested in Data Consulting | Data & Digital Marketing | Digital Commerce ?
Read our monthly newsletter to get actionable advice, insights, business cases, from all our data experts around the world!