Search

Using Embeddings to Train and Build Privacy-Preserving Machine Learning Models and Applications

Contribution type: article

Title: Using Embeddings to Train and Build Privacy-Preserving Machine Learning Models and Applications

Authors:

Dalmo Cirne, Workday, Inc., USA
Pierce Buckner-Wolfson, Wesleyan University, USA

Keywords: Embeddings, Machine Learning, Privacy

Abstract:

Protecting data privacy is a critical responsibility for application developers. However, without data, it becomes impossible to build entire categories of products. This paper proposes an innovative method for training Machine Learning (ML) models using only embeddings (derived data). Embeddings represent the original data as multidimensional vectors and, as such, can be plotted and clustered in hyperspace, enabling effective solutions for problems such as anomaly detection, search, sentiment analysis, recommendation, and graph prediction, without requiring access to the raw data. Using only derived representations of the data and their clustering patterns, this approach preserves privacy while allowing responsible application development. Many machine learning algorithms such as Neural Networks, Gradient Boosting, and K-Nearest Neighbors are well-suited tools, providing cost-effective and computationally efficient alternatives to Large Language Models (LLMs). The approach proposed here balances data-driven innovation that is compliant with strict privacy requirements while unlocking the space for the development of powerful applications.

Publication Date: November 16, 2025

Presented during:

Dates: November 16, 2025 to November 20, 2025

Location: Nice / Saint-Laurent-du-Var, France

Venue:

Novotel Nice Airport Cap 3000

40 Avenue de Verdun
06700 SAINT LAURENT DU VAR
France

Hotel website

Copyright (c) DTR Society, 2025

Contact Us

Proposals

The submission system is currently being prepared. We invite you to subscribe to the newsletter so you will be the first to know when it goes online.