Forum

Ignite Discussions : Ask Questions, Find Answers, Share Expertise about Sparkflows

To test this feature, visit your live site.

Nagisa

Jun 30, 2023

How do I split my data into unique and duplicate records in Sparkflows?

in Data Preparation

Please help me out.

1 comment

Comments (1)

Namjoo

Jun 30, 2023

Hey Nagisa,

In Sparkflows, there is a "Filter Unique" node that can perform the exact operation you're looking for. You can specify the column(s) based on which you want to determine uniqueness. Simply add this node to your input dataset and specify the column(s) you want to be unique. As a result, you will get two dataframes: the lower edge will contain all the unique data, while the higher edge will contain the duplicate records that were found.

Forum

How do I split my data into unique and duplicate records in Sparkflows?

© 2023 Sparkflows, Inc. All rights reserved.