Forum

Ignite Discussions : Ask Questions, Find Answers, Share Expertise about Sparkflows

To test this feature, visit your live site.

Nagisa

Jun 15, 2023

I have an employee department dataset containing salary information.

in Data Preparation

I want to get a salary based ranking within each department and location. How to achieve this in Sparkflows?

1 comment

Comments (1)

Namjoo

Jun 15, 2023

Hey Nagisa, In Sparkflows, we can use the ‘Multi Windows Ranking’ processor to get ranking within a partition. First it would create a partition by department and location. Then it would rank based on salary.

To use the ‘Multi Windows Ranking’ Processor:

- Select ‘Rank’ in ‘Windows Function’. It would output a rank value.

- Enter the columns used for partitioning the dataset in ‘PartitionBy’. In this case it would be ‘department’,’location’.

- Enter the columns used for sorting the dataset in ‘Order By’. In this case it would be ‘Salary’.

- Enter ‘Output column’ to list the output in the outgoing DataFrame. It would contain rank value within a partition.

For more information read the Sparkflows Documentation here:

https://docs.sparkflows.io/en/latest/user-guide/data-preparation/others.html?highlight=multi%20window%20ranking#multi-windows-ranking

Forum

I have an employee department dataset containing salary information.

© 2023 Sparkflows, Inc. All rights reserved.