Dataprep: Duplicate rows based on key with multiple values in columns how to clean

Hi there,

I have a DS with duplicates for my key value. These are users that have signed up with different names. I need 1 row per key (email).

Normally I would aggregate data and group by key but can’t seem to do that for strings.

Next I want to create a simple counter for each key row, and then just filter out and use the first row only (counter = 1). Any suggestions on how to do this?

I know it’s not ideal, and we need to clean our source data but I just need unique rows for now.

Thanks!

Just discovered the ROWNUMBER() function haha.