How can I delete a duplicate entry automatically?

karthikk_vijay · September 1, 2021, 12:15pm

Hey everyone, nice to be a part of the fib fam

So, we import lots of data through CSV, and sometimes there might be duplicates from different files that get imported into the same type.

Is there a way that I can set up an automation which checks for duplicate name fields and then deletes one of them?

Thanks in advance

Chr1sG · September 1, 2021, 4:47pm

hey there, and glad to have you in the fib fam

When you have duplicates, does it matter which one gets deleted?
Is it enough to assume that they are duplicates if the name is the same?
Does the name have to match exactly, or does name=Name=NAME for example?

karthikk_vijay · September 1, 2021, 5:57pm

I think for this specific issue at hand :

it doesn’t matter which one gets deleted
Yes, lets just say the names can be used to identify the duplicates
Name has to match exactly

I think the best thing would be if while importing fibery told me which ones are duplicate and then I could choose what the best course of action would be from there after reviewing. But if that’s not possible then setting up an automation that deletes these duplicates might be best.

Thanks!

Chr1sG · September 1, 2021, 6:31pm

It’s actually quite difficult to do automatically, but there is a workaround way of highlighting duplicates:

Create a many-to-many auto-relation to the type itself, where the criteria for a match is Name = Name:
e.g. for the Task type:

Then create a formula that counts the number of tasks (or whatever) with the same name:

When looking at a view of the entities

adding a filter will make it easy to see the duplicates:

I think adding a duplicate check on CSV import would be a nicer solution, but until that’s available, this was the best I could come up with

karthikk_vijay · September 2, 2021, 3:09am

That’s some great logic work there @Chr1sG thanks for this, I think this could work for the time being

There’s still one problem though, that filter is going to list all the duplicates including one original copy that I would like to leave behind. So I guess I would still have to manually go through and delete them carefully. If there was a way to leave behind one of them and filter out all extra copies in a view, then it would be much easier to batch select and delete things.

Chr1sG · September 2, 2021, 6:45am

Well, here’s another neat little trick that will allow you to select only one of the duplicate items:

Create a formula field as follows:

This will be true for all but the oldest of the duplicates:

You can now filter on this flag and all but one of the entities in each group of matches will be shown:

These can now be safely deleted

Matt_Blais · July 6, 2022, 3:55pm

Related: Check this post for (among other things) a script that does entity de-duplication using the GraphQL API:

Topic		Replies	Views
Form Improvement: Duplicate Check Ideas & Features	8	615	May 8, 2025
New template - Deduplication 👯‍♂️ News & Announcements	4	717	September 19, 2022
Script Request: Mark duplicates in relational DB Get Help	6	356	August 18, 2022
Import CSV and Update Existing Entities (where matches exist) Get Help	4	122	August 14, 2025
[✔️ DONE] Duplicate a record Ideas & Features	12	2586	May 13, 2022

How can I delete a duplicate entry automatically?

Related topics