MS Access SQL If Row(n) = Row(n-1)

Question

I am developing an access DB where in one of the table inputs, duplicates regularly appear in (what I would eventually like to be) the primary key due to the use of some old software for reasons as can be seen below:

ID |   Fluid      |   Designer   |   Team        |
--------------------------------------------------
A  |   Water      |   John       |   Piping      |
A  |   Water      |   James      |   Piping      |
B  |   Steam      |   Sam        |   Piping      |
B  |   Steam      |   Sam        |   Modeling    |
C  |   Hydrogen   |   Joe        |   Piping      |
D  |   Steam      |   Joe        |   Piping      |

A duplicates query with reference to the ID field yields:

ID |   Fluid      |   Designer   |   Team        |
--------------------------------------------------
A  |   Water      |   John       |   Piping      |
A  |   Water      |   James      |   Piping      |
B  |   Steam      |   Sam        |   Piping      |
B  |   Steam      |   Sam        |   Modeling    |

Deleting these duplicates automatically is not desired as it is important to know, say, that the software model changed designer or was transferred to another team. As such, these are manually reviewed.

However, this table can contain as much as 30 fields and finding a single non-duplicate can be quite an eyesore. As such, what my desired output would be:

ID |   Fluid      |   Designer   |   Team        |
--------------------------------------------------
A  |              |   John       |               |
A  |              |   James      |               |
B  |              |              |   Piping      |
B  |              |              |   Modeling    |

I have very little experience with SQL but have spent some time using visual basic so my guess would be some criteria like:

if(record(n)=record(n-1), display("")

However from my basic understanding of access and SQL, I know that relating visual basic is by far, an apples to oranges comparison.

I'm running MS Access 2007-2010.

Edit: Looking into normalization now (apologies, I'm a beginner). More specifics though on my DB issue. Also updated the final table above.

How I believe my DB should work:

Import data from software (daily). This yields table 1.
Use a duplicate query to find any duplicates in the ID column. This is table 2.
Use a query to generate a table which will display the duplicated IDs along with the respective column that is not duplicated.

The errors in the software derive from the fact that it amends new data onto preexisting data but only removes exact duplicates. If say, a different user works on the model, then instead of updating the current ID, it adds a new row with all details the same except a different user.

You should normalize your database. This is redundancy, which is not good for you. — Lajos Arpad, Jul 26 '18 at 11:03
I'd suggest [normalising](https://en.wikipedia.org/wiki/Database_normalization) your data first. I originally thought _ID_ was the identifier for _Fluid_ but _Steam_ has an _ID_ of _B_ & _D_. I'd split _Fluid_, _Designer_ and _Team_ into their own tables so there's no duplication of data. — Darren Bartrup-Cook, Jul 26 '18 at 11:07
Wouldn't _Steam_ have _Sam_ and _Joe_ as designers in your final table or is the duplication just based on the _ID_? When you say _30 fields_ do you mean (hate to say this when talking about databases) _30 columns_? — Darren Bartrup-Cook, Jul 26 '18 at 11:23
@DarrenBartrup-Cook Yes, I do mean 30 columns. Updated my post to be (hopefully) more helpful. Also I made a mistake in the last table which I updated. Thanks for the replies so far and I'm looking into normalization. — OriginalDisplayName, Jul 26 '18 at 11:32

score 0 · Accepted Answer · answered Jul 26 '18 at 19:38

Conditional Formatting

It sounds like what you are looking for is an easy way to visually spot the (few) differences in a long row of columns in similar records. I had a very similar project that I did a few years back that produced a report to view changes a user made to their profile. The way I approached it was to use Conditional Formatting to highlight the fields that were different.

In your comparison report, I would also group by the ID column with a group header to make an easy visual separation between the groups.

For the comparison formula in your report, you can compare the Min() and Max() values for the field in the group. If the Min and Max are the same, the fields match across the group. If not, you can highlight the field using conditional formatting.

I suppose if you wanted to get really fancy, you could expand this expression to use a running sum to only highlight the change in the second record.

Hiding Duplicate Values

Another approach would be to hide the duplicate values for subsequent records in the group. This would involve the same grouping concept, but in the report you would set the Hide Duplicates property on the fields to True. Taking this approach, you would see a complete record at the top of the list, but then the fields of subsequent records would only display when the value changed.

Fantastic, this really helps. Thank you so much. – OriginalDisplayName Jul 27 '18 at 11:19 — OriginalDisplayName, Jul 27 '18 at 11:19

MS Access SQL If Row(n) = Row(n-1)

1 Answers1

Conditional Formatting

Hiding Duplicate Values