Deduplicate options mean?

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • bs27975
    Junior Member
    • May 2014
    • 29

    #1

    Deduplicate options mean?

    In sync settings settings are deduplication buttons (utilities). Be it Android, PC, or Google.

    The option names do not make clear each's meaning, could you clarify please?

    I see:
    - Keep First Retrieved
    [This seems meaningless - there seems no way to know which is which, to know to choose it or not.]
    - Keep Oldest Modified
    [This would seem to mean earliest modified, which is probably not what I would want - I would want the last / latest modified one, I suspect.]
    - Keep Oldest Created
    [This would not seem to be what I would want - it implies I created another one afterwards, which probably has more current information. But the latest modified would likely be more current.]

    Could you clarify these options please?

    If "Keep Oldest Modified" means "Most Recently Modified", perhaps you could have development rephrase for next version.
  • Thomas
    DejaOffice Team Member
    • Dec 2010
    • 3008

    #2
    bs27975,

    - Keep First Retrieved - Means when we compare the two records and the first record we get in our read of the data is the one we keep. This varies depending on the database\destination. Sometimes the list we are handed is in no particular order. For DPC (which we can actually control) the order is Create Date.
    - Keep Oldest Modified - Means when we compare the two records, we keep the one with the oldest modified date. It means just that, the record with the newer modified date would removed.
    - Keep Oldest Created - Means when we compare the two records, we keep the one with oldest created date regardless of the modified date.
    - Lead QA

    Comment

    • bs27975
      Junior Member
      • May 2014
      • 29

      #3
      Curious.


      > - Keep First Retrieved - Means when we compare the two records and the first record we get in our read of the data is the one we keep. This varies depending on the database\destination. Sometimes the list we are handed is in no particular order. For DPC (which we can actually control) the order is Create Date.

      So, essentially a meaningless option. Seems a crap shoot as to which one will be taken, the earlies, or the latest. I would have thought the records would be retrieved in some particularly requested order. Even id order would mean in order of creation.

      > - Keep Oldest Modified - Means when we compare the two records, we keep the one with the oldest modified date. It means just that, the record with the newer modified date would removed.

      This seems intuitively backwards to what one would want - yet an option for that latter seems to be unavailable.

      > - Keep Oldest Created - Means when we compare the two records, we keep the one with oldest created date regardless of the modified date.

      Same comment.

      What am I missing - it would seem intuitive that the latest modified date would likely be the one a user most recently corrected, and would have made corrections in expectations of continuing to use that version. Ergo it would be expected that that would be the one one would want taken.

      Am I overlooking something?


      Comment

      • Thomas
        DejaOffice Team Member
        • Dec 2010
        • 3008

        #4
        That is why we give the option, we have seen records where an old import was the issue so the Oldest created was the duplicated and where the newest modified was the duplicate. We can not assume to know all user's data, hence the choice.
        - Lead QA

        Comment

        • bs27975
          Junior Member
          • May 2014
          • 29

          #5
          Perhaps a latest modified option makes sense. It does seem intuitive. Especially as the use would have no way of knowing which order things might be retrieved in, to turn that on or off.

          Comment

          Working...