pullpush-actual Well, you don't really need to judge which is the "correct" version, you can simply mix them, and create a "correct" one. What I mean is that, since the only useful values for a comment is the author
and body
, and, they are also the only values which could change in different versions, (for the same comment, no matter which version, they always have the same id
, created_utc
etc), so you can just extract the correct author
and body
, then "create" a comment with the rest info of any version.
For the author
, it only has 2 possible values, normal or [deleted], so if any version has a normal value, take it.
For the body
, there are more choices, normal, [deleted], [removed], [ Removed by Reddit ], but all 3 last values are useless, no matter if the comments is really removed, or it's just someone who has literally typed "[removed]", they are worthless anyway, so you can also simply pick a normal value from any version.