How to check a file for duplicate lines
In this edition of “eDiscovery-related Python Tricks,” we’ll cover some fundamental techniques and operations that you’ll likely find yourself using repeatedly. Suppose you’ve been given the task of merging load files from several productions together. You’re fairly sure that merging several files together has left the load file with duplicative lines, but the file is large and this would be difficult to determine manually. While this example may seem a little contrived, it will provide a simple setup for laying foundation that will likely be re-used when we get to more interesting examples...Read more