Supporting the Big Bots

Supporting the Big Bots

Just the facts please

  • Sub to Top 3 accounting firm
  • 250GBs of NSFs, PSTs, eDocs
  • 50 custodians
  • Hundreds of search terms
  • New search software
  • English and Spanish searches
  • 3,000,000 docs pre search
  • Dozens of search reports
  • 900,000 docs post search
  • Less than 2 weeks to complete
  • Oh, it’s hard work. We just make it look easy.

Challenge:

Visitor: Knock, knock.

Logik: Who’s there? 

Visitor: One of top 3 accounting firms in the world who’s desperately in need of eDiscovery assistance. 

Logik: Uh, you’re joking, right?

Top 3 Firm: No, seriously, we really need your help. 

Logik (opening the door wide with a grin): Well then, come on in and stay awhile.

That’s just about how things happened, only they used a phone. Our now very new client was in a serious eDiscovery jam. Their customer presented them with a mind-boggling and “need it now now now” eDiscovery request: process 250GBs of Lotus Notes .NSFs files, Microsoft Outlook .PSTs files and various MS Office documents (eDocs) from 50 custodians. Then search each of the 50 custodians with unique combinations of English and Spanish keywords, file type categories and date ranges. Next, tag any hit results with the category hit plus the keywords hit and load it all into kCura’s Relativity™ software, with a unique Bates number schema per custodian. One more thing… they need it all ready for review in three weeks.

Even for a multi-billion dollar accounting company with an excellent reputation for eDiscovery services, this was a project that needed special attention. They looked to Logik because they knew our capabilities and were comfortable with our team managing a very complex project. Ok, we said. No problem.

So, what’s the problem:


What we did:

The very first thing we did on this project was to convince our client that a universal Bates numbering schema for all 50 custodians would be 1,000 times more efficient than a unique number per custodian. Since the custodian information is captured in a database field, this wasn’t a big change request, but it was a change that saved our client from significant delays in deliveries. 

A project this complex needs a level of attention that only a few people possess. Lucky for us, we are all a little OCD when it comes to eDiscovery. We created detailed spreadsheets to track every single custodian, the keywords/categories hit and missed, the total documents and pages each category yielded. And to make sure we did this right, we also developed a new search tagging software to super-charge the tagging process.

This new search software enabled us to perform the complicated searches for all 50 custodians simultaneously. This took less than 30 minutes over a population of 3,000,000 documents. Putting things into perspective, before this project started the same process would have taken us at least two days to complete. A few dozen or so search reports later, the results were finalized. We exported the results along with all the search metadata for delivery into Relativity in less than two days.

The results:

More cases

Case Studies

Did you know?

  • That most near-dupe technologies can not group foreign language documents together?

  • That lots of useful and searchable content is missed by search engines because they do NOT perform OCR on the documents?

  • That a journalist at the New York Times OCRd 4 terabytes of TIFF images in under 24 hours with the use of Amazon’s EC2 cloud services?

  • That attorneys can be sanctioned for improperly handling eDiscovery processing?  Search for Bray & Gillespie.

  • That earlier versions of LexisNexis’ Concordance can display Unicode text if properly tweaked?

  • That Microsoft XLS files can contain hidden spreadsheets?

  • That by reading through all of these “Did You Knows” qualifies you as an eDiscovery ninja?

  • That Adobe Photoshop files contain multiple layers of information, most of which are hidden from view and cannot be seen without the use of Photoshop?

  • That Microsoft Outlook MSG files retain their attachments after processing, thus increasing the size of data you need to store on disk?

  • That removing near duplicate documents without first reviewing them could risk missing important information?

  • That Lotus Notes has a soft delete option that activates when you open a NSF and it will automatically delete emails marked with soft delete?

  • That rich text and html emails can contain white-on-white text?

  • That a thorough data map can help you to implement your data retention policy, and can equip you for your “meet and confer” conference?

  • That a standard DVD-R single layer can only hold ~4.7GB of information and takes ~30minutes to fully burn, whereas copying 4.7GB of files to a hard drive will only take 5 minutes?

  • That MS Excel 2007 supports over 1 million rows of data?