FAQs

Security, backups, policies, etc. expand all | close all

What is your chain of custody policy? 

Each piece of media we process (e.g. hard drive, DVD, CD, email attachment, etc.) is assigned a unique identification number and logged in our project management software. We keep track of everything about the media; date received, who logged it, serial numbers, media format, color, labels, size (in MB), your chain of custody numbers and when we send the data back to you.

Is my data backed up in case of an emergency?

Data is backed up once in the morning and once at night.  The data is written to multiple disks instantly ensuring redundancy.

How do I know my data is segregated from other client’s data? 

Our enterprise storage area network allows us to create a unique volume for each individual project.  All data for a given project is stored within this volume and periodically backed up.

How long do you keep my data active?

Depending on the pricing plan you have, we keep the data for all projects active for two months after the last accessed date. After the second month of inactivity, we notify you we are planning to backup your data to external hard drives. We give each customer the option of backing up to off-network storage and storing off-site, or destroying the data, or keeping the data active, on-network. Each option has an associated fee with it.

What procedures do you use to ensure my data is secured during shipment?

All deliveries made on a hard drive arrive in a custom-molded Pelican case. Pelican cases are water and shock proof, perfect for handling sensitive information. Each case can hold up to three hard drives and cables. The cases come locked and labeled. You will receive an email with the unlock code along with the labeling information prior to delivery. Watch how Logik-tuff they are

How often do you invoice?

Most invoices are generated and sent via email within 24 hours after delivery.  However, depending on the contract or project, we may invoice at the end of the week or month.

Your business cards are plastic, isn’t it more environmentally friendly to use paper?

Thanks! We love our cards; they are very unique. And here’s why we makes others feel green (with envy) about them: the cards are printed on recycled plastic and we always order in small quantities.  Most paper-based business cards are ordered in bulk –  500, 1,000, or 2,000 – when only 200 might be needed. The combination of recycled plastic and small quantities is actually more environmentally friendly than the traditional paper card.

Are you 24/7?

As a sign of our addiction to quality service, we work when you need us to work. However, that doesn’t mean we have zombies working 24/7 waiting for something to happen. For every project, you’ll work closely with an account manager and a project manager to make sure your team’s needs are always met (and exceeded). Additionally, our entire operations team is available to help with requests. You can contact your project managers at any time by phone or email.

Where is my evidence stored at Logik?

All project evidence is logged and labeled as soon as it arrives into our possession. Evidence information, including physical information and dates and times of arrival and return are meticulously maintained in our Salesforce application. Physical media is stored in a locked cabinet within our server space. Access is available only by biometric lock and pin code.

Can I FTP my data to you?

If you have an Internet connection, you can FTP data to us.  Our high-speed Internet connection can handle pretty big uploads and downloads.  Give us a call or shoot us an email and we will set you up with an account.

 

Features, functionality, capacity, etc expand all | close all

What languages can you detect?

  • Japanese (all encodings)
  • Korean
  • Chinese (simplified and traditional)
  • Thai
  • Danish
  • Dutch
  • French
  • German
  • Italian
  • Spanish
  • Norwegian
  • Portuguese
  • Swedish
  • Czech
  • Hungarian
  • Polish
  • Romanian
  • Russian
  • Arabic
  • Greek
  • Hebrew
  • Turkish
  • English

How often do you update your Gridlogik™ platform with new features?

We generally have a monthly release cycle for new features, but more advanced features tend to take a little longer and are released when complete.  Our team of engineers will work with any client in need of a faster turnaround.

Can you support foreign language data?

We support all languages, but we currently only auto-detect 23 languages. Our technology detects the encoding of each document before further processing. This allows us to treat the language appropriately.

I have a PDF portfolio document, what can you do with this?

PDF portfolio documents can contain hidden metadata, attachments and individual emails.  We parse through the files, extract each file individually along with their attachments and hidden metadata.  This process effectively breaks apart the bloated PDF file into smaller chunks, making it easier to read and search.

Is my data encrypted in transit to and from offices?

Yes, if you want it to be.  We use trucrypt to encrypt your data during shipment.  The media is further protected by our secure, air-tight and shock-resistant Pelican cases.

How do you handle Lotus Notes natively?

The right way. We know that Lotus Notes is a complex enterprise application that many companies use exclusively for email communication, scheduling and contact management. Lotus allows users to create custom databases for storing content of various types. Because of the complexity and variety of Lotus Notes data, the best way to process this information is with the native application.

Our software automates this interaction and allows us to extract more data and cleaner content than other solutions. Don’t let someone convert your Lotus Notes data to another format before processing.

What is your daily capacity?

In terms of raw processing speed, we can fully index between 100-200 GB of native files per 24 hours.  If we convert your data to images, we can create 1,000,000 tiffs per 24 hour time period.  These metrics are for processing time only and do not include project consulting, setup time or export.  The times are also not a guarantee.

How many projects can you run concurrently?

We often work on more than a dozen cases at a time.  Our ability to be successful with multiple simultaneous projects comes down to how well we perform in three key areas.

  • Flexible technology: Our architecture is scalable. This is one of the many benefits of virtualization. If new engagements stretch our existing capacity, we simply create more server clones on the fly.

  • The best staff: Logik hires and trains the brightest multi-talented people. Our team consists of industry veterans, consultants and technologists with distinguished backgrounds. We recognize the importance of cross-training our employees, meaning Todd in Sales can help monitor projects, and Adam from R&D knows about our pricing structures.

  • Smart management: Our project and account managers are used to tight deadlines and handling eDiscovery requests of all shapes and sizes. To manage this many simultaneous projects requires exceptional communication and organization skills.  We do both really well.

Do you keep document families together?

Attachments are extracted during indexing and treated independently, though their relation to parent emails are always maintained.  Similarly, embedded objects are extracted and treated as separate files while maintaining the connection to the parent document.

These are extracted recursively to any depth level.  We offer clients a variety of options for viewing these relationships.  The most common is to provide BegAttach and EndAttach fields. 

We also provide AttachRange, ParentID, and ChildIDs fields for many of our clients.  In addition to the standard fields used for identifying relationships, we provide a True/False field that signals when an embedded object is present.

Can you detect more than one language within a document?

Yes, our foreign language detection algorithms are able to capture document encoding and to identify multiple languages, even in the same document. 

We supply fielded information containing the document languages and a relative weighting metric based on detection certainty.  We’ll also provide a field populated with all languages detected across an entire family of documents.

What email applications have you processed?

You name it, we’ve done it. Microsoft Outlook, Outlook Express, Eudora, Thunderbird, Lotus Notes, Bloomberg, Netscape, Mbox, and multiple other text-based email archives. It’s all just metadata and text to us.

What file types can’t you support?

We do not have any limitations that prevent searching other than file types that do not contain text that is extractable or can be OCR’d (pictures, some 3D modeling files).  We also advise caution when dealing with relational databases. 

Often these file types do not lend well to data processing and especially to tiffing.  Much of the content of these file types is determined by the context created by the structure and format of the database.  Additionally, there will always be certain file types that do not contain printable content.  We provide slip sheets for these documents that list the file name and if desired, the md5 hash value.

What load files can you export to?

You name it, we’ve done it. Concordance, Summation, Iconect, Ringtail (we are certified), Ipro, CaseLogistix, Opticon, Introspect, Catalyst, and Kcura, just to name a few. We have worked with Attenex, Clearwell, Stratify and others.

How does your email threading work?

Our email threading technology enables your review teams to quickly and easily sort all of their email by conversation thread. You can sort by custodian, or within a custodian’s PST or just by conversation thread.

Sorting by the conversation thread alone is usually not a good idea if multiple PSTs are involved, so we provide the two other fields, EMAIL_SORT1 and 2. Using a combined field sort, you can sort conversations within folders within PSTs, and by custodian if that information is provided in the file path. Using email thread analysis is a good idea. When you find an interesting document, you can easily review the entire conversation in order and in context. Alternatively, use threading to quickly identify an entire conversation and the related group of documents as non-responsive if off topic.

Alternatively, use threading to quickly identify an entire conversation and the related group of documents as non-responsive if off topic.

How does Simlogik™ work?

Simlogik is a technology that can detect and group similar or near-duplicative files. Unlike de-duplication technologies that detect exact duplicates, Simlogik detects slight variations in documents.

Simlogik is Unicode friendly. Unlike other near-duplicate technologies, Simlogik is not limited to just ASCII encoded files.  This means Simlogik can detect and group similar Chinese-to-Chinese, Japanese-to-Japanese, and even Arabic-to-Arabic documents together.

Note that this does not mean Simlogik can group a Chinese document with a similarly worded Arabic document.

I need more details on Simlogik™

We extract and normalize document text from native files using GridLogik.  Then, our algorithm assigns a doc-by-doc match score based on overlapping regions of the extracted text.  Extracted text allows for similarities to be detected across file types (e.g., email body text that is pasted into a Word document).  Files which have match scores above a certain threshold are considered ‘connected’. 

Connected groups of documents are assigned an id number which allows Simlogik near-dupes to clump together when sorted in a document review tool like Concordance.  You can also use the id numbers to make real-time queries if using an SQL-based document review tool (e.g. “find all similar documents by 75% or more to THIS doc). We calculate a doc-level group number and a parent-level group number which is cascaded down to children.  The parent-level number allows children to remain grouped with near-dupe parents when the results are sorted.

How fast can you endorse 100,000 pages?

An endorsing project usually has several variables, but a conservative estimate is that we can brand 100,000 pages in about 20 minutes.  This includes stamping a bates number and a single, consistent endorsement designation.  Multiple designations and/or handling redactions would obviously add additional time to the project.

What is your de-duplication process for emails and loose files?

  • Emails: For email, we create an MD5hash value by stringing together the following fields: From + To + BCC + CC + Subject + Sent Date + Received Date + BodyText(minus header) + Attachment Names = [MD5hash]. If one character is off in this formula the MD5hash value will be different.  So, if an email is exactly the same as another email except one has BCC, each email will be kept, because the hash values will be different.

  • Loose files: For loose files, eDocs, we create an MD5hash value by putting the entire file into the MD5hash algorithm and then out comes an MD5hash.  Only the content of the file is analyzed for the MD5hash, not the file name.  So, you could have two exact files with different names, but be exact duplicates of one another, because the file name doesn’t affect the MD5hash.

What if I want to re-populate dupes?

So let it be written, so let it be done.  Since Gridlogik tracks every single byte of every file it comes in contact with, re-populating duplicate documents is very easy to do.  We can also provide a detailed de-dupe audit log that details what was de-duped and why. 

This is very helpful for horizontal de-duplication, because the log will show that a different person’s email may not be included, because it was de-duped.

About Us

Did you know?

  • That rich text and html emails can contain white-on-white text?

  • That running a front-end file-type filter using the visible document extension will likely miss many documents that match the criteria in content, but don’t have the correct extension (i.e. myxlsdoc.xlerd)?

  • That the European Union’s Directive on Data Protection mandates that any non-EU recipient of EU-based personal data must provide the required levels of privacy protection? Logik is Safe Harbor Certified.

  • That Japanese documents can come in 1 of 3 different character sets?

  • That 1 gigabyte of information is actually 1,024 megabytes, not 1,000 megabytes?

  • That Microsoft Exchange (.edb) databases can be easily opened by a variety of software products?

  • That producing in native format isn’t all that it’s cracked up to be, and sometimes producing in tiff with metadata can be faster and easier?

  • That Google Gmail emails can be downloaded to Microsoft Outlook using a POP3 or IMAP connection?

  • That transferring sensitive data via a device (like a hard drive) in a cardboard box (like a bankers box) is highly susceptible to promoting disk failure?

  • That Microsoft Outlook MSG files retain their attachments after processing, thus increasing the size of data you need to store on disk?

  • That transporting your sensitive evidence in an unsafe container, like a cardboard box, is ok until that box is dropped on the floor or lands in a puddle?

  • That Microsoft PPT files can contain a hidden master slide that may have many more slides than the actual PPT itself?

  • That a thorough data map can help you to implement your data retention policy, and can equip you for your “meet and confer” conference?

  • That not all OCR software is created equal and that many don’t work very well?

  • That search terms generally miss over 50% of would-be relevant content according to TREC?

Like Red Wine?

Enter to win a case (Twelve 750ml bottles) of Logik Redaction, our very own red Zinfandel, bottled and ready to drink by early 2010.

Learn more