Use optical character recognition to read images
As of January 31, 2017 Gmail data loss prevention (DLP) is available only with G Suite Enterprise. Customers who are licensed with G Suite Business on Mar 31, 2017 can continue to use Gmail DLP until Jan 31, 2020 provided they continuously renew their G Suite Business license during that time period.
Optical character recognition (OCR) is a technology that extracts text from images. It scans GIF, JPG, PNG, and TIFF images. If you turn it on, the extracted text is then subject to any content compliance or objectionable content rules you set up for Gmail messages.
For example, say you configured your content compliance setting so that messages with credit card numbers are moved to quarantine. A customer sends you a message with a PNG image attachment of an invoice that contains a credit card number. If you turn on OCR, Gmail converts the image attachment to text, detects the credit card number, and moves the message to quarantine.
Note: OCR doesn’t scan images embedded in attached files, such as Adobe® PDF or Microsoft® Word® documents. And, it's not always 100% accurate. Sometimes, it doesn’t detect content that triggers a compliance setting action.
Turn OCR on or off
Sign in to your Google Admin console.
Sign in using your administrator account (does not end in @gmail.com).
From the Admin console dashboard, go to AppsG SuiteGmailAdvanced settings.
Tip: To see Advanced settings, scroll to the bottom of the Gmail page.
- (Optional) On the left, select an organization.
- Scroll to the the Optical Character Recognition (OCR) setting in the Compliance section.
- Check or uncheck the Enable OCR for email attachments box to turn OCR on or off.