Tag Archive: optical character recognition



Surya R Praveen Google Drive for iOS

Of all the exciting things Google announced at the I/O developer conference this week, one of the sleeper hits was Drive for iOS. “Sleeper” might be too strong a word — attach “for iOS” to anything and it’s going to get attention — but it’s hard for any announcement to compete for headlines against Project Glass, theNexus 7 with Jelly Bean, andskydivers. Part of what caused Drive to turn so many heads during the Day 2 keynote (aside from the iOS association) was its image recognition capabilities.

Drive uses optical character recognition (OCR) to perform certain tricks. This is nothing new for Google — back in 2009 Google Docs could take an image file or a PDF and convert it into a document. It could also change the orientation of one of those files using its ability to recognize characters on the page. These features were brought over to Drive, but so was basic image recognition. It might seem totally sci-fi, but if you upload a picture of a pyramid to Drive, you can search for it and the system will identify it based on what Google knows about pyramids. This feature might seem new, but Google has used image recognition in both Google Goggles and (in a tweaked form) Search by Image.

Surya R Praveen Google Drive for iOSIt’s not too shocking that Google would bring this feature over to the iOS and Android versions of Drive, but it is extra handy on a platform where you don’t have advanced search tools to sift through your 5GB of data. The bulk of the computing is taking place on the server side so it’s not like your iPad needs to be able to recognize that pyramid, you just need to have an internet connection so that Google can lend some of its search magic to the files you’ve placed on Drive.

I conducted an informal test of Drive’s image recognition capabilities using some resources from around the web, including a couples of documents and then some randomly chosen, generic images. I made sure they were free of any data that might help the search tools identify them (EXIF, file names, etc.) and then uploaded them to Drive. According to Google’s demo I should be able to identify each by a basic description of each of them. Here’s how it went…

  • 2 cat images – Correctly identified 2 out of 2 with the term “cat”
  • dog image – 1 out of 1 with the term “dog”
  • 4 pyramid images – 4 out of 4 with the term “pyramid”; 1 out of 4 with “Giza”
  • Samsung Galaxy Tab image – Did not identify as a “tablet”; did identify as “Samsung”
  • Dell laptop image – Did not identify as “laptop”; did identify as “Dell”
  • hard drive image – Did not identify as “hard drive”; did identify as “drive” and “disk”
  • hamburger image – Identified as “hamburger” but not as more vague terms like “food” or “dinner”
  • JPG of an invoice – Identified as an “invoice” and by the company names used on it
  • 1099 tax form, PDF – Identified by “taxes”, “tax form”, “1099″ and “1099 misc”
  • Failed to identify – Images I took of a refrigerator, graphics card, smartphone, and light bulb

With my limited data set the effectiveness of Google’s image recognition was mixed. It work very well with text in images, well enough with images that are in the Google Images database, and then it totally stumbled with my personal images. I gave the system sufficient time process the data (the better part of an hour) but it just couldn’t figure some out.

Surya R Praveen Drive

Google Drive: “wtf is that?”

Maybe the image recognition isn’t particularly new for Google but the impressive bit here is that the company was able to cram it into a mobile app, apply it to my personal data set in the cloud, and then have the app communicate with the cloud in a quick manner. Sure, most of the work is done immediately after the file is uploaded, but the recognition abilities and cataloging is nothing short of what we’d expect from Google.

Aside from the OCR abilities and the convenience of using my Google account, Drive for iOS is basically an adequate Dropbox competitor. It’s a smooth experience and with snappy performance on my iPad 3, plus I have the ability to choose exactly which files I want to store offline and which I don’t. The app makes it easy to see who I’ve shared a file with, which is great for people who in groups. Advanced (and nearly hidden) tools include the abilities to share with people in your contacts list and to rename files.

In addition to being an impressive trick, the OCR gives Drive a notable advantage over Dropbox. That service is able to search by filename and it has some cool features, like the ability to email a link, but the search tool can’t even dive into the text of a PDF. And, as we know, Google is only getting smarter while other services struggle to compete against its user base, computing power, and massive Knowledge Graph.

Read on for more of ExtremeTech’s Google I/O 2012 coverage

Source


Surya R Praveen Google Drive

Today’s debut of Google Drive is case-in-point why you never show your hand first. Yesterday Microsoft detailed plans for SkyDrive, aimed at giving Redmond a leg up in the competitive cloud storage space. Today’s Google Drive announcement blows SkyDrive out of the water.

Even though Google Drive offers two less gigabytes of space (5GB versus 7GB for SkyDrive), the tie-ins with other Google services are the difference. For example, Google takes Google Docs and incorporates it into Google Drive. Like the standalone product, users are able to collaborate on content, which can then be commented on by anyone who its shared with.

Google Drive also offers drag-and-drop integration for both the Windows and OS X, and mobile access through individual apps for iOS (once the app is available) and Android, just like SkyDrive. Fair enough — but where Drive really shines is in search.

Google makes everything that you upload into Drive searchable. The content of scanned documents is searchable through Optical Character Recognition (OCR) technology, and the company says image recognition technology will do the same for your images. That last part may be a little too much for some, but it is still more than Microsoft offers.

Surya R Praveen The only place Microsoft bests Google Drive is on extra storage. An extra 20GB of space (27GB total) on SkyDrive costs $10 yearly, with Google that will set you back about $30, billed $2.49 monthly. 100GB of space on Google Drive is $4.99 monthly or about $60 per year, with SkyDrive 100GB extra runs $50 per year.

Not to be outdone, Google offers additional capacity upgrades of 200GB to 16TB, ranging in price from $9.99 to $799.99 per month. That’s a lot more space than most of us will ever need, but the option is there.

Bottom line? Google Drive just ate SkyDrive’s lunch. Why did Microsoft let this happen in the first place, though? The company panicked we’re told.

Sources close to Microsoft tell ExtremeTech that the SkyDrive update was originally intended to launch next Monday (April 30th), but a decision was made to move the launch up one week based on the Drive rumors. There was a desire to get out first inside the company, it seems, although I am not sure they expected Google to launch the very next day.

Microsoft now finds itself in a predicament. Only a day old, the new SkyDrive already looks dated and nothing more than your average cloud storage service. On the other hand, Google comes out offering a set of compelling features above and beyond storage. Could Microsoft more tightly wound Bing into SkyDrive? It certainly could have. Microsoft does offer web versions of its Office platform to SkyDrive users, but they don’t seem as fleshed out and full-featured as Google Docs.

I have never been a fan of kneejerk reactions to the market as it leads to poor business decisions, which may have been the case here. Microsoft may have been best leaving to its original April 30 launch date to one-up Google rather than have its announcement overshadowed by its rival. Google wins again, and Microsoft is left scrambling to compete.

Source