Millions of Historic Images Added to Flicker by Internet Archive



/ 4 years ago

14785201595_deee345880_o

Earlier this year the Internet Archive began culling over 14 million images from their public domain ebooks, then began uploading them to the Internet Archive’s Flickr account. This means that all of the historic images are now easily searchable and downloadable, something that wasn’t really possible before without downloading each ebook and finding the images yourself, then exporting them.

The ebooks are easily searchable already thanks to the Optical Character Recognition software which was used when adding them to the archive, but it didn’t work for images. Now with the help of Flickr, those looking for historic text and images can get the best of both worlds.

“The software also copied the caption for each image and the text from the paragraphs immediately preceding and following it in the book,” said The Internet Archives Communications Technology Scholar Kavel Leetaru when speaking with the BBC.

The software used isn’t perfect, so admittedly some of the tags on images will be imprecise, but to have such a vast library of easily searchable content is great for learning purposes and the team are hoping that libraries around the world will one day follow suit and digitize their books and images.

Thank you Arstechnica for providing us with this information.

Image courtesy of Arstechnica.

 

Topics: , , , ,

Support eTeknix.com

By supporting eTeknix, you help us grow. And continue to bring you the latest news, reviews, and competitions. Follow us on Facebook and Twitter to keep up with the latest technology. Share your favourite articles, chat with the team and more. Also check out eTeknix YouTube, where you'll find our latest video reviews, event coverage and features in 4K!
eTeknix FacebookeTeknix TwittereTeknix Instagram

Check out our Latest Video

Speak Your Mind

Tell us what you're thinking...
and oh, if you want a pic to show with your comment, go get a gravatar!


Optimized with PageSpeed Ninja