X

Business

Home Business Social Media

Facebook advances computer vision using hashtagged pictures

At F8, Facebook explains how it's more efficiently and more effectively training computer vision models with the help of photos already labeled with hashtags.

Written by Stephanie Condon, Senior Writer May 2, 2018 at 11:06 a.m. PT

Video: Despite controversy, Facebook still the go-to platform for lifestyle brands

Featured

I tested Lenovo's dual-screen laptop and it improved my productivity in profound ways
Samsung Galaxy Book 4 Ultra review: Should Windows users consider anything else?
Nothing's new $99 earbuds are the most stylish ones I've tested (and almost perfect)
The most charming projector I've tested has now replaced my TV for movie nights

Hashtagging pictures of your #pitbull on Instagram is accomplishing more than just connecting you to other dog lovers.

Facebook announced Wednesday that it's been using publicly available, hashtagged photos to train computer vision models -- and it's achieved breakthrough results.

Computer vision models typically rely almost entirely on hand-curated, human-labeled data sets. This makes it the biggest limiting factor in computer vision, Facebook CTO Mike Schroepfer said on Day 2 of the F8 developer conference in San Jose, Calif.

To address this, Facebook has instead trained models with a set of 3.5 billion publicly available images and 17,000 hashtags. By using 1,500 user-supplied hashtags as labels for a one billion-image version of the data set, Facebook achieved a record score of 85.4 percent accuracy on ImageNet, a common bench-marking tool.

In a blog post, Facebook explains that it trained a large-scale hashtag prediction model to sort through hashtags that aren't useful for tagging images, like #tbt.

In the future, Facebook plans to open source the embeddings of these models for the research community.

Read also: Facebook gives developers a tool for spotting phishing attempts

Schroepfer called AI "the foundation of everything we do" at Facebook, running through the company's various AI research efforts.

In addition to working on computer vision, Facebook is working on natural language processing. It's open sourcing Translate, a PyTorch language library, for fast machine translations. Schroepfer also mentioned Facebook's early work on Multilingual Unsupervised and Supervised Word Embeddings (MUSE), which should help increase the number of languages available for translation on Facebook.

Meanwhile, Facebook AI Research (FAIR) is working on advancing reinforcement learning. In partnership with researchers at Georgia Tech, FAIR developed a collection of virtual agents that use vision, dialog, and reasoning to physically navigate environments and answer questions. For instance, Schroepfer explained, the agent could answer a question like, "In which room is the light on?" Because the agents are trained in 3D virtual environments rather than the real world with robots, Facebook can train them several times faster. The research team is open sourcing their EmbodiedQA and House3D projects.

Facebook also announced PyTorch 1.0, the latest version of the open source framework PyTorch, as well as the expansion of the Open Neural Network Exchange (ONNX) format, which enables engineers to easily move AI models between frameworks.

Adjust these Facebook privacy settings to protect your personal data

More Facebook F8

Editorial standards

Show Comments

Related

OpenAI

OpenAI makes GPT-4 Turbo with Vision available to developers to unlock new AI apps

Zoom Workplace

Zoom gets its first major overhaul in 10 years, powered by generative AI

close up programmer man hand typing on keyboard laptop for register data system or access password at dark operation room , cyber security concept

All eyes on cyberdefense as elections enter the generative AI era