'Block reasoning' technique improves computer vision

By | September 9, 2010, 10:40pm PDT

Computer scientists at Carnegie Mellon University have devised a method that enables computers to better understand an image by making assumptions about the physical constraints of the scene.

Credit: Carnegie Mellon University

The researchers say that like a child using toy building blocks to assemble something that looks like a building, a computer could analyze an outdoor scene by using virtual blocks to build a three-dimensional  approximation of the image based on parameters of volume and mass.

“When people look at a photo, they understand that the scene is geometrically constrained,” said Abhinav Gupta of CMU’s Robotics Institute. “We know that buildings aren’t infinitely thin, that most towers do not lean, and that heavy objects require support. It might not be possible to know the three-dimensional size and shape of all the objects in the photo, but we can narrow the possibilities.”

According to a university release, the new method works by first breaking down the image into various segments that correspond to objects in the image. Once the ground and sky are identified, other segments are assigned potential geometric shapes. The shapes are categorized as light or heavy, depending on appearance. So a surface that appears to be a brick wall, for instance, would be classified as heavy. The computer then tries to reconstruct the image using the virtual blocks. If a heavy block appears unsupported, the computer must substitute an appropriately shaped block, or make assumptions that the original block was obscured in the original image.

Understanding outdoor scenes remains one of the great challenges of computer vision and artificial intelligence. One approach has been to identify features of a scene, such as buildings, roads and cars, but this provides no understanding of the geometry of the scene, such as the location of walkable surfaces. Another approach, pioneered in part by the same Carnegie team, has been to map the planar surfaces of an image to create a rough 3-D depiction of an image, similar to a pop-up book. But that approach can lead to depictions that are highly unlikely and sometimes physically impossible.

The “qualitative volumetric” approach is so new according to Gupta, that no established datasets or evaluation methodologies exist for it. “In estimating the layout of surfaces, other than sky and ground, the method is better than 70 percent accurate, and its performance is almost as good when comparing its segmentation to ground truth.”

Gupta presented the research, which he conducted with Efros and Robotics Professor Martial Hebert, at the European Conference on Computer Vision, Sept. 5-11 in Crete, Greece.

Related reading:

Researchers rethink approaches to computer vision

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Christopher Jablonski is a freelance technology writer.

Disclosure

Chris Jablonski

Christopher Jablonski has no business relationships, affiliations, investments, or other actual/potential conflicts of interest relating to the content posted so far on this blog.

Biography

Chris Jablonski

Christopher Jablonski is a freelance technology writer. Previously, he held research analyst positions in the IT industry and was the manager of marketing editorial at CBS Interactive. He's been contributing to ZDNet since 2003.

Christopher received a bachelor's degree in business administration from the University of Illinois at Urbana/Champaign. With over 12 years in IT, he's an expert on transformational technologies, particularly those influential in B2B.

Talkback Most Recent of 5 Talkback(s)

  • RE: 'Block reasoning' technique improves computer vision
    Hi,
    I see that you are interested about Computer vision.
    Doing my research I find one amazing free to download book about it.
    This book presents research trends on computer vision, especially on application of robotics, and on advanced approaches for computer vision (such as omnidirectional vision).
    The contents of this book allow the reader to know more technical aspects and applications of computer vision.
    The intended audience is anyone who wishes to become familiar with the latest research work on computer vision, especially its applications on robots.
    This book features representative work on the computer vision, and it puts more focus on robotics vision and omnidirectional vision.
    This is the link where you can find it: http://sciyo.com/books/show/title/computer_vision
    ZDNet Gravatar
    Zana1234
    12th Oct 2010
  • RE: 'Block reasoning' technique improves computer vision
    @Zana1234
    facebook video indir
    kuzey guney
    laptop tamiri
    oto kiralama ankara
    ankara oto kiralama
    facebook video indir
    kuzey guney dizisi
    ZDNet Gravatar
    concone
    27th Sep
  • RE: 'Block reasoning' technique improves computer vision
    I would like first Sohbet Sohbet Odalari to congratulate our esteemed Mynet Sohbet |
    Sohbet Odalari | islami sohbet |
    islami chat | islami radyo | managers andemployees have a Sohbet Odalari |
    sohbet siteleri | chat |
    chat siteleri | really great working with your portal will Chat | Chat Sohbet | Chat Siteleri |
    made ??a great blog wor canakkale sohbet | canakkale chat | have a very ambitious,you would like to Sohbet Odalari | yemek tarifleri | yemektarifleri | oktay usta yemek tarifleri |visit when I found every opportunity I have read your postsvery helpful and useful, resimli yemek tarifleri | Pasta Tarifleri |
    Kurabiye Tarifleri |
    Kolay Yemek Tarifleri |
    tatli tarifleri | to read as much as I try to follow ankara sohbet | ankara chat along with being active in my membership
    news ankara sohbet odalari | about current kamerali sohbet | issues in e-mail kamerali sohbet odalari address I would like to thank you for letting us also
    wish you success in your ankara chat studies sex izle |
    sex seyret |
    sex hikayeleri|
    will continue to be followers
    ZDNet Gravatar
    DeLi_Cocuk
    30th Aug
  • RE: 'Block reasoning' technique improves computer vision
    gdfsgdfsNow, if the daughter had her contacts stored in her Mac's Address Book or on the MobileMe cloud, there's just no need for her to reenter those manually and I strongly doubt that the Genius employee would have suggested it. Thankfully the iPhone/iTunes sync features give this extra level of redundancy, but you have those sync features turned on, and you have plug your phone into your computer once in a while.
    mynet sohbet mynet sohbet Mynet Sohbet sohbet siteleri sohbet odalari yonja forum siteleri ankara sohbet ankara chat almanya sohbet dizi izle istanbul sohbet mirc indir mirc indir sohbet mynet sohbet canli sohbet Many iPhone users sync their phones a little less than they should. sohbet siteleri Hopefully Apple will add wireless syncing at some point in the future. sohbet chat netlog mynet sohbet netlog sohbet Hopefully Apple will add wireless syncing at some point in the future. chat seviyeli chat seviyeli sohbet adana sohbet dini sohbet cet siteleri cet bayan escort vip escort istanbul escort senol balaban ankara escort izmir escort escort Suatanlee Doktorlar izle Spartacus izle Dizi izle Sa?ma Bana G?re ankara chatHopefully Apple will add wireless syncing at some point in the future.
    ZDNet Gravatar
    AdanaLy
    1st Sep
  • ZDNet Gravatar
    MuratCan
    6th Nov

Talkback - Tell Us What You Think

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources