An enormous gap exists between human abilities and machine performance when it comes to understanding the visual world from images and videos. Humans are still way out in front.
“People are the best vision systems we have,” said Devi Parikh assistant professor in the Bradley Department of Electrical and Computer Engineering at Virginia Tech. “If we can figure out a way for people to effectively teach machines, machines will be much more intelligent than they are today.”
In her research, Parikh is proposing to use visual abstractions or cartoons to teach machines. She works from the idea that concepts that are difficult to describe textually may be easier to illustrate. By having thousands of online crowd workers manipulate clipart images to mimic photographs, she seeks to teach a computer to understand the visual world like humans do. Read more.