Letting neural networks be weird

  • About Me
  • Contact me
  • FAQ
  • Merchandise
  • Random
  • Archive
  • RSS

The Visual Chatbot

image

There is a delightful algorithm called Visual Chatbot that will answer questions about any image it sees. It’s a demo by a team of academic researchers that goes along with a recent machine learning research paper (and a challenge for anyone who’d like to improve on it), and its performance is pretty state-of-the-art, meant to demonstrate image recognition, language comprehension, and spatial awareness.

However, there are a couple of interesting things to note about this algorithm.

  1. It was trained on a large but very specific set of images.
  2. It is not prepared for images that aren’t like the images it saw in training.
  3. When confused, it tends not to admit it.

Now, Visual Chatbot was indeed trained on a huge variety of images. It can answer fairly involved questions about a lot of different things, and that’s impressive. The problem is that humans are very weird, and there are still many things it’s never seen. (This turns out to be a major challenge for self-driving cars.) And given Visual Chatbot’s tendency to react to confusion by digging itself a deeper hole, this can lead to some pretty surreal interactions.

image

Another thing about Visual Chatbot is that most of the images it’s been trained on have something in them - a bird, a person, an animal. It may have never seen an image of just rocks, or a plain stick lying on dirt. So even if there isn’t an animal there, it will be convinced there is. This means this bot always thinks it’s on the best safari ever. (For the record, it thought the stick lying on dirt was “a bird is standing on a rock in the snow”)

image

If you ask it enough questions, could you get an idea of how it made its mistakes?

image

An algorithm that can explain itself is really useful. Algorithms make mistakes all the time, or accidentally learn the wrong thing. This particular algorithm didn’t have trouble with hallucinating sheep like some other algorithms I tested. But it did have similar problems with goats in trees, and now I finally got to ask why.

image

[Goat image: Fred Dunn]

Upon further questioning, however, it also decided that dogs also have horns, and birds do not fly. Actually, it turns out that a lot depends on how you ask the question. The answer to “do bunnies fly?” is “no”, but the answer to “can bunnies fly?” is “yes”, so either the algorithm is answering a lot of these questions at random, or bunnies *can* fly but choose not to. (The construction “Do <blank> have <blank>?” seems to almost always result in a “yes”, so I can report that yes, bunnies do have spaceships and lightsabers.)

image

So I wouldn’t necessarily believe Visual Chatbot’s answer to my question about the zoo rocks thing. In fact, it seems to have learned to give explanations that are total lies - if it doesn’t know the color of something, it’ll answer “it’s a black and white photo so i can’t tell” without realizing that this excuse only works on an actual black and white photo.

image

It’s too bad this is so tricky. Since algorithms can often be biased, it would be great if we could ask them  “Why did you show me that ad?” or “Why did you decline my application?”. But getting a sensible answer from them may not be all that straightforward, especially if they pretend they know more than they actually do.

Bonus material: Visual Chatbot explains the plot of The Last Jedi. Enter your email and be edified (contains spoilers - sort of.)

image
    • #neural networks
    • #image recognition
    • #visual chatbot
    • #chatbot
    • #best safari
  • 1 week ago
  • 5025
  • Permalink
Share

Short URL

TwitterFacebookPinterestGoogle+

5025 Notes/ Hide

  1. kc-79 reblogged this from alexis-royce
  2. goshujinsamadesu reblogged this from goatsandgangsters
  3. tintenfischie liked this
  4. catamiteamongthepigeons liked this
  5. asseenelsewhere liked this
  6. metronomeblue liked this
  7. 28bestkessel reblogged this from goatsandgangsters
  8. goatsandgangsters reblogged this from glumshoe
  9. -rhiannon- liked this
  10. ehh-what reblogged this from writing-ace
  11. mildly-human reblogged this from writing-ace
  12. mildly-human liked this
  13. writing-ace reblogged this from nolivingunderstarlight
  14. iamalexia61 liked this
  15. frittering-sandcastle reblogged this from proximally
  16. proximally reblogged this from the-blue-phantom
  17. biobeetleholmcross reblogged this from ceescedasticity
  18. soap-in-space reblogged this from glumshoe
  19. smallphoenix13 reblogged this from lewisandquark
  20. talkingbirdguy reblogged this from yourunicornmind
  21. talkingbirdguy liked this
  22. drewskiskiski reblogged this from lewisandquark
  23. memeing-chicken reblogged this from peanutworm
  24. aa-gray liked this
  25. ripley-rainy-day liked this
  26. jekel45 liked this
  27. wormantheherm reblogged this from glumshoe
  28. wormantheherm liked this
  29. wolffyluna liked this
  30. risingape reblogged this from glumshoe
  31. lick-my-b00bs liked this
  32. ivy-red liked this
  33. lupismaris liked this
  34. undeadandinbed liked this
  35. drakanekurashiki liked this
  36. nine-luzgar reblogged this from raininginadelaide
  37. whyarebagels reblogged this from fearsquared
  38. starbuckspizza-blog liked this
  39. quantumducky reblogged this from fearsquared
  40. quantumducky liked this
  41. afriendlyirin liked this
  42. kagamika reblogged this from rabbiteclair
  43. fearsquared reblogged this from umblrgumblr
  44. homesung liked this
  45. obligateanthrovore reblogged this from lewisandquark
  46. thewonderfulking reblogged this from wetwareproblem
  47. ev-in-spookyland reblogged this from lewisandquark
  48. andonehyena liked this
  49. foredoomedrhythm reblogged this from child-of-crows
  50. nolivingunderstarlight reblogged this from lewisandquark
  51. Show more notesLoading...
← Previous • Next →

Portrait/Logo

About

I train neural networks, a type of machine learning algorithm, to write unintentional humor as they struggle to imitate human datasets. Well, I intend the humor. The neural networks are just doing their best to understand what's going on. Currently located on the occupied land of the Arapahoe Nation.
https://wandering.shop/@janellecshane

Twitter

loading tweets…

  • RSS
  • Random
  • Archive
  • Mobile
Effector Theme — Tumblr themes by Pixel Union