Array

Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Forum Threads

Kirin 9000s Analysis: Should be Made by SMIC's N+2 Process

latest reply by KevinK on April 19, 2024

started by Maximus on September 1, 2023
TSMC Q1 2024 Discussion

latest reply by MKWVentures on April 19, 2024

started by Daniel Nenni on April 18, 2024
Drone Submarines and Ocean development, A semi market of staggering size

latest reply by VCT on April 19, 2024

started by Arthur Hanson on April 19, 2024
Timing for TSMC Wafer Orders

started by fansink on April 19, 2024
Intel is industry’s first mover on High NA EUV lithography system.

latest reply by Xebec on April 19, 2024

started by Daniel Nenni on April 18, 2024
Micron set to get $6.1 bln in chip grants from US

latest reply by nghanayem on April 18, 2024

started by Daniel Nenni on April 18, 2024
5 things you should know about High NA EUV lithography

latest reply by hskuo on April 18, 2024

started by Daniel Nenni on April 17, 2024
A new generation is uncovering the tiny doodles left by engineers on old microchips

latest reply by Barnsley on April 18, 2024

started by hist78 on April 16, 2024
A question regards to the semiconductor value chain

latest reply by jm820428 on April 18, 2024

started by jm820428 on April 17, 2024
Japan chipmaker Rapidus opens arm in Nvidia's backyard from Nikkei Asia

latest reply by Mooredaddy on April 17, 2024

started by hskuo on April 15, 2024

Recent Article Comments

The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024
Intel and TSMC IDM 2024 Discussions
I am not sure there is much flex between DRAM and Logic (Scotten Jones would know better). The Samsung wafer…

— Mark Webb on April 5, 2024
How MZ Technologies is Making Multi-Die Design a Reality
I think Siemens EDA has a competing product now but maybe.

— Daniel Nenni on April 5, 2024
Intel and TSMC IDM 2024 Discussions
I feel like yes. Samsung fabs are bigger than TSMC fabs and can theoretically flex equipment between logic and memory…

— nghanayem on April 4, 2024
Intel and TSMC IDM 2024 Discussions
Would it be fair to expect Samsung foundry cost to be lower than TSMC? There were reports they were able…

— Fred Chen on April 3, 2024

hip webinar automating integration workflow 800x100 (1)

WP_Term Object
(
    [term_id] => 151
    [name] => General
    [slug] => general
    [term_group] => 0
    [term_taxonomy_id] => 151
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 441
    [filter] => raw
    [cat_ID] => 151
    [category_count] => 441
    [category_description] => 
    [cat_name] => General
    [category_nicename] => general
    [category_parent] => 0
)

August 1, 2016 by Bernard Murphy

Limits to Deep Reasoning in Vision

Limits to Deep Reasoning in Vision
by Bernard Murphy on 08-01-2016 at 7:00 am
Categories: General

If you are a regular reader, you’ll know I like to explore the boundaries of technology. Readers I respect sometimes interpret this as a laughable attempt to oppose the inevitable march of progress, but that is not my purpose. In understanding the limits of a particular technology, it is possible to envision what properties a successor technology should have. And that to me seems more interesting than assuming all further progress in that direction will be no more than fine-tuning.

Take deep learning and vision. Recent progress in this direction has been quite astounding; in one example, systems have bested humans in identifying dog breeds. These systems are now used in cars for driver assistance and safety applications – detecting lane markings, collision hazards, even traffic signs. Increasingly Google and Facebook use image recognition to search and tag people, animals and objects in images. It seems we’ve almost conquered automated image recognition at a level better than humans. But have we really, and if so, is that good enough?

While progress in deep reasoning has been impressive, there have also been some fairly spectacular fails. Microsoft was forced to retire a chatbot after it developed racist and other unpleasant tendencies. Google had to remove the “gorilla” tag from its Photos app after complaints that it was identifying dark-skinned people as gorillas. And Google released open-source software which identifies surrealist collages of faces in what we would consider perfectly ordinary images (in fairness, Google was pushing the software to see what happened).

You could argue that this is just normal progression for technology. Perhaps once the bugs are worked out, these problems will be rare. But I am skeptical that solutions as they stand just need better training. Our own fallibility in image recognition should be a hint. It’s common to see faces and other images in complex irregular patterns if we stare at them for a while. This phenomenon is called pareidolia, a bias of the brain to see patterns, particularly faces in random images. I can’t imagine why deep reasoning should be immune from this problem; after all we modeled the method on human reasoning, so it would be surprising if it did not also inherit weaknesses in that approach. In fact the Google software that produced surrealist images is known to have this bias.

How good the recognition has to be may depend on the application, but clearly there is room for improvement and for some applications, the bar is going to be very high. More training might help, up to a point. So might more hidden layers, though apparently the value of adding layers drops off sharply after a relatively small number. Ultimately we have to acknowledge that the only straightforward way to fix deep reasoning problems is to try harder, which is not an encouraging place to start when you want to find breakthrough solutions.

Or perhaps we could go back to how we think. Most of us don’t instantly convert what we think we see into action. We consider multiple factors and we pass our conclusions through multiple filters. This is so apparent that we all know people who seem to lack these safeguards; we consider them socially-challenged (or worse). Now think of a cascade of neural nets where each net is trained in different ways. Deep learning methods for particle detection at the Large Hadron Collider (LHC) use similar methods, also combining different approaches – neural nets and binary decision trees – to weed out false positives. This alone might be a good start, with a first order goal to default to “I don’t know” when there is ambiguity in recognition.

Training more nets and other methods would be more expensive and the outcome may initially be more ambiguous than we might like. But maybe that’s an inescapable reality of improved recognition. Perhaps we should think of what we have today as hind-brain recognition – good for quick reaction (fight-or-flight) response but, like the hind-brain, not good at ultra-high-fidelity recognition where we might need improved tools.

I’m sure however this evolves the field will continue to be called deep learning, but that’s just a label. For one insight into limitations in existing architectures and newer methods, see HERE. You can see the Google surrealist art HERE.

Comments

0 Replies to “Limits to Deep Reasoning in Vision”

You must register or log in to view/post comments.

The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024
Intel and TSMC IDM 2024 Discussions
I am not sure there is much flex between DRAM and Logic (Scotten Jones would know better). The Samsung wafer…

— Mark Webb on April 5, 2024
How MZ Technologies is Making Multi-Die Design a Reality
I think Siemens EDA has a competing product now but maybe.

— Daniel Nenni on April 5, 2024

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

0 Replies to “Limits to Deep Reasoning in Vision”

Recent Forum Threads

Recent Article Comments