Array

Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

ASML- Soft revenues & Orders – But…China 49% – Memory Improving
What does "the street" base their projections on? Would "the street" like to see ASML jacking up their prices as…

— Barnsley on April 22, 2024
ASML- Soft revenues & Orders – But…China 49% – Memory Improving
DUV will not go away, KrF sales show that. Even the I-line is still present. High-NA EUV has not been…

— Fred Chen on April 20, 2024
The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024
Intel and TSMC IDM 2024 Discussions
I am not sure there is much flex between DRAM and Logic (Scotten Jones would know better). The Samsung wafer…

— Mark Webb on April 5, 2024
How MZ Technologies is Making Multi-Die Design a Reality
I think Siemens EDA has a competing product now but maybe.

— Daniel Nenni on April 5, 2024

hip webinar automating integration workflow 800x100 (1)

WP_Term Object
(
    [term_id] => 151
    [name] => General
    [slug] => general
    [term_group] => 0
    [term_taxonomy_id] => 151
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 441
    [filter] => raw
    [cat_ID] => 151
    [category_count] => 441
    [category_description] => 
    [cat_name] => General
    [category_nicename] => general
    [category_parent] => 0
)

August 11, 2016 by Bernard Murphy

The Higgs Boson and Machine Learning

The Higgs Boson and Machine Learning
by Bernard Murphy on 08-11-2016 at 7:00 am
Categories: General

Technology in and around the LHC can sometimes be a useful exemplar for how technologies may evolve in the more mundane world of IoT devices, clouds and intelligent systems. I wrote recently on how LHC teams manage Big Data; here I want to look at how they use machine learning to study and reduce that data.

The reason high-energy physics needs this kind of help is to manage the signal-to-noise problem. Of O(10[SUP]12[/SUP]) events/hour only ~300 produce Higgs bosons. Real-time pre-filtering significantly reduces this torrent of data to O(10[SUP]6[/SUP]) events/hour but that’s still a very high noise level for a 300 event signal. Despite this, the existence of Higgs has been confirmed with a significance of 5σ, but the physics doesn’t end there. Now we want to study the properties of the particle (there are actually multiple types), but the signal-to-noise problems appeared so daunting that CERN launched a challenge in 2014 to propose machine-learning methods to further reduce candidate interactions.

The tricky part here is that you don’t want to rush to publish your solution to quantum gravitation or dark matter only to find a systematic error in the machine learning-based data analysis. So standards for accuracy and lack of bias/systematic errors are very high, suggesting that the LHC may also be beating a path for the rest of us in machine learning.

The CERN machine-learning challenge required no understanding of high-energy physics. The winning method, provided by Gabor Melis, used an ensemble of neural nets. There’s a lot of detail to the method but one topic is especially interesting – the careful methods and intensive effort put into avoiding over-fitting data (aka false positives). I recently commented on a potential weakness in neural net methods. If you train to see X, you will have a bias to see X, even in random data. So how do you minimize that bias?

The method used both to generate training data and to test significance of “discoveries” in that data is Monte Carlo simulation, a technique which has been in use for many decades in high-energy physics (my starting point many years ago). The simulation models not only event dynamics but also detector efficiency. Out of this come many-dimensional representations of each event which form the input to training for each of the challenge participants’ methods. Since the data is simulated, it is easy to inject events of special interactions with any desired probability to test metrics for classification.

Deep neural nets and boosted tree algorithms dominated successful entries. The challenge was also important in enabling cross-validation and comparison between techniques. To ensure objectivity between entries, statistical likelihood measures were defined by CERN and used to grade the solutions from each competitor. The competition together with these measures is a large part of how CERN was able to have confidence in minimized bias in the algorithms. But they also commented that the statistical metrics used are still very much a work in progress.

I should also stress that these methods are not yet being used to detect particles. They are only being used to reduce the data set, based on classification, to a set that can be analyzed using more traditional methods. And in practice a wide variety of techniques are being used on Atlas and CMS experiments (two of the detectors at the LHC), including neural nets and boosted decision trees, plus pattern recognition on events, energy and momentum regressions, individual component identification in events and others.

And yet even with all this care, machine learning methods are not out of the woods yet. One of the event types of interest is decay of a Higgs boson to 2 photons – a so-called di-photon event. The existence of Higgs is in no doubt, but recent di-photon events looking in a different mass range found (with 3σ significance) an apparent resonance at 750 GeV, which might have heralded a major new physics discovery.

But subsequent experiments this year reversed the likelihood that a new particle had been detected. Whether the initial false detection points back to weaknesses in the machine learning algorithms or in human error, this should serve as a reminder that when you’re trying to see very weak signals in significant background, eliminating systematic errors is very, very hard. I think it also points to the power of multiple independent viewpoints or, if you like, the power of the crowd. This underpins a core strength of the scientific method: independent and repeatable validation.

You can learn more about the CERN challenge HERE. A more comprehensive discussion of the total solution can be found HERE. And a report on the non-existent 750GeV resonance can be found HERE.

Comments

0 Replies to “The Higgs Boson and Machine Learning”

You must register or log in to view/post comments.

ASML- Soft revenues & Orders – But…China 49% – Memory Improving
What does "the street" base their projections on? Would "the street" like to see ASML jacking up their prices as…

— Barnsley on April 22, 2024
ASML- Soft revenues & Orders – But…China 49% – Memory Improving
DUV will not go away, KrF sales show that. Even the I-line is still present. High-NA EUV has not been…

— Fred Chen on April 20, 2024
The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

0 Replies to “The Higgs Boson and Machine Learning”

Recent Forum Threads

Recent Article Comments