Array

Instance

Array
(
    [title] => Recent Forum Threads
    [title_url] => 
    [ignore_sticky] => 0
    [exclude_current] => 0
    [limit] => 10
    [sluglist] => ["jobs-dashboard"]
    [rw_opt] => Array
        (
            [widget_select] => 1
            [pageid_281769] => 1
            [pageid_281772] => 1
        )

    [display_widget_mobile] => 
    [rw_opt_exclude] => Array
        (
            [pageid_274493] => 1
            [cpt_podcast] => 1
            [cpta_podcast] => 1
            [category_16613] => 1
            [category_16631] => 1
            [taxonomy_series] => 1
        )

    [node_id] => Array
        (
            [0] => 2
        )

)

Threads

Recent Article Comments

The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024
Intel and TSMC IDM 2024 Discussions
I am not sure there is much flex between DRAM and Logic (Scotten Jones would know better). The Samsung wafer…

— Mark Webb on April 5, 2024
How MZ Technologies is Making Multi-Die Design a Reality
I think Siemens EDA has a competing product now but maybe.

— Daniel Nenni on April 5, 2024
Intel and TSMC IDM 2024 Discussions
I feel like yes. Samsung fabs are bigger than TSMC fabs and can theoretically flex equipment between logic and memory…

— nghanayem on April 4, 2024
Intel and TSMC IDM 2024 Discussions
Would it be fair to expect Samsung foundry cost to be lower than TSMC? There were reports they were able…

— Fred Chen on April 3, 2024

WP_Term Object
(
    [term_id] => 151
    [name] => General
    [slug] => general
    [term_group] => 0
    [term_taxonomy_id] => 151
    [taxonomy] => category
    [description] => 
    [parent] => 0
    [count] => 441
    [filter] => raw
    [cat_ID] => 151
    [category_count] => 441
    [category_description] => 
    [cat_name] => General
    [category_nicename] => general
    [category_parent] => 0
)

July 19, 2016 by Bernard Murphy

Technology, Shakespeare, Linguistics and Combatting Terror

Technology, Shakespeare, Linguistics and Combatting Terror
by Bernard Murphy on 07-19-2016 at 7:00 am
Categories: General

My brother Sean is working on post-doctoral research in linguistics, especially the use of language in Shakespeare’s plays. Which may seem like a domain far removed from the interests of the technologists who read these blogs, but stick with me. This connects in unexpected ways to analytics of interest to us techies, and ultimately to a topic of interest to every reasonable person worldwide.

Let me start with Sean’s research. His goal has been to understand the different use of language, for example pronouns, between soliloquies in the comedies, history plays and tragedies. I won’t tax the patience of SemiWiki readers by going into the details – if you want to know more, there’s a link at the end of this blog. His approach is based on something called Corpus Linguistics – analysis of a body of writing to find trends and correlations.

Since Shakespeare’s works, prolific though he was, fit comfortably into one large, small-print volume, analysis of an electronic version can be performed easily with desktop software. Think of a statistical analysis package applied to language rather than numbers, looking at frequencies of word usage, or words used in close proximity. There are multiple software packages (from small and probably mostly academic vendors) for this type of analysis.

Automated analysis of language depends on recognition, and recognition at a basic word level can be very straightforward; even recognizing inflected words as variants of the base word is not complex in English. Going further than word recognition requires tagging the text (“this is the subject in this sentence” for example) or some level of natural language recognition, which gets you into the domain of Google’s SyntaxNet and deep-learning technologies.

Corpus Linguistics methods are not limited to published works. Domains within the Internet are obvious candidates for analysis, where Big Data analytics and deep learning methods can be valuable. But to what purpose? There are perhaps lots of interesting market analyses that could be done in this way, but one much more compelling application is to detect impending terrorist attacks.

Sean’s own department (at Lancaster University in the UK) is active in research in this area, as are a number of other universities. Each group is predominantly looking at social media posts from identified terrorists. The Lancaster group are looking at word “collocation”, measuring the closeness of connection between significant words and the name of a person or place. “Attack” and “crowded” would be an obvious example. This can be used to establish positive or negative associations; increasing frequency of such connections then potentially indicates an upcoming attack.

While approaches like this are clearly not foolproof, they can provide valuable supporting evidence when combined with other indicators. Also for me this general domain illustrates opportunities we often miss in sticking to our own silos of expertise. Technologies that we do understand are often used in domains far from those we might expect. And bigger pictures, combining needs and techniques from widely differing domains, can often suggest solutions that silo experts might miss.

You can learn more about Sean’s research HERE and the work on terrorist post analysis HERE.

Comments

There are no comments yet.

You must register or log in to view/post comments.

The Data Crisis is Unfolding – Are We Ready?
Sounds like this trend will be a major driver for HBM memory to cut down on the traffic load. Also,…

— Arthur Hanson on April 12, 2024
MZ Technologies Enables Multi-Die Design with GENIO
it looks Siemens may have intereset to acquries this company

— yanfeng on April 9, 2024
Strong End to 2023 Drives Healthy 2024
I think downward revisions are in the very near future!

— Daniel Nenni on April 9, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
Here the red line drawn is for 3s/avg=10%, more as a reference. What was meant to be highlighted was the…

— Fred Chen on April 8, 2024
Pinning Down an EUV Resist’s Resolution vs. Throughput
In practice, how much std deviation of secondary electron is acceptable?

— Jeffstar on April 8, 2024
ASML moving to U.S.- Nvidia to change name to AISi & acquire PSI Quantum
Very entertaining, especially the "rollover Beethoven" nomenclature 😀

— ChrisBoyce on April 7, 2024
Intel and TSMC IDM 2024 Discussions
I am not sure there is much flex between DRAM and Logic (Scotten Jones would know better). The Samsung wafer…

— Mark Webb on April 5, 2024
How MZ Technologies is Making Multi-Die Design a Reality
I think Siemens EDA has a competing product now but maybe.

— Daniel Nenni on April 5, 2024

Search Semiwiki

Recent Forum Threads

Recent Article Comments

Recent Podcast Episodes

Comments

Recent Forum Threads

Recent Article Comments