Overview
We study the landscape of product innovation in the computer software sector, leveraging publicly available opportunity data news articles obtained from Dow Jones, a business news and data provider. We implement a series of Bidirectional Encoder Representations from Transformers (BERT) neural networks, a sophisticated natural language processing method, for a number of tasks. Our work developed a BERT classification model to identify news articles describing innovation broadly, making use of a training set of 600 manually labeled articles and demonstrating an accuracy rate over 96%. This model was then applied to 1 year’s worth of news articles about the computer software industry to predict which articles describe innovation. We applied a different BERT algorithm to this set of predicted innovation articles for the purposes of named entity recognition, which was used here to extract the company and new product names mentioned in these predicted innovation-describing articles.
Teaser Video
Zoom Link
Project Website
Fellow
Digvijay Ghotane
Georgetown University, Department of Public Policy
Interns
Aditi Mahabal
University of Virginia, College of Arts and Sciences
Akilesh S Ramakrishna
University of Virginia, College of Arts and Sciences & Batten School of Leadership and Public Policy
Mentors
Neil Alexander Kattampallil
Research Scientist, Biocomplexity Institute, University of Virginia
Devika Mahoney-Nair
Research Scientist, Biocomplexity Institute, University of Virginia
Gizem Korkmaz
Research Associate Professor, Biocomplexity Institute, University of Virginia
Stakeholder
Gary Anderson
Research & Development Statistics Program, National Center for Science and Engineering Statistics