Award Abstract # 1750911
CAREER: SPARK: A Theoretical Framework for Discovering Complex Patterns in Big Attributed Networks

NSF Org: IIS
Div Of Information & Intelligent Systems
Recipient: RESEARCH FOUNDATION FOR THE STATE UNIVERSITY OF NEW YORK, THE
Initial Amendment Date: February 12, 2018
Latest Amendment Date: June 26, 2019
Award Number: 1750911
Award Instrument: Continuing Grant
Program Manager: Wei Ding
IIS
 Div Of Information & Intelligent Systems
CSE
 Direct For Computer & Info Scie & Enginr
Start Date: May 1, 2018
End Date: December 31, 2019 (Estimated)
Total Intended Award Amount: $537,044.00
Total Awarded Amount to Date: $206,164.00
Funds Obligated to Date: FY 2018 = $24,977.00
FY 2019 = $0.00
History of Investigator:
  • Feng Chen (Principal Investigator)
    feng.chen@utdallas.edu
Recipient Sponsored Research Office: SUNY at Albany
1400 WASHINGTON AVE
ALBANY
NY  US  12222-0100
(518)437-4974
Sponsor Congressional District: 20
Primary Place of Performance: University at albany SUNY
1400 washington ave unh 104
albany
NY  US  12222-0100
Primary Place of Performance
Congressional District:
20
Unique Entity Identifier (UEI): NHH3T1Z96H29
Parent UEI: NHH3T1Z96H29
NSF Program(s): Info Integration & Informatics
Primary Program Source: 01001819DB NSF RESEARCH & RELATED ACTIVIT
01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 1045, 7364
Program Element Code(s): 736400
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Recent advances in sensing and computing techniques have led to a need for massive quantities of data to be aggregated from heterogeneous information sources in fields such as science, engineering, and business that are naturally modeled in the form of big attributed networks. A big attributed network (BAN) is characterized by a combination of (a) high-dimensional and heterogeneous network topologies and (b) high-dimensional and heterogeneous attribute data. Effective analysis of BAN data relies on simultaneous subgraph mining and feature selection for discovering complex patterns that are interesting or significant. However, as yet little has been done to bridge these two important research areas. The focus of this project is therefore to unify a wide range of complex pattern discovery tasks including, for example, the detection and forecasting of societal events (disasters, civil unrest), anomalous patterns (disease outbreaks, cyberattacks), discriminative subnetworks (cancer diagnosis), knowledge patterns (new knowledge building) and storylines (intelligence analysis), and to resolve the fundamental modeling, algorithmic, and interactive challenges associated with ubiquitous BAN data in today's big data era. This project incorporates the resulting research outcomes into the curricula of interdisciplinary courses on topics such as complex pattern detection in BAN data presented at seminars, tutorials, and workshops, and will include outreach activities such as a big data analytics summer camp for local K-12 education in New York's Capital Region.

The research objectives of this project are: (1) the development of a unified, theoretical framework for discovering complex patterns in BAN data in various kinds of tasks; (2) making the inference computationally tractable in extremely large combined spaces composed of vertices, edges, and attributes; and (3) rendering the detected heterogeneous patterns transparent and interpretable and incorporating heterogeneous user feedback into the detection process. The research approach includes the development of: (1) novel principled methods capable of learning complex patterns of interest directly from BAN data; (2) near-linear-time common inference algorithms capable of optimizing a variety of BAN-specific model objectives that are subject to different constraints on subgraph topologies and structured sparsity models; and (3) a computer-interpretable language-based system capable of modeling and interpreting rich user feedback schemes in BAN data. More details can be found at: http://www.cs.albany.edu/~fchen/projects/BAN/.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tao, Rongrong and Zhou, Baojian and Chen, Feng and Mares, Dvid and Butler, Patrick and Ramakrishnan, Naren and Kennedy, Ryan "Detecting Media Self-Censorship without Explicit Training Data" Proceedings of the 2020 SIAM International Conference on Data Mining , 2020 https://doi.org/10.1137/1.9781611976236.62 Citation Details
Chunpai Wang, Daniel Neill "Calibrated Nonparametric Scan Statistics for Anomalous Pattern Detection in Graphs" Thirty-Sixth AAAI Conference on Artificial Intelligence , 2022 Citation Details
Killamsetty, Krishnateja and Zhou, Xujiang and Chen, Feng and Iyer, Rishabh "RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning" Proceedings of the Thirty-Five Neural Information Processing Systems , 2021 Citation Details
Cadena, Jose and Chen, Feng and Vullikanti, Anil "Graph Anomaly Detection Based on Steiner Connectivity and Density" Proceedings of the IEEE , v.106 , 2018 https://doi.org/10.1109/JPROC.2018.2813311 Citation Details

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page