Researchers use environmental justice questions to reveal geographic biases in ChatGPT

A U.S. map shows counties where residents could (blue) or could not (pink) receive local-specific information about environmental justice issues. Photo courtesy of Junghwan Kim.

Virginia Tech researchers have discovered limitations in ChatGPT’s capacity to provide location-specific information about environmental justice issues. Their findings, published in the journal Telematics and Informatics, suggest the potential for geographic biases existing in current generative artificial intelligence (AI) models.

Ismini Lourentzou, assistant professor in the College of Engineering and core faculty at the Sanghani Center for Artificial Intelligence and Data Analytics, is a co-author on the paper. Read full story here.


Congratulations to Sanghani Center’s 2023 Summer and Fall Graduates

Virginia Tech’s 2023 Fall Commencement ceremonies take place today. The Graduate School Commencement Ceremony will be held in Cassell Coliseum at 1:30 p.m. and will be live-streamed.

“We celebrate our Summer and Fall graduates who have worked so hard to achieve their graduate degrees,” said Naren Ramakrishnan, the Thomas L. Phillips Professor of Engineering in the Department of Computer Science at Virginia Tech and director of the Sanghani Center for Artificial Intelligence and Data Analytics. They deserve all the congratulations coming their way and we wish them all the best as they embark on their new journeys.”

The following Sanghani Center students are among those who are receiving degrees:

Ph.D. Graduates

Aman Ahuja, advised by Edward Fox, has earned a Ph.D. in computer science. His research focused on document understanding, search and retrieval, and question-answering to improve the accessibility of long PDF documents, such as books and dissertations. His dissertation, “Analyzing and Navigating Electronic Theses and Dissertations” was awarded the 2023 Innovative Student Thesis Award by the Networked Digital Library of Theses and Dissertations (NDLTD). Ahuja has joined DocuSign in Seattle, Washington, as an applied scientist.

Arka Daw, advised by Anuj Karpatne, has earned a Ph.D. in computer science. His research centers around the emerging field of science-guided machine learning, where machine learning models are integrated with scientific knowledge (or physics) to ensure better interpretability and generalizability while enforcing scientific consistency. The title of his dissertation is “Physics-informed Machine Learning with Uncertainty Quantification.”  Daw is joining Oak Ridge National Lab (ORNL) in Knoxville, Tennessee, as a Distinguished Staff Fellow.

Chris Grubb, advised by Leanna House, has earned a Ph.D. in statistics. His research focuses on developing a statistical learning method of population synthesis that allows for propagation of uncertainty from sample data into synthetic populations of agents. The title of his dissertation is “Inference for Populations: Uncertainty Propagation via Bayesian Population Synthesis.” Grubb has joined Virginia Tech’s Center for Biostatistics and Health Data Science in Roanoke, Virginia, as a research scientist.

Whitney Hayes, co-advised by Ashley Reichelmann and Naren Ramakrishnan, has earned a Ph.D. in sociology. Her research focus is on identity. The title of her dissertation is “Enhancing Identity Theory Measurement: A Case Study in Ways to Advance the Subfield.” Hayes also received a graduate certificate in urban computing offered through the Sanghani Center. She has joined Elevate, a climate action nonprofit based in Chicago, Illinois, and works remotely as a research analyst. 

Brian Keithadvised by Chris North, has earned a Ph.D. in computer science. His research focuses on how to represent, extract, and visualize information narratives to aid analysts in their narrative sensemaking process. The title of his dissertation is “Narrative Maps: A Computational Model to Support Analysts in Narrative Sensemaking.” Keith has joined the Catholic University of the North in Chile as an assistant professor in the Department of Computing and Systems Engineering. 

Shuo Lei, advised by Chang-Tien Lu, has earned a Ph.D. in computer science. Her research focuses on few-shot learning and domain adaptation. The title of her dissertation is “Learning with Limited Labeled Data: Techniques and Applications.” Lei has joined Sony Research in San Jose, California, as a research scientist.

Lei Zhang, advised by Chang-Tien Lu, has earned a Ph.D. in computer science. His research focuses on bi-level optimization, neural architecture search, and graph neural networks. The title of his dissertation is “Bilevel Optimization in the Deep Learning Era: Methods and Applications.”

Ming Zhu, co-advised by Daphne Yao and Ismini Lourentzou, has earned a Ph.D. in computer science. Her research focus is on Machine Learning and Natural Language Processing. The title of her dissertation is “Neural Sequence Modeling for Domain-Specific Language Processing: A Systematic Approach.” Zhu has joined ByteDance in Seattle, Washington, as a research scientist.

Master’s Degree Graduates

Nikhil Abhyankar, advised by Ruoxi Jia, has earned a master’s degree in electrical and computer engineering. His research focus is on machine learning privacy and security. The title of his master’s thesis is “Data Centric Defenses for Privacy Attacks.” Abhyankar has joined the Virginia Tech Department of Computer Science to pursue a Ph.D.

Humaid Desaiadvised by Hoda Eldardiry, has earned a master’s degree in computer science. His research focuses on enhancing the efficiency and resource utilization of Federated Learning in resource-constrained and heterogeneous environments. The title of his master’s thesis is “REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments.” Desai is joining Ellucian in Reston, Virginia, as a software engineer.

Chongyu He, advised by Edward Fox, has earned a master’s degree in computer science. His research primarily revolves around the application of advanced deep learning techniques for cell organelle segmentation in high-resolution microscopy images. The title of He’s master’s thesis is “Deep Learning Approach for Cell Nuclear Pore Detection and Quantification over High Resolution 3D Data.”

Junho Oh, advised by Lynn Abbott, has earned a master’s degree in Computer Engineering. His research focus is machine learning. The title of Oh’s master’s thesis is “Estimation of Global Illumination using Cycle-Consistent Adversarial Networks.”

Akash Sonth, advised by Abhijit Sarkar and Lynn Abbott, has earned a master’s degree in computer engineering. His research focus is on the application of machine learning in driver safety and intelligent transportation. The title of his master’s thesis is “Enhancing Road Safety through Machine Learning for Prediction of Unsafe Driving Behaviors.”  Sonth has joined the Aspen Technology office located in Bedford, Massachusetts, as a data scientist.

Surendrabikram Thapa, co-advised by Anuj Karpatne and Abhijit Sarkar, has earned a master’s degree in computer science. His research focus is multimodal learning, computer vision, and natural language processing applications. The title of his master’s thesis is “Deidentification of Face Videos in Naturalistic Driving Scenarios.” Thapa also received a graduate certificate in data analytics offered by the Sanghani Center. He has joined the Virginia Tech Transportation Institute (VTTI) as a research faculty.


‘Curious Conversations’ podcast: Ismini Lourentzou talks about AI’s potential as an assistant

“Curious Conversations” is produced by the Virginia Tech Office of Research and Innovation.

Ismini Lourentzou joined Virginia Tech’s “Curious Conversations” to chat about artificial intelligence (AI) and machine learning related to personal assistants, as well as her student team’s recent experience with the Alexa Prize TaskBot Challenge 2. 

About Lourentzou

Lourentzou is an assistant professor in the Department of Computer Science and core faculty at the  Sanghani Center for Artificial Intelligence and Data Analytics. She is also an affiliate faculty member of the National Security Institute and the Center for Advanced Innovation in Agriculture.

Read more and listen here.


Aman Ahuja garners 2023 Innovative Student Thesis Award from Networked Digital Library of Theses and Dissertations

Aman Ahuja

The Networked Digital Library of Theses and Dissertations (NDLTD) has awarded its 2023 Innovative Student Thesis Award to Aman Ahuja, who was a Ph.D. student in computer science at the Sanghani Center for Artificial Intelligence and Data Analytics.

Ahuja defended his dissertation this past summer and is currently an applied scientist at DocuSign in Seattle, Washington. His advisor was Edward Fox.

The organization’s annual award supports student efforts to transform the genre of the dissertation through the use of innovative research data management techniques and software to create multimedia Electronic Theses and Dissertations (ETDs). It includes a cash award and travel scholarship funds to attend a future ETD Symposium.  

Following is an excerpt from the email Ahuja received from the chair of the NDLTD Awards Committee notifying him of this honor:

“Your thesis, “Analyzing and Navigating Electronic Theses and Dissertations,” provides a technical framework to expand the access to the content of millions of published theses, like yours, which are constrained in their usability and usefulness by the portable document format. Current digital libraries are institutional repositories with the objective being content archiving, they often lack end-user services needed to make this valuable data useful for the scholarly community. To effectively utilize such data to address the information needs of users, digital libraries should support various end-user services such as document search and browsing, document recommendation, as well as services to make navigation of long PDF documents easier and accessible. Your research and dissertation directly addresses these concerns in creative and beneficial ways.”

Ahuja earned a bachelor’s degree in information systems from Birla Institute of Technology & Science, India, where, as part of his undergraduate studies, he was also a visiting scholar at Carnegie Mellon University in Pittsburgh, Pennsylvania. 


Teaming up to beat the heat

Assistant Professor Theo Lim of the School of Public and International Affairs presents on his research during the 2023 State of the College program. Photo by Andrew Adkins for Virginia Tech.

This summer marked the Earth’s hottest on record.

The Roanoke Valley was no exception to the heat, with news reports naming 2023 as the region’s second-hottest summer. But the rising temperatures were particularly stifling for some neighborhoods in Roanoke —  those impacted by harmful urban planning practices.

Naren Ramakrishnan, the Thomas L. Phillips Professor of Engineering and director of the Sanghani Center for Artificial Intelligence and Data Analytics, and Nathan Self, research associate at the center, are on a team of researchers led by Theodore Lim, who will use a National Science Foundation grant to work with Roanoke communities to combat the impact of rising temperatures and promote healing among those impacted by harmful urban planning practices. Read full story here.


Sanghani Center leads collaborative study to improve both discovery and traceability of illegally-sourced timber

Reference sample collections from World Forest ID

Virginia Tech has received funding from the National Science Foundation for a collaborative research project that brings machine learning and data science research to the domain of Stable Isotope Ratio Analysis (SIRA) to improve discovery and traceability of illicitly-sourced timber products. Illegal timber trade (ITT) is the most profitable natural-resource crime, valued at 50-152 billion U.S. dollars per year.

Naren Ramakrishnan, the Thomas L. Phillips Professor of Engineering and director of the Sanghani Center for Artificial Intelligence and Data Analytics, is serving as principal investigator for the project with the University of Washington, World Forest ID, and Simeone Consulting, LLC.

“To enforce timber regulations and international frameworks, there is a need for accurate, cost-effective, and high-throughput tools that can be used to identify and trace illegally sourced timber products,” Ramakrishnan said. 

The team brings together data scientists, analytical chemists, geospatial and remote sensing scientists, practitioners, international trade and supply chain specialists, and field experts who conduct reference sample expeditions to bring novel data science approaches to analyzing a range of geospatial and remotely sensed datasets.

Patrick Butler, senior research associate, and Brian Mayer, research associate at the Sanghani Center will be part of the Virginia Tech team.

Key foci of this project include machine learning methods for SIRA analytics; location determination from isotopic ratios; and active sampling strategies to close the loop. Foundational machine learning contributions in science-guided machine learning, contrastive and generative learning paradigms, and active sampling algorithms will support not only the specific domain of SIRA but other adjacent domains in environmental conservation, agricultural forecasting, and smart farm modeling. 

“For example, what we learn from our research could be directly applicable to tracing many other illicitly-sourced products and product inputs, including forest risk commodities such as cocoa, soy, and beef,” said L. Monika Moskal, professor at the University of Washington.

The study will have broad and far-reaching impacts on American security and prosperity, as well. 

“Many key U.S. adversaries rely on illegal logging to finance their activities,” said Jade Saunders, executive director at World Forest ID. “Detecting and curbing such activities will moderate sources of regional instability and threats to U.S. interests.”

The project will lead to improving geospatial prediction accuracy of product origin and will enable a cost-benefit analysis to minimize future data collection costs and optimize prediction gain. Finally, this project will also positively affect U.S. economic competitiveness by reducing competition with illicit actors and moderating risks to international trade, Ramakrishnan said.


Human-Centered Future of Work Symposium set for Nov. 3

Sue Ge, director of ICAT’s Center for Future Work Places and Practices, addresses faculty at the center’s spring networking event. Virginia Tech photo

As technology continues to revolutionize industries and alter the nature of everyday life, the future of work can seem unclear. Virginia Tech’s Institute for Creativity, Arts, and Technology (ICAT) is bringing together a wide breadth of expertise to discuss this topic during the Human-Centered Future of Work Symposium.

Sponsored by the Department of Economics and the Kohl Center, AAEC, the symposium feature a policy roundtable discussion that aims to search for the common ground on the human-centered future of work.

One of the panelists is Chris North is a professor of computer science at Virginia Tech and the associate director of the Sanghani Center for Artificial Intelligence and Data Analytics. Read full story here.


New Data and Decision Sciences Building encourages collaboration to address world’s data challenges

The Data and Decision Sciences Building. Photo by Noah Alderman for Virginia Tech.

Virginia Tech’s new Data and Decision Sciences Building has opened its doors to students, faculty, staff, and industry professionals ready to tackle some of the world’s most pressing data challenges. Completed in the summer, the 120,000-gross-square-foot facility houses multiple colleges including the Pamplin College of BusinessCollege of Engineering, and College of Science.

Several faculty from the Department of Computer Science have offices located in the building, along with labs and classrooms that allow students to experience and interact with the latest computational technologies. The new visualization lab features a high-resolution power wall with multi-touch functionality. Coupled with SAGE3 software developed by researchers in the Sanghani Center for Artificial Intelligence and Data Analytics under a $5 million dollar National Science Foundation grant, the high-resolution screen enables the display and organization of large amounts of media, data analytics, and visualizations. Read full story here.


Sanghani Center Student Spotlight: Syuan-Ying Wu

Poster for published paper “MetroScope: An Advanced System for Real-Time Detection and Analysis of Metro-Related Threats and Events via Twitter”

Metro systems are vital to many people’s daily lives, but they face safety or reliability challenges, such as criminal activities or infrastructure disruptions. Real-time threat detection and analysis are crucial to ensure their safety and reliability. 

Syuan-Ying (Justin) Wu, a master’s degree student in computer science whose research focuses on social media analytics and software development, is currently part of a research team that is working with the Washington Metropolitan Area Transit Authority (WMATA) to address these issues.  

With fellow students at the Sanghani Center and his advisor, Chang-Tien Lu, Wu has been instrumental in developing the MetroScope real-time threat/event detection system that can automatically analyze event development; prioritize events based on urgency; send emergency notifications via emails; provide efficient content retrieval; and self-maintain the system.

“This is a great improvement over many existing systems that can detect the event but cannot analyze it or prioritize it,” Wu said. “And our system offers other advantages like not having to continuously monitor system notifications.”

Their collaborative paper, “MetroScope: An Advanced System for Real-Time Detection and Analysis of Metro-Related Threats and Events via Twitter,” was published in the proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval held in Taipei, Taiwan, this past summer.

Wu, who earned a bachelor’s degree in applied mathematics at Fu Jen Catholic University in Taiwan, said this research collaboration with a metropolitan metro system is a good example of what led him to pursue his master’s degree at Virginia Tech and the Sanghani Center. “The exceptional computer science program and distinguished professors have offered me the opportunity to find ways of applying cutting-edge technology to tackle a real-world problem,” he said. “It has been the perfect environment to achieve my goals.”

Projected to graduate this fall, Wu hopes to secure a position as a software engineer. 


Ming Jin receives NSF grant to introduce antifragility into power systems

Ming Jin

Ming Jin, an assistant professor in electrical and computer engineering and core faculty at the Sanghani Center has received a National Science Foundation grant to revolutionize the design of learning-enabled, safety-critical systems, with a special focus on power systems.

The grant was awarded under the Safe Learning-Enabled Systems (SLES), a partnership between the NSF, Open Philanthropy, and Good Ventures.

Jin will collaborate with Javad Lavaei, professor in Industrial Engineering and Operations Research at the University of California Berkeley.

The project introduces antifragility, a concept that goes beyond robustness which can be compared to a sturdy structure that remains unyielding in a storm but does not grow or adapt from the experience; or resilience which is like a rubber band: when stretched, it can recover by going back into its original shape. 

“We are not merely designing systems to withstand challenges of rare and unpredictable events, but to flourish because of them,” Jin said. 

The task of preserving end-to-end safety of the power system will be crucial, Jin said, though it is complex amidst distributional shifts, driven by the growing complexity and unpredictability of the environment. 

The project will addresses safety challenges through three interconnected research thrusts. The first thrust targets the creation of proactive, antifragile systems that anticipate and adapt to changes, using advanced techniques such as meta-safe learning and offline reinforcement learning. The second thrust bolsters system antifragility through multi-agent systems, encouraging exploration, cooperation, and distributed control to ensure resilience and safety, even under significant disturbances. The third thrust is devoted to validation and stress testing, employing multi-objective adversarial learning and real-world case studies to better handle rare or unexpected events.

“Our algorithms are more than just learners; they’re evolvers. By turning continual threats into avenues for enhancement, we are redefining what safety in power systems looks like,” he said.

Four students advised by Jin will work with him on the project: Vanshaj KhattarAhmad Al-TawahaZain ul Abdeen, andBilgehan Sel.