Zhao & Qiao - Graph Neural Networks for Real World Fraud Detection | PyData Amsterdam 2023

Описание к видео Zhao & Qiao - Graph Neural Networks for Real World Fraud Detection | PyData Amsterdam 2023

Fraud is a major problem for financial services companies. As fraudsters change tactics, our detection methods need to get smarter. Graph neural networks (GNNs) are a promising model to improve detection performance. Unlike traditional machine learning models or rule-based engines, GNNs can effectively learn from subtle relationships by aggregating neighborhood information in the financial transaction networks. However, it remains a challenge to adopt this new approach in production.

The goal of this talk is to share best practices for building a production ready GNN solution and hopefully spark your interest to apply GNNs to your own use cases.

In this talk, we focus on suspicious account detection for online marketplaces. These platforms allow users to set up shops and sell products with little friction. Unfortunately, this attracts fraudsters who abuse these platforms. We use GNNs to do supervised learning based on accounts previously flagged as fraudulent, so that we can learn from both account properties and the relationship between accounts. However, productionizing GNNs is a big challenge. Addressing this challenge purely using open source packages is the main focus of this talk.

We first give an overview of GNN-based fraud detection. Then we deep dive into utilizing PySpark and GraphFrames to build a transaction graph in a scalable way and convert it to DGL (Deep Graph Library) format. Next we share our experiences of setting up training and inference graphs in different time intervals, and deploying the end-to-end model pipeline in Airflow.

Attendees are required to have a basic understanding of machine learning. In this informative talk, they will gain insights into fraud detection's challenges and learn best practices to productionize GNNs.

Bios:
Feng Zhao
Feng is a senior data scientist at Adyen. He is passionate about solving real business problems using innovative AI/machine learning approaches. He received his Ph.D. from the National University of Singapore.

Tingting Qiao
Senior data scientist in Adyen, working in the Score team focusing on fraud detection.
Having PhD background in computer vision and natural language processing using deep neural networks. Familiar with prediction models, such as regression, classification models, etc., as well as the latest research techniques, such as adversarial learning, neural networks etc. Several years of experience with popular deep learning frameworks.

===

www.pydata.org

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.

00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.

Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...

Комментарии

Информация по комментариям в разработке