Behavioral graph fraud detection in E-commerce

hgpu.org » Applications » Finance » Behavioral graph fraud detection in E-commerce

Behavioral graph fraud detection in E-commerce

Hang Yin, Zitao Zhang, Zhurong Wang, Yilmazcan Ozyurt, Weiming Liang, Wenyu Dong, Yang Zhao, Yinan Shan

eBay China, Shanghai, China

arXiv:2210.06968 [cs.LG], (13 Oct 2022)

DOI:10.48550/arXiv.2210.06968

@misc{https://doi.org/10.48550/arxiv.2210.06968,

doi={10.48550/ARXIV.2210.06968},

url={https://arxiv.org/abs/2210.06968},

author={Yin, Hang and Zhang, Zitao and Wang, Zhurong and Ozyurt, Yilmazcan and Liang, Weiming and Dong, Wenyu and Zhao, Yang and Shan, Yinan},

keywords={Machine Learning (cs.LG), FOS: Computer and information sciences, FOS: Computer and information sciences},

title={Behavioral graph fraud detection in E-commerce},

publisher={arXiv},

year={2022},

}

Download (PDF)

View

Source

2129

views

In e-commerce industry, graph neural network methods are the new trends for transaction risk modeling.The power of graph algorithms lie in the capability to catch transaction linking network information, which is very hard to be captured by other algorithms.However, in most existing approaches, transaction or user connections are defined by hard link strategies on shared properties, such as same credit card, same device, same ip address, same shipping address, etc. Those types of strategies will result in sparse linkages by entities with strong identification characteristics (ie. device) and over-linkages by entities that could be widely shared (ie. ip address), making it more difficult to learn useful information from graph. To address aforementioned problems, we present a novel behavioral biometric based method to establish transaction linkings based on user behavioral similarities, then train an unsupervised GNN to extract embedding features for downstream fraud prediction tasks. To our knowledge, this is the first time similarity based soft link has been used in graph embedding applications. To speed up similarity calculation, we apply an in-house GPU based HDBSCAN clustering method to remove highly concentrated and isolated nodes before graph construction. Our experiments show that embedding features learned from similarity based behavioral graph have achieved significant performance increase to the baseline fraud detection model in various business scenarios. In new guest buyer transaction scenario, this segment is a challenge for traditional method, we can make precision increase from 0.82 to 0.86 at the same recall of 0.27, which means we can decrease false positive rate using this method.

Tags: Clustering, Finance, Machine learning, Neural networks, Security

October 23, 2022 by hgpu

Rating: 5.0/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org