Cryptocurrency abuse reporting services are a valuable data source about abusive blockchain addresses, prevalent types of cryptocurrency abuse, and their financial impact on victims. However, they may suffer data pollution due to their crowd-sourced nature. This work analyzes the extent and impact of data pollution in cryptocurrency abuse reporting services and proposes a novel LLM-based defense to address the pollution. We collect 289K abuse reports submitted over 6 years to two popular services and use them to answer three research questions. RQ1 analyzes the extent and impact of pollution. We show that spam reports will eventually flood unchecked abuse reporting services, with BitcoinAbuse receiving 75% of spam before stopping operations. We build a public dataset of 19,443 abuse reports labeled with 19 popular abuse types and use it to reveal the inaccuracy of user-reported abuse types. We identified 91 (0.1%) benign addresses reported, responsible for 60% of all the received funds. RQ2 examines whether we can automate identifying valid reports and their classification into abuse types. We propose an unsupervised LLM-based classifier that achieves an F1 score of 0.95 when classifying reports, an F1 of 0.89 when classifying out-of-distribution data, and an F1 of 0.99 when identifying spam reports. Our unsupervised LLM-based classifier clearly outperforms two baselines: a supervised classifier and a naive usage of the LLM. Finally, RQ3 demonstrates the usefulness of our LLM-based classifier for quantifying the financial impact of different cryptocurrency abuse types. We show that victim-reported losses heavily underestimate cybercriminal revenue by estimating a 29 times higher revenue from deposit transactions. We identified that investment scams have the highest financial impact and that extortions have lower conversion rates but compensate for them with massive email campaigns.
翻译:加密货币滥用报告服务是获取有关恶意区块链地址、主流加密货币滥用类型及其对受害者财务影响的重要数据来源。然而,由于其众包性质,这些服务可能遭受数据污染。本研究分析了加密货币滥用报告服务中数据污染的程度与影响,并提出了一种基于大语言模型(LLM)的新型防御方法以应对污染。我们收集了六年间提交至两个主流服务的28.9万份滥用报告,并据此回答三个研究问题。RQ1分析了污染的程度与影响:研究表明,垃圾报告最终将淹没未受监管的滥用报告服务,其中BitcoinAbuse在停止运营前接收的报告中垃圾报告占比达75%。我们构建了包含19,443份标注有19种主流滥用类型的公开数据集,并据此揭示用户上报滥用类型的不准确性。研究发现91个(0.1%)被误报的良性地址却接收了全部资金的60%。RQ2探讨了自动化识别有效报告及其滥用类型分类的可行性:我们提出一种基于LLM的无监督分类器,在报告分类任务中F1分数达0.95,在分布外数据分类中F1分数达0.89,在垃圾报告识别中F1分数达0.99。该无监督LLM分类器显著优于两个基线模型:有监督分类器及原始LLM直接调用。最后,RQ3验证了所提LLM分类器在量化不同加密货币滥用类型财务影响方面的实用性:通过估算存款交易产生的收入,发现犯罪收入实际是受害者上报损失的29倍。研究还表明投资诈骗造成的财务影响最大,而勒索攻击虽转化率较低,但通过大规模邮件活动进行补偿。