Code for "Detection of LLM-Generated Java Code Using Discretized Nested Bigrams" (arXiv:2502.15740). Achieves state-of-the-art performance in distinguishing human vs. LLM-written Java.
-
Updated
May 15, 2025 - Java
Code for "Detection of LLM-Generated Java Code Using Discretized Nested Bigrams" (arXiv:2502.15740). Achieves state-of-the-art performance in distinguishing human vs. LLM-written Java.
Explainable multi-modal ML for detecting malicious PyPI packages. Three-modality detection (metadata + AST static analysis + code stylometry), SHAP-driven Ladisa taxonomy mapping (7 attack vectors), real-time CLI scanner, and live PyPI monitoring. F1=0.9993 on 18.5K packages.
Add a description, image, and links to the code-stylometry topic page so that developers can more easily learn about it.
To associate your repository with the code-stylometry topic, visit your repo's landing page and select "manage topics."