code-stylometry

Here are 2 public repositories matching this topic...

tipaek / NestedBigramsResearch

Code for "Detection of LLM-Generated Java Code Using Discretized Nested Bigrams" (arXiv:2502.15740). Achieves state-of-the-art performance in distinguishing human vs. LLM-written Java.

nlp machine-learning natural-language-processing bigrams feature-engineering discretization authorship-attribution binary-classification abstract-syntax-tree source-code-analysis nested-bigrams llm-generated-code code-stylometry

Updated May 15, 2025
Java

sarowarzahan414 / supplyshield

Star

Explainable multi-modal ML for detecting malicious PyPI packages. Three-modality detection (metadata + AST static analysis + code stylometry), SHAP-driven Ladisa taxonomy mapping (7 attack vectors), real-time CLI scanner, and live PyPI monitoring. F1=0.9993 on 18.5K packages.

machine-learning static-analysis pypi-package open-source-security supply-chain-security code-stylometry pip-security

Updated May 14, 2026
Python

Improve this page

Add a description, image, and links to the code-stylometry topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the code-stylometry topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly