New Two-Stage Framework Improves Zero-Shot Anomaly Detection on High-Dimensional Tabular Data
Global: New Two-Stage Framework Improves Zero-Shot Anomaly Detection on High-Dimensional Tabular Data
Background
Researchers have identified a persistent challenge in supervised deep learning for high‑dimensional tabular datasets, termed “generalization collapse,” where models perform well on known distributions but fail dramatically on out‑of‑distribution (OOD) inputs. This limitation hampers reliable anomaly detection in dynamic environments such as network traffic monitoring.
Method Overview
The authors propose a hierarchical, two‑stage representation learning system called Latent Sculpting. The approach separates structural learning from density estimation, aiming to create a compact latent manifold before applying precise probability modeling.
Stage One: Latent Sculpting
In the first stage, a hybrid architecture combining a one‑dimensional convolutional neural network (1D‑CNN) with a Transformer encoder is trained using a novel Dual‑Centroid Compactness Loss (DCCL). Unlike conventional contrastive losses that depend on triplet mining, DCCL optimizes global cluster centroids to enforce a hyperspherical, low‑entropy cluster for benign traffic, thereby imposing topological constraints on the latent space.
Stage Two: Density Estimation
The second stage conditions a Masked Autoregressive Flow (MAF) on the pre‑structured manifold produced by stage one. This enables the model to learn an exact density estimate over the sculpted latent space, facilitating precise anomaly scoring without further altering the learned structure.
Evaluation
The framework was evaluated on the CIC‑IDS‑2017 benchmark, treated as a proxy for complex, non‑stationary data streams. The study focused on strictly zero‑shot anomaly detection, meaning the model encountered anomalous patterns it had never seen during training.
Results
According to the authors, supervised baselines suffered catastrophic performance collapse on unseen distribution shifts, achieving an F1 score of approximately 0.30. The strongest unsupervised baseline reached an F1 score of 0.76. In contrast, the Latent Sculpting framework attained an F1 score of 0.87 on zero‑shot anomalies and reported an 88.89% detection rate on “Infiltration” scenarios, where state‑of‑the‑art supervised models recorded 0.00% accuracy.
Implications
The findings suggest that explicitly sculpting the latent manifold before density estimation can substantially enhance zero‑shot generalization in anomaly detection. The authors argue that decoupling structure learning from density estimation may provide a scalable pathway toward more robust detection systems in evolving data environments.This report is based on information from arXiv, licensed under Academic Preprint / Open Access. Based on the abstract of the research paper. Full text available via ArXiv.
Ende der Übertragung