Multi-Frequency Fusion for Sturdy Video Face Forgery Detection

Present face video forgery detectors use huge or dual-stream backbones. We present {that a} single, light-weight fusion of two handcrafted cues can obtain greater accuracy with a a lot smaller mannequin. Based mostly on the Xception baseline mannequin (21.9 million parameters), we construct two detectors: LFWS, which provides a 1×1 convolution to mix a low-frequency Wavelet-Denoised Function (WDF) with the phase-only Spatial-Section Shallow Studying (SPSL) map, and LFWL, which merges WDF with Native Binary Patterns (LBP) in the identical manner. This further module provides solely 292 parameters, conserving the full at 21.9 million—smaller than F3Net (22.5 million) and fewer than half the scale of SRM (55.3 million). Even with this minimal overhead, the fused fashions enhance the typical space underneath the curve (AUC) from 74.8% to 78.6% on FaceForensics++ and from 70.5% to 74.9% on DFDC-Preview, positive aspects of three.8% and 4.4% over the Xception baseline. Additionally they constantly outperform F3Net, SRM, and SPSL in eight public benchmarks, with out further information or test-time augmentation. These outcomes present that rigorously paired, handcrafted options, mixed via the light-weight fusion block, can present state-of-the-art robustness at a considerably decrease price. Our findings recommend a have to reevaluate scale-driven design decisions in face video forgery detection.

† Google
‡ Carnegie Mellon College
** Work accomplished whereas at Apple

Main Menu

What's Hot

BeatBanker Trojan Spreads by way of Phishing, Deploys Crypto Miner and RAT on Focused Gadgets

Expertise Is Reshaping Sleep Apnea Therapy

My New E-book On Vulnerability Nearly Killed Me…Actually

Multi-Frequency Fusion for Sturdy Video Face Forgery Detection

Speed up customized LLM deployment: Effective-tune with Oumi and deploy to Amazon Bedrock

Run Tiny AI Fashions Domestically Utilizing BitNet A Newbie Information

From Textual content to Tables: Characteristic Engineering with LLMs for Tabular Knowledge

Evaluating the Finest AI Video Mills for Social Media

Utilizing AI To Repair The Innovation Drawback: The Three Step Resolution

Midjourney V7: Quicker, smarter, extra reasonable

Meta resumes AI coaching utilizing EU person knowledge

BeatBanker Trojan Spreads by way of Phishing, Deploys Crypto Miner and RAT on Focused Gadgets

Expertise Is Reshaping Sleep Apnea Therapy

My New E-book On Vulnerability Nearly Killed Me…Actually

Speed up customized LLM deployment: Effective-tune with Oumi and deploy to Amazon Bedrock

Main Menu

Subscribe to Updates

What's Hot

Multi-Frequency Fusion for Sturdy Video Face Forgery Detection

Related Posts