1. Introduction

SCL-DeBERTa: Multi-Author Writing Style Change Detection Enhanced by Supervised Contrastive Learning

Kaichuan Lin

Chang Liu

Fuchuan Ye

Zhongyuan Han

hanzhongyuan@gmail.com 0 0 Foshan University , Foshan , China

2025

Multi-author writing style change detection, a core challenge in PAN@CLEF evaluations, requires precise localization of author transition points in collaboratively authored documents. This paper presents SCL-DeBERTa, novel framework that integrates supervised contrastive learning (SCL) with DeBERTa for fine-grained style boundary detection. in the PAN 2025 evaluation, Our team, xxsu-team, proposed a novel method that achieved F1 scores of 0.955 (ranking 3rd), 0.825 (ranking 1st), and 0.829 (ranking 2nd) on the three subtasks, respectively, establishing a new benchmark for multi-author writing style analysis. This performance demonstrates significant advancements in feature extraction robustness and cross-task adaptability.

eol>PAN 2025 SCL-DeBERTa DeBERTa Supervised Contrastive Learning Feature Disentanglement Writing Style Change Detection

1. Introduction 2. Related Work

The identification of writing styles in multi-author documents has emerged as a core task in digital text forensics, with numerous solutions based on pre-trained language models proposed in recent years. Early approaches primarily relied on traditional feature engineering, such as lexical statistical features and syntactic pattern analysis, but these methods exhibited limited generalization ability in cross-topic scenarios.

With the rise of Transformer models, researchers began exploring the adaptability of diferent pretrained architectures: Team Gladiators[ 4 ] systematically evaluated the performance of models such as ELECTRA, SqueezeBERT, and RoBERTa, finding that RoBERTa performed best in cross-topic tasks due to its dynamic masking strategy. However, its paragraph order reversal data augmentation strategy, while alleviating class imbalance issues, failed to efectively capture deep stylistic features. Team NYCU-NLP[ 5 ] innovatively integrated RoBERTa, DeBERTa, and ERNIE models, introducing a semantic similarity correction mechanism based on LaBSE embeddings to explore the efects of model ensemble and semantic enhancement on performance. This method achieved an F1 score of 0.863 on the hard task, but the multi-model ensemble incurred over three times the computational cost, and the manually set topic similarity threshold limited the method’s generalization ability. Chen et al.[ 6 ] optimized DeBERTa’s embedding space using supervised contrastive learning, forcing paragraph vectors from the same author to cluster together. Although this method improved sensitivity to stylistic features, it relied on a complex paragraph pair reconstruction strategy, resulting in over 30% noisy false negatives when the number of authors was unknown. Large model adaptation techniques: The xxsu-team[ 7 ] was the first to apply LLAMA-3-8B to this task, compressing model parameters through Low-Rank Adaptation (LoRA)[ 8 ]. Although it achieved an F1 score of 0.887 on the medium task, int8 quantization led to representation information loss, causing performance fluctuations.

In contrast, we adopted the lightweight DeBERTa as the encoder to embed data samples. Its disentangled attention mechanism[ 3 ] separates content and positional information modeling, demonstrating stronger topic independence in multi-author style analysis. Additionally, we utilized a supervised contrastive learning framework to optimize model parameters. In the oficial PAN 2025 evaluation, our system achieved F1 scores of 0.953, 0.823, and 0.83 on the easy, medium, and hard subtasks, respectively, providing a reliable solution for real-world scenarios such as lightweight academic co-authorship detection and legal document tampering identification.

3. Data Processing

The PAN25 writing style analysis evaluation task focuses on paragraph-level stylistic change detection in multi-author documents, requiring participating systems to precisely identify all writing style transition boundaries within documents under strictly constrained conditions of author identity and topic variation.

3.1. Data Sources and Structure

The dataset used in this study for stylistic change detection consists of multiple document pairs, with each document corresponding to two files: • problem-<id>.txt: Contains the original text content.

• truth-problem-<id>.json: Contains the annotations for stylistic changes.

The annotation information for each document is stored as a binary array, indicating whether a stylistic change occurs between adjacent sentences: changes = [1, 2, . . . , −1 ] where ∈ {0, 1} (1) Here, represents the total number of sentences in the document. The overall distribution of the dataset is shown in Table 1.

3.2. Sample Structure

Each generated training sample contains the following elements: sample = {input_ids, attention_mask, label , doc_id , pair_idx } (2) ⏟Token ⏞IDs ⏟ Attention⏞Mask

Styl⏟istic ⏞Label Docum⏟ent I⏞dentifier Sent⏟ence Pa⏞ir Index

3.3. Support for Supervised Contrastive Learning

To support the supervised contrastive learning framework, additional document-level metadata is included in the samples: • doc_id: A unique document identifier used to group sentence pairs from the same document. • pair_idx: The position index of the sentence pair within the document.

This metadata enables the model to learn representations of stylistic consistency within a document, enhancing its ability to detect stylistic changes in long documents.

3.4. Error Handling Mechanism InputFile

Success Successful

Warning

Error Exception ❌ ❌

I/O Excepting Encoding Error

⚠ ⚠

Consistency check Automatic filtering

• File I/O exception handling.

• Encoding error handling.

To ensure the robustness of data processing, the following error handling mechanisms are implemented: • Data consistency checks.

• Automatic filtering of invalid samples.

These mechanisms ensure that most usable data can still be processed efectively, even in cases of partial data corruption or non-standard formatting.

4. Method

DeBERTa Encoder

4.1. Task Definition

CLS Token CLSFeatures

Token Features Hierarchical Features

CClalsassisfiicfiactaiotinon

Header Header

StyleCChahnagnege Style

Probability

Probability Projection Head

Style Vector

SupervisedContrastive Loss The multi-author writing style detection task, established by PAN Lab in CLEF conferences, requires precise localization of author transition points in collaboratively authored documents. Formally, given a document = {1, 2, . . . , } composed of paragraphs, the goal is to identify the boundary set = { | author() ̸= author(+1)}.

4.2. SCL-DeBERTa Architecture

We propose SCL-DeBERTa, a novel framework integrating supervised contrastive learning with DeBERTa[ 3 ] encoder. As illustrated in Figure 2, the model employs dual-path processing: hCLS = DeBERTa(x)[0] Classification head: ^ = (W

2 · ReLU(W 1hCLS))

Projection head: z = W4 · ReLU(W 3hCLS) where W1 ∈ R256×768 , W2 ∈ R1×256 , W3 ∈ R256×768 , W4 ∈ R128×256 are learnable parameters.

4.3. Supervised Contrastive Learning

We introduce a label-guided contrastive loss[ 9 ] to enhance style discrimination: ℒscl = −

1 ∑︁

∑︁ log =1 | ()| ∈ ()

exp(z · z / ) ∑︀̸= exp(z · z / ) where () = { | = } denotes positive samples sharing the same style label, and = 0.07 is the temperature hyperparameter.

The joint optimization objective combines classification and contrastive losses: ℒtotal = BCE(^, ) + · ⏟ ⏞

ℒcls with = 0.3 controlling the contrastive regularization strength.

ℒ scl style re⏟gularization ⏞ (3) (4) (5) (6) (7)

5. Experiment 5.1. Dataset

The PAN 2025 multi-author writing style analysis evaluation task focuses on paragraph-level stylistic change detection, requiring participants to analysis the stylistic consistency of consecutive sentence pairs, participants must accurately identify all author transition boundaries. The task is designed to strictly control the synchronization of author identity and topic variation, ensuring that stylistic discrimination is not afected by topic shifts. Systems are evaluated across three dificulty levels based on the complexity of document topics.

1. Easy Subtask: The document topics are highly consistent, with frequent and obvious author transitions, and significant stylistic diferences between paragraphs. 2. Medium Subtask: The document topics are moderately complex, with more subtle author transitions and smaller stylistic diferences between paragraphs. 3. Hard Subtask: The document topics are highly mixed, with extremely subtle author transitions and almost imperceptible stylistic diferences between paragraphs.

The PAN 2025 multi-author writing style analysis evaluation task centers on paragraph-level stylistic change detection, challenging participants to conduct pure stylistic change detection at the sentence level. By examining the stylistic consistency between consecutive sentence pairs, participants are required to precisely identify all author transition boundaries. The task is meticulously designed to enforce strict synchronization between author identity and topic variation, ensuring that stylistic discrimination remains unafected by topic shifts. Systems are assessed across three dificulty levels, determined by the complexity of document topics.

5.2. Settings

In this study, we selected DeBERTa as the pre-trained model and utilized a supervised contrastive learning framework to optimize model parameters. Based on this foundation, we proposed SCLDeBERTa, which enhances the ability of supervised contrastive learning to improve style diferentiation. By combining classification and contrastive losses, the model achieves precise detection of author transition boundaries. The trained model was evaluated on three distinct task datasets, resulting in customized models tailored for each task. The hyperparameter settings used in this experiment are detailed in Table 2.

6. Results 6.1. Evaluation on PAN 2025 Benchmark

As shown in Table 3, SCL-DeBERTa achieved F1 scores of 0.955 (ranking 3rd), 0.825 (ranking 1st), and 0.829 (ranking 2nd) on the three subtasks:

This table presents the leaderboard results on the PAN 2025 benchmark. Our team, xxsu-team, with the proposed SCL-DeBERTa method, achieved top-tier performance across all three subtasks, ranking 3rd, 1st, and 2nd on Task1, Task2, and Task3, respectively. Compared to other strong baselines, our approach demonstrates superior robustness and adaptability in multi-author writing style analysis, establishing a new benchmark for this challenging task.

7. Conclusion And Future Work 7.1. Conclusion

In this study, we proposed SCL-DeBERTa, a novel framework that integrates supervised contrastive learning with the DeBERTa encoder to address the challenging task of multi-author writing style detection. By combining classification and contrastive losses, the model efectively enhances style diferentiation and achieves precise detection of author transition boundaries.

7.2. Future Work

While SCL-DeBERTa has shown promising results, several avenues for future research remain open. First, we plan to explore the integration of additional pre-trained language models, such as GPT or LLAMA, to further enhance the model’s stylistic representation capabilities. Second, extending the framework to handle multilingual datasets could broaden its applicability to diverse linguistic contexts. Third, we aim to investigate the impact of incorporating external knowledge, such as syntactic or semantic features, to improve the model’s performance on the Hard subtask. Finally, optimizing the computational eficiency of the framework, particularly for large-scale datasets, will be a key focus to ensure its scalability in practical scenarios.

8. Acknowledgments

This work is supported by the National Social Science Foundation of China (24BYY080).

9. Declaration on Generative AI

During the preparation of this work, the authors utilized the DeepSeek language model and GPT-4.1 for grammaiand spelling checks. Following the use of this tool, the authors carefully reviewed and edited the contenas necessary and assume full responsibility for the content of this publication

[1]

Zangerle ,

Mayerl ,

Potthast ,

Stein , Overview of the Multi-Author Writing Style Analysis Task at PAN 2025 , in: G. Faggioli,

Ferro ,

Rosso , D. Spina (Eds.), Working Notes of CLEF 2025 - Conference and Labs of the Evaluation Forum, CEUR Workshop Proceedings, CEUR-WS.org , 2025 .

[2]

Khosla ,

Teterwak ,

Wang ,

Sarna ,

Tian ,

Isola ,

Maschinot ,

Liu ,

Krishnan , Supervised contrastive learning , Advances in neural information processing systems 33 ( 2020 ) 18661 - 18673 .

[3]

He ,

Liu ,

Gao , W. Chen, Deberta: Decoding-enhanced bert with disentangled attention , arXiv preprint arXiv: 2006 . 03654 ( 2020 ).

[4]

Khan ,

Rai ,

Khan ,

Shah ,

Alvi ,

Samad , Team gladiators at pan: improving author identification: a comparative analysis of pre-trained transformers for multi-author classification , Working Notes of CLEF ( 2024 ).

[5]

Lin ,

Wu ,

Lee , Team nycu-nlp at pan 2024: integrating transformers with similarity adjustments for multi-author writing style analysis , Working Notes of CLEF ( 2024 ).

[6]

Chen ,

Han ,

Li ,

Han , A writing style embedding based on contrastive learning for multi-author writing style analysis ., in: CLEF (Working Notes) , 2023 , pp. 2562 - 2567 .

[7]

Lv ,

Yi ,

Qi , Team fosu-stu at pan: Supervised fine-tuning of large language models for multi author writing style analysis , Working Notes of CLEF ( 2024 ).

[8]

E. J.

Hu ,

Shen ,

Wallis ,

Allen-Zhu ,

Li ,

Wang ,

Chen , et al., Lora: Low-rank adaptation of large language models . , ICLR 1 ( 2022 ) 3 .

[9]

Chen ,

Kornblith ,

Norouzi ,

Hinton , A simple framework for contrastive learning of visual representations , in: International conference on machine learning , PmLR , 2020 , pp. 1597 - 1607 .