1Cademy - Designing a Self-Supervised Task for Code

Learn Before

Self-supervised Pre-training

Short Answer

Designing a Self-Supervised Task for Code

You are tasked with pre-training a language model on a massive, unlabeled dataset of computer code. Beyond the common approach of predicting randomly masked parts of the code, propose one distinct self-supervised objective that would be particularly well-suited for this dataset. Briefly justify why your proposed objective would help the model learn meaningful patterns specific to code.

Updated 2025-10-07

Contributors are:

Who are from:

Learn Before

Related