From MOCO v1 to v3: Towards Building a Dynamic Dictionary for Self-Supervised Learning — Part 1 | by Mengliu Zhao

A mild recap on the momentum distinction studying framework

Have we reached the period of self-supervised studying?

Information is flowing in day by day. Persons are working 24/7. Jobs are distributed to each nook of the world. However nonetheless, a lot information is left unannotated, ready for the attainable use by a brand new mannequin, a brand new coaching, or a brand new improve.

Or, it should by no means occur. It should by no means occur when the world is operating in a supervised trend.

The rise of self-supervised studying lately has unveiled a brand new course. As an alternative of making annotations for all duties, self-supervised studying breaks duties into pretext/pre-training (see my earlier put up on pre-training here) duties and downstream duties. The pretext duties deal with extracting consultant options from the entire dataset with out the steering of any floor fact annotations. Nonetheless, this process requires labels generated routinely from the dataset, often by in depth information augmentation. Therefore, we use the terminologies unsupervised studying (dataset is unannotated) and self-supervised studying (duties are supervised by self-generated labels) interchangeably on this article.

Contrastive studying is a significant class of self-supervised studying. It makes use of unlabelled datasets and contrastive information-encoded losses (e.g., contrastive loss, InfoNCE loss, triplet loss, and so on.) to coach the deep studying community. Main contrastive studying consists of SimCLR, SimSiam, and the MOCO collection.

MOCO — the phrase is an abbreviation for “momentum distinction.” The core thought was written within the first MOCO paper, suggesting the understanding of a pc imaginative and prescient self-supervised studying downside, as follows:

“[quote from original paper] Laptop imaginative and prescient, in distinction, additional considerations dictionary constructing, because the uncooked sign is in a steady, high-dimensional house and isn’t structured for human communication… Although pushed by varied motivations, these (be aware: latest visible illustration studying) strategies could be regarded as constructing dynamic dictionaries… Unsupervised studying trains encoders to carry out dictionary look-up: an encoded ‘question’ needs to be just like its matching key and dissimilar to others. Studying is formulated as minimizing a contrastive loss.”

On this article, we’ll do a delicate assessment of MOCO v1 to v3:

v1 — the paper “Momentum contrast for unsupervised visual representation learning” was printed in CVPR 2020. The paper proposes a momentum replace to key ResNet encoders utilizing pattern queues with InfoNCE loss.
v2 — the paper “ Improved baselines with momentum contrastive studying” got here out instantly after, implementing two SimCLR structure enhancements: a) changing the FC layer with a 2-layer MLP and b) extending the unique information augmentation by together with blur.
v3 — the paper “An empirical research of coaching self-supervised imaginative and prescient transformers” was printed in ICCV 2021. The framework extends the key-query pair to 2 key-query pairs, which had been used to type a SimSiam-style symmetric contrastive loss. The spine additionally acquired prolonged from ResNet-only to each ResNet and ViT.

Picture supply: https://pxhere.com/en/photo/760197

Source link

The Invisible Revolution: How Vectors Are (Re)defining Business Success | by Felix Schmidt | Jan, 2025

Great Books for AI Engineering. 10 books with valuable insights about… | by Duncan McKinnon | Jan, 2025

AI Ethics for the Everyday User — Why Should You Care? | by Murtaza Ali | Jan, 2025

Despite return, Rams should still prepare for future without Stafford

New Coin Listing – Sealana Crypto Presale Hits $5 Million, 24 Hours Left

Financial Peace University vs. True Financial Freedom vs. Crown Financial MoneyLife

Nigeria not an easy place for startups

Best AI Nude Generators Revealed (2024)

Our Picks

Mel Kiper Jr. names 2025 QB prospect who’s trending upward

Panthers sign former first-round pick to multiyear deal

WATCH: Kamala Harris Musters Fake Accent in North Carolina | The Gateway Pundit

Most Popular

Despite return, Rams should still prepare for future without Stafford

New Coin Listing – Sealana Crypto Presale Hits $5 Million, 24 Hours Left

Financial Peace University vs. True Financial Freedom vs. Crown Financial MoneyLife

From MOCO v1 to v3: Towards Building a Dynamic Dictionary for Self-Supervised Learning — Part 1 | by Mengliu Zhao | Jul, 2024

A mild recap on the momentum distinction studying framework

Related Posts