[Live Machine Learning Research] Plain Self-Ensembles (I Actually DISCOVER SOMETHING) - Part 1

I share my progress of implementing a research idea from scratch. I attempt to build an ensemble model out of students of label-free self-distillation without any additional data or augmentation. Turns out, it actually works, and interestingly, the more students I employ, the better the accuracy. This leads to the hypothesis that the ensemble effect is not a process of extracting more information from labels.

OUTLINE:
0:00 – Introduction
2:10 – Research Idea
4:15 – Adjusting the Codebase
25:00 – Teacher and Student Models
52:30 – Shipping to the Server
1:03:40 – Results
1:14:50 – Conclusion

Code:

References:
My Video on SimCLRv2:
Born-Again Neural Networks:
Deep Ensembles: A Loss Landscape Perspective:

Links:
YouTube:
Twitter:
Discord:
BitChute:
Minds:
Parler:

source

What's Hot

AI Agents + What’s Next for Legal Judgment – Artificial Lawyer

P3-SAM: Native 3D Part Segmentation – Takara TLDR

Stability AI Launches Stable Audio 2.5 with Enterprise-Grade Speed and Creative Control

[Live Machine Learning Research] Plain Self-Ensembles (I actually DISCOVER SOMETHING) – Part 1

AGI is not coming!

Context Rot: How Increasing Input Tokens Impacts LLM Performance (Paper Analysis)

Energy-Based Transformers are Scalable Learners and Thinkers (Paper Review)

Christie’s Will Auction The First Calculating Machine In History

The Art Market Isn’t Dying. The Way We Write About It Might Be.

Banksy Mural of Judge Beating Protestor Removed by Courts Service

Death of Matthew Christopher Pietras Ruled a Suicide

AI Agents + What’s Next for Legal Judgment – Artificial Lawyer

P3-SAM: Native 3D Part Segmentation – Takara TLDR

Stability AI Launches Stable Audio 2.5 with Enterprise-Grade Speed and Creative Control

What's Hot

[Live Machine Learning Research] Plain Self-Ensembles (I actually DISCOVER SOMETHING) – Part 1

Related Posts

Subscribe to Updates