SciPost logo

SciPost Submission Page

On the role of non-linear latent features in bipartite generative neural networks

by Tony Bonnaire, Giovanni Catania, Aurélien Decelle, Beatriz Seoane

This Submission thread is now published as

Submission summary

Authors (as registered SciPost users): Giovanni Catania · Aurélien Decelle
Submission information
Preprint Link: scipost_202509_00033v1  (pdf)
Code repository: https://github.com/giovact/FixedPointSolver
Date accepted: Oct. 28, 2025
Date submitted: Sept. 16, 2025, 10:17 p.m.
Submitted by: Aurélien Decelle
Submitted to: SciPost Physics
Ontological classification
Academic field: Physics
Specialties:
  • Statistical and Soft Matter Physics
Approach: Theoretical

Abstract

We investigate the phase diagram and memory retrieval capabilities of Restricted Boltzmann Machines (RBMs), an archetypal model of bipartite energy-based neural networks, as a function of the prior distribution imposed on their hidden units—including binary, multi-state, and ReLU-like activations. Drawing connections to the Hopfield model and employing analytical tools from statistical physics of disordered systems, we explore how the architectural choices and activation functions shape the thermodynamic properties of these models. Our analysis reveals that standard RBMs with binary hidden nodes and extensive connectivity suffer from reduced critical capacity, limiting their effectiveness as associative memories. To address this, we examine several modifications, such as introducing local biases and adopting richer hidden unit priors. These adjustments restore ordered retrieval phases and markedly improve recall performance, even at finite temperatures. Our theoretical findings, supported by finite-size Monte Carlo simulations, highlight the importance of hidden unit design in enhancing the expressive power of RBMs.

Author indications on fulfilling journal expectations

  • Provide a novel and synergetic link between different research areas.
  • Open a new pathway in an existing or a new research direction, with clear potential for multi-pronged follow-up work
  • Detail a groundbreaking theoretical/experimental/computational discovery
  • Present a breakthrough on a previously-identified and long-standing research stumbling block

Author comments upon resubmission

Dear Editors,

we hereby resubmit our manuscript after having taking into account the referee's comments.
The modifications to the manuscript are highlighted in the text.

Thanks.

Best regards.

List of changes

We detailed the list of changes as an answered to the reviewer.
In this resubmission, we highlight the change in color in the manuscripts.

Published as SciPost Phys. 19, 141 (2025)


Reports on this Submission

Report #2 by Anonymous (Referee 2) on 2025-10-18 (Invited Report)

Report

The authors have addressed all my comments and I am happy to recommend this paper for publication.

Recommendation

Publish (easily meets expectations and criteria for this Journal; among top 50%)

  • validity: high
  • significance: high
  • originality: good
  • clarity: top
  • formatting: perfect
  • grammar: perfect

Report #1 by Anonymous (Referee 1) on 2025-9-17 (Invited Report)

  • Cite as: Anonymous, Report on arXiv:scipost_202509_00033v1, delivered 2025-09-17, doi: 10.21468/SciPost.Report.11951

Strengths

In my opinion, the work faces a key but underexplored design choice in the context of bipartite energy-based neural networks.
The work includes both a clear analytical derivation (detailed and reproducible)
and numerical experiments (on synthetic as well as structured datasets) that confirm the analytical predictions.

Weaknesses

I do not see any significant weakness at this stage.

Report

In my opinion the revised version clarifies the theoretical framework and strengthens the connection between theory and simulation results. The paper now provides a detailed and well-structured analysis, comprehensively addressing my previous concerns.
Based on my experience, the revised manuscript represents a high-quality contribution at the interface of statistical physics and machine learning and I strongly recommend acceptance in SciPost Physics.

Recommendation

Publish (surpasses expectations and criteria for this Journal; among top 10%)

  • validity: top
  • significance: top
  • originality: high
  • clarity: top
  • formatting: perfect
  • grammar: perfect

Login to report or comment