Stat Comput. 2026;36(3):131. doi: 10.1007/s11222-026-10866-0. Epub 2026 May 5.
ABSTRACT
In an effort to develop topic modeling methods that can be quickly applied to large data sets, we revisit the problem of maximum-likelihood estimation in topic models. It is known, at least informally, that maximum-likelihood estimation in topic models is closely related to non-negative matrix factorization (NMF). Yet, to our knowledge, this relationship has not been exploited previously to fit topic models. We show that recent advances in NMF optimization methods can be leveraged to fit topic models very efficiently, often resulting in much better fits and in less time than existing algorithms for topic models. We also formally make the connection between the NMF optimization problem and maximum-likelihood estimation for the topic model, and using this result we show that the expectation maximization (EM) algorithm for the topic model is essentially the same as the classic multiplicative updates for NMF. Our methods are implemented in the R package “fastTopics”.
PMID:42100650 | PMC:PMC13144203 | DOI:10.1007/s11222-026-10866-0