Contents
What is factorization machine used for?
The Factorization Machines algorithm is a general-purpose supervised learning algorithm that you can use for both classification and regression tasks. It is an extension of a linear model that is designed to capture interactions between features within high dimensional sparse datasets economically.
What is Field aware factorization machines?
Field Aware Factorization Machines (FFM) The categorical values that each field takes will be termed features. For example, male, female, action, romance, etc are all features. Photo by Author. In FM, every feature has only one latent vector to learn the latent effect with all other features [1].
How do factorization machines work?
Factorization machines were first introduced by Steffen Rendle [1] in 2010. The idea behind FMs is to model interactions between features (explanatory variables) using factorized parameters. The FM model has the ability to the estimate all interactions between features even with extreme sparsity of data.
What can a factorization machine be used for?
A factorization machine is a general-purpose supervised learning algorithm that you can use for both classification and regression tasks. It is an extension of a linear model that is designed to capture interactions between features within high dimensional sparse datasets economically.
How are factorization machines used in sparse settings?
Broadly speaking, factorization machines are able to estimate interactions in sparse settings because they break the independence of the interaction parameters by factorizing them (using latent vectors as expressed in ).
Can a factorization machine be run in regression mode?
The Factorization Machines algorithm can be run in either in binary classification mode or regression mode. In each mode, a dataset can be provided to the test channel along with the train channel dataset. The scoring depends on the mode used.
Is the factorization machine algorithm good for CSV?
For training, the Factorization Machines algorithm currently supports only the recordIO-protobuf format with Float32 tensors. Because their use case is predominantly on sparse data, CSV is not a good candidate. Both File and Pipe mode training are supported for recordIO-wrapped protobuf.