Albert Transformer, … ALBERT uses the Transformer architecture to pre-train training on text data.