INDEX
Explanations
specific verbs in the past tense
verbs and actions related to achievements or significant events
New Auto-Interp
Negative Logits
ugal
-0.63
acebook
-0.61
ankind
-0.58
ueless
-0.58
Seym
-0.57
millenn
-0.57
selves
-0.56
OTAL
-0.56
ogether
-0.53
halla
-0.53
POSITIVE LOGITS
Phant
0.59
Bayern
0.56
LLP
0.56
Profile
0.55
Mush
0.54
âĢº
0.54
bra
0.54
IPM
0.53
ranch
0.52
brew
0.52
Activations Density 0.548%