INDEX
Explanations
mentions of phone calls and communication in various contexts
phrases related to actions and their settings
New Auto-Interp
Negative Logits
)."
-0.73
attRot
-0.70
").
-0.68
".[
-0.68
outwe
-0.66
Compat
-0.66
ForgeModLoader
-0.66
."[
-0.65
.""
-0.63
'."
-0.63
POSITIVE LOGITS
undrum
0.68
veland
0.64
enhagen
0.60
Adds
0.60
@
0.60
ansky
0.59
Stanton
0.57
tonight
0.55
emale
0.54
mural
0.54
Activations Density 1.565%