INDEX
Explanations
intensifiers that emphasize the completeness or totality of an action or situation
New Auto-Interp
Negative Logits
roe
-0.16
ette
-0.16
ooter
-0.16
ning
-0.15
uhl
-0.15
haar
-0.14
duk
-0.14
нÑĥл
-0.14
sted
-0.14
vil
-0.14
POSITIVE LOGITS
/full
0.18
ajan
0.18
entirely
0.15
completely
0.15
yscale
0.14
žÃŃ
0.14
opposite
0.14
coverage
0.14
ayah
0.14
isses
0.14
Activations Density 0.043%