INDEX
Explanations
the word "big" and variations of its usage, indicating a focus on significance or intensity
New Auto-Interp
Negative Logits
urence
-0.16
prus
-0.15
گاÙĩ
-0.15
ncia
-0.14
ope
-0.14
ollower
-0.14
oden
-0.14
itage
-0.14
ponce
-0.14
дам
-0.13
POSITIVE LOGITS
oted
0.22
gie
0.16
wig
0.16
elow
0.16
elters
0.16
tuá»ķi
0.16
ween
0.15
ging
0.15
-ticket
0.15
-picture
0.15
Activations Density 0.039%