INDEX
Explanations
instances of the verb "estar" in its various forms
New Auto-Interp
Negative Logits
↵
-0.20
stares
-0.18
stance
-0.17
sting
-0.17
story
-0.17
strict
-0.16
statements
-0.16
statements
-0.16
↵
-0.16
gó
-0.16
POSITIVE LOGITS
coach
0.18
quo
0.17
vation
0.16
ãģĨãģ¡
0.16
house
0.16
à¸ģารà¸ĵ
0.15
ois
0.15
ihn
0.14
Sche
0.14
bucks
0.14
Activations Density 0.001%