INDEX
Explanations
instances of the word "the," indicating a focus on common articles or determiners in text
the fight or task
New Auto-Interp
Negative Logits
enterOuterAlt
-0.52
StoryboardSegue
-0.46
méri
-0.40
Igual
-0.37
Comunicación
-0.37
trav
-0.36
médecin
-0.36
Governing
-0.35
TintMode
-0.35
Recording
-0.35
POSITIVE LOGITS
InjectAttribute
0.51
Chwiliwch
0.49
>−
0.48
displayquote
0.48
Diweddarwch
0.47
gesteld
0.46
enumii
0.46
eur
0.46
0.45
ogan
0.43
Activations Density 0.043%