INDEX
Explanations
occurrences of the definite article "the" and associated phrases
New Auto-Interp
Negative Logits
oplevel
-0.18
idla
-0.16
apult
-0.16
ignon
-0.16
urette
-0.15
åŁ
-0.15
istra
-0.15
agna
-0.15
illez
-0.15
à¸Ńà¸Ļ
-0.15
POSITIVE LOGITS
same
0.16
seguint
0.16
diffuse
0.15
TMPro
0.15
898
0.14
pleas
0.14
permanently
0.14
mistaken
0.14
Gim
0.14
IBUTES
0.14
Activations Density 0.215%