INDEX
Explanations
occurrences of the word "of" in various contexts
New Auto-Interp
Negative Logits
ka
-0.18
_dash
-0.15
latin
-0.14
egrity
-0.14
ange
-0.14
onical
-0.14
/browse
-0.14
reau
-0.14
igo
-0.14
à¸Ńà¸Ļ
-0.14
POSITIVE LOGITS
lack
0.24
its
0.20
reasons
0.18
being
0.17
sheer
0.16
fears
0.15
Lack
0.15
lack
0.15
their
0.15
lacking
0.14
Activations Density 0.068%