INDEX
Explanations
phrases that reference locations or regions
the repetition of the word "around" in various contexts
New Auto-Interp
Negative Logits
xual
-0.79
inen
-0.77
nis
-0.68
oly
-0.66
ocy
-0.66
qua
-0.65
HO
-0.62
istg
-0.61
ysis
-0.61
gard
-0.60
POSITIVE LOGITS
corners
0.87
clock
0.84
eatures
0.83
abouts
0.80
perty
0.72
lasses
0.69
intersections
0.66
lihood
0.65
atform
0.64
ingu
0.62
Activations Density 0.041%