INDEX
Explanations
phrases indicating physical presence or proximity
New Auto-Interp
Negative Logits
ix
-0.18
éĤ
-0.16
ancia
-0.15
adius
-0.15
esan
-0.15
phere
-0.15
ortex
-0.15
ascimento
-0.14
ancias
-0.14
imate
-0.14
POSITIVE LOGITS
cameras
0.20
-camera
0.20
camera
0.18
ÅĤu
0.17
į°ìĿ´
0.16
AYER
0.16
ultz
0.16
witnesses
0.15
-runner
0.14
uria
0.14
Activations Density 0.027%