INDEX
Explanations
places or objects within specific locations, such as rooms or closets, that may be related to crimes or investigations
New Auto-Interp
Negative Logits
igl
-0.59
kef
-0.58
ppa
-0.55
meier
-0.53
fman
-0.51
itz
-0.51
jah
-0.50
xon
-0.49
Kiw
-0.49
arnaev
-0.49
POSITIVE LOGITS
itself
0.71
consists
0.61
herself
0.57
yourself
0.56
minus
0.54
encompasses
0.53
pedia
0.52
ourselves
0.52
himself
0.51
behaves
0.51
Activations Density 0.935%