INDEX
Explanations
instances of questioning and reflection on personal experiences and observations
New Auto-Interp
Negative Logits
ãģıãĤĵ
-0.15
strand
-0.14
imson
-0.14
isay
-0.14
cope
-0.14
udeau
-0.14
lements
-0.14
erp
-0.14
ought
-0.14
MBER
-0.14
POSITIVE LOGITS
look
0.68
Look
0.57
look
0.56
Look
0.51
LOOK
0.42
_look
0.41
notice
0.40
.look
0.36
LOOK
0.35
.Look
0.34
Activations Density 0.330%