INDEX
Explanations
phrases containing "it is"
the pronoun "It" at the beginning of various statements
New Auto-Interp
Negative Logits
dding
-0.80
Guant
-0.68
hips
-0.67
Guinea
-0.64
Priv
-0.64
911
-0.63
priv
-0.61
friends
-0.61
inqu
-0.59
-----
-0.59
POSITIVE LOGITS
unes
0.99
self
0.93
alian
0.91
achi
0.90
chy
0.90
'll
0.88
asca
0.87
seems
0.86
consists
0.80
ÃĥÃĤ
0.78
Activations Density 0.278%