INDEX
Explanations
phrases indicating the consequences or implications of a particular situation
the repeated phrase "the fact that."
New Auto-Interp
Negative Logits
aukee
-0.74
vc
-0.74
uttering
-0.70
yna
-0.68
Y
-0.66
pec
-0.64
Si
-0.64
gur
-0.64
think
-0.64
TPPStreamerBot
-0.63
POSITIVE LOGITS
they
1.04
someone
0.91
we
0.90
humans
0.90
there
0.85
nobody
0.85
everyone
0.81
he
0.81
these
0.80
somebody
0.78
Activations Density 0.109%