INDEX
Explanations
phrases indicating uncertainty or possibility
expressions of uncertainty or speculation
New Auto-Interp
Negative Logits
ovember
-0.69
----------
-0.65
pin
-0.63
trap
-0.62
translator
-0.61
Typ
-0.60
La
-0.60
Released
-0.59
Sac
-0.59
idan
-0.57
POSITIVE LOGITS
they
1.06
he
0.89
THEY
0.87
anwhile
0.86
she
0.79
it
0.76
they
0.75
we
0.74
there
0.74
none
0.72
Activations Density 0.305%