INDEX
Explanations
instances of the word "think" and variations of expressing opinion or thought
New Auto-Interp
Negative Logits
-0.63
,
-0.57
øya
-0.57
so
-0.52
ca
-0.49
أن
-0.49
<strong>
-0.49
:
-0.48
an
-0.47
(
-0.47
POSITIVE LOGITS
aarrggbb
1.18
itſelf
1.15
Efq
1.06
Jefus
1.05
myſelf
1.04
themſelves
1.04
Monfieur
1.04
estekak
1.03
Shakspeare
0.99
―――――
0.97
Activations Density 0.106%