INDEX
Explanations
phrases related to expressed thoughts or opinions
instances of the word "that" in various contexts
New Auto-Interp
Negative Logits
aukee
-0.85
ãĤ´ãĥ³
-0.81
pione
-0.79
ä
-0.77
iped
-0.77
arest
-0.74
afe
-0.74
ascript
-0.73
amia
-0.71
ãĤĬ
-0.70
POSITIVE LOGITS
's
0.97
happens
0.95
wasn
0.91
translates
0.90
settles
0.90
justifies
0.90
applies
0.89
doesn
0.89
proves
0.88
isn
0.88
Activations Density 0.210%