INDEX
Explanations
opinions or thoughts expressed by individuals
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-1.02
.<
-0.89
.*
-0.89
.(
-0.88
!.
-0.85
ãĢĤ
-0.81
.).
-0.81
.</
-0.80
%.
-0.77
.#
-0.77
POSITIVE LOGITS
[
1.52
,"
1.26
['
1.15
,'"
1.12
),"
1.10
,''
0.99
â̦
0.91
.,"
0.90
%"
0.89
,'
0.86
Activations Density 1.248%