INDEX
Explanations
statements related to personal experiences and opinions on rights and freedoms
New Auto-Interp
Negative Logits
sort
-0.24
sorts
-0.20
sort
-0.19
terrific
-0.17
SORT
-0.17
kv
-0.16
SORT
-0.15
suddenly
-0.15
-redux
-0.15
incredibly
-0.15
POSITIVE LOGITS
ionario
0.15
å±ŀ
0.15
ÙħتÙĨ
0.14
evidently
0.14
sir
0.14
_vect
0.14
.obtain
0.14
ãĥ³ãĤ¸
0.13
ãĥĸãĥŃ
0.13
691
0.13
Activations Density 0.127%