INDEX
Explanations
language related to expressing opinion or making a decision
expressions indicating confidence or certainty
New Auto-Interp
Negative Logits
Koran
-0.57
Kissinger
-0.57
Poverty
-0.56
cible
-0.56
Kosovo
-0.56
ynamic
-0.53
bestos
-0.53
bris
-0.53
Kabul
-0.52
DERR
-0.51
POSITIVE LOGITS
.�
1.11
ðŁĺ
0.96
.ãĢį
0.96
¯
0.93
ðŁĻĤ
0.93
âĢ
0.90
.<
0.88
âķIJâķIJ
0.86
soType
0.86
.</
0.86
Activations Density 0.685%