INDEX
Explanations
proper nouns referring to people or places
references to specific individuals, particularly politicians or public figures
New Auto-Interp
Negative Logits
lished
-0.76
âĢ¢âĢ¢
-0.71
Gaza
-0.64
ãĤ±
-0.63
ãĤ¦
-0.61
CDC
-0.61
é¾
-0.60
franc
-0.59
kcal
-0.57
IPM
-0.56
POSITIVE LOGITS
ttle
0.94
mort
0.88
issan
0.86
acket
0.77
ikh
0.73
ung
0.70
inge
0.68
amic
0.67
azine
0.67
ophile
0.67
Activations Density 0.073%