INDEX
Explanations
phrases related to specific entities or concepts, possibly involving conflicts or controversies
mentions of a specific character or entity symbolized by the character "Ļ"
New Auto-Interp
Negative Logits
disadvant
-0.75
misunder
-0.72
mathemat
-0.66
contrace
-0.65
seiz
-0.65
condem
-0.64
Palestin
-0.64
merce
-0.63
regulation
-0.63
ozy
-0.63
POSITIVE LOGITS
ï¸ı
1.48
ï¸
0.96
âĹ
0.94
0.92
Balt
0.86
âĸº
0.84
gypt
0.83
¯¯
0.83
âĪ
0.83
âĻ
0.82
Activations Density 0.449%