INDEX
Explanations
words related to political discussions and opinions
phrases or mentions that convey uncertainty or questioning about a situation
New Auto-Interp
Negative Logits
Tanz
-0.74
captives
-0.71
geries
-0.69
sacrific
-0.66
mathemat
-0.65
filib
-0.65
Palestin
-0.65
hostages
-0.63
princ
-0.60
ITNESS
-0.60
POSITIVE LOGITS
ï¸ı
0.82
ï¸
0.77
Pg
0.76
uable
0.75
eal
0.73
else
0.72
should
0.71
deserves
0.69
mir
0.69
forth
0.69
Activations Density 0.242%