INDEX
Explanations
mentions of people in positions of authority or recognition within an organization
New Auto-Interp
Negative Logits
raq
-0.17
inel
-0.16
Beit
-0.15
Hamas
-0.15
Sly
-0.15
ingroup
-0.14
Sting
-0.14
aeda
-0.14
entai
-0.14
Ya
-0.14
POSITIVE LOGITS
Patel
0.24
Indian
0.20
jee
0.20
Indians
0.19
Bind
0.18
Indian
0.17
Gupta
0.17
bind
0.17
Chop
0.17
bind
0.17
Activations Density 0.251%