INDEX
Explanations
mentions of societal issues or controversies
references to incidents involving legal issues or controversies
New Auto-Interp
Negative Logits
yip
-0.78
equivalents
-0.72
rax
-0.71
semble
-0.69
ovy
-0.69
export
-0.69
zbollah
-0.67
flavour
-0.66
inctions
-0.65
capacity
-0.63
POSITIVE LOGITS
âĢ
1.40
âĢ
1.20
his
1.09
TMZ
1.08
¶
0.97
âľ
0.97
he
0.97
his
0.92
âĸ
0.91
His
0.87
Activations Density 0.748%