INDEX
Explanations
proper nouns and terms related to legal and political issues
references to controversial or significant issues and topics
New Auto-Interp
Negative Logits
ĸļ
-0.63
Pyro
-0.60
Soph
-0.56
¥µ
-0.56
ahime
-0.55
OTAL
-0.55
ptroller
-0.53
umar
-0.52
FI
-0.52
Romeo
-0.52
POSITIVE LOGITS
Haram
0.54
Ħ¢
0.52
poses
0.50
ervatives
0.50
flour
0.49
MIN
0.48
Getty
0.48
supporters
0.48
ervative
0.48
violates
0.47
Activations Density 0.671%