INDEX
Explanations
quotes or statements related to legal matters or political commentary
New Auto-Interp
Negative Logits
federation
-0.71
seiz
-0.70
sacrific
-0.68
Negro
-0.67
ende
-0.67
Orchestra
-0.65
blot
-0.63
iolet
-0.63
strugg
-0.62
vulner
-0.61
POSITIVE LOGITS
¯
1.07
ï¸ı
0.99
cue
0.89
âĢł
0.88
STEM
0.87
which
0.82
s
0.78
°
0.77
£
0.75
âϦ
0.74
Activations Density 0.172%