INDEX
Explanations
words related to legal proceedings and news reports
specific symbols representing citations or formal referencing in the text
New Auto-Interp
Negative Logits
monop
-0.77
tremend
-0.75
bda
-0.70
fart
-0.69
zoning
-0.68
Tsuk
-0.65
Banana
-0.64
scatter
-0.64
juggling
-0.64
rons
-0.63
POSITIVE LOGITS
£
0.79
Hon
0.74
¬
0.73
said
0.73
¯
0.72
Rapids
0.70
¢
0.69
º
0.69
âĹ¼
0.69
Govern
0.68
Activations Density 0.313%