INDEX
Explanations
phrases related to government actions or statements
symbols or characters resembling the "â" character
New Auto-Interp
Negative Logits
zag
-0.67
maxim
-0.67
subur
-0.67
scatter
-0.67
radar
-0.66
minim
-0.64
scene
-0.64
obser
-0.63
spinning
-0.63
ctors
-0.62
POSITIVE LOGITS
£
0.95
_>
0.88
¢
0.82
âĢł
0.80
¯
0.80
there
0.78
sure
0.77
º
0.75
âĢ
0.75
>[
0.75
Activations Density 0.375%