INDEX
Explanations
references to author and artist names
the word "more" frequently in various contexts
New Auto-Interp
Negative Logits
riber
-0.71
²¾
-0.69
inical
-0.68
ļ
-0.67
ĩ
-0.67
ļé
-0.65
etimes
-0.64
ibl
-0.62
DD
-0.62
reen
-0.62
POSITIVE LOGITS
than
0.83
than
0.81
ado
0.79
cam
0.79
Than
0.78
vine
0.76
importantly
0.71
likely
0.71
invasive
0.71
HUD
0.70
Activations Density 0.007%