INDEX
Explanations
discussions about prioritizing resources and societal issues
New Auto-Interp
Negative Logits
ève
-0.16
isoft
-0.15
sville
-0.15
bekl
-0.15
NÄĽm
-0.15
Soft
-0.14
illis
-0.14
anken
-0.14
mitt
-0.14
opper
-0.14
POSITIVE LOGITS
instead
0.25
instead
0.24
real
0.22
focus
0.22
Instead
0.22
Instead
0.21
Focus
0.21
priority
0.21
elsewhere
0.20
focus
0.20
Activations Density 0.187%