INDEX
Explanations
references to economic decline and societal issues related to poverty
New Auto-Interp
Negative Logits
tactos
-0.53
IFYING
-0.49
keber
-0.48
+#+#
-0.48
ελ
-0.48
erapeu
-0.47
LLA
-0.47
GOTREF
-0.47
andom
-0.47
balle
-0.46
POSITIVE LOGITS
worst
0.82
'\\;'
0.80
worse
0.79
Worse
0.76
worst
0.76
Worst
0.74
Worse
0.74
Worst
0.69
peor
0.67
OMITBAD
0.65
Activations Density 1.288%