INDEX
Explanations
people's names or surnames
mentions of specific individuals, particularly with the last names "Heller" and "Keller."
New Auto-Interp
Negative Logits
orough
-0.77
resh
-0.71
Es
-0.69
Rich
-0.68
edit
-0.67
lihood
-0.65
gres
-0.64
uld
-0.63
Commission
-0.63
Work
-0.63
POSITIVE LOGITS
gren
0.93
Heller
0.93
kamp
0.90
stein
0.83
mann
0.81
stadt
0.77
Koen
0.75
idge
0.75
anguage
0.75
housing
0.74
Activations Density 0.011%