INDEX
Explanations
names of people or characters
references to individuals named Stefan or Klaus
New Auto-Interp
Negative Logits
BACK
-0.74
REE
-0.70
rights
-0.69
outh
-0.66
Indians
-0.65
lled
-0.64
naire
-0.64
nces
-0.62
upon
-0.62
hazard
-0.61
POSITIVE LOGITS
Stefan
1.21
stadt
0.98
ovic
0.87
etti
0.82
apo
0.80
acci
0.79
ucci
0.77
Rah
0.77
Matte
0.76
Stef
0.75
Activations Density 0.008%