INDEX
Explanations
emotional states and concerns regarding health and well-being
New Auto-Interp
Negative Logits
Monfieur
-1.16
pleaſure
-1.11
Reſ
-1.08
perſon
-1.05
rungsseite
-1.05
Theſe
-1.05
houſe
-1.02
Houſe
-1.02
Majefty
-1.01
Diſ
-1.00
POSITIVE LOGITS
was
0.99
wasn
0.92
didn
0.80
weren
0.78
had
0.76
earlier
0.74
Wasn
0.71
did
0.71
było
0.71
wasn
0.70
Activations Density 1.105%