INDEX
Explanations
phrases of the form "a [something] manner/way" or similar
New Auto-Interp
Negative Logits
nahilalakip
-1.13
GEBURTSDATUM
-1.02
'\\;'
-0.97
itſelf
-0.94
myſelf
-0.94
doubtnut
-0.93
―――――
-0.93
Theſe
-0.91
ⓧ
-0.91
pinulongan
-0.91
POSITIVE LOGITS
n
0.59
A
0.57
to
0.55
0.55
a
0.54
-
0.53
N
0.51
independent
0.50
addOn
0.49
is
0.49
Activations Density 0.021%