INDEX
Explanations
numerical data and statistics related to study results
New Auto-Interp
Negative Logits
Efq
-1.06
Monfieur
-1.03
raiſ
-0.98
itſelf
-0.95
Theſe
-0.95
Houſe
-0.94
CloseOperation
-0.94
myſelf
-0.93
ſeveral
-0.92
]='\
-0.92
POSITIVE LOGITS
.
0.60
↵
0.55
,
0.55
i
0.49
Chwiliwch
0.48
x
0.44
os
0.44
?
0.44
!
0.42
–
0.42
Activations Density 0.011%