INDEX
Explanations
non-zero values in numerical data or results
New Auto-Interp
Negative Logits
myſelf
-1.75
itſelf
-1.69
―――――
-1.50
Paglinawan
-1.40
Efq
-1.36
doubtnut
-1.34
iſt
-1.34
ſelves
-1.34
Monfieur
-1.33
Anſ
-1.32
POSITIVE LOGITS
.
1.40
,
1.31
↵
1.22
<eos>
1.21
1.20
↵↵
1.11
of
1.02
!
1.00
?
1.00
0.96
Activations Density 0.472%