INDEX
Explanations
terms related to interaction and physical contact
New Auto-Interp
Negative Logits
</b>
-0.51
-0.48
st
-0.48
,
-0.47
</i>
-0.44
…
-0.43
'
-0.43
↵↵
-0.42
<eos>
-0.41
[
-0.41
POSITIVE LOGITS
Efq
1.27
BibitemShut
1.25
myſelf
1.21
Jefus
1.17
للاسماء
1.16
bibfield
1.15
houſe
1.15
bibinfo
1.09
raiſ
1.09
Theſe
1.07
Activations Density 0.598%