INDEX
Explanations
references to specific events or actions in a narrative context
New Auto-Interp
Negative Logits
OnError
-0.18
ÑĪÑĤ
-0.18
recon
-0.16
ref
-0.15
rejo
-0.15
irse
-0.15
thru
-0.15
aux
-0.15
dyn
-0.15
illian
-0.14
POSITIVE LOGITS
ò
0.23
onders
0.21
ù
0.19
ì
0.19
agers
0.17
uzione
0.17
ubb
0.15
McB
0.15
più
0.15
è
0.15
Activations Density 0.058%