INDEX
Explanations
conditional statements related to consequences or threats
New Auto-Interp
Negative Logits
Jefus
-0.66
ſche
-0.65
ſtate
-0.64
pleaſure
-0.63
myſelf
-0.63
itſelf
-0.62
tanleria
-0.60
̈́
-0.60
évaluateur
-0.59
houſe
-0.58
POSITIVE LOGITS
else
0.71
otherwise
0.61
altrimenti
0.60
onCancelled
0.60
perma
0.58
otherwise
0.56
else
0.55
Else
0.55
sonst
0.54
Otherwise
0.53
Activations Density 0.144%