INDEX
Explanations
references to personal or professional backgrounds
New Auto-Interp
Negative Logits
ábado
-0.16
iku
-0.16
Tiger
-0.15
mond
-0.15
alf
-0.15
ase
-0.14
Ì
-0.14
iro
-0.14
LINEAR
-0.14
dopamine
-0.14
POSITIVE LOGITS
chein
0.16
czy
0.15
weis
0.14
apist
0.14
$criteria
0.14
$MESS
0.14
ULER
0.14
.hm
0.13
AGMA
0.13
thren
0.13
Activations Density 0.010%