INDEX
Explanations
instances of important statistical or numerical data
New Auto-Interp
Negative Logits
less
-0.17
ometr
-0.14
Boom
-0.14
791
-0.14
oman
-0.14
horn
-0.14
996
-0.14
æĭ
-0.13
litt
-0.13
in
-0.13
POSITIVE LOGITS
ige
0.15
Moder
0.15
ulle
0.15
nave
0.15
iche
0.15
lander
0.14
aires
0.14
بار
0.14
$MESS
0.14
ETA
0.14
Activations Density 0.008%