INDEX
Explanations
specific numbers related to measurements, statistics, and values
New Auto-Interp
Negative Logits
lue
-0.16
tsky
-0.16
actus
-0.15
irie
-0.15
er
-0.15
-в
-0.15
dez
-0.15
amedi
-0.15
ilion
-0.15
aries
-0.15
POSITIVE LOGITS
ervo
0.17
eda
0.16
red
0.16
etter
0.15
platz
0.15
thal
0.15
aw
0.14
akedirs
0.14
iac
0.14
mos
0.14
Activations Density 0.135%