INDEX
Explanations
numerical values related to research and scientific data
New Auto-Interp
Negative Logits
èŃ·
-0.15
ugen
-0.15
raid
-0.15
rello
-0.15
oret
-0.14
rei
-0.14
ano
-0.14
žel
-0.14
umm
-0.14
reon
-0.14
POSITIVE LOGITS
s
0.20
jeme
0.16
e
0.15
eless
0.15
Femme
0.15
arde
0.15
umeric
0.14
peare
0.14
SION
0.14
vÄĽÅĻ
0.14
Activations Density 0.057%