INDEX
Explanations
nouns and their associated numeric values in a historical or chronological context
New Auto-Interp
Negative Logits
¦
-0.18
iva
-0.16
oster
-0.14
ãĥ«ãĤ¯
-0.14
arton
-0.14
loth
-0.14
ÏĦιν
-0.14
apor
-0.14
dni
-0.14
ivol
-0.14
POSITIVE LOGITS
igate
0.16
жи
0.15
refr
0.15
acular
0.15
lean
0.15
thon
0.14
_CN
0.14
iles
0.14
sole
0.14
spacer
0.14
Activations Density 0.033%