INDEX
Explanations
elements in a structured code or markup format
New Auto-Interp
Negative Logits
ãĤº
-0.18
idl
-0.15
hir
-0.14
sko
-0.14
BDS
-0.14
vell
-0.14
isz
-0.14
chaft
-0.14
ubs
-0.13
pillar
-0.13
POSITIVE LOGITS
824
0.16
agon
0.15
·
0.15
uka
0.15
Jensen
0.15
Infer
0.14
032
0.14
ħ§
0.14
.misc
0.14
voj
0.14
Activations Density 0.042%