INDEX
Explanations
references to hierarchical relationships and comparisons between concepts
New Auto-Interp
Negative Logits
zos
-0.17
PT
-0.15
lrt
-0.15
dbcTemplate
-0.15
urb
-0.15
hausen
-0.14
âķĹ
-0.14
maal
-0.13
ton
-0.13
'ÑĶ
-0.13
POSITIVE LOGITS
another
0.36
another
0.31
Another
0.30
Another
0.27
others
0.23
åı¦
0.21
otro
0.19
дÑĢÑĥгой
0.19
Others
0.19
otra
0.18
Activations Density 0.043%