INDEX
Explanations
coding or special characters, such as symbols and arrows
occurrences of a specific symbol or character
New Auto-Interp
Negative Logits
shelf
-0.68
plain
-0.67
ifice
-0.66
Gorge
-0.64
Bombs
-0.64
disposable
-0.62
DOT
-0.62
coffin
-0.61
Spit
-0.61
conflic
-0.61
POSITIVE LOGITS
especially
0.96
then
0.88
particularly
0.84
said
0.82
which
0.79
_>
0.79
unknown
0.77
¹
0.77
£
0.76
insert
0.76
Activations Density 0.101%