INDEX
Explanations
phrases related to visual or imaginative experiences
instances of strong emotional or dramatic expressions related to social issues
New Auto-Interp
Negative Logits
dispers
-0.75
bonded
-0.71
scatter
-0.71
scattering
-0.71
detached
-0.70
barrier
-0.70
floating
-0.67
shroud
-0.67
Bermuda
-0.66
confinement
-0.66
POSITIVE LOGITS
¹
1.07
į
1.00
Į
0.99
£
0.98
¤
0.95
ĸļ
0.94
º
0.93
¡
0.93
Ķ
0.91
Ĭ
0.91
Activations Density 0.161%