INDEX
Explanations
expressions related to feelings of discomfort or unease
New Auto-Interp
Head Attr Weights
0:0.04
1:0.03
2:0.22
3:0.05
4:0.26
5:0.04
6:0.02
7:0.03
8:0.09
9:0.09
10:0.05
11:0.03
Negative Logits
ardless
-1.43
Lich
-1.42
Aff
-1.27
Tsukuyomi
-1.24
hetical
-1.21
Kore
-1.21
hai
-1.20
Ess
-1.18
Earthquake
-1.18
affinity
-1.17
POSITIVE LOGITS
itiveness
1.60
ewater
1.48
Alto
1.35
beck
1.33
bour
1.33
ths
1.32
opers
1.30
Ã
1.30
frogs
1.20
nir
1.20
Activations Density 0.000%