INDEX
Explanations
instances of desire or intent expressed with "want to" and other modal verbs
New Auto-Interp
Negative Logits
ä¸ļ
-0.17
stru
-0.16
nev
-0.16
uler
-0.15
Hindered
-0.15
anium
-0.14
eldom
-0.14
erie
-0.14
ckt
-0.14
amber
-0.14
POSITIVE LOGITS
rain
0.22
bore
0.21
Rain
0.20
labour
0.20
dwelling
0.20
Ñĥгл
0.19
Rain
0.19
RAIN
0.19
dwell
0.19
sound
0.19
Activations Density 0.111%