INDEX
Explanations
the word "regardless" and its variations, indicating a focus on unconditionality or lack of consideration for circumstances
New Auto-Interp
Negative Logits
jin
-0.17
tra
-0.16
tron
-0.16
onica
-0.15
tings
-0.15
zsche
-0.14
uma
-0.14
овÑĸд
-0.14
oke
-0.14
egrity
-0.14
POSITIVE LOGITS
whether
0.25
how
0.23
whether
0.21
LY
0.20
ly
0.20
fully
0.20
what
0.19
ness
0.17
ä¹İ
0.17
antly
0.17
Activations Density 0.009%