INDEX
Explanations
expressions indicating a lack of concern or indifference
expressions indicating indifference or lack of concern
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.70
oute
-0.66
UES
-0.65
resume
-0.65
confirmation
-0.64
uterte
-0.63
arb
-0.62
GV
-0.62
urat
-0.60
akedown
-0.60
POSITIVE LOGITS
lessly
1.03
taker
0.99
cared
0.98
giving
0.92
fully
0.90
passionately
0.84
sacrific
0.80
fulness
0.74
bear
0.72
tta
0.72
Activations Density 0.014%