INDEX
Explanations
words related to caring or concern
expressions of concern or indifference
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.73
oute
-0.73
è¦ļéĨĴ
-0.68
ãĥ³ãĤ¸
-0.67
GV
-0.65
UES
-0.65
jam
-0.65
sclerosis
-0.62
resume
-0.61
adr
-0.61
POSITIVE LOGITS
lessly
1.15
taker
1.15
passionately
1.03
cared
0.99
fully
0.90
giving
0.86
bear
0.81
tta
0.80
lessness
0.79
der
0.79
Activations Density 0.015%