INDEX
Explanations
phrases related to acts of kindness and charitable activities
New Auto-Interp
Negative Logits
¯
-0.63
isin
-0.59
ãĤ¼
-0.58
>>\
-0.56
ukong
-0.56
ocent
-0.55
ftime
-0.53
Earthquake
-0.53
Disclosure
-0.51
elson
-0.50
POSITIVE LOGITS
them
1.39
ones
1.30
theirs
1.24
THEM
1.22
apiece
1.19
them
1.15
Them
1.13
They
1.05
respectively
1.04
they
1.04
Activations Density 1.052%