INDEX
Explanations
action verbs related to communication and disclosure
actions related to stating or explaining information
New Auto-Interp
Negative Logits
agos
-0.79
avery
-0.71
ceans
-0.71
boot
-0.69
eries
-0.68
agues
-0.68
aspx
-0.66
enf
-0.66
ï¸ı
-0.66
cour
-0.64
POSITIVE LOGITS
aloud
1.03
loudly
0.94
oneself
0.88
allegiance
0.82
anything
0.82
something
0.80
publicly
0.78
paternity
0.77
goodbye
0.76
wrongdoing
0.76
Activations Density 0.338%