INDEX
Explanations
words associated with emotions or states of being
New Auto-Interp
Negative Logits
ADOS
-0.17
umen
-0.15
OfMonth
-0.14
:frame
-0.14
elden
-0.14
edef
-0.13
coe
-0.13
Elias
-0.13
PRESSION
-0.13
resenter
-0.13
POSITIVE LOGITS
about
0.19
ingly
0.19
fit
0.16
олог
0.15
Complaint
0.14
quant
0.14
phy
0.14
Citizen
0.13
ens
0.13
chw
0.13
Activations Density 0.131%