INDEX
Explanations
adjectives related to comparison and evaluation
references to emotional states or conditions related to societal issues
New Auto-Interp
Negative Logits
Selected
-0.71
aval
-0.67
origin
-0.65
agram
-0.62
asus
-0.62
Variant
-0.62
DNA
-0.61
Citation
-0.61
jew
-0.61
alion
-0.60
POSITIVE LOGITS
calmed
1.34
peaceful
1.34
calm
1.31
harmless
1.29
manageable
1.26
calming
1.24
safer
1.22
toler
1.19
peace
1.18
peacefully
1.17
Activations Density 1.298%