INDEX
Explanations
adjectives related to describing personality traits or qualities
terms related to character analysis and characterization
New Auto-Interp
Negative Logits
nesday
-0.76
EMS
-0.68
RESULTS
-0.67
VEN
-0.64
xon
-0.64
Indies
-0.64
yg
-0.64
lawfully
-0.62
sterdam
-0.62
Tanz
-0.61
POSITIVE LOGITS
istically
1.83
istics
1.70
izations
1.65
izes
1.37
isations
1.37
isation
1.36
izing
1.22
ised
1.18
istic
1.17
istical
1.13
Activations Density 0.034%