INDEX
Explanations
adjectives expressing opinions or evaluations about things or people
words related to characterization and opinions
New Auto-Interp
Negative Logits
ammy
-0.66
hoff
-0.64
cum
-0.64
nir
-0.64
perty
-0.62
neau
-0.60
stration
-0.60
externalActionCode
-0.59
kr
-0.59
isoft
-0.59
POSITIVE LOGITS
phas
0.94
ourselves
0.83
oneself
0.79
encies
0.78
it
0.77
Ī
0.77
enance
0.76
pointers
0.74
themselves
0.74
tones
0.73
Activations Density 0.199%