INDEX
Explanations
statements about what is good for people or entities in a variety of contexts
phrases indicating what is beneficial or harmful for individuals or groups
New Auto-Interp
Negative Logits
orbit
-0.85
aults
-0.83
buster
-0.74
peer
-0.72
soever
-0.72
clair
-0.68
heid
-0.67
ieties
-0.67
ault
-0.67
operated
-0.67
POSITIVE LOGITS
everybody
1.15
us
1.09
ya
1.05
everyone
1.04
me
1.04
morale
1.01
awhile
1.01
sure
0.99
them
0.98
him
0.97
Activations Density 0.133%