INDEX
Explanations
phrases expressing strong opinions or criticism
phrases expressing strong opinions or criticisms about decisions and events
New Auto-Interp
Negative Logits
enhagen
-0.76
toggle
-0.72
electric
-0.70
aceae
-0.70
Variable
-0.70
clustered
-0.69
Enlarge
-0.67
Puzzles
-0.65
directional
-0.65
Inventory
-0.63
POSITIVE LOGITS
disgrace
1.40
unacceptable
1.23
despicable
1.23
irresponsible
1.19
tarn
1.18
disrespect
1.18
shame
1.17
dishon
1.12
Shame
1.12
intolerable
1.10
Activations Density 1.154%