INDEX
Explanations
phrases related to negative criticism or attacks
instances of the word "belittle" and its variations
New Auto-Interp
Negative Logits
inals
-0.79
office
-0.79
Fields
-0.72
anwhile
-0.71
INAL
-0.70
hibition
-0.69
Enhancement
-0.68
Regulation
-0.68
exempt
-0.67
azines
-0.65
POSITIVE LOGITS
ittle
0.90
gian
0.87
bel
0.84
aying
0.83
ayed
0.82
ieving
0.79
ayer
0.78
phe
0.76
ieved
0.74
ousing
0.73
Activations Density 0.010%