INDEX
Explanations
instances of text where something is done or stated publicly
instances of the word "publicly" and its variations
New Auto-Interp
Negative Logits
nesota
-0.83
nian
-0.83
mble
-0.79
Velocity
-0.72
NER
-0.70
ners
-0.69
Origins
-0.69
wolves
-0.68
Upper
-0.67
Trem
-0.67
POSITIVE LOGITS
shaming
0.97
humiliated
0.93
traded
0.92
funded
0.86
acknowledged
0.86
pronounce
0.84
proclaimed
0.84
humili
0.83
endorsed
0.83
denounced
0.83
Activations Density 0.017%