INDEX
Explanations
verbs indicating actions or emotional reactions
phrases indicating emotions and reactions to change
New Auto-Interp
Negative Logits
selves
-0.70
unison
-0.67
hub
-0.63
they
-0.57
angular
-0.56
ocument
-0.55
VERTISEMENT
-0.54
respectively
-0.54
taboola
-0.54
ADVERTISEMENT
-0.53
POSITIVE LOGITS
himself
1.80
Himself
1.28
his
1.25
herself
1.11
HIS
0.94
His
0.92
his
0.90
His
0.88
subordinates
0.79
wife
0.70
Activations Density 2.161%