INDEX
Explanations
confrontational interactions or situations
instances of the word "confront" and its variations
New Auto-Interp
Negative Logits
arent
-0.71
aim
-0.70
serving
-0.70
erd
-0.69
exempt
-0.69
ramid
-0.69
sample
-0.69
oiler
-0.68
stream
-0.68
urt
-0.67
POSITIVE LOGITS
confront
0.92
confronts
0.86
confronting
0.85
confronted
0.78
confrontation
0.77
Uriel
0.76
DonaldTrump
0.71
Crusade
0.70
enged
0.69
illes
0.69
Activations Density 0.010%