INDEX
Explanations
words related to challenging or opposing others
occurrences of the word "confront."
New Auto-Interp
Negative Logits
rotein
-0.76
exempt
-0.75
ectar
-0.74
adder
-0.74
protein
-0.72
flu
-0.72
serving
-0.71
sample
-0.71
RNA
-0.71
Ĥ¬
-0.69
POSITIVE LOGITS
confront
1.20
confrontation
0.93
confronting
0.92
confronts
0.87
ational
0.87
DonaldTrump
0.82
Tanz
0.79
ings
0.78
encounters
0.77
truths
0.77
Activations Density 0.006%