INDEX
Explanations
verbs related to denial, acknowledgement, and resistance in text
New Auto-Interp
Negative Logits
blogspot
-0.52
gow
-0.50
chie
-0.49
emies
-0.49
ighth
-0.48
ummies
-0.47
zbek
-0.47
onies
-0.46
itched
-0.45
ollah
-0.45
POSITIVE LOGITS
temptation
0.64
FontSize
0.50
>)
0.50
anymore
0.49
anything
0.47
nor
0.45
them
0.45
ANCE
0.44
anybody
0.44
ned
0.43
Activations Density 12.232%