INDEX
Explanations
occurrences of the negation word "not" in various contexts
New Auto-Interp
Negative Logits
raiſ
-0.88
fhew
-0.82
Efq
-0.80
fevere
-0.79
ſever
-0.76
Sarm
-0.74
ſeveral
-0.74
pleaſure
-0.73
uſed
-0.71
feveral
-0.70
POSITIVE LOGITS
not
2.52
not
2.33
Not
2.22
NOT
2.13
Not
2.06
NOT
2.05
nicht
1.34
isNot
1.23
niet
1.22
nicht
1.17
Activations Density 0.222%