INDEX
Explanations
instances of negation or denial in the text
New Auto-Interp
Negative Logits
unal
-0.15
misunderstanding
-0.15
eno
-0.14
iven
-0.14
TestData
-0.14
ambi
-0.14
ertino
-0.14
ernel
-0.14
amb
-0.13
Marino
-0.13
POSITIVE LOGITS
realize
0.41
realization
0.39
realise
0.37
realizes
0.36
realized
0.36
realizing
0.33
realised
0.33
réalis
0.28
æĦıè¯Ĩ
0.25
realiz
0.25
Activations Density 0.077%