INDEX
Explanations
contractions of "do not" followed by a verb
negations and expressions of reluctance or refusal
New Auto-Interp
Negative Logits
populated
-0.68
Alas
-0.66
eleph
-0.66
Adv
-0.66
princ
-0.65
nearest
-0.60
eagerly
-0.59
aims
-0.59
anwhile
-0.58
HF
-0.58
POSITIVE LOGITS
't
1.64
ÃŃ
0.96
nis
0.88
uts
0.84
etsk
0.84
ovan
0.82
iting
0.82
nat
0.81
´
0.80
n
0.79
Activations Density 0.116%