INDEX
Explanations
instances where things are not completely as expected or perfect
phrases indicating a degree of uncertainty or qualification
New Auto-Interp
Negative Logits
uments
-0.79
olan
-0.76
ogi
-0.76
orer
-0.72
udo
-0.69
lessness
-0.69
uty
-0.66
ously
-0.65
çļ
-0.65
no
-0.64
POSITIVE LOGITS
Enough
0.80
anymore
0.78
ifiable
0.74
enough
0.73
icable
0.71
fit
0.70
reconcil
0.69
yet
0.66
bothered
0.66
spoon
0.66
Activations Density 0.017%