INDEX
Explanations
the phrase "not much"
expressions of scarcity or limitation
New Auto-Interp
Negative Logits
idon
-0.89
yne
-0.79
İĭ
-0.74
otom
-0.74
arium
-0.74
kus
-0.73
eneg
-0.72
adia
-0.71
grad
-0.70
eters
-0.70
POSITIVE LOGITS
avail
0.91
else
0.90
consolation
0.89
ado
0.83
anymore
0.82
luck
0.81
fuss
0.79
remorse
0.74
longer
0.73
happ
0.72
Activations Density 0.034%