INDEX
Explanations
phrases indicating mediocrity or dissatisfaction
New Auto-Interp
Negative Logits
oux
-0.15
εια
-0.15
iar
-0.15
ampp
-0.15
rum
-0.14
ợ
-0.14
antas
-0.14
STILL
-0.14
Okay
-0.14
echan
-0.14
POSITIVE LOGITS
particularly
0.34
necessarily
0.30
especially
0.27
particularly
0.27
especialmente
0.26
earth
0.25
terribly
0.22
exactly
0.22
especially
0.21
icularly
0.21
Activations Density 0.198%