INDEX
Explanations
instances of the word "excellent" or related terms indicating high quality or praise
New Auto-Interp
Negative Logits
-0.16
ến
-0.15
oleon
-0.14
emic
-0.14
ields
-0.14
oke
-0.14
ksam
-0.14
duk
-0.14
ationally
-0.14
oko
-0.14
POSITIVE LOGITS
-quality
0.22
itude
0.18
iterals
0.16
ARRIER
0.16
-looking
0.15
lah
0.15
mente
0.15
ifar
0.15
ibrary
0.14
bih
0.14
Activations Density 0.023%