INDEX
Explanations
phrases that emphasize evaluation or judgment about a performance or experience
New Auto-Interp
Negative Logits
adoo
-0.14
khúc
-0.14
ом
-0.14
ÙħÙĪÙĨ
-0.14
omb
-0.14
REA
-0.14
sez
-0.13
umbo
-0.13
isel
-0.13
undaki
-0.13
POSITIVE LOGITS
ival
0.16
anything
0.15
something
0.15
gress
0.14
fairly
0.14
exactly
0.14
prim
0.14
quite
0.14
arel
0.13
_lifetime
0.13
Activations Density 0.224%