INDEX
Explanations
phrases indicating feelings of frustration or dissatisfaction
New Auto-Interp
Negative Logits
as
-0.22
als
-0.20
как
-0.17
bindung
-0.17
eous
-0.15
asca
-0.15
ÙĥÙħا
-0.15
cene
-0.15
evidenced
-0.15
as
-0.15
POSITIVE LOGITS
follows
0.36
cribed
0.30
opposed
0.29
cert
0.29
sembl
0.28
paragus
0.26
regards
0.26
cribing
0.25
soon
0.25
ides
0.24
Activations Density 0.365%