INDEX
Explanations
expressions indicating feelings or opinions
the word "about" in various contexts
New Auto-Interp
Negative Logits
Peaks
-0.82
KC
-0.76
------------------------
-0.72
rang
-0.71
gallery
-0.69
ammy
-0.68
ensation
-0.68
hesis
-0.66
CHR
-0.66
igmatic
-0.66
POSITIVE LOGITS
respecting
0.78
whether
0.71
how
0.71
donating
0.70
halfway
0.69
reforming
0.67
migrating
0.64
coerc
0.64
ducks
0.64
protecting
0.64
Activations Density 0.102%