INDEX
Explanations
personal opinions or evaluations
instances of the word "it" in various contexts
New Auto-Interp
Negative Logits
ILCS
-0.65
odder
-0.62
ãĥ¯ãĥ³
-0.62
èĢħ
-0.61
è¦ļéĨĴ
-0.60
è£ıè¦ļéĨĴ
-0.60
estern
-0.59
shoot
-0.59
pring
-0.57
cribed
-0.56
POSITIVE LOGITS
nonetheless
1.28
does
1.24
nevertheless
1.23
doesn
1.22
isn
1.19
DOES
1.19
ain
1.18
certainly
1.13
shouldn
1.13
hasn
1.12
Activations Density 0.133%