INDEX
Explanations
expressions of personal experiences and feelings
New Auto-Interp
Negative Logits
estic
-0.15
oloj
-0.15
subpoena
-0.14
adir
-0.14
ovich
-0.14
_visible
-0.13
)[-
-0.13
Recorder
-0.13
ĥĿ
-0.13
Visible
-0.13
POSITIVE LOGITS
hear
0.71
heard
0.71
hearing
0.66
read
0.66
hears
0.58
heard
0.58
hear
0.56
Heard
0.56
Hear
0.55
reading
0.54
Activations Density 0.344%