INDEX
Explanations
occurrences of the word "being" in various contexts
New Auto-Interp
Negative Logits
hire
-0.16
меÑĤÑĮ
-0.16
heimer
-0.16
ordinates
-0.15
archical
-0.15
Inspectable
-0.15
empor
-0.15
lue
-0.15
Available
-0.15
нок
-0.15
POSITIVE LOGITS
ness
0.24
awan
0.18
actively
0.18
held
0.17
groom
0.17
eld
0.17
360
0.16
eyed
0.16
ing
0.16
currently
0.15
Activations Density 0.026%