INDEX
Explanations
words indicating states of being and existence
New Auto-Interp
Negative Logits
moot
-0.15
Something
-0.14
íķŃ
-0.14
sight
-0.14
annon
-0.14
inta
-0.14
anything
-0.13
rench
-0.13
lob
-0.13
itez
-0.13
POSITIVE LOGITS
happening
0.37
happen
0.25
happened
0.24
happens
0.24
Happ
0.22
happ
0.20
åıijçĶŁ
0.19
aconte
0.18
wrong
0.17
wrong
0.17
Activations Density 0.080%