INDEX
Explanations
keywords related to locations or events
the verb "is" and its variations as indicators of state or existence
New Auto-Interp
Negative Logits
Antar
-0.60
edIn
-0.58
weights
-0.56
Autob
-0.55
senses
-0.55
Seym
-0.55
eor
-0.54
IMAGES
-0.54
atern
-0.53
Benn
-0.53
POSITIVE LOGITS
rael
1.29
omorphic
1.18
senal
1.04
ometric
1.03
lam
1.02
othermal
1.01
olation
1.01
abella
0.98
olate
0.96
tis
0.94
Activations Density 0.212%