INDEX
Explanations
words related to events or actions that involve recommendations, celebrations, or emotional responses
words related to revealing information or events
New Auto-Interp
Negative Logits
intrusion
-0.65
Dragonbound
-0.64
owe
-0.57
FISA
-0.57
impossibility
-0.56
icipated
-0.56
Doe
-0.55
ISO
-0.55
iane
-0.55
Observer
-0.55
POSITIVE LOGITS
llers
1.58
ller
1.55
lling
1.53
ptions
1.29
ptic
1.27
brate
1.27
lled
1.27
ven
1.27
vered
1.26
brates
1.23
Activations Density 0.160%