INDEX
Explanations
proper nouns or capitalized phrases unrelated to each other
words related to upcoming events or actions
New Auto-Interp
Negative Logits
expulsion
-0.70
reciproc
-0.68
srfAttach
-0.65
hement
-0.64
shedding
-0.63
shock
-0.63
defamation
-0.63
suppressed
-0.62
imposed
-0.60
instantaneous
-0.59
POSITIVE LOGITS
Own
0.91
Else
0.91
tons
0.81
!,
0.72
angered
0.72
Unknown
0.71
icia
0.71
Dying
0.71
Dead
0.70
Happ
0.70
Activations Density 0.284%