INDEX
Explanations
words related to positions of authority or control
words related to announcements and declarations
New Auto-Interp
Negative Logits
Dragonbound
-0.72
Dahl
-0.72
Beasts
-0.69
Rowling
-0.68
Lemon
-0.67
Penguin
-0.66
Solitaire
-0.66
Lamp
-0.66
Bots
-0.65
Elves
-0.65
POSITIVE LOGITS
abor
0.95
ased
0.93
ction
0.92
irm
0.90
agog
0.90
asons
0.90
urring
0.89
ission
0.87
vered
0.87
ocal
0.86
Activations Density 0.158%