INDEX
Explanations
references to historical events or significant developments in various fields
New Auto-Interp
Negative Logits
tics
-0.87
mund
-0.84
gery
-0.79
Ïī
-0.77
crim
-0.70
iety
-0.70
ACTION
-0.70
facing
-0.69
fact
-0.69
politics
-0.69
POSITIVE LOGITS
baseman
1.13
responders
1.10
batch
1.07
glimpse
0.97
ever
0.95
installment
0.93
foray
0.93
lady
0.93
incarnation
0.92
wave
0.91
Activations Density 8.260%