INDEX
Explanations
phrases related to actions involving a change in structure or status
instances of the prefix "de-" indicating a process of reversal or negation
New Auto-Interp
Negative Logits
LW
-0.90
Dickinson
-0.75
Ys
-0.66
Dillon
-0.62
uez
-0.62
Thick
-0.61
ANGE
-0.61
galleries
-0.61
SHARES
-0.61
unto
-0.61
POSITIVE LOGITS
fact
1.22
emphasis
1.19
escal
1.18
cert
1.00
capt
0.99
vacc
0.97
funding
0.97
register
0.96
branded
0.95
radical
0.95
Activations Density 0.035%