INDEX
Explanations
words related to holding and maintaining positions or actions
New Auto-Interp
Negative Logits
Debor
-0.66
Heller
-0.60
ISON
-0.60
Finder
-0.60
Amend
-0.60
VM
-0.55
PRES
-0.54
Schr
-0.53
Ͻ
-0.51
Democr
-0.51
POSITIVE LOGITS
estones
0.79
ewater
0.72
icking
0.71
atal
0.69
estone
0.65
aces
0.65
enges
0.65
eworthy
0.64
appers
0.63
cially
0.63
Activations Density 0.020%