INDEX
Explanations
terms related to physical arms or weapons
New Auto-Interp
Negative Logits
light
-0.68
ples
-0.61
ters
-0.61
DER
-0.58
PER
-0.57
charged
-0.57
rers
-0.56
faced
-0.56
TER
-0.55
flush
-0.55
POSITIVE LOGITS
ament
1.17
aments
1.13
aceutical
1.06
ando
1.05
ageddon
1.02
chair
0.96
agnetic
0.93
ally
0.93
heid
0.91
ophon
0.91
Activations Density 1.431%