INDEX
Explanations
words related to promoting or supporting something
terms related to support and promotion
New Auto-Interp
Negative Logits
Alexandria
-0.69
Wol
-0.66
Mend
-0.64
Sph
-0.63
Supervisor
-0.62
mammoth
-0.61
Cot
-0.61
Poc
-0.61
Watkins
-0.61
BMC
-0.60
POSITIVE LOGITS
terday
1.17
ardless
1.10
acters
1.05
tenance
1.04
actly
1.03
ificantly
1.02
etheless
1.00
ruction
0.98
antic
0.98
initely
0.98
Activations Density 0.178%