INDEX
Explanations
words related to deeply rooted ideas or positions
terms related to established or deep-rooted interests and their influence
New Auto-Interp
Negative Logits
hner
-0.86
oping
-0.84
ynthesis
-0.82
othy
-0.76
kers
-0.75
ioxide
-0.74
asers
-0.74
paces
-0.73
umbn
-0.72
ionic
-0.72
POSITIVE LOGITS
entrenched
1.31
ingrained
1.08
vested
0.83
behavi
0.80
embroiled
0.78
incumbent
0.77
incumb
0.76
proble
0.75
entangled
0.75
lishes
0.74
Activations Density 0.015%