INDEX
Explanations
verbs related to actions of intervention or obstruction
instances of the word "interfere" in various contexts
New Auto-Interp
Negative Logits
sal
-0.84
tu
-0.75
Bio
-0.73
True
-0.73
Template
-0.70
Sam
-0.69
ndra
-0.69
mber
-0.68
chal
-0.67
tim
-0.67
POSITIVE LOGITS
interfere
1.24
interfered
1.14
interfering
1.10
interference
0.89
medd
0.87
adversely
0.83
awei
0.82
newsp
0.79
undermin
0.79
overl
0.77
Activations Density 0.007%