INDEX
Explanations
phrases related to interference or intervention in various contexts
references to interference, particularly in political or social contexts
New Auto-Interp
Negative Logits
ndra
-0.81
sal
-0.75
chal
-0.74
":"/
-0.74
Bio
-0.72
True
-0.72
abc
-0.72
href
-0.71
Template
-0.69
mber
-0.68
POSITIVE LOGITS
interference
1.11
interfere
1.00
interfering
0.98
interfered
0.98
meddling
0.95
tampering
0.92
medd
0.84
newsp
0.78
intr
0.76
adversely
0.72
Activations Density 0.013%