INDEX
Explanations
mentions of monitoring activities or systems
terms related to observation and supervision
New Auto-Interp
Negative Logits
illard
-0.83
Anth
-0.80
erella
-0.76
hire
-0.76
Masquerade
-0.74
onne
-0.74
anne
-0.74
ONT
-0.73
bery
-0.73
ALSE
-0.72
POSITIVE LOGITS
monitoring
1.10
monitors
1.10
monitored
1.10
monitor
1.04
itored
0.93
Monitor
0.90
closely
0.90
dog
0.82
developments
0.82
attent
0.79
Activations Density 0.040%